BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011482
(484 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/462 (70%), Positives = 384/462 (83%), Gaps = 7/462 (1%)
Query: 23 LLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIV 82
+ G CF+GKK L +HK QW+Q S +SS+C+S Q++R E GA LE+KHK+ CSGKI+
Sbjct: 24 IFDNGVQCFQGKKVLSMHKFQWKQGS-NSSTCLS-QETRWENGATILEMKHKDSCSGKIL 81
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISG-NIKDVSNTEIPLTSGIRLQTLNYIATIEL 141
DWN++ + LI+D+ ++ LQSR+K++ISG NI D + IPLTSGIRLQTLNYI T+EL
Sbjct: 82 DWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAPIPLTSGIRLQTLNYIVTVEL 141
Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
GGR MTVIVDTGSDL+WVQCQPCK CYNQQDPVF+PS SPSY+ VLC+S TC +L+ ATG
Sbjct: 142 GGRKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATG 201
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGV 260
N GVC S+ PP CNY V+YGDGSYTRGELG EHL LG ++ VN+FIFGCGRNN+GLFGG
Sbjct: 202 NLGVCGSN-PPSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGA 260
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
SGL+GLGRS LSL+SQTS +FGG+FSYCLP T+ ASGSL++GGNSSV+KN+TPI+YT
Sbjct: 261 SGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETE-ASGSLVMGGNSSVYKNTTPISYTR 319
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
MIPNPQL FY LNLTGI++G +QA F K G++IDSGTVITRLPPSIY ALK EF+K
Sbjct: 320 MIPNPQLP-FYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVK 378
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
QFSGFPSAP F ILDTCFNLS YQEV IP +KM FEGNAE+ VDVTG+ YFVK+DASQVC
Sbjct: 379 QFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVC 438
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
LA+ASLSYE+E GIIGNYQQKNQRVIYDTK S LGFA E C+
Sbjct: 439 LAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEACT 480
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 302/414 (72%), Positives = 354/414 (85%), Gaps = 4/414 (0%)
Query: 71 LKHKNYC--SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
+KH+++C SGK DWN++ Q LILD+ V+ LQSRIK++ SGN D +++IPL+SG+
Sbjct: 1 MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60
Query: 129 RLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
RLQTLNYI T+E+GGRNMTVIVDTGSDLTWVQCQPC+ CYNQQDP+F+PS SPSY+ +LC
Sbjct: 61 RLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILC 120
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
NSSTC +L++ATGN GVC S++P CNY V+YGDGSYTRG+LG E L LG V++FIFG
Sbjct: 121 NSSTCQSLQYATGNLGVCGSNTP-TCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFG 179
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
CGRNNKGLFGG SGLMGLG+SDLSLVSQTS IF G+FSYCLP+T A ASGSLILGGNSS
Sbjct: 180 CGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTA-ADASGSLILGGNSS 238
Query: 309 VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPP 368
V+KN+TPI+YT MI NPQL TFY LNLTGISIGG LQA + + GILIDSGTVITRLPP
Sbjct: 239 VYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITRLPP 298
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
+Y LKAEFLKQFSGFPSAP FSILDTCFNL+ Y EV+IP ++M+FEGNAE+TVDVTGI
Sbjct: 299 PVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTGI 358
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
YFVK+DASQVCLALASLS++DE IIGNYQQ+NQRVIY+TK S+LGFA E CS
Sbjct: 359 FYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 298/413 (72%), Positives = 351/413 (84%), Gaps = 4/413 (0%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIR 129
+KHK+ CSGKI+DWN++ Q RLI+DN ++ LQSRIKN+I SGNI D +T+IPLTSGIR
Sbjct: 1 MKHKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIR 60
Query: 130 LQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
LQ+LNYI T+ELGGR MTVIVDTGSDL+WVQCQPC CYNQQDPVF+PS SPSY+ VLCN
Sbjct: 61 LQSLNYIVTVELGGRKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCN 120
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
S TC +L+ ATGNSGVC S+ PP CNY V+YGDGSYT GE+G EHL LG +VN+FIFGC
Sbjct: 121 SLTCRSLQLATGNSGVCGSN-PPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGC 179
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
GR N+GLFGG SGL+GLGR+DLSL+SQ S +FGG+FSYCLP+T+ A ASGSL++GGNSSV
Sbjct: 180 GRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTE-AEASGSLVMGGNSSV 238
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPS 369
+KN+TPI+YT MI NP L FY LNLTGI++GG ++QA F K ++IDSGTVI+RLPPS
Sbjct: 239 YKNTTPISYTRMIHNP-LLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPS 297
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
IY ALKAEF+KQFSG+PSAP F ILD+CFNLS YQEV IP +KM FEG+AE+ VDVTG+
Sbjct: 298 IYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVF 357
Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
Y VK+DASQVCLA+ASL YEDE GIIGNYQQKNQR+IYDTK S LGFA E CS
Sbjct: 358 YSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 586 bits (1510), Expect = e-164, Method: Compositional matrix adjust.
Identities = 292/459 (63%), Positives = 350/459 (76%), Gaps = 9/459 (1%)
Query: 29 HCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNE 86
H KK L +H W K +SSSC S + + TLE+KH+ CSGK +DW +
Sbjct: 30 HGVGEKKILSVHNNIWSPKKSYEASSSCFSRSLGKGRE-STTLEMKHRELCSGKTIDWGK 88
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGRN 145
+ + L+LDN+ VQ LQ RIK M S + VS T+IPLTSGI+L+TLNYI T+ELGG+N
Sbjct: 89 KMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGKN 148
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
M++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC L ATGNSG
Sbjct: 149 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGP 208
Query: 206 CSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
C + C Y VSYGDGSYTRG+L E + LG + + +FGCGRNNKGLFGG SG
Sbjct: 209 CGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKGLFGGASG 268
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
LMGLGRS +SLVSQT + F G+FSYCLPS +D GASG+L G + SV+KNST + YT ++
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGTLSFGNDFSVYKNSTSVFYTPLV 327
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
NPQL +FYILNLTG SIGG +L+ F +G ILIDSGTVITRLPPSIY A+K EFLKQF
Sbjct: 328 QNPQLRSFYILNLTGASIGGVELKTLSFGRG-ILIDSGTVITRLPPSIYKAVKTEFLKQF 386
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
SGFPSAPG+SILDTCFNL++Y++++IP +KM FEGNAE+ VDVTG+ YFVK DAS VCLA
Sbjct: 387 SGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLA 446
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
LASLSYE+E GIIGNYQQKNQRVIYDT +LG AGE+C
Sbjct: 447 LASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 281/413 (68%), Positives = 335/413 (81%), Gaps = 2/413 (0%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
+K + +CS K +DWN + Q +LILD+L V+ +Q+RI+ + S + + S T+IPL+SGI L
Sbjct: 1 MKDRGHCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL 60
Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
QTLNYI T+ LG +NMTVI+DTGSDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61 QTLNYIVTMGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
STC +L+FATGN+G C SS+P CNY V+YGDGSYT GELG E L G SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCG 180
Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
RNNKGLFGGVSGLMGLGRS LSLVSQT+ FGG+FSYCLP+T+ AG+SGSL++G SSVF
Sbjct: 181 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTE-AGSSGSLVMGNESSVF 239
Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPS 369
KN+ PITYT M+ NPQL+ FYILNLTGI +GG L+A F GGILIDSGTVITRLP S
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITRLPSS 299
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
+Y ALKAEFLK+F+GFPSAPGFSILDTCFNL+ Y EV+IP + + FEGNA++ VD TG
Sbjct: 300 VYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTF 359
Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
Y VK DASQVCLALASLS +T IIGNYQQ+NQRVIYDTK S++GFA E CS
Sbjct: 360 YVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 281/412 (68%), Positives = 331/412 (80%), Gaps = 2/412 (0%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
+K + +CS K +DWN + Q +LI D+L V+ +Q+RI+ ++S + + S T+IPL+SGI L
Sbjct: 1 MKDRGHCSEKKIDWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINL 60
Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
QTLNYI T+ LG NMTVI+DTGSDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61 QTLNYIVTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
STC +L+FATGN+G C S+ P CNY V+YGDGSYT GELG E L G SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSN-PSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFGCG 179
Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
RNNKGLFGGVSGLMGLGRS LSLVSQT+ FGG+FSYCLP+T+ +GASGSL++G SSVF
Sbjct: 180 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTE-SGASGSLVMGNESSVF 238
Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSI 370
KN TPITYT M+PNPQL+ FYILNLTGI + G LQ F GG+LIDSGTVITRLP S+
Sbjct: 239 KNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSSV 298
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
Y ALKA FLKQF+GFPSAPGFSILDTCFNL+ Y EV+IP + M FEGNAE+ VD TG Y
Sbjct: 299 YKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFY 358
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
VK DASQVCLALASLS +T IIGNYQQ+NQRVIYDTK S++GFA E CS
Sbjct: 359 VVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 285/476 (59%), Positives = 357/476 (75%), Gaps = 6/476 (1%)
Query: 9 TILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
T+L L + F++A G E KK + LQ + GS + +SR E GAI
Sbjct: 7 TMLPFFLSFVFLYFIIANGGCELEQKKMFKVQMLQRNHQFGSKGCILP--ESRKEKGAIV 64
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN--IKDVSNTEIPLTS 126
LE+K + YCS + ++WN + Q +LI D+L V+ +Q+RI+ +SG+ + S +IPL S
Sbjct: 65 LEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLAS 124
Query: 127 GIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKV 186
GI L+TLNYI TI LG +NMTVI+DTGSDLTWVQC PC SCY+QQ PVF+PS S SY +
Sbjct: 125 GINLETLNYIVTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSL 184
Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
LCNSSTC L+F TGN+ C S++P CN+ VSYGDGS+T GELG EHL G SV++F+
Sbjct: 185 LCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFV 244
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
FGCGRNNKGLFGGVSG+MGLGRS+LS++SQT+ FGG+FSYCLP+T D+GASGSL++G
Sbjct: 245 FGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTT-DSGASGSLVIGNE 303
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRL 366
SS+FKN TPI YT+M+ NPQL+ FY+LNLTGI +GG +Q + F GGILIDSGTVITRL
Sbjct: 304 SSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRL 363
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
PS+Y+ALKAEFLKQFSG+P AP SILDTCFNL+ +EV+IP + M FE N ++ VD
Sbjct: 364 APSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAV 423
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
GI+Y K D SQVCLALASLS E++ IIGNYQQ+NQRVIYD K S++GFA EDCS
Sbjct: 424 GILYMPK-DGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 288/456 (63%), Positives = 343/456 (75%), Gaps = 22/456 (4%)
Query: 35 KKLHLH-KLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG--KIVDWNEQQQNR 91
K HL KLQ + C+ Q SR E GAI LE+K + CS + DW E+Q
Sbjct: 27 KTFHLQRKLQH-----GTPECLLPQ-SRKEKGAIILEMKDRGECSESERKGDWVEKQ--- 77
Query: 92 LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV 150
L+LD LHV+ +Q+ I K S I D S T++PLTSGI+ QTLNYI T+ LG +NM+VIV
Sbjct: 78 LVLDGLHVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGSQNMSVIV 137
Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS-- 208
DTGSDLTWVQC+PC+SCYNQ P+F PS SPSY+ +LCNS+TC +LE G C S
Sbjct: 138 DTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLEL-----GACGSDP 192
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
S+ C+Y V+YGDGSYT GELG E LG G SV++F+FGCGRNNKGLFGG SGLMGLGR
Sbjct: 193 STSATCDYVVNYGDGSYTSGELGIEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGR 252
Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
S+LS++SQT+ FGG+FSYCLPST AGASGSL++G S VFKN TPI YT M+PN QL+
Sbjct: 253 SELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLS 312
Query: 329 TFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
FYILNLTGI +GG L QAS F GG+++DSGTVI+RL PS+Y ALKA+FL+QFSGFP
Sbjct: 313 NFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFP 372
Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
SAPGFSILDTCFNL+ Y +VNIP + M FEGNAE+ VD TGI Y VK DAS+VCLALASL
Sbjct: 373 SAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASL 432
Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S E E GIIGNYQQ+NQRV+YD K SQ+GFA E C+
Sbjct: 433 SDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPCT 468
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 287/460 (62%), Positives = 354/460 (76%), Gaps = 9/460 (1%)
Query: 28 AHCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWN 85
H + KK L +H W K +S+SC S + + TLE+KH+ CSGK +D
Sbjct: 26 VHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGK-GRESTTLEMKHRELCSGKTIDLG 84
Query: 86 EQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
++ + L+LDN+ VQ LQ +IK M S + VS T+IPLTSGI+L++LNYI T+ELGG+
Sbjct: 85 KKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGK 144
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC L AT NSG
Sbjct: 145 NMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204
Query: 205 VCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
C ++ C Y VSYGDGSYTRG+L E + LG + +F+FGCGRNNKGLFGG S
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSS 264
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL G +SSV+ NST ++YT +
Sbjct: 265 GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSLSFGNDSSVYTNSTSVSYTPL 323
Query: 322 IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
+ NPQL +FYILNLTG SIGG +L++S F +G ILIDSGTVITRLPPSIY A+K EFLKQ
Sbjct: 324 VQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGTVITRLPPSIYKAVKIEFLKQ 382
Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
FSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+ VDVTG+ YFVK DAS VCL
Sbjct: 383 FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCL 442
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
ALASLSYE+E GIIGNYQQKNQRVIYDT +LG GE+C
Sbjct: 443 ALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 286/460 (62%), Positives = 354/460 (76%), Gaps = 9/460 (1%)
Query: 28 AHCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWN 85
H + KK L +H W K +S+SC S + + TLE+KH+ CSGK +D
Sbjct: 26 VHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGK-GRESTTLEMKHRELCSGKTIDLG 84
Query: 86 EQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
++ + L+LDN+ VQ LQ +IK M S + VS T+IPLTSGI+L++LNYI T+ELGG+
Sbjct: 85 KKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGK 144
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC L AT NSG
Sbjct: 145 NMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204
Query: 205 VCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
C ++ C Y VSYGDGSYTRG+L E + LG + +F+FGCGRNNKGLFGG S
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSS 264
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL G +SSV+ NST ++YT +
Sbjct: 265 GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSLSFGNDSSVYTNSTSVSYTPL 323
Query: 322 IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
+ NPQL +FYILNLTG SIGG +L++S F +G ILIDSGTVITRLPPSIY A+K EFLKQ
Sbjct: 324 VQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGTVITRLPPSIYKAVKIEFLKQ 382
Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
FSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+ VDVTG+ YFVK DAS VCL
Sbjct: 383 FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCL 442
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
ALASLSYE+E GIIGNYQQKNQRVIYD+ +LG GE+C
Sbjct: 443 ALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 276/420 (65%), Positives = 338/420 (80%), Gaps = 6/420 (1%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPL 124
+ TLE+KH+ CSGK +D ++ + L+LDN+ VQ LQ +IK M S + VS T+IPL
Sbjct: 17 STTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPL 76
Query: 125 TSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
TSGI+L++LNYI T+ELGG+NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK
Sbjct: 77 TSGIKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYK 136
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
V CNSSTC L AT NSG C ++ C Y VSYGDGSYTRG+L E + LG
Sbjct: 137 TVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK 196
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ +F+FGCGRNNKGLFGG SGLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL
Sbjct: 197 LENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSL 255
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
G +SSV+ NST ++YT ++ NPQL +FYILNLTG SIGG +L++S F +G ILIDSGT
Sbjct: 256 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGT 314
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
VITRLPPSIY A+K EFLKQFSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+
Sbjct: 315 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAEL 374
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
VDVTG+ YFVK DAS VCLALASLSYE+E GIIGNYQQKNQRVIYDT +LG GE+C
Sbjct: 375 EVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 281/468 (60%), Positives = 350/468 (74%), Gaps = 8/468 (1%)
Query: 20 SLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG 79
S F L G + +G +L W++ + +C+ QK +I G TLE+K ++YCSG
Sbjct: 31 SSFNLGNGDNHEKGLLQL-FQNFPWKEHGEAVVNCI-FQKPKITKGITTLEMKQRDYCSG 88
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIRLQTLNYIAT 138
KI DW + QNR+ILD ++V L S K+ I G +S+++IP++SG RLQTLNYI T
Sbjct: 89 KITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVT 148
Query: 139 IELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+ +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+PS S S+ + CNS TC AL+
Sbjct: 149 VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 208
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
G+SG+CS+ + C+Y + YGDGSY+RGELG E L LGK +++FIFGCGRNNKGLFG
Sbjct: 209 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFG 268
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPIT 317
G SGLMGL RS+LSLVSQTS +FG +FSYCLP+T G+SGSL LGG + S FKN +PI+
Sbjct: 269 GASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGSSGSLTLGGADFSNFKNISPIS 327
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI--LIDSGTVITRLPPSIYSAL 374
YT MI NPQ++ FY LNLTGISIGG L + G+ L+DSGTVITRL PSIY A
Sbjct: 328 YTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAF 387
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK FEGNAEM VDV G+ YFVKS
Sbjct: 388 KAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKS 447
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
DASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S++GFAGE CS
Sbjct: 448 DASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 266/417 (63%), Positives = 326/417 (78%), Gaps = 6/417 (1%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIR 129
+K ++YCSGKI DW + QNR+ILD ++V L S K+ I G +S+++IP++SG R
Sbjct: 1 MKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGAR 60
Query: 130 LQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
LQTLNYI T+ +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+PS S S+ + CN
Sbjct: 61 LQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCN 120
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
S TC AL+ G+SG+CS+ + C+Y + YGDGSY+RGELG E L LGK +++FIFGC
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGC 180
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSS 308
GRNNKGLFGG SGLMGL RS+LSLVSQTS +FG +FSYCLP+T G+SGSL LGG + S
Sbjct: 181 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGSSGSLTLGGADFS 239
Query: 309 VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI--LIDSGTVITR 365
FKN +PI+YT MI NPQ++ FY LNLTGISIGG L + G+ L+DSGTVITR
Sbjct: 240 NFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITR 299
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
L PSIY A KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK FEGNAEM VDV
Sbjct: 300 LSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDV 359
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
G+ YFVKSDASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S++GFAGE CS
Sbjct: 360 EGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 236/458 (51%), Positives = 313/458 (68%), Gaps = 23/458 (5%)
Query: 44 WQQKSGSSSSCV---SHQKSRIEMGAIT-LELKHKNYCSGKIVDWNEQQQNR-----LIL 94
W ++ G + H+K+ A T LELK + + I D + +R L
Sbjct: 86 WSRRYGDAKLAEMLGEHKKAGAARTATTVLELKRHSLVA--IPDDDPAAHDRYLRRLLAA 143
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGG-------RNM 146
D Q RI+N + S + E+PLTSGIR QTLNY+ TI LGG N+
Sbjct: 144 DESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANL 203
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHA-LEFATGNSGV 205
TVIVDTGSDLTWVQC+PC +CY Q+DP+FDP+ S +Y V CN+S C A L+ ATG G
Sbjct: 204 TVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKAATGTPGS 263
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
C + C Y ++YGDGS++RG L + + LG AS++ F+FGCG +N+GLFGG +GLMG
Sbjct: 264 CGGGNE-RCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRGLFGGTAGLMG 322
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR++LSLVSQT+ +GG+FSYCLP+T ASGSL LGG++S ++N+TP+ YT MI +P
Sbjct: 323 LGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADP 382
Query: 326 QLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF--S 383
FY LN+TG ++GG L A G +LIDSGTVITRL PS+Y ++AEF +QF +
Sbjct: 383 AQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAA 442
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
G+P+APGFSILDTC++L+ + EV +PL+ + EG AE+TVD G+++ V+ D SQVCLA+
Sbjct: 443 GYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAM 502
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
ASLSYED+T IIGNYQQKN+RV+YDT S+LGFA EDC
Sbjct: 503 ASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 236/478 (49%), Positives = 316/478 (66%), Gaps = 37/478 (7%)
Query: 37 LHLHKLQWQQKSGSSSSCVS---------------HQKSRIEMGAIT--LELKHKNYCS- 78
L L +L +++S +++ S H+K+ GA T LELK + +
Sbjct: 31 LSLRELDGRRRSAATTDTRSSRYYVDAMLAETLGEHKKA----GAATSVLELKRHSLTAI 86
Query: 79 -GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIA 137
V + + L D Q R + ++ E+PLTSGIRLQTLNY+
Sbjct: 87 PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRLQTLNYVT 146
Query: 138 TIELGGR------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
TI LGG N+TVIVDTGSDLTWVQC+PC +CY Q+DP+FDP+ S +Y V CN+S
Sbjct: 147 TISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNAS 206
Query: 192 TC-HALEFATGNSGVCSSSSP--PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
C +L ATG G C S+ C Y ++YGDGS++RG L + + LG AS+ F+FG
Sbjct: 207 ACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFG 266
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN-- 306
CG +N+GLFGG +GLMGLGR++LSLVSQT+ +GG+FSYCLP+ ASGSL LGG
Sbjct: 267 CGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDD 326
Query: 307 -SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITR 365
+S ++N+TP+ YT MI +P FY LN+TG ++GG L A G +LIDSGTVITR
Sbjct: 327 AASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITR 386
Query: 366 LPPSIYSALKAEFLKQF--SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
L PS+Y A++AEF++QF +G+P+APGFSILDTC++L+ + EV +PL+ + EG A++TV
Sbjct: 387 LAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVTV 446
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D G+++ V+ D SQVCLA+ASLSYEDET IIGNYQQKN+RV+YDT S+LGFA EDC
Sbjct: 447 DAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 228/430 (53%), Positives = 301/430 (70%), Gaps = 8/430 (1%)
Query: 59 KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKN---MISGNIK 115
+SR E GA LEL+H S E+ L D V LQ RI + + S +
Sbjct: 33 RSRAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAA 92
Query: 116 DVSN-TEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
S ++P+TSG RL+TLNY+AT+ +GG TVIVDT S+LTWVQC+PC +C++QQ+P+
Sbjct: 93 SASKLAQVPVTSGARLRTLNYVATVGIGGGEATVIVDTASELTWVQCEPCDACHDQQEPL 152
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
FDPS SPSY V CNSS+C AL ATG SG P C+Y +SY DGSY+RG L +
Sbjct: 153 FDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDR 212
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
L L + F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FSYCLP ++
Sbjct: 213 LSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-PKE 271
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
+G+SGSL+LG ++SV++NSTPI YT M+ +P FY+ NLTGI++GG+ +Q+ GF+ GG
Sbjct: 272 SGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGG 331
Query: 355 ---ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
++DSGT+IT L PS+Y+A++AEF+ Q + +P A FSILDTCF+L+ +EV +P +
Sbjct: 332 GGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSL 391
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
K+ F+G AE+ VD G++Y V DASQVCLALASL E +T IIGNYQQKN RVI+DT
Sbjct: 392 KLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVG 451
Query: 472 SQLGFAGEDC 481
SQ+GFA E C
Sbjct: 452 SQIGFAQETC 461
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 236/477 (49%), Positives = 324/477 (67%), Gaps = 30/477 (6%)
Query: 27 GAHCF----EGKKKLHLHK--LQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGK 80
G HC E ++ HL + LQ +Q+ H +SR GA LEL+H ++
Sbjct: 29 GVHCLDLDLEEGRRHHLSRRALQGRQRR-------HHLRSRAVGGATVLELRHHSFSPAP 81
Query: 81 IVDWNEQQQNRLILDNLHVQYLQSRIKN---MISGNIKDV----SNTEIPLTSGIRLQTL 133
E+ L D V LQ RI++ + + +V S ++P++SG RL+TL
Sbjct: 82 ANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGARLRTL 141
Query: 134 NYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
NY+AT+ LGG TVIVDT S+LTWVQC PC+SC++QQ P+FDPS SPSY V C+S +C
Sbjct: 142 NYVATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSC 201
Query: 194 HALE--FATG---NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
AL+ ATG + C + P C+Y +SY DGSY+RG L + L L ++ F+FG
Sbjct: 202 DALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFG 261
Query: 249 CGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
CG +N+G FGG SGLMGLGRS LSLVSQT + FGG+FSYCLP ++++ ASGSL+LG +
Sbjct: 262 CGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDP 321
Query: 308 SVFKNSTPITYTNMIPN--PQL-ATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVIT 364
S ++NSTP+ YT+M+ N P L FY++NLTGI++GG++++++GF+ I +DSGTVIT
Sbjct: 322 SAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAI-VDSGTVIT 380
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
L PS+Y+A++AEF+ Q + +P APGFSILDTCFN++ +EV +P + + F+G AE+ VD
Sbjct: 381 SLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVD 440
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
G++YFV SD+SQVCLA+ASL EDET IIGNYQQKN RV++DT SQ+GFA E C
Sbjct: 441 SGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 242/480 (50%), Positives = 323/480 (67%), Gaps = 26/480 (5%)
Query: 27 GAHCF----EGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIV 82
G HC +G + H H L + H +SR GA LEL+H+++ S
Sbjct: 31 GVHCLHLEEDGSRHRHQHHLSRRALRQGRQRHPHHLRSRAVGGATVLELRHRSFSSAPPA 90
Query: 83 DWNEQQQNRLI-LDNLHVQYLQSRIKN----MISGNIKDVSNT-----EIPLTSGIRLQT 132
E++ + L+ D V LQ RI MI+ + + ++P+TSG +L+T
Sbjct: 91 SSREEEVDGLLSTDAARVSSLQRRIDRYRRLMITSSAEVAVAVAASKAQVPVTSGAKLRT 150
Query: 133 LNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
LNY+AT+ LGG TVIVDT S+LTWVQC PC+SC++QQDP+FDPS SPSY V CNSS+
Sbjct: 151 LNYVATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSS 210
Query: 193 CHALEFATGN----SGVCS--SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
C AL+ ATG + C S C+Y +SY DGSY+RG L + L L ++ F+
Sbjct: 211 CDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFV 270
Query: 247 FGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
FGCG +N+G FGG SGLMGLGRS LSLVSQT + FGG+FSYCLP +++ +SGSL++G
Sbjct: 271 FGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLP-LKESDSSGSLVIGD 329
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGT 361
+SSV++NSTPI Y +M+ +P FY +NLTGI++GG+++++SGF+ GG +IDSGT
Sbjct: 330 DSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGT 389
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
VIT L PSIY+A+KAEFL QF+ +P APGFSILDTCFN++ +EV +P +K+ F+G E+
Sbjct: 390 VITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREVQVPSLKLVFDGGVEV 449
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
VD G++YFV SD+SQVCLA+A L E ET IIGNYQQKN RVI+DT SQ+GFA E C
Sbjct: 450 EVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 229/435 (52%), Positives = 300/435 (68%), Gaps = 14/435 (3%)
Query: 57 HQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKN-- 108
H +SR E GA LEL+H G + + L D V LQ R
Sbjct: 40 HLRSRAESGATILELRHHGGGGGGGSGKSGGRSREEELGGLFSSDAARVSSLQRRAGGGS 99
Query: 109 -MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSC 167
+ +P+TSG RL+TLNY+AT+ LGG TVIVDT S+LTWVQC PC SC
Sbjct: 100 WAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEATVIVDTASELTWVQCAPCASC 159
Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYT 226
++QQ P+FDP+ SPSY + CNSS+C AL+ ATG++ P C+Y +SY DGSY+
Sbjct: 160 HDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYS 219
Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
+G L + L L ++ F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FS
Sbjct: 220 QGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFS 279
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
YCLP +++ +SGSL+LG ++SV++NSTPI YT M+ +P FY +NLTGI+IGG++++
Sbjct: 280 YCLP-LKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE 338
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
+S G +++DSGT+IT L PS+Y+A+KAEFL QF+ +P APGFSILDTCFNL+ ++EV
Sbjct: 339 SSA---GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREV 395
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
IP +K FEGN E+ VD +G++YFV SD+SQVCLALASL E ET IIGNYQQKN RVI
Sbjct: 396 QIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVI 455
Query: 467 YDTKNSQLGFAGEDC 481
+DT SQ+GFA E C
Sbjct: 456 FDTLGSQIGFAQETC 470
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 229/435 (52%), Positives = 300/435 (68%), Gaps = 14/435 (3%)
Query: 57 HQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKN-- 108
H +SR E GA LEL+H G + + L D V LQ R
Sbjct: 39 HLRSRAESGATILELRHHGGGGGGGSGKSGGRSREEELGGLFSSDAARVSSLQRRAGGGS 98
Query: 109 -MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSC 167
+ +P+TSG RL+TLNY+AT+ LGG TVIVDT S+LTWVQC PC SC
Sbjct: 99 WAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEATVIVDTASELTWVQCAPCASC 158
Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYT 226
++QQ P+FDP+ SPSY + CNSS+C AL+ ATG++ P C+Y +SY DGSY+
Sbjct: 159 HDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYS 218
Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
+G L + L L ++ F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FS
Sbjct: 219 QGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFS 278
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
YCLP +++ +SGSL+LG ++SV++NSTPI YT M+ +P FY +NLTGI+IGG++++
Sbjct: 279 YCLP-LKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE 337
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
+S G +++DSGT+IT L PS+Y+A+KAEFL QF+ +P APGFSILDTCFNL+ ++EV
Sbjct: 338 SSA---GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREV 394
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
IP +K FEGN E+ VD +G++YFV SD+SQVCLALASL E ET IIGNYQQKN RVI
Sbjct: 395 QIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVI 454
Query: 467 YDTKNSQLGFAGEDC 481
+DT SQ+GFA E C
Sbjct: 455 FDTLGSQIGFAQETC 469
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 229/441 (51%), Positives = 303/441 (68%), Gaps = 19/441 (4%)
Query: 59 KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNR-------LILDNLHVQYLQSRIKNMIS 111
+SR E G+ LEL+H S + +R L D V LQ RI++ S
Sbjct: 32 RSRTESGSTILELRHHISSSFSPGPNRPSKTSRGEVDGGVLSSDAARVSSLQRRIESYRS 91
Query: 112 GNIKDVSNT-----EIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKS 166
+ + ++P+TSG L+TLNY+AT+ LG TV+VDT S+LTWVQCQPC+S
Sbjct: 92 SSEGEEEEASKLALQVPITSGANLRTLNYVATVGLGAAEATVVVDTASELTWVQCQPCES 151
Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP--PDCNYFVSYGDG 223
C++QQDP+FDPS SPSY V CNSS+C AL A + C+ + P C+Y +SY DG
Sbjct: 152 CHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDG 211
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFG 282
SY+RG L R+ L L + F+FGCG +N+G FGG SGLMGLGRS +SLVSQT + FG
Sbjct: 212 SYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFG 271
Query: 283 GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN--PQLATFYILNLTGISI 340
G+FSYCLP +++G+SGSL+LG +SS ++NSTPI YT M+ + P FY LNLTGI++
Sbjct: 272 GVFSYCLP-MRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITV 330
Query: 341 GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
GG+++++ F+ G ++IDSGT+IT L PS+Y+A++AEFL Q + +P AP FSILDTCFNL
Sbjct: 331 GGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNL 390
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
+ +EV +P +K FEG+ E+ VD G++YFV SDASQVCLALASL E +T IIGNYQQ
Sbjct: 391 TGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQ 450
Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
KN RVI+DT SQ+GFA E C
Sbjct: 451 KNLRVIFDTLGSQIGFAQETC 471
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/287 (74%), Positives = 240/287 (83%), Gaps = 2/287 (0%)
Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKG 255
+ +GNSGVC S++P CNY ++YGDGS+TRGELG E L G V DFIFGCGRNNKG
Sbjct: 116 IPVTSGNSGVCGSAAP-ICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKG 174
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
LFGGVSGLMGLGRSDLSL+SQTS IFGG+FSYCLPST+ G SGSLILGGNSSV++NS+P
Sbjct: 175 LFGGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSP 233
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALK 375
I+Y MI NPQL FY +NLTGISIGG LQA IL+DSGTVITRLPP+IY ALK
Sbjct: 234 ISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALK 293
Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
AEFLKQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSD
Sbjct: 294 AEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD 353
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
ASQVCLALASL Y+DE I+GNYQQKN RVIYDTK +++GFA E CS
Sbjct: 354 ASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 30 CFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ 89
C E K+ L L K+Q + +S + +SC S QKSR EMGA LE+KH+++CSG DWNE+ Q
Sbjct: 26 CLEEKRVLSLQKVQPKLQS-TDTSCFS-QKSRREMGATILEMKHRDHCSGVTRDWNEKLQ 83
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
RL +D V+ LQSRIK + N +DVSN +IP+TSG
Sbjct: 84 KRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIPVTSG 121
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 240/493 (48%), Positives = 314/493 (63%), Gaps = 53/493 (10%)
Query: 37 LHLHKLQWQQKSGSSSSCVSHQKSRIE------------------MGAITLELKHKNYCS 78
L L +LQW GSS Q R E LELKH + +
Sbjct: 36 LQLRELQW----GSSGQVRYSQSKRFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA 91
Query: 79 GKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT-------EIPLTSG 127
I D Q+ RL+ D LQ R K + + K + E+PLTSG
Sbjct: 92 --IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSG 149
Query: 128 IRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
IR QTLNY+ TI LGG N+TVIVDTGSDLTWVQC+PC CY Q+DP+FDPS
Sbjct: 150 IRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSG 209
Query: 180 SPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELG 231
S SY V CN+S C A L+ ATG G C++ C Y ++YGDGS++RG L
Sbjct: 210 SASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLA 269
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ + LG ASV+ F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+ FGG+FSYCLP+
Sbjct: 270 TDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA 329
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
A+GSL LGG++S ++N+TP++YT MI +P FY +N+TG S+GG + A+G
Sbjct: 330 ATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLG 389
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIP 409
+L+DSGTVITRL PS+Y A++AEF +QF +P+AP FS+LD C+NL+ + EV +P
Sbjct: 390 AANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVP 449
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
L+ + EG A+MTVD G+++ + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT
Sbjct: 450 LLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDT 509
Query: 470 KNSQLGFAGEDCS 482
S+LGFA EDCS
Sbjct: 510 VGSRLGFADEDCS 522
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 238/490 (48%), Positives = 314/490 (64%), Gaps = 46/490 (9%)
Query: 37 LHLHKLQW---------QQKSGSSSSCVSHQKSRIEMGAI-----TLELKHKNYCSGKIV 82
L L +LQW Q K H+K+ LELKH + + I
Sbjct: 36 LQLRELQWGSSGQVRYSQSKHFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA--IP 93
Query: 83 DWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT--------EIPLTSGIRL 130
D Q+ RL+ D LQ R K + + K + E+PLTSGIR
Sbjct: 94 DHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIRF 153
Query: 131 QTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
QTLNY+ TI LGG N+TVIVDTGSDLTWVQC+PC CY Q+DP+FDPS S S
Sbjct: 154 QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSAS 213
Query: 183 YKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELGREH 234
Y V CN+S C A L+ ATG G C++ C Y ++YGDGS++RG L +
Sbjct: 214 YAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDT 273
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ LG ASV+ F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+ FGG+FSYCLP+
Sbjct: 274 VALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATS 333
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
A+GSL LGG++S ++N+TP++YT MI +P FY +N+TG S+GG + A+G
Sbjct: 334 GDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN 393
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
+L+DSGTVITRL PS+Y A++AEF +QF +P+AP FS+LD C+NL+ + EV +PL+
Sbjct: 394 VLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLT 453
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ EG A+MTVD G+++ + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT S
Sbjct: 454 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 513
Query: 473 QLGFAGEDCS 482
+LGFA EDCS
Sbjct: 514 RLGFADEDCS 523
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 207/274 (75%), Positives = 232/274 (84%), Gaps = 2/274 (0%)
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG 259
+GNSGVC S++P CNY ++YGDGS+TRGELG E L G V DFIFGCGRNNKGLFGG
Sbjct: 63 SGNSGVCGSAAPI-CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGG 121
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
VSGLMGLGRSDLSL+SQTS IFGG+FSYCLPST+ G SGSLILGGNSSV++NS+PI+Y
Sbjct: 122 VSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYA 180
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
MI NPQL FY +NLTGISIGG LQA IL+DSGTVITRLPP+IY ALKAEFL
Sbjct: 181 KMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 240
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
KQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSDASQV
Sbjct: 241 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 300
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
CLALASL Y+DE I+GNYQQKN RVIYDTK ++
Sbjct: 301 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETK 334
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 46/64 (71%)
Query: 64 MGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
MGA LE+KH+++CSG DWNE+ Q RL +D V+ LQSRIK + N +DVSN +IP
Sbjct: 1 MGATILEMKHRDHCSGVTRDWNEKLQKRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIP 60
Query: 124 LTSG 127
+TSG
Sbjct: 61 VTSG 64
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 227/434 (52%), Positives = 297/434 (68%), Gaps = 28/434 (6%)
Query: 69 LELKHKNYCSGKIVDWNEQQQNR-----LILDNLHVQYLQSRIKN---MISGNIKDVSNT 120
LELKH + S V + + R L D+ LQ R + +
Sbjct: 108 LELKH--HSSTATVPDHPAARERYLKHLLAADSARAASLQLRKPKPASSTTTTQASAAAA 165
Query: 121 EIPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVF 175
E+PL SGIR QTLNY+ TI LGG +N+TVIVDTGSDLTWVQC+PC SCY Q+DP+F
Sbjct: 166 EVPLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLF 225
Query: 176 DPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELG 231
DP+ SP++ V C S C A L+ ATG G C+ S S C Y +SYGDGS++RG L
Sbjct: 226 DPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLA 285
Query: 232 REHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
++ LGLG + ++ F+FGCG +N+GLFGG +GLMGLGR+DLSLVSQT+ FGG+FSYCLP
Sbjct: 286 QDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLP 345
Query: 291 STQDAGASGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ-LQAS 348
+T + +GSL LG G SS F N + YT MI +P FY +N+TG ++GG L A
Sbjct: 346 ATTTS--TGSLSLGPGPSSSFPN---MAYTRMIADPTQPPFYFINITGAAVGGGAALTAP 400
Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
GF G +L+DSGTVITRL PS+Y A++AEF ++F +P+APGFSILD C++L+ EVN+
Sbjct: 401 GFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRFE-YPAAPGFSILDACYDLTGRDEVNV 459
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
PL+ + EG A++TVD G+++ V+ D SQVCLA+ASL YED+T IIGNYQQ+N+RV+YD
Sbjct: 460 PLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYD 519
Query: 469 TKNSQLGFAGEDCS 482
T S+LGFA EDC+
Sbjct: 520 TVGSRLGFADEDCT 533
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 204/340 (60%), Positives = 259/340 (76%), Gaps = 11/340 (3%)
Query: 2 VTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSR 61
+ KVK L ++SL L ++A G FE KK +L LQ +Q+ GS C+ H +SR
Sbjct: 21 MVKVKALLLVSLCL-------IIANGVSSFEEKKVFNLQILQRKQQLGSLG-CL-HPESR 71
Query: 62 IEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
E GAI LE+K ++YCS K V+W+ + N+L LD+LHV+ +Q+R++ M+S + +VS +
Sbjct: 72 QEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQ 131
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
IPL SG+ QTLNYI T+ELGG++MTVI+DTGSDLTWVQC+PC SCYNQQ PVF PS S
Sbjct: 132 IPLASGVNFQTLNYIVTMELGGQDMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSS 191
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY+ + CNSSTC +L+ TGN+G C S+P +C+Y V+YGDGSYT GELG EHL G S
Sbjct: 192 SYQSIPCNSSTCQSLQLTTGNAGAC-ESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGIS 250
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V++F+FGCG+NNKGLFGGVSGLMGLGRS+LSL+SQT+ FGG+FSYCLP T DAGASGSL
Sbjct: 251 VSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPT-DAGASGSL 309
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
+G SSVFKN TPI YT M+PNPQL+ FY+LNLTGI +G
Sbjct: 310 AMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 362 bits (928), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 206/444 (46%), Positives = 275/444 (61%), Gaps = 52/444 (11%)
Query: 69 LELKHKNYCSGKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT---- 120
LELKH + + I D Q+ RL+ D LQ R K + + K +
Sbjct: 27 LELKHHSLTA--IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAA 84
Query: 121 ----EIPLTSGIRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCY 168
E+PLTSGIR QTLNY+ TI LGG N+TVIVDTGSDLTWVQC+PC CY
Sbjct: 85 AAGAEVPLTSGIRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCY 144
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSY 220
Q+DP+FDPS S SY V CN+S C A L+ ATG G C++ C Y ++Y
Sbjct: 145 AQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAY 204
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
GDGS++RG L + + LG ASV+ F+FGCG +N+GL R + S T+
Sbjct: 205 GDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGL----------RRPGSAASSPTAS- 253
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
P A+GSL LGG++S ++N+TP++YT MI +P FY +N+TG S+
Sbjct: 254 ---------PPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV 304
Query: 341 GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCF 398
GG + A+G +L+DSGTVITRL PS+Y A++AEF +QF +P+AP FS+LD C+
Sbjct: 305 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 364
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
NL+ + EV +PL+ + E A+MTVD G+++ + D SQVCLA+ASLS+ED+T IIGNY
Sbjct: 365 NLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 424
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
QQKN+RV+YDT S+LGFA EDCS
Sbjct: 425 QQKNKRVVYDTVGSRLGFADEDCS 448
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 357 bits (917), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 182/283 (64%), Positives = 202/283 (71%), Gaps = 45/283 (15%)
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG 259
+GNSGVC S++P CNY ++YGDGS+TRGELG E L G V DFIFGCGRNNKGLFGG
Sbjct: 120 SGNSGVCGSAAPI-CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGG 178
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
VSGLMGLGRSDLSL+SQTSE
Sbjct: 179 VSGLMGLGRSDLSLISQTSE---------------------------------------- 198
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
NPQL FY +NLTGISIGG LQA IL+DSGTVITRLPP+IY ALKAEFL
Sbjct: 199 ----NPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 254
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
KQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSDASQV
Sbjct: 255 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 314
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLALASL Y+DE I+GNYQQKN RVIYDTK +++GFA E CS
Sbjct: 315 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 30 CFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ 89
C E K+ L L K+Q + +S + +SC S QKSR EMGA LE+KH+++CSG DWNE+ Q
Sbjct: 26 CLEEKRVLSLQKVQPKLQS-TDTSCFS-QKSRREMGATILEMKHRDHCSGVTRDWNEKLQ 83
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
RL +D V+ LQSRIK + N +DVSN +IP+TSG
Sbjct: 84 KRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIPVTSG 121
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 211/493 (42%), Positives = 273/493 (55%), Gaps = 100/493 (20%)
Query: 37 LHLHKLQWQQKSGSSSSCVSHQKSRIE------------------MGAITLELKHKNYCS 78
L L +LQW GSS Q R E LELKH + +
Sbjct: 36 LQLRELQW----GSSGQVRYSQSKRFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA 91
Query: 79 GKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT-------EIPLTSG 127
I D Q+ RL+ D LQ R K + + K + E+PLTSG
Sbjct: 92 --IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSG 149
Query: 128 IRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
IR QTLNY+ TI LGG N+TVIVDTGSDLTWVQC+PC CY Q+DP+FDPS
Sbjct: 150 IRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSG 209
Query: 180 SPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELG 231
S SY V CN+S C A L+ ATG G C++ C Y ++YGDGS++RG L
Sbjct: 210 SASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLA 269
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ + LG ASV+ F+FGCG +N+GLFGG +GLMGLG P
Sbjct: 270 TDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLG----------------------PD 307
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
AG +P+ FY +N+TG S+GG + A+G
Sbjct: 308 GALAG-------------------------LPDGAPPPFYFMNVTGASVGGAAVAAAGLG 342
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIP 409
+L+DSGTVITRL PS+Y A++AEF +QF +P+AP FS+LD C+NL+ + EV +P
Sbjct: 343 AANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVP 402
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
L+ + EG A+MTVD G+++ + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT
Sbjct: 403 LLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDT 462
Query: 470 KNSQLGFAGEDCS 482
S+LGFA EDCS
Sbjct: 463 VGSRLGFADEDCS 475
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 337 bits (864), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 204/490 (41%), Positives = 286/490 (58%), Gaps = 28/490 (5%)
Query: 7 PLTILSLLLPLMVSLFLLAKGAHCFEGKKKL-----HLHKLQWQQKSGSSSSCVSHQKSR 61
P++ + LL L+ S L +K F+G+K LH + SS V +
Sbjct: 4 PISTIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSS---VCSPSPK 60
Query: 62 IEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQSRI-KNMISGNIKDV 117
+ +LE+ HK+ K+ + +R L D V ++SR+ KN G
Sbjct: 61 GDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKG 120
Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
S +P SG + T NY+ T+ LG R++T I DTGSDLTW QC+PC + CY+QQ+P+
Sbjct: 121 SKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPI 180
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
F+PS S SY + C+S TC L+ TGNS CS+S+ C Y + YGD SY+ G ++
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST---CVYGIQYGDQSYSVGFFAQDK 237
Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L L V N+F+FGCG+NN+GLF GV+GL+GLGR+ LSLVSQT++ +G LFSYCLPST
Sbjct: 238 LALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPST- 296
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA 351
+ ++G L G S + +T + N Q +FY LNL IS+GG++L AS F+
Sbjct: 297 -SSSTGYLTFGSGGGT---SKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFS 352
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
G +IDSGTVI+RLPP+ YS L+A F +Q S +P A SILDTC++ S Y V++P +
Sbjct: 353 TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKI 412
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F AEM +D +GI Y + + SQVCLA A S + I+GN QQK V+YD
Sbjct: 413 NLYFSDGAEMDLDPSGIFYIL--NISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAG 470
Query: 472 SQLGFAGEDC 481
++GFA C
Sbjct: 471 GRIGFAPGGC 480
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 199/489 (40%), Positives = 281/489 (57%), Gaps = 24/489 (4%)
Query: 7 PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKS------GSSSSCVSHQKS 60
P++ + LL L+ + L K EG++ H +Q + SS+C K
Sbjct: 11 PISTICLLRFLLYASLLSLKSGFAIEGRESAESHHVQPIHHNVHITSLMPSSACSPSPKG 70
Query: 61 RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRI-KNMISGNIKDVS 118
+ ++ + KH + N +++ D V +QSR+ KN+ G+ S
Sbjct: 71 HDQRASLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKAS 130
Query: 119 NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVF 175
+P S L + NY+ T+ LG R++T I DTGSDLTW QC+PC CY Q++ +F
Sbjct: 131 KATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIF 190
Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
DPS S SY V C+S +C LE ATGNS CSSS+ C Y + YGDGSY+ G RE L
Sbjct: 191 DPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSST---CLYGIRYGDGSYSIGFFAREKL 247
Query: 236 GLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
L V N+F FGCG+NN+GLFGG +GL+GL R+ LSLVSQT++ +G +FSYCLPS+
Sbjct: 248 SLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSS 307
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK 352
+ +G L G +S + +T N +FY L++ GIS+G ++L S F+
Sbjct: 308 S--TGYLSFGSGDG---DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST 362
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G +IDSGTVI+RLPP++YS+++ F + S +P G SILDTC++LS Y+ V +P +
Sbjct: 363 AGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKII 422
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F G AEM + GI+Y +K SQVCLA A S +DE IIGN QQK V+YD
Sbjct: 423 LYFSGGAEMDLAPEGIIYVLK--VSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEG 480
Query: 473 QLGFAGEDC 481
++GFA C
Sbjct: 481 RVGFAPSGC 489
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 209/514 (40%), Positives = 290/514 (56%), Gaps = 49/514 (9%)
Query: 1 MVTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKL--HLHKLQWQQKSGSSSSCVSHQ 58
M T+ L S L++ F + K +H E K+ + H H LQ SSSC +
Sbjct: 6 MATRSYFLLFSSFTFLLILLSFPVEK-SHALEAKETIESHFHTLQLTSLL-PSSSCNTAT 63
Query: 59 KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKNM--- 109
K + GA +LE+ ++ G N++ L D V +Q+R+ +
Sbjct: 64 KGK-RRGA-SLEVVNRQ---GPCTQLNQKGAKAPTLTEILAHDQARVDSIQARVTDQSYD 118
Query: 110 ----------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLT 157
S +P SG+ L T NYI + LG +++++I DTGSDLT
Sbjct: 119 LFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLT 178
Query: 158 WVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
W QCQPC KSCY QQ P+FDPS S +Y + C S+ C L+ ATGNS CSSS +C Y
Sbjct: 179 WTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSSS---NCVY 235
Query: 217 FVSYGDGSYTRGELGREHLGLGKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 275
+ YGD S+T G ++ L L + V D F+FGCG+NN+GLFG +GL+GLGR LS+V
Sbjct: 236 GIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQ 295
Query: 276 QTSEIFGGLFSYCLPSTQDAGASGSLILG-GN----SSVFKNSTPITYTNMIPNPQLATF 330
QT++ FG FSYCLP+++ G++G L G GN S KN IT+T + Q ATF
Sbjct: 296 QTAQKFGKYFSYCLPTSR--GSNGHLTFGNGNGVKTSKAVKNG--ITFTPF-ASSQGATF 350
Query: 331 YILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
Y +++ GIS+GGK L S F G +IDSGTVITRLP ++Y +LK+ F + S +P+A
Sbjct: 351 YFIDVLGISVGGKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTA 410
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
P S+LDTC++LS Y ++IP + F GNA + ++ GI+ + + ASQVCLA A
Sbjct: 411 PALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGIL--ITNGASQVCLAFAGNGD 468
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+D GI GN QQ+ V+YD QLGF + CS
Sbjct: 469 DDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 315 bits (806), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 177/373 (47%), Positives = 238/373 (63%), Gaps = 21/373 (5%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDP 177
+P SG+ L T NYI + LG +++++I DTGSDLTW QCQPC KSCY QQ P+FDP
Sbjct: 140 NLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDP 199
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S S +Y + C S+ C +L+ ATGNS CSSS +C Y + YGD S+T G ++ L L
Sbjct: 200 STSKTYSNISCTSAACSSLKSATGNSPGCSSS---NCVYGIQYGDSSFTIGFFAKDKLTL 256
Query: 238 GKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+ V D F+FGCG+NNKGLFG +GL+GLGR LS+V QT++ FG FSYCLP+++ G
Sbjct: 257 TQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSR--G 314
Query: 297 ASGSLILG-GN----SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG-- 349
++G L G GN S KN IT+T + Q +Y +++ GIS+GGK L S
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNG--ITFTPF-ASSQGTAYYFIDVLGISVGGKALSISPML 371
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
F G +IDSGTVITRLP + Y +LK+ F + S +P+AP S+LDTC++LS Y ++IP
Sbjct: 372 FQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIP 431
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ F GNA + +D GI+ + + ASQVCLA A +D GI GN QQ+ V+YD
Sbjct: 432 KISFNFNGNANVELDPNGIL--ITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDV 489
Query: 470 KNSQLGFAGEDCS 482
QLGF + CS
Sbjct: 490 AGGQLGFGYKGCS 502
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 311 bits (797), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 152/234 (64%), Positives = 189/234 (80%), Gaps = 1/234 (0%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
+K + +CS K +DWN + Q +LILD+L V+ +Q+RI+ + S + + S T+IPL+SGI L
Sbjct: 1 MKDRGHCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL 60
Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
QTLNYI T+ LG +NMTVI+DT SDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61 QTLNYIVTMGLGSKNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
STC +L+FATGN+G C SS+P CNY V+YGDGSYT G+LG E L G SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVSVSDFVFGCG 180
Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
RNNKGLFGGVSGLMGLGRS LSLVSQT+ FGG+FSYCLP+T +AG+SGSL++G
Sbjct: 181 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT-EAGSSGSLVMG 233
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 308 bits (788), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 188/489 (38%), Positives = 267/489 (54%), Gaps = 38/489 (7%)
Query: 11 LSLLLPLMVSLFLLAKG-AHCFEGKKKLHLHKLQWQQKSGSSSSC-VSHQKSRIEMGAIT 68
+ ++ LM+ L+ A E + + L+W+ K + C S +
Sbjct: 13 IRVVAALMLQCLLMGSSTALDHENYHTISVDILKWKWKPPGFAKCPASFAGQEALKPGVK 72
Query: 69 LELKH--------KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
+ L H + S +D Q +R DN + + S KN +G +SN
Sbjct: 73 IRLDHIHGACSPLRPINSSSWIDMVSQSFDR---DNDRLNTIWS--KN--NGTYSTMSN- 124
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+PL G ++ T NYI T G +N +I+DTGSD+TW+QC+PC CY+Q DP+F+P
Sbjct: 125 -LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQ 183
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S SYK + C SS C E T N C C Y ++YGDGS ++G+ +E L LG
Sbjct: 184 QSSSYKHLSCLSSAC--TELTTMNH--CRLGG---CVYEINYGDGSRSQGDFSQETLTLG 236
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
S F FGCG N GLF G +GL+GLGR+ LS SQT +GG FSYCLP + ++
Sbjct: 237 SDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTST 296
Query: 299 GSLILGGNSSVFKNSTPIT--YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
GS +G + S P T + ++ N +FY + L GIS+GG++L + +GG
Sbjct: 297 GSFSVG------QGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGG 350
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
++DSGTVITRL P Y ALK F + PSA FSILDTC++LS+Y +V IP +
Sbjct: 351 TIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFH 410
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F+ NA++ V GI++ ++SD SQVCLA AS S T IIGN+QQ+ RV +DT ++
Sbjct: 411 FQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRI 470
Query: 475 GFAGEDCSS 483
GFA C++
Sbjct: 471 GFAPGSCAT 479
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 168/398 (42%), Positives = 241/398 (60%), Gaps = 21/398 (5%)
Query: 94 LDNLHVQYLQSRIKNMISG--NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
LDN V+Y+QSR+ + G +K++ +T +P SG + + +Y + LG R++++I
Sbjct: 97 LDNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLI 156
Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
DTGS LTW QC+PC SCY QQDP+FDPS S SY + C SS C S CSS
Sbjct: 157 FDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQFR-----SAGCSS 211
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
S+ C Y V YGD S +RG L +E L + V+DF+FGCG++N+GLF G +GLMGL
Sbjct: 212 STDASCIYDVKYGDNSISRGFLSQERLTITATDIVHDFLFGCGQDNEGLFRGTAGLMGLS 271
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R +S V QTS I+ +FSYCLPST + G L G +++ N + YT
Sbjct: 272 RHPISFVQQTSSIYNKIFSYCLPSTPSS--LGHLTFGASAATNAN---LKYTPFSTISGE 326
Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+FY L++ GIS+GG +L A S F+ GG +IDSGTVITRLPP+ Y+AL++ F +
Sbjct: 327 NSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMK 386
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+P A G +LDTC++ S Y+E+++P + EF G ++ + + GI+Y A Q+CLA A
Sbjct: 387 YPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVKVELPLVGILY--GESAQQLCLAFA 444
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ ++ I GN QQK V+YD + ++GF C+
Sbjct: 445 ANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 482
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 179/445 (40%), Positives = 262/445 (58%), Gaps = 21/445 (4%)
Query: 48 SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQS 104
S S+ S + H + +L + H++ ++ + + L LD V + S
Sbjct: 13 SKSALSSLHHHHLVFFLPESSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHS 72
Query: 105 RI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
++ K + + ++ + +T++P G L + NYI T+ LG ++++I DTGSDLTW QC
Sbjct: 73 KLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC 132
Query: 162 QPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
QPC ++CY+Q++P+F+PS S SY V C+S+ C +L ATGN+G CS+S +C Y + Y
Sbjct: 133 QPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS---NCIYGIQY 189
Query: 221 GDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
GD S++ G L +E L + V D + FGCG NN+GLF GV+GL+GLGR LS SQT+
Sbjct: 190 GDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTAT 249
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
+ +FSYCLPS+ A +G L G S+ S T + I + +FY LN+ I+
Sbjct: 250 AYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG--TSFYGLNIVAIT 303
Query: 340 IGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
+GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F + S +P+ G SILDTC
Sbjct: 304 VGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 363
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
F+LS ++ V IP V F G A + + GI Y K SQVCLA A S + I GN
Sbjct: 364 FDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAGNSDDSNAAIFGN 421
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
QQ+ V+YD ++GFA CS
Sbjct: 422 VQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 173/398 (43%), Positives = 245/398 (61%), Gaps = 18/398 (4%)
Query: 92 LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
L LD V + S++ K + + ++ + +T++P G L + NYI T+ LG ++++
Sbjct: 88 LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 147
Query: 149 IVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I DTGSDLTW QCQPC ++CY+Q++P+F+PS S SY V C+S+ C +L ATGN+G CS
Sbjct: 148 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 207
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGL 266
+S +C Y + YGD S++ G L +E L + V D + FGCG NN+GLF GV+GL+GL
Sbjct: 208 AS---NCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGL 264
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
GR LS SQT+ + +FSYCLPS+ A +G L G S+ S T + I +
Sbjct: 265 GRDKLSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG- 319
Query: 327 LATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+FY LN+ I++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F + S
Sbjct: 320 -TSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSK 378
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+P+ G SILDTCF+LS ++ V IP V F G A + + GI Y K SQVCLA A
Sbjct: 379 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 436
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S + I GN QQ+ V+YD ++GFA CS
Sbjct: 437 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 173/422 (40%), Positives = 245/422 (58%), Gaps = 18/422 (4%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG--NIKDVSNTEIPLT 125
+LE+ H++ G V L+ D V ++ S+I + ++ T+IP
Sbjct: 62 SLEVIHRHGPCGDEVSNAPTAAEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATKIPAK 121
Query: 126 SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPS 182
SG + + NYI ++ LG + +++I DTGSDLTW QCQPC + CYNQ+DPVF PS S +
Sbjct: 122 SGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTT 181
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
Y + C+S C LE TGN CS++ C Y + YGD S++ G +E L L V
Sbjct: 182 YSNISCSSPDCSQLESGTGNQPGCSAARA--CIYGIQYGDQSFSVGYFAKETLTLTSTDV 239
Query: 243 -NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+F+FGCG+NN+GLFG +GL+GLG+ +S+V QT++ +G +FSYCLP T + +
Sbjct: 240 IENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF 299
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDS 359
GG K TPIT + + N FY +++ G+ +GG Q+ +S F+ G +IDS
Sbjct: 300 GGGGGGGALKY-TPITKAHGVAN-----FYGVDIVGMKVGGTQIPISSSVFSTSGAIIDS 353
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
GTVITRLPP YSALK+ F K + +P AP SILDTC++LS Y + IP V F+G
Sbjct: 354 GTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGE 413
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
E+ +D GI+Y + SQVCLA A IIGN QQK +V+YD ++GF
Sbjct: 414 ELDLDGIGIMY--GASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYN 471
Query: 480 DC 481
C
Sbjct: 472 GC 473
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 169/415 (40%), Positives = 257/415 (61%), Gaps = 24/415 (5%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRI-KNM-ISGNIKDVSN 119
+LE+ HK+ ++ + + + +++ L D V+Y+ SRI KN+ ++ ++ +
Sbjct: 70 SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129
Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFD 176
+P SG + + NY + LG R++++I DTGSDLTW QC+PC +SCY QQD +FD
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFD 189
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
PS S SY + C S+ C L ATGN CS+S+ C Y + YGD S++ G RE L
Sbjct: 190 PSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKA-CIYGIQYGDSSFSVGYFSRERLS 248
Query: 237 LGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
+ V++F+FGCG+NN+GLFGG +GL+GLGR +S V QT+ ++ +FSYCLP+T +
Sbjct: 249 VTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPAT--S 306
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG 353
++G L G ++ + TP + + + ++FY L++TGIS+GG +L +S F+ G
Sbjct: 307 SSTGRLSFGTTTTSYVKYTPFSTIS-----RGSSFYGLDITGISVGGAKLPVSSSTFSTG 361
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G +IDSGTVITRLPP+ Y+AL++ F + S +PSA SILDTC++LS Y+ +IP +
Sbjct: 362 GAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDF 421
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
F G + + GI+Y + A QVCLA A+ + + I GN QQK V+YD
Sbjct: 422 SFAGGVTVQLPPQGILYV--ASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 301 bits (771), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 172/398 (43%), Positives = 244/398 (61%), Gaps = 18/398 (4%)
Query: 92 LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
L LD V + S++ K + + ++ +T++P G L + NYI T+ LG ++++
Sbjct: 89 LRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 148
Query: 149 IVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I DTGSDLTW QCQPC ++CY+Q++P+F+PS S SY V C+S+ C +L ATGN+G CS
Sbjct: 149 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 208
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGL 266
+S +C Y + YGD S++ G L ++ L + V D + FGCG NN+GLF GV+GL+GL
Sbjct: 209 AS---NCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGL 265
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
GR LS SQT+ + +FSYCLPS+ A +G L G S+ S T + I +
Sbjct: 266 GRDKLSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG- 320
Query: 327 LATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+FY LN+ I++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F + S
Sbjct: 321 -TSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSK 379
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+P+ G SILDTCF+LS ++ V IP V F G A + + GI Y K SQVCLA A
Sbjct: 380 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK--ISQVCLAFA 437
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S + I GN QQ+ V+YD ++GFA CS
Sbjct: 438 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 300 bits (769), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 167/397 (42%), Positives = 246/397 (61%), Gaps = 17/397 (4%)
Query: 94 LDNLHVQYLQSRI-KNMISGN-IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
LDN V+Y+QSR+ KN+ N +KD+ +T +P SG + + NY+ + LG R+++++
Sbjct: 3 LDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLSLV 62
Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
DTGSDLTW QC+PC SCY QQD +FDPS S SY + C SS C L + G CSS
Sbjct: 63 FDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLT-SDGIKSECSS 121
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
S+ C Y YGD S + G L +E L + V+DF+FGCG++N+GLF G +GLMGLG
Sbjct: 122 STDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLG 181
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R +S+V QTS + +FSYCLP+T + + G L G +++ + + YT +
Sbjct: 182 RHPISIVQQTSSNYNKIFSYCLPAT--SSSLGHLTFGASAAT---NASLIYTPLSTISGD 236
Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+FY L++ IS+GG +L A S F+ GG +IDSGTVITRL P++Y+AL++ F +
Sbjct: 237 NSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEK 296
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+P A +LDTC++LS Y+E+++P + EF G + + GI+ V+S+ QVCLA A
Sbjct: 297 YPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILX-VESE-QQVCLAFA 354
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ +++ + GN QQK V+YD K ++GF C
Sbjct: 355 ANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 297 bits (760), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 168/397 (42%), Positives = 243/397 (61%), Gaps = 18/397 (4%)
Query: 94 LDNLHVQYLQSRI-KNMISGN-IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
LDN V+Y+QSR+ KN+ N +K++ +T +P SG + + NY + LG R+++++
Sbjct: 93 LDNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLV 152
Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
DTGSDLTW QC+PC SCY QQD +FDPS S SY + C SS C L A G CSS
Sbjct: 153 FDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSA-GIKSRCSS 211
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
S+ C Y + YGD S + G L +E L + V+DF+FGCG++N+GLF G +GL+GLG
Sbjct: 212 STTA-CIYGIQYGDKSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLG 270
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R +S V QTS I+ +FSYCLPST + + G L G +++ N + YT +
Sbjct: 271 RHPISFVQQTSSIYNKIFSYCLPST--SSSLGHLTFGASAATNAN---LKYTPLSTISGD 325
Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
TFY L++ GIS+GG +L A S F+ GG +IDSGTVITRL P+ Y+AL++ F +
Sbjct: 326 NTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEK 385
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+P A + DTC++ S Y+E+++P + EF G + + + GI+ + A QVCLA A
Sbjct: 386 YPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGGVTVELPLVGIL--IGRSAQQVCLAFA 443
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ +++ I GN QQK V+YD + ++GF C
Sbjct: 444 ANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 297 bits (760), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 202/479 (42%), Positives = 285/479 (59%), Gaps = 27/479 (5%)
Query: 18 MVSLFLLAKGAHC--FEGKK---KLHLHKLQWQQKSGSSSSCV-SHQKSRIEMGAITLEL 71
+SL+LL +C FEG+K H H ++SC S Q IE A L++
Sbjct: 29 FLSLWLLFSFNNCYAFEGRKFAESQHTHTTIHLTSLLPAASCKPSTQVPSIENKAF-LKV 87
Query: 72 KHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIR 129
HK+ CS + Q L+ D V + S++ K+ ++K + T +P G
Sbjct: 88 VHKHGPCSDLRQGHKAEAQYILLQDQSRVDSIHSKLSKDSGLSDVKATAATTLPAKDGSI 147
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKV 186
+ + NY T+ LG ++ ++I DTGSDLTW QC+PC KSCYNQ++ +F+PS S SY +
Sbjct: 148 IGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANI 207
Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDF 245
C S+ C +L ATGN C+SS+ C Y + YGD S++ G G+E L L V NDF
Sbjct: 208 SCGSTLCDSLASATGNIFNCASST---CVYGIQYGDSSFSIGFFGKEKLSLTATDVFNDF 264
Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
FGCG+NNKGLFGG +GL+GLGR LSLVSQT++ + +FSYCLPS+ + ++G L GG
Sbjct: 265 YFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSS--SSSTGFLTFGG 322
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVI 363
++S + TP+ + ++FY L+LTGIS+GG++L S F+ G +IDSGTVI
Sbjct: 323 STSKSASFTPLATIS-----GGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVI 377
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
TRLPP+ YSAL + F K S +P+AP SILDTCF+ S + +++P + + F G + +
Sbjct: 378 TRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDI 437
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
D TGI Y +D +QVCLA A S + I GN QQK V+YD ++GFA CS
Sbjct: 438 DKTGIFYV--NDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 180/483 (37%), Positives = 261/483 (54%), Gaps = 23/483 (4%)
Query: 11 LSLLLPLMVSLFLLAKGAHCFEGKKKLHL---HKLQWQQKSGSSSSCVSHQKSRIEMGAI 67
+S++ LM+ L+ G+ HL +W+ G + S +
Sbjct: 13 ISVVAVLMLQCLLM--GSSVAPDHDNYHLIPVENFKWKDPQGFAKCPASSAGQEALKPGV 70
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ-SRIKNMISGNIKDVSNTEIPLTS 126
+ L H + + N L+ + + + I++ SG +SN +PL S
Sbjct: 71 KIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN--LPLQS 128
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G + T NYI T G +N +I+DTGSDLTW+QC+PC CY+Q D +F+P S SYK
Sbjct: 129 GTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYK 188
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
+ C S+TC L + N C C Y ++YGDGS ++G+ +E L LG S +
Sbjct: 189 TLPCLSATCTELITSESNPTPCLLGG---CVYEINYGDGSSSQGDFSQETLTLGSDSFQN 245
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
F FGCG N GLF G SGL+GLG++ LS SQ+ +GG F+YCLP + ++GS +G
Sbjct: 246 FAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVG 305
Query: 305 GNSSVFKNSTPIT--YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSG 360
K S P + +T ++ N TFY + L GIS+GG +L + +G ++DSG
Sbjct: 306 ------KGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSG 359
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
TVITRL P Y+ALK F + PSA FSILDTC++LS + +V IP + F+ NA+
Sbjct: 360 TVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNAD 419
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ V GI+ V++ SQVCLA AS S D IIGN+QQ+ RV +DT ++GFA
Sbjct: 420 VAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGS 479
Query: 481 CSS 483
C++
Sbjct: 480 CAA 482
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 182/489 (37%), Positives = 273/489 (55%), Gaps = 42/489 (8%)
Query: 8 LTILSLLLPLMVSLFLLAKGAHCFEGKK---KLHLHKLQWQQKSGSSSSCVSHQKSRIEM 64
L S LL L + + L A FEG+K + HL + + S S +++
Sbjct: 3 LISFSHLLCLCLVISLSTTYAFGFEGRKIAQENHLQLIHAIEISNLLPSADCEHSTKVAQ 62
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRIKNMISGNIKDVS 118
+L++ HK+ G N+Q N L+ D V + +++ + +K+
Sbjct: 63 NKASLKVVHKH---GPCSQLNQQNGNAPNLVEILLEDQSRVDSIHAKLSDH--SGVKETD 117
Query: 119 NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
++P SG+ L T NYI +I LG +++ +I DTGSDLTW +C ++ FD
Sbjct: 118 AAKLPTKSGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET--------FD 169
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
P+ S SY V C++ C ++ ATGN C++S+ C Y + YGDGSY+ G LG+E L
Sbjct: 170 PTKSTSYANVSCSTPLCSSVISATGNPSRCAAST---CVYGIQYGDGSYSIGFLGKERLT 226
Query: 237 LGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
+G + N+F FGCG++ GLFG +GL+GLGR LS+VSQT+ + LFSYCLPS+
Sbjct: 227 IGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSS-- 284
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKG 353
++G L G + S TP++ P ++FY L+LTGI++GG++L S F+
Sbjct: 285 -STGFLSFGSSQSKSAKFTPLS-----SGP--SSFYNLDLTGITVGGQKLAIPLSVFSTA 336
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G +IDSGTV+TRLPP+ YSAL++ F K + +P SILDTC++ S Y+ + +P + +
Sbjct: 337 GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F G ++ VD GI FV + QVCLA A + +T I GN QQ+N V+YD +
Sbjct: 397 SFSGGVDVDVDQAGI--FVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGK 454
Query: 474 LGFAGEDCS 482
+GFA CS
Sbjct: 455 VGFAPASCS 463
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 166/415 (40%), Positives = 251/415 (60%), Gaps = 23/415 (5%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRI-KNM-ISGNIKDVSN 119
+LE+ HK+ ++ D + + ++ L D V+Y+ SR+ KN+ +++++ +
Sbjct: 71 SLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEELDS 130
Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFD 176
+P SG + + NY + LG R++++I DTGSDLTW QC+PC +SCY QQD +FD
Sbjct: 131 ATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFD 190
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
PS S SY + C S+ C L ATGN CS+S+ C Y + YGD S++ G RE L
Sbjct: 191 PSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKA-CIYGIQYGDSSFSVGYFSRERLT 249
Query: 237 LGKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
+ V D F+FGCG+NN+GLFGG +GL+GLGR +S V QT+ + +FSYCLPST +
Sbjct: 250 VTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPST--S 307
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG 353
++G L G ++ + YT + ++FY L++T I++GG +L +S F+ G
Sbjct: 308 SSTGHLSFGPAAT----GRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTG 363
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G +IDSGTVITRLPP+ Y AL++ F + S +PSA SILDTC++LS Y+ +IP ++
Sbjct: 364 GAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEF 423
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
F G + + GI++ + QVCLA A+ + + I GN QQ+ V+YD
Sbjct: 424 SFAGGVTVKLPPQGILFVASTK--QVCLAFAANGDDSDVTIYGNVQQRTIEVVYD 476
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 287 bits (735), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 165/399 (41%), Positives = 239/399 (59%), Gaps = 27/399 (6%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
Q+R +D++H + + S + +P+ SG + + +Y T+ LG +
Sbjct: 95 QDRHRVDSIHAR--------LSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEF 146
Query: 147 TVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
T+I DTGSDLTW QC+PC K+CY Q++P DP+ S SYK + C+S+ C L+ G S
Sbjct: 147 TLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGES-- 204
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLM 264
CSS P C Y V YGDGSY+ G E L L ++V +F+FGCG+ N GLF G +GL+
Sbjct: 205 CSS---PTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLL 261
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
GLGR+ LSL SQT++ + LFSYCLP++ + + G L GG + S + +T + +
Sbjct: 262 GLGRTKLSLPSQTAQKYKKLFSYCLPAS--SSSKGYLSFGG-----QVSKTVKFTPLSED 314
Query: 325 PQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
+ FY L++T +S+GG +L AS F+ G +IDSGTVITRLP + YSAL + F K
Sbjct: 315 FKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLM 374
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+ +PS G+SI DTC++ S + + IP V + F+G EM +DV+GI+Y V +VCLA
Sbjct: 375 TDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNG-LKKVCLA 433
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + + I GN QQK +V+YD ++GFA C
Sbjct: 434 FAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 161/398 (40%), Positives = 219/398 (55%), Gaps = 26/398 (6%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTE---IPLTSGIRLQTLNYIATIELG--GRNMTVI 149
D V + +I S + + +P GI L T NY+ ++ LG R+MTV+
Sbjct: 103 DQARVDSIHRKIAAAASPVLDQARGKKGVTLPAQRGISLGTGNYVVSMGLGTPARDMTVV 162
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
DTGSDL+WVQC PC CY Q+DP+FDP+ S +Y V C S C L+ S CS
Sbjct: 163 FDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLD-----SRSCSRD 217
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGR 268
C Y V YGD S T G L R+ L L ++ V F+FGCG + GLFG GL+GLGR
Sbjct: 218 K--KCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVGLGR 275
Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
+SL SQ + +G FSYCLPS+ A+G L LGG + +T M
Sbjct: 276 EKVSLSSQAASKYGAGFSYCLPSSPS--AAGYLSLGGPAPANAR-----FTAMETRHDSP 328
Query: 329 TFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
+FY + L G+ + G+ ++ S F+ G +IDSGTVITRLPP +Y+AL++ F + G
Sbjct: 329 SFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYG 388
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+ AP SILDTC++ + + V IP V + F G A + +D +G++Y K SQ CLA A
Sbjct: 389 YKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLYVAK--VSQACLAFA 446
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ GIIGN QQK V+YD ++GF CS
Sbjct: 447 PNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 285 bits (728), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 154/324 (47%), Positives = 202/324 (62%), Gaps = 20/324 (6%)
Query: 52 SSCVSHQKSRIEMGAIT--LELKHKNYCS--GKIVDWNEQQQNRLILDNLHVQYLQSRIK 107
SS H+K+ GA T LELK + + V + + L D Q R
Sbjct: 9 SSSGEHKKA----GAATSVLELKRHSLTAIPEDPVARDRYLRRLLAADESRANSFQPRRN 64
Query: 108 NMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR------NMTVIVDTGSDLTWVQC 161
+ ++ E+PLTSGIRLQTLNY+ TI LGG N+TVIVDTGSDLTWVQC
Sbjct: 65 KDRASASTQSASAEVPLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQC 124
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSP--PDCNYFV 218
+PC +CY Q+DP+FDP+ S +Y V CN+S C +L ATG G C S+ C Y +
Sbjct: 125 KPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYAL 184
Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
+YGDGS++RG L + + LG AS+ F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+
Sbjct: 185 AYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTA 244
Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNL 335
+GG+FSYCLP+ ASGSL LGG +S ++N+TP+ YT MI +P FY LN+
Sbjct: 245 SRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV 304
Query: 336 TGISIGGKQLQASGFAKGGILIDS 359
TG ++GG L A G +LIDS
Sbjct: 305 TGAAVGGTALAAQGLGASNVLIDS 328
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 161/403 (39%), Positives = 226/403 (56%), Gaps = 30/403 (7%)
Query: 86 EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG-- 143
++ Q+R+ D++H + +G +P G+RL T NYI ++ LG
Sbjct: 145 DRDQDRV--DSIH----RMTAGPWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPR 198
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R++ V+ DTGSDL+WVQC+PC +CY Q DP+FDPS S +Y V C + C +S
Sbjct: 199 RDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQECL-------DS 251
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVS 261
G CSS C Y V YGD S T G L R+ L LG +S + F+FGCG ++ GLFG
Sbjct: 252 GTCSSGK---CRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRAD 308
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GL GLGR +SL SQ + +G FSYCLPS+ A G L LG ++ +T M
Sbjct: 309 GLFGLGRDRVSLASQAAARYGAGFSYCLPSSWR--AEGYLSLGSAAA----PPHAQFTAM 362
Query: 322 IPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFL 379
+ +FY L+L GI + G+ ++ + F G +IDSGTVITRLP YSAL++ F
Sbjct: 363 VTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSFA 422
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ AP SILDTC++ + +V IP V + F+G A + + G++Y ++ SQ
Sbjct: 423 GFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYV--ANRSQA 480
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLA AS + GI+GN QQK V+YD N ++GF + CS
Sbjct: 481 CLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 240/441 (54%), Gaps = 28/441 (6%)
Query: 52 SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
S C + + GA T+ L H++ CS + RL D L Y+Q +
Sbjct: 43 SVCSESKAVKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGG 102
Query: 111 -------SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
+G+++ S+ +P T G L TL Y+ T+ LG G++ T+++DTGSD++WVQC
Sbjct: 103 VNGSRGGAGDVQQ-SHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQC 161
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
+PC C++Q DP+FDPS S +Y C+S+ C L GN CSSS C Y V+YG
Sbjct: 162 KPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLG-QEGNG--CSSS---QCQYTVTYG 215
Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
DGS T G + L LG +V F FGC G GLMGLG SLVSQT+ F
Sbjct: 216 DGSSTTGTYSSDTLALGSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTF 275
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYCLP+T + +SG L LG +S F T M+ + Q+ TFY + + I +G
Sbjct: 276 GAAFSYCLPAT--SSSSGFLTLGAGTSGFVK------TPMLRSSQVPTFYGVRIQAIRVG 327
Query: 342 GKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
G+QL + G ++DSGTV+TRLPP+ YSAL + F +PSAP ILDTCF+
Sbjct: 328 GRQLSIPTSVFSAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDF 387
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
S V+IP V + F G A + + GI+ +++ S +CLA A+ S + GIIGN QQ
Sbjct: 388 SGQSSVSIPTVALVFSGGAVVDIASDGIM--LQTSNSILCLAFAANSDDSSLGIIGNVQQ 445
Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
+ V+YD +GF C
Sbjct: 446 RTFEVLYDVGGGAVGFKAGAC 466
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 185/490 (37%), Positives = 274/490 (55%), Gaps = 27/490 (5%)
Query: 1 MVTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKK--KLHLHKLQWQQKSGSSSSCVSHQ 58
+++ +K + + L + L L KG + E + K ++H L+ S S Q
Sbjct: 4 LISSIKFTGFIYVFLLFLCPLCSLKKG-YAVEANEHIKKYVHTLEVNSLLASDSC---DQ 59
Query: 59 KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVS 118
S++ A +L++ HK ++++ + L+ D L V +Q+R+ + I +
Sbjct: 60 SSKVIDKASSLQVLHKYGPCMQVLN-DRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEM 118
Query: 119 NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVF 175
T++P SGI + T NY+ T+ LG + T++ DTGS +TW QCQPC SCY Q++ F
Sbjct: 119 VTKLPAQSGIAIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKF 178
Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
DP+ S SY V C+S++C+ L T G +S+S C Y + YGD SY++G E L
Sbjct: 179 DPTKSTSYNNVSCSSASCNLL--PTSERGCSASNS--TCLYQIIYGDQSYSQGFFATETL 234
Query: 236 GLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ + V +F+FGCG++N GLFG +GL+GL S +SL SQT+E + FSYCLPST
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPST-- 292
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK 352
++G L GG S TPI+ P ++FY +++ GIS+ G QL S F
Sbjct: 293 PSSTGYLNFGGKVSQTAGFTPIS-------PAFSSFYGIDIVGISVAGSQLPIDPSIFTT 345
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G +IDSGTVITRLPP+ Y ALK F ++ S +P G +LDTC++ S Y V+ P V
Sbjct: 346 SGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVS 405
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G E+ +D +GI+Y V VCLA A+ + E GI GN+QQK V+YD
Sbjct: 406 VSFKGGVEVDIDASGILYLVNG-VKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKG 464
Query: 473 QLGFAGEDCS 482
+GFA CS
Sbjct: 465 MIGFAAGACS 474
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 163/367 (44%), Positives = 217/367 (59%), Gaps = 25/367 (6%)
Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
IP G+ + T NY+ T+ G +N TVI DTGS++ W+QC+PC SCY QQ+P+FDP
Sbjct: 2 SIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDP 61
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
++S +Y+ + C S+ C L S CS S+ C Y V+YGDGS T G L E L
Sbjct: 62 TLSSTYRNISCTSAACTGLS-----SRGCSGST---CVYGVTYGDGSSTVGFLATETFTL 113
Query: 238 GKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+V N+FIFGCG+NN+GLF G +GL+GLGRS SL SQ + G +FSYCLPST
Sbjct: 114 AAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSS-- 171
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
A+G L +G TP YT M+ N + T Y ++L GIS+GG +L S F G
Sbjct: 172 ATGYLNIGN-----PLRTP-GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVG 225
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
+IDSGTVITRLPP+ Y AL+ F + + A SILDTC++ S V P +K+
Sbjct: 226 TIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLH 285
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+ G ++T+ G+ Y + S SQVCLA A S + GIIGN QQ+ V YD ++
Sbjct: 286 YTG-LDVTIPGAGVFYVISS--SQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRI 342
Query: 475 GFAGEDC 481
GFA C
Sbjct: 343 GFAAGAC 349
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 163/411 (39%), Positives = 233/411 (56%), Gaps = 35/411 (8%)
Query: 86 EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
++ Q+R+ D++H + +R + +P G+ L T NYI ++ LG
Sbjct: 92 DRDQDRV--DSIH-RLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGTPK 148
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R++ V+ DTGSDL+WVQC+PC CY Q DP+FDPS S +Y V C + C L+ S
Sbjct: 149 RDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLD-----S 203
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-------VNDFIFGCGRNNKGL 256
G CSS C Y V YGD S T G L R+ L LG +S + +F+FGCG ++ GL
Sbjct: 204 GSCSSGK---CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGL 260
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
FG GL GLGR +SL SQ + +G FSYCLPS+ + A G L LG S+ N+
Sbjct: 261 FGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSS--STAEGYLSLG--SAAPPNA--- 313
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSAL 374
+T M+ +FY LNL GI + G+ ++ S F G +IDSGTVITRLP Y+AL
Sbjct: 314 RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRAYAAL 373
Query: 375 KAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
++ F ++++S + AP SILDTC++ + +V IP V + F+G A + + ++Y
Sbjct: 374 RSSFAGLMRRYS-YKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYV 432
Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++ SQ CLA AS + I+GN QQK V+YD N ++GF + CS
Sbjct: 433 --ANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 162/404 (40%), Positives = 228/404 (56%), Gaps = 26/404 (6%)
Query: 86 EQQQNRLILDNLHVQYLQSR-IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
E+ Q R+ D++H + + +++ +P GI L T NY+ ++ LG
Sbjct: 101 ERDQARV--DSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTP 158
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+ VI DTGSDL+WVQC+PC CY QQDP+FDPS+S +Y V C + C L+ A+G
Sbjct: 159 AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELD-ASG- 216
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVS 261
CSS S C Y V YGD S T G L R+ L L + ++ F+FGCG N GLFG V
Sbjct: 217 ---CSSDS--RCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVD 271
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GL GLGR +SL SQ + +G F+YCLPS+ G L LGG T +
Sbjct: 272 GLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSG--RGYLSLGGAPPANAQFTALA---- 325
Query: 322 IPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
+ +FY ++L GI +GG+ ++ + A GG +IDSGTVITRLPP Y+ L+A F
Sbjct: 326 --DGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAF 383
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ + + AP SILDTC++ + ++ IP V++ F G A +++D TG++Y K SQ
Sbjct: 384 ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSK--VSQ 441
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLA A + + I+GN QQK V YD N ++GF + CS
Sbjct: 442 ACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 162/404 (40%), Positives = 228/404 (56%), Gaps = 26/404 (6%)
Query: 86 EQQQNRLILDNLHVQYLQSR-IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
E+ Q R+ D++H + + +++ +P GI L T NY+ ++ LG
Sbjct: 101 ERDQARV--DSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTP 158
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+ VI DTGSDL+WVQC+PC CY QQDP+FDPS+S +Y V C + C L+ A+G
Sbjct: 159 AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELD-ASG- 216
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVS 261
CSS S C Y V YGD S T G L R+ L L + ++ F+FGCG N GLFG V
Sbjct: 217 ---CSSDS--RCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVD 271
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GL GLGR +SL SQ + +G F+YCLPS+ G L LGG T +
Sbjct: 272 GLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSG--RGYLSLGGAPPANAQFTALA---- 325
Query: 322 IPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
+ +FY ++L GI +GG+ ++ + A GG +IDSGTVITRLPP Y+ L+A F
Sbjct: 326 --DGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAF 383
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ + + AP SILDTC++ + ++ IP V++ F G A +++D TG++Y K SQ
Sbjct: 384 ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSK--VSQ 441
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLA A + + I+GN QQK V YD N ++GF + CS
Sbjct: 442 ACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 157/371 (42%), Positives = 222/371 (59%), Gaps = 28/371 (7%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDP 177
++P + G+ L T NY+ + LG TV+ DTGSD TWVQCQPC + CY Q++P+FDP
Sbjct: 147 DLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDP 206
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ S +Y + C+SS C L + +G SG C Y + YGDGSYT G ++ L L
Sbjct: 207 TKSATYANISCSSSYCSDL-YVSGCSGG-------HCLYGIQYGDGSYTIGFYAQDTLTL 258
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++ +F FGCG N+GLFG +GL+GLGR SL Q + +GG+F+YCLP+T +
Sbjct: 259 AYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPAT--SAG 316
Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
+G L LG G + TP+ + TFY + +TGI +GG L G F+ G
Sbjct: 317 TGFLDLGPGAPAANARLTPMLVD------RGPTFYYVGMTGIKVGGHVLPIPGSVFSTAG 370
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLK--QFSGFPSAPGFSILDTCFNLSAYQ--EVNIPL 410
L+DSGTVITRLPPS Y+ L++ F K Q G+ +AP FSILDTC++L+ ++ + +P
Sbjct: 371 TLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPA 430
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V + F+G A + VD +GI+Y +D SQ CLA A + + + I+GN QQK V+YD
Sbjct: 431 VSLVFQGGACLDVDASGILYV--ADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIG 488
Query: 471 NSQLGFAGEDC 481
+GFA C
Sbjct: 489 KKIVGFAPGAC 499
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 175/467 (37%), Positives = 249/467 (53%), Gaps = 49/467 (10%)
Query: 34 KKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKN---YCSGKIVDWNEQQQ- 89
+ +LH+ W VS R A+ L L H++ +GK
Sbjct: 28 RHRLHIQLRDWDSLR------VSAASPRNGTSAV-LRLTHRHGPCAPAGKASALGSPPSF 80
Query: 90 -NRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
+ L D +Y+Q R+ + + +P G + TL Y+ T+ LG
Sbjct: 81 LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTP 140
Query: 145 NM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FA 199
+ T+ VDTGSD++WVQC+PC S CY+Q+DP+FDP+ S SY V C +++C L ++
Sbjct: 141 AVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYS 200
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFG 258
G SG C Y VSYGDGS T G + L L G ++ F+FGCG +GLF
Sbjct: 201 NGCSGG-------QCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFA 253
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
GV GL+GLGR SLVSQ S +GG+FSYCLP TQ+ + G + LGG SS ST
Sbjct: 254 GVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN--SVGYISLGGPSSTAGFST---- 307
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKA 376
T ++ T+YI+ L GIS+GG+ L AS FA G + +D+GTV+TRLPP+ YSAL++
Sbjct: 308 TPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPTAYSALRS 366
Query: 377 EFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
F + G+PSAP ILDTC++ + Y V +P + + F G A M + +GI+
Sbjct: 367 AFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGIL----- 421
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ CLA A + + I+GN QQ++ V +D S +GF C
Sbjct: 422 --TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 171/476 (35%), Positives = 248/476 (52%), Gaps = 60/476 (12%)
Query: 48 SGSSSSCVSHQKSRIEMGAIT-LELKHKNYCSGKIVDWNEQQQNR-----LILDNLHVQY 101
S +++SC + ++ R E G T + + H++ + D ++ L+ D V+Y
Sbjct: 46 SAAAASCHTPEQ-RPEAGTATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVADQRRVEY 104
Query: 102 LQSRIKNMISGNIKDVSNTE----------------------------IPLTSGIRLQTL 133
+ R+ +G ++ ++ +P SG+ L T
Sbjct: 105 IHRRVSE-TTGRVRRQKHSAPVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSGLSLNTG 163
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
NY+ I LG TV+ DTGSD TWVQCQPC + CY Q++P+F P+ S +Y + C S
Sbjct: 164 NYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTS 223
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
S C L+ + G C Y V YGDGSYT G ++ L LG +V DF FGCG
Sbjct: 224 SYCSDLDTRGCSGG--------HCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCG 275
Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
N+GLFG +GLMGLGR S+ Q + + G+F+YC+P+T G ++
Sbjct: 276 EKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPAAAN 335
Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPP 368
TP+ N TFY + +TGI +GG L A+ F+ G L+DSGTVITRLPP
Sbjct: 336 ARLTPMLVDNG------PTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPP 389
Query: 369 SIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDV 425
S Y L++ F K G+ +AP FSILDTC++L+ YQ + +P V + F+G A + VD
Sbjct: 390 SAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+GI+Y +D SQ CLA A+ + + I+GN QQK V+YD +GFA C
Sbjct: 450 SGILYV--ADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/400 (44%), Positives = 249/400 (62%), Gaps = 21/400 (5%)
Query: 92 LILDNLHVQYLQSRIKNMISGNIKDV---SNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
L+ D V+ + SR+ N + KDV +T IP G + + NYI T+ LG +++
Sbjct: 103 LLQDQSRVKSIHSRLSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDL 162
Query: 147 TVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
++I DTGSD+TW QCQPC +SCY Q++ +FDPS S SY + C+SS C++L ATGN+
Sbjct: 163 SLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPG 222
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
C+SS+ C Y + YGD S++ G G E L L + N+ FGCG+NN+GLFGG +GL+
Sbjct: 223 CASSA---CVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLL 279
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
GLGR LS+VSQT++ + +FSYCLPS+ + +G L GG++S TP++ + P
Sbjct: 280 GLGRDKLSVVSQTAQKYNKIFSYCLPSSSSS--TGFLTFGGSASKNAKFTPLSTISAGP- 336
Query: 325 PQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
+FY L+ TGIS+GGK+L AS F+ G +IDSGTVITRLPP+ YSAL+A F
Sbjct: 337 ----SFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLM 392
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
S +P SILDTC++ S+Y +++P + F E+ +D TGI+Y S SQVCLA
Sbjct: 393 SKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILY--ASSLSQVCLA 450
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A S + I GN QQK V YD ++GFA CS
Sbjct: 451 FAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 157/371 (42%), Positives = 222/371 (59%), Gaps = 28/371 (7%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDP 177
++P + G+ L T NY+ + LG TV+ DTGSD TWVQCQPC + CY Q++P+FDP
Sbjct: 82 DLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDP 141
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ S +Y + C+SS C L + +G SG C Y + YGDGSYT G ++ L L
Sbjct: 142 TKSATYANISCSSSYCSDL-YVSGCSGG-------HCLYGIQYGDGSYTIGFYAQDTLTL 193
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++ +F FGCG N+GLFG +GL+GLGR SL Q + +GG+F+YCLP+T +
Sbjct: 194 AYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPAT--SAG 251
Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
+G L LG G + TP+ + TFY + +TGI +GG L G F+ G
Sbjct: 252 TGFLDLGPGAPAANARLTPMLVD------RGPTFYYVGMTGIKVGGHVLPIPGSVFSTAG 305
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLK--QFSGFPSAPGFSILDTCFNLSAYQ--EVNIPL 410
L+DSGTVITRLPPS Y+ L++ F K Q G+ +AP FSILDTC++L+ ++ + +P
Sbjct: 306 TLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPA 365
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V + F+G A + VD +GI+Y +D SQ CLA A + + + I+GN QQK V+YD
Sbjct: 366 VSLVFQGGACLDVDASGILYV--ADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIG 423
Query: 471 NSQLGFAGEDC 481
+GFA C
Sbjct: 424 KKIVGFAPGAC 434
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 170/437 (38%), Positives = 235/437 (53%), Gaps = 60/437 (13%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRI-KN 108
SS+C K + ++ + KH + N +++ D V +QSR+ KN
Sbjct: 3 SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKN 62
Query: 109 MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS 166
+ G+ S +P S L + NY+ T+ LG R++T I DTGSDLTW QC+PC
Sbjct: 63 LAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVG 122
Query: 167 -CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
CY Q++ +FDPS S SY V C+S +C LE ATGNS CSSS+ C Y + YGDGSY
Sbjct: 123 YCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSST---CLYGIRYGDGSY 179
Query: 226 TRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ G RE L L V N+F FGCG+NN+GLFGG +GL+GL R+ LSLVSQT++ +G +
Sbjct: 180 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 239
Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
FSYCLPS+ + ++G L G +S + +T
Sbjct: 240 FSYCLPSS--SSSTGYLSFGSGDG---DSKAVKFT------------------------- 269
Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
RLPP++YS+++ F + S +P G SILDTC++LS Y+
Sbjct: 270 -------------------PRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYK 310
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
V +P + + F G AEM + GI+Y +K SQVCLA A S +DE IIGN QQK
Sbjct: 311 TVKVPKIILYFSGGAEMDLAPEGIIYVLK--VSQVCLAFAGNSDDDEVAIIGNVQQKTIH 368
Query: 465 VIYDTKNSQLGFAGEDC 481
V+YD ++GFA C
Sbjct: 369 VVYDDAEGRVGFAPSGC 385
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 163/402 (40%), Positives = 224/402 (55%), Gaps = 31/402 (7%)
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKD--VSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
+ L D +++ R+ + + D + +P G + T NY+ T LG G
Sbjct: 90 DTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMA 149
Query: 146 MTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
T+ VDTGSDL+WVQC+PC SCY Q+DP+FDP+ S SY V C S C L +
Sbjct: 150 QTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGI---YA 206
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR-NNKGLFGGVS 261
CS++ C Y VSYGDGS T G + L L A+V F+FGCG + GLF G+
Sbjct: 207 SACSAA---QCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGID 263
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GL+G GR SLV QT+ +GG+FSYCLP+ +G L LGG S V + T +
Sbjct: 264 GLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSST--TGYLTLGGPSGVAPG---FSTTQL 318
Query: 322 IPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
+P+P T+Y++ LTGIS+GG+ L AS FA G ++D+GTVITRLPP+ Y+AL++ F
Sbjct: 319 LPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAG-TVVDTGTVITRLPPAAYAALRSAFR 377
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ +PSAP ILDTC++ + Y VN+ V + F A MT+ GI+ F
Sbjct: 378 SGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIMSF-------G 430
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA AS + I+GN QQ++ V D S +GF C
Sbjct: 431 CLAFASSGSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 161/400 (40%), Positives = 223/400 (55%), Gaps = 24/400 (6%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
+++L VQ + + G+ S +P TSG + T NY+ T+ LG
Sbjct: 117 RDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKY 176
Query: 147 TVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
TV+ DTGSD TWVQC+PC CY Q++P+FDP+ S +Y V C S C L+ G
Sbjct: 177 TVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNGCTGG- 235
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
C Y V YGDGSYT G ++ L + ++ F FGCG N GLFG +GLMG
Sbjct: 236 -------HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMG 288
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR SL Q +GG F+YCLP+ +G L G S+ N+ + T M+ +
Sbjct: 289 LGRGKTSLTVQAYNKYGGAFAYCLPALTT--GTGYLDFGPGSA--GNNARL--TPMLTD- 341
Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF- 382
+ TFY + +TGI +GG+Q+ S F+ G L+DSGTVITRLP + Y+AL + F K
Sbjct: 342 KGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVML 401
Query: 383 -SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
G+ APG+SILDTC++ + +V +P V + F+G A + VDV+GIVY + +QVCL
Sbjct: 402 ARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISE--AQVCL 459
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A AS ++ I+GN QQK V+YD +GFA C
Sbjct: 460 AFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 274 bits (701), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 163/406 (40%), Positives = 230/406 (56%), Gaps = 37/406 (9%)
Query: 90 NRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRN 145
+ L D +Y+Q R+ + + +P G + TL Y+ T+ LG
Sbjct: 93 DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPA 152
Query: 146 M--TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FAT 200
+ T+ VDTGSD++WVQC+PC S CY+Q+DP+FDP+ S SY V C +++C L ++
Sbjct: 153 VAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSN 212
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGG 259
G SG C Y VSYGDGS T G + L L G ++ F+FGCG +GLF G
Sbjct: 213 GCSGG-------QCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 265
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
V GL+GLGR SLVSQ S +GG+FSYCLP TQ+ + G + LGG SS ST T
Sbjct: 266 VDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN--SVGYISLGGPSSTAGFST----T 319
Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
++ T+YI+ L GIS+GG+ L AS FA G + +D+GTV+TRLPP+ YSAL++
Sbjct: 320 PLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPTAYSALRSA 378
Query: 378 FLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
F + G+PSAP ILDTC++ + Y V +P + + F G A M + +GI+
Sbjct: 379 FRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGIL------ 432
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ CLA A + + I+GN QQ++ V +D S +GF C
Sbjct: 433 -TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 161/400 (40%), Positives = 222/400 (55%), Gaps = 24/400 (6%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
+++L VQ + + G+ S +P TSG + T NY+ T+ LG
Sbjct: 117 RDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKY 176
Query: 147 TVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
TV+ DTGSD TWVQC+PC CY Q+ P+FDP+ S +Y V C S C L+ G
Sbjct: 177 TVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNGCTGG- 235
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
C Y V YGDGSYT G ++ L + ++ F FGCG N GLFG +GLMG
Sbjct: 236 -------HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMG 288
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR SL Q +GG F+YCLP+ +G L G S+ N+ + T M+ +
Sbjct: 289 LGRGKTSLTVQAYNKYGGAFAYCLPALTT--GTGYLDFGPGSA--GNNARL--TPMLTD- 341
Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF- 382
+ TFY + +TGI +GG+Q+ S F+ G L+DSGTVITRLP + Y+AL + F K
Sbjct: 342 KGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVML 401
Query: 383 -SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
G+ APG+SILDTC++ + +V +P V + F+G A + VDV+GIVY + +QVCL
Sbjct: 402 ARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISE--AQVCL 459
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A AS ++ I+GN QQK V+YD +GFA C
Sbjct: 460 AFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 181/496 (36%), Positives = 251/496 (50%), Gaps = 46/496 (9%)
Query: 5 VKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVS----HQKS 60
V+ +LSL+ + + GA G + + + SS+C S Q+
Sbjct: 6 VRRALLLSLICAGALGFLPCSHGAAVAPGYVTVSAARFR------PSSTCSSLDPVAQRR 59
Query: 61 RIEMGAITLELKHKN-YCSGKIVD--WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD- 116
R A+ L L HK+ C+ + L D +Y+ R+ + + D
Sbjct: 60 RNGTSAV-LRLTHKHGPCAPSRASSLATPSVADTLRADQRRAEYILRRVSGRGTPQLWDS 118
Query: 117 ---VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYN 169
+ +P G + TLNY+ T+ LG G T+ VDTGSDL+WVQC PC + CY+
Sbjct: 119 KAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYS 178
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
Q+DP+FDP+ S SY V C C L SS S C Y VSYGDGS T G
Sbjct: 179 QKDPLFDPAQSSSYAAVPCGGPVCGGLGI------YASSCSAAQCGYVVSYGDGSKTTGV 232
Query: 230 LGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
+ L L +V F FGCG G F G GL+GLGR + SLV QT+ +GG+FSYC
Sbjct: 233 YSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYC 291
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA- 347
LP+ +G L LGG S + T ++ +P AT+Y++ LTGIS+GG+QL
Sbjct: 292 LPTRPST--TGYLTLGGPSGAAPPG--FSTTQLLSSPNAATYYVVMLTGISVGGQQLSVP 347
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQE 405
S GG ++D+GTVITRLPP+ Y+AL++ F + G+PSAP ILDTC+N S Y
Sbjct: 348 SSVFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT 407
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
V +P V + F G A +T+ GI+ F CLA A + I+GN QQ++ V
Sbjct: 408 VTLPNVALTFSGGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEV 460
Query: 466 IYDTKNSQLGFAGEDC 481
D + +GF C
Sbjct: 461 RID--GTSVGFKPSSC 474
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 170/443 (38%), Positives = 232/443 (52%), Gaps = 37/443 (8%)
Query: 52 SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
S C + R GA T+ L H++ CS ++RL D L Y IK
Sbjct: 42 SVCSESKAVRSSSGATTVPLHHRHGPCSPLPTKKMPSLEDRLHRDQLRAAY----IKRKF 97
Query: 111 SGNIK---------DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
SG++K + S+ +P T G L TL Y+ T+ LG + TV++D+GSD++WV
Sbjct: 98 SGDVKKDGQGAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWV 157
Query: 160 QCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVS 219
QC+PC C++Q DP+FDPS+S +Y C+S+ C L GN CSSSS C Y V
Sbjct: 158 QCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAACAQLG-QDGNG--CSSSS--QCQYIVR 212
Query: 220 YGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
Y DGS T G + L LG ++++F FGC G GLMGLG SL SQT+
Sbjct: 213 YADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAG 272
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
FG FSYCLP T +SG L LG +S F TP+ ++ +P TFY + L I
Sbjct: 273 TFGTAFSYCLPPTPS--SSGFLTLGAGTSGFVK-TPMLRSSPVP-----TFYGVRLEAIR 324
Query: 340 IGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
+GG QL + G+++DSGT+ITRLP + YSAL + F + AP SI+DTCF
Sbjct: 325 VGGTQLSIPTSVFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCF 384
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
+ S V +P V + F G A + +D GI+ CLA A+ S + GI+GN
Sbjct: 385 DFSGQSSVRLPSVALVFSGGAVVNLDANGIIL-------GNCLAFAANSDDSSPGIVGNV 437
Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
QQ+ V+YD +GF C
Sbjct: 438 QQRTFEVLYDVGGGAVGFKAGAC 460
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/411 (40%), Positives = 226/411 (54%), Gaps = 38/411 (9%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
D V + I N + +DVS +P GI + T NY+ ++ LG R++TV+ DT
Sbjct: 48 DQARVDSIHRMIANETAVVGQDVS---LPAERGISVGTGNYVVSVGLGTPARDLTVVFDT 104
Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
GSDL+WVQC PC S CY+QQDP+F PS S ++ V C C + CSSS
Sbjct: 105 GSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRARQS------CSSSP 158
Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLGLG-----KASVND------FIFGCGRNNKGLFG 258
D C Y V YGD S T G LG + L LG AS N+ F+FGCG NN GLFG
Sbjct: 159 GDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFG 218
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
GL GLGR +SL SQ + +G FSYCLPS+ + A G L LG + ++ +
Sbjct: 219 KADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSS-SNAHGYLSLGTPAPAPAHA---RF 274
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQAS---GFAKGGILIDSGTVITRLPPSIYSALK 375
T M+ +FY + L GI + G+ ++ S G+++DSGTVITRL P YSAL+
Sbjct: 275 TPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALR 334
Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYF 431
FL G+ AP SILDTC++ +A+ V+IP V + F G A ++VD +G++Y
Sbjct: 335 TAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYV 394
Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
K +Q CLA A GI+GN QQ+ V+YD ++GFA + CS
Sbjct: 395 AK--VAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGCS 443
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 169/434 (38%), Positives = 238/434 (54%), Gaps = 53/434 (12%)
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI------------KNMISGNIKD 116
L L H+ S + E Q+ D V+Y+Q R+ + + +G+
Sbjct: 75 LRLAHRCGPSTASASFAEVQRA----DEQRVEYIQRRVSGGGARGAKGALQQLATGS--- 127
Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQD 172
+ +P T G+ T Y+ T+ LG G + TV VDTGSD++WVQC+PC + C +Q+D
Sbjct: 128 -RSATVPTTMGV--GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD 184
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
+FDP+ S +Y V C + C L CS S C Y VSYGDGS T G G
Sbjct: 185 QLFDPAKSSTYSAVPCGADACSELRIYEAG---CSGS---QCGYVVSYGDGSNTTGVYGS 238
Query: 233 EHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ L L +V F+FGCG G+F G+ GL+ LGR +SL SQ + +GG+FSYCLPS
Sbjct: 239 DTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPS 298
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
Q A+G L LGG SS +T T ++ TFY++ LTGIS+GG+Q+ AS
Sbjct: 299 KQS--AAGYLTLGGPSSASGFAT----TGLLTAWAAPTFYMVMLTGISVGGQQVAVPASA 352
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVN 407
FA GG ++D+GTVITRLPP+ Y+AL++ F + G+PSAP ILDTC++ S Y V
Sbjct: 353 FA-GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVT 411
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P V + F G A + ++ GI+ S CLA A + + I+GN QQ++ V +
Sbjct: 412 LPTVALTFSGGATLALEAPGIL-------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRF 464
Query: 468 DTKNSQLGFAGEDC 481
D S +GF C
Sbjct: 465 D--GSTVGFMPGAC 476
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 168/434 (38%), Positives = 238/434 (54%), Gaps = 53/434 (12%)
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI------------KNMISGNIKD 116
L L H+ S + E Q+ D V+Y+Q R+ + + +G+
Sbjct: 75 LRLAHRCGPSTASASFAEVQRA----DEQRVEYIQRRVSGGGARGAKGALQQLATGS--- 127
Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQD 172
+ +P T G+ T Y+ T+ LG G + TV VDTGSD++WVQC+PC + C +Q+D
Sbjct: 128 -RSATVPTTMGV--GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD 184
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
+FDP+ S +Y V C + C L CS S C Y VSYGDGS T G G
Sbjct: 185 QLFDPAKSSTYSAVPCGADACSELRIYEAG---CSGS---QCGYVVSYGDGSNTTGVYGS 238
Query: 233 EHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ L L +V F+FGCG G+F G+ GL+ LGR +SL SQ + +GG+FSYCLPS
Sbjct: 239 DTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPS 298
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
Q A+G L LGG +S +T T ++ TFY++ LTGIS+GG+Q+ AS
Sbjct: 299 KQS--AAGYLTLGGPTSASGFAT----TGLLTAWAAPTFYMVMLTGISVGGQQVAVPASA 352
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVN 407
FA GG ++D+GTVITRLPP+ Y+AL++ F + G+PSAP ILDTC++ S Y V
Sbjct: 353 FA-GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVT 411
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P V + F G A + ++ GI+ S CLA A + + I+GN QQ++ V +
Sbjct: 412 LPTVALTFSGGATLALEAPGIL-------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRF 464
Query: 468 DTKNSQLGFAGEDC 481
D S +GF C
Sbjct: 465 D--GSTVGFMPGAC 476
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 172/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
+ QNR+ D++H + L SR G + T +P+ SG + +Y+ T+ LG +
Sbjct: 32 RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 82
Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
T+I DTGSD+TW QC+PC K+CY Q++P +PS S SYK + C+S+ C + S
Sbjct: 83 EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 142
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
CSSS+ C Y V YGDGSY+ G E L L ++V +F+FGCG+ N GLFGG +G
Sbjct: 143 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 199
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
L+GLGR+ L+L SQT++ + LFSYCLP++ + + G L LGG S TP++ +
Sbjct: 200 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 256
Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
P FY L++TG+S+GG+QL S F+ G +IDSGTVITRL P+ YS L + F
Sbjct: 257 STP----FYGLDITGLSVGGRQLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 311
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
+ +PS G+SI DTC++ S Y V IP V + F+G EM +DV+GI+Y V +VC
Sbjct: 312 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 370
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
LA A + +T I GN QQ+ +V+YD ++GFA CS
Sbjct: 371 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 171/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
+ QNR+ D++H + L SR G + T +P+ SG + +Y+ T+ LG +
Sbjct: 92 RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 142
Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
T+I DTGSD+TW QC+PC K+CY Q++P +PS S SYK + C+S+ C + S
Sbjct: 143 EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 202
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
CSSS+ C Y V YGDGSY+ G E L L ++V +F+FGCG+ N GLFGG +G
Sbjct: 203 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 259
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
L+GLGR+ L+L SQT++ + LFSYCLP++ + + G L LGG S TP++ +
Sbjct: 260 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 316
Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
P FY L++TG+S+GG++L S F+ G +IDSGTVITRL P+ YS L + F
Sbjct: 317 STP----FYGLDITGLSVGGRKLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 371
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
+ +PS G+SI DTC++ S Y V IP V + F+G EM +DV+GI+Y V +VC
Sbjct: 372 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 430
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
LA A + +T I GN QQ+ +V+YD ++GFA CS
Sbjct: 431 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 171/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
+ QNR+ D++H + L SR G + T +P+ SG + +Y+ T+ LG +
Sbjct: 80 RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 130
Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
T+I DTGSD+TW QC+PC K+CY Q++P +PS S SYK + C+S+ C + S
Sbjct: 131 EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 190
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
CSSS+ C Y V YGDGSY+ G E L L ++V +F+FGCG+ N GLFGG +G
Sbjct: 191 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 247
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
L+GLGR+ L+L SQT++ + LFSYCLP++ + + G L LGG S TP++ +
Sbjct: 248 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 304
Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
P FY L++TG+S+GG++L S F+ G +IDSGTVITRL P+ YS L + F
Sbjct: 305 STP----FYGLDITGLSVGGRKLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 359
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
+ +PS G+SI DTC++ S Y V IP V + F+G EM +DV+GI+Y V +VC
Sbjct: 360 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 418
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
LA A + +T I GN QQ+ +V+YD ++GFA CS
Sbjct: 419 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 222/408 (54%), Gaps = 35/408 (8%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
D V + I N S VS +P GI + T NY+ ++ LG R++TV+ DT
Sbjct: 117 DQARVDSILGMITNETSAVGPGVS---LPAERGISVGTGNYVVSVGLGTPARDLTVVFDT 173
Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
GSDL+WVQC PC S CY QQDP+F PS S ++ V C + C A + G+ G
Sbjct: 174 GSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRARQSCGGSPG------ 227
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLG------KASVND-----FIFGCGRNNKGLFGG 259
C Y V YGD S T+G LG + L LG ++ ND F+FGCG NN GLFG
Sbjct: 228 DDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQ 287
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
GL GLGR +SL SQ + FG FSYCLPS+ + A G L LG ++ +T
Sbjct: 288 ADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSS-APGYLSLGTPVPAPAHA---QFT 343
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEF 378
M+ +FY + L GI + G+ ++ S +++DSGTVITRL P Y AL+A F
Sbjct: 344 PMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPRAYRALRAAF 403
Query: 379 LKQFS--GFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
L G+ AP SILDTC++ +A+ V+IP V + F G A ++VD +G++Y K
Sbjct: 404 LSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAK- 462
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+Q CLA A GI+GN QQ+ V+YD ++GFA + CS
Sbjct: 463 -VAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGCS 509
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 161/409 (39%), Positives = 223/409 (54%), Gaps = 39/409 (9%)
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
+ L D +Y+ R+ SG + +++ +P + G + TLNY+ T LG
Sbjct: 92 DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147
Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S SY V C C L
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLG 207
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGL 256
++ + Y VSYGDGS T G + L L +S V F FGCG GL
Sbjct: 208 IYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGL 262
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
F GV GL+GLGR SLV QT+ +GG+FSYCLP+ +L LGG S
Sbjct: 263 FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPG---F 319
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSAL 374
+ T ++P+P T+Y++ LTGIS+GG+QL AS FA GG ++D+GTVITRLPP+ Y+AL
Sbjct: 320 STTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA-GGTVVDTGTVITRLPPTAYAAL 378
Query: 375 KAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
++ F + G+P+AP ILDTC+N + Y V +P V + F A + + GI+ F
Sbjct: 379 RSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGILSF- 437
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + I+GN QQ++ V D + +GF C
Sbjct: 438 ------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 167/426 (39%), Positives = 228/426 (53%), Gaps = 27/426 (6%)
Query: 65 GAITLELKHKN-YCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
G IT+ L H++ CS V N+ + RL D L Y++ + G+++
Sbjct: 59 GGITVPLHHRHGPCS--PVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDAA 116
Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+P T G L TL Y+ T+ +G +T + +DTGSD++WVQC+PC C+++ D +FDPS
Sbjct: 117 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPS 176
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y C+S+ C L + +G CSSS C Y VSY DGS T G + L LG
Sbjct: 177 ASSTYSPFSCSSAACVQLSQSQQGNG-CSSS---QCQYIVSYVDGSSTTGTYSSDTLTLG 232
Query: 239 KASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++ F FGC ++ G F GLMGLG SLVSQT+ FG FSYCLP T G+
Sbjct: 233 SNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPT--PGS 290
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
SG L LG S TP+ + IP T+Y + L I +GG+QL S F+ G +
Sbjct: 291 SGFLTLGAASRSGFVKTPMLRSTQIP-----TYYGVLLEAIRVGGQQLNIPTSVFSAGSV 345
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+ DSGTVITRLPP+ YSAL + F +P A ILDTCF+ S V+IP V + F
Sbjct: 346 M-DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 404
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A + +D GI+ + CLA A+ S + G IGN QQ+ V+YD +G
Sbjct: 405 SGGAVVNLDFNGIML----ELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVG 460
Query: 476 FAGEDC 481
F C
Sbjct: 461 FRAGAC 466
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 156/407 (38%), Positives = 239/407 (58%), Gaps = 37/407 (9%)
Query: 95 DNLHVQYLQSRIK----------NMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG- 143
D HV++L SR++ SG++ + ++ IPL G+ + + NY + LG
Sbjct: 70 DEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLGSP 129
Query: 144 -RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+ T+I+DTGS L+W+QC+PC C++Q DP+F+PS S +Y+ + C+SS C L+ AT
Sbjct: 130 PKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAATL 189
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV 260
N +C++S C Y SYGD SY+ G L R+ L L + ++ F +GCG++N+GLFG
Sbjct: 190 NDPLCTASG--VCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGLFGKA 247
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS-TPITYT 319
+G++GL R LS+++Q S +G FSYCLP++ +G GG S+ K S + +T
Sbjct: 248 AGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSG-------GGFLSIGKISPSSYKFT 300
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGTVITRLPPSIYSALK 375
MI N Q + Y L L I++ G+ + G A G +IDSGTV+TRLP SIY+AL+
Sbjct: 301 PMIRNSQNPSLYFLRLAAITVAGRPV---GVAAAGYQVPTIIDSGTVVTRLPISIYAALR 357
Query: 376 AEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
F+K S + AP +SILDTCF S P ++M F+G A++++ I+ +++
Sbjct: 358 EAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNIL--IEA 415
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D CLA AS ++ IIGN+QQ+ + YD S++GFA C
Sbjct: 416 DKGIACLAFAS---SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 162/442 (36%), Positives = 239/442 (54%), Gaps = 31/442 (7%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNM 109
S C H+ + + G+ TL L H++ CS I + L D L Y+Q+++ +
Sbjct: 43 SEVCSGHKVTPSKNGS-TLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSR 101
Query: 110 ISGNIKDV--SNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPC- 164
+ K++ S IP +SG L T Y+ T+ +G +T + +DTGSD++WVQC PC
Sbjct: 102 YNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCA 161
Query: 165 -KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGD 222
+SC +Q+D +FDP++S +Y C S+ C L G+ G C S C Y V YGD
Sbjct: 162 AQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQL----GDEGNGCLKS---QCQYIVKYGD 214
Query: 223 GSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
GS T G G + L L + +V F FGC G G + GLMGLG SLVSQT+ +
Sbjct: 215 GSNTAGTYGSDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATY 274
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYCLP +G G L LG +S+ ++T M+ + TFY + L GI++
Sbjct: 275 GKAFSYCLPPPSSSGG-GFLTLGAAGGA--SSSRYSHTPMV-RFSVPTFYGVFLQGITVA 330
Query: 342 GKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
G L AS F+ G ++DSGTVIT+LPP+ Y AL+ F K+ +PSA LDTCF+
Sbjct: 331 GTMLNVPASVFS-GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFD 389
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
S + + +P V + F A M +D++GI+Y CLA + +++ +TGI+GN Q
Sbjct: 390 FSGFNTITVPTVTLTFSRGAAMDLDISGILY-------AGCLAFTATAHDGDTGILGNVQ 442
Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
Q+ +++D +GF C
Sbjct: 443 QRTFEMLFDVGGRTIGFRSGAC 464
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 169/424 (39%), Positives = 226/424 (53%), Gaps = 25/424 (5%)
Query: 65 GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
GA T+ L H++ CS + L D L Y+Q + DV S+
Sbjct: 56 GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 114
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+P G L TL Y+ T+ LG + T+++DTGSD++WVQC+PC C++Q DP+FDPS
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +Y C S+ C L GN CSSSS C Y V+YGDGS T G + L LG
Sbjct: 175 SSTYSPFSCGSAACAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 229
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
++V F FGC G GLMGLG SLVSQT+ G FSYCLP T +SG
Sbjct: 230 SAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 287
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
L L ++ ++ T M+ + Q+ TFY + L I +GG+QL AS F+ G ++
Sbjct: 288 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 344
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
DSGTVITRLPP+ YSAL + F +P A ILDTCF+ S V+IP V + F G
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A +++D +GI+ CLA A+ S + GIIGN QQ+ V+YD +GF
Sbjct: 405 GAVVSLDASGIIL-------SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 478 GEDC 481
C
Sbjct: 458 AGAC 461
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/425 (35%), Positives = 230/425 (54%), Gaps = 26/425 (6%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
+L L H++ SG Q L+ DN V++L+ R+ S + +D+ + +P
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
G+ + Y + +G + ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
V C S+ C L G + C+Y V+YGDGSYT+GEL E L LG +V
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG N GLF G +GL+GLG +SLV Q GG+FSYCL +++ AG +GSL+L
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL-ASRGAGGAGSLVL 295
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
G +V + + ++ N Q ++FY + LTGI +GG++L Q + GG++
Sbjct: 296 GRTEAVPVGA---VWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 352
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+D+GT +TRLP Y+AL+ F P +P S+LDTC++LS Y V +P V F+
Sbjct: 353 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 412
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
A +T+ + V+ + CLA A S I+GN QQ+ ++ D+ N +GF
Sbjct: 413 QGAVLTLPARNL--LVEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 468
Query: 477 AGEDC 481
C
Sbjct: 469 GPNTC 473
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 169/424 (39%), Positives = 225/424 (53%), Gaps = 25/424 (5%)
Query: 65 GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
GA T+ L H++ CS + L D L Y+Q + DV S+
Sbjct: 126 GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 184
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+P G L TL Y+ T+ LG + T+++DTGSD++WVQC+PC C++Q DP+FDPS
Sbjct: 185 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 244
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +Y C S+ C L GN CSSSS C Y V+YGDGS T G + L LG
Sbjct: 245 SSTYSPFSCGSADCAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 299
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
++V F FGC G GLMGLG SLVSQT+ G FSYCLP T +SG
Sbjct: 300 SAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 357
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
L L ++ ++ T M+ + Q+ TFY + L I +GG+QL AS F+ G ++
Sbjct: 358 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 414
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
DSGTVITRLPP+ YSAL + F +P A ILDTCF+ S V+IP V + F G
Sbjct: 415 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 474
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A +++D +GI+ CLA A S + GIIGN QQ+ V+YD +GF
Sbjct: 475 GAVVSLDASGIIL-------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527
Query: 478 GEDC 481
C
Sbjct: 528 AGAC 531
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 169/424 (39%), Positives = 225/424 (53%), Gaps = 25/424 (5%)
Query: 65 GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
GA T+ L H++ CS + L D L Y+Q + DV S+
Sbjct: 56 GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 114
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+P G L TL Y+ T+ LG + T+++DTGSD++WVQC+PC C++Q DP+FDPS
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +Y C S+ C L GN CSSSS C Y V+YGDGS T G + L LG
Sbjct: 175 SSTYSPFSCGSADCAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 229
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
++V F FGC G GLMGLG SLVSQT+ G FSYCLP T +SG
Sbjct: 230 SAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 287
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
L L ++ ++ T M+ + Q+ TFY + L I +GG+QL AS F+ G ++
Sbjct: 288 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 344
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
DSGTVITRLPP+ YSAL + F +P A ILDTCF+ S V+IP V + F G
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A +++D +GI+ CLA A S + GIIGN QQ+ V+YD +GF
Sbjct: 405 GAVVSLDASGIIL-------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 478 GEDC 481
C
Sbjct: 458 AGAC 461
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 228/424 (53%), Gaps = 24/424 (5%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
+L L H++ SG Q L+ DN V++L+ R+ S + + +E+
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEV--VP 121
Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G+ + Y + +G + ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFS 181
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
V C S+ C L G + C+Y V+YGDGSYT+GEL E L LG +V
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQG 237
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
GCG N GLF G +GL+GLG +SL+ Q GG+FSYCL +++ AG +GSL+LG
Sbjct: 238 VAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCL-ASRGAGGAGSLVLG 296
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILI 357
+V + + ++ N Q ++FY + LTGI +GG++L Q + GG+++
Sbjct: 297 RTEAVPVGA---VWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVM 353
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
D+GT +TRLP Y+AL+ F P +P S+LDTC++LS Y V +P V F+
Sbjct: 354 DTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQ 413
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A +T+ + V+ + CLA A S I+GN QQ+ ++ D+ N +GF
Sbjct: 414 GAVLTLPARNL--LVEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGFG 469
Query: 478 GEDC 481
C
Sbjct: 470 PNTC 473
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 153/403 (37%), Positives = 232/403 (57%), Gaps = 23/403 (5%)
Query: 95 DNLHVQYLQSRIKNM----------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG- 143
D HV+ L R+ N SG++ + ++ IPL G+ + + NY + LG
Sbjct: 75 DEEHVKALSDRLANKGLGSGSAKPPKSGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTP 134
Query: 144 -RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+ +I+DTGS L+W+QCQPC C+ Q DP++DPS+S +YKK+ C S C L+ AT
Sbjct: 135 PKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATL 194
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV 260
N +C + S C Y SYGD S++ G L ++ L L + ++ F +GCG++N+GLFG
Sbjct: 195 NDPLCETDSN-ACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRA 253
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+G++GL R LS+++Q S +G FSYCLP+ + G + G+ S T +T
Sbjct: 254 AGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSIS----PTSYKFTP 309
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
M+ + + + Y L LT I++ G+ L A+ + LIDSGTVITRLP S+Y+AL+ F+
Sbjct: 310 MLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFV 369
Query: 380 KQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
K S + AP +SILDTCF S +P +KM F+G A++T+ I+ +++D
Sbjct: 370 KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL--IEADKGI 427
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A S ++ IIGN QQ+ + YD S++GFA C
Sbjct: 428 TCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 163/480 (33%), Positives = 251/480 (52%), Gaps = 37/480 (7%)
Query: 15 LPLMVSLFLLAKGAHCFEGKKKLH-LHKLQWQQKSGSSSSCVSHQKSRIEMGA--ITLEL 71
LPL+V L + G ++ H L + + S +++C + + ++ G+ +++ L
Sbjct: 4 LPLLVCFILCTYNSLAHGGNEEEHVLVAVPTSRYSEPAATCSTSRVRWLDEGSNTVSVPL 63
Query: 72 KHKN-YCSGKIVDWNEQQ-QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
H++ C+ +E RL +Y+ SR SN IP G
Sbjct: 64 VHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASK---------SNVSIPTHLGGS 114
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
+ +L Y+ T+ LG + +++DTGSDL+WVQC PC S CY Q+DP+FDPS S +Y
Sbjct: 115 VDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAP 174
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSP--PDCNYFVSYGDGSYTRGELGREHLGLGKA-SV 242
+ CN+ C L G C+S S C Y ++YGDGS T G E L + +V
Sbjct: 175 IPCNTDACRDLT-RDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTV 233
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
DF FGCG + G GL+GLG + SLV QTS ++GG FSYCLP+ D +G L
Sbjct: 234 KDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAAND--QAGFLA 291
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGT 361
LG + +++ +T M+ Q TFY++N+TGI++GG+ + A GG++IDSGT
Sbjct: 292 LG---APVNDASGFVFTPMVREQQ--TFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGT 346
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
V+T L + Y+AL+A F K + +P P LDTC+N + + V +P V + F G A +
Sbjct: 347 VVTELQHTAYAALQAAFRKAMAAYPLLPNGE-LDTCYNFTGHSNVTVPRVALTFSGGATV 405
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+DV + CLA +++ GI+GN Q+ V+YD + ++GF + C
Sbjct: 406 DLDVPDGILLDN------CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/397 (37%), Positives = 216/397 (54%), Gaps = 30/397 (7%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
DN +YL SR+ S +E + SG+ + Y + +G ++VD+
Sbjct: 88 DNARAEYLASRLSPAAY-QPTGFSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDS 146
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP 211
GSD+ WVQC+PC CY Q DP+FDP+ S ++ V C S+ C L + G+SG
Sbjct: 147 GSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSGCGDSG------- 199
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
C+Y VSYGDGSYT+G L E L LG +V GCG N+GLF G +GL+GLG +
Sbjct: 200 -GCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 258
Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
SLV Q GG FSYCL S +GSL+LG + +V + + + ++ NPQ +FY
Sbjct: 259 SLVGQLGGAAGGAFSYCLASR----GAGSLVLGRSEAVPEGA---VWVPLVRNPQAPSFY 311
Query: 332 ILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+ L+GI +G ++L Q + GG+++D+GT +TRLP Y+AL+ F+
Sbjct: 312 YVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGA 371
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
P APG S+LDTC++LS Y V +P V F+G A +T+ ++ ++ D CLA A
Sbjct: 372 LPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLL--LEVDGGIYCLAFA 429
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S I+GN QQ+ ++ D+ N +GF C
Sbjct: 430 PSS--SGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 157/368 (42%), Positives = 212/368 (57%), Gaps = 26/368 (7%)
Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
IP G+ + + NY+ T+ G R TV+ DTGSD+ W+QC+PC CY QQ+P+FDP
Sbjct: 2 SIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDP 61
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S+S +Y+ V C C L ++ CSSS+ C Y V YGDGS T G L + L
Sbjct: 62 SLSSTYRNVSCTEPACVGL-----STRGCSSST---CLYGVFYGDGSSTIGFLAMDTFML 113
Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDA 295
A +FIFGCG+NN GLF G +GL+GLGRS SL SQ + G +FSYCLPST A
Sbjct: 114 TPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSA 173
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
+G L +G +TP YT M+ + ++ T Y ++L GIS+GG +L S F
Sbjct: 174 --TGYLNIGN-----PQNTP-GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSV 225
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G +IDSGTVITRLPP+ YSALK + + AP +ILDTC++ S V P++ +
Sbjct: 226 GTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVL 285
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F G ++ + TG+ + S SQVCLA A + GIIGN QQ V YD + +
Sbjct: 286 HFAG-LDVRIPATGVFFVFNS--SQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKR 342
Query: 474 LGFAGEDC 481
+GF+ C
Sbjct: 343 IGFSAGAC 350
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 149/402 (37%), Positives = 218/402 (54%), Gaps = 32/402 (7%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
DN +YL SR+ D +E + SG+ + Y + +G ++VD+
Sbjct: 87 DNARAEYLASRLSPAY--QPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDS 144
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP 211
GSD+ WVQC+PC CY Q DP+FDP+ S ++ V C S+ C L + G+SG
Sbjct: 145 GSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSG------- 197
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
C Y VSYGDGSYT+G L E L LG +V GCG N+GLF G +GL+GLG +
Sbjct: 198 -GCEYEVSYGDGSYTKGTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 256
Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAG-----ASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
SLV Q GG FSYCL S +G A+GSL+LG + +V + + + ++ NPQ
Sbjct: 257 SLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGA---VWVPLVRNPQ 313
Query: 327 LATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
+FY + ++GI +G ++L Q + GG+++D+GT +TRLP Y+AL+ F+
Sbjct: 314 APSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFV 373
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
P APG S+LDTC++LS Y V +P V F+G A +T+ ++ ++ D
Sbjct: 374 GAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLL--LEVDGGIY 431
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A S I+GN QQ+ ++ D+ N +GF C
Sbjct: 432 CLAFAPSS--SGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 153/368 (41%), Positives = 209/368 (56%), Gaps = 29/368 (7%)
Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ TI LG TV+ DTGSD TWVQCQPC CY QQ+ +FDP+ S
Sbjct: 172 SSGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSS 231
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y V C + C L + G SG C Y V YGDGSY+ G + L L
Sbjct: 232 TYANVSCAAPACSDL-YTRGCSGG-------HCLYSVQYGDGSYSIGFFAMDTLTLSSYD 283
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP+ +G
Sbjct: 284 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS--GTGY 341
Query: 301 LILGGNSSV---FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
L G S + +TP+ N TFY + +TGI +GG+ L S F+ G
Sbjct: 342 LDFGPGSPAAVGARQTTPMLTDNG------PTFYYVGMTGIRVGGQLLSIPQSVFSTAGT 395
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
++DSGTVITRLPP+ YS+L++ F + G+ AP S+LDTC++ + EV IP V +
Sbjct: 396 IVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSL 455
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F+G A + V+ +GI+Y + SQVCL A+ +D+ GI+GN Q K V+YD
Sbjct: 456 LFQGGAYLDVNASGIMY--AASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKT 513
Query: 474 LGFAGEDC 481
+GF+ C
Sbjct: 514 VGFSPGAC 521
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 206/370 (55%), Gaps = 25/370 (6%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
+PLT G + NY+ + LG + ++VDTGS LTW+QC PC+ SC+ Q PVFDP
Sbjct: 103 SVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDP 162
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S SY V C+S C L AT N VCS S+ C Y SYGD S++ G L ++ +
Sbjct: 163 KTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSN--VCIYQASYGDSSFSVGYLSKDTVSF 220
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG- 296
G SV +F +GCG++N+GLFG +GLMGL R+ LSL+ Q + G FSYCLPST +G
Sbjct: 221 GANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGY 280
Query: 297 -ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
+ GS GG S YT M+ N + Y ++L+G+++ GK L S +
Sbjct: 281 LSIGSYNPGGYS----------YTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSL 330
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
+IDSGTVITRLP S+Y+AL G A +SILDTCF A + +P V
Sbjct: 331 PTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVS 390
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
M F G A + + ++ V D + CLA A IIGN QQ+ V+YD K++
Sbjct: 391 MAFSGGATLKLSAGNLL--VDVDGATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKSN 445
Query: 473 QLGFAGEDCS 482
++GFA CS
Sbjct: 446 RIGFAAAGCS 455
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 160/429 (37%), Positives = 229/429 (53%), Gaps = 33/429 (7%)
Query: 69 LELKHKNYCSGKIVDWNE----QQQNRLILDNLHVQYLQSRIKNM--ISGNIKDVSNTEI 122
+ + H++ + D ++ + L D + +Q R+ +S + +
Sbjct: 89 MPIVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSL 148
Query: 123 PLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
P +SG L T NY+ TI LG GR TV+ DTGSD TWVQC+PC CY QQ+ +FDP+
Sbjct: 149 PASSGSALGTGNYVVTIGLGTPAGR-YTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPA 207
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y + C + C L + G SG C Y V YGDGSY+ G + L L
Sbjct: 208 RSSTYANISCAAPACSDL-YIKGCSG-------GHCLYGVQYGDGSYSIGFFAMDTLTLS 259
Query: 239 K-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++ F FGCG N+GL+G +GL+GLGR SL Q + +GG+F++C P A +
Sbjct: 260 SYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFP----ARS 315
Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
SG+ L G S+ S +T ++ N TFY + LTGI +GGK L S F G
Sbjct: 316 SGTGYLDFGPGSLPAVSAKLTTPMLVDNGP--TFYYVGLTGIRVGGKLLSIPQSVFTTSG 373
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
++DSGTVITRLPP+ YS+L++ F + G+ AP S+LDTC++ + EV IP V
Sbjct: 374 TIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVS 433
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G A + V +GI+Y + SQ CL A +D+ GI+GN Q K V+YD
Sbjct: 434 LLFQGGASLDVHASGIIY--AASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKK 491
Query: 473 QLGFAGEDC 481
+GF C
Sbjct: 492 VVGFCPGAC 500
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 161/440 (36%), Positives = 230/440 (52%), Gaps = 42/440 (9%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNM------------------ 109
L L H ++ CS + + L D+ +L SR+
Sbjct: 47 LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARAAHLASRLATTSNAPSRRPTTSLRKPKAA 106
Query: 110 --ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK 165
SG D S +PLT G + NY+ + LG + ++VDTGS LTW+QC PC
Sbjct: 107 AGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCV 166
Query: 166 -SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
SC+ Q P++DP S +Y V C++S C L+ AT N CS + C Y SYGD S
Sbjct: 167 VSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRN--VCIYQASYGDSS 224
Query: 225 YTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
++ G L R+ + G S +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G
Sbjct: 225 FSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYS 284
Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
FSYCLP+ ++G L +G +S + TP+ +++ A+ Y + L+G+S+GG
Sbjct: 285 FSYCLPT---PASTGYLSIGPYTSGHYSYTPMASSSLD-----ASLYFVTLSGMSVGGSP 336
Query: 345 LQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
L S ++ +IDSGTVITRLP ++Y+AL G SAP FSILDTCF A
Sbjct: 337 LAVSPAEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCFQGQA 396
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
Q + +P V M F G A + + ++ + D S CLA A D T IIGN QQ+
Sbjct: 397 SQ-LRVPAVAMAFAGGATLKLATQNVL--IDVDDSTTCLAFAP---TDSTTIIGNTQQQT 450
Query: 463 QRVIYDTKNSQLGFAGEDCS 482
V+YD S++GFA CS
Sbjct: 451 FSVVYDVAQSRIGFAAGGCS 470
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 152/364 (41%), Positives = 205/364 (56%), Gaps = 27/364 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSY 183
G L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY Q++ +FDP+ S +Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
V C + C L+ + G C Y V YGDGSY+ G + L L +V
Sbjct: 231 ANVSCAAPACSDLDTRGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 282
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP+ +G L
Sbjct: 283 KGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST--GTGYLD 340
Query: 303 LGGNSSVFK-NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDS 359
G S + +TP+ N P TFY + LTGI +GG+ L S FA G ++DS
Sbjct: 341 FGAGSPAARLTTTPMLVDN---GP---TFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDS 394
Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
GTVITRLPP+ YS+L++ F S G+ AP S+LDTC++ + +V IP V + F+G
Sbjct: 395 GTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQG 454
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A + VD +GI+Y + ASQVCLA A+ + GI+GN Q K V YD + F+
Sbjct: 455 GARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFS 512
Query: 478 GEDC 481
C
Sbjct: 513 PGAC 516
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 152/363 (41%), Positives = 208/363 (57%), Gaps = 22/363 (6%)
Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
G+ L T NY+ I LG TV+ DTGSD TWVQC+PC SCY Q+D +FDP+ S +Y
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
V C C L+ + N+G C Y + YGDGSYT G ++ L + + ++
Sbjct: 215 ANVSCADPACADLDASGCNAG--------HCLYGIQYGDGSYTVGFFAKDTLAVAQDAIK 266
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
F FGCG N+GLFG +GL+GLGR S+ Q E +GG FSYCLP++ + A+G L
Sbjct: 267 GFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPAS--SAATGYLEF 324
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA---SGFAKGGILIDSG 360
G S S T T M+ + + TFY + LTGI +GGKQL A S F+ G L+DSG
Sbjct: 325 GPLSPSSSGSNAKT-TPMLTD-KGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTLVDSG 382
Query: 361 TVITRLPPS--IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
TVITRLP + + SG+ A +SILDTC++ + +V++P V + F+G
Sbjct: 383 TVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGG 442
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A + +D +GIVY + SQVCL AS ++ GI+GN QQ+ V+YD +GFA
Sbjct: 443 ACLDLDASGIVYAISQ--SQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAP 500
Query: 479 EDC 481
C
Sbjct: 501 GAC 503
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/362 (39%), Positives = 205/362 (56%), Gaps = 16/362 (4%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G L T NY ++ LG ++ V +DTGSD +W+QC+PC CY Q + +FDPS S +Y
Sbjct: 126 GKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYS 185
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
+ C+S C L G+S + SS C Y ++Y D SYT G L R+ L L +V
Sbjct: 186 DITCSSRECQEL----GSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAVP 241
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
F+FGCG NN G FG + GL+GLGR SL SQ + +G FSYCLPS+ A+G L
Sbjct: 242 GFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPS--ATGYLSF 299
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA-KGGILIDSG 360
G ++ T +T M+ Q +FY LNLTGI++ G+ ++ S FA G +IDSG
Sbjct: 300 SGAAA--AAPTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSG 356
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T + LPPS Y+AL++ + AP +I DTC++L+ ++ V IP V + F A
Sbjct: 357 TAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGAT 416
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + +G++Y S+ SQ CLA + G++GN QQ+ VIYD N ++GF
Sbjct: 417 VHLHPSGVLY-TWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANG 475
Query: 481 CS 482
C+
Sbjct: 476 CA 477
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/425 (35%), Positives = 227/425 (53%), Gaps = 35/425 (8%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
+L L H++ SG Q L+ DN V++L+ R+ S + +D+ + +P
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
G+ + Y + +G + ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
V C S+ C L G + C+Y V+YGDGSYT+GEL E L LG +V
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG N GLF G +GL+GLG +SLV Q GG+FSYCL +++ AG +GSL+L
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL-ASRGAGGAGSLVL 295
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
G +V + + ++FY + LTGI +GG++L Q + GG++
Sbjct: 296 GRTEAVPRGR------------RASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 343
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+D+GT +TRLP Y+AL+ F P +P S+LDTC++LS Y V +P V F+
Sbjct: 344 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 403
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
A +T+ ++ V+ + CLA A S I+GN QQ+ ++ D+ N +GF
Sbjct: 404 QGAVLTLPARNLL--VEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 459
Query: 477 AGEDC 481
C
Sbjct: 460 GPNTC 464
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 162/442 (36%), Positives = 236/442 (53%), Gaps = 44/442 (9%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI----------------KNMIS 111
L L H ++ CS + + L D+ V +L SR+ K
Sbjct: 46 LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAG 105
Query: 112 G-----NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
G ++ D S +PL+ G + NY+ + LG + ++VDTGS LTW+QC PC
Sbjct: 106 GASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC 165
Query: 165 K-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
SC+ Q P+FDP S +Y V C++S C L+ AT N CS+S+ C Y SYGD
Sbjct: 166 VVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASN--VCIYQASYGDS 223
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
S++ G L + + G S F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G
Sbjct: 224 SFSVGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGY 283
Query: 284 LFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
FSYCLP+ A ++G L +G N+ + + TP+ +++ A+ Y + L+G+S+GG
Sbjct: 284 SFSYCLPT---AASTGYLSIGPYNTGHYYSYTPMASSSLD-----ASLYFITLSGMSVGG 335
Query: 343 KQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
L S ++ +IDSGTVITRLP ++++AL + +G AP FSILDTCF
Sbjct: 336 SPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEG 395
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
A Q + +P V M F G A M + ++ + D S CLA A D T IIGN QQ
Sbjct: 396 QASQ-LRVPTVVMAFAGGASMKLTTRNVL--IDVDDSTTCLAFAP---TDSTAIIGNTQQ 449
Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
+ VIYD S++GF+ CS
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 165/444 (37%), Positives = 235/444 (52%), Gaps = 35/444 (7%)
Query: 52 SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
S S QK TL L H++ CS + + L D L + +++ +
Sbjct: 44 SEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPR 103
Query: 111 SGNIKDV--SNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPC-- 164
+ + K++ S IP +SG L T Y+ T+ LG +T + +DTGSD++WVQC PC
Sbjct: 104 NSSAKELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAA 163
Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
+SC +Q+D +FDP+ S +Y C+S+ C L G C +S C Y V Y D S
Sbjct: 164 QSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLG---GEGNGCLNS---HCQYIVKYVDHS 217
Query: 225 YTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
T G G + LGL + +V +F FGC G G + GLMGLG SLVSQT+ +G
Sbjct: 218 NTTGTYGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGK 277
Query: 284 LFSYCLPSTQDAGASGSLILG----GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
FSYCLP + + A G L LG G SS + TP+ N + TFY + L I+
Sbjct: 278 AFSYCLPPSSSS-AGGFLTLGAAAGGTSSSRYSRTPLVRFN------VPTFYGVFLQAIT 330
Query: 340 IGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
+ G +L AS F+ G ++DSGTVIT+LPP+ Y AL+ F K+ +PSA ILDTC
Sbjct: 331 VAGTKLNVPASVFS-GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTC 389
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
F+ S + V +P+V + F A M +DV+GI Y CLA + + + +TGI+GN
Sbjct: 390 FDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY-------AGCLAFTATAQDGDTGILGN 442
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
QQ+ +++D S LGF C
Sbjct: 443 VQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 161/442 (36%), Positives = 235/442 (53%), Gaps = 44/442 (9%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI----------------KNMIS 111
L L H ++ CS + + L D+ V +L SR+ K
Sbjct: 46 LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAG 105
Query: 112 G-----NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
G ++ D S +PL+ G + NY+ + LG + ++VDTGS LTW+QC PC
Sbjct: 106 GASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC 165
Query: 165 K-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
SC+ Q P+FDP S +Y V C++S C L+ AT N CS+S+ C Y SYGD
Sbjct: 166 VVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNPSACSASN--VCIYQASYGDS 223
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
S++ G L + + G F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G
Sbjct: 224 SFSVGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGY 283
Query: 284 LFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
FSYCLP+ A ++G L +G N+ + + TP+ +++ A+ Y + L+G+S+GG
Sbjct: 284 SFSYCLPT---AASTGYLSIGPYNTGHYYSYTPMASSSLD-----ASLYFITLSGMSVGG 335
Query: 343 KQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
L S ++ +IDSGTVITRLP ++++AL + +G AP FSILDTCF
Sbjct: 336 SPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEG 395
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
A Q + +P V M F G A M + ++ + D S CLA A D T IIGN QQ
Sbjct: 396 QASQ-LRVPTVAMAFAGGASMKLTTRNVL--IDVDDSTTCLAFAP---TDSTAIIGNTQQ 449
Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
+ VIYD S++GF+ CS
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 149/399 (37%), Positives = 217/399 (54%), Gaps = 19/399 (4%)
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT-- 147
+RL + +Y+ SR+ + G+ DVS IP G + +L Y+ T+ LG +++
Sbjct: 82 DRLRRNRARSKYIMSRVSKGMMGDDADVS---IPTHLGGSVDSLEYVVTVGLGTPSVSQV 138
Query: 148 VIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+++DTGSDL+WVQCQPC S CY Q+DP+FDPS S +Y + CN+ C L G
Sbjct: 139 LLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGC 198
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
S C + ++YGDGS TRG E L L +V DF FGCG + G GL+
Sbjct: 199 ASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLL 258
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-AGASGSLILGGNSSVFKNSTPITYTNMIP 323
GLG + SLV QT+ ++GG FSYCLP+ + G G S N++ +T MI
Sbjct: 259 GLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIR 318
Query: 324 NPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
+ TFY++N+TGI++GG+ + A GG++IDSGTV+T L + Y+AL+A F K
Sbjct: 319 EEE--TFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYNALQAAFRKAM 376
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+ +P LDTC++ S Y V +P V + F G A + +DV + CLA
Sbjct: 377 AAYPLVRNGE-LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGILL------DDCLA 429
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D+ GI+GN Q+ V+YD ++GF C
Sbjct: 430 FQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 162/399 (40%), Positives = 215/399 (53%), Gaps = 24/399 (6%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTEIPLTSGIRLQTLNYIATIELG--GR 144
+ L D L Y+Q + DV S+ +P G L TL Y+ T+ LG
Sbjct: 5 EETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPAT 63
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
+ T+++DTGSD++WVQC+PC C++Q DP+FDPS S +Y C S+ C L GN
Sbjct: 64 SQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLG-QEGNG- 121
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
CSSSS C Y V+YGDGS T G + L LG ++V F FGC G GLM
Sbjct: 122 -CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLM 178
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
GLG SLVSQT+ G FSYCLP T +SG L L ++ ++ T M+ +
Sbjct: 179 GLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSGFLTL--GAAGGSGTSGFVKTPMLRS 234
Query: 325 PQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Q+ TFY + L I +GG+QL AS F+ G ++ DSGTVITRLPP+ YSAL + F
Sbjct: 235 SQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM-DSGTVITRLPPTAYSALSSAFKAGM 293
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+P A ILDTCF+ S V+IP V + F G A +++D +GI+ CLA
Sbjct: 294 KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-------SNCLA 346
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A S + GIIGN QQ+ V+YD +GF C
Sbjct: 347 FAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 163/454 (35%), Positives = 234/454 (51%), Gaps = 32/454 (7%)
Query: 45 QQKSGSSSSCVSHQKSRIE--MGAITLELKHK-------NYCSGKIVDWNEQ-QQNRLIL 94
Q++S S + S K +E +++ L H+ Y + +E +++R
Sbjct: 31 QRRSYDSETVCSASKVNLEPSSATVSMSLVHRYGPCAPSQYSNVPTPSISETLRRSRART 90
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDT 152
+ + Q +S M S D + IP G + +L Y+ T+ G ++ +++DT
Sbjct: 91 NYIMSQASKSMGMGMASTPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDT 150
Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
GSD++WVQC PC S CY Q+DP+FDPS S +Y + CN+ C L N C+S
Sbjct: 151 GSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNG--CTSGG 208
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
C Y V Y DGS++RG E L L +V DF FGCGR+ +G GL+GLG +
Sbjct: 209 T-QCGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGA 267
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
+SLV QTS ++GG FSYCLP+ +G L+LG S N + +T M P AT
Sbjct: 268 PVSLVVQTSSVYGGAFSYCLPALNS--EAGFLVLGSPPS--GNKSAFVFTPMRHLPGYAT 323
Query: 330 FYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
FY++ +TGIS+GGK L A +GG++IDSGTV T LP + Y+AL+A K +P
Sbjct: 324 FYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV 383
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLS 447
P DTC+N + Y + +P V F G A + +DV GI+ CLA
Sbjct: 384 PS-DDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILV-------NDCLAFQESG 435
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D GIIGN Q+ V+YD +GF C
Sbjct: 436 PDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 151/369 (40%), Positives = 205/369 (55%), Gaps = 31/369 (8%)
Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY QQ+ +FDP+ S
Sbjct: 169 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSS 228
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y V C + C L+ + G C Y V YGDGSY+ G + L L
Sbjct: 229 TYANVSCAAPACFDLDTRGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 280
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP A +SG+
Sbjct: 281 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSSGT 336
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
L F +P + P L TFY + +TGI +GG+ L S FA G
Sbjct: 337 GYLD-----FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 391
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
++DSGTVITRLPP YS+L++ F+ + G+ AP S+LDTC++ + +V IP V
Sbjct: 392 TIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 451
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G A + VD +GI+Y + SQVCL A+ + GI+GN Q K V YD
Sbjct: 452 LLFQGGAILDVDASGIMY--AASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 509
Query: 473 QLGFAGEDC 481
+GF+ C
Sbjct: 510 VVGFSPGAC 518
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 163/447 (36%), Positives = 226/447 (50%), Gaps = 33/447 (7%)
Query: 50 SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQ--QQNRLILDNLHVQYLQSRIK 107
S + C + T+ L H++ + ++ ++ L D L +++Q +
Sbjct: 35 SEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFA 94
Query: 108 -NMISGNIKDVSNTEI----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
N D+ +++ P G L TL Y+ ++ LG TV +DTGSD++WVQ
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
C PC + C+ Q +FDP+ S +Y+ V C ++ C LE G + +C Y V
Sbjct: 155 CNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNY----ECQYGV 210
Query: 219 SYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
YGDGS T G R+ L L AS V F FGC G GLMGLG SLVSQ
Sbjct: 211 QYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQ 270
Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
T+ +G FSYCLP T SGS S +T T M+ + Q+ TFY L
Sbjct: 271 TAAAYGNSFSYCLPPT-----SGSSGFLTLGGGGGASGFVT-TRMLRSKQIPTFYGARLQ 324
Query: 337 GISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
I++GGKQL S FA G + +DSGT+ITRLPP+ YSAL + F + SAP SIL
Sbjct: 325 DIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSIL 383
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
DTCF+ + +++IP V + F G A + +D GI+Y CLA A+ + TGI
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY-------GNCLAFAATGDDGTTGI 436
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IGN QQ+ V+YD +S LGF C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 215/368 (58%), Gaps = 25/368 (6%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SGI + +Y A I +G R++ ++ DTGSD++W+QC PC+ CY QQDP+F+PS+S
Sbjct: 69 PLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLS 128
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
S+K + C SS C L+ CS + +C Y VSYGDGS+T G+ E L G+
Sbjct: 129 SSFKPLACASSICGKLKIKG-----CSRKN--ECMYQVSYGDGSFTVGDFSTETLSFGEH 181
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V GCGRNN+GLF G +GL+GLGR LS SQT + +FSYCLP + A A+ S
Sbjct: 182 AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA-S 240
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
L+ G ++ K +T ++PN +L T+Y + L I + G + FA G
Sbjct: 241 LVFGPSAVPEK----ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTG 296
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G+++DSGT I+RL Y+AL+ F + FPSAPG S+ DTC++LS+ + +P V +
Sbjct: 297 GVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKTATLPAVVL 355
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
+F+G A M + GI+ V D CLA A E+ IIGN QQ+ R+ D + Q
Sbjct: 356 DFDGGASMPLPADGILVNVD-DEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQ 412
Query: 474 LGFAGEDC 481
+G A + C
Sbjct: 413 MGIAPDQC 420
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 157/432 (36%), Positives = 233/432 (53%), Gaps = 32/432 (7%)
Query: 68 TLELKHKN-YCSGKIVDWNEQQQ----NRLILDNLHVQYLQSRI--KNMISGNIKDVSNT 120
++ L H++ C+ K ++++ RL D ++ + + M+S +
Sbjct: 55 SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMS----EGGGA 110
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFD 176
IP G + +L Y+ T+ +G TV++DTGSDL+WVQC+PC + CY Q+DP+FD
Sbjct: 111 SIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFD 170
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS--PPDCNYFVSYGDGSYTRGELGREH 234
PS S ++ + C S C L ++G +++S PP C Y + YG+G+ T G E
Sbjct: 171 PSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTET 230
Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L LG A V F FGCG + G + GL+GLG + SLVSQT+ ++GG FSYCLP
Sbjct: 231 LALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLN 290
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIP-NPQLATFYILNLTGISIGGKQLQ--ASGF 350
+G L LG +S +++ +T M +P++ATFY++ LTGIS+GGK L + F
Sbjct: 291 S--GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF 348
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNLSAYQEVNIP 409
AKG I +DSGTVIT +P + Y AL+ F + +P P S LDTC+N + + V +P
Sbjct: 349 AKGNI-VDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVP 407
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
V + F G A + +DV V + CLA A + GIIGN + V+YD+
Sbjct: 408 KVALTFVGGATVDLDVPSGVLV------EDCLAFADAG-DGSFGIIGNVNTRTIEVLYDS 460
Query: 470 KNSQLGFAGEDC 481
LGF C
Sbjct: 461 GKGHLGFRAGAC 472
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 211/390 (54%), Gaps = 19/390 (4%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSD 155
Y++SR ++ D + T +P G + +L Y+ T+ G ++ +++DTGSD
Sbjct: 89 RTNYIKSRASTGMASTPDDAAVT-VPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSD 147
Query: 156 LTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
++WVQC PC S CY Q+DP+FDPS S +Y + C + C+ L N C+S
Sbjct: 148 VSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNG--CTSGGT-Q 204
Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C Y V YGDGS TRG E + +V DF FGCG + +G GL+GLG + S
Sbjct: 205 CGYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPES 264
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
LV QT+ ++GG FSYCLP+ +G L LG S N++ +T M P AT Y+
Sbjct: 265 LVVQTASVYGGAFSYCLPALNS--EAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYM 322
Query: 333 LNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+N+TGIS+GGK L A +GG+LIDSGT++T LP + Y+AL A K F+ +P
Sbjct: 323 VNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMV-AS 381
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
DTC+N + Y V +P V + F G A + +DV + VK CLA +
Sbjct: 382 EDFDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGI-LVKD-----CLAFRESGPDVG 435
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
GIIGN Q+ V+YD + ++GF C
Sbjct: 436 LGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 150/364 (41%), Positives = 206/364 (56%), Gaps = 31/364 (8%)
Query: 130 LQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKK 185
L T NY+ TI LG GR TV+ DTGSD TWVQC+PC CY QQ+ +FDP+ S +
Sbjct: 181 LGTGNYVVTIGLGTPAGR-YTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDAN 239
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVND 244
+ C + C L + G SG C Y V YGDGSY+ G + L L ++
Sbjct: 240 ISCAAPACSDL-YTKGCSGG-------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 291
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
F FGCG N+GLFG +GL+GLGR SL Q + +GG+F++C P+ + +G L G
Sbjct: 292 FRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPAR--SSGTGYLDFG 349
Query: 305 GNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDS 359
SS K +TP+ N + TFY + LTGI +GGK L S F G ++DS
Sbjct: 350 PGSSPAVSTKLTTPMLVDNGL------TFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDS 403
Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
GTVITRLPP+ YS+L++ F + G+ AP S+LDTC++ + +V IP V + F+G
Sbjct: 404 GTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQG 463
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A + VD +GI+Y + SQ CL A+ +D+ GI+GN Q K V+YD +GF+
Sbjct: 464 GASLDVDASGIIY--AASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFS 521
Query: 478 GEDC 481
C
Sbjct: 522 PGAC 525
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 148/369 (40%), Positives = 205/369 (55%), Gaps = 31/369 (8%)
Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY Q++ +FDP+ S
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y + C + C L+ + G +C Y V YGDGSY+ G + L L
Sbjct: 230 TYANISCAAPACSDLDTRGCSGG--------NCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP A +SG+
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSSGT 337
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
L F +P + P L TFY + +TGI +GG+ L S F G
Sbjct: 338 GYLD-----FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAG 392
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
++DSGTVITRLPP+ YS+L++ F + G+ AP S+LDTC++ + +V IP V
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 452
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G A + VD +GI+Y + SQVCL A+ + GI+GN Q K V YD
Sbjct: 453 LLFQGGARLDVDASGIMY--AASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510
Query: 473 QLGFAGEDC 481
+GF+ C
Sbjct: 511 VVGFSPGAC 519
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 163/447 (36%), Positives = 224/447 (50%), Gaps = 33/447 (7%)
Query: 50 SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQ--QQNRLILDNLHVQYLQSRIK 107
S + C + T+ L H++ + ++ ++ L D L +++Q +
Sbjct: 35 SEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFA 94
Query: 108 -NMISGNIKDVSNTEI----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
N D+ +++ P G L TL Y+ ++ LG TV +DTGSD++WVQ
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
C PC + CY Q +FDP+ S +Y+ V C ++ C LE G + +C Y V
Sbjct: 155 CNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNY----ECQYGV 210
Query: 219 SYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
YGDGS T G R+ L L AS V F FGC G GLMGLG SLVSQ
Sbjct: 211 QYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQ 270
Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
T+ +G FSYCLP T SGS S T M+ + Q+ TFY L
Sbjct: 271 TAAAYGNSFSYCLPPT-----SGSSGFLTLGGGGGVSG-FVTTRMLRSRQIPTFYGARLQ 324
Query: 337 GISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
I++GGKQL S FA G + +DSGT+ITRLPP+ YSAL + F + SAP SIL
Sbjct: 325 DIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSIL 383
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
DTCF+ + +++IP V + F G A + +D GI+Y CLA A+ + TGI
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY-------GNCLAFAATGDDGTTGI 436
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IGN QQ+ V+YD +S LGF C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 151/363 (41%), Positives = 203/363 (55%), Gaps = 26/363 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
G L T NY+ T+ LG TV+ DTGSD TWVQCQPC +CY Q++ +FDP+ S +Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
V C + C L+ + + G C Y V YGDGSY+ G + L L +V
Sbjct: 231 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 282
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
F FGCG N GLFG +GL+GLGR SL QT +GG+F++CLP+ +G L
Sbjct: 283 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARST--GTGYLD 340
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
G S +TP+ N P TFY + +TGI +GG+ L S FA G ++DSG
Sbjct: 341 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 394
Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
TVITRLPP+ YS+L++ F + G+ A S+LDTC++ + +V IP V + F+G
Sbjct: 395 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 454
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A + VD +GI+Y V ASQVCLA A + GI+GN Q K V YD +GF+
Sbjct: 455 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 512
Query: 479 EDC 481
C
Sbjct: 513 GAC 515
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 172/478 (35%), Positives = 245/478 (51%), Gaps = 27/478 (5%)
Query: 9 TILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
+I LL L+ S L AH + ++ HK+ SS+ S K +T
Sbjct: 3 SISKFLLALLFSYHTLI--AHAADDRR----HKVLSVGSLMKSSTACSEPKVTPPSTGVT 56
Query: 69 LELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
+ L H+ + CS + RL D L Y++ + +G+I+ +P T G
Sbjct: 57 VPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSG--AGDIEQSDAATVPTTLG 114
Query: 128 IRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
L TL Y+ T+ +G +T + +DTGSD++WVQC+PC C+++ D +FDPS S +Y
Sbjct: 115 TSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSP 174
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
C+S+ C L + +G SS C Y V+YGD S T G + L LG +++ DF
Sbjct: 175 FSCSSAPCAQLSQSQEGNGCMSS----QCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDF 230
Query: 246 IFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC ++ G F GLMGLG SL SQT+ FG FSYCLP T +G+SG L LG
Sbjct: 231 QFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPT--SGSSGFLTLG 288
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVI 363
SS F T M+ + Q+ T+Y++ L I +G +QL + G L+DSGT+I
Sbjct: 289 TGSSGFVK------TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTII 342
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
TRLPP+ YSAL + F +P A ILDTCF+ S ++IP V + F G A + +
Sbjct: 343 TRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDL 402
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
GI+ + S S CLA + GIIGN QQ+ V+YD +GF C
Sbjct: 403 AFDGIMLEISS--SIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 151/363 (41%), Positives = 203/363 (55%), Gaps = 26/363 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
G L T NY+ T+ LG TV+ DTGSD TWVQCQPC +CY Q++ +FDP+ S +Y
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
V C + C L+ + + G C Y V YGDGSY+ G + L L +V
Sbjct: 235 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 286
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
F FGCG N GLFG +GL+GLGR SL QT +GG+F++CLP+ +G L
Sbjct: 287 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARST--GTGYLD 344
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
G S +TP+ N P TFY + +TGI +GG+ L S FA G ++DSG
Sbjct: 345 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 398
Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
TVITRLPP+ YS+L++ F + G+ A S+LDTC++ + +V IP V + F+G
Sbjct: 399 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 458
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A + VD +GI+Y V ASQVCLA A + GI+GN Q K V YD +GF+
Sbjct: 459 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 516
Query: 479 EDC 481
C
Sbjct: 517 GAC 519
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 151/368 (41%), Positives = 204/368 (55%), Gaps = 29/368 (7%)
Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY Q++ +FDP+ S
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y V C + C L + G C Y V YGDGSY+ G + L L
Sbjct: 230 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP+ +G
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST--GTGY 339
Query: 301 LILGGNS---SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
L G S + + +TP+ N P TFY + +TGI +GG+ L S FA G
Sbjct: 340 LDFGAGSLAAARARLTTPMLTEN---GP---TFYYVGMTGIRVGGQLLSIPQSVFATAGT 393
Query: 356 LIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
++DSGTVITRLPP+ YS+L+ G+ AP S+LDTC++ + +V IP V +
Sbjct: 394 IVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSL 453
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F+G A + VD +GI+Y + ASQVCLA A+ + GI+GN Q K V YD
Sbjct: 454 LFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKV 511
Query: 474 LGFAGEDC 481
+GF C
Sbjct: 512 VGFYPGAC 519
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 151/363 (41%), Positives = 202/363 (55%), Gaps = 26/363 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
G L T NY+ T+ LG TV+ DTGSD TWVQCQPC +CY Q++ +FDP+ S +Y
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
V C + C L+ + + G C Y V YGDGSY+ G + L L +V
Sbjct: 232 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 283
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
F FGCG N GLFG +GL+GLGR SL QT +GG+F++CLP +G L
Sbjct: 284 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRST--GTGYLD 341
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
G S +TP+ N P TFY + +TGI +GG+ L S FA G ++DSG
Sbjct: 342 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 395
Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
TVITRLPP+ YS+L++ F + G+ A S+LDTC++ + +V IP V + F+G
Sbjct: 396 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 455
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A + VD +GI+Y V ASQVCLA A + GI+GN Q K V YD +GF+
Sbjct: 456 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 513
Query: 479 EDC 481
C
Sbjct: 514 GAC 516
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 214/368 (58%), Gaps = 25/368 (6%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SGI + +Y A I +G R++ ++ DTGSD++W+QC PC+ CY QQDP+F+PS+S
Sbjct: 2 PLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLS 61
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
S+K + C SS C L+ CS + C Y VSYGDGS+T G+ E L G+
Sbjct: 62 SSFKPLACASSICGKLKIKG-----CSRKN--KCMYQVSYGDGSFTVGDFSTETLSFGEH 114
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V GCGRNN+GLF G +GL+GLGR LS SQT + +FSYCLP + A A+ S
Sbjct: 115 AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA-S 173
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
L+ G ++ K +T ++PN +L T+Y + L I + G + FA G
Sbjct: 174 LVFGPSAVPEK----ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTG 229
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G+++DSGT I+RL Y+AL+ F + FPSAPG S+ DTC++LS+ + +P V +
Sbjct: 230 GVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKTATLPAVVL 288
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
+F+G A M + GI+ V D CLA A E+ IIGN QQ+ R+ D + Q
Sbjct: 289 DFDGGASMPLPADGILVNVD-DEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQ 345
Query: 474 LGFAGEDC 481
+G A + C
Sbjct: 346 MGIAPDQC 353
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 167/449 (37%), Positives = 240/449 (53%), Gaps = 35/449 (7%)
Query: 59 KSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN---- 113
+SR T+ L H++ CS + RL D L Y+ ++
Sbjct: 54 ESRAPAVHATVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGA 113
Query: 114 -----IKDVSNTEIPLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPC- 164
++ +P T G L TL Y+ T+ LG G++ T+++DTGSD++WV+C+PC
Sbjct: 114 GGDVVVQQSHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW 173
Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
+ C Q DP+FDPS+S +Y C+S+ C L F GN+ CSSS C Y YGDGS
Sbjct: 174 QQCRPQVDPLFDPSLSSTYSPFSCSSAACAQL-FQEGNANGCSSSG--QCQYIAMYGDGS 230
Query: 225 Y-TRGELGREHLGLGKAS----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
T G + L LG S V+ F FGC G+ G +GLMGLG SLVSQT+
Sbjct: 231 VGTTGTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAG 290
Query: 280 IFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
FG FSYCLP T + SG L LG + +S T M+ + Q+ FY + L I
Sbjct: 291 TFGTTAFSYCLPPTPSS--SGFLTLGAAGT---SSAGFVKTPMLRSSQVPAFYGVRLEAI 345
Query: 339 SIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSIL 394
+GG+QL + G+++DSGTV+TRLPP+ YS+L + F +KQ+ PS+ G L
Sbjct: 346 RVGGRQLSIPTTVFSAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFL 405
Query: 395 DTCFNLSAYQEVNIPLVKMEFE--GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
DTCF++S V++P V + F G A + +D +GI+ +++ +S CLA + S + T
Sbjct: 406 DTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMET-SSIFCLAFVATSDDGST 464
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
GIIGN QQ+ +V+YD +GF C
Sbjct: 465 GIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 147/368 (39%), Positives = 219/368 (59%), Gaps = 24/368 (6%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSI 179
PL G + + NY + LG R ++IVDTGS L+W+QC+PC C+ Q DP+FDPS
Sbjct: 1 PLNPGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSA 60
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +YK + C SS C +L AT N+ +C +SS C Y SYGD SY+ G L ++ L L
Sbjct: 61 SKTYKSLSCTSSQCSSLVDATLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAP 119
Query: 240 A-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+ ++ F++GCG++++GLFG +G++GLGR+ LS++ Q S FG FSYCLP+ G
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---GGG 176
Query: 299 GSLILGGNS---SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGG 354
G L +G S S +K +T M +P + Y L LT I++GG+ L A+ +
Sbjct: 177 GFLSIGKASLAGSAYK------FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP 230
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
+IDSGTVITRLP S+Y+ + F+K S + APGFSILDTCF + ++P V++
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRL 290
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F+G A++ + ++ ++ D CLA A + IIGN+QQ+ +V +D ++
Sbjct: 291 IFQGGADLNLRPVNVL--LQVDEGLTCLAFAG---NNGVAIIGNHQQQTFKVAHDISTAR 345
Query: 474 LGFAGEDC 481
+GFA C
Sbjct: 346 IGFATGGC 353
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 145/372 (38%), Positives = 203/372 (54%), Gaps = 21/372 (5%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
D S +PLT G NY+ + LG + ++VDTGS LTW+QC PC+ SC+ Q
Sbjct: 118 DGSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSG 177
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
PVFDP S SY V C++ C+ L AT N CSSS C Y SYGD S++ G L +
Sbjct: 178 PVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSD--VCIYQASYGDSSFSVGYLSK 235
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
+ + G SV +F +GCG++N+GLFG +GLMGL R+ LSL+ Q + G FSYCLPS+
Sbjct: 236 DTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSS 295
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--F 350
+G N +YT M+ + + Y + L+G+++ GK L S +
Sbjct: 296 SSSGYLSIGSY--------NPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEY 347
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
+ +IDSGTVITRLP ++Y AL G A +SILDTCF + + +P
Sbjct: 348 SSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPA 406
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V M F G A + + ++ V D+S CLA A IIGN QQ+ V+YD K
Sbjct: 407 VSMAFSGGAALKLSAQNLL--VDVDSSTTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 461
Query: 471 NSQLGFAGEDCS 482
++++GFA C+
Sbjct: 462 SNRIGFAAGGCT 473
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 204/338 (60%), Gaps = 11/338 (3%)
Query: 148 VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+I+DTGS L+W+QCQPC C+ Q DP++DPS+S +YKK+ C S C L+ AT N +C
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMG 265
+ S C Y SYGD S++ G L ++ L L + ++ F +GCG++N+GLFG +G++G
Sbjct: 61 ETDSNA-CLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIG 119
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
L R LS+++Q S +G FSYCLP+ + G + G+ S T +T M+ +
Sbjct: 120 LARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSIS----PTSYKFTPMLTDS 175
Query: 326 QLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS- 383
+ + Y L LT I++ G+ L A+ + LIDSGTVITRLP S+Y+AL+ F+K S
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
+ AP +SILDTCF S +P +KM F+G A++T+ I+ +++D CLA
Sbjct: 236 KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL--IEADKGITCLAF 293
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A S ++ IIGN QQ+ + YD S++GFA C
Sbjct: 294 AGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 159/409 (38%), Positives = 223/409 (54%), Gaps = 39/409 (9%)
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
+ L D +Y+ R+ SG + +++ +P + G + TLNY+ T LG
Sbjct: 92 DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147
Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S SY V C C L
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLG 207
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGL 256
++ + Y VSYGDGS T G + L L +S V F FGCG GL
Sbjct: 208 IYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGL 262
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
F GV GL+GLGR SLV QT+ +GG+FSYCLP+ +L +GG S
Sbjct: 263 FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---F 319
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSAL 374
+ T ++P+P T+Y++ LTGIS+GG+QL AS FA G ++D+GTV+TRLPP+ Y+AL
Sbjct: 320 STTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAAL 378
Query: 375 KAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
++ F + G+P+AP ILDTC+N + Y V +P V + F A +T+ GI+ F
Sbjct: 379 RSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF- 437
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + I+GN QQ++ V D + +GF C
Sbjct: 438 ------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 148/425 (34%), Positives = 219/425 (51%), Gaps = 48/425 (11%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
+L L H++ SG Q L+ DN V++L+ R+ S + +D+ + +P
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
G+ + Y + +G + ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
V C S+ C L G + C+Y V+YGDGSYT+GEL E L LG +V
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG N GLF G +GL+GLG +SLV Q GG+FSYCL S + AG +GSL
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLA- 294
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
++FY + LTGI +GG++L Q + GG++
Sbjct: 295 ------------------------SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 330
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+D+GT +TRLP Y+AL+ F P +P S+LDTC++LS Y V +P V F+
Sbjct: 331 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 390
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
A +T+ ++ V+ + CLA A S I+GN QQ+ ++ D+ N +GF
Sbjct: 391 QGAVLTLPARNLL--VEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 446
Query: 477 AGEDC 481
C
Sbjct: 447 GPNTC 451
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 159/405 (39%), Positives = 240/405 (59%), Gaps = 26/405 (6%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSN-------TEIPLTSGIRLQTLNYIATIELG--GRN 145
D V++L SR+ N S + ++ PL SG+ + + NY I +G +
Sbjct: 60 DEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPAKY 119
Query: 146 MTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
++IVDTGS L+W+QCQPC C+ Q DP+F PS+S +YK + C+SS C +L+ +T N+
Sbjct: 120 FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLNAP 179
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLFGGVSG 262
CS+++ C Y SYGD S++ G L ++ L L A + F++GCG++N+GLFG +G
Sbjct: 180 GCSNATG-ACVYKASYGDTSFSIGYLSQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSAG 238
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPST----QDAGASGSLILGGNSSVFKNSTPITY 318
++GL LS++ Q S +G FSYCLPS+ ++ SG L +G +S +S+P +
Sbjct: 239 IIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSGFLSIGASSL---SSSPYKF 295
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAE 377
T ++ NP++ + Y L LT I++ GK L S + +IDSGTVITRLP +IY+ALK
Sbjct: 296 TPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAIYNALKKS 355
Query: 378 FLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
F+ S + APGFSILDTCF S + +P +++ F G A + + V + V+ +
Sbjct: 356 FVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSL--VEIEK 413
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA+A+ S + IIGNYQQ+ V YD NS++GFA C
Sbjct: 414 GTTCLAIAASS--NPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 195/341 (57%), Gaps = 22/341 (6%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
M +++DTGSD+TW+QC PC CY QQD +F P+ S +YK + CNS+ C L+ S
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQ---SFSHS 57
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKGLFGGV 260
C +SS CNY VSYGD S TRG+ E L L SV +F FGCG NKGLF G
Sbjct: 58 CLNSS---CNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA 114
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GLMGLG+S + +QTS FG +FSYCLPS SG L G + + + + +T
Sbjct: 115 AGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYD---VRFTP 171
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
++ + + Y +++TGI++G + L S +++DSGTVI+R S Y L+ F +
Sbjct: 172 LVDSSSGPSQYFVSMTGINVGDELLPIS----ATVMVDSGTVISRFEQSAYERLRDAFTQ 227
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
G +A + DTCF +S ++NIPL+ + F +AE+ + I+Y V D +C
Sbjct: 228 ILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPV--DDGVMC 285
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A A S ++GN+QQ+N R +YD S+LG + +C
Sbjct: 286 FAFAPSS--SGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 144/405 (35%), Positives = 214/405 (52%), Gaps = 32/405 (7%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
DN +YL +R+ S +E + SG+ + Y+ + +G ++VD+
Sbjct: 133 DNARAEYLATRLSPAY--QPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDS 190
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
GSD+ WVQC+PC CY Q DP+FDP+ S ++ V C S+ C L + C
Sbjct: 191 GSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILP-----TSACGDGELG 245
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C Y VSY DGSYT+G L E L LG +V + GCG N+GLF G +GLMGLG +S
Sbjct: 246 GCEYEVSYADGSYTKGALALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMS 305
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGA------SGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
LV Q GG FSYCL S G+ +G L+LG + +V + + + ++ NP+
Sbjct: 306 LVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGA---VWVPLVRNPR 362
Query: 327 LATFYILNLTGISIGGKQ--LQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFL 379
+FY + L+GI +G ++ LQA F G +++D+GT +TRLP Y+AL+ F+
Sbjct: 363 APSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFV 422
Query: 380 KQFSG-FPSAPGF--SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
+G P A G S+LDTC++LS Y V +P V F+G+A + + + ++ D
Sbjct: 423 GALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNV--LLEVDM 480
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A S I+GN QQ ++ D+ N +GF +C
Sbjct: 481 GIYCLAFAPSS--SGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 149/369 (40%), Positives = 202/369 (54%), Gaps = 31/369 (8%)
Query: 125 TSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY Q++ +FDP+ S
Sbjct: 170 SSGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y V C + C L + G C Y V YGDGSY+ G + L L
Sbjct: 230 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP A ++G+
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSTGT 337
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
L F + + + P L TFY + +TGI +GG+ L S FA G
Sbjct: 338 GYLD-----FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392
Query: 355 ILIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
++DSGTVITRLPP+ YS+L+ G+ AP S+LDTC++ + +V IP V
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 452
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G A + VD +GI+Y + ASQVCLA A+ + GI+GN Q K V YD
Sbjct: 453 LLFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510
Query: 473 QLGFAGEDC 481
+GF C
Sbjct: 511 VVGFYPGAC 519
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 150/364 (41%), Positives = 200/364 (54%), Gaps = 31/364 (8%)
Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
+SG L T NY+ T+ LG TV+ DTGSD TWVQCQPC CY QQ+ +FDP S
Sbjct: 168 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSS 227
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
+Y V C + C L + G C Y V YGDGSY+ G + L L
Sbjct: 228 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 279
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+V F FGCG N+GLFG +GL+GLGR SL QT + +GG+F++CLP A ++G+
Sbjct: 280 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSTGT 335
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
L F +P + + P L TFY + +TGI +GG+ L S FA G
Sbjct: 336 GYLD-----FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAG 390
Query: 355 ILIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
++DSGTVITRLPP YS+L+ G+ AP S+LDTC++ + +V IP V
Sbjct: 391 TIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 450
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F+G A + VD +GI+Y + ASQVCLA A+ + GI+GN Q K V YD
Sbjct: 451 LLFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 508
Query: 473 QLGF 476
+GF
Sbjct: 509 VVGF 512
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 139/367 (37%), Positives = 189/367 (51%), Gaps = 19/367 (5%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
IP ++G L TL ++ T+ G + TVI DTGSD++W+QC PC CY Q DP+FDP+
Sbjct: 122 IPDSTGTSLDTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPT 181
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y V C C A + CS+ + C Y V YGDGS + G L E L L
Sbjct: 182 KSATYSVVPCGHPQC-----AAADGSKCSNGT---CLYKVEYGDGSSSAGVLSHETLSLT 233
Query: 239 KA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++ F FGCG+ N G FG V GL+GLGR LSL SQ + FGG FSYCLPS D
Sbjct: 234 STRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS--DNTT 291
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGI 355
G L +G + + + YT M+ +FY + L I IGG L F G
Sbjct: 292 HGYLTIGPTTPASNDD--VQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGT 349
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+DSGT++T LPP Y+AL+ F + + AP + DTC++ + + IP V +F
Sbjct: 350 FLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKF 409
Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+ + GI+ F A + CL + I+GN QQ+N VIYD ++
Sbjct: 410 SDGSVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKI 469
Query: 475 GFAGEDC 481
GFA C
Sbjct: 470 GFASASC 476
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/397 (39%), Positives = 233/397 (58%), Gaps = 17/397 (4%)
Query: 92 LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR--NMTVI 149
L+ D L V+ + +R N +G+ +IP+ SGI L NY+ + LG ++++
Sbjct: 2 LLQDQLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61
Query: 150 VDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
+DTGSD+TW QC+PC SCY Q FDP S SYK V C+SS+C + + G G SS
Sbjct: 62 LDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVSS 121
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
+ C Y V YGDGSY+ G E L + + V ++F+FGCG+ N G FG ++GL+GLG
Sbjct: 122 T----CIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLG 177
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R LSL QTSE + LF+YCLPS + ++G L LGG + + +T + P +
Sbjct: 178 RGKLSLALQTSEKYNNLFTYCLPSFSSS-STGHLTLGG-----QVPKSVKFTPLSPAFKN 231
Query: 328 ATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
FY +++ G+S+GG L AS F+ G +IDSGTVITRL P++YSAL ++F + +
Sbjct: 232 TPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDY 291
Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
P GFSILDTC++ S + +++P + F+G E+ + GI+ + + +VCLA A
Sbjct: 292 PKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINA-WDKVCLAFAP 350
Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ + + GN QQ+ V++D ++GFA C+
Sbjct: 351 NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGCN 387
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 151/383 (39%), Positives = 212/383 (55%), Gaps = 17/383 (4%)
Query: 107 KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
K SG +S+ IP + G + +L Y+ T+ +G TV++DTGSDL+WVQC+PC
Sbjct: 99 KAKASGRTTTLSDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPC 158
Query: 165 KS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
S CY Q+DP++DP+ S +Y V C+S C L + G +SS C Y + YG+
Sbjct: 159 NSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGN 218
Query: 223 GSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
T G E L L + SV DF FGCG +G F GL+GLG + SLVSQT+E +
Sbjct: 219 RDTTVGVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETY 278
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
GG FSYCLP +G L LG ++ ++ +T + P+ ATFY++NLTG+S+G
Sbjct: 279 GGAFSYCLPPGNS--TTGFLALGAPTN-NNDTAGFLFTPLHSLPEQATFYLVNLTGVSVG 335
Query: 342 GKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
GK L GG++IDSGT+IT LP + YSAL+ F S +P P +LDTC+
Sbjct: 336 GKPLDIPPTVLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCY 395
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
N + V +P V + F+G A + +DV V Q CLA A + + + GIIGN
Sbjct: 396 NFTGIANVTVPTVALTFDGGATIDLDVPSGVLI------QDCLAFAGGASDGDVGIIGNV 449
Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
Q+ V+YD+ +GF C
Sbjct: 450 NQRTFEVLYDSGRGHVGFRPGAC 472
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 160/440 (36%), Positives = 230/440 (52%), Gaps = 41/440 (9%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKN----------MISGNIK-- 115
L L H ++ CS + + + D+ + +L SR+ N ++ G+ K
Sbjct: 45 LTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHRKKK 104
Query: 116 -------DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK- 165
S++ +PLT G + NY+ + LG + ++VDTGS LTW+QC PC
Sbjct: 105 AGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSV 164
Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
SC+ Q PVFDP S +Y V C+SS C L+ AT N CS S+ C Y SYGD SY
Sbjct: 165 SCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSN--VCIYQASYGDSSY 222
Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
+ G L ++ + G S F +GCG++N+GLFG +GL+GL ++ LSL+ Q + G F
Sbjct: 223 SVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAF 282
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
SYCLP++ + A+G L +G N +YT M + A+ Y + L+GIS+ G L
Sbjct: 283 SYCLPTS--SAAAGYLSIGS-----YNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPL 335
Query: 346 QA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSA 402
S + +IDSGTVITRLPP++Y+AL + +SILDTCF SA
Sbjct: 336 AVPPSEYRSLPTIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA 395
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
+ +P V M F G A + + ++ + D S CLA A T IIGN QQ+
Sbjct: 396 -AGLRVPRVDMAFAGGATLALSPGNVL--IDVDDSTTCLAFAP---TGGTAIIGNTQQQT 449
Query: 463 QRVIYDTKNSQLGFAGEDCS 482
V+YD S++GFA CS
Sbjct: 450 FSVVYDVAQSRIGFAAGGCS 469
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 159/431 (36%), Positives = 224/431 (51%), Gaps = 32/431 (7%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV---------- 117
L L H ++ CS + + L D+ + L +R+ S +
Sbjct: 43 LTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSSSSPDA 102
Query: 118 -SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDP 173
S +PL G + NY+ + LG ++ ++VDTGS LTW+QC PC SC+ Q P
Sbjct: 103 ESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGP 162
Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
VF+P S SY V C++ C AL AT N CS+S+ C Y SYGD S++ G L ++
Sbjct: 163 VFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSN--VCIYQASYGDSSFSVGYLSKD 220
Query: 234 HLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
+ G SV +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G FSYCLP++
Sbjct: 221 TVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSS 280
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA 351
+ S N +YT M + + Y + +TGI++ GK L AS ++
Sbjct: 281 SSSGY-------LSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYS 333
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
+IDSGTVITRLP +YSAL G P A FSILDTCF A + +P V
Sbjct: 334 SLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-SRLRVPQV 392
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
M F G A + + T ++ V D++ CLA A IIGN QQ+ V+YD KN
Sbjct: 393 SMAFAGGAALKLKATNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKN 447
Query: 472 SQLGFAGEDCS 482
S++GFA CS
Sbjct: 448 SKIGFAAGGCS 458
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 153/370 (41%), Positives = 209/370 (56%), Gaps = 28/370 (7%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFD 176
+P + G + TLNY+ T LG G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FD
Sbjct: 35 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFD 94
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
P+ S SY V C C L ++ + Y VSYGDGS T G + L
Sbjct: 95 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLT 149
Query: 237 LGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
L +S V F FGCG GLF GV GL+GLGR SLV QT+ +GG+FSYCLP+
Sbjct: 150 LSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 209
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG 353
+L +GG S + T ++P+P T+Y++ LTGIS+GG+QL AS FA
Sbjct: 210 AGYLTLGVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG- 265
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLV 411
G ++D+GTV+TRLPP+ Y+AL++ F + G+P+AP ILDTC+N + Y V +P V
Sbjct: 266 GTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNV 325
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F A +T+ GI+ F CLA A + I+GN QQ++ V D
Sbjct: 326 ALTFGSGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--G 376
Query: 472 SQLGFAGEDC 481
+ +GF C
Sbjct: 377 TSVGFKPSSC 386
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 158/409 (38%), Positives = 241/409 (58%), Gaps = 32/409 (7%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTE-----------IPLTSGIRLQTLNYIATIELG- 142
D V++L SR+ N S +++ + T+ PL SG+ + + NY I LG
Sbjct: 64 DEERVRFLHSRLTNKES--VRNSATTDKLRGGPSLVSTTPLKSGLSIGSGNYYVKIGLGT 121
Query: 143 -GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
+ ++IVDTGS L+W+QCQPC C+ Q DP+F PS S +YK + C+SS C +L+ +T
Sbjct: 122 PAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSLKSST 181
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLFG 258
N+ CS+++ C Y SYGD S++ G L ++ L L +A + F++GCG++N+GLFG
Sbjct: 182 LNAPGCSNATG-ACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQGLFG 240
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA----SGSLILGGNSSVFKNST 314
SG++GL +S++ Q S+ +G FSYCLPS+ A SG L +G +S S+
Sbjct: 241 RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASS---LTSS 297
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSA 373
P +T ++ N ++ + Y L+LT I++ GK L S + +IDSGTVITRLP ++Y+A
Sbjct: 298 PYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAVYNA 357
Query: 374 LKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
LK F+ S + APGFSILDTCF S + +P +++ F G A + + + V
Sbjct: 358 LKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSL--V 415
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + CLA+A+ S + IIGNYQQ+ +V YD N ++GFA C
Sbjct: 416 EIEKGTTCLAIAASS--NPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 172/479 (35%), Positives = 258/479 (53%), Gaps = 28/479 (5%)
Query: 10 ILSLLLPLMVSL-FLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
LS+++ L V L + A+GA + K L + +Q SSSSCV K+ ++
Sbjct: 7 FLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTKSSLR 66
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
+ H CS D + D V+ + S++ + + + +TE+P SGI
Sbjct: 67 VVHMH-GACSHLSSDARVDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGI 125
Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKK 185
L + NYI TI +G +++++ DTGSDLTW QC+PC SCY+Q++P F+PS S +Y+
Sbjct: 126 TLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQN 185
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-ND 244
V C+S C E S S +C Y + YGD S+T+G L +E L + V D
Sbjct: 186 VSCSSPMCEDAE----------SCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLED 235
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG NN+GLF GV+GL+GLG LSL +QT+ + +FSYCLPS + ++G L G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT-SNSTGHLTFG 294
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS--GFAKGGILIDSGTV 362
S+ S T + P+ A Y +++ GIS+G K+L + F+ G +IDSGTV
Sbjct: 295 --SAGISESVKFTPISSFPS---AFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTV 349
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
TRLP +Y+ L++ F ++ S + S G+ + DTC++ + V P + F G+ +
Sbjct: 350 FTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVE 409
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D +GI +K SQVCLA A +D I GN QQ V+YD ++GFA C
Sbjct: 410 LDGSGISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 160/461 (34%), Positives = 247/461 (53%), Gaps = 45/461 (9%)
Query: 36 KLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHK-------NYCSGKIVDWNE-Q 87
+ + H L+ S+ C K+ E G+ +L+L H+ + +NE
Sbjct: 32 RAYFHTLKISSLP-STEVCKESSKALNE-GSSSLKLVHRFGPCNPHRTSTAPASSFNEIL 89
Query: 88 QQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
++++L +D++ +Q+R ++ +++ + ++ +P ++ +YI + +G +
Sbjct: 90 RRDKLRVDSI----IQARRSMNLTSSVEHMKSS-VPFYGLSKITASDYIVNVGIGTPKKE 144
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
M +I DTGS L W QC+PCK+CY + PVFDP+ S S+K + C+S C ++
Sbjct: 145 MPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIRQG------ 197
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVSGL 263
CSS P C Y +Y D S + G L E + K + + GC G G SG+
Sbjct: 198 CSS---PKCTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGI 254
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
MGL RS +SL SQT+ I+ LFSYC+PST G++G L GG +P++ T
Sbjct: 255 MGLNRSPISLASQTANIYDKLFSYCIPST--PGSTGHLTFGGKVPNDVRFSPVSKT---- 308
Query: 324 NPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
++ Y + +TGIS+GG++L AS F K IDSG V+TRLPP YSAL++ F +
Sbjct: 309 --APSSDYDIKMTGISVGGRKLLIDASAF-KIASTIDSGAVLTRLPPKAYSALRSVFREM 365
Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-C 440
G+P LDTC++ S Y V IP + + FEG EM +DV+GI++ V S+V C
Sbjct: 366 MKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVP--GSKVYC 423
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
LA A L +DE I GN+QQK V++D ++GFA C
Sbjct: 424 LAFAEL--DDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 163/426 (38%), Positives = 230/426 (53%), Gaps = 33/426 (7%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQ-QNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
G +T+ L H++ + N ++ L D L Y+ +R + ++G+ DV S+
Sbjct: 55 GVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYI-TRKYSGVNGSAGDVEGSDVT 113
Query: 122 IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+P T G L TL Y+ T+ +G + T+++DTGSD++WVQC+PC C++Q D +FDPS
Sbjct: 114 VPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSS 173
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +Y C S+ C L G CSSS C Y V YGDGS G + L LG
Sbjct: 174 SSTYSAFSCTSAACAQLR----QRG-CSSS---QCQYTVKYGDGSTGSGTYSSDTLALGS 225
Query: 240 ASVNDFIFGCGRNNKG--LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
++V +F FGC ++ G L +GLMGLG SL +QT+ FG FSYCLP T G+
Sbjct: 226 STVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPT--PGS 283
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
SG L LG ++S F TP+ + +P+ +Y + L I +GG+QL AS F+ G I
Sbjct: 284 SGFLTLGASTSGFVVKTPMLRSTQVPS-----YYGVLLQAIRVGGRQLNIPASAFSAGSI 338
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+ DSGT+ITRLP + YSAL + F +P A I DTCF+ S V+IP V + F
Sbjct: 339 M-DSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVF 397
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A + + GI+ CLA A+ S + GIIGN QQ+ V+YD +G
Sbjct: 398 SGGAVVDLASDGIIL-------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVG 450
Query: 476 FAGEDC 481
F C
Sbjct: 451 FKAGAC 456
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 172/479 (35%), Positives = 257/479 (53%), Gaps = 28/479 (5%)
Query: 10 ILSLLLPLMVSL-FLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
LS+++ L V L + A+GA + K L + +Q SSSSCV K+ ++
Sbjct: 7 FLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTKSSLR 66
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
+ H CS D + D V+ + S++ + + + +TE+P SGI
Sbjct: 67 VVHMH-GACSHLSSDARVDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGI 125
Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKK 185
L + NYI TI +G +++++ DTGSDLTW QC+PC SCY+Q++P F+PS S +Y+
Sbjct: 126 TLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQN 185
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-ND 244
V C+S C E S S +C Y + YGD S+T+G L +E L + V D
Sbjct: 186 VSCSSPMCEDAE----------SCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLED 235
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG NN+GLF GV+GL+GLG LSL +QT+ + +FSYCLPS + ++G L G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT-SNSTGHLTFG 294
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS--GFAKGGILIDSGTV 362
S+ S T + P+ A Y +++ GIS+G K+L + F+ G +IDSGTV
Sbjct: 295 --SAGISESVKFTPISSFPS---AFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTV 349
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
TRLP +Y+ L++ F ++ S + S G+ + DTC++ + V P + F G +
Sbjct: 350 FTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVE 409
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D +GI +K SQVCLA A +D I GN QQ V+YD ++GFA C
Sbjct: 410 LDGSGISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 152/365 (41%), Positives = 206/365 (56%), Gaps = 28/365 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISP 181
G + TLNY+ T LG G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S
Sbjct: 132 GYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSS 191
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY V C C L ++ + Y VSYGDGS T G + L L +S
Sbjct: 192 SYAAVPCGGPVCAGLGIYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 242 -VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V F FGCG GLF GV GL+GLGR SLV QT+ +GG+FSYCLP+ +
Sbjct: 247 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 306
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILID 358
L +GG S + T ++P+P T+Y++ LTGIS+GG+QL AS FA G ++D
Sbjct: 307 LGVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVD 362
Query: 359 SGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+GTV+TRLPP+ Y+AL++ F + G+P+AP ILDTC+N + Y V +P V + F
Sbjct: 363 TGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFG 422
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
A +T+ GI+ F CLA A + I+GN QQ++ V D + +GF
Sbjct: 423 SGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGF 473
Query: 477 AGEDC 481
C
Sbjct: 474 KPSSC 478
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 166/453 (36%), Positives = 249/453 (54%), Gaps = 46/453 (10%)
Query: 58 QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLI----LDNLHVQYLQSRIKNMISGN 113
Q S + G ++LEL H+N + + + L+ D V++++S K ++G
Sbjct: 47 QLSPRDGGTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIES--KAQLAGK 104
Query: 114 IKD-VSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
KD S+T++ P+TSG+ + Y + +G R++ ++VDTGSDL W+QCQPCKSCY
Sbjct: 105 KKDEASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCY 164
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEF--ATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
Q DP+FDP S S++++ C S C ALE +G+ G S C+Y V+YGDGS++
Sbjct: 165 KQADPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSR-----CSYQVAYGDGSFS 219
Query: 227 RGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ-----TSEI 280
G+ + LG S FGCG +N+GLF G +GL+GLG LS SQ T+
Sbjct: 220 VGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSS 279
Query: 281 FGGLFSYCL-----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
FSYCL P T+ +S SLI G + + + ++ NP+L TFY +
Sbjct: 280 TANSFSYCLVDRSNPMTR---SSSSLIFGAAAI----PSTAALSPLLKNPKLDTFYYAAM 332
Query: 336 TGISIGGKQ-------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
G+S+GG Q LQ S GG++IDSGT +TR P S+Y+ ++ F + PSA
Sbjct: 333 IGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSA 392
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
P +S+ DTC+N S V++P + + FE A++ + T + + + A CLA A S
Sbjct: 393 PRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINT-AGSFCLAFAPTSM 451
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
E GIIGN QQ++ R+ +D + S L FA + C
Sbjct: 452 --ELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 153/389 (39%), Positives = 219/389 (56%), Gaps = 35/389 (8%)
Query: 108 NMISGNIK--DVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
+++ N+ + S EI P+ SG+ L + Y + + +G R + +++DTGSD+TWVQC
Sbjct: 132 DLVPANVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQC 191
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
QPC CY Q DPVFDPS+S SY V C++ CH L+ A C +S+ C Y V+YG
Sbjct: 192 QPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAA-----ACRNSTGA-CLYEVAYG 245
Query: 222 DGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
DGSYT G+ E L LG A V+ GCG +N+GLF G +GL+ LG LS SQ S
Sbjct: 246 DGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT 305
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
FSYCL +D+ +S +L G+++ + + P +I +P+ +TFY + L+GIS+
Sbjct: 306 ---TFSYCL-VDRDSPSSSTLQF-GDAADAEVTAP-----LIRSPRTSTFYYVGLSGISV 355
Query: 341 GGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG+ L S FA GG+++DSGT +TRL S Y+AL+ F++ P G S+
Sbjct: 356 GGQILSIPPSAFAMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL 415
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDET 452
DTC++LS V +P V + F G E+ + Y + D A CLA A +
Sbjct: 416 FDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN--YLIPVDGAGTYCLAFAPTNA--AV 471
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ RV +DT S +GF C
Sbjct: 472 SIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 158/402 (39%), Positives = 234/402 (58%), Gaps = 24/402 (5%)
Query: 95 DNLHVQYLQSRIKNMISGN--IKDVSN--TEIPLTSGIRLQTLNYIATIELGG--RNMTV 148
D ++Y SR+ N K V IPL SG+ + + NY + LG + T+
Sbjct: 59 DEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTM 118
Query: 149 IVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
IVDTGS +W+QCQPC C+ Q+DPVF+PS S +YK V C+SS C +L+ AT N CS
Sbjct: 119 IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCS 178
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGL 266
S C Y SYGD S++ G L ++ L L + +++ F++GCG++N+GLFG G++GL
Sbjct: 179 KQSN-ACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGL 237
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
++LS++SQ S +G FSYCLP ST ++ G L +G +S S+ +T ++
Sbjct: 238 ANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG--TSSLTPSSSYKFTPLLK 295
Query: 324 NPQLATFYILNLTGISIGGKQL-QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
NP + Y ++L I++ G+ L A+ K +IDSGTVITRLP +Y+ LK ++
Sbjct: 296 NPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTIL 355
Query: 383 S-GFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
S + APG S+LDTCF +L+ EV P +++ F+G A++ + G V+ +
Sbjct: 356 SKKYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADL--QLKGHNSLVELETGIT 412
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA+A S IIGNYQQ+ +V YD NS++GFA C
Sbjct: 413 CLAMAGSS---SIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 158/402 (39%), Positives = 234/402 (58%), Gaps = 24/402 (5%)
Query: 95 DNLHVQYLQSRIKNMISGNI--KDVSN--TEIPLTSGIRLQTLNYIATIELGG--RNMTV 148
D ++Y SR+ N K V IPL SG+ + + NY + LG + T+
Sbjct: 59 DEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTM 118
Query: 149 IVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
IVDTGS +W+QCQPC C+ Q+DPVF+PS S +YK V C+SS C +L+ AT N CS
Sbjct: 119 IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCS 178
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGL 266
S C Y SYGD S++ G L ++ L L + +++ F++GCG++N+GLFG G++GL
Sbjct: 179 KQSN-ACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGL 237
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
++LS++SQ S +G FSYCLP ST ++ G L +G +S S+ +T ++
Sbjct: 238 ANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG--TSSLTPSSSYKFTPLLK 295
Query: 324 NPQLATFYILNLTGISIGGKQL-QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
NP + Y ++L I++ G+ L A+ K +IDSGTVITRLP +Y+ LK ++
Sbjct: 296 NPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTIL 355
Query: 383 S-GFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
S + APG S+LDTCF +L+ EV P +++ F+G A++ + G V+ +
Sbjct: 356 SKKYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADL--QLKGHNSLVELETGIT 412
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA+A S IIGNYQQ+ +V YD NS++GFA C
Sbjct: 413 CLAMAGSS---SIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 158/435 (36%), Positives = 228/435 (52%), Gaps = 30/435 (6%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK--NMISGNIKDVSN--TE 121
++ L +H RL D Y+ ++ + + D + T
Sbjct: 18 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDP 177
IP G + +L Y+ T+ +G TV++DTGSDL+WVQC+PC + CY Q+DP+FDP
Sbjct: 78 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137
Query: 178 SISPSYKKVLCNSSTCHALE---FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
S S SY V C+S C L + G +GV S + C Y + YG+ + T G E
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGV-SGGAAALCEYGIEYGNRATTTGVYSTET 196
Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L L V DF FGCG + G + GL+GLG + SLVSQTS FGG FSYCLP T
Sbjct: 197 LTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPT- 255
Query: 294 DAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
+G +G L LG NSS ++ +++T M P + TFYI+ LTGIS+GG L S
Sbjct: 256 -SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSA 314
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEV 406
F+ G++IDSGTVIT LP + Y+AL++ F + ++ P + G +LDTC++ + + V
Sbjct: 315 FSS-GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHANV 372
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P + + F G A T+D+ + CLA A ++ GIIGN Q+ V+
Sbjct: 373 TVPTISLTFSGGA--TIDLAAPAGVLVDG----CLAFAGAGTDNAIGIIGNVNQRTFEVL 426
Query: 467 YDTKNSQLGFAGEDC 481
YD+ +GF C
Sbjct: 427 YDSGKGTVGFRAGAC 441
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 157/430 (36%), Positives = 221/430 (51%), Gaps = 32/430 (7%)
Query: 69 LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV---------- 117
LEL H ++ CS V + L D+ + L +R+ S +
Sbjct: 45 LELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAGLAG 104
Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
S +PL+ G + NY+ + LG ++VDTGS LTW+QC PC SC+ Q PV
Sbjct: 105 SLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPV 164
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
F+P S +Y V C++ C L AT N CSSS+ C Y SYGD S++ G L ++
Sbjct: 165 FNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSN--VCIYQASYGDSSFSVGYLSKDT 222
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ G S+ +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G F+YCLPS+
Sbjct: 223 VSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS 282
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
+G S N +YT M+ + + Y + L+G+++ G L S A
Sbjct: 283 SGYL--------SLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSS 334
Query: 355 I--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
+ +IDSGTVITRLP S+YSAL G A +SILDTCF A V+ P V
Sbjct: 335 LPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA-SRVSAPAVT 393
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
M F G A + + ++ V D S CLA A IIGN QQ+ V+YD K+S
Sbjct: 394 MSFAGGAALKLSAQNLL--VDVDDSTTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKSS 448
Query: 473 QLGFAGEDCS 482
++GFA CS
Sbjct: 449 RIGFAAGGCS 458
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 167/408 (40%), Positives = 230/408 (56%), Gaps = 39/408 (9%)
Query: 92 LILDNLHVQYLQSRIKNMIS-GNIKDVS------NTEIPLTSGIRLQTLNYIATIELG-- 142
L D +Y+Q R+ G ++ + + IP G + TL Y+ T+ LG
Sbjct: 450 LRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKSVTIPANIGHSIGTLQYVVTVSLGTP 509
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYN--QQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
G TV VDTGSD++WVQC PC + Q+D +FDP+ S SY V C + C E +T
Sbjct: 510 GVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAADACS--ELST 567
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGG 259
G C++ S C Y VSYGDGS T G G + L L A +V F+FGCG GLF G
Sbjct: 568 YGHG-CAAGS--QCGYVVSYGDGSNTTGVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAG 624
Query: 260 VSGLMGLGRSDLSLVSQTSEIF-GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
+ GL+ LGR +SL SQTS + GG+FSYCLP + ++G L LGG SS +T
Sbjct: 625 IDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS--STGFLTLGGPSSASGFAT---- 678
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALK 375
T ++ + TFY++ LTGI +GG+QL AS FA GG ++D+GTVITRLPP+ Y+AL+
Sbjct: 679 TGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA-GGTVVDTGTVITRLPPTAYAALR 737
Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
A F + G+P+AP ILDTC+N + Y V +P V + F G A + +D G +
Sbjct: 738 AAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSGGATLKLDAPGFL---- 793
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S CLA A+ S + + I+GN QQ++ V +D S +GF C
Sbjct: 794 ---SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--GSSVGFMPHSC 836
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 156/395 (39%), Positives = 221/395 (55%), Gaps = 22/395 (5%)
Query: 95 DNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVD 151
D + L SR+ KD V+ + +PL SG + NYI + LG T ++VD
Sbjct: 71 DAARIAGLASRLAT----KDKDWVAASSVPLASGASVGVGNYITRLGLGTPTTTYVMVVD 126
Query: 152 TGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
+GS LTW+QC PC SC+ Q P++DP S +Y V C++ C L+ AT N CS S
Sbjct: 127 SGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSGSG 186
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
C Y SYGDGS++ G L ++ + L + S F +GCG++N GLFG +GL+GL R+
Sbjct: 187 --VCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARN 244
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
LSL+SQ + G F+YCLP T A ++G L G NS KN +YT+M+ + A+
Sbjct: 245 KLSLLSQLAPSVGNSFAYCLP-TSAAASAGYLSFGSNSDN-KNPGKYSYTSMVSSSLDAS 302
Query: 330 FYILNLTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
Y ++L G+S+ G L G + +IDSGTVITRLP +Y+AL ++ + PS
Sbjct: 303 LYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSGTVITRLPTPVYTAL-SKAVGAALAAPS 361
Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
AP +SIL TCF ++ +P V M F G A + + ++ V + + CLA A
Sbjct: 362 APAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVL--VDVNETTTCLAFAP-- 416
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
D T IIGN QQ+ V+YD K S++GFA CS
Sbjct: 417 -TDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 156/411 (37%), Positives = 223/411 (54%), Gaps = 30/411 (7%)
Query: 90 NRLILDNLHVQYLQSRIK--NMISGNIKDVSN--TEIPLTSGIRLQTLNYIATIELG--G 143
RL D Y+ ++ + + D + T IP G + +L Y+ T+ +G
Sbjct: 122 ERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPA 181
Query: 144 RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE---F 198
TV++DTGSDL+WVQC+PC + CY Q+DP+FDPS S SY V C+S C L +
Sbjct: 182 VQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAY 241
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLF 257
G +GV S + C Y + YG+ + T G E L L V DF FGCG + G +
Sbjct: 242 GHGCTGV-SGGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPY 300
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG--NSSVFKNSTP 315
GL+GLG + SLVSQTS FGG FSYCLP T +G +G L LG NSS ++
Sbjct: 301 EKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPT--SGGAGFLTLGAPPNSSSSTAASG 358
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSA 373
+++T M P + TFYI+ LTGIS+GG L S F+ G++IDSGTVIT LP + Y+A
Sbjct: 359 LSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSS-GMVIDSGTVITGLPATAYAA 417
Query: 374 LKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
L++ F + ++ P + G +LDTC++ + + V +P + + F G A T+D+
Sbjct: 418 LRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHANVTVPTISLTFSGGA--TIDLAAPAG 474
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ CLA A ++ GIIGN Q+ V+YD+ +GF C
Sbjct: 475 VLVDG----CLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 152/389 (39%), Positives = 219/389 (56%), Gaps = 35/389 (8%)
Query: 108 NMISGNIK--DVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
+++ N+ + S EI P+ SG+ L + Y + + +G R + +++DTGSD+TWVQC
Sbjct: 136 DLVPANVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQC 195
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
QPC CY Q DPVFDPS+S SY V C++ CH L+ A C +S+ C Y V+YG
Sbjct: 196 QPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAA-----ACRNSTGA-CLYEVAYG 249
Query: 222 DGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
DGSYT G+ E L LG A V+ GCG +N+GLF G +GL+ LG LS SQ S
Sbjct: 250 DGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT 309
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
FSYCL +D+ +S +L G+++ + + P +I +P+ +TFY + L+G+S+
Sbjct: 310 ---TFSYCL-VDRDSPSSSTLQF-GDAADAEVTAP-----LIRSPRTSTFYYVGLSGLSV 359
Query: 341 GGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG+ L S FA GG+++DSGT +TRL S Y+AL+ F++ P G S+
Sbjct: 360 GGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL 419
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDET 452
DTC++LS V +P V + F G E+ + Y + D A CLA A +
Sbjct: 420 FDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN--YLIPVDGAGTYCLAFAPTNA--AV 475
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ RV +DT S +GF C
Sbjct: 476 SIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 155/423 (36%), Positives = 225/423 (53%), Gaps = 39/423 (9%)
Query: 81 IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
+ D + L D+ V+ + R+ D + T IP + G+ +L Y+ TI
Sbjct: 78 VPDHHPHYTGILRRDHNRVRSIHRRLTGA-----GDTAAT-IPASLGLAFHSLEYVVTIG 131
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
+G RN TV+ DTGSDLTWVQC+PC SCY QQ+P+FDPS S +Y V C + C +
Sbjct: 132 IGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQC---K 188
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRN-NK 254
G C ++ C Y V YGD S TRG L +E L ++ +FGC +
Sbjct: 189 IGGGQDLTCGGTT---CEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEYSS 245
Query: 255 GLFGG-----VSGLMGLGRSDLSLVSQTSE-IFGGLFSYCLPSTQDAGASGSLILGGNSS 308
G+ G V+GL+GLGR D S++SQT G +FSYCLP ++G L +G +
Sbjct: 246 GVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPR--GSSAGYLTIGAAAP 303
Query: 309 VFKNSTPITYTNMIP-NPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITR 365
N +++T ++ N QL++ Y++NL GIS+ G L AS F G + IDSGTVIT
Sbjct: 304 PQSN---LSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV-IDSGTVITH 359
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
+P + Y L+ EF + G+ P + LDTC++++ + V P V +EF G A + V
Sbjct: 360 MPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDV 419
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDET----GIIGNYQQKNQRVIYDTKNSQLGFAGE 479
D +GI+ DAS L LA L++ IIGN QQ+ V++D + ++GF
Sbjct: 420 DASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGAN 479
Query: 480 DCS 482
CS
Sbjct: 480 GCS 482
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 145/370 (39%), Positives = 210/370 (56%), Gaps = 26/370 (7%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G L T NY+A++ LG + V +DTGSD +WVQC+PC CY Q+DPVFDP+ S +Y
Sbjct: 131 GKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYS 190
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA---- 240
V C + C L ++ + S ++ +C Y VSY D S+T G+L R+ L L +
Sbjct: 191 AVPCGARECQELASSSSSRNCSSDNN-KNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPS 249
Query: 241 ---SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+V F+FGCG +N G FG V GL+GLG SL SQ + +G FSYCLPS+ A
Sbjct: 250 PADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS--A 307
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-KGG 354
+G L GG ++ +T M+ Q T Y LNLTGI + G+ ++ AS FA G
Sbjct: 308 AGYLSFGGAAARAN----AQFTEMVTG-QDPTSYYLNLTGIVVAGRAIKVPASAFATAAG 362
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
+IDSGT +RLPPS Y+AL++ F + AP I DTC++ + ++ V IP V+
Sbjct: 363 TIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVE 422
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F A + + +G++Y +D +Q CLA + GI+GN QQ+ VIYD +
Sbjct: 423 LVFADGATVHLHPSGVLY-TWNDVAQTCLAFVP---NHDLGILGNTQQRTLAVIYDVGSQ 478
Query: 473 QLGFAGEDCS 482
++GF + C+
Sbjct: 479 RIGFGRKGCA 488
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 156/454 (34%), Positives = 239/454 (52%), Gaps = 48/454 (10%)
Query: 58 QKSRIEMGAITLELKHKNYCSGKI-----VDWNEQQQNRLILDNLHVQYLQSRIKNMISG 112
++ +E+ ++ L H++ G + + E+ Q RL D V + SR++ ++G
Sbjct: 50 KEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLELAVNG 109
Query: 113 NIKDV-------------SNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLT 157
+ S+ + P+ SG+ + Y + I +G R+ +++DTGSD+T
Sbjct: 110 IKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVT 169
Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
W+QC+PC CY Q DP+++P++S SYK V C ++ C L+ V S C Y
Sbjct: 170 WIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANLCQQLD-------VSGCSRNGSCLYQ 222
Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
VSYGDGSYT+G E L LG A + + GCG +N+GLF G +GL+GLG LS SQ
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQL 282
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP--ITYTNMIPNPQLATFYILNL 335
++ G +FSYCL +D+ +S +L G + + P M+ N +L TFY ++L
Sbjct: 283 TDENGKIFSYCLVD-RDSESSSTLQFG------RAAVPNGAVLAPMLKNSRLDTFYYVSL 335
Query: 336 TGISIGGKQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
+GIS+GGK L S GG+++DSGT +TRL + Y +L+ F PS
Sbjct: 336 SGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPST 395
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLS 447
G S+ DTC++LS+ + V++P V F G M++ Y V D+ C A A S
Sbjct: 396 DGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKN--YLVPVDSMGTFCFAFAPTS 453
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GN QQ+ RV +D N+Q+GFA C
Sbjct: 454 --SSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 148/428 (34%), Positives = 220/428 (51%), Gaps = 34/428 (7%)
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD----VSNTEIPL 124
L L H++ S + +R+ D + V L R+ + +KD V+N +
Sbjct: 74 LNLLHRDKLS-HVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDV 132
Query: 125 TSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
SG+ + Y I +G RN +++D+GSD+ WVQC+PC CY Q DPVFDP+ S S
Sbjct: 133 ISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSS 192
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
+ V C S C LE N+G C Y VSYGDGSYT+G L E L +G+ +
Sbjct: 193 FAGVSCGSDVCDRLENTGCNAG--------RCRYEVSYGDGSYTKGTLALETLTVGQVMI 244
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
D GCG N+G+F G +GL+GLG +S + Q GG FSYCL S + G++G+L
Sbjct: 245 RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS-RGTGSTGALE 303
Query: 303 LGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGG-------KQLQASGFAKG 353
G + + P+ T+ ++I NP+ +FY + L GI +GG + Q + +
Sbjct: 304 FG------RGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTN 357
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G+++D+GT +TR P + Y A + F Q S P APG SI DTC++L+ ++ V +P V
Sbjct: 358 GVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSF 417
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F +T+ + V + CLA A IIGN QQ+ ++ +D N
Sbjct: 418 YFSDGPVLTLPARNFLIPVDGGGT-FCLAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 474
Query: 474 LGFAGEDC 481
+GF C
Sbjct: 475 VGFGPNIC 482
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 149/381 (39%), Positives = 212/381 (55%), Gaps = 33/381 (8%)
Query: 114 IKDVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
+ + S EI P+ SG+ + Y + + +G R + +++DTGSD+TW+QCQPC CY
Sbjct: 140 VFEASAAEIQGPVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYA 199
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
Q DPV+DPS+S SY V C+S C L+ A C +S+ C Y V+YGDGSYT G+
Sbjct: 200 QSDPVYDPSVSTSYATVGCDSPRCRDLDAA-----ACRNST-GSCLYEVAYGDGSYTVGD 253
Query: 230 LGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
E L LG A V++ GCG +N+GLF G +GL+ LG LS SQ S FSYC
Sbjct: 254 FATETLTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT---TFSYC 310
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-- 346
L +D+ +S +L G + P +I +P+ TFY + L+GIS+GG+ L
Sbjct: 311 L-VDRDSPSSSTLQFG------DSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIP 363
Query: 347 ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
+S FA GG+++DSGT +TRL Y AL+ F++ P A G S+ DTC++L+
Sbjct: 364 SSAFAMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLA 423
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQ 460
V +P V + FEG E+ + Y + DA+ CLA A S IIGN QQ
Sbjct: 424 GRSSVQVPAVALWFEGGGELKLPAKN--YLIPVDAAGTYCLAFAGTS--GPVSIIGNVQQ 479
Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
+ RV +DT + +GF + C
Sbjct: 480 QGVRVSFDTAKNTVGFTADKC 500
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 156/461 (33%), Positives = 231/461 (50%), Gaps = 41/461 (8%)
Query: 43 QWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYL 102
++ ++ S+S + +R ++ L +H RL D Y+
Sbjct: 24 SFEPEAACSTSSANSDPNR---ASVPLVHRHGPCAPSAASGGKPSLAERLRRDRARANYI 80
Query: 103 QSR----------IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
++ + + + G T IP G + +L Y+ T+ +G V++
Sbjct: 81 VTKAAGGRTAATAVSDAVGGG-----GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLI 135
Query: 151 DTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
DTGSDL+WVQC+PC + CY Q+DP+FDPS S SY V C+S C L G C+S
Sbjct: 136 DTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHG-CTS 194
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
+ C Y + YG+ + T G E L L V DF FGCG + G + GL+GLG
Sbjct: 195 GAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLG 254
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP--ITYTNMIPNP 325
+ SLVSQTS FGG FSYCLP T +G +G L LG +S ++ +T M P
Sbjct: 255 GAPESLVSQTSSQFGGPFSYCLPPT--SGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIP 312
Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEF---LK 380
+ TFY++ LTGIS+GG L S F+ G++IDSGTVIT LP + Y+AL++ F +
Sbjct: 313 SVPTFYVVTLTGISVGGAPLAVPPSAFSS-GMVIDSGTVITGLPATAYAALRSAFRSAMS 371
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
++ P + G ++LDTC++ + + V +P + + F G A + + V C
Sbjct: 372 EYRLLPPSNG-AVLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLV------DGC 424
Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
LA A +D GIIGN Q+ V+YD+ +GF C
Sbjct: 425 LAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 159/412 (38%), Positives = 232/412 (56%), Gaps = 42/412 (10%)
Query: 95 DNLHVQYLQSRIKNMISGNIKD-VSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVI 149
D V++++S+ K ++G KD S+T++ P+TSG+ + Y + LG R++ ++
Sbjct: 13 DERRVRWIESKAK--LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPARSLFMV 70
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF--ATGNSGVCS 207
VDTGSDL W+QCQPCKSCY Q DP+FDP S S++++ C S C ALE +G+ G S
Sbjct: 71 VDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHSCSGSRGATS 130
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGL 266
C+Y V+YGDGS++ G+ + LG S FGCG +N+GLF G +GL+GL
Sbjct: 131 R-----CSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGL 185
Query: 267 GRSDLSLVSQ-----TSEIFGGLFSYCL-----PSTQDAGASGSLILGGNSSVFKNSTPI 316
G LS SQ T+ FSYCL P T+ +S SLI G V +
Sbjct: 186 GAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTR---SSSSLIFG----VAAIPSTA 238
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQ-------LQASGFAKGGILIDSGTVITRLPPS 369
+ ++ NP+L TFY + G+S+GG Q LQ S GG++IDSGT +TR P S
Sbjct: 239 ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTS 298
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
+Y+ ++ F PSAP +S+ DTC+N S V++P + + FE A++ + T +
Sbjct: 299 VYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYL 358
Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + A CLA A S E GIIGN QQ++ R+ +D + S L FA + C
Sbjct: 359 IPINT-AGSFCLAFAPTSM--ELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/492 (35%), Positives = 252/492 (51%), Gaps = 36/492 (7%)
Query: 7 PLTILSLLLPLMVSLFLLA-----KGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSR 61
PL+ +SL L V L LL K EGK+ + ++ + + S V Q +R
Sbjct: 5 PLSPISLTFILYVFLVLLCPLCSLKKGLTVEGKETTK-NYIRTVRVNSLLPSNVCSQSTR 63
Query: 62 IEMGAITLELKHK-NYC---SGKIVDWNEQQQNRLIL-DNLHVQYLQSRIK-NMISGNIK 115
+ A +L++ +K C +G N +L D L V+ Q R+ N SG K
Sbjct: 64 VLNRASSLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFK 123
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQD 172
++ T IP + I Y+ T+ LG ++ T+ DTGSDLTW QC+PC C+ Q
Sbjct: 124 EMQTT-IP--ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQ 180
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
P FDP+ S SYK V C+S C + + C S++ C Y + YG G YT G L
Sbjct: 181 PKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNT---CLYGIQYGSG-YTIGFLAT 236
Query: 233 EHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
E L + + V +F+FGC ++G F G +GL+GLGRS ++L SQT+ + LFSYCLP+
Sbjct: 237 ETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA 296
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
+ ++G L G S STPI+ P+L Y LN GIS+ G++L +G +
Sbjct: 297 SPS--STGHLSFGVEVSQAAKSTPIS-------PKLKQLYGLNTVGISVRGRELPING-S 346
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS--AYQEVNIP 409
+IDSGT T LP YSAL + F + + + G S C++ S + IP
Sbjct: 347 ISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIP 406
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ + FEG E+ +DV+GI+ V + +VCLA A + + I GNYQQK VIYD
Sbjct: 407 GISIFFEGGVEVEIDVSGIMIPV-NGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDV 465
Query: 470 KNSQLGFAGEDC 481
+GFA + C
Sbjct: 466 AKGMVGFAPKGC 477
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 203/371 (54%), Gaps = 30/371 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y + + +G R + +++DTGSD+TWVQCQPC CY Q DPVFDPS+S
Sbjct: 157 PVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 216
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
SY V C+S C L+ A + + C Y V+YGDGSYT G+ E L LG +
Sbjct: 217 ASYAAVSCDSPRCRDLDTAACRNATGA------CLYEVAYGDGSYTVGDFATETLTLGDS 270
Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ V + GCG +N+GLF G +GL+ LG LS SQ I FSYCL +D+ A+
Sbjct: 271 TPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCL-VDRDSPAAS 326
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA------ 351
+L G + + T ++ +P+ TFY + L+GIS+GG+ L +S FA
Sbjct: 327 TLQFGADGAEADTVT----APLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG+++DSGT +TRL S Y+AL+ F++ P G S+ DTC++LS V +P V
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
+ FEG + + Y + D A CLA A + IIGN QQ+ RV +DT
Sbjct: 443 SLRFEGGGALRLPAKN--YLIPVDGAGTYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTA 498
Query: 471 NSQLGFAGEDC 481
+GF C
Sbjct: 499 KGVVGFTPNKC 509
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 204/371 (54%), Gaps = 30/371 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y + + +G R + +++DTGSD+TWVQCQPC CY Q DPVFDPS+S
Sbjct: 154 PVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 213
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
SY V C+S C L+ A + + C Y V+YGDGSYT G+ E L LG +
Sbjct: 214 ASYAAVSCDSQRCRDLDTAACRNATGA------CLYEVAYGDGSYTVGDFATETLTLGDS 267
Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ V + GCG +N+GLF G +GL+ LG LS SQ I FSYCL +D+ A+
Sbjct: 268 TPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCL-VDRDSPAAS 323
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA------ 351
+L G ++ T ++ +P+ +TFY + L+GIS+GG+ L AS FA
Sbjct: 324 TLQFGDGAAEAGTVT----APLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSG 379
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG+++DSGT +TRL + Y+AL+ F++ P G S+ DTC++LS V +P V
Sbjct: 380 SGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 439
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
+ FEG + + Y + D A CLA A + IIGN QQ+ RV +DT
Sbjct: 440 SLRFEGGGALRLPAKN--YLIPVDGAGTYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTA 495
Query: 471 NSQLGFAGEDC 481
+GF C
Sbjct: 496 RGAVGFTPNKC 506
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 142/369 (38%), Positives = 203/369 (55%), Gaps = 25/369 (6%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+TSG+ + Y + +G + +++DTGSD+ W+QC PCKSCY Q D VFDP S
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S++++ C++ C L+ C+S+ C Y VSYGDGS+T G+L + + +
Sbjct: 63 SFRRLSCSTPQCKLLDVK-----ACASTDN-RCLYQVSYGDGSFTVGDLASDSFSVSRGR 116
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ +FGCG +N+GLF G +GL+GLG LS SQ S FSYCL S + + S
Sbjct: 117 TSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSS---RKFSYCLVSRDNGVRASSA 173
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------ASGFAKG 353
+L G+S++ S YT ++ NP+L TFY L+GISIGG L +S +G
Sbjct: 174 LLFGDSAL-PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRG 232
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G++IDSGT +TRLP Y+ ++ F P A FS+ DTC++ SA V IP V
Sbjct: 233 GVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSF 292
Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
FEG A + + + Y V D S C A + S + IIGN QQ+ RV D +S
Sbjct: 293 HFEGGASVQLPPSN--YLVPVDTSGTFCFAFSKTSL--DLSIIGNIQQQTMRVAIDLDSS 348
Query: 473 QLGFAGEDC 481
++GFA C
Sbjct: 349 RVGFAPRQC 357
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 160/435 (36%), Positives = 229/435 (52%), Gaps = 40/435 (9%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNIKDVSNTEI- 122
G +L L H++ SG+ L D V+YLQ R+ TE+
Sbjct: 67 GRPSLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLS-------PTTMTTEVG 119
Query: 123 -PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+ SGI + Y + +G ++VD+GSD+ W+QC+PC CY Q DP+FDP+
Sbjct: 120 SEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAA 179
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S S+ V C+S C L G+SG S + C Y VSYGDGSYT+G L E L G
Sbjct: 180 SASFTAVPCDSGVCRTLP--GGSSGCADSGA---CRYQVSYGDGSYTQGVLAMETLTFGD 234
Query: 240 AS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGA 297
++ V GCG N+GLF G +GL+GLG +SLV Q GG FSYCL S DAGA
Sbjct: 235 STPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGA 294
Query: 298 SGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF--- 350
GSL+ G + ++ P+ + ++ N Q +FY + LTG+ +GG++ LQ F
Sbjct: 295 -GSLVFGRD-----DAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLT 348
Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVN 407
GG+++D+GT +TRLPP Y+AL+ F G P APG S+LDTC++LS Y V
Sbjct: 349 EDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVR 408
Query: 408 IPLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V + F A +T+ ++ V+ CLA A+ + I+GN QQ+ ++
Sbjct: 409 VPTVALYFGRDGAALTLPARNLL--VEMGGGVYCLAFAASA--SGLSILGNIQQQGIQIT 464
Query: 467 YDTKNSQLGFAGEDC 481
D+ N +GF C
Sbjct: 465 VDSANGYVGFGPSTC 479
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 166/460 (36%), Positives = 235/460 (51%), Gaps = 41/460 (8%)
Query: 42 LQWQQKSGSSSSC-----VSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDN 96
+Q S S+++C V+ SR M L +H N ++ +
Sbjct: 31 VQTSTSSPSNAACSPAAQVTSDPSRASM---PLMYRHGPCAPASAAATNRPSPAEMLRRD 87
Query: 97 LHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGS 154
+ ++ I SG + IP + G + +L Y+ T+ G + +++DTGS
Sbjct: 88 ---RARRNHILRKASGR-RITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGS 143
Query: 155 DLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
DL+WVQCQPC S CY Q+DPVFDPS S +Y V C S C L+ + +G +SSS
Sbjct: 144 DLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGA 203
Query: 213 D-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGR 268
C Y + YG+G T G E L L + VN+F FGCG KG+F GL+GLG
Sbjct: 204 SLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGG 263
Query: 269 SDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
+ SLVSQT+ +GG FSYCLP ST A G+ GGN++ TP+
Sbjct: 264 APESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQFTPLQVVET---- 319
Query: 326 QLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
TFY++ LTGIS+GGKQL + + FA GG++IDSGT++T LP + YSAL+ F S
Sbjct: 320 ---TFYLVKLTGISVGGKQLDIEPTVFA-GGMIIDSGTIVTGLPETAYSALRTAFRSAMS 375
Query: 384 GFPSAP--GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
+P P LDTC++ + V +P V + FEG + +DV V CL
Sbjct: 376 AYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLL------DGCL 429
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + + + +TGIIGN Q+ V+YD+ +GF C
Sbjct: 430 AFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 196/368 (53%), Gaps = 22/368 (5%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
IP +G L TL ++ + G + +I+DTGSDL+W+QC+PC CY Q DP FDP+
Sbjct: 124 IPDHTGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPA 183
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S SY V C + C A G+C+ ++ C Y V YGDGS T G L R+ L
Sbjct: 184 KSSSYAAVPCGTPVCAAA------GGMCNGTT---CLYGVQYGDGSSTTGVLSRDTLTFN 234
Query: 239 KAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+S F FGCG N G FG V GL+GLGR LSL SQ + FGG+FSYCLPS
Sbjct: 235 SSSKFTGFTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT--T 292
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGI 355
G L +G ++ P+ YT MI PQ +FY + L I+IGG L S F K G
Sbjct: 293 PGYLNIGATKPT--STVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGT 350
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
L+DSGT++T LPP Y++L+ F G AP + LDTC++ + + IP V F
Sbjct: 351 LLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNF 410
Query: 416 EGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
A +D GI+ F DA + CLA S I+GN QQ+ VIYD + +
Sbjct: 411 SDGAVFDLDFYGIMIF-PDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQK 469
Query: 474 LGFAGEDC 481
+GF C
Sbjct: 470 IGFIPISC 477
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 220/424 (51%), Gaps = 43/424 (10%)
Query: 93 ILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELGG-- 143
+ D+ H + R ++ + + ++ E IP G+ Q+L Y+ TI +G
Sbjct: 73 VPDHHHYTGILRRDRHRVRSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPP 132
Query: 144 RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
RN TV+ DTGSDLTWVQC PC SCY QQ+P+FDPS S +Y V C++ CH
Sbjct: 133 RNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECHIGGVQQT 192
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGL 256
G S C Y V YGD S T G L E L S +FGC +
Sbjct: 193 RCGATS------CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGCSHEYISV 246
Query: 257 FG----GVSGLMGLGRSDLSLVSQTSEIF---GGLFSYCLPSTQDAGASGSLILGGNSSV 309
F GV+GL+GLGR D S++SQT GG+FSYCLP ++G L +GG ++
Sbjct: 247 FNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPR--GSSTGYLTIGGGAAA 304
Query: 310 FKNS-TPITYTNMIPN-PQLATFYILNLTGISIGGK--QLQASGFAKGGILIDSGTVITR 365
+ + +++T +I QL + Y++NL G+S+ G + AS F+ G + IDSGTV+T
Sbjct: 305 PQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAV-IDSGTVVTH 363
Query: 366 LPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
+P + Y L+ EF + P +LDTC++++ V P V +EF G A + V
Sbjct: 364 MPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDV 423
Query: 424 DVTGIVYFVKS-DASQVCLALASLSY--EDETG--IIGNYQQKNQRVIYDTKNSQLGFAG 478
D +GI+ + + D S L LA L++ + G I+GN QQ+ V++D ++GF
Sbjct: 424 DASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGP 483
Query: 479 EDCS 482
CS
Sbjct: 484 NGCS 487
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 151/408 (37%), Positives = 221/408 (54%), Gaps = 36/408 (8%)
Query: 99 VQYLQSRI--KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGS 154
VQ L+S++ ++I+ + + S + +Y+ TI LG + +VI DTGS
Sbjct: 2 VQALRSKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGS 61
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
DL W+QC+PC++C+NQ+DP+FDP S SY + C + C +L + CS PDC
Sbjct: 62 DLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKS-----CS----PDC 112
Query: 215 NYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
+Y YGDGS TRG L E + L K + + FGCG N+G F SGL+GLGR
Sbjct: 113 DYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRG 172
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITY--TNMIPNPQ 326
+LS VSQ ++FG FSYCL +DA + S + G SS + + Y T MI NP
Sbjct: 173 NLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPA 232
Query: 327 LATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFL 379
+ +FY + L ISI G+ L+ A F GG++ DSGT +T LP + Y +
Sbjct: 233 MESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALR 292
Query: 380 KQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
+ S FP G S LD C+++S A ++ IP + FEG A+ + V YF+ ++
Sbjct: 293 SKIS-FPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEG-ADYQLPVEN--YFIAAN 348
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + LA +S + GI GN Q+N RV+YD +S++G+A C S
Sbjct: 349 DAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDS 396
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 152/463 (32%), Positives = 230/463 (49%), Gaps = 40/463 (8%)
Query: 37 LHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQN------ 90
L++ + + K+ +Q + G L+L H++ KI +N+ +
Sbjct: 41 LNVKEAITETKASQYQELFDNQNDTLTEGKWKLKLVHRD----KITAFNKSSYDHSHNFH 96
Query: 91 -RLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMT 147
R+ D V L R+ + + V + SG+ + Y I +G R
Sbjct: 97 ARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQY 156
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
V++D+GSD+ WVQCQPC CY+Q DPVFDP+ S S+ V C+SS C +E A ++G
Sbjct: 157 VVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHAG--- 213
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
C Y V YGDGSYT+G L E L G+ V + GCG N+G+F G +GL+GLG
Sbjct: 214 -----GCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLG 268
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNP 325
+SLV Q GG FSYCL S + ++GSL G + + P+ + +I NP
Sbjct: 269 GGSMSLVGQLGGQTGGAFSYCLVS-RGTDSAGSLEFG------RGAMPVGAAWIPLIRNP 321
Query: 326 QLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
+ +FY + L+G+ +GG ++ Q + GG+++D+GT +TR+P Y A + F
Sbjct: 322 RAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAF 381
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ Q P A G SI DTC+NL+ + V +P V F G +T+ + V D
Sbjct: 382 IGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVD-DVGT 440
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
C A A + IIGN QQ+ ++ +D N +GF C
Sbjct: 441 FCFAFA--ASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 143/373 (38%), Positives = 212/373 (56%), Gaps = 30/373 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y + I +G R + +++DTGSD+TW+QC PC CY Q DP+FDP++S
Sbjct: 184 PVVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---GL 237
SY V C+S C AL+ + ++ + +S C Y V+YGDGSYT G+ E L G
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNS--SCVYEVAYGDGSYTVGDFATETLTLGGD 301
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
G A+V+D GCG +N+GLF G +GL+ LG LS SQ S FSYCL +D+ +
Sbjct: 302 GSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT---EFSYCL-VDRDSPS 357
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFA--- 351
+ +L G + +S+ +T ++ +P+ TFY + L GIS+GG+ L + FA
Sbjct: 358 ASTLQFGAS-----DSSTVT-APLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDE 411
Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
GG+++DSGT +TRL S YSAL+ F++ P A G S+ DTC++L+ V +P
Sbjct: 412 QGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVP 471
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
V + FEG E+ + Y + D A CLA A+ I+GN QQ+ RV +D
Sbjct: 472 AVSLRFEGGGELKLPAKN--YLIPVDGAGTYCLAFAATG--GAVSIVGNVQQQGIRVSFD 527
Query: 469 TKNSQLGFAGEDC 481
T + +GF+ C
Sbjct: 528 TAKNTVGFSPNKC 540
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 154/420 (36%), Positives = 226/420 (53%), Gaps = 46/420 (10%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKD-----VSNT--------EIPLTSGIRLQTLNY 135
+NRL D L + + SRI ++G K + NT E PL SG+ + Y
Sbjct: 22 RNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSGEY 81
Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
++ +G R + ++ DTGSD+ W+QC PC+SCY Q DP+F+PS S +++ + C SS C
Sbjct: 82 FVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLC 141
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
L C + C Y VSYGDGS+T GE E L G +VN GCG NN
Sbjct: 142 QQLLIRG-----CRRN---QCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNN 193
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GLF G +GL+GLG+ LS SQ +++G +FSYCLP+ + G S LI GN +V N+
Sbjct: 194 QGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SVPLIF-GNQAVASNA 251
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITR 365
+T ++ NP+L TFY + + GI +GG + S GG+++DSGT +TR
Sbjct: 252 ---QFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTR 308
Query: 366 LPPSIYSALKAEFLKQFSGFPS----APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
L S Y+ ++ F +G PS GFS+ DTC++LS + +P V F G A M
Sbjct: 309 LVTSAYNPMRDAFR---AGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ I+ V ++ CLA A S + IIGN QQ++ R+ +D+ +++G C
Sbjct: 366 ALPAQNIMVPVD-NSGTYCLAFAPNS--ENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 142/369 (38%), Positives = 203/369 (55%), Gaps = 25/369 (6%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+TSG+ + Y + +G + +++DTGSD+ W+QC PCKSCY Q D VFDP S
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S++++ C++ C L+ C+S+ C Y VSYGDGS+T G+L + + +
Sbjct: 63 SFRRLSCSTPQCKLLDVK-----ACASTD-NRCLYQVSYGDGSFTVGDLASDSFLVSRGR 116
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ +FGCG +N+GLF G +GL+GLG LS SQ S FSYCL S + + S
Sbjct: 117 TSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSS---RKFSYCLVSRDNGVRASSA 173
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------ASGFAKG 353
+L G+S++ S YT ++ NP+L TFY L+GISIGG L +S +G
Sbjct: 174 LLFGDSAL-PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRG 232
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G++IDSGT +TRLP Y+ ++ F P A FS+ DTC++ SA V IP V
Sbjct: 233 GVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSF 292
Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
FEG A + + + Y V D S C A + S + IIGN QQ+ RV D +S
Sbjct: 293 HFEGGASVQLPPSN--YLVPVDTSGTFCFAFSKTSL--DLSIIGNIQQQTMRVAIDLDSS 348
Query: 473 QLGFAGEDC 481
++GFA C
Sbjct: 349 RVGFAPRQC 357
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 154/420 (36%), Positives = 226/420 (53%), Gaps = 46/420 (10%)
Query: 89 QNRLILDNLHVQYLQSRIKNMISGNIKD-----VSNT--------EIPLTSGIRLQTLNY 135
+NRL D L + + SRI ++G K + NT E PL SG+ + Y
Sbjct: 22 RNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSGEY 81
Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
++ +G R + ++ DTGSD+ W+QC PC+SCY Q DP+F+PS S +++ + C SS C
Sbjct: 82 FVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLC 141
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
L C + C Y VSYGDGS+T GE E L G +VN GCG NN
Sbjct: 142 QQLLIRG-----CRRN---QCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNN 193
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GLF G +GL+GLG+ LS SQ +++G +FSYCLP+ + G S LI GN +V N+
Sbjct: 194 QGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SVPLIF-GNQAVASNA 251
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITR 365
+T ++ NP+L TFY + + GI +GG + S GG+++DSGT +TR
Sbjct: 252 ---QFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTR 308
Query: 366 LPPSIYSALKAEFLKQFSGFPS----APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
L S Y+ ++ F +G PS GFS+ DTC++LS + +P V F G A M
Sbjct: 309 LVTSAYNPMRDAFR---AGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ I+ V ++ CLA A S + IIGN QQ++ R+ +D+ +++G C
Sbjct: 366 ALPAQNIMVPVD-NSGTYCLAFAPNS--ENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 156/439 (35%), Positives = 230/439 (52%), Gaps = 44/439 (10%)
Query: 66 AITLELKHKNYCSG-KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD---VSNTE 121
+ +LEL + G D+ +RL D+ V+ + ++++ +SG K +TE
Sbjct: 79 SFSLELHPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTE 138
Query: 122 I--------PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQ 171
I P+TSG + Y + +G + T +++DTGSD+ W+QC+PC CY Q
Sbjct: 139 ILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQV 198
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
DP+FDP+ S S+ ++ C + C L+ FA N C Y VSYGDGSYT G+
Sbjct: 199 DPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDS---------CLYQVSYGDGSYTVGDF 249
Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
E + G + SV+ GCG +N+GLF G +GL+GLG LSL TS+I FSYCL
Sbjct: 250 ATETVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSL---TSQIKASSFSYCL 306
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---- 345
+D+ S +L + PI N ++ TFY + +TG+S+GG++L
Sbjct: 307 -VNRDSVDSSTLEFNSAKPSDSVTAPI-----FKNSKVDTFYYVGITGMSVGGEKLAIPP 360
Query: 346 ---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
+ G KGGI++D GT +TRL Y+AL+ F+K PS GF++ DTC+NLS+
Sbjct: 361 SIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSS 420
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
V +P V F+G + + + + V S A CLA A + IIGN QQ+
Sbjct: 421 RTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDS-AGTFCLAFAPTTA--SLSIIGNVQQQG 477
Query: 463 QRVIYDTKNSQLGFAGEDC 481
RV YD NSQ+ F+ C
Sbjct: 478 TRVTYDLANSQVSFSSRKC 496
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 147/425 (34%), Positives = 218/425 (51%), Gaps = 31/425 (7%)
Query: 69 LELKHKNYCS-GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
+++ H++ S G D + RL D V L R+ + G+ + V + + SG
Sbjct: 74 MKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYR-VDDFGTDVISG 132
Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
+ + Y I +G R+ +++D+GSD+ WVQCQPC CY+Q DPVFDP+ S S+
Sbjct: 133 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 192
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
V C+SS C LE A ++G C Y VSYGDGSYT+G L E L G+ V
Sbjct: 193 VSCSSSVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTFGRTMVRSV 244
Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GCG N+G+F G +GL+GLG +S V Q GG FSYCL S + +SGSL+ G
Sbjct: 245 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS-RGTDSSGSLVFG- 302
Query: 306 NSSVFKNSTP--ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------GFAKGGIL 356
+ + P + ++ NP+ +FY + L G+ +GG ++ S GG++
Sbjct: 303 -----REALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVV 357
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+D+GT +TRLP Y A + FL Q + P A G +I DTC++L + V +P V F
Sbjct: 358 MDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 417
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
G +T+ + DA C A A + I+GN QQ+ ++ +D N +GF
Sbjct: 418 GGPILTLPARNFL-IPMDDAGTFCFAFAPST--SGLSILGNIQQEGIQISFDGANGYVGF 474
Query: 477 AGEDC 481
C
Sbjct: 475 GPNIC 479
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 151/372 (40%), Positives = 201/372 (54%), Gaps = 28/372 (7%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFD 176
IP SG L TL ++ + LG + +I DTGSDL+WVQCQPC S C+ QQDP+FD
Sbjct: 136 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFD 195
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
PS S +Y V C C A G+CS + C Y V YGDGS T G L R+ L
Sbjct: 196 PSKSSTYAAVHCGEPQCAA------AGGLCSEDNT-TCLYLVHYGDGSSTTGVLSRDTLA 248
Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
L + ++ F FGCG N G FG V GL+GLGR +LSL SQ + FG +FSYCLPS+
Sbjct: 249 LTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS- 307
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKG 353
+G L +G + ++ YT M+ PQ +FY + L I IGG L + F +G
Sbjct: 308 -TTGYLTIGATPAT--DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRG 364
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G L+DSGTV+T LP Y L+ F + AP +LD C++ + EV +P V
Sbjct: 365 GTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSF 424
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDT 469
F A +D G++ F+ D + CLA A++ D G IIGN QQ++ VIYD
Sbjct: 425 RFGDGAVFELDFFGVMIFL--DENVGCLAFAAM---DAGGLPLSIIGNTQQRSAEVIYDV 479
Query: 470 KNSQLGFAGEDC 481
++GF C
Sbjct: 480 AAEKIGFVPASC 491
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 173/505 (34%), Positives = 256/505 (50%), Gaps = 66/505 (13%)
Query: 3 TKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGS-SSSCVSHQKSR 61
++ P + + LL ++ SL + AH HL++ Q QQ++ SSS H +SR
Sbjct: 20 SRSTPHSSKTTLLDVVSSL----QNAHNAVAFTPHHLNQHQRQQEALLLSSSFGIHLRSR 75
Query: 62 IEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
A + H++Y S + +RL D+ V+ LQ+R+ ++ K VSN++
Sbjct: 76 ----ASIQKPSHRDYKSLTL--------SRLARDSARVKSLQTRLDLVL----KRVSNSD 119
Query: 122 I----------------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQP 163
+ P+ SG + Y + +G V++DTGSD++W+QC P
Sbjct: 120 LHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAP 179
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C CY Q DP+FDP S SY + C++ C +L+ + +G C Y VSYGDG
Sbjct: 180 CSECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCL--------YEVSYGDG 231
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
SYT GE E + LG A+V + GCG NN+GLF G +GL+GLG LS +Q +
Sbjct: 232 SYTVGEFATETVTLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQ---VNAT 288
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
FSYCL +D+ A +L NS + +N + + NP+L TFY L L GIS+GG+
Sbjct: 289 SFSYCL-VNRDSDAVSTLEF--NSPLPRN---VVTAPLRRNPELDTFYYLGLKGISVGGE 342
Query: 344 QL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT 396
L + GGI+IDSGT +TRL +Y AL+ F+K G P A G S+ DT
Sbjct: 343 ALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT 402
Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
C++LS+ + V +P V F E+ + + V S C A A + I+G
Sbjct: 403 CYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDS-VGTFCFAFAPTT--SSLSIMG 459
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
N QQ+ RV +D NS +GF+ + C
Sbjct: 460 NVQQQGTRVGFDIANSLVGFSADSC 484
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 173/507 (34%), Positives = 252/507 (49%), Gaps = 70/507 (13%)
Query: 3 TKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSG---SSSSCVSHQK 59
++ P + + LL ++ SL + AH H +K Q QQ+S SS H +
Sbjct: 20 SRTTPHSPQTTLLDVVSSL----QNAHNVVAFTHHHPNKHQRQQESSLLTSSFGIQLHSR 75
Query: 60 SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
+ I+ + H +Y S + +RL D+ V+ LQ+R+ + K VSN
Sbjct: 76 ASIQKSS------HSDYKSLTL--------SRLARDSARVKALQTRLDLFL----KRVSN 117
Query: 120 TEI----------------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQC 161
+++ P+ SG + Y + +G V++DTGSD++W+QC
Sbjct: 118 SDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC 177
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
PC CY Q DP+FDP S SY + C+ C +L+ + +G C Y VSYG
Sbjct: 178 APCSECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCL--------YEVSYG 229
Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
DGSYT GE E + LG A+V + GCG NN+GLF G +GL+GLG LS +Q +
Sbjct: 230 DGSYTVGEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQ---VN 286
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
FSYCL +D+ A +L NS + +N+ ++ NP+L TFY L L GIS+G
Sbjct: 287 ATSFSYCL-VNRDSDAVSTLEF--NSPLPRNA---ATAPLMRNPELDTFYYLGLKGISVG 340
Query: 342 GKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
G+ L + GGI+IDSGT +TRL +Y AL+ F+K G P A G S+
Sbjct: 341 GEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLF 400
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
DTC++LS+ + V IP V F E+ + + V S C A A + I
Sbjct: 401 DTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDS-VGTFCFAFAPTT--SSLSI 457
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IGN QQ+ RV +D NS +GF+ + C
Sbjct: 458 IGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 152/409 (37%), Positives = 222/409 (54%), Gaps = 38/409 (9%)
Query: 99 VQYLQSRI--KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGS 154
VQ L+S++ ++I+ + + S + +Y+ TI LG + +VI DTGS
Sbjct: 2 VQALRSKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGS 61
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
DL W+QC+PC++C+NQ+DP+FDP S SY + C + C +L + CS P+C
Sbjct: 62 DLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKS-----CS----PNC 112
Query: 215 NYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
+Y YGDGS TRG L E + L K + + FGCG N+G F SGL+GLGR
Sbjct: 113 DYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRG 172
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITY--TNMIPNPQ 326
+LS VSQ ++FG FSYCL +DA + S + G SS + + Y T MI NP
Sbjct: 173 NLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPA 232
Query: 327 LATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPPSIYS-ALKAEF 378
+ +FY + L ISI G+ L+ A F GG++ DSGT +T LP + Y L+A
Sbjct: 233 MESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-- 290
Query: 379 LKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
L+ FP G S LD C+++S A + IP + FEG A+ + V YF+ +
Sbjct: 291 LRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEG-ADHQLPVEN--YFIAA 347
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + + LA +S + GI GN Q+N RV+YD +S++G+A C S
Sbjct: 348 NDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDS 396
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 200/373 (53%), Gaps = 30/373 (8%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFD 176
IP SG L TL ++ + LG + +I DTGSDL+WVQCQPC S C+ QQDP+FD
Sbjct: 131 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFD 190
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
PS S +Y V C C A +G S C Y V YGDGS T G L R+ L
Sbjct: 191 PSKSSTYAAVHCGEPQCAA-------AGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLA 243
Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
L + ++ F FGCG N G FG V GL+GLGR +LSL SQ + FG +FSYCLPS+
Sbjct: 244 LTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS- 302
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
+G L +G + ++ YT M+ PQ +FY + L I IGG L F +G
Sbjct: 303 -TTGYLTIGATPAT--DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG 359
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G L+DSGTV+T LP Y+ L+ F + AP +LD C++ + EV +P V
Sbjct: 360 GTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSF 419
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYD 468
F A +D G++ F+ D + CLA A++ +TG IIGN QQ++ VIYD
Sbjct: 420 RFGDGAVFELDFFGVMIFL--DENVGCLAFAAM----DTGGLPLSIIGNTQQRSAEVIYD 473
Query: 469 TKNSQLGFAGEDC 481
++GF C
Sbjct: 474 VAAEKIGFVPASC 486
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 219/435 (50%), Gaps = 31/435 (7%)
Query: 65 GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
G ++ L H+ CS + E++ + L D L Y++ + +G
Sbjct: 58 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 117
Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
S +P T G L TL Y+ ++ LG MT V++DTGSD++WVQC+PC + C+
Sbjct: 118 SKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 177
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
+FDP+ S +Y C+++ C L +G + C + S C Y V YGDGS T G
Sbjct: 178 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTGTYSS 234
Query: 233 EHLGL-GKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ L L G V F FGC G+ GL+GLG SLVSQT+ +G FSYCL
Sbjct: 235 DVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCL 294
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
P+T +SG L LG +S T M+ + ++ T+Y L I++GGK+L
Sbjct: 295 PATP--ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 352
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
S FA G L+DSGTVITRLPP+ Y+AL + F + + A ILDTCFN + +V
Sbjct: 353 PSVFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKV 411
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+IP V + F G A + +D GIV S CLA A + G IGN QQ+ V+
Sbjct: 412 SIPTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 464
Query: 467 YDTKNSQLGFAGEDC 481
YD GF C
Sbjct: 465 YDVGGGVFGFRAGAC 479
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 224 bits (571), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 131/378 (34%), Positives = 205/378 (54%), Gaps = 29/378 (7%)
Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
E P+ SG+ T Y A + +G R+M ++VDTGSD+TW+QC PC +CY Q+D +F+PS
Sbjct: 2 EAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPS 61
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
S S+K + C+SS C L+ C S+ C Y YGDGS+T GEL +++
Sbjct: 62 SSSSFKVLDCSSSLCLNLDVMG-----CLSNK---CLYQADYGDGSFTMGELVTDNVVLD 113
Query: 236 ---GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
G G+ + + GCG +N+G FG +G++GLGR LS + +FSYCLP
Sbjct: 114 DAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDR 173
Query: 293 QDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQL------ 345
+ S ++ G++++ +T + + + NP++AT+Y + +TGIS+GG L
Sbjct: 174 ESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPAS 233
Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
Q GG + DSGT ITRL Y+A++ F SA F I DTC++ +
Sbjct: 234 VFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGM 293
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
+++P V F+G+ +M + + + V S+ + C A A+ +IGN QQ++
Sbjct: 294 NSISVPTVTFHFQGDVDMRLPPSNYIVPV-SNNNIFCFAFAA---SMGPSVIGNVQQQSF 349
Query: 464 RVIYDTKNSQLGFAGEDC 481
RVIYD + Q+G + C
Sbjct: 350 RVIYDNVHKQIGLLPDQC 367
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 224 bits (571), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 149/393 (37%), Positives = 216/393 (54%), Gaps = 31/393 (7%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN----YIATIELG--GRNMTVIVDTGSDL 156
Q R+K++ + + + S T + R+ T + Y T+ LG ++ +++ DTGSDL
Sbjct: 96 QLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDL 155
Query: 157 TWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPD 213
TW QC+PC C+ Q D FDP+ S SYK + C+S C ++ E A G CSSS+
Sbjct: 156 TWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQG----CSSSN--S 209
Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C Y V YG G YT G L E L + + V +F+ GCG N G F G +GL+GLGRS ++
Sbjct: 210 CLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVA 268
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
L SQTS + LFSYCLP++ + ++G L GG S TPI T+ IP Y
Sbjct: 269 LPSQTSSTYKNLFSYCLPAS--SSSTGHLSFGGGVSQAAKFTPI--TSKIPE-----LYG 319
Query: 333 LNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
L+++GIS+GG++L S F G +IDSGT +T LP + +SAL + F + + + G
Sbjct: 320 LDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKG 379
Query: 391 FSILDTCFNLS--AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
S L C++ S A + IP + + FEG E+ +D +GI + + +VCLA
Sbjct: 380 TSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGI-FIAANGLEEVCLAFKDNGN 438
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + I GN QQK V+YD +GFA C
Sbjct: 439 DTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 229/444 (51%), Gaps = 46/444 (10%)
Query: 66 AITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI------SGNI 114
A +++L H++ + + + + +L + V+ L+ RI+ + +G+
Sbjct: 70 AWSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSY 129
Query: 115 KDVSNTEIP----LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCY 168
++V+ + SG+ + Y I +G R +++DTGSD+ W+QC+PC+ CY
Sbjct: 130 ENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECY 189
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+Q DP+F+PS S S+ V C+S+ C L+ + G C Y VSYGDGSYT G
Sbjct: 190 SQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGG--------GCLYEVSYGDGSYTVG 241
Query: 229 ELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
E L G S+ + GCG +N GLF G +GL+GLG LS +Q G FSYC
Sbjct: 242 SYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC 301
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGG---K 343
L +D+ +SG+L G S PI +T ++ NP L TFY L++ IS+GG
Sbjct: 302 L-VDRDSESSGTLEFG------PESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILD 354
Query: 344 QLQASGF------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
+ + F +GGI+IDSGT +TRL S Y AL+ F+ P A G SI DTC
Sbjct: 355 SVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTC 414
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
++LSA Q V+IP V F A + + + S + C A A + I+GN
Sbjct: 415 YDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGT-FCFAFAPA--DSNLSIMGN 471
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
QQ+ RV +D+ NS +GFA + C
Sbjct: 472 IQQQGIRVSFDSANSLVGFAIDQC 495
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 195/373 (52%), Gaps = 20/373 (5%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQD 172
+ + IP +G L+T ++ + G T + DTGSDL+W+QCQPC CY Q D
Sbjct: 93 EAPSATIPDHTGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHD 152
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
PVFDP+ S SY V C ++ C A G C+ ++ C Y V YGDGS T G L R
Sbjct: 153 PVFDPAKSSSYAVVPCGTTECAAA------GGECNGTT---CVYGVEYGDGSSTTGVLAR 203
Query: 233 EHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
E L +S FIFGCG N G FG V GL+GLGR LSL SQ + FGG+FSYCLPS
Sbjct: 204 ETLTFSSSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPS 263
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASG 349
G L +G ++ P+ YT M+ P +FY + L I+IGG L S
Sbjct: 264 YNT--TPGYLSIG--ATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSE 319
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
F K G L+DSGT++T LPP Y+AL+ F G AP + LDTC++ + + IP
Sbjct: 320 FTKTGTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIP 379
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
V F A ++ GI+ F V CLA S + ++G+ Q++ VIYD
Sbjct: 380 GVSFNFSDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYD 439
Query: 469 TKNSQLGFAGEDC 481
++GF C
Sbjct: 440 VPAQKIGFIPASC 452
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 160/427 (37%), Positives = 217/427 (50%), Gaps = 35/427 (8%)
Query: 68 TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK-NMISGN--IKDVSNTEIP 123
T+ L H++ CS L D L +Y+Q+++ N SG ++ + +P
Sbjct: 54 TVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAITLP 113
Query: 124 LTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
T G L TL Y+ T+ +G MT V++DTGSD++WV C + FDP S
Sbjct: 114 TTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSL--FFDPGKSS 171
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
+Y C+S+ C LE G CS +S C Y V YGDGS T G G + L L
Sbjct: 172 TYTPFSCSSAACTRLE---GRDNGCSLNS--TCQYTVRYGDGSNTTGTYGSDTLALNSTE 226
Query: 242 -VNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
V +F FGC + G GLMGLG SLVSQT+ +G FSYCLP+T +
Sbjct: 227 KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRS- 285
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
SG L LG ++ ++ T M + + TFY + L GI++GG + S FA G
Sbjct: 286 -SGFLTLGAST----GTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGS 340
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
I+ DSGT+ITRLPP YSAL A F +P A FSILDTCF+ + V+IP V++
Sbjct: 341 IM-DSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELV 399
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F G A + +D GI+Y CLA A + + IIGN QQ+ V++D S L
Sbjct: 400 FSGGAVVDLDADGIMY-------GSCLAFAPATGGIGS-IIGNVQQRTFEVLHDVGQSVL 451
Query: 475 GFAGEDC 481
GF C
Sbjct: 452 GFRPGAC 458
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 193/340 (56%), Gaps = 26/340 (7%)
Query: 150 VDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S SY V C C L ++
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMG 265
+ Y VSYGDGS T G + L L +S V F FGCG GLF GV GL+G
Sbjct: 63 AQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLG 117
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR SLV QT+ +GG+FSYCLP+ +L +GG S + T ++P+P
Sbjct: 118 LGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---FSTTQLLPSP 174
Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
T+Y++ LTGIS+GG+QL AS FA G ++D+GTV+TRLPP+ Y+AL++ F +
Sbjct: 175 NAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAALRSAFRSGMA 233
Query: 384 --GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
G+P+AP ILDTC+N + Y V +P V + F A +T+ GI+ F CL
Sbjct: 234 SYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF-------GCL 286
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A A + I+GN QQ++ V D + +GF C
Sbjct: 287 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 150/423 (35%), Positives = 207/423 (48%), Gaps = 36/423 (8%)
Query: 84 WNEQQQNRLILDN-----LHVQYLQS--------RIKNMISGNIKDVSNTE-----IPLT 125
WN+ + RLI L + YL + R + + + E IP +
Sbjct: 51 WNKSEVPRLISRTCNGRPLPLDYLWTYGPAPSPHRPRGIPISYPPTIPPAEAPAVTIPDS 110
Query: 126 SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPS 182
+G L TL ++ T+ G + T++ DTGSD++W+QC PC CY Q DP+FDP+ S +
Sbjct: 111 TGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSAT 170
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-S 241
Y V C C A G CSS+ C Y V YGDGS T G L E L L A +
Sbjct: 171 YSAVPCGHPQCAA------AGGKCSSNG--TCLYKVQYGDGSSTAGVLSHETLSLTSARA 222
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ F FGCG N G FG V GL+GLGR LSL SQ + FG FSYCLPS + G L
Sbjct: 223 LPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT--SHGYL 280
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDS 359
+G + S + YT MI +FY ++L I +GG L F + G L+DS
Sbjct: 281 TIGTTTPA-SGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDS 339
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
GTV+T LPP Y+AL+ F + + AP + DTC++ + + +PLV +F +
Sbjct: 340 GTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGS 399
Query: 420 EMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
+ G++ F A CLA I+GN QQ+N +IYD ++GF
Sbjct: 400 SFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVS 459
Query: 479 EDC 481
C
Sbjct: 460 GSC 462
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 147/368 (39%), Positives = 202/368 (54%), Gaps = 26/368 (7%)
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
SG+ + Y I +G R + +++DTGSD+ W+QC PCK CY Q DPVFDP S S+
Sbjct: 117 SGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSF 176
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
+ C S CH L+ S C++ C Y VSYGDGS+T G+ E L + V
Sbjct: 177 ASIACRSPLCHRLD-----SPGCNTQKQ-TCMYQVSYGDGSFTFGDFSTETLTFRRTRVA 230
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG +N+GLF G +GL+GLGR LS SQT F FSYCL + S++
Sbjct: 231 RVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVF 290
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKGGI 355
G+S+V + + +T ++ NP+L TFY + L GIS+GG + + AS F GG+
Sbjct: 291 -GDSAVSRTA---RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGV 346
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSGT +TRL Y A + F S AP FS+ DTCF+LS EV +P V + F
Sbjct: 347 IIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHF 406
Query: 416 EGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
G A++++ + Y + D S CLA A IIGN QQ+ RV+YD S++
Sbjct: 407 RG-ADVSLPASN--YLIPVDTSGNFCLAFAGT--MGGLSIIGNIQQQGFRVVYDLAGSRV 461
Query: 475 GFAGEDCS 482
GFA C+
Sbjct: 462 GFAPHGCA 469
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 148/450 (32%), Positives = 224/450 (49%), Gaps = 48/450 (10%)
Query: 55 VSHQKSRIEMGAIT-----LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI--- 106
+ HQK I A + L+L H++ K+ +N +R N +Q R+
Sbjct: 49 LQHQKLNIATEASSPAKYKLKLVHRD----KVPTFNTSHDHRTRF-NARMQRDTKRVAAL 103
Query: 107 -KNMISGN---IKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
+++ +G ++ +++ SG+ + Y I +G RN V++D+GSD+ WVQ
Sbjct: 104 RRHLAAGKPTYAEEAFGSDV--VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQ 161
Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
C+PC CY+Q DPVF+P+ S SY V C S+ C ++ A + G C Y VSY
Sbjct: 162 CEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEG--------RCRYEVSY 213
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
GDGSYT+G L E L G+ + + GCG +N+G+F G +GL+GLG +S V Q
Sbjct: 214 GDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQ 273
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG- 337
GG FSYCL S + +SG L G + + P+ + +I NP+ +FY + L+G
Sbjct: 274 AGGTFSYCLVS-RGIQSSGLLQFG------REAVPVGAAWVPLIHNPRAQSFYYVGLSGL 326
Query: 338 ------ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+ I + S GG+++D+GT +TRLP + Y A + F+ Q + P A G
Sbjct: 327 GVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGV 386
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
SI DTC++L + V +P V F G +T+ + V D C A A S
Sbjct: 387 SIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVD-DVGSFCFAFAPSS--SG 443
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ + D N +GF C
Sbjct: 444 LSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 158/431 (36%), Positives = 229/431 (53%), Gaps = 33/431 (7%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQ--QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP- 123
++++L H + S D + Q +RL+ D V+ L S + N+
Sbjct: 76 LSVQLHHIDALSS---DKSSQDLFNSRLVRDAARVKSLISLAATVGGTNLTRARGPGFSS 132
Query: 124 -LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
+ SG+ + Y + +G R + +++DTGSD+ W+QC PC CY+Q DPVFDP+ S
Sbjct: 133 SVISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKS 192
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
S+ + C S C L++ CS+ C Y VSYGDGS+T GE E L
Sbjct: 193 RSFANIPCGSPLCRRLDYPG-----CSTKKQ-ICLYQVSYGDGSFTVGEFSTETLTFRGT 246
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V + GCG +N+GLF G +GL+GLGR LS SQ F FSYCL + A + S
Sbjct: 247 RVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCL-GDRSASSRPS 305
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AK 352
I+ G+S++ + + +T ++ NP+L TFY + L GIS+GG + + AS F
Sbjct: 306 SIVFGDSAISRTT---RFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGN 362
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
GG++IDSGT +TRL + Y AL+ FL S AP FS+ DTCF+LS EV +P V
Sbjct: 363 GGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVV 422
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F G A++ + + Y + D S C A A + IIGN QQ+ RV+YD
Sbjct: 423 LHFRG-ADVPLPASN--YLIPVDNSGSFCFAFAGTA--SGLSIIGNIQQQGFRVVYDLAT 477
Query: 472 SQLGFAGEDCS 482
S++GFA C+
Sbjct: 478 SRVGFAPRGCA 488
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 148/372 (39%), Positives = 204/372 (54%), Gaps = 19/372 (5%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
D S +PL G + NY+ + LG ++ ++VDTGS LTW+QC PC SC+ Q
Sbjct: 108 DESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG 167
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
PVF+P S SY V C++ C L AT N CS+S+ C Y SYGD S++ G L +
Sbjct: 168 PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSK 225
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
+ + G SV +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G FSYCLP++
Sbjct: 226 DTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTS 285
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
+ + I N + +YT M + + Y + +TGI + GK L S A
Sbjct: 286 SSSSSGYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAY 339
Query: 353 GGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
+ +IDSGTVITRLP +YSAL G P A FSILDTCF A + +P
Sbjct: 340 SSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPE 398
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V M F G A + + ++ V D++ CLA A IIGN QQ+ V+YD K
Sbjct: 399 VTMAFAGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 453
Query: 471 NSQLGFAGEDCS 482
NS++GFA CS
Sbjct: 454 NSKIGFAAAGCS 465
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 188/338 (55%), Gaps = 19/338 (5%)
Query: 148 VIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
++VDTGS LTW+QC PC SC+ Q PVF+P S +Y V C++ C L AT N C
Sbjct: 12 MVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSAC 71
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
SSS+ C Y SYGD S++ G L ++ + G S+ +F +GCG++N+GLFG +GL+GL
Sbjct: 72 SSSN--VCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGL 129
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
R+ LSL+ Q + G F+YCLPS+ +G S N +YT M+ +
Sbjct: 130 ARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYL--------SLGSYNPGQYSYTPMVSSSL 181
Query: 327 LATFYILNLTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+ Y + L+G+++ G L S A + +IDSGTVITRLP S+YSAL G
Sbjct: 182 DDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKG 241
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
A +SILDTCF A V+ P V M F G A + + ++ V D S CLA A
Sbjct: 242 TSRASAYSILDTCFKGQA-SRVSAPAVTMSFAGGAALKLSAQNLL--VDVDDSTTCLAFA 298
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
IIGN QQ+ V+YD K+S++GFA CS
Sbjct: 299 P---ARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/372 (39%), Positives = 204/372 (54%), Gaps = 19/372 (5%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
D S +PL G + NY+ + LG ++ ++VDTGS LTW+QC PC SC+ Q
Sbjct: 108 DESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG 167
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
PVF+P S SY V C++ C L AT N CS+S+ C Y SYGD S++ G L +
Sbjct: 168 PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSK 225
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
+ + G SV +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G FSYCLP++
Sbjct: 226 DTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTS 285
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
+ + I N + +YT M + + Y + +TGI + GK L S A
Sbjct: 286 SSSSSGYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAY 339
Query: 353 GGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
+ +IDSGTVITRLP +YSAL G P A FSILDTCF A + +P
Sbjct: 340 SSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPE 398
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V M F G A + + ++ V D++ CLA A IIGN QQ+ V+YD K
Sbjct: 399 VTMAFAGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 453
Query: 471 NSQLGFAGEDCS 482
NS++GFA CS
Sbjct: 454 NSKIGFAAGGCS 465
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 149/408 (36%), Positives = 209/408 (51%), Gaps = 63/408 (15%)
Query: 90 NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
+ L D +Y+ R+ SG + +++ +P + G + TLNY+ T LG
Sbjct: 92 DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147
Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S SY V C C L
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGL- 206
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
G Y G +V F FGCG GLF
Sbjct: 207 -------------------------GIYAASACSAAQCG----AVQGFFFGCGHAQSGLF 237
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
GV GL+GLGR SLV QT+ +GG+FSYCLP+ +L +GG S +
Sbjct: 238 NGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---FS 294
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALK 375
T ++P+P T+Y++ LTGIS+GG+QL AS FA G ++D+GTV+TRLPP+ Y+AL+
Sbjct: 295 TTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAALR 353
Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
+ F + G+P+AP ILDTC+N + Y V +P V + F A +T+ GI+ F
Sbjct: 354 SAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF-- 411
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + I+GN QQ++ V D + +GF C
Sbjct: 412 -----GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/370 (39%), Positives = 207/370 (55%), Gaps = 27/370 (7%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+TSG+ + Y + +G + + +++DTGSD+ W+QC PC+ CY+Q DPVFDP S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSG 195
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ + C S C L+ S C+S C Y V+YGDGS+T GE E L
Sbjct: 196 SFSSISCRSPLCLRLD-----SPGCNSRQ--SCLYQVAYGDGSFTFGEFSTETLTFRGTR 248
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V GCG +N+GLF G +GL+GLGR LS +QT FG FSYCL + A + S
Sbjct: 249 VPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVD-RSASSKPSS 307
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
++ G S+V + + +T +I NP+L TFY L LTGIS+GG + + AS F G
Sbjct: 308 VVFGQSAVSRTA---VFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNG 364
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G++IDSGT +TRL Y +L+ F + AP +S+ DTCF+LS EV +P V M
Sbjct: 365 GVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVM 424
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G A++++ T Y + D + V C A A IIGN QQ+ RV++D S
Sbjct: 425 HFRG-ADVSLPATN--YLIPVDTNGVFCFAFAGT--MSGLSIIGNIQQQGFRVVFDVAAS 479
Query: 473 QLGFAGEDCS 482
++GFA C+
Sbjct: 480 RIGFAARGCA 489
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 171/498 (34%), Positives = 253/498 (50%), Gaps = 58/498 (11%)
Query: 9 TILSLLLP----LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEM 64
++ S +LP S+ +A H + L++ + Q S SSS + SR+ +
Sbjct: 19 SVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSASSSFSL-QLHSRVSV 77
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSR----IKNMISGNIKDVS-- 118
+H +Y S + RL D V+ L +R I N+ ++K +S
Sbjct: 78 RGT----EHSDYKSLTLA--------RLNRDTARVKSLITRLDLAINNISKADLKPISTM 125
Query: 119 ------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
+ E PL SG + Y + +G R + +++DTGSD+ W+QC PC CY+Q
Sbjct: 126 YTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQ 185
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
+P+F+PS S SY+ + C++ C+ALE + C +++ C Y VSYGDGSYT G+
Sbjct: 186 TEPIFEPSSSSSYEPLSCDTPQCNALEVSE-----CRNAT---CLYEVSYGDGSYTVGDF 237
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
E L +G V + GCG +N+GLF G +GL+GLG L+L SQ + FSYCL
Sbjct: 238 ATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS---FSYCLV 294
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--AS 348
+D+ ++ ++ G + S P ++ N QL TFY L LTGIS+GG+ LQ S
Sbjct: 295 D-RDSDSASTVDFGTSLSPDAVVAP-----LLRNHQLDTFYYLGLTGISVGGELLQIPQS 348
Query: 349 GF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
F GGI+IDSGT +TRL IY++L+ F+K A G ++ DTC+NLSA
Sbjct: 349 SFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAK 408
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
V +P V F G + + + V S CLA A + IIGN QQ+
Sbjct: 409 TTVEVPTVAFHFPGGKMLALPAKNYMIPVDS-VGTFCLAFAPTA--SSLAIIGNVQQQGT 465
Query: 464 RVIYDTKNSQLGFAGEDC 481
RV +D NS +GF+ C
Sbjct: 466 RVTFDLANSLIGFSSNKC 483
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/367 (39%), Positives = 202/367 (55%), Gaps = 19/367 (5%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
+PL G + NY+ + LG ++ ++VDTGS LTW+QC PC SC+ Q PVF+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S SY V C++ C L AT N CS+S+ C Y SYGD S++ G L ++ +
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSKDTVSF 232
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
G SV +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G FSYCLP++ + +
Sbjct: 233 GSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSS 292
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-- 355
I N + +YT M + + Y + +TGI + GK L S A +
Sbjct: 293 GYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT 346
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSGTVITRLP +YSAL G P A FSILDTCF A + +P V M F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 405
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A + + ++ V D++ CLA A IIGN QQ+ V+YD KNS++G
Sbjct: 406 AGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKNSKIG 460
Query: 476 FAGEDCS 482
FA CS
Sbjct: 461 FAAGGCS 467
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 155/449 (34%), Positives = 230/449 (51%), Gaps = 47/449 (10%)
Query: 63 EMGAITLELKHKNYCSGKIVDW--NEQQQNRLILDNLHVQYLQSRIKNMISGNIK----- 115
E +I L++ H++ S E Q RL D V + +R++ G K
Sbjct: 64 EKNSIVLQVVHRDSLSSSSNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKP 123
Query: 116 ----------DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQP 163
D + + SG+ + Y + +G R +++DTGSD+ W+QC P
Sbjct: 124 LNGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLP 183
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C CY Q DP+F+P+ S +Y+KV C + C L+ + + C Y VSYGDG
Sbjct: 184 CAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLD-------ISGCRNKRYCEYQVSYGDG 236
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
S+T G+ E L + GCG +N+GLF G +GL+GLGR LS SQT F
Sbjct: 237 SFTVGDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSK 296
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
FSYCL +G + SLI G +++ K++ +T ++ NP+L TFY + L GIS+GG+
Sbjct: 297 RFSYCLVDRSASGTASSLIF-GKAAIPKSA---IFTPLLSNPKLDTFYYVELVGISVGGR 352
Query: 344 QLQ---ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
+L AS F GG++IDSGT +TRL S YS ++ F SA GFS+ D
Sbjct: 353 RLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFD 412
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
TC++LS + V +P + F+G A +++ T + V S A+ C A A TG
Sbjct: 413 TCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSAT-FCFAFAG-----NTGGL 466
Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ RV++D+ +++GF C
Sbjct: 467 SIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 145/414 (35%), Positives = 217/414 (52%), Gaps = 41/414 (9%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELGG- 143
DNL V + RI ++G + S + + P+ SG+ L + Y I +G
Sbjct: 8 DNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTP 67
Query: 144 -RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
R M +++DTGSD+ W+QC PC +CY+Q D +FDP S +Y + C++ C L+
Sbjct: 68 PRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDI---- 123
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDFIFGCGRNNKGL 256
G C ++ C Y V YGDGS+T GE G + + G+G+ +N GCG +N+G
Sbjct: 124 -GTCQANK---CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGY 179
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
F G +GL+GLG+ LS +Q GG FSYCL + GS ++ G ++V
Sbjct: 180 FVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGA-- 237
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPS 369
+T N ++ TFY L +TGIS+GG L Q GG++IDSGT +TRL +
Sbjct: 238 RFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNA 297
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
Y++L+ F S GFS+ DTC++LS V++P V + F+G ++ + +
Sbjct: 298 AYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASN-- 355
Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
Y + D S CLA A + IIGN QQ+ RVIYD ++Q+GF C+
Sbjct: 356 YLIPVDNSNTFCLAFAGTT---GPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 147/368 (39%), Positives = 206/368 (55%), Gaps = 27/368 (7%)
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
SG+ + Y I +G + + +++DTGSD+ W+QC PCK+CY+Q DPVF+P S S+
Sbjct: 120 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 179
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
KVLC + C LE S C+ C Y VSYGDGSYT GE E L + V
Sbjct: 180 AKVLCRTPLCRRLE-----SPGCNQRQ--TCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 232
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG +N+GLF G +GL+GLGR LS SQ F FSYCL + A + S ++
Sbjct: 233 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCL-VDRSASSKPSSVV 291
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKGGI 355
GNS+V + + +T ++ NP+L TFY + L GIS+GG + AS F GG+
Sbjct: 292 FGNSAVSRTA---RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGV 348
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+ID GT +TRL Y AL+ F S SAP FS+ DTC++LS V +P V + F
Sbjct: 349 IIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF 408
Query: 416 EGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
G A++++ + Y + D S + C A A + IIGN QQ+ RV+YD +S++
Sbjct: 409 RG-ADVSLPASN--YLIPVDGSGRFCFAFAGTT--SGLSIIGNIQQQGFRVVYDLASSRV 463
Query: 475 GFAGEDCS 482
GF+ C+
Sbjct: 464 GFSPRGCA 471
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 137/344 (39%), Positives = 191/344 (55%), Gaps = 28/344 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+++DTGSD+TWVQCQPC CY Q DPVFDPS+S SY V C+S C L+ A + +
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGL 266
C Y V+YGDGSYT G+ E L LG ++ V + GCG +N+GLF G +GL+ L
Sbjct: 61 ------CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLAL 114
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
G LS SQ I FSYCL +D+ A+ +L G ++ T ++ +P+
Sbjct: 115 GGGPLSFPSQ---ISASTFSYCL-VDRDSPAASTLQFGDGAAEAGTVT----APLVRSPR 166
Query: 327 LATFYILNLTGISIGGKQLQ--ASGFA------KGGILIDSGTVITRLPPSIYSALKAEF 378
+TFY + L+GIS+GG+ L AS FA GG+++DSGT +TRL + Y+AL+ F
Sbjct: 167 TSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAF 226
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-AS 437
++ P G S+ DTC++LS V +P V + FEG + + Y + D A
Sbjct: 227 VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKN--YLIPVDGAG 284
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + IIGN QQ+ RV +DT +GF C
Sbjct: 285 TYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 153/430 (35%), Positives = 227/430 (52%), Gaps = 28/430 (6%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT---EI 122
+ITL L H + S E +RL D+ V+ + + + N+ T
Sbjct: 71 SITLNLDHIDALSSNKTP-QELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRTGGFSS 129
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
+ SG+ + Y + +G R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP S
Sbjct: 130 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+Y + C+S C L+ A N+ + C Y VSYGDGS+T G+ E L +
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKT------CLYQVSYGDGSFTVGDFSTETLTFRRN 243
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V GCG +N+GLF G +GL+GLG+ LS QT F FSYCL + A + S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSKPS 302
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AK 352
++ GN++V S +T ++ NP+L TFY + L GIS+GG + + AS F
Sbjct: 303 SVVFGNAAV---SRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGN 359
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
GG++IDSGT +TRL Y A++ F AP FS+ DTCF+LS EV +P V
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVV 419
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F G A++++ T + V ++ + C A A IIGN QQ+ RV+YD +S
Sbjct: 420 LHFRG-ADVSLPATNYLIPVDTNG-KFCFAFAGT--MGGLSIIGNIQQQGFRVVYDLASS 475
Query: 473 QLGFAGEDCS 482
++GFA C+
Sbjct: 476 RVGFAPGGCA 485
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 145/367 (39%), Positives = 202/367 (55%), Gaps = 19/367 (5%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
+PL G + NY+ + LG ++ ++VDTGS LTW+QC PC SC+ Q PVF+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S SY V C++ C L AT + CS+S+ C Y SYGD S++ G L ++ +
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLSPASCSTSN--VCIYQASYGDSSFSVGYLSKDTVSF 232
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
G SV +F +GCG++N+GLFG +GL+GL R+ LSL+ Q + G FSYCLP++ + +
Sbjct: 233 GSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSS 292
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-- 355
I N + +YT M + + Y + +TGI + GK L S A +
Sbjct: 293 GYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT 346
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSGTVITRLP +YSAL G P A FSILDTCF A + +P V M F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 405
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A + + ++ V D++ CLA A IIGN QQ+ V+YD KNS++G
Sbjct: 406 AGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKNSKIG 460
Query: 476 FAGEDCS 482
FA CS
Sbjct: 461 FAAGGCS 467
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 147/370 (39%), Positives = 207/370 (55%), Gaps = 27/370 (7%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + Y I +G + + +++DTGSD+ W+QC PCK+CY+Q DPVF+P S
Sbjct: 31 VISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSG 90
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ KVLC + C LE S C+ C Y VSYGDGSYT GE E L +
Sbjct: 91 SFAKVLCRTPLCRRLE-----SPGCNQRQ--TCLYQVSYGDGSYTTGEFVTETLTFRRTK 143
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V GCG +N+GLF G +GL+GLGR LS SQ F FSYCL + A + S
Sbjct: 144 VEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCL-VDRSASSKPSS 202
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
++ GNS+V + + +T ++ NP+L TFY + L GIS+GG + AS F G
Sbjct: 203 VVFGNSAVSRTA---RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNG 259
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G++ID GT +TRL Y AL+ F S SAP FS+ DTC++LS V +P V +
Sbjct: 260 GVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVL 319
Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G A++++ + Y + D S + C A A + IIGN QQ+ RV+YD +S
Sbjct: 320 HFRG-ADVSLPASN--YLIPVDGSGRFCFAFAGTT--SGLSIIGNIQQQGFRVVYDLASS 374
Query: 473 QLGFAGEDCS 482
++GF+ C+
Sbjct: 375 RVGFSPRGCA 384
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 171/498 (34%), Positives = 252/498 (50%), Gaps = 59/498 (11%)
Query: 10 ILSLLLP----LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMG 65
+ S +LP S+ +A H + L++ + Q S SSS + SR+ +
Sbjct: 22 VFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSRSSSFSL-QLHSRVSVR 80
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSR----IKNMISGNIKDVS--- 118
+H +Y S + RL D V+ L +R I N+ ++K V+
Sbjct: 81 GT----EHSDYKSLTLA--------RLNRDTARVKSLITRLDLAINNISKADLKPVTTMY 128
Query: 119 ------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
+ E PL SG + Y + +G R + +++DTGSD+ W+QC PC CY+Q
Sbjct: 129 TTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQ 188
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
+P+F+PS S SY+ + C++ C+ALE + C +++ C Y VSYGDGSYT G+
Sbjct: 189 TEPIFEPSSSSSYEPLSCDTPQCNALEVSE-----CRNAT---CLYEVSYGDGSYTVGDF 240
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
E L +G V + GCG +N+GLF G +GL+GLG L+L SQ + FSYCL
Sbjct: 241 ATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS---FSYCLV 297
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--AS 348
+D+ ++ ++ G + P ++ N QL TFY L LTGIS+GG+ LQ S
Sbjct: 298 D-RDSDSASTVEFGTSLPPDAVVAP-----LLRNHQLDTFYYLGLTGISVGGELLQIPQS 351
Query: 349 GF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
F GGI+IDSGT +TRL IY++L+ FLK S A G ++ DTC+NLSA
Sbjct: 352 SFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAK 411
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
+ +P V F G + + + V S CLA A + IIGN QQ+
Sbjct: 412 TTIEVPTVAFHFPGGKMLALPAKNYMIPVDS-VGTFCLAFAPTA--SSLAIIGNVQQQGT 468
Query: 464 RVIYDTKNSQLGFAGEDC 481
RV +D NS +GF+ C
Sbjct: 469 RVTFDLANSLIGFSSNKC 486
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 153/409 (37%), Positives = 212/409 (51%), Gaps = 29/409 (7%)
Query: 91 RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
RL D+L V+ L S +N+ + + SG+ + Y + +G
Sbjct: 87 RLQRDSLRVESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRLGVGTPA 146
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
NM +++DTGSD+ W+QC PCK CYNQ DPVF+P+ S ++ V C S C L+ +S
Sbjct: 147 TNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLD----DS 202
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
C S C Y VSYGDGS+T G+ E L A V+ GCG +N+GLF G +GL
Sbjct: 203 SECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGL 262
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GLGR LS SQT + G FSYCL S+ + S I+ GN +V K + +T
Sbjct: 263 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTA---VFTP 319
Query: 321 MIPNPQLATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYS 372
++ NP+L TFY L L GIS+GG Q + GG++IDSGT +TRL S Y
Sbjct: 320 LLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYV 379
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
AL+ F + AP +S+ DTCF+LS V +P V F G E+++ + + V
Sbjct: 380 ALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTG-GEVSLPASNYLIPV 438
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + C A A IIGN QQ+ RV YD S++GF C
Sbjct: 439 NNQG-RFCFAFAGT--MGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 202/383 (52%), Gaps = 32/383 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SGI ++ Y A + +G +++DTGSDL W+QC PC+ CY Q+ VFDP S
Sbjct: 74 PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+Y++V C+S C AL F +SG + C Y V+YGDGS + G+L + L
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGDLATDKLAFAND 190
Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ VN+ GCGR+N+GLF +GL+G+GR +S+ +Q + +G +F YCL
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
S ++ G + +T ++ NP+ + Y +++ G S+GG+++ +GF+
Sbjct: 251 SYLVFGRT---PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERV--TGFSNASLALDT 305
Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQE 405
+GG+++DSGT I+R Y+AL+ F + S+ D C++L
Sbjct: 306 ATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 365
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQ 460
+ PL+ + F G A+M + YF+ D + A L + +D +IGN QQ
Sbjct: 366 ASAPLIVLHFAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
+ RV++D + ++GFA + C+S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 156/433 (36%), Positives = 231/433 (53%), Gaps = 36/433 (8%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP--- 123
ITL L H + S +E +RL D+ V+ + + + I G ++V++ P
Sbjct: 72 ITLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIAT-LAAQIPG--RNVTHAPRPGGF 127
Query: 124 ---LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+ SG+ + Y + +G R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y + C+S C L+ A N+ C Y VSYGDGS+T G+ E L
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNT------RRKTCLYQVSYGDGSFTVGDFSTETLTFR 241
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+ V GCG +N+GLF G +GL+GLG+ LS QT F FSYCL + A +
Sbjct: 242 RNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSK 300
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF----- 350
S ++ GN++V S +T ++ NP+L TFY + L GIS+GG + + AS F
Sbjct: 301 PSSVVFGNAAV---SRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
GG++IDSGT +TRL Y A++ F AP FS+ DTCF+LS EV +P
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPT 417
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
V + F G A++++ T Y + D + + C A A IIGN QQ+ RV+YD
Sbjct: 418 VVLHFRG-ADVSLPATN--YLIPVDTNGKFCFAFAGT--MGGLSIIGNIQQQGFRVVYDL 472
Query: 470 KNSQLGFAGEDCS 482
+S++GFA C+
Sbjct: 473 ASSRVGFAPGGCA 485
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 201/383 (52%), Gaps = 32/383 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SGI ++ Y A + +G +++DTGSDL W+QC PC+ CY Q+ VFDP S
Sbjct: 74 PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+Y++V C+S C AL F +SG + C Y V+YGDGS + GEL + L
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGELATDKLAFAND 190
Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ VN+ GCGR+N+GLF +GL+G+ R +S+ +Q + +G +F YCL
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
S ++ G + +T ++ NP+ + Y +++ G S+GG+++ +GF+
Sbjct: 251 SYLVFGRT---PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERV--TGFSNASLALDT 305
Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQE 405
+GG+++DSGT I+R Y+AL+ F + S+ D C++L
Sbjct: 306 ATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 365
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQ 460
+ PL+ + F G A+M + YF+ D + A L + +D +IGN QQ
Sbjct: 366 ASAPLIVLHFAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
+ RV++D + ++GFA + C+S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/423 (33%), Positives = 207/423 (48%), Gaps = 46/423 (10%)
Query: 69 LELKHKNYCS-GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
+++ H++ S G D + RL D V L R+ + G+ + V + + SG
Sbjct: 135 MKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYR-VDDFGTDVISG 193
Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
+ + Y I +G R+ +++D+GSD+ WVQCQPC CY+Q DPVFDP+ S S+
Sbjct: 194 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 253
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
V C+SS C LE A ++G C Y VSYGDGSYT+G L E L G+ V
Sbjct: 254 VSCSSSVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTFGRTMVRSV 305
Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GCG N+G+F G +GL+GLG +S V Q GG FSYCL S
Sbjct: 306 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA------------- 352
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------GFAKGGILID 358
+ ++ NP+ +FY + L G+ +GG ++ S GG+++D
Sbjct: 353 -----------AWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMD 401
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
+GT +TRLP Y A + FL Q + P A G +I DTC++L + V +P V F G
Sbjct: 402 TGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGG 461
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
+T+ + DA C A A + I+GN QQ+ ++ +D N +GF
Sbjct: 462 PILTLPARNFL-IPMDDAGTFCFAFAPST--SGLSILGNIQQEGIQISFDGANGYVGFGP 518
Query: 479 EDC 481
C
Sbjct: 519 NIC 521
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 184/368 (50%), Gaps = 18/368 (4%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
IP ++G L TL ++ T+ G +N T+ +DTGSD++W+QC PC CY Q DPVFDP+
Sbjct: 148 IPDSTGTSLDTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPT 207
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y V C C A NSG C Y V+YGDGS T G L E L L
Sbjct: 208 KSATYSAVPCGHPQCAAAGGKCSNSGTCL--------YKVTYGDGSSTAGVLSHETLSLS 259
Query: 239 KA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+ F FGCG+ N G FGGV GL+GLGR LSL SQ + FG FSYCLPS
Sbjct: 260 STRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT--T 317
Query: 298 SGSLILGGNSSVFKN-STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
G L +G + N + YT MI + Y + + I IGG L F + G
Sbjct: 318 HGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDG 377
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
L DSGT++T LPP Y++L+ F + + AP + DTC++ + + + +P V +
Sbjct: 378 TLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFK 437
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F A + I+ + A CLA IIGN QQ+ VIYD +
Sbjct: 438 FSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEK 497
Query: 474 LGFAGEDC 481
+GF C
Sbjct: 498 IGFGQFTC 505
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 162/466 (34%), Positives = 232/466 (49%), Gaps = 49/466 (10%)
Query: 42 LQWQQKSGSSSSCVSHQKSRIEMGAITLELKH----KNYCSGKIVDWNEQQQNRLILDNL 97
L W + S VS + ++++ L H ++ VD + RL D+L
Sbjct: 44 LSWPESKSFSDESVSESTT-----SLSVHLSHVDALSSFSDASPVDLFKL---RLQRDSL 95
Query: 98 HVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
V+ + S +N + + SG+ + Y + +G N+ +++
Sbjct: 96 RVKSITSLAAVSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVL 155
Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
DTGSD+ W+QC PCK+CYNQ D +FDP S ++ V C S C L+ +S C +
Sbjct: 156 DTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLD----DSSECVTRR 211
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSD 270
C Y VSYGDGS+T G+ E L A V+ GCG +N+GLF G +GL+GLGR
Sbjct: 212 SKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGG 271
Query: 271 LSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
LS SQT + G FSYCL S+ + S I+ GN +V K S +T ++ NP+L
Sbjct: 272 LSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTS---VFTPLLTNPKL 328
Query: 328 ATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
TFY L L GIS+GG Q + GG++IDSGT +TRL S Y AL+ F
Sbjct: 329 DTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFR 388
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ AP +S+ DTCF+LS V +P V F G E+++ + + V ++ +
Sbjct: 389 LGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEG-RF 446
Query: 440 CLALA----SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
C A A SLS IIGN QQ+ RV YD S++GF C
Sbjct: 447 CFAFAGTMGSLS------IIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/402 (35%), Positives = 214/402 (53%), Gaps = 30/402 (7%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIP-LTSGIRLQTLNYIATIELGG--RNMTVIVD 151
D ++++ RI++ + + S + ++SG+ L + Y A + +G R+ + +D
Sbjct: 4 DEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLELD 63
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
TGSD+TW+QC PC SCY+Q DP++DPS S SY++V C S+ C AL+++ C
Sbjct: 64 TGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYS-----ACQGMG- 117
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGR 268
C+Y V YGD S + G+LG E LG S + + FGCG +N GLF G +GL+G+G
Sbjct: 118 --CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGG 175
Query: 269 SDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
LS SQ + G FSYCL +Q S LI G + F +T ++ NP+
Sbjct: 176 GTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA----ARFTPLLKNPR 231
Query: 327 LATFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
+ TFY LTGIS+GG Q +G GG ++DSGT +TR+ P+ Y+ L+ +
Sbjct: 232 IDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYR 291
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
P APG +LDTCFN V IP + + F+ + +M + I+ V +
Sbjct: 292 AASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGT-F 350
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A S +IGN QQ+ R+ +D + S + A +C
Sbjct: 351 CLAFAPSSM--PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/364 (39%), Positives = 192/364 (52%), Gaps = 52/364 (14%)
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISP 181
G + TLNY+ T LG G T+ VDTGSDL+WVQC+PC SCY+Q+DP+FDP+ S
Sbjct: 132 GYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSS 191
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY V C C L G Y G +
Sbjct: 192 SYAAVPCGGPVCAGL--------------------------GIYAASACSAAQCG----A 221
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V F FGCG GLF GV GL+GLGR SLV QT+ +GG+FSYCLP+ +L
Sbjct: 222 VQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTL 281
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDS 359
+GG S + T ++P+P T+Y++ LTGIS+GG+QL AS FA G ++D+
Sbjct: 282 GVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDT 337
Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
GTV+TRLPP+ Y+AL++ F + G+P+AP ILDTC+N + Y V +P V + F
Sbjct: 338 GTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGS 397
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A +T+ GI+ F CLA A + I+GN QQ++ V D + +GF
Sbjct: 398 GATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFK 448
Query: 478 GEDC 481
C
Sbjct: 449 PSSC 452
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 141/361 (39%), Positives = 194/361 (53%), Gaps = 31/361 (8%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y I +G R +++DTGSD+ W+QC+PC+ CY+Q DP+F+PS S S+ V C+S+
Sbjct: 7 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 66
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
C L+ + G C Y VSYGDGSYT G E L G S+ + GCG
Sbjct: 67 VCSQLDANDCHGG--------GCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGH 118
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N GLF G +GL+GLG LS +Q G FSYCL +D+ +SG+L G
Sbjct: 119 DNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCL-VDRDSESSGTLEFG------P 171
Query: 312 NSTPI--TYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF------AKGGILIDSG 360
S PI +T ++ NP L TFY L++ IS+GG + + F +GGI+IDSG
Sbjct: 172 ESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSG 231
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T +TRL S Y AL+ F+ P A G SI DTC++LSA Q V+IP V F A
Sbjct: 232 TAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAG 291
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + + S + C A A + I+GN QQ+ RV +D+ NS +GFA +
Sbjct: 292 FILPAKNCLIPMDSMGT-FCFAFAPA--DSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQ 348
Query: 481 C 481
C
Sbjct: 349 C 349
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 151/422 (35%), Positives = 214/422 (50%), Gaps = 31/422 (7%)
Query: 65 GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
G ++ L H+ CS + E++ + L D L Y++ + +G
Sbjct: 31 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 90
Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
S +P T G L TL Y+ ++ LG +T V++DTGSD++WVQC+PC + C+
Sbjct: 91 SKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 150
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
+FDP+ S +Y C+++ C L +G + C + S C Y V YGDGS T G
Sbjct: 151 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTGTYSS 207
Query: 233 EHLGL-GKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ L L G V F FGC G+ GL+GLG S VSQT+ +G F YCL
Sbjct: 208 DVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCL 267
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
P+T +SG L LG +S T M+ + ++ T+Y L I++GGK+L
Sbjct: 268 PATP--ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 325
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
S FA G L+DSGTVITRLPP+ Y+AL + F + + A ILDTCFN + +V
Sbjct: 326 PSVFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKV 384
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+IP V + F G A + +D GIV S CLA A + G IGN QQ+ V+
Sbjct: 385 SIPTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 437
Query: 467 YD 468
YD
Sbjct: 438 YD 439
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 197/359 (54%), Gaps = 26/359 (7%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y I +G R + +++DTGSD+ W+QC PC+ CY Q D VFDP+ S +Y + C +
Sbjct: 117 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAP 176
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
C L+ S CS+ + C Y VSYGDGS+T G+ E L + V GCG
Sbjct: 177 LCRRLD-----SPGCSNKNKV-CQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVALGCGH 230
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N+GLF G +GL+GLGR LS QT F FSYCL + A A S ++ G+S+V +
Sbjct: 231 DNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCL-VDRSASAKPSSVIFGDSAVSR 289
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVI 363
+ +T +I NP+L TFY L L GIS+GG + L AS F GG++IDSGT +
Sbjct: 290 TA---HFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSV 346
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
TRL Y AL+ F S AP FS+ DTCF+LS EV +P V + F G A++++
Sbjct: 347 TRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRG-ADVSL 405
Query: 424 DVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
T Y + D S C A A IIGN QQ+ R+ YD S++GFA C
Sbjct: 406 PATN--YLIPVDNSGSFCFAFAGTM--SGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 203/396 (51%), Gaps = 29/396 (7%)
Query: 104 SRIKNMISGN-------IKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
+R + M+ G + + +IPL SG + + NYI + G ++ ++DTGS
Sbjct: 86 ARYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGS 145
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
++ W+ C PC C ++Q P F+PS S +Y + C S C L T S + +C
Sbjct: 146 NIAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQCQLLRVCT------KSDNSVNC 198
Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
+ YGD S L E L +G V +F+FGC +GL L+G GR+ LS V
Sbjct: 199 SLTQRYGDQSEVDEILSSETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFV 258
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
SQT+ ++ FSYCLPS + +GSL+LG + ++ + +T ++ N + +FY +
Sbjct: 259 SQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEA---LSAQGLKFTPLLSNSRYPSFYYVG 315
Query: 335 LTGISIGGK-------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
L GIS+G + L G +IDSGTVITRL Y+A++ F Q S
Sbjct: 316 LNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTM 375
Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA--LAS 445
A + DTC+N + +V PL+ + F+ N ++T+ + I+Y D S +CLA L
Sbjct: 376 ASPTDLFDTCYNRPS-GDVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPP 434
Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D GNYQQ+ R+++D S+LG A E+C
Sbjct: 435 GGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 198/371 (53%), Gaps = 30/371 (8%)
Query: 119 NTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPV 174
+P G + +L Y+ + G + V++DTGSD++W+QC+PC S C+ Q+DP+
Sbjct: 63 KVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPL 122
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
+DPS S +Y V C S C L SG C+S C + +SY DG+ T G ++
Sbjct: 123 YDPSHSSTYSAVPCASDVCKKLAADAYGSG-CTSGK--QCGFAISYADGTSTVGAYSQDK 179
Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L L A V +F FGCG + G G++GLGR SL ++ +GG+FSYCLPS
Sbjct: 180 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS 235
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
G L LG KN + +T M P TF + L GI++GGK+ L+ S F+
Sbjct: 236 S--KPGFLALGAG----KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS 289
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG+++DSGTVIT L + Y AL++ F K + P LDTC+NL+ Y+ V +P +
Sbjct: 290 -GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKI 347
Query: 412 KMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
+ F G A + +DV GI+ CLA A + G++GN Q+ V++DT
Sbjct: 348 ALTFTGGATINLDVPNGILV-------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTS 400
Query: 471 NSQLGFAGEDC 481
S+ GF + C
Sbjct: 401 TSKFGFRAKAC 411
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 152/413 (36%), Positives = 214/413 (51%), Gaps = 37/413 (8%)
Query: 91 RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
RL D+L V+ + S +N + + SG+ + Y + +G
Sbjct: 86 RLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPA 145
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
N+ +++DTGSD+ W+QC PCK+CYNQ D +FDP S ++ V C S C L+ +S
Sbjct: 146 TNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD----DS 201
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
C + C Y VSYGDGS+T G+ E L A V+ GCG +N+GLF G +GL
Sbjct: 202 SECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGL 261
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GLGR LS SQT + G FSYCL S+ + S I+ GN++V K S +T
Sbjct: 262 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS---VFTP 318
Query: 321 MIPNPQLATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYS 372
++ NP+L TFY L L GIS+GG Q + GG++IDSGT +TRL Y
Sbjct: 319 LLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYV 378
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
AL+ F + AP +S+ DTCF+LS V +P V F G E+++ + + V
Sbjct: 379 ALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPV 437
Query: 433 KSDASQVCLALA----SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
++ + C A A SLS IIGN QQ+ RV YD S++GF C
Sbjct: 438 NTEG-RFCFAFAGTMGSLS------IIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 144/370 (38%), Positives = 205/370 (55%), Gaps = 26/370 (7%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+TSG+ + Y + +G R + +++DTGSD+ W+QC PCK CY+Q DPVF+P+ S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSR 195
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ + C S C L+ S CS+ C Y VSYGDGS+T GE E L
Sbjct: 196 SFANIPCGSPLCRRLD-----SPGCSTKKH-ICLYQVSYGDGSFTYGEFSTETLTFRGTR 249
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V GCG +N+GLF G +GL+GLGR LS SQ F FSYCL + A + S
Sbjct: 250 VGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCL-VDRSASSKPSY 308
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
++ G+S++ + + +T ++ NP+L TFY + L G+S+GG + + AS F G
Sbjct: 309 MVFGDSAISRTA---RFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNG 365
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G++IDSGT +TRL Y AL+ F S AP FS+ DTCF+LS EV +P V +
Sbjct: 366 GVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVL 425
Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G A++++ + Y + D S C A A I+GN QQ+ RV+YD S
Sbjct: 426 HFRG-ADVSLPASN--YLIPVDNSGSFCFAFAGT--MSGLSIVGNIQQQGFRVVYDLAAS 480
Query: 473 QLGFAGEDCS 482
++GFA C+
Sbjct: 481 RVGFAPRGCA 490
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 155/433 (35%), Positives = 229/433 (52%), Gaps = 36/433 (8%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP--- 123
ITL L H + S E +RL D+ V+ + + + I G ++V++ P
Sbjct: 72 ITLNLDHIDALSSNKTP-QELFSSRLQRDSRRVRSIAT-LAAQIPG--RNVTHAPRPGGF 127
Query: 124 ---LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+ SG+ + Y + +G R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y + C+S C L+ A N+ C Y VSYGDGS+T G+ E L
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNT------RRKTCLYQVSYGDGSFTVGDFSTETLTFR 241
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+ V GCG +N+GLF G +GL+GLG+ LS QT F FSYCL + A +
Sbjct: 242 RNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSK 300
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF----- 350
S ++ GN++V S +T ++ NP+L TFY + L GIS+GG + + AS F
Sbjct: 301 PSSVVFGNAAV---SRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
GG++IDSGT +TRL Y A++ F AP FS+ DTCF+LS EV +P
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPT 417
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
V + F A++++ T Y + D + + C A A IIGN QQ+ RV+YD
Sbjct: 418 VVLHFR-RADVSLPATN--YLIPVDTNGKFCFAFAGT--MGGLSIIGNIQQQGFRVVYDL 472
Query: 470 KNSQLGFAGEDCS 482
+S++GFA C+
Sbjct: 473 ASSRVGFAPGGCA 485
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 198/371 (53%), Gaps = 30/371 (8%)
Query: 119 NTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPV 174
+P G + +L Y+ + G + V++DTGSD++W+QC+PC S C+ Q+DP+
Sbjct: 97 KVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPL 156
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
+DPS S +Y V C S C L SG C+S C + +SY DG+ T G ++
Sbjct: 157 YDPSHSSTYSAVPCASDVCKKLAADAYGSG-CTSGK--QCGFAISYADGTSTVGAYSQDK 213
Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L L A V +F FGCG + G G++GLGR SL ++ +GG+FSYCLPS
Sbjct: 214 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS 269
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
G L LG KN + +T M P TF + L GI++GGK+ L+ S F+
Sbjct: 270 S--KPGFLALGAG----KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS 323
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG+++DSGTVIT L + Y AL++ F K + P LDTC+NL+ Y+ V +P +
Sbjct: 324 -GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKI 381
Query: 412 KMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
+ F G A + +DV GI+ CLA A + G++GN Q+ V++DT
Sbjct: 382 ALTFTGGATINLDVPNGILV-------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTS 434
Query: 471 NSQLGFAGEDC 481
S+ GF + C
Sbjct: 435 TSKFGFRAKAC 445
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 158/510 (30%), Positives = 246/510 (48%), Gaps = 56/510 (10%)
Query: 7 PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQ----WQQKSGSSSSCVSHQKSRI 62
PL + LL + + LFL + + + H L ++ ++ ++++
Sbjct: 10 PLLPFTFLLCVGMLLFLQSAQSRPISVPEVPAYHALDVASSLRETDTAAGGAEYKRETKP 69
Query: 63 EMGAITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
++E+ H++ + + + + +L + + V+ L+ +I+ ++ N V
Sbjct: 70 RRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPV 129
Query: 118 SNTEI----------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
+ E + SG+ + Y I +G R +++DTGSD+ W+QC+PC+
Sbjct: 130 NRYENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCR 189
Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
CY+Q DP+F+PS S S+ V C+S+ C L+ +SG C Y SYGDGSY
Sbjct: 190 ECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSG--------GCLYEASYGDGSY 241
Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
+ G E L G SV + GCG N GLF G +GL+GLG LS +Q G F
Sbjct: 242 STGSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTF 301
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGGK 343
SYCL +++ +SG L G S P+ +T + NP L TFY L++T IS+GG
Sbjct: 302 SYCL-VDRESDSSGPLQFG------PKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGA 354
Query: 344 QL-----------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L + SG GG +IDSGTV+TRL S Y A++ F+ P S
Sbjct: 355 LLDSIPPEVFRIDETSG--HGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVS 412
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDE 451
I DTC++LS Q V++P V F A + + Y + D C A A +
Sbjct: 413 IFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKN--YLIPMDTVGTFCFAFAPAA--SS 468
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GN QQ++ RV +D+ NS +GFA + C
Sbjct: 469 VSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 143/389 (36%), Positives = 202/389 (51%), Gaps = 32/389 (8%)
Query: 112 GNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
G + S P+ SG+ + Y I +G +++DTGSD+ W+QC PC+ CY+
Sbjct: 119 GTRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYD 178
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
Q VFDP S SY V C++ C L+ SG C C Y V+YGDGS T G+
Sbjct: 179 QSGQVFDPRRSRSYGAVGCSAPLCRRLD-----SGGCDLRRKA-CLYQVAYGDGSVTAGD 232
Query: 230 LGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
E L G A V GCG +N+GLF +GL+GLGR LS +Q S +G FSYC
Sbjct: 233 FATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYC 292
Query: 289 L----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
L S A S ++ G S ++ ++T M+ NP++ TFY + L GIS+GG +
Sbjct: 293 LVDRTSSANPASHSSTVTFG--SGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGAR 350
Query: 345 LQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFS 392
+ SG A +GG+++DSGT +TRL YSAL+ F +G +P GFS
Sbjct: 351 V--SGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFS 408
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
+ DTC++LS + V +P V M F G AE + + V S + C A A +
Sbjct: 409 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGT--DGGV 465
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ RV++D ++GF + C
Sbjct: 466 SIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 186/344 (54%), Gaps = 24/344 (6%)
Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+ TV+VDT SD+ WVQC PC C+ Q+DP++DP+ S ++ + C S C L + GN
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV- 260
CS ++ +C Y V+YGDG T G + L + V DF FGC +G F
Sbjct: 228 G--CSPTTD-ECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQN 284
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+G++ LG SL+ QT++ +G FSYC+P AG L LGG + S +YT
Sbjct: 285 AGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGF---LSLGGP---VEASLKFSYTP 338
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
+I N TFYI++L I + GKQL + FA G ++ DSG V+T+LPP +Y+AL+A F
Sbjct: 339 LIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVM-DSGAVVTQLPPQVYAALRAAF 397
Query: 379 LKQFSGF-PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
+ + P A LDTC++ + + +V +P V + F G A + ++ I+
Sbjct: 398 RSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL------- 450
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A+ E+ G IGN QQ+ V+YD ++GF C
Sbjct: 451 DGCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 145/359 (40%), Positives = 197/359 (54%), Gaps = 26/359 (7%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y I +G R + +++DTGSD+ W+QC PC+ CY Q DPVFDP+ S +Y + C +
Sbjct: 128 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAP 187
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
C L+ S C++ + C Y VSYGDGS+T G+ E L + V GCG
Sbjct: 188 LCRRLD-----SPGCNNKNKV-CQYQVSYGDGSFTFGDFSTETLTFRRTRVTRVALGCGH 241
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N+GLF G +GL+GLGR LS QT F FSYCL + A A S ++ G+S+V +
Sbjct: 242 DNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCL-VDRSASAKPSSVVFGDSAVSR 300
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVI 363
+ +T +I NP+L TFY L L GIS+GG + L AS F GG++IDSGT +
Sbjct: 301 TA---RFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSV 357
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
TRL Y AL+ F S A FS+ DTCF+LS EV +P V + F G A++++
Sbjct: 358 TRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG-ADVSL 416
Query: 424 DVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
T Y + D S C A A IIGN QQ+ RV +D S++GFA C
Sbjct: 417 PATN--YLIPVDNSGSFCFAFAGT--MSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 151/439 (34%), Positives = 230/439 (52%), Gaps = 46/439 (10%)
Query: 67 ITLELKHKNYCSG-KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV---- 117
+T+EL + K D+ +RL D+ V+ + +R+ I G ++K +
Sbjct: 63 LTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDS 122
Query: 118 ----SNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
+ + P+ SG + Y + + +G + V ++DTGSD+ W+QC PC CY+Q
Sbjct: 123 QFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQA 182
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
DP+F+P+ S SY + C++ C +L+ + C +++ C Y VSYGDGSYT G+
Sbjct: 183 DPIFEPASSTSYSPLSCDTKQCQSLDVSE-----CRNNT---CLYEVSYGDGSYTVGDFV 234
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
E + LG ASV++ GCG NN+GLF G +GL+GLG LS SQ I FSYCL
Sbjct: 235 TETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQ---INASSFSYCL-- 289
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK-------- 343
D + + L NS++ ++ IT ++ N +L TFY + +TG+S+GG+
Sbjct: 290 -VDRDSDSASTLEFNSALLPHA--IT-APLLRNRELDTFYYVGMTGLSVGGELLSIPESM 345
Query: 344 -QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
++ SG GGI+IDSGT +TRL + Y+AL+ F+K P ++ DTC++LS
Sbjct: 346 FEMDESG--NGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSR 403
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
V +P V G + + T + V SD + C A A S IIGN QQ+
Sbjct: 404 KTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGT-FCFAFAPTS--SALSIIGNVQQQG 460
Query: 463 QRVIYDTKNSQLGFAGEDC 481
RV +D NS +GF C
Sbjct: 461 TRVGFDLANSLVGFEPRQC 479
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 150/431 (34%), Positives = 220/431 (51%), Gaps = 37/431 (8%)
Query: 68 TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI---SGNIKDVSNTEIP 123
TL L H++ + S + + + R+ D V + RI + S + +V++
Sbjct: 60 TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSD 119
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + Y I +G R+ +++D+GSD+ WVQCQPCK CY Q DPVFDP+ S
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 179
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY V C SS C +E NSG C S C Y V YGDGSYT+G L E L K
Sbjct: 180 SYTGVSCGSSVCDRIE----NSG-CHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTV 231
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V + GCG N+G+F G +GL+G+G +S V Q S GG F YCL S + ++GSL
Sbjct: 232 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSL 290
Query: 302 ILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAK 352
+ G + + P+ ++ ++ NP+ +FY + L G I + +
Sbjct: 291 VFG------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGD 344
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
GG+++D+GT +TRLP + Y A + F Q + P A G SI DTC++LS + V +P V
Sbjct: 345 GGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVS 404
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDTK 470
F +T+ + V D+ C A A+ TG IIGN QQ+ +V +D
Sbjct: 405 FYFTEGPVLTLPARNFLMPVD-DSGTYCFAFAA----SPTGLSIIGNIQQEGIQVSFDGA 459
Query: 471 NSQLGFAGEDC 481
N +GF C
Sbjct: 460 NGFVGFGPNVC 470
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 153/419 (36%), Positives = 217/419 (51%), Gaps = 40/419 (9%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE------IPLTSGIRLQT 132
D+ RL D+ V+ L +R+ I+G ++K V PL SG +
Sbjct: 93 DYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGASQGS 152
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y + + +G +++ ++VDTGSD+ WVQC PC CY Q DP+F+PS S SY + C +
Sbjct: 153 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCET 212
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGC 249
C +L+ + C + S C Y VSYGDGSYT G+ E + L G AS+N+ GC
Sbjct: 213 HQCKSLDVSE-----CRNDS---CLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGC 264
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
G +N+GLF G +GL+GLG LS SQ I FSYCL + AS L NS +
Sbjct: 265 GHDNEGLFVGAAGLLGLGGGSLSFPSQ---INASSFSYCLVNRDTDSAS---TLEFNSPI 318
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTV 362
+S ++ N QL TFY L +TGI +GG+ L S F GGI++DSGT
Sbjct: 319 PSHSVT---APLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTA 375
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
+TRL +Y++L+ F++ PS G ++ DTC++LS+ V +P V F +
Sbjct: 376 VTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLA 435
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + V S A C A A + IIGN QQ+ RV YD NS +GF+ C
Sbjct: 436 LPAKNYLIPVDS-AGTFCFAFAPTT--SALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 159/481 (33%), Positives = 252/481 (52%), Gaps = 51/481 (10%)
Query: 10 ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITL 69
ILSL + M + +A G +C K L+ +K G S VS I
Sbjct: 14 ILSLAITFMCGVAEIAPGLNCRSSDKILN-------RKVGKRSHSVSFPLIHIYSECSPF 66
Query: 70 ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
++ W ++ D +++L+ S + K +N +P+ SG
Sbjct: 67 RPPNRT--------WESLMSEKIRGDANRLRFLK-----RTSRSSKQDANANVPVRSG-- 111
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ YI ++ G ++M ++DTGSD+ W+ C+ C+ C++ P+FDP+ S SYK
Sbjct: 112 --SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFA 168
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
C+S C + SG C +S C + VSYGDG+ G L + + LG + +F F
Sbjct: 169 CDSQPCQEI------SGNCGGNS--KCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSF 220
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GC + GLMGLG LSL++Q T+E+FGG FSYCLPS+ + +SGSL+LG
Sbjct: 221 GCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS--STSSGSLVLGK 278
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTV 362
++V +S+ + +T +I +P + TFY + L IS+G ++ G + GG +IDSGT
Sbjct: 279 EAAV--SSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTT 336
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
IT L PS Y+AL+ F +Q S P +DTC++LS+ V++P + + + N ++
Sbjct: 337 ITHLVPSAYTALRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLV 394
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ I+ + ++ CLA +S D IIGN QQ+N R+++D NSQ+GFA E C+
Sbjct: 395 LPKENIL--ITQESGLACLAFSS---TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
Query: 483 S 483
+
Sbjct: 450 A 450
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 150/432 (34%), Positives = 220/432 (50%), Gaps = 38/432 (8%)
Query: 68 TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI----SGNIKDVSNTEI 122
TL L H++ + S + + + R+ D V + RI + S + +V++
Sbjct: 60 TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGS 119
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
+ SG+ + Y I +G R+ +++D+GSD+ WVQCQPCK CY Q DPVFDP+ S
Sbjct: 120 DVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 179
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
SY V C SS C +E NSG C S C Y V YGDGSYT+G L E L K
Sbjct: 180 GSYTGVSCGSSVCDRIE----NSG-CHSGG---CRYEVMYGDGSYTKGTLALETLTFAKT 231
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V + GCG N+G+F G +GL+G+G +S V Q S GG F YCL S + ++GS
Sbjct: 232 VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGS 290
Query: 301 LILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFA 351
L+ G + + P+ ++ ++ NP+ +FY + L G I + +
Sbjct: 291 LVFG------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETG 344
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG+++D+GT +TRLP Y+A + F Q + P A G SI DTC++LS + V +P V
Sbjct: 345 DGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTV 404
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDT 469
F +T+ + V D+ C A A+ TG IIGN QQ+ +V +D
Sbjct: 405 SFYFTEGPVLTLPARNFLMPVD-DSGTYCFAFAA----SPTGLSIIGNIQQEGIQVSFDG 459
Query: 470 KNSQLGFAGEDC 481
N +GF C
Sbjct: 460 ANGFVGFGPNVC 471
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 138/372 (37%), Positives = 199/372 (53%), Gaps = 29/372 (7%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
++SG+ L + Y A + +G R+ + +DTGSD+TW+QC PC SCY+Q DP++DPS S
Sbjct: 1 ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY++V C S+ C AL+++ C C+Y V YGD S + G+LG E LG S
Sbjct: 61 SYRRVYCGSALCQALDYS-----ACQGMG---CSYRVVYGDSSASSGDLGIESFYLGPNS 112
Query: 242 ---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAG 296
+ + FGCG +N GLF G +GL+G+G LS SQ + G FSYCL +Q
Sbjct: 113 STAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQS 172
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG-------KQLQASG 349
S LI G + F +T ++ NP++ TFY LTGIS+GG Q +G
Sbjct: 173 RSSPLIFGRTAIPFA----ARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTG 228
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
GG ++DSGT +TR+ P Y+ L+ + P APG +LDTCFN V IP
Sbjct: 229 NGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIP 288
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ + F+ +M + I+ V + CLA A S +IGN QQ+ R+ +D
Sbjct: 289 SLVLHFDNGVDMVLPGGNILIPVDRSGT-FCLAFAPSSM--PISVIGNVQQQTFRIGFDL 345
Query: 470 KNSQLGFAGEDC 481
+ S + A +C
Sbjct: 346 QRSLIAIAPREC 357
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 137/396 (34%), Positives = 204/396 (51%), Gaps = 31/396 (7%)
Query: 99 VQYLQSRIKNMISGNIK--DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
V+ + S I + SG+ +V + + SG+ + Y I LG R+ +++D+GS
Sbjct: 5 VKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYMVIDSGS 64
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
D+ WVQC+PC CY+Q DP+FDP+ S S+ V C+S+ C +E A NSG C
Sbjct: 65 DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNSG--------RC 116
Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
Y VSYGDGSYT+G L E L G+ V + GCG +N+G+F G +GL+GLG +S +
Sbjct: 117 RYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFM 176
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYI 332
Q S G FSYCL S + +G L G + P+ + ++ NP+ +FY
Sbjct: 177 GQLSGQTGNAFSYCLVS-RGTNTNGFLEFG------SEAMPVGAAWIPLVRNPRAPSFYY 229
Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
+ L G+ +G ++ Q + GG+++D+GT +TR P Y A + F++Q
Sbjct: 230 IRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNL 289
Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
P A G SI DTC+NL + V +P V F G +T+ + V DA C A A
Sbjct: 290 PRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVD-DAGTFCFAFA- 347
Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GN QQ+ ++ D N +GF C
Sbjct: 348 -PSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 146/429 (34%), Positives = 214/429 (49%), Gaps = 57/429 (13%)
Query: 85 NEQQQN-------RLILDNLHVQYLQSRIKNMISG-NIKDVSNTEI----------PLTS 126
NEQ N RL D V L ++++ +S N D+ TE P++S
Sbjct: 89 NEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSS 148
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G + Y + + +G + +++DTGSD+ W+QC+PC CY Q DP+FDP+ S SY
Sbjct: 149 GTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYN 208
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
+ C++ C LE + +G C Y VSYGDGS+T GE E + G SVN
Sbjct: 209 PLTCDAQQCQDLEMSACRNG--------KCLYQVSYGDGSFTVGEYVTETVSFGAGSVNR 260
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
GCG +N+GLF G GL +S TS+I FSYCL +D+G S +L
Sbjct: 261 VAIGCGHDNEGLF---VGSAGLLGLGGGPLSLTSQIKATSFSYCL-VDRDSGKSSTLEFN 316
Query: 305 ----GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
G+S V ++ N ++ TFY + LTG+S+GG+ + FA G
Sbjct: 317 SPRPGDSVV---------APLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAG 367
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
G+++DSGT ITRL Y++++ F ++ S A G ++ DTC++LS+ Q V +P V
Sbjct: 368 GVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSF 427
Query: 414 EFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G+ + Y + D A C A A + IIGN QQ+ RV +D NS
Sbjct: 428 HFSGDRAWALPAKN--YLIPVDGAGTYCFAFAPTT--SSMSIIGNVQQQGTRVSFDLANS 483
Query: 473 QLGFAGEDC 481
+GF+ C
Sbjct: 484 LVGFSPNKC 492
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 195/348 (56%), Gaps = 24/348 (6%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ + +++DTGSD+ W+QC+PC CY+Q D +FDPS S S+ + C S C L+ S
Sbjct: 141 KYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLD-----S 195
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
CS + C Y VSYGDGS+T G+ E L +A+V GCG +N+GLF G +GL
Sbjct: 196 PGCSLKNN-LCQYQVSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGL 254
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
+GLGR LS +QT F FSYCL + + A A S I+ G+S+V + + +T ++
Sbjct: 255 LGLGRGGLSFPTQTGTRFNNKFSYCL-TDRTASAKPSSIVFGDSAVSRTA---RFTPLVK 310
Query: 324 NPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVITRLPPSIYSALK 375
NP+L TFY + L GIS+GG + + AS F GG++IDSGT +TRL Y +L+
Sbjct: 311 NPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLR 370
Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
F S AP FS+ DTC++LS EV +P V + F G V + Y V D
Sbjct: 371 DAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRG---ADVSLPAANYLVPVD 427
Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S C A A IIGN QQ+ RV++D S++GFA C+
Sbjct: 428 NSGSFCFAFAGT--MSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 149/424 (35%), Positives = 214/424 (50%), Gaps = 47/424 (11%)
Query: 84 WNEQQQNRLILDNLHVQYLQSRIKNMI------SGNIKDVSNTEIP----LTSGIRLQTL 133
+ + + L D V+ L+ RI+ + +G+ ++V+ + SG+ +
Sbjct: 136 YERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSG 195
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y I +G R +++DTGSD+ W+QC+PC CY+Q DP+F+PS+S S+ + CNS+
Sbjct: 196 EYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSA 255
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
C L+ + G C Y VSYGDGSYT G E L G SV + GCG
Sbjct: 256 VCSYLDAYNCHGG--------GCLYKVSYGDGSYTIGSFATEMLTFGTTSVRNVAIGCGH 307
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N GLF G +GL+GLG LS SQ G FSYCL + + +SG+L G
Sbjct: 308 DNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCL-VDRFSESSGTLEFG------P 360
Query: 312 NSTPI--TYTNMIPNPQLATFYILNLTGISIGGKQL-----------QASGFAKGGILID 358
S P+ T ++ NP L TFY + L IS+GG L + SG +GG ++D
Sbjct: 361 ESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSG--RGGFIVD 418
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
SGT +TRL +Y A++ F+ P A G SI DTC++LS VN+P V F
Sbjct: 419 SGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFSNG 478
Query: 419 AEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A + + Y + D C A A + + I+GN QQ+ RV +DT NS +GFA
Sbjct: 479 ASLILPAKN--YMIPMDFMGTFCFAFAPAT--SDLSIMGNIQQQGIRVSFDTANSLVGFA 534
Query: 478 GEDC 481
C
Sbjct: 535 LRQC 538
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 140/368 (38%), Positives = 192/368 (52%), Gaps = 43/368 (11%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ T+ LG ++ VIVDTGSDL WVQC PC+ CY Q P FDPS S S++K C +
Sbjct: 39 YLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNL 98
Query: 193 CH--ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL----GLGKASVNDFI 246
C+ AL + V C Y +YGD S T G+L E + G G SV +F
Sbjct: 99 CNVSALPLKACAANV--------CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFA 150
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
FGCG N G F G +GL+GLG+ LSL SQ S F FSYCL S AS GS+
Sbjct: 151 FGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIA 210
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA------KGG 354
N I YT+++ N + T+Y + L I +GG+ L S FA +GG
Sbjct: 211 AAAN---------IQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGG 261
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKM 413
+IDSGT IT L YSA+ + + F +P G + LD CFN++ ++P +
Sbjct: 262 TIIDSGTTITMLTLPAYSAVLRAY-ESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVF 320
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
+F+G A+ + + V + A+ +CLA+ IIGN QQ+N V+YD + +
Sbjct: 321 KFQG-ADFQMRGENLFVLVDTSATTLCLAMGG---SQGFSIIGNIQQQNHLVVYDLEAKK 376
Query: 474 LGFAGEDC 481
+GFA DC
Sbjct: 377 IGFATADC 384
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 153/458 (33%), Positives = 231/458 (50%), Gaps = 46/458 (10%)
Query: 48 SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKI-VDWNEQQQNRLILDNLHVQYLQSR- 105
SG S + Q+ +T+EL + + +RL D+ V+ L +R
Sbjct: 49 SGPKMSPFNQQEKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRL 108
Query: 106 ---IKNMISGNIKDVS--------NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
I ++ S ++K + + + P+ SG + Y + + +G +I+DT
Sbjct: 109 DLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDT 168
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
GSD+ WVQC PC CY Q DP+F+P+ S S+ + CN+ C +L+ + C + +
Sbjct: 169 GSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSE-----CRNDT-- 221
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C Y VSYGDGSYT G+ E + LG A V++ GCG NN+GLF G +GL+GLG LS
Sbjct: 222 -CLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLS 280
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
SQ I FSYCL AS L NS++ N+ ++ N L TFY
Sbjct: 281 FPSQ---INATSFSYCLVDRDSESAS---TLEFNSTLPPNA---VSAPLLRNHHLDTFYY 331
Query: 333 LNLTGISIGGK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ LTG+S+GG+ Q+ SG GG+++DSGT ITRL +Y++L+ F+K+
Sbjct: 332 VGLTGLSVGGELVSIPESAFQIDESG--NGGVIVDSGTAITRLQTDVYNSLRDAFVKRTR 389
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
PS G ++ DTC++LS+ V +P V F E+ + + + S+ + C A
Sbjct: 390 DLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGT-FCFAF 448
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + IIGN QQ+ RV+YD N +GF C
Sbjct: 449 APTA--SSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 152/435 (34%), Positives = 222/435 (51%), Gaps = 45/435 (10%)
Query: 61 RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
R E + L+H + SG E+ Q + L +Q L ++ + S+
Sbjct: 36 RPEKTWFRVSLRHVD--SGGNYTKFERLQRAMKRGKLRLQRLSAKTASF-------ESSV 86
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
E P+ +G ++ + +G + I+DTGSDL W QC+PCK C++Q P+FDP
Sbjct: 87 EAPVHAG----NGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPK 142
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S S+ K+ C+S C AL ++ + G C Y SYGD S T+G L E G
Sbjct: 143 KSSSFSKLPCSSDLCAALPISSCSDG---------CEYLYSYGDYSSTQGVLATETFAFG 193
Query: 239 KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
ASV+ FGCG +N G F +GL+GLGR LSL+SQ E FSYCL S D+
Sbjct: 194 DASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGE---PKFSYCLTSMDDSKG 250
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
SL++G +++ KN+ IT T +I NP +FY L+L GIS+G L + S F+
Sbjct: 251 ISSLLVGSEATM-KNA--IT-TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQND 306
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIP 409
GG++IDSGT IT L S ++ALK EF+ Q G + LD CF L V++P
Sbjct: 307 GSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVP 366
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ FEG A++ + + S +CL + S S I GN+QQ+N V++D
Sbjct: 367 QLVFHFEG-ADLKLPAENYI-IADSGLGVICLTMGSSS---GMSIFGNFQQQNIVVLHDL 421
Query: 470 KNSQLGFAGEDCSSM 484
+ + FA C+ +
Sbjct: 422 EKETISFAPAQCNQL 436
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 151/430 (35%), Positives = 217/430 (50%), Gaps = 44/430 (10%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQN----RLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
+TL+L H + S N+ + RL D L V L SR S
Sbjct: 54 LTLDLHHLDSLS-----LNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSS---------- 98
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
+ SG+ + Y + +G R + +++DTGSD+ W+QC PC+ CY+Q DP+F+P S
Sbjct: 99 -VVSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKS 157
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
S+ + C+S C L+ S CS+ C Y VSYGDGS+T G+ E L
Sbjct: 158 KSFAGIPCSSPLCRRLD-----SSGCSTRRH-TCLYQVSYGDGSFTTGDFATETLTFRGN 211
Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
+ GCG +N+GLF G +GL+GLGR LS SQT F FSYCL + S
Sbjct: 212 KIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSS 271
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAK 352
++ G++++ S +T +I NP+L TFY + L GIS+GG +++
Sbjct: 272 MVF-GDAAI---SRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGN 327
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
GG++IDSGT +TRL Y+AL+ F P FS+ DTC++LS V +P V
Sbjct: 328 GGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVV 387
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F G A+M + T + V + S C A A IIGN QQ+ RV+YD S
Sbjct: 388 LHFRG-ADMALPATNYLIPVDENGS-FCFAFAGT--ISGLSIIGNIQQQGFRVVYDLAGS 443
Query: 473 QLGFAGEDCS 482
++GFA C+
Sbjct: 444 RIGFAPRGCT 453
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 149/458 (32%), Positives = 225/458 (49%), Gaps = 27/458 (5%)
Query: 34 KKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLI 93
++K + + Q S +SC S + +++ L H+N + E + ++
Sbjct: 29 ERKFTVVPTAFLQSSSEEASC-STPRGTPHANRVSVPLAHRNGPCSPVRGKGELPRAEML 87
Query: 94 L-DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGGRNM--TVI 149
D +Y+ R ++D ++ +P G + Y+AT+ LG + T+I
Sbjct: 88 RRDRERTEYIIRRASRSRR--LQDNNDAVSVPTQLGSSYDSQEYVATVGLGTPAVPQTLI 145
Query: 150 VDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+DTGS LTWVQC+PC S CY Q+ P+FDP+ S SY V C+S C AL G C+
Sbjct: 146 LDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDSQECRALAAGIDGDG-CT 204
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNN-KGLFGGVSGLMG 265
S C Y + YG G+ GE + L LG A V F FGCG + +G F G++G
Sbjct: 205 SDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLG 264
Query: 266 LGRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
LGR SL Q S GG+FS+CLP T ++G L LG +++ +T ++
Sbjct: 265 LGRLPQSLAWQASARRGGGVFSHCLPPT--GVSTGFLALGAP----HDTSAFVFTPLLTM 318
Query: 325 PQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
FY L T IS+ G+ L + G++ DSGTV++ L + Y+AL+ F +
Sbjct: 319 DDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITDSGTVLSALQETAYTALRTAFRSAMA 378
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
+P AP LDTCFN + Y V +P V + F G A + +D + V CLA
Sbjct: 379 EYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLM------DGCLAF 432
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S S ++ TG+IG+ Q+ V+YD ++GF C
Sbjct: 433 WS-SGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 200/375 (53%), Gaps = 25/375 (6%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y I +G + +++DTGSD+ W+QC PC+ CY+Q PVFDP S
Sbjct: 128 PVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRS 187
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GK 239
SY V C + C L+ SG C C Y V+YGDGS T G+ E L G
Sbjct: 188 SSYGAVDCAAPLCRRLD-----SGGCDLRRRA-CLYQVAYGDGSVTAGDFATETLTFAGG 241
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
A V GCG +N+GLF +GL+GLGR LS +Q S +G FSYCL + +SG
Sbjct: 242 ARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSG 301
Query: 300 SLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
+ +S+V +++ ++T M+ NP++ TFY + L GIS+GG ++
Sbjct: 302 AASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLD 361
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
+GG+++DSGT +TRL YSAL+ F +G +P GFS+ DTC++L + V
Sbjct: 362 PSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVV 421
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V M F G AE + + V S + C A A + IIGN QQ+ RV+
Sbjct: 422 KVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVV 478
Query: 467 YDTKNSQLGFAGEDC 481
+D ++GFA + C
Sbjct: 479 FDGDGQRVGFAPKGC 493
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 143/426 (33%), Positives = 226/426 (53%), Gaps = 54/426 (12%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEI-------PLTSGIRLQ 131
D+ +RL D+ VQ + +R++ +++G ++K + TEI P++SG
Sbjct: 97 DYKALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPL-QTEIQPQDLSTPVSSGTSQG 155
Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
+ Y + +G ++ +++DTGSD+ W+QCQPC CY Q DP+F P+ S SY + C+
Sbjct: 156 SGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCD 215
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFG 248
S C++L+ ++ +G C Y V+YGDGS+T G+ E + G +VN G
Sbjct: 216 SQQCNSLQMSSCRNG--------QCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
CG +N+GLF G +GL+GLG LSL TS++ FSYCL +D+ AS +L
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSL---TSQLKATSFSYCL-VNRDSAASSTLDF----- 318
Query: 309 VFKNSTPITYTNMIP---NPQLATFYILNLTGISIGGK---------QLQASGFAKGGIL 356
NS P+ + + P + ++ TFY + L+G+S+GG+ +L SG GG++
Sbjct: 319 ---NSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSG--DGGVI 373
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+D GT ITRL Y++L+ F+ S G ++ DTC++LS V +P V F+
Sbjct: 374 VDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFD 433
Query: 417 GNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G D+ Y + D A C A A + IIGN QQ+ RV +D N+++G
Sbjct: 434 GGKSW--DLPAANYLIPVDSAGTYCFAFAPTT--SSLSIIGNVQQQGTRVSFDLANNRVG 489
Query: 476 FAGEDC 481
F+ C
Sbjct: 490 FSTNKC 495
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 152/419 (36%), Positives = 225/419 (53%), Gaps = 47/419 (11%)
Query: 77 CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
CSG Q D V ++ S+ SGN+K+ ++ + + + N++
Sbjct: 75 CSGSGHSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFL 129
Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
+ G + +I+DTGS +TW QC+ C +C + FD S S +Y C ST
Sbjct: 130 VDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIPSTVE 189
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNN 253
NY ++YGD S + G G + + L + V F FGCGRNN
Sbjct: 190 N-------------------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNN 230
Query: 254 KGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
KG FG GV G++GLG+ LS VSQT+ F +FSYCLP + + GSL+ G ++
Sbjct: 231 KGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--SQ 285
Query: 313 STPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLP 367
S+ + +T+++ P Q + +Y +NL+ IS+G ++L +S FA G +IDS TVITRLP
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLP 345
Query: 368 PSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
YSALKA F K + +P + G ILDTC+NLS ++V +P + + F G A++ +
Sbjct: 346 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRL 405
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ T IV+ SDAS++CLA A S E IIGN QQ + V+YD + ++GF G CS
Sbjct: 406 NGTNIVW--GSDASRLCLAFAGTS---ELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 130/380 (34%), Positives = 200/380 (52%), Gaps = 28/380 (7%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
D + E P ++G ++ I LG + VI+DTGSDLTW+Q +PC++C+ Q DP
Sbjct: 10 DNESYEFPESAGYG----EFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADP 65
Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
+FDPS S +Y K+ C+SS C L G + S+ +C Y YGDGS TRG +E
Sbjct: 66 IFDPSKSSTYNKIACSSSACADL------LGTQTCSAAANCIYAYGYGDGSVTRGYFSKE 119
Query: 234 HLGLGKASVNDFIFGCGRNNKGLFG--GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ + + FG N G FG G G++GLG+ +S+ SQ + G FSYCL
Sbjct: 120 TITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVD 179
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL------ 345
AG+ S + G+++V S + YT ++PN T+Y + + GIS+GG L
Sbjct: 180 WLSAGSETSTMYFGDAAV--PSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSV 237
Query: 346 -QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
+ GG +IDSGT IT L +++AL A + Q +P+ + LD CFN
Sbjct: 238 YEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVR-YPTTTSATGLDLCFNTRGTG 296
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
P + + +G + +++ F+ + + +CLA AS + + I GN QQ+N
Sbjct: 297 SPVFPAMTIHLDG---VHLELPTANTFISLETNIICLAFAS-ALDFPIAIFGNIQQQNFD 352
Query: 465 VIYDTKNSQLGFAGEDCSSM 484
++YD N ++GFA DC+S+
Sbjct: 353 IVYDLDNMRIGFAPADCASL 372
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 148/418 (35%), Positives = 207/418 (49%), Gaps = 35/418 (8%)
Query: 85 NEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
E ++RL D + K V+ P+ SG+ + Y I +G
Sbjct: 82 GELLKHRLQRDKRRAARISEAAGAGGGNGRKGVA---APVVSGLAQGSGEYFTKIGVGTP 138
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+++DTGSD+ WVQC PC+ CY Q PVFDP S SY V C ++ C L+
Sbjct: 139 ATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLD----- 193
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVS 261
SG C C Y V+YGDGS T G+ E L G A V GCG +N+GLF +
Sbjct: 194 SGGCDLRRGA-CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAA 252
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG-------SLILGGNSSVFKNST 314
GL+GLGR LS +Q S +G FSYCL +GA S + G SV +S
Sbjct: 253 GLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSA 312
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---------FAKGGILIDSGTVITR 365
++T M+ NP++ TFY + L GIS+GG ++ +GG+++DSGT +TR
Sbjct: 313 --SFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTR 370
Query: 366 LPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
L + YSAL+ F +G S GFS+ DTC++L + V +P V M F G AE +
Sbjct: 371 LARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAAL 430
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ V S + C A A + IIGN QQ+ RV++D ++GFA + C
Sbjct: 431 PPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 148/436 (33%), Positives = 224/436 (51%), Gaps = 47/436 (10%)
Query: 61 RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
R E + L+H + SG E+ Q + L +Q L ++ + +
Sbjct: 36 RPEKNGFRVSLRHVD--SGGNYTKFERLQRAVKRGRLRLQRLSAKTASF-------EPSV 86
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
E P+ +G ++ + +G + I+DTGSDL W QC+PCK C++Q P+FDP
Sbjct: 87 EAPVHAG----NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPE 142
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S S+ K+ C+S C AL ++ + G C Y SYGD S T+G L E G
Sbjct: 143 KSSSFSKLPCSSDLCVALPISSCSDG---------CEYRYSYGDHSSTQGVLATETFTFG 193
Query: 239 KASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
ASV+ FGCG +N+G + +GL+GLGR LSL+SQ + FSYCL S D+
Sbjct: 194 DASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ---LGVPKFSYCLTSIDDSKG 250
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
+L++G ++V K++ P T +I NP +FY L+L GIS+G L + S F+
Sbjct: 251 ISTLLVGSEATV-KSAIP---TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDD 306
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-QEVNIP 409
GG++IDSGT IT L S ++ALK EF+ Q A G + L+ CF L V++P
Sbjct: 307 GSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVP 366
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
+ FEG + + + Y ++ A +V CL + S S I GN+QQ+N V++D
Sbjct: 367 QLVFHFEG---VDLKLPKENYIIEDSALRVICLTMGSSS---GMSIFGNFQQQNIVVLHD 420
Query: 469 TKNSQLGFAGEDCSSM 484
+ + FA C+ +
Sbjct: 421 LEKETISFAPAQCNQL 436
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 139/377 (36%), Positives = 203/377 (53%), Gaps = 31/377 (8%)
Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+ P+ SG+ L + Y + +G R M +++DTGSD+ W+QC PC SCY+Q D VFDP
Sbjct: 23 QAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPY 82
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
S +Y + CNS C L+ G C + C Y V YGDGS++ GE + + L
Sbjct: 83 KSSTYSTLGCNSRQCLNLDV-----GGCVGNK---CLYQVDYGDGSFSTGEFATDAVSLN 134
Query: 238 -----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
G+ +N GCG +N+G F G +GL+GLG+ LS +Q + GG FSYCL
Sbjct: 135 STSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGR 194
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL------- 345
S ++ G+++V + +T N +++TFY L +TGIS+GG L
Sbjct: 195 DTDSTERSSLIFGDAAV--PPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAF 252
Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
Q GG++IDSGT +TRL + Y++L+ F S FS+ DTC+NLS
Sbjct: 253 QLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSS 312
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQR 464
V++P V + F+G A++ + + Y V D +S CLA A + IIGN QQ+ R
Sbjct: 313 VDVPTVTLHFQGGADLKLPASN--YLVPVDNSSTFCLAFAGTT---GPSIIGNIQQQGFR 367
Query: 465 VIYDTKNSQLGFAGEDC 481
VIYD ++Q+GF C
Sbjct: 368 VIYDNLHNQVGFVPSQC 384
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 146/425 (34%), Positives = 216/425 (50%), Gaps = 47/425 (11%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV------------SNTEIPLTS 126
D+ +RL D+ V+ L +RI I G +++ + + E P+ S
Sbjct: 83 DYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVS 142
Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G + Y + + +G V ++DTGSD++WVQC PC CY Q DP+F+P+ S S+
Sbjct: 143 GASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFT 202
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
+ C + C +L+ + +G C Y VSYGDGSYT G+ E + LG S+ +
Sbjct: 203 SLSCETEQCKSLDVSECRNGTCL--------YEVSYGDGSYTVGDFVTETVTLGSTSLGN 254
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
GCG NN+GLF G +GL+GLG LS SQ + FSYCL D + + L
Sbjct: 255 IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCL---VDRDSDSTSTLD 308
Query: 305 GNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
NS + TP T + NP L TF+ L LTG+S+GG L Q S GGI+
Sbjct: 309 FNSPI----TPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+DSGT +TRL ++Y+ L+ F+K +A G ++ DTC++LS+ V +P V F
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFA 424
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
E+ + + V S+ + C A A + I+GN QQ+ RV +D NS +GF
Sbjct: 425 NGNELPLPAKNYLIPVDSEGT-FCFAFAPT--DSTLSILGNAQQQGTRVGFDLANSLVGF 481
Query: 477 AGEDC 481
+ C
Sbjct: 482 SPNKC 486
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/347 (36%), Positives = 185/347 (53%), Gaps = 32/347 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
TVI+D+GSD++WVQC+PC C+ Q+DP+FDP++S +Y V C S+ C L + G
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 226
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
CS+++ C + ++YGDGS G + L LG V F FGC ++G V
Sbjct: 227 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 282
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
+G + LG SLV QT+ +G +FSYCLP T A + G L+LG + STP
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 340
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
+ ++M P TFY + L I + G+ L +IDS T+I+RLPP+ Y AL
Sbjct: 341 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 395
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+A F + + +AP SILDTC++ + + + +P + + F+G A + +D GI+
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 451
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + + G IGN QQK V+YD + F C
Sbjct: 452 ---GSCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 144/435 (33%), Positives = 218/435 (50%), Gaps = 53/435 (12%)
Query: 70 ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK----NMISGNIKDVSNTEI--- 122
++ HK+Y S + +RL D + L +R++ ++ ++K + TEI
Sbjct: 94 KIHHKDYKSLVL--------SRLHRDTVRFNSLTARLQLALEDISKSDLKPL-ETEIKPE 144
Query: 123 ----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
P+TSG + Y + +G R +++DTGSD+ W+QCQPC CY Q DP+FD
Sbjct: 145 DLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFD 204
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
P+ S +Y V C S C +LE ++ SG C Y V+YGDGSYT G+ E +
Sbjct: 205 PTASSTYAPVTCQSQQCSSLEMSSCRSG--------QCLYQVNYGDGSYTFGDFATESVS 256
Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
G + SV + GCG +N+GLF G +GL+GLG LSL T+++ FSYCL + A
Sbjct: 257 FGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSA 313
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQ 346
G+S V + P ++ N ++ TFY + L+G+S+GG+ +L
Sbjct: 314 GSSTLDFNSAQLGVDSVTAP-----LMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLD 368
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
SG GGI++D GT ITRL Y+ L+ F++ ++ DTC++LS V
Sbjct: 369 ESG--NGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASV 426
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V F + + V S A C A A + IIGN QQ+ RV
Sbjct: 427 RVPTVSFHFADGKSWNLPAANYLIPVDS-AGTYCFAFAPTT--SSLSIIGNVQQQGTRVT 483
Query: 467 YDTKNSQLGFAGEDC 481
+D N+++GF+ C
Sbjct: 484 FDLANNRMGFSPNKC 498
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/378 (35%), Positives = 194/378 (51%), Gaps = 23/378 (6%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C++Q+ PVFDP S
Sbjct: 139 VESGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMAST 198
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY+ V C + C L C SS C Y+ YGD S T G+L E + +
Sbjct: 199 SYRNVTCGDTRC-GLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257
Query: 242 -----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
V+ + GCG N+GLF G +GL+GLGR LS SQ ++G FSYCL
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCL--VDHGS 315
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAK 352
A GS I+ G+ +V + + YT P+ TFY + L GI +GG+ L G +K
Sbjct: 316 AVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSK 375
Query: 353 ----GGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVN 407
GG +IDSGT ++ P Y A++ F+ + +P F +L C+N+S + V
Sbjct: 376 EDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVE 435
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVI 466
+P + F A D YF++ D + CLA+ IIGNYQQ+N V+
Sbjct: 436 VPEFSLLFADGA--VWDFPAENYFIRLDTEGIMCLAVLGTP-RSAMSIIGNYQQQNFHVL 492
Query: 467 YDTKNSQLGFAGEDCSSM 484
YD +++LGFA C+ +
Sbjct: 493 YDLHHNRLGFAPRRCAEV 510
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 190/356 (53%), Gaps = 37/356 (10%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
IVDTGSDL W QC+PC C+ Q PVFDPS S +Y V C+S+ C L +T C+
Sbjct: 115 AIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTST-----CT 169
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGL-FGGVSGLM 264
S+S C Y +YGD S T+G L E LG K + FGCG N+G F +GL+
Sbjct: 170 SAS--KCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLV 227
Query: 265 GLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGG---NSSVFKNSTPITYT 319
GLGR LSLVSQ GL FSYCL S D L+LGG S + P+ T
Sbjct: 228 GLGRGPLSLVSQL-----GLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTT 282
Query: 320 NMIPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYS 372
++ NP +FY ++LTG+++G + L AS FA GG+++DSGT IT L Y
Sbjct: 283 PLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYR 342
Query: 373 ALKAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIV 429
ALK F+ Q + P+ G I LD CF A EV +P + + F+G A++ D+
Sbjct: 343 ALKKAFVAQMA-LPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADL--DLPAEN 399
Query: 430 YFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y V AS +CL +A IIGN+QQ+N + +YD L FA C+ +
Sbjct: 400 YMVLDSASGALCLTVAP---SRGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQCNKL 452
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 146/425 (34%), Positives = 215/425 (50%), Gaps = 47/425 (11%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV------------SNTEIPLTS 126
D+ +RL D+ V+ L +RI I G +++ + + E P+ S
Sbjct: 83 DYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVS 142
Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G + Y + + +G V ++DTGSD++WVQC PC CY Q DP F+P+ S S+
Sbjct: 143 GASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFT 202
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
+ C + C +L+ + +G C Y VSYGDGSYT G+ E + LG S+ +
Sbjct: 203 SLSCETEQCKSLDVSECRNGTCL--------YEVSYGDGSYTVGDFVTETVTLGSTSLGN 254
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
GCG NN+GLF G +GL+GLG LS SQ + FSYCL D + + L
Sbjct: 255 IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCL---VDRDSDSTSTLD 308
Query: 305 GNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
NS + TP T + NP L TF+ L LTG+S+GG L Q S GGI+
Sbjct: 309 FNSPI----TPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+DSGT +TRL ++Y+ L+ F+K +A G ++ DTC++LS+ V +P V F
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFA 424
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
E+ + + V S+ + C A A + I+GN QQ+ RV +D NS +GF
Sbjct: 425 NGNELPLPAKNYLIPVDSEGT-FCFAFAPT--DSTLSILGNAQQQGTRVGFDLANSLVGF 481
Query: 477 AGEDC 481
+ C
Sbjct: 482 SPNKC 486
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 153/481 (31%), Positives = 246/481 (51%), Gaps = 51/481 (10%)
Query: 10 ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITL 69
ILSL + M + +A G +C K L+ +K G S VS I
Sbjct: 14 ILSLAITFMCGVAEIAPGLNCRSSDKILN-------RKVGKRSHSVSFPLIHIYSECSPF 66
Query: 70 ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
++ W ++ D +++L+ S + K+ +N +P+ SG
Sbjct: 67 RPPNRT--------WESLMSEKIRGDANRLRFLK-----RTSRSSKEDANANVPVRSG-- 111
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ YI ++ G ++M ++DTGSD+ W+ C+ C+ C++ P+FDP+ S SYK
Sbjct: 112 --SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFA 168
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
C+S C + SG C +S C + V YGDG+ G L + + LG + +F F
Sbjct: 169 CDSQPCQEI------SGNCGGNS--KCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSF 220
Query: 248 GCGRN-NKGLFGGVSGLMGLGRSDLSLV-SQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GC + ++ + + G S L + T+E+FGG FSYCLPS+ + +SGSL+LG
Sbjct: 221 GCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS--STSSGSLVLGK 278
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG-ILIDSGTV 362
++V +S+ + +T +I +P TFY + L IS+G ++ A+ A GG +IDSGT
Sbjct: 279 EAAV--SSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTT 336
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
IT L PS Y L+ F +Q S P +DTC++LS+ V++P + + + N ++
Sbjct: 337 ITYLVPSAYKDLRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLV 394
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ I+ +S S CLA +S D IIGN QQ+N R+++D NSQ+GFA E C+
Sbjct: 395 LPKENILITQESGLS--CLAFSS---TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
Query: 483 S 483
+
Sbjct: 450 A 450
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 147/436 (33%), Positives = 223/436 (51%), Gaps = 47/436 (10%)
Query: 61 RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
R E + L+H + SG E+ Q + L +Q L ++ + +
Sbjct: 36 RPEKNGFRVSLRHVD--SGGNYTKFERLQRAVKRGRLRLQRLSAKTASF-------EPSV 86
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
E P+ +G ++ + +G + I+DTGSDL W QC+PCK C++Q P+FDP
Sbjct: 87 EAPVHAG----NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPE 142
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S S+ K+ C+S C AL ++ + G C Y SYGD S T+G L E G
Sbjct: 143 KSSSFSKLPCSSDLCVALPISSCSDG---------CEYRYSYGDHSSTQGVLATETFTFG 193
Query: 239 KASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
ASV+ FGCG +N+G + +GL+GLGR LSL+SQ + FSYCL S D+
Sbjct: 194 DASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ---LGVPKFSYCLTSIDDSKG 250
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
+L++G ++V K++ P T +I NP +FY L+L GIS+G L + S F+
Sbjct: 251 ISTLLVGSEATV-KSAIP---TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDD 306
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-QEVNIP 409
GG++IDSGT IT L + ++ALK EF+ Q A G + L+ CF L V +P
Sbjct: 307 GSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVP 366
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
+ FEG + + + Y ++ A +V CL + S S I GN+QQ+N V++D
Sbjct: 367 QLVFHFEG---VDLKLPKENYIIEDSALRVICLTMGSSS---GMSIFGNFQQQNIVVLHD 420
Query: 469 TKNSQLGFAGEDCSSM 484
+ + FA C+ +
Sbjct: 421 LEKETISFAPAQCNQL 436
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 162/517 (31%), Positives = 253/517 (48%), Gaps = 67/517 (12%)
Query: 10 ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGA--- 66
LSLL + +SLFL A A L + + +S +R + A
Sbjct: 6 FLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSLTATKP 65
Query: 67 --------------ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS 111
++LEL ++ + + D+ +RL D+ V + ++I+ +
Sbjct: 66 ESISDPVFFNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFAVE 125
Query: 112 G----NIKDVSNTEI---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDL 156
G ++K V+N + P+ SG+ + Y + I +G + M +++DTGSD+
Sbjct: 126 GIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDV 185
Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
W+QC+PC CY Q DPVF+P+ S +YK + C++ C LE + C S+ C Y
Sbjct: 186 NWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLY 237
Query: 217 FVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 275
VSYGDGS+T GEL + + G + +ND GCG +N+GLF G +GL+GLG LS+
Sbjct: 238 QVSYGDGSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSI-- 295
Query: 276 QTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
T+++ FSYCL +D+G S SL +SV S T ++ N ++ TFY + L
Sbjct: 296 -TNQMKATSFSYCLVD-RDSGKSSSLDF---NSVQLGSGDAT-APLLRNQKIDTFYYVGL 349
Query: 336 TGISIGGKQ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
+G S+GG++ + ASG GG+++D GT +TRL Y++L+ FLK +
Sbjct: 350 SGFSVGGQKVMMPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLK 407
Query: 387 S-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALA 444
S+ DTC++ S+ V +P V F G + D+ Y + D C A A
Sbjct: 408 KGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDNGTFCFAFA 465
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S IIGN QQ+ R+ YD N +G +G C
Sbjct: 466 PTS--SSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 133/351 (37%), Positives = 186/351 (52%), Gaps = 30/351 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+++DTGSD+ W+QC PC+ CY Q VFDP S SY V C + C L+ SG C
Sbjct: 155 MVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCRRLD-----SGGCD 209
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
C Y V+YGDGS T G+ E L G A V GCG +N+GLF +GL+GL
Sbjct: 210 LRRSA-CLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGL 268
Query: 267 GRSDLSLVSQTSEIFGGLFSYCL----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
GR LS +Q S +G FSYCL S A S ++ G S ++ ++T M+
Sbjct: 269 GRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFG--SGAVGSTVASSFTPMV 326
Query: 323 PNPQLATFYILNLTGISIGGKQL-----------QASGFAKGGILIDSGTVITRLPPSIY 371
NP++ TFY + L GIS+GG ++ +SG +GG+++DSGT +TRL Y
Sbjct: 327 KNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSG--RGGVIVDSGTSVTRLARPAY 384
Query: 372 SALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
SAL+ F +G +P GFS+ DTC++LS + V +P V M F G AE + +
Sbjct: 385 SALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLI 444
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V S + C A A + IIGN QQ+ RV++D ++ F + C
Sbjct: 445 PVDSKGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 144/449 (32%), Positives = 218/449 (48%), Gaps = 45/449 (10%)
Query: 66 AITLELKHKNYCSGK------IVDWNEQQQNRLILDNLHVQ---------YLQSRIKNMI 110
++ L + H++ +G+ +D E+ R+ D +H + S + +
Sbjct: 73 SLKLHMTHRSAAAGETGKGSFFLDSAEKDAVRI--DTMHRRAALSGSAAARRDSAPRRAL 130
Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
S + + +P+ SG Y+ + LG R +I+DTGSDL W+QC PC C+
Sbjct: 131 SERVVATVESGVPVGSG------EYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCF 184
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
Q P+FDP+ S SY+ V C C + A C C Y+ YGD S T
Sbjct: 185 EQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTT 244
Query: 228 GELGREHLGL-----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
G+L E + G V+ FGCG N+GLF G +GL+GLGR LS SQ ++G
Sbjct: 245 GDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYG 304
Query: 283 G-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYCL + A+GS I+ G+ + YT P TFY L L I +G
Sbjct: 305 GHAFSYCL--VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVG 362
Query: 342 GKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP---GFSILDT 396
G+ + S + GG +IDSGT ++ P Y A++ F+ + S PS P GF +L
Sbjct: 363 GEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMS--PSYPLILGFPVLSP 420
Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGII 455
C+N+S ++V +P + + F A YF++ + + CLA+ II
Sbjct: 421 CYNVSGAEKVEVPELSLVFADGAAWEFPAEN--YFIRLEPEGIMCLAVLGTP-RSGMSII 477
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
GNYQQ+N V+YD ++++LGFA C+ +
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCADV 506
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/445 (33%), Positives = 222/445 (49%), Gaps = 51/445 (11%)
Query: 69 LELKHKNYCSGKIVDWNEQQQNRLIL---DNLHVQY---LQSRIKNMISGNIK------- 115
LEL H+NY ++ + + Q Q +L L D L + + R K IS + K
Sbjct: 51 LEL-HENYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLR 109
Query: 116 --------DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
V++ + SG + Y I +G R+ V++D+GSD+ WVQCQPC
Sbjct: 110 LLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCS 169
Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
CY Q DPVFDP+ S +Y + C+SS C L+ A N G C Y VSYGDGSY
Sbjct: 170 ECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDG--------RCRYEVSYGDGSY 221
Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
TRG L E L G+ + + GCG N+G+F G +GL+GLG +S V Q GG F
Sbjct: 222 TRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 281
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG------ 337
SYCL S + ++G+L G + + P+ + +I NP+ +FY + L+G
Sbjct: 282 SYCLVS-RGTESTGTLEFG------RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGI 334
Query: 338 -ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT 396
+ I + + + GG+++D+GT +TRLP Y A + F+ Q + P + SI DT
Sbjct: 335 RVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDT 394
Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
C+NL+ + V +P V F G +T+ + V + + C A A+ + IIG
Sbjct: 395 CYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGT-FCFAFAASA--SGLSIIG 451
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
N QQ+ ++ D N +GF C
Sbjct: 452 NIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 194/375 (51%), Gaps = 25/375 (6%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y I +G +++DTGSD+ W+QC PC+ CY+Q +FDP S
Sbjct: 135 PVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
SY V C + C L+ SG C C Y V+YGDGS T G+ E L
Sbjct: 195 HSYGAVDCAAPLCRRLD-----SGGCDLRRKA-CLYQVAYGDGSVTAGDFATETLTFASG 248
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
A V GCG +N+GLF +GL+GLGR LS SQ S FG FSYCL S+ +
Sbjct: 249 ARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASA 308
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
S S + S S ++T M+ NP++ TFY + L GIS+GG ++ +
Sbjct: 309 TSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLD 368
Query: 352 ----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
+GG+++DSGT +TRL Y+AL+ F +G +P GFS+ DTC++LS + V
Sbjct: 369 PSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVV 428
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V M F G AE + + V S + C A A + IIGN QQ+ RV+
Sbjct: 429 KVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVV 485
Query: 467 YDTKNSQLGFAGEDC 481
+D +LGF + C
Sbjct: 486 FDGDGQRLGFVPKGC 500
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 145/387 (37%), Positives = 209/387 (54%), Gaps = 34/387 (8%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQ 160
+R+ ++SG K VS +P G +++L Y+AT+ G + V++DTGSDLTW+Q
Sbjct: 85 HARLSYIVSG--KKVS---VPAHLGTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQ 139
Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
C+PC S C Q+DP+FDPS S +Y V C S C L SG CS+ P C + +
Sbjct: 140 CKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGECKKLAADAYGSG-CSNGQP--CGFAI 196
Query: 219 SYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
SY DG+ T G G++ L L A V DF FGCG + L G GL+GLGR SL +Q
Sbjct: 197 SYVDGTSTVGVYGKDKLTLAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQY 256
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
FSYCLP+ G L G +N + +T M P TF + L G
Sbjct: 257 GGGG--GFSYCLPAVNS--KPGFLAFGAG----RNPSGFVFTPMGRVPGQPTFSTVTLAG 308
Query: 338 ISIGGKQ--LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
I++GGK+ L+ S F+ GG+++DSGTV+T L ++Y AL+A F + + G LD
Sbjct: 309 ITVGGKKLDLRPSAFS-GGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHG--DLD 365
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGI 454
TC++L+ Y+ V +P + + F G A + +DV GI+ CLA A + G+
Sbjct: 366 TCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILV-------NGCLAFAETGKDGTAGV 418
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+GN Q+ V++DT S+ GF + C
Sbjct: 419 LGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 151/439 (34%), Positives = 215/439 (48%), Gaps = 42/439 (9%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLIL---DNLHVQYLQSRIKNMISGNIK---DVSNTE 121
+L+L H++ SG ++ L L D V YLQ R+ S + + T
Sbjct: 58 SLQLLHRDTVSGT--KHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTI 115
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
+ SG Y+ + +G + ++ DTGSD+ WVQC PC CY Q DP+FDP+
Sbjct: 116 VSHGSG------EYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPAN 169
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-G 238
S S+ V CNS C A A S +C Y VSYGD SYT G L E L L G
Sbjct: 170 SASFSPVPCNSGVCRA---AARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDG 226
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP--STQDAG 296
V GCG N+GLF +GL+GLG +SLV Q GG FSYCL + +
Sbjct: 227 GTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGS 286
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-------ASG 349
SGSL+LG + T + ++ NP +FY + + G+ + G++LQ
Sbjct: 287 GSGSLVLGREDAA---PTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGD 343
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNI 408
GG+++D+GT +TRLP Y+AL+ F F G P APG S+ DTC++LS Y V +
Sbjct: 344 DGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRV 403
Query: 409 PLVKMEFEGN------AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
P V + F G A +T+ ++ V D CLA A+++ I+GN QQ+
Sbjct: 404 PTVALYFGGGGQGQEAASLTLPARNLLVPVD-DGGTYCLAFAAVA--SGPSILGNIQQQG 460
Query: 463 QRVIYDTKNSQLGFAGEDC 481
+ D+ + +GF C
Sbjct: 461 IEITVDSASGYVGFGPATC 479
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 167/491 (34%), Positives = 237/491 (48%), Gaps = 56/491 (11%)
Query: 13 LLLP--LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLE 70
LLLP +M++ L A + K L L+ + C + T+
Sbjct: 8 LLLPCIIMITYHALVARAGDEKSYKVLSASSLK------PGAVCAEPKVRDSSSSGATVP 61
Query: 71 LKHKNYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE--IPLT 125
L H++ + ++Q L D L Y+Q + + + +E +P+
Sbjct: 62 LNHRHGPCSPVPSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEATVPIA 121
Query: 126 SGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
G L TL Y+ T+ +G + T+ +DTGSD++W++C+ ++DP S +Y
Sbjct: 122 LGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTY 172
Query: 184 KKVLCNSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
C++ C L G G CSS S C Y V YGDGS T G G + L L S
Sbjct: 173 APFSCSAPACAQL----GRRGTGCSSGS--TCVYSVKYGDGSNTTGTYGSDTLTLAGTSE 226
Query: 242 --VNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
++ F FGC G GLMGLG S VSQT+ +G FSYCLP T + +S
Sbjct: 227 PLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWN--SS 284
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGIL 356
G L LG SS + + T M+ + Q ATFY L L GIS+GGK L+ +S F+ G I
Sbjct: 285 GFLTLGAPSSSTSAAF--STTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSI- 341
Query: 357 IDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVN---IPL 410
+DSGTVITRLPP+ Y AL A F + ++ P+AP +LDTCF+ + + E N +P
Sbjct: 342 VDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAP-RGLLDTCFDFTGHGEGNNFTVPS 400
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V + +G A + + GIV CLA A+ + TGIIGN QQ+ V+YD
Sbjct: 401 VALVLDGGAVVDLHPNGIV-------QDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVG 453
Query: 471 NSQLGFAGEDC 481
S GF C
Sbjct: 454 QSVFGFRPGAC 464
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 136/392 (34%), Positives = 200/392 (51%), Gaps = 46/392 (11%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G R+ ++I+DTGSDL W+QC PC C+ Q P +DP S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESS 240
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
S+K + C+ CH + SS PP C YF YGD S T G+ E
Sbjct: 241 SFKNIGCHDPRCH----------LVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALE 290
Query: 234 HLGLGKAS---------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ S V + +FGCG N+GLF G +GL+GLGR LS SQ ++G
Sbjct: 291 TFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHS 350
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGISI 340
FSYCL D S LI G + + N + +T+++ NP + TFY + + I +
Sbjct: 351 FSYCLVDRNSDTNVSSKLIFGEDKDLL-NHPEVNFTSLVAGKENP-VDTFYYVQIKSIMV 408
Query: 341 GGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG+ L+ S GG ++DSGT ++ Y +K F+K+ G+P F I
Sbjct: 409 GGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPI 468
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
LD C+N+S +++ +P ++ FE A V YF+K + + VCLA+
Sbjct: 469 LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVEN--YFIKLEPEEIVCLAILGTP-RSAL 525
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N ++YDTK S+LG+A C+ +
Sbjct: 526 SIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 135/396 (34%), Positives = 203/396 (51%), Gaps = 31/396 (7%)
Query: 99 VQYLQSRIKNMISGNIKD--VSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
V+ + S I+ + SG+ V + + SG+ + Y I +G R+ +++D+GS
Sbjct: 5 VKRVVSLIRRVSSGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 64
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
D+ WVQC+PC CY+Q DP+FDP+ S S+ V C+S+ C ++ A NSG C
Sbjct: 65 DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNSG--------RC 116
Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
Y VSYGDGS T+G L E L LG+ V + GCG N+G+F G +GL+GLG +S V
Sbjct: 117 RYEVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFV 176
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYI 332
Q S G FSYCL S + ++G L G + P+ + +I NP ++Y
Sbjct: 177 GQLSRERGNAFSYCLVS-RVTNSNGFLEFG------SEAMPVGAAWIPLIRNPHSPSYYY 229
Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
+ L+G+ +G ++ + + GG+++D+GT +TR P Y A + F+ Q
Sbjct: 230 IGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNL 289
Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
P A G SI DTC+NL + V +P V F G +T+ + V DA C A A
Sbjct: 290 PRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVD-DAGTFCFAFA- 347
Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GN QQ+ ++ D N +GF C
Sbjct: 348 -PSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 29/369 (7%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + Y I +G RN V++D+GSD+ WVQC+PC CY+Q DPVF+P+ S
Sbjct: 125 VVSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSS 184
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ V C S+ C ++ A + G C Y VSYGDGSYT+G L E + G+
Sbjct: 185 SFSGVSCASTVCSHVDNAACHEG--------RCRYEVSYGDGSYTKGTLALETITFGRTL 236
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ + GCG +N+G+F G +GL+GLG +S V Q GG FSYCL S + +SG L
Sbjct: 237 IRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVS-RGIESSGLL 295
Query: 302 ILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAK 352
G + + P+ + +I NP+ +FY + L+G +SI + S
Sbjct: 296 EFG------REAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGD 349
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
GG+++D+GT +TRLP Y A + F+ Q + P A G SI DTC++L + V +P V
Sbjct: 350 GGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 409
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G +T+ + V D C A A S IIGN QQ+ ++ D N
Sbjct: 410 FYFSGGPILTLPARNFLIPVD-DVGTFCFAFAPSS--SGLSIIGNIQQEGIQISVDGANG 466
Query: 473 QLGFAGEDC 481
+GF C
Sbjct: 467 FVGFGPNVC 475
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 146/448 (32%), Positives = 232/448 (51%), Gaps = 60/448 (13%)
Query: 67 ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE 121
++LEL ++ + + + D+ +RL D+ V + ++I+ + G ++K V N +
Sbjct: 80 LSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNED 139
Query: 122 I---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
P+ SG + Y + I +G ++M +++DTGSD+ W+QC+PC CY Q
Sbjct: 140 TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQ 199
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
DPVF+P+ S +YK + C++ C LE + C S+ C Y VSYGDGS+T GEL
Sbjct: 200 SDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLYQVSYGDGSFTVGEL 251
Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ + G + +N+ GCG +N+GLF G +GL+GLG LS+ T+++ FSYCL
Sbjct: 252 ATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSI---TNQMKATSFSYCL 308
Query: 290 PSTQDAGASGSLI-----LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
+D+G S SL LGG + ++ N ++ TFY + L+G S+GG++
Sbjct: 309 VD-RDSGKSSSLDFNSVQLGGGDAT---------APLLRNKKIDTFYYVGLSGFSVGGEK 358
Query: 345 ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSIL 394
+ ASG GG+++D GT +TRL Y++L+ FLK + S+
Sbjct: 359 VVLPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLF 416
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETG 453
DTC++ S+ V +P V F G + D+ Y + D S C A A S
Sbjct: 417 DTCYDFSSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPTS--SSLS 472
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ R+ YD + +G +G C
Sbjct: 473 IIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 136/353 (38%), Positives = 187/353 (52%), Gaps = 30/353 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+++DTGSD+ WVQC PC+ CY Q PVFDP S SY V C ++ C L+ SG C
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLD-----SGGCD 55
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
C Y V+YGDGS T G+ E L G A V GCG +N+GLF +GL+GL
Sbjct: 56 LRRGA-CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGL 114
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG-------SLILGGNSSVFKNSTPITYT 319
GR LS +Q S +G FSYCL +GA S + G SV +S ++T
Sbjct: 115 GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSA--SFT 172
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQA---------SGFAKGGILIDSGTVITRLPPSI 370
M+ NP++ TFY + L GIS+GG ++ +GG+++DSGT +TRL +
Sbjct: 173 PMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARAS 232
Query: 371 YSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
YSAL+ F +G S GFS+ DTC++L + V +P V M F G AE +
Sbjct: 233 YSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENY 292
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ V S + C A A + IIGN QQ+ RV++D ++GFA + C
Sbjct: 293 LIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 157/467 (33%), Positives = 238/467 (50%), Gaps = 63/467 (13%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNR--LILDNL-----HVQYLQSRIKNMISGNIKDVS 118
++ +ELKH+ D + +NR L+L++L +Q Q R+ ++ + +
Sbjct: 82 SLKMELKHR--------DHGQPTRNRRSLLLESLKRDITRLQSFQKRVSEKLTASANPEA 133
Query: 119 NTEIP---------------------LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
E+ + SG L Y + +G R+ +I+DTGSD
Sbjct: 134 YLEMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSD 193
Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPD 213
LTW+QC+PCK+C++Q PVFDPS S S+K + CN++ C + + NS S +SP
Sbjct: 194 LTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNS---SKTSPKT 250
Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA------SVNDFIFGCGRNNKGLFGGVSGLMGLG 267
C YF YGD S T G+L E L + + + D + GCG +NKGLF G GL+GLG
Sbjct: 251 CKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLG 310
Query: 268 RSDLSLVSQ-TSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI-PN 324
+ LS SQ S G FSYCL T + S ++ G ++ ++ + +T + N
Sbjct: 311 QGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTN 370
Query: 325 PQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAE 377
+ TFY L + GI I + L A FA GG +IDSGT +T L Y A+++
Sbjct: 371 NSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESA 430
Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
FL + S +P A F IL C+N + V P + + F+ AE+ D+ YF++ D
Sbjct: 431 FLARIS-YPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAEL--DLPQENYFIQPDPQ 487
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ LA L D IIGN+QQ+N +YD ++++LGFA DCS++
Sbjct: 488 EAKHCLAILP-TDGMSIIGNFQQQNIHFLYDVQHARLGFANTDCSAL 533
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 146/448 (32%), Positives = 231/448 (51%), Gaps = 60/448 (13%)
Query: 67 ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE 121
++LEL ++ + + + D+ +RL D+ V + ++I+ + G ++K V N +
Sbjct: 80 LSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNED 139
Query: 122 I---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
P+ SG + Y + I +G + M +++DTGSD+ W+QC+PC CY Q
Sbjct: 140 TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQ 199
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
DPVF+P+ S +YK + C++ C LE + C S+ C Y VSYGDGS+T GEL
Sbjct: 200 SDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLYQVSYGDGSFTVGEL 251
Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ + G + +N+ GCG +N+GLF G +GL+GLG LS+ T+++ FSYCL
Sbjct: 252 ATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSI---TNQMKATSFSYCL 308
Query: 290 PSTQDAGASGSLI-----LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
+D+G S SL LGG + ++ N ++ TFY + L+G S+GG++
Sbjct: 309 VD-RDSGKSSSLDFNSVQLGGGDAT---------APLLRNKKIDTFYYVGLSGFSVGGEK 358
Query: 345 ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSIL 394
+ ASG GG+++D GT +TRL Y++L+ FLK + S+
Sbjct: 359 VVLPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLF 416
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETG 453
DTC++ S+ V +P V F G + D+ Y + D S C A A S
Sbjct: 417 DTCYDFSSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPTS--SSLS 472
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ R+ YD + +G +G C
Sbjct: 473 IIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 143/381 (37%), Positives = 208/381 (54%), Gaps = 27/381 (7%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG L Y + +G R+ +I+DTGSDLTW+QC+PCK+C++Q PVFDPS S
Sbjct: 76 VESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQST 135
Query: 182 SYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S+K + CN++ C + + NS S +SP C YF YGD S T G+L E L +
Sbjct: 136 SFKIIPCNAAACDLVVHDECRDNS---SKTSPKTCKYFYWYGDSSRTSGDLALESLSVSL 192
Query: 240 A------SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ-TSEIFGGLFSYCL-PS 291
+ + D + GCG +NKGLF G GL+GLG+ LS SQ S G FSYCL
Sbjct: 193 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDR 252
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL--QAS 348
T + S ++ G ++ ++ + +T + N + TFY L + GI I + L A
Sbjct: 253 TNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAE 312
Query: 349 GFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
FA GG +IDSGT +T L Y A+++ FL + S +P A F IL C+N +
Sbjct: 313 RFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-YPRADPFDILGICYNATGR 371
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
V P + + F+ AE+ D+ YF++ D + LA L D IIGN+QQ+N
Sbjct: 372 AAVPFPALSIVFQNGAEL--DLPQENYFIQPDPQEAKHCLAILP-TDGMSIIGNFQQQNI 428
Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
+YD ++++LGFA DCS++
Sbjct: 429 HFLYDVQHARLGFANTDCSAL 449
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 135/392 (34%), Positives = 200/392 (51%), Gaps = 46/392 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC C+ Q P +DP S
Sbjct: 79 LESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESS 138
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
S++ + C+ CH + SS PP C YF YGD S T G+ E
Sbjct: 139 SFRNIGCHDPRCH----------LVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATE 188
Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ GK+ V + +FGCG N+GLF G SGL+GLGR LS SQ ++G
Sbjct: 189 TFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHS 248
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI---PNPQLATFYILNLTGISI 340
FSYCL D S LI G + + N + +T ++ NP + TFY + + I +
Sbjct: 249 FSYCLVDRNSDTNVSSKLIFGEDKDLL-NHPELNFTTLVGGKENP-VDTFYYVQIKSIMV 306
Query: 341 GGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG+ L + GG ++DSGT ++ Y +K F+K+ G+P F I
Sbjct: 307 GGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPI 366
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
LD C+N+S +++++P + F A V YF++ D + VCLA+
Sbjct: 367 LDPCYNVSGVEKIDLPDFGILFADGAVWNFPVEN--YFIRLDPEEVVCLAILGTP-RSAL 423
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N V+YDTK S+LG+A +C+ +
Sbjct: 424 SIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 455
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/367 (33%), Positives = 188/367 (51%), Gaps = 37/367 (10%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG + Y I +G + +++D+GSD+ W+QC+PC CYNQ DP+F+P+ S
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSA 177
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ V C+S+ C+ L+ + C C Y V+YGDGSYT+G L E + +G+
Sbjct: 178 SFIGVACSSNVCNQLD----DDVACRKGR---CGYQVAYGDGSYTKGTLALETITIGRTV 230
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
+ D GCG N+G+F G +GL+GLG +S V Q GG F YCL S ++
Sbjct: 231 IQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCL-------VSRAM 283
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGG 354
+G + +I NP +FY ++L+G+++GG ++ Q + GG
Sbjct: 284 PVGA-----------MWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGG 332
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
+++D+GT ITRLP Y+A + F+ Q + P APG SI DTC++L+ + V +P V
Sbjct: 333 VVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFY 392
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F G +T + D C A A IIGN QQ+ +V D N +
Sbjct: 393 FSGGQILTFPARNFL-IPADDVGTFCFAFA--PSPSGLSIIGNIQQEGIQVSIDGTNGFV 449
Query: 475 GFAGEDC 481
GF C
Sbjct: 450 GFGPNVC 456
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 35/358 (9%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R+ + I+DTGSDL W QC+PC+ C++Q P+FDP S S+ K+ C+S C AL +T
Sbjct: 377 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTST--- 433
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIFGCGRNNKGL-F 257
CSS C Y +YGD S T+G L E G + S+ FGCG +N G F
Sbjct: 434 --CSSDG---CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF 488
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--P 315
+GL+GLGR LSLVSQ E F+YCL + D+ S SL+LG +++ ++
Sbjct: 489 SQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPS-SLLLGSLANITPKTSKDE 544
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPP 368
+ T +I NP +FY L+L GIS+GG QL S F GG++IDSGT IT +
Sbjct: 545 MKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVEN 604
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTG 427
S +++LK EF+ Q + G LD CFNL A +V +P + F+G +++ G
Sbjct: 605 SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKG---ADLELPG 661
Query: 428 IVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y + S A +CLA+ S I GN QQ+N V++D + L F C S+
Sbjct: 662 ENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 716
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 131/371 (35%), Positives = 191/371 (51%), Gaps = 33/371 (8%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+TSG + Y + +G R +++DTGSD+ W+QCQPC CY Q DP+FDP+ S
Sbjct: 8 PVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 67
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+Y V C S C +LE ++ SG C Y V+YGDGSYT G+ E + G +
Sbjct: 68 STYAPVTCQSQQCSSLEMSSCRSG--------QCLYQVNYGDGSYTFGDFATESVSFGNS 119
Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
SV + GCG +N+GLF G +GL+GLG LSL T+++ FSYCL + AG+S
Sbjct: 120 GSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSAGSST 176
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGF 350
V + P ++ N ++ TFY + L+G+S+GG+ +L SG
Sbjct: 177 LDFNSAQLGVDSVTAP-----LMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESG- 230
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
GGI++D GT ITRL Y+ L+ F++ ++ DTC++LS V +P
Sbjct: 231 -NGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPT 289
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V F + + V S A C A A + IIGN QQ+ RV +D
Sbjct: 290 VSFHFADGKSWNLPAANYLIPVDS-AGTYCFAFAPTT--SSLSIIGNVQQQGTRVTFDLA 346
Query: 471 NSQLGFAGEDC 481
N+++GF+ C
Sbjct: 347 NNRMGFSPNKC 357
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 35/358 (9%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R+ + I+DTGSDL W QC+PC+ C++Q P+FDP S S+ K+ C+S C AL +T
Sbjct: 122 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTST--- 178
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIFGCGRNNKGL-F 257
CSS C Y +YGD S T+G L E G + S+ FGCG +N G F
Sbjct: 179 --CSSDG---CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF 233
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--P 315
+GL+GLGR LSLVSQ E F+YCL + D+ S SL+LG +++ ++
Sbjct: 234 SQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPS-SLLLGSLANITPKTSKDE 289
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPP 368
+ T +I NP +FY L+L GIS+GG QL S F GG++IDSGT IT +
Sbjct: 290 MKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVEN 349
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTG 427
S +++LK EF+ Q + G LD CFNL A +V +P + F+G +++ G
Sbjct: 350 SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKG---ADLELPG 406
Query: 428 IVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y + S A +CLA+ S I GN QQ+N V++D + L F C S+
Sbjct: 407 ENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 136/369 (36%), Positives = 188/369 (50%), Gaps = 36/369 (9%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
YIA I +G G + +DT SDLTW+QCQPC+ CY Q PVFDP S SY+++ N++
Sbjct: 137 EYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAA 196
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCG 250
C AL + G + C Y V YGDGS T G+ E L G + GCG
Sbjct: 197 DCQALGRSGGGDAKRGT-----CVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRISIGCG 251
Query: 251 RNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
+NKGLFG +G++GLGR +S +Q G FSYCL S S L +
Sbjct: 252 HDNKGLFGAPAAGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGAGA 309
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKGGILIDSG 360
S P+++T + N + TFY + LTGIS+GG + LQ + +GG+++DSG
Sbjct: 310 VDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVDSG 369
Query: 361 TVITRLPPSIYSALKAEF------LKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
T +TRL Y+A + F L Q S G PS GF DTC+ + +P V M
Sbjct: 370 TAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPS--GF--FDTCYTVGGRGMKKVPTVSM 425
Query: 414 EFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G+ E+ + Y + D+ VC A A+ + IIGN QQ+ R++YD
Sbjct: 426 HFAGSVEVKLQPKN--YLIPVDSMGTVCFAFAATG-DHSVSIIGNIQQQGFRIVYDI-GG 481
Query: 473 QLGFAGEDC 481
++GFA C
Sbjct: 482 RVGFAPNSC 490
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 148/421 (35%), Positives = 213/421 (50%), Gaps = 44/421 (10%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---DVSNTEI-------PLTSGIRLQT 132
D+ +RL D+ V+ + R++ +S + + TEI P+ SG +
Sbjct: 93 DYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGS 152
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y + + +G + +++DTGSD+ W+QCQPC CY Q DP+FDP S S+ + C S
Sbjct: 153 GEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGC 249
C ALE SG C +S C Y VSYGDGS+T GE E L G + +ND GC
Sbjct: 213 QQCQALE----TSG-CRASK---CLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGC 264
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
G +N+GLF G GL +S TS++ FSYCL D +S S L NS+
Sbjct: 265 GHDNEGLF---VGSAGLLGLGGGPLSLTSQMKASSFSYCL---VDRDSSSSSDLEFNSAA 318
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSG 360
+S ++ + ++ TFY + LTG+S+GG+ Q+ SG+ GGI++DSG
Sbjct: 319 PSDS---VNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY--GGIIVDSG 373
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T ITRL Y+ L+ F+ + GF++ DTC++LS+ V IP V EF G
Sbjct: 374 TAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKS 433
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + + V S C A A + IIGN QQ+ RV YD NS +GF+
Sbjct: 434 LQLPPKNYLIPVDS-VGTFCFAFAPTT--SSLSIIGNVQQQGTRVHYDLANSVVGFSPHK 490
Query: 481 C 481
C
Sbjct: 491 C 491
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 146/422 (34%), Positives = 217/422 (51%), Gaps = 43/422 (10%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISGNI---KDVSNTEI------PLTSG-IRLQT 132
D+N + RL D VQ+L ++ ++G + ++ + I P+ SG +
Sbjct: 86 DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVL 187
Y+A I +G + ++ DTGSD+TW+QCQPC S CY Q DP+FDP S SY +
Sbjct: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
CNS C L+ A NS C Y V YGDGS+T GEL E L G + S+ +
Sbjct: 206 CNSQQCKLLDKANCNSDTCI--------YQVHYGDGSFTTGELATETLSFGNSNSIPNLP 257
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
GCG +N+GLF G +GL+GLG +SL SQ + FSYCL + D+ +S +L N
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQ---LKASSFSYCLVNL-DSDSSSTLEFNSN 313
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDS 359
++P ++ N + ++ + + GIS+GGK L S GGI++DS
Sbjct: 314 MPSDSLTSP-----LVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
GT+I+RLP +Y +L+ F+K S APG S+ DTC+N S V +P +
Sbjct: 369 GTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
+ + + + + A CLA + + IIG++QQ+ RV YD NS +GF+
Sbjct: 429 SLRLPARNYLIMLDT-AGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTN 485
Query: 480 DC 481
C
Sbjct: 486 KC 487
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/350 (38%), Positives = 183/350 (52%), Gaps = 29/350 (8%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
IVDTGSDL W QC+PC C+NQ PVFDPS S +Y + C+SS C L +T C+S
Sbjct: 134 IVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPTST-----CTS 188
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLG 267
++ DC Y +YGD S T+G L E L K + FGCG N+G F +GL+GLG
Sbjct: 189 AA-KDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDGFTQGAGLVGLG 247
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNMIPNP 325
R LSLVSQ G FSYCL S D S L+ L S+ ++ I T +I NP
Sbjct: 248 RGPLSLVSQLGL---GKFSYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNP 304
Query: 326 QLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
+FY + L +++G + L S FA GG+++DSGT IT L Y LK F
Sbjct: 305 SQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAF 364
Query: 379 LKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
Q P A G ++ LD CF A +V +P + + F+G A++ D+ Y V
Sbjct: 365 AAQMK-LPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADL--DLPAENYMVLDS 421
Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
AS +CL + IIGN+QQ+N + +YD L FA C+ +
Sbjct: 422 ASGALCLTVMG---SRGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQCAKL 468
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SG+ + Y A + +G T +++DTGSD+ W+QC PC+ CY Q VFDP S
Sbjct: 116 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 175
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
SY V C + C L+ A + S C Y V+YGDGS T G+ E L +
Sbjct: 176 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 229
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
A V GCG +N+GLF SGL+GLGR LS SQ + FG FSYCL S+
Sbjct: 230 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 289
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
++ S + + + ++T M NP++ATFY ++L G S+GG +++
Sbjct: 290 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 349
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
+GG+++DSGT +TRL +Y A++ F G +P GFS+ DTC+NLS + V
Sbjct: 350 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 409
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
+P V M G A + + Y + D S C A+A + IIGN QQ+ RV
Sbjct: 410 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 465
Query: 466 IYDTKNSQLGFAGEDC 481
++D ++GF + C
Sbjct: 466 VFDGDAQRVGFVPKSC 481
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 142/467 (30%), Positives = 221/467 (47%), Gaps = 37/467 (7%)
Query: 47 KSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG------KIVDWNEQQQNRLILDNLHVQ 100
K G++ QK ++ L + H+ G +D E+ R+ + +H +
Sbjct: 54 KLGAAEDAADEQKPASPSSSLKLHMTHRRGAEGGRTRKGSFLDLAEKDAVRV--EAMHRR 111
Query: 101 YLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTW 158
S + + + SG+ + + Y+ + +G R +I+DTGSDL W
Sbjct: 112 VASSSSSPRRGRALSESERVVATVESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNW 171
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSPPDCNYF 217
+QC PC C+ Q+ PVFDP+ S SY+ + C C H C C Y+
Sbjct: 172 LQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYY 231
Query: 218 VSYGDGSYTRGELGREHLGL------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
YGD S + G+L E + + V+ +FGCG N+GLF G +GL+GLGR L
Sbjct: 232 YWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPL 291
Query: 272 SLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLA- 328
S SQ ++GG FSYCL + + ++ G + ++ + P + YT P A
Sbjct: 292 SFASQLRAVYGGHTFSYCL-VDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPAD 350
Query: 329 TFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
TFY + LTG+ +GG+ L AS GG +IDSGT ++ Y ++ F+ +
Sbjct: 351 TFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDR 410
Query: 382 FSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV- 439
SG +P P F +L C+N+S + +P + + F A D YF++ D +
Sbjct: 411 MSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGA--VWDFPAENYFIRLDPDGIM 468
Query: 440 CLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA+ TG IIGN+QQ+N V YD N++LGFA C+ +
Sbjct: 469 CLAVLGTP---RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAEV 512
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SG+ + Y A + +G T +++DTGSD+ W+QC PC+ CY Q VFDP S
Sbjct: 110 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 169
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
SY V C + C L+ A + S C Y V+YGDGS T G+ E L +
Sbjct: 170 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 223
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
A V GCG +N+GLF SGL+GLGR LS SQ + FG FSYCL S+
Sbjct: 224 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 283
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
++ S + + + ++T M NP++ATFY ++L G S+GG +++
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
+GG+++DSGT +TRL +Y A++ F G +P GFS+ DTC+NLS + V
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 403
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
+P V M G A + + Y + D S C A+A + IIGN QQ+ RV
Sbjct: 404 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 459
Query: 466 IYDTKNSQLGFAGEDC 481
++D ++GF + C
Sbjct: 460 VFDGDAQRVGFVPKSC 475
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 204/395 (51%), Gaps = 39/395 (9%)
Query: 110 ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSC 167
++ N D +N + P G + ++ + +G + IVDTGSDL W QC+PC C
Sbjct: 87 VASNPDDTNNIKAPTHGG----SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTEC 142
Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
++Q P+FDP S SY KV C+S C+AL + N S C Y +YGD S TR
Sbjct: 143 FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDS------CEYLYTYGDYSSTR 196
Query: 228 GELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
G L E + S++ FGCG N+G F SGL+GLGR LSL+SQ E F
Sbjct: 197 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KF 253
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNST------PITYT-NMIPNPQLATFYILNLTGI 338
SYCL S +D+ AS SL +G +S N T +T T +++ NP +FY L L GI
Sbjct: 254 SYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGI 313
Query: 339 SIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
++G K+L + S GG++IDSGT IT L + + LK EF + S G
Sbjct: 314 TVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS 373
Query: 392 SILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYE 449
+ LD CF L +A + + +P + F+G +++ G Y V ++ V CLA+ S
Sbjct: 374 TGLDLCFKLPNAAKNIAVPKLIFHFKG---ADLELPGENYMVADSSTGVLCLAMGS---S 427
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ I GN QQ+N V++D + + F +C +
Sbjct: 428 NGMSIFGNVQQQNFNVLHDLEKETVTFVPTECGKL 462
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 137/393 (34%), Positives = 198/393 (50%), Gaps = 47/393 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC C+ Q P +DP S
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSI 244
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
S++ + CN C + SS PP C YF YGD S T G+ E
Sbjct: 245 SFRNITCNDPRCQ----------LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALE 294
Query: 234 HLGL-------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
+ GK+ V + +FGCG N+GLF G +GL+GLGR LS SQ ++G
Sbjct: 295 TFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGH 354
Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGIS 339
FSYCL D S LI G + + + + +T++I NP + TFY L + I
Sbjct: 355 SFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENP-VDTFYYLQIKSIF 412
Query: 340 IGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
+GG++LQ S GG +IDSGT ++ Y +K FL++ G+ F
Sbjct: 413 VGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFP 472
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE 451
IL C+N+S E+N P ++F A V YF++ VCLA+ +
Sbjct: 473 ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVEN--YFIRIQQLDIVCLAMLGTP-KSA 529
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N ++YDTKNS+LG+A C+ +
Sbjct: 530 LSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 138/445 (31%), Positives = 220/445 (49%), Gaps = 34/445 (7%)
Query: 66 AITLELKHKNYCSGKIVD---WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE- 121
++ L +KH++ G+ ++ +++ + ++ +H + +S + M + + + +E
Sbjct: 74 SLQLRMKHRSAEGGRTRKESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPRRALSER 133
Query: 122 --IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
+ SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C+ Q+ PVFDP
Sbjct: 134 MVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDP 193
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ S SY+ V C C L C + C Y+ YGD S T G+L E +
Sbjct: 194 AASSSYRNVTCGDQRC-GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTV 252
Query: 238 ------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
V+ +FGCG N+GLF G +GL+GLGR LS SQ ++G FSYCL
Sbjct: 253 NLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL-- 310
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG- 349
+ +GS ++ G + + YT P A TFY + L G+ +GG L S
Sbjct: 311 VEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSD 370
Query: 350 ------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSA 402
GG +IDSGT ++ Y ++ F+ S +P P F +L+ C+N+S
Sbjct: 371 TWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSG 430
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG--IIGNYQ 459
+ +P + + F A D YFV+ D + CLA+ TG IIGN+Q
Sbjct: 431 VERPEVPELSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVRGTP---RTGMSIIGNFQ 485
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSSM 484
Q+N V+YD +N++LGFA C+ +
Sbjct: 486 QQNFHVVYDLQNNRLGFAPRRCAEV 510
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 134/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SG+ + Y A + +G T +++DTGSD+ W+QC PC+ CY Q VFDP S
Sbjct: 110 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 169
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
SY V C + C L+ A + S C Y V+YGDGS T G+ E L +
Sbjct: 170 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 223
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
A V GCG +N+GLF SGL+GLGR LS +Q + FG FSYCL S+
Sbjct: 224 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPS 283
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
++ S + + + ++T M NP++ATFY ++L G S+GG +++
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
+GG+++DSGT +TRL +Y A++ F G +P GFS+ DTC+NLS + V
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 403
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
+P V M G A + + Y + D S C A+A + IIGN QQ+ RV
Sbjct: 404 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 459
Query: 466 IYDTKNSQLGFAGEDC 481
++D ++GF + C
Sbjct: 460 VFDGDAQRVGFVPKSC 475
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 137/393 (34%), Positives = 198/393 (50%), Gaps = 47/393 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC C+ Q P +DP S
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSI 244
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
S++ + CN C + SS PP C YF YGD S T G+ E
Sbjct: 245 SFRNITCNDPRCQ----------LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALE 294
Query: 234 HLGL-------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
+ GK+ V + +FGCG N+GLF G +GL+GLGR LS SQ ++G
Sbjct: 295 TFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGH 354
Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGIS 339
FSYCL D S LI G + + + + +T++I NP + TFY L + I
Sbjct: 355 SFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENP-VDTFYYLQIKSIF 412
Query: 340 IGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
+GG++LQ S GG +IDSGT ++ Y +K FL++ G+ F
Sbjct: 413 VGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFP 472
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE 451
IL C+N+S E+N P ++F A V YF++ VCLA+ +
Sbjct: 473 ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVEN--YFIRIQQLDIVCLAMLGTP-KSA 529
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N ++YDTKNS+LG+A C+ +
Sbjct: 530 LSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 224/440 (50%), Gaps = 44/440 (10%)
Query: 67 ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIK--DVSN 119
++LEL ++ + + D+ +RL D+ V + ++I+ + G ++K D+
Sbjct: 82 LSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFAVEGIDRSDLKPVDIDE 141
Query: 120 TEI-------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
T P+ SG + Y + I +G + M V++DTGSD+ W+QC PC CY Q
Sbjct: 142 TRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQ 201
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
DP+FDP+ S ++K + C+ C +L+ + C S+ C Y VSYGDGS+T G
Sbjct: 202 SDPIFDPTSSSTFKSLTCSDPKCASLDVS-----ACRSNK---CLYQVSYGDGSFTVGNY 253
Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ + G++ VND GCG +N+GLF G +GL+GLG LS+ T++I FSYCL
Sbjct: 254 ATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSM---TNQIKAKSFSYCL 310
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---- 345
+D+ S SL ++T ++ N ++ TFY + L+G S+GG+Q+
Sbjct: 311 VD-RDSAKSSSLDFNSVQIGAGDAT----APLLRNSKMDTFYYVGLSGFSVGGQQVSIPS 365
Query: 346 ---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLS 401
+ GG+++D GT +TRL Y++L+ F+K + F S+ DTC++ S
Sbjct: 366 SLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFS 425
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
+ V +P V F G + + + + DA C A A S IIGN QQ+
Sbjct: 426 SLSTVKVPTVTFHFTGGKSLNLPAKNYLIPID-DAGTFCFAFAPTS--SSLSIIGNVQQQ 482
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
R+ YD N+ +G + C
Sbjct: 483 GTRITYDLANNLIGLSANKC 502
>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
Length = 166
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 89/152 (58%), Positives = 123/152 (80%), Gaps = 1/152 (0%)
Query: 330 FYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
FY++NLTGI++GG++++++GF+ I +DSGTVIT L PS+Y+A++AEF+ Q + +P AP
Sbjct: 13 FYLVNLTGITVGGQEVESTGFSARAI-VDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAP 71
Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
GFSILDTCFN++ +EV +P + + F+G AE+ VD G++YFV SD+SQVCLA+ASL E
Sbjct: 72 GFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSE 131
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
DET IIGNYQQKN RV++DT SQ+GFA E C
Sbjct: 132 DETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 147/421 (34%), Positives = 213/421 (50%), Gaps = 44/421 (10%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---DVSNTEI-------PLTSGIRLQT 132
D+ +RL D+ V+ + R++ +S + + TEI P+ SG +
Sbjct: 93 DYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGS 152
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y + + +G + +++DTGSD+ W+QCQPC CY Q DP+FDP S S+ + C S
Sbjct: 153 GEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGC 249
C ALE SG C +S C Y VSYGDGS+T GE E L G + +N+ GC
Sbjct: 213 QQCQALE----TSG-CRASK---CLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGC 264
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
G +N+GLF G GL +S TS++ FSYCL D +S S L NS+
Sbjct: 265 GHDNEGLF---VGSAGLLGLGGGSLSLTSQMKASSFSYCL---VDRDSSSSSDLEFNSAA 318
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSG 360
+S ++ + ++ TFY + LTG+S+GG+ Q+ SG+ GGI++DSG
Sbjct: 319 PSDS---VNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY--GGIIVDSG 373
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T ITRL Y+ L+ F+ + GF++ DTC++LS+ V IP V EF G
Sbjct: 374 TAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKS 433
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + + V S C A A + IIGN QQ+ RV YD NS +GF+
Sbjct: 434 LQLPPKNYLIPVDS-VGTFCFAFAPTT--SSLSIIGNVQQQGTRVHYDLANSVVGFSPHK 490
Query: 481 C 481
C
Sbjct: 491 C 491
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/356 (37%), Positives = 188/356 (52%), Gaps = 33/356 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C++Q P+FDP S SY KV C+S C+AL + N
Sbjct: 121 SAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN---- 176
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLM 264
C Y +YGD S TRG L E + S++ FGCG N+G F SGL+
Sbjct: 177 --EDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLV 234
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST------PITY 318
GLGR LSL+SQ E FSYCL S +D+ AS SL +G +S N T +T
Sbjct: 235 GLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTK 291
Query: 319 T-NMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGILIDSGTVITRLPPSI 370
T +++ NP +FY L L GI++G K+L + S F GG++IDSGT IT L +
Sbjct: 292 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETA 351
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
+ LK EF + S G + LD CF L A + + +P + F+G +++ G
Sbjct: 352 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKG---ADLELPGEN 408
Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y V ++ V CLA+ S + I GN QQ+N V++D + + F +C +
Sbjct: 409 YMVADSSTGVLCLAMGS---SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 194/363 (53%), Gaps = 30/363 (8%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+AT+ LG R +VIVDTGSDLTWVQC PC +CY+Q D +F P+ S S+ K+ C +
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIF 247
C+ L + +C+ ++ C Y+ SYGDGS + G+ + + + K V +F F
Sbjct: 63 CNGLPYP-----MCNQTT---CVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAF 114
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
GCG +N+G F G G++GLG+ LS SQ +F G FSYCL S +L G++
Sbjct: 115 GCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDA 174
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSG 360
+V + Y +++ NP++ T+Y + L GIS+GGK L S A + G + DSG
Sbjct: 175 AV-PTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSG 233
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCF-NLSAYQEVNIPLVKMEFEGN 418
T +T+L ++ + A +P + S LD C + Q +P + FEG
Sbjct: 234 TTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEG- 292
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
+M + + F++S S C ++ S + IIG+ QQ+N +V YDT ++GF
Sbjct: 293 GDMELPPSNYFIFLESSQS-YCFSMVS---SPDVTIIGSIQQQNFQVYYDTVGRKIGFVP 348
Query: 479 EDC 481
+ C
Sbjct: 349 KSC 351
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/422 (34%), Positives = 218/422 (51%), Gaps = 43/422 (10%)
Query: 83 DWNEQQQNRLILDNLHVQYLQSRIKNMISGNI---KDVSNTEI------PLTSG-IRLQT 132
D+N + RL D VQ+L ++ ++G + ++ + I P+ SG +
Sbjct: 86 DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVL 187
Y+A I +G + ++ DTGSD+TW+QCQPC S CY Q DP+FDP S SY +
Sbjct: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
CNS C L+ A NS C Y V YGDGS+T GEL E L G + S+ +
Sbjct: 206 CNSQQCKLLDKANCNSDTCI--------YQVHYGDGSFTTGELATETLSFGNSNSIPNLP 257
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
GCG +N+GLF G +GL+GLG +SL SQ + FSYCL + D+ +S +L
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQ---LKASSFSYCLVNL-DSDSSSTLEFNS- 312
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDS 359
+ S +T + ++ N + ++ + + GIS+GGK L S GGI++DS
Sbjct: 313 ---YMPSDSLT-SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
GT+I+RLP +Y +L+ F+K S APG S+ DTC+N S V +P +
Sbjct: 369 GTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
+ + + + + A CLA + + IIG++QQ+ RV YD NS +GF+
Sbjct: 429 SLRLPARNYLIMLDT-AGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTN 485
Query: 480 DC 481
C
Sbjct: 486 KC 487
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 202/396 (51%), Gaps = 35/396 (8%)
Query: 106 IKNMISGNI--KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC 161
+ ++I G++ D + P+ SG+ + Y A++ +G +++DTGSD+ W+QC
Sbjct: 68 VASLIIGSLTAHDDDHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQC 127
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
+PC CY Q P++DP S +Y + C+ C + G +G C Y + YG
Sbjct: 128 KPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTG--------GCGYRIVYG 179
Query: 222 DGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
D S T G L + L SV + GCG +N+GLFG +GL+G+ R + S +Q ++
Sbjct: 180 DASSTSGNLATDRLVFSNDTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADS 239
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
+G F+YCL +G+S S ++ G ++ S+ +T + NP+ + Y +++ G S+
Sbjct: 240 YGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSS--VFTPLRSNPRRPSLYYVDMVGFSV 297
Query: 341 GGKQLQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF---P 386
GG+ + +GF+ +GG+++DSGT ITR Y AL+ F + +
Sbjct: 298 GGEPV--TGFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRK 355
Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS 445
G S+ D C++L + P V + F G A++ + Y V ++ + C AL +
Sbjct: 356 VGRGISVFDACYDLRGVAVADAPGVVLHFAGGADVALPPEN--YLVPEESGRYHCFALEA 413
Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ D +IGN Q+ RV++D +N ++GF C
Sbjct: 414 AGH-DGLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 193/375 (51%), Gaps = 32/375 (8%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
T+ + Y+AT+ LG R +VIVDTGSDLTWVQC PC CY+Q D +F P+ S
Sbjct: 2 FTAPVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTST 61
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--- 238
S+ K+ C S+ C+ L F +C+ ++ C Y+ SYGDGS T G+ + + +
Sbjct: 62 SFTKLACGSALCNGLPFP-----MCNQTT---CVYWYSYGDGSLTTGDFVYDTITMDGIN 113
Query: 239 --KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
K V +F FGCG +N+G F G G++GLG+ LS SQ ++ G FSYCL
Sbjct: 114 GQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPP 173
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------G 349
S +L G+++V + Y ++ NP++ T+Y + L GIS+G L S
Sbjct: 174 TQTSPLLFGDAAV-PILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDS 232
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA-PGFSILDTCFN-LSAYQEVN 407
G + DSGT +T+L + Y + A + S LD C + Q
Sbjct: 233 VGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPT 292
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDETGIIGNYQQKNQRVI 466
+P + FEG +M + + YF+ ++SQ C A+ S + IIG+ QQ+N +V
Sbjct: 293 VPAMTFHFEG-GDMVLPPSN--YFIYLESSQSYCFAMTS---SPDVNIIGSVQQQNFQVY 346
Query: 467 YDTKNSQLGFAGEDC 481
YDT +LGF +DC
Sbjct: 347 YDTAGRKLGFVPKDC 361
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 135/356 (37%), Positives = 188/356 (52%), Gaps = 33/356 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C++Q P+FDP S SY KV C+S C+AL + N
Sbjct: 13 SAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN---- 68
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLM 264
C Y +YGD S TRG L E + S++ FGCG N+G F SGL+
Sbjct: 69 --EDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLV 126
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST------PITY 318
GLGR LSL+SQ E FSYCL S +D+ AS SL +G +S N T +T
Sbjct: 127 GLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTK 183
Query: 319 T-NMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGILIDSGTVITRLPPSI 370
T +++ NP +FY L L GI++G K+L + S F GG++IDSGT IT L +
Sbjct: 184 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETA 243
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
+ LK EF + S G + LD CF L A + + +P + F+G +++ G
Sbjct: 244 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKG---ADLELPGEN 300
Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y V ++ V CLA+ S + I GN QQ+N V++D + + F +C +
Sbjct: 301 YMVADSSTGVLCLAMGS---SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 353
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 132/339 (38%), Positives = 183/339 (53%), Gaps = 21/339 (6%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
I DTGSDLTW QC PC CY Q P+F+P S S+ V CN+ TCHA++ G+ GV
Sbjct: 108 IADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD--DGHCGVQGV 165
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
C+Y +YGD +Y++G+LG E + +G +SV I GCG + G FG SG++GLG
Sbjct: 166 -----CDYSYTYGDRTYSKGDLGFEKITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGG 219
Query: 269 SDLSLVSQTSEIFG--GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
LSLVSQ S+ G FSYCLP T + A+G + G N+ V S P + + +
Sbjct: 220 GQLSLVSQMSQTSGISRRFSYCLP-TLLSHANGKINFGENAVV---SGPGVVSTPLISKN 275
Query: 327 LATFYILNLTGISIGGKQLQASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
T+Y + L ISIG ++ A FAK G ++IDSGT +T LP +Y + + LK
Sbjct: 276 TVTYYYITLEAISIGNERHMA--FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAK 333
Query: 386 PSAPGFSILDTCFN--LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
LD CF+ ++A + IP++ F G A V++ I F K + CL L
Sbjct: 334 RVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGAN--VNLLPINTFRKVADNVNCLTL 391
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S E GIIGN Q N + YD + +L F C+
Sbjct: 392 KAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 133/360 (36%), Positives = 189/360 (52%), Gaps = 33/360 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ + +G + + I+DTGSDL W QCQPC C+NQ P+F+P S S+ + C+S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
C AL +S CS++ C Y YGDGS T+G +G E L G S+ + FGCG N
Sbjct: 155 CQAL-----SSPTCSNNF---CQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGEN 206
Query: 253 NKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
N+G G +GL+G+GR LSL SQ FSYC+ + S +L+LG ++
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPS-NLLLGSLANSVT 262
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVI 363
+P T +I + Q+ TFY + L G+S+G +L S FA GGI+IDSGT +
Sbjct: 263 AGSP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 320
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL-SAYQEVNIPLVKMEFEGNAEM 421
T + Y +++ EF+ Q + P G S D CF S + IP M F+G
Sbjct: 321 TYFVNNAYQSVRQEFISQIN-LPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG--- 376
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+++ YF+ +CLA+ S S I GN QQ+N V+YDT NS + FA C
Sbjct: 377 DLELPSENYFISPSNGLICLAMGSSS--QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 39/389 (10%)
Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
R++ M++G S E P+ +G Y+ + +G + + I+DTGSDL W QCQ
Sbjct: 73 RLEAMLNG----PSGVETPVYAGDG----EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124
Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
PC C+NQ P+F+P S S+ + C+S C AL+ T CS++S C Y YGD
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-----CSNNS---CQYTYGYGD 176
Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
GS T+G +G E L G S+ + FGCG NN+G G +GL+G+GR LSL SQ
Sbjct: 177 GSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT- 235
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
FSYC+ + +S +L+LG ++ +P T +I + Q+ TFY + L G+S+G
Sbjct: 236 --KFSYCMTPIGSSNSS-TLLLGSLANSVTAGSP--NTTLIQSSQIPTFYYITLNGLSVG 290
Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
L S GGI+IDSGT +T + Y A++ F+ Q + S
Sbjct: 291 STPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSG 350
Query: 394 LDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
D CF + + Q + IP M F+G ++ + YF+ +CLA+ S S
Sbjct: 351 FDLCFQMPSDQSNLQIPTFVMHFDG-GDLVLPSEN--YFISPSNGLICLAMGSSS--QGM 405
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I GN QQ+N V+YDT NS + F C
Sbjct: 406 SIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 39/389 (10%)
Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
R++ M++G S E P+ +G Y+ + +G + + I+DTGSDL W QCQ
Sbjct: 73 RLEAMLNG----PSGVETPVYAGDG----EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124
Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
PC C+NQ P+F+P S S+ + C+S C AL+ S CS++S C Y YGD
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQ-----SPTCSNNS---CQYTYGYGD 176
Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
GS T+G +G E L G S+ + FGCG NN+G G +GL+G+GR LSL SQ
Sbjct: 177 GSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT- 235
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
FSYC+ + + S +L+LG ++ +P T +I + Q+ TFY + L G+S+G
Sbjct: 236 --KFSYCM-TPIGSSTSSTLLLGSLANSVTAGSP--NTTLIESSQIPTFYYITLNGLSVG 290
Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
L S GGI+IDSGT +T + Y A++ F+ Q + S
Sbjct: 291 STPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSG 350
Query: 394 LDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
D CF + + Q + IP M F+G ++ + YF+ +CLA+ S S
Sbjct: 351 FDLCFQMPSDQSNLQIPTFVMHFDG-GDLVLPSEN--YFISPSNGLICLAMGSSS--QGM 405
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I GN QQ+N V+YDT NS + F C
Sbjct: 406 SIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 182/358 (50%), Gaps = 24/358 (6%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I LG + + IVDTGSDL WVQC PC C+ Q DP+F P S SY C S
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
C AL T CS + C Y SYGDGS TRG+ E + L +++ FGCG N
Sbjct: 68 CDALPRPT-----CSMRN--TCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHN 120
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
+G F G GL+GLG+ LSL SQ + F +FSYCL G + G + +
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRA 180
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITR 365
S +T ++ N ++Y + + IS+G +++ S F GG+++DSGT IT
Sbjct: 181 S----FTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITY 236
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-GNAEMTVD 424
+ + + AE +Q S + P L+ C+++S+ ++ L M N + +
Sbjct: 237 WRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIP 296
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
V+ + V + VC A+++ D+ IIGN QQ+N ++ D NS++GF DCS
Sbjct: 297 VSNLWVLVDNFGETVCTAMST---SDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 178/328 (54%), Gaps = 32/328 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
TVI+D+GSD++WVQC+PC C+ Q+DP+FDP++S +Y V C S+ C L + G
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 226
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
CS+++ C + ++YGDGS G + L LG V F FGC ++G V
Sbjct: 227 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 282
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
+G + LG SLV QT+ +G +FSYCLP T A + G L+LG + STP
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 340
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
+ ++M P TFY + L I + G+ L +IDS T+I+RLPP+ Y AL
Sbjct: 341 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 395
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+A F + + +AP SILDTC++ + + + +P + + F+G A + +D GI+
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 451
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKN 462
CLA A + + G IGN QQK
Sbjct: 452 ---GSCLAFAPTASDRMPGFIGNVQQKT 476
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 135/283 (47%), Gaps = 49/283 (17%)
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
CS+++ C + ++YGDGS G + L LG V+
Sbjct: 480 CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVD---------------------- 515
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKN--STPITYTN 320
R L L +T+ +G +FSYC+P + + G + LG +++ STP+ ++
Sbjct: 516 --RQGLPL--RTATQYGRVFSYCIPPSPSS--LGFITLGVPPQRAALVPTFVSTPLLSSS 569
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
+P TFY + L I + G+ L F+ + I S TVI+RLPP+ Y AL+A F
Sbjct: 570 SMP----PTFYRVLLRAIIVAGRPLPVPPTVFSTSSV-IASTTVISRLPPTAYQALRAAF 624
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ + + +AP SILDTC++ + + + +P + + F+G A + +D GI+ Q
Sbjct: 625 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-------Q 677
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + + G IGN QQ+ V+YD + F C
Sbjct: 678 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 178/328 (54%), Gaps = 32/328 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
TVI+D+GSD++WVQC+PC C+ Q+DP+FDP++S +Y V C S+ C L + G
Sbjct: 78 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 135
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
CS+++ C + ++YGDGS G + L LG V F FGC ++G V
Sbjct: 136 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 191
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
+G + LG SLV QT+ +G +FSYCLP T A + G L+LG + STP
Sbjct: 192 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 249
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
+ ++M P TFY + L I + G+ L +IDS T+I+RLPP+ Y AL
Sbjct: 250 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 304
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+A F + + +AP SILDTC++ + + + +P + + F+G A + +D GI+
Sbjct: 305 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 360
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKN 462
CLA A + + G IGN QQK
Sbjct: 361 ---GSCLAFAPTASDRMPGFIGNVQQKT 385
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 135/283 (47%), Gaps = 49/283 (17%)
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
CS+++ C + ++YGDGS G + L LG V+
Sbjct: 389 CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVD---------------------- 424
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKN--STPITYTN 320
R L L +T+ +G +FSYC+P + + G + LG +++ STP+ ++
Sbjct: 425 --RQGLPL--RTATQYGRVFSYCIPPSPSS--LGFITLGVPPQRAALVPTFVSTPLLSSS 478
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
+P TFY + L I + G+ L F+ + I S TVI+RLPP+ Y AL+A F
Sbjct: 479 SMP----PTFYRVLLRAIIVAGRPLPVPPTVFSTSSV-IASTTVISRLPPTAYQALRAAF 533
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ + + +AP SILDTC++ + + + +P + + F+G A + +D GI+ Q
Sbjct: 534 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-------Q 586
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + + G IGN QQ+ V+YD + F C
Sbjct: 587 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 187/349 (53%), Gaps = 27/349 (7%)
Query: 140 ELGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL- 196
+L G TV++D+ SD+ WVQC PC C+ Q D +DPS SPS C+S TC AL
Sbjct: 153 KLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALG 212
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKG 255
+A G C+++ C Y V Y DGS T G + L L +V+ F FGC +G
Sbjct: 213 PYANG----CANN---QCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 265
Query: 256 LFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
F +G+M LG SL+SQT+ +G FSYC+P+T A SG LG + S+
Sbjct: 266 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPAT--ASDSGFFTLGVPR---RASS 320
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYS 372
T M+ Q ATFY + L I++GG++L + FA G +L DS T ITRLPP+ Y
Sbjct: 321 RYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVL-DSRTAITRLPPTAYQ 379
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
AL++ F + + SAP LDTC++ + + +P + + F+ NA + +D +GI++
Sbjct: 380 ALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 437
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA S + + G++G+ QQ+ V+YD +GF C
Sbjct: 438 -----NDCLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 144/377 (38%), Positives = 188/377 (49%), Gaps = 49/377 (12%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ +G R + I+DTGSDL W QC PC C +Q P FDP+ SPSY K+ CNS
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFG 248
C+AL + VC YF YGD + T G L E G + +V FG
Sbjct: 149 CNALYYPLCYRNVCVY------QYF--YGDSANTAGVLSNETFTFGTNDTRVTVPRIAFG 200
Query: 249 CGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGS 300
CG N G LF G SG++G GR LSLVSQ FSYCL PS GA +
Sbjct: 201 CGNLNAGSLFNG-SGMVGFGRGPLSLVSQLGS---PRFSYCLTSFMSPVPSRLYFGAYAT 256
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------K 352
L NS+ P+ T I NP L T Y LN+TGIS+GG+ L S FA
Sbjct: 257 L----NSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS---ILDTCFNLSAYQE--VN 407
GG++IDSG+ IT L + Y + F Q G P S +LDTCF V
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAFADQV-GLPLTNATSLADVLDTCFVWPPPPRKIVT 371
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P + FEG A M + + + + D +CLA+A+ D+ IIG++Q +N V+Y
Sbjct: 372 MPELAFHFEG-ANMELPLENYM-LIDGDTGNLCLAIAA---SDDGSIIGSFQHQNFHVLY 426
Query: 468 DTKNSQLGFAGEDCSSM 484
D +NS L F C+ M
Sbjct: 427 DNENSLLSFTPATCNVM 443
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 187/349 (53%), Gaps = 27/349 (7%)
Query: 140 ELGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL- 196
+L G TV++D+ SD+ WVQC PC C+ Q D +DPS SP+ C+S TC AL
Sbjct: 23 KLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALG 82
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKG 255
+A G C+++ C Y V Y DGS T G + L L +V+ F FGC +G
Sbjct: 83 PYANG----CANN---QCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 135
Query: 256 LFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
F +G+M LG SL+SQT+ +G FSYC+P+T A SG LG + S+
Sbjct: 136 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPAT--ASDSGFFTLGVPR---RASS 190
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYS 372
T M+ Q ATFY + L I++GG++L + FA G +L DS T ITRLPP+ Y
Sbjct: 191 RYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVL-DSRTAITRLPPTAYQ 249
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
AL+A F + + SAP LDTC++ + + +P + + F+ NA + +D +GI++
Sbjct: 250 ALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 307
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA S + + G++G+ QQ+ V+YD +GF C
Sbjct: 308 -----NDCLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C+ Q PVFDPS S +Y V C+S++C L + C
Sbjct: 119 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 173
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
+S+S C Y +YGD S T+G L E L K+ + +FGCG N+G F +GL+G
Sbjct: 174 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 231
Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
LGR LSLVSQ GL FSYCL S D S L+ L G S ++ + T +
Sbjct: 232 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 286
Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
I NP +FY ++L I++G + L +S FA GG+++DSGT IT L Y AL
Sbjct: 287 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 346
Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
K F Q + P+A G + LD CF A +V +P + F+G A++ D+ Y
Sbjct: 347 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 403
Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V S +CL + IIGN+QQ+N + +YD + L FA C+ +
Sbjct: 404 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C+ Q PVFDPS S +Y V C+S++C L + C
Sbjct: 109 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 163
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
+S+S C Y +YGD S T+G L E L K+ + +FGCG N+G F +GL+G
Sbjct: 164 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 221
Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
LGR LSLVSQ GL FSYCL S D S L+ L G S ++ + T +
Sbjct: 222 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 276
Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
I NP +FY ++L I++G + L +S FA GG+++DSGT IT L Y AL
Sbjct: 277 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 336
Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
K F Q + P+A G + LD CF A +V +P + F+G A++ D+ Y
Sbjct: 337 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 393
Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V S +CL + IIGN+QQ+N + +YD + L FA C+ +
Sbjct: 394 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C+ Q PVFDPS S +Y V C+S++C L + C
Sbjct: 88 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP-----TSKC 142
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
+S+S C Y +YGD S T+G L E L K+ + +FGCG N+G F +GL+G
Sbjct: 143 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 200
Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
LGR LSLVSQ GL FSYCL S D S L+ L G S ++ + T +
Sbjct: 201 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 255
Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
I NP +FY ++L I++G + L +S FA GG+++DSGT IT L Y AL
Sbjct: 256 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 315
Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
K F Q + P+A G + LD CF A +V +P + F+G A++ D+ Y
Sbjct: 316 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 372
Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V S +CL + IIGN+QQ+N + +YD + L FA C+ +
Sbjct: 373 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/407 (28%), Positives = 199/407 (48%), Gaps = 31/407 (7%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
H +++ + S D P+ SG+ + Y A I +G V++DTGSD
Sbjct: 51 HAAPFTAQVASFHSIAADDDDRLRSPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSD 110
Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
L W+QC PC+ CY Q P++DP S +++++ C S C + G C + + C
Sbjct: 111 LIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLRYPG----CDART-GGCV 165
Query: 216 YFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
Y V YGDGS + G+L + L V++ GCG +N GL +GL+G+GR LS
Sbjct: 166 YMVVYGDGSASSGDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFP 225
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
+Q + +G +FSYCL +GS L + ST +T + NP+ + Y ++
Sbjct: 226 TQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEPPST--AFTPLRTNPRRPSLYYVD 283
Query: 335 LTGISIGGKQLQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ G S+GG+++ +GF+ +GGI++DSGT I+R Y+A++ F +
Sbjct: 284 MVGFSVGGERV--TGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAA 341
Query: 384 GFPS----APGFSILDTCFNL----SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
+ A FS+ D C++L + V +P + + F G A+M + + V+
Sbjct: 342 AAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGG 401
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ L + +D ++GN QQ+ +++D + ++GF CS
Sbjct: 402 DRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRIGFTPNGCS 448
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/277 (40%), Positives = 153/277 (55%), Gaps = 19/277 (6%)
Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C Y V YGDGSYT G + L L ++ F FGCG N+GLFG +GL+GLGR S
Sbjct: 21 CLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTS 80
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL----A 328
L QT + +GG+F++C P+ +G L G S+P + P L
Sbjct: 81 LPVQTYDKYGGVFAHCFPARSS--GTGYLEFG------PGSSPAVSAKLSTTPMLIDTGP 132
Query: 329 TFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
TFY + +TGI +GGK L S FA G ++DSGTVITRLPP+ YS+L++ F + G
Sbjct: 133 TFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAASMAARG 192
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+ AP S+LDTC++L+ EV IP V + F+G + VD +GI+Y + SQ CL A
Sbjct: 193 YKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY--AASVSQACLGFA 250
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D+ I+GN Q K V+YD + +GF C
Sbjct: 251 GNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 145/436 (33%), Positives = 210/436 (48%), Gaps = 46/436 (10%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
G +++L H++ D ++ Q RL D SR+ G + + T +
Sbjct: 30 GGFSVDLIHRDSPHSPFFDPSKTQAERLT-DAFRRSV--SRV-----GRFRPTAMTSDGI 81
Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
S I Y+ + +G + VI VDTGSDLTW QC+PC CY Q P+FDP S +
Sbjct: 82 QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSST 141
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----G 238
Y+ C +S C AL G CS C + SY DGS+T G L E L + G
Sbjct: 142 YRDSSCGTSFCLAL----GKDRSCSKEK--KCTFRYSYADGSFTGGNLASETLTVDSTAG 195
Query: 239 K-ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
K S F FGCG ++ G+F SG++GLG +LSL+SQ GLFSYC LP + D+
Sbjct: 196 KPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDS 255
Query: 296 GASGSLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
S + G + V STP+ + P+ TFY L L GIS+G K+L G++K
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLVQKS--PD----TFYYLTLEGISVGKKRLPYKGYSK 309
Query: 353 ------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
G I++DSGT T LP YS L+ G I C+N +A E+
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EI 367
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
N P++ F+ + V++ + F++ VC +A S + G++GN Q N V
Sbjct: 368 NAPIITAHFK---DANVELQPLNTFMRMQEDLVCFTVAPTS---DIGVLGNLAQVNFLVG 421
Query: 467 YDTKNSQLGFAGEDCS 482
+D + ++ F DC+
Sbjct: 422 FDLRKKRVSFKAADCT 437
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 185/354 (52%), Gaps = 38/354 (10%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I+DTGSDL W QC+PC C+NQ PVFDPS S +Y + C+S+ C L S C+
Sbjct: 117 AIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSSTLCSDLP-----SSKCT 171
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGL 266
S+ C Y +YGD S T+G L E L K + D FGCG N+G F +GL+GL
Sbjct: 172 SAK---CGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGL 228
Query: 267 GRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNSSV---FKNSTPITYTNM 321
GR LSLVSQ GL FSYCL S D S L+LG +++ ++ + T +
Sbjct: 229 GRGPLSLVSQL-----GLNKFSYCLTSLDDTSKS-PLLLGSLATISESAAAASSVQTTPL 282
Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
I NP +FY +NL G+++G L +S FA GG+++DSGT IT L Y AL
Sbjct: 283 IRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRAL 342
Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
K F Q P+A G I LDTCF A +V +P + +G +D+ Y
Sbjct: 343 KKAFAAQMK-LPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLDG---ADLDLPAENYM 398
Query: 432 V-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V S + +CL + IIGN+QQ+N + +YD + L FA C+ +
Sbjct: 399 VLDSGSGALCLTVMG---SRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQCAKL 449
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 199/392 (50%), Gaps = 46/392 (11%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC C+ Q P +DP S
Sbjct: 170 LESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSS 229
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
SY+ + C+ S CH + SS PP C Y+ YGD S T G+ E
Sbjct: 230 SYRNIGCHDSRCH----------LVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALE 279
Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ GK V + +FGCG N+GLF G +GL+GLGR LS SQ ++G
Sbjct: 280 TFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHS 339
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGISI 340
FSYCL DA S LI G + + + + +T ++ NP + TFY + + I +
Sbjct: 340 FSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVAGKENP-VDTFYYVQIKSIVV 397
Query: 341 GG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG ++ Q + GG +IDSGT ++ Y +K F+ + G+P F +
Sbjct: 398 GGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPV 457
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
L+ C+N++ ++ ++P + F A V YF++ + + VCLA+
Sbjct: 458 LEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVEN--YFIEIEPREVVCLAILGTP-PSAL 514
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N ++YDTK S+LGFA C+ +
Sbjct: 515 SIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 546
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 196/381 (51%), Gaps = 27/381 (7%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C+ Q+ PVFDP+ S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSL 200
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
SY+ V C C + T S P C Y+ YGD S T G+L E +
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDP-CPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
V+D +FGCG +N+GLF G +GL+GLGR LS SQ ++G FSYCL D
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL---VDH 316
Query: 296 GAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGGKQLQASGF-- 350
G+S GS I+ G+ + YT P+ A TFY + L G+ +GG++L S
Sbjct: 317 GSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTW 376
Query: 351 -----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQ 404
GG +IDSGT ++ Y ++ F+++ +P F +L C+N+S +
Sbjct: 377 DVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVE 436
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQ 463
V +P + F A D YFV+ D + CLA+ + IIGN+QQ+N
Sbjct: 437 RVEVPEFSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVLG-TPRSAMSIIGNFQQQNF 493
Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
V+YD +N++LGFA C+ +
Sbjct: 494 HVLYDLQNNRLGFAPRRCAEV 514
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 196/381 (51%), Gaps = 27/381 (7%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C+ Q+ PVFDP+ S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASL 200
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
SY+ V C C + T S P C Y+ YGD S T G+L E +
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDP-CPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
V+D +FGCG +N+GLF G +GL+GLGR LS SQ ++G FSYCL D
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL---VDH 316
Query: 296 GAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGGKQLQASGF-- 350
G+S GS I+ G+ + YT P+ A TFY + L G+ +GG++L S
Sbjct: 317 GSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTW 376
Query: 351 -----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQ 404
GG +IDSGT ++ Y ++ F+++ +P F +L C+N+S +
Sbjct: 377 DVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVE 436
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQ 463
V +P + F A D YFV+ D + CLA+ + IIGN+QQ+N
Sbjct: 437 RVEVPEFSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVLG-TPRSAMSIIGNFQQQNF 493
Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
V+YD +N++LGFA C+ +
Sbjct: 494 HVLYDLQNNRLGFAPRRCAEV 514
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ IVDTGSDL W QC+PC C+ Q PVFDPS S +Y V C+S++C L + C
Sbjct: 181 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 235
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
+S+S C Y +YGD S T+G L E L K+ + +FGCG N+G F +GL+G
Sbjct: 236 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 293
Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
LGR LSLVSQ GL FSYCL S D S L+ L G S ++ + T +
Sbjct: 294 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 348
Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
I NP +FY ++L I++G + L +S FA GG+++DSGT IT L Y AL
Sbjct: 349 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 408
Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
K F Q + P+A G + LD CF A +V +P + F+G A++ D+ Y
Sbjct: 409 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 465
Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V S +CL + IIGN+QQ+N + +YD + L FA C+ +
Sbjct: 466 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 141/408 (34%), Positives = 217/408 (53%), Gaps = 49/408 (12%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
D V ++ S+ SGN+K+ ++ + + + N++ + G + +I+DT
Sbjct: 92 DESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFLVDVAFGTPPQKFKLILDT 146
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
GS +TW QC+ C C FD S +Y C ST GN+
Sbjct: 147 GSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPST-------VGNT--------- 190
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFG-GVSGLMGLGRSD 270
Y ++YGD S + G G + + L + V F FGCGRNN+G FG G G++GLG+
Sbjct: 191 ---YNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQ 247
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP----- 325
LS VSQT+ F +FSYCLP + + GSL+ G ++ S+ + +T+++ P
Sbjct: 248 LSTVSQTASKFKKVFSYCLP---EENSIGSLLFGEKAT--SQSSSLKFTSLVNGPGTSGL 302
Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ + +Y + L IS+G K+L +S FA G +IDSGTVITRLP YSALKA F K +
Sbjct: 303 EESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMA 362
Query: 384 GFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+P + G +LDTC+NLS ++V +P + F A++ ++ +V+ +DAS++
Sbjct: 363 KYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVW--GNDASRL 420
Query: 440 CLALASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA A S E IIGN QQ + V+YD + ++GF G CS++
Sbjct: 421 CLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCSNL 468
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 145/401 (36%), Positives = 207/401 (51%), Gaps = 39/401 (9%)
Query: 99 VQYLQSRIKNMISGNIKDVSNTEI--PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
++ R++ + + + SN EI P+ SG ++ + +G + I+DTGS
Sbjct: 66 IKRANHRLERLNAMVLAASSNAEINSPVLSG----NGEFLMNLAIGTPPETYSAIMDTGS 121
Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
DL W QC+PC C++Q P+FDP S S+ K+ C+S C AL ++ CS S C
Sbjct: 122 DLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSS-----CSDS----C 172
Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSL 273
Y +YGD S T+G + E GK S+ + FGCG +N+G F SGL+GLGR LSL
Sbjct: 173 EYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSL 232
Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
VSQ E FSYCL S D S +L++G +SV S I T +I NP +FY L
Sbjct: 233 VSQLKE---AKFSYCLTSIDDTKTS-TLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYL 288
Query: 334 NLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
+L GIS+GG +L Q GG++IDSGT IT L S + +K EF Q G P
Sbjct: 289 SLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQM-GLP 347
Query: 387 -SAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDASQVCLAL 443
G + L+ C+NL S E+ +P + + F G +++ G Y + S +CLA+
Sbjct: 348 VDNSGATGLELCYNLPSDTSELEVPKLVLHFTG---ADLELPGENYMIADSSMGVICLAM 404
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
S I GN QQ+N V +D + L F +C +
Sbjct: 405 GS---SGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 128/339 (37%), Positives = 181/339 (53%), Gaps = 21/339 (6%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
I DTGSDLTW QC PC CY Q P+F+P S S+ V CN+ TCHA++ G+ GV
Sbjct: 96 IADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD--DGHCGVQGV 153
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
C+Y +YGD +Y++G+LG E + +G +SV I GCG + G FG SG++GLG
Sbjct: 154 -----CDYSYTYGDRTYSKGDLGFEKITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGG 207
Query: 269 SDLSLVSQTSEIFG--GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
LSLVSQ S+ G FSYCLP T + A+G + G N+ V S P + + +
Sbjct: 208 GQLSLVSQMSQTSGISRRFSYCLP-TLLSHANGKINFGQNAVV---SGPGVVSTPLISKN 263
Query: 327 LATFYILNLTGISIGGKQLQASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
T+Y + L ISIG ++ A FAK G ++IDSGT ++ LP +Y + + LK
Sbjct: 264 TVTYYYITLEAISIGNERHMA--FAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAK 321
Query: 386 PSAPGFSILDTCFN--LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
+ D CF+ ++ IP++ +F G A V++ + F K + CL L
Sbjct: 322 RVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGAN--VNLLPVNTFQKVANNVNCLTL 379
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S DE GIIGN N + YD + +L F C+
Sbjct: 380 TPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 185/354 (52%), Gaps = 26/354 (7%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
IVDTGSDL W QC+PC C+NQ PVFDP+ S +Y + C+S+ C L +T S S
Sbjct: 131 AIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSALCADLPTSTCASSSSS 190
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGL 266
SS+ C Y +YGD S T+G L E L + V FGCG N+G F +GL+GL
Sbjct: 191 SSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGL 250
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL--GGNSSVFKNSTPITYTNMIPN 324
GR LSLVSQ FSYCL S DA L+L S + P T ++ N
Sbjct: 251 GRGPLSLVSQLGI---DRFSYCLTSLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKN 307
Query: 325 PQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAE 377
P +FY ++LTG+++G +L +S FA GG+++DSGT IT L Y AL+
Sbjct: 308 PSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKA 367
Query: 378 FLKQFSGFPSAPGFSI-LDTCFNLSAYQ-----EVNIPLVKMEFEGNAEMTVDVTGIVYF 431
F+ S P+ I LD CF A +V +P + + F+G A++ D+ Y
Sbjct: 368 FVAHMS-LPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKLVLHFDGGADL--DLPAENYM 424
Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V AS +CL + + IIGN+QQ+N + +YD L FA +C+ +
Sbjct: 425 VLDSASGALCLTVMA---SRGLSIIGNFQQQNFQFVYDVAGDTLSFAPAECNKL 475
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 173/301 (57%), Gaps = 27/301 (8%)
Query: 92 LILDNLHVQYLQSRIKN---------MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG 142
L D+ V+ L SR+ + +I+ + +PL G + + NY + G
Sbjct: 66 LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFG 125
Query: 143 --GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
R ++IVDTGS L+W+QC+PC C+ Q DP+FDPS S +YK + C SS C +L A
Sbjct: 126 SPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDA 185
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG 258
T N+ +C +SS C Y SYGD SY+ G L ++ L L + ++ F++GCG+++ GLFG
Sbjct: 186 TLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS---SVFKNSTP 315
+G++GLGR+ LS++ Q S FG FSYCLP+ G G L +G S S +K
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---GGGGFLSIGKASLAGSAYK---- 297
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSAL 374
+T M +P + Y L LT I++GG+ L A+ + +IDSGTVITRLP S+Y+
Sbjct: 298 --FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRLPMSVYTPF 355
Query: 375 K 375
+
Sbjct: 356 Q 356
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 188/345 (54%), Gaps = 29/345 (8%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ I+DTGSDL W QC+PC C++Q P+FDP S S+ K+ C+S C AL ++ N+G
Sbjct: 111 SAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG-- 168
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
C Y SYGD S T+G L E L GKASV + FGCG +N+G F +GL+G
Sbjct: 169 -------CEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVG 221
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR LSLVSQ E FSYCL + D S +L++G +SV +S+ I T +I +P
Sbjct: 222 LGRGPLSLVSQLKE---PKFSYCLTTVDDTKTS-TLLMGSLASVNASSSAIKTTPLIHSP 277
Query: 326 QLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
+FY L+L GIS+G +L + S F+ GG++IDSGT IT L S ++ + EF
Sbjct: 278 AHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEF 337
Query: 379 LKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDA 436
+ + + G + LD CF L S + +P + F+G +++ Y + S
Sbjct: 338 TAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG---ADLELPAENYMIGDSSM 394
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA+ S S I GN QQ+N V++D + L F C
Sbjct: 395 GVACLAMGSSS---GMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 145/399 (36%), Positives = 202/399 (50%), Gaps = 46/399 (11%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
D L +Y+Q ++ D++ +P T G L T+ Y+ T+ +G T+++DT
Sbjct: 92 DQLRAKYIQRKLSGTDGLQPLDLT---VPTTLGSALDTMEYVITVGIGSPAVTQTMMIDT 148
Query: 153 GSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-CSSS 209
GSD++WV+C N D +FDPS S +Y C+S+ C L GN+G CS+S
Sbjct: 149 GSDVSWVRC-------NSTDGLTLFDPSKSTTYAPFSCSSAACAQL----GNNGDGCSNS 197
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG-GVSGLMGLG 267
C Y V YGDGS T G + L L + +V DF FGC + + G + GLMGLG
Sbjct: 198 G---CQYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGCSHHEEDFDGEKIDGLMGLG 254
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
SLVSQT+ +G FSYCLP T SG L G + S T M+ P+
Sbjct: 255 GDAQSLVSQTAATYGKSFSYCLPPTNR--TSGFLTFGAPNG---TSGGFVTTPMLRWPKA 309
Query: 328 ATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEF---LKQF 382
T Y + L IS+GG L Q S + G ++ DSGTVIT LP YSAL + F + +
Sbjct: 310 PTLYGVLLQDISVGGTPLGIQPSVLSNGSVM-DSGTVITWLPRRAYSALSSAFRSSMTRL 368
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+AP ILDTC++ + V+IP V + +G A + +D GI+ Q CLA
Sbjct: 369 RHQRAAP-LGILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI-------QDCLA 420
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A+ S + IIGN QQ+ V++D GF C
Sbjct: 421 FAATSGDS---IIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 187/349 (53%), Gaps = 31/349 (8%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ I+DTGSDL W QC+PC C++Q P+FDP S S+ K+ C+S C AL +T + G
Sbjct: 111 SAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG-- 168
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
C Y YGD S T+G L E L GK SV + FGCG +N+G F SGL+G
Sbjct: 169 -------CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVG 221
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR LSLVSQ E FSYCL S D AS +L++G +SV + + I T +I N
Sbjct: 222 LGRGPLSLVSQLKE---PKFSYCLTSVDDTKAS-TLLMGSLASVKASDSEIKTTPLIQNS 277
Query: 326 QLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
+FY L+L GIS+G L + S F+ GG++IDSGT IT L S + + EF
Sbjct: 278 AQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEF 337
Query: 379 LKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
Q + G + L+ CF L S ++ +P + F+G A++ + ++ +DAS
Sbjct: 338 TSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDG-ADLELPAEN---YMIADAS 393
Query: 438 Q--VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA+ S S I GN QQ+N V++D + L F C +
Sbjct: 394 MGVACLAMGSSS---GMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 26/340 (7%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNSGVC 206
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L + G C
Sbjct: 166 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAG----C 221
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGG-VSGLM 264
S++ C YFV YGDG T G + L L ++V +F FGC +G F SG M
Sbjct: 222 SNN---QCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTM 278
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
LG SL+SQT+ FG FSYC+P D +SG L LGG + T ++ N
Sbjct: 279 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 334
Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
P + T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F
Sbjct: 335 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 394
Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
+ +P A G + LDTC++ + V +P V + F+G A + +D G++ + CL
Sbjct: 395 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 447
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + G IGN QQ+ V+YD +GF C
Sbjct: 448 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 173/344 (50%), Gaps = 26/344 (7%)
Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
T+ +DT D+ W+QC PC CY Q+DP+FDP+ S + V C S C +L GN G
Sbjct: 149 TMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLG-PYGN-G 206
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGV-SG 262
+ S+ +C Y + Y D T G + L + G +V +F FGC +G F + +G
Sbjct: 207 CSNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAG 266
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKNSTPITY 318
M LG SL++QT+ G FSYC+P A ASG L +GG NS+ +TP+
Sbjct: 267 TMSLGGGAQSLLAQTARSLGNAFSYCVP---QASASGFLSIGGPATTNSTTVFATTPLVR 323
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAE 377
+ + NP L Y++ L GI + G++L A G ++DS VIT+LPP+ Y AL+
Sbjct: 324 SAI--NPSL---YLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRA 378
Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
F +P + LDTC++ V +P V + F G A + +D ++
Sbjct: 379 FRNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI------- 431
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA + S + G IGN QQ+ V+YD +GF C
Sbjct: 432 GGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 173/340 (50%), Gaps = 26/340 (7%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L G G C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGG-VSGLM 264
S++ C YFV YGDG T G + L L ++V +F FGC +G F SG M
Sbjct: 206 SNN---QCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTM 262
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
LG SL+SQT+ FG FSYC+P D +SG L LGG + T ++ N
Sbjct: 263 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 318
Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
P + T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F
Sbjct: 319 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 378
Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
+ +P A G + LDTC++ + V +P V + F+G A + +D G++ + CL
Sbjct: 379 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 431
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + G IGN QQ+ V+YD +GF C
Sbjct: 432 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 195/385 (50%), Gaps = 39/385 (10%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y A I +G + V++DTGSDL W+QC PC+ CY Q P++DP S
Sbjct: 80 PVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNS 139
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GK 239
+++++ C S C + G C + + C Y V YGDGS + G+L + L L
Sbjct: 140 KTHRRIPCASPQCRGVLRYPG----CDART-GGCVYMVVYGDGSASSGDLATDTLVLPDD 194
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGAS 298
V++ GCG +N+GL +GL+G GR LS +Q + +G +FSYCL A S
Sbjct: 195 TRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNS 254
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA------- 351
S ++ G + ++ +T + NP+ + Y +++ G S+GG+++ +GF+
Sbjct: 255 SSYLVFGRTPELPST---AFTPLRTNPRRPSLYYVDMVGFSVGGERV--AGFSNASLALN 309
Query: 352 ----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-------FSILDTCFNL 400
+GG+++DSGT I+R Y+A++ F+ +A G FS+ DTC+++
Sbjct: 310 PATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHA----AAAGMRRLRNKFSVFDTCYDV 365
Query: 401 SAYQE---VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
V +P + + F A+M + + V + L + +D ++GN
Sbjct: 366 HGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGN 425
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
QQ+ V++D + ++GF CS
Sbjct: 426 VQQQGFGVVFDVERGRIGFTPNGCS 450
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 208/425 (48%), Gaps = 58/425 (13%)
Query: 92 LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--------YIATIELGG 143
L D Y+Q R+ ++G ++ ++P+++ Q++ Y A +
Sbjct: 68 LWSDQHRADYIQWRLSGSVAGVLQPAD--DVPVSTNYEQQSIEGDLNYGTYYPAPAPMSS 125
Query: 144 RNM----------------TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
+ M T+++DT SD+TWVQC PC + CY Q+D ++DP+ S S
Sbjct: 126 KAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGV 185
Query: 186 VLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
CNS TC L +A G C++++ C Y V Y DG+ T G + L + A +V
Sbjct: 186 FSCNSPTCTQLGPYANG----CTNNN--QCQYRVRYPDGTSTAGTYISDLLTITPATAVR 239
Query: 244 DFIFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
F FGC +G F +G+M LG SLVSQT+ +G +FS+C P G
Sbjct: 240 SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR---RGF 296
Query: 301 LILG-GNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG--FAKGGIL 356
LG + ++ T M+ NP + TFY++ L I++ G+++ FA G L
Sbjct: 297 FTLGVPRVAAWR----YVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAAL 352
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
DS T ITRLPP+ Y AL+ F + + + AP LDTC++++ + +P + + F+
Sbjct: 353 -DSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD 411
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
NA + +D +G+++ Q CLA + + GIIGN Q + V+Y+ + +GF
Sbjct: 412 KNAAVELDPSGVLF-------QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 464
Query: 477 AGEDC 481
C
Sbjct: 465 RHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 208/425 (48%), Gaps = 58/425 (13%)
Query: 92 LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--------YIATIELGG 143
L D Y+Q R+ ++G ++ ++P+++ Q++ Y A +
Sbjct: 93 LWSDQHRADYIQWRLSGSVAGVLQPAD--DVPVSTNYEQQSIEGDLNYGTYYPAPAPMSS 150
Query: 144 RNM----------------TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
+ M T+++DT SD+TWVQC PC + CY Q+D ++DP+ S S
Sbjct: 151 KAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGV 210
Query: 186 VLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
CNS TC L +A G C++++ C Y V Y DG+ T G + L + A +V
Sbjct: 211 FSCNSPTCTQLGPYANG----CTNNN--QCQYRVRYPDGTSTAGTYISDLLTITPATAVR 264
Query: 244 DFIFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
F FGC +G F +G+M LG SLVSQT+ +G +FS+C P G
Sbjct: 265 SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR---RGF 321
Query: 301 LILG-GNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG--FAKGGIL 356
LG + ++ T M+ NP + TFY++ L I++ G+++ FA G L
Sbjct: 322 FTLGVPRVAAWR----YVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAAL 377
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
DS T ITRLPP+ Y AL+ F + + + AP LDTC++++ + +P + + F+
Sbjct: 378 -DSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD 436
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
NA + +D +G+++ Q CLA + + GIIGN Q + V+Y+ + +GF
Sbjct: 437 KNAAVELDPSGVLF-------QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 489
Query: 477 AGEDC 481
C
Sbjct: 490 RHAAC 494
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 184/369 (49%), Gaps = 26/369 (7%)
Query: 132 TLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
T Y+ + +G R + + +DTGSDL W QC PC+ C++Q PV DP+ S +Y + C
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-------SV 242
++ C AL F + GV + + C Y YGD S T GE+ + G +
Sbjct: 141 AARCRALPFTS--CGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHT 198
Query: 243 NDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
FGCG NKG+F +G+ G GR SL SQ + FSYC S ++ +S +
Sbjct: 199 RRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFESKSS-LV 254
Query: 302 ILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILID 358
LGG+ + +S + T ++ NP + Y L+L GIS+G +L +ID
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIID 314
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL---SAYQEVNIPLVKMEF 415
SG IT LP +Y A+KAEF Q PS S LD CF L + ++ +P + +
Sbjct: 315 SGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHL 374
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
EG A+ + + V F A +C+ L + E +IGN+QQ+N V+YD +N +L
Sbjct: 375 EG-ADWELPRSNYV-FEDLGARVMCIVLDAA--PGEQTVIGNFQQQNTHVVYDLENDRLS 430
Query: 476 FAGEDCSSM 484
FA C +
Sbjct: 431 FAPARCDRL 439
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 181/351 (51%), Gaps = 35/351 (9%)
Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATG 201
+ TVI+D+GSD+ WVQCQPC C+ Q+DP+FDP+ S +Y V C+S+ C L + G
Sbjct: 80 SQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRG 139
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFG 258
C ++S C + ++Y +G+ G + L LG V F+FGC ++G
Sbjct: 140 ----CLANS--QCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSY 193
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS------SVFKN 312
V+G + LG S V QT+ + +FSYC+P + + G ++ G F +
Sbjct: 194 DVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS--FGFIMFGVPPQRAALVPTFVS 251
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSI 370
+ ++ + M P TFY + L I + G+ L F+ + IDS TVI+R+PP+
Sbjct: 252 TPLLSSSTMSP-----TFYRVLLRSIIVAGRPLPVPPTVFSASSV-IDSATVISRIPPTA 305
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
Y AL+A F + + AP SILDTC++ S + + +P + + F+G A + +D GI+
Sbjct: 306 YQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL 365
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
Q CLA A + + G IGN QQ+ V+YD + F C
Sbjct: 366 -------QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 150/422 (35%), Positives = 221/422 (52%), Gaps = 51/422 (12%)
Query: 77 CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNY 135
CSG Q D V ++ S+ N+KD N ++ G N+
Sbjct: 109 CSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKDHTPNNKLFDEDG------NF 162
Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
+ + G + T+I+DTGS +TW QC+PC C FDPS S +Y C ST
Sbjct: 163 LVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPST- 221
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRN 252
GN+ Y ++YGD S + G G + + L + V F FGCGRN
Sbjct: 222 ------VGNT------------YNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRN 263
Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
N+G FG G G++GLG+ LS VSQT+ F +FSYCLP + + GSL+ G ++
Sbjct: 264 NEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP---EEDSIGSLLFGEKAT--S 318
Query: 312 NSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVIT 364
S+ + +T+++ P + + +Y + L IS+G K+L +S FA G +IDSGTVIT
Sbjct: 319 QSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVIT 378
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
RLP YSALKA F K + +P + G ILDTC+NLS ++V +P + + F A+
Sbjct: 379 RLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGAD 438
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ ++ +++ +DAS++CLA A S E IIGN QQ + V+YD + ++GF G
Sbjct: 439 VRLNGKRVIW--GNDASRLCLAFAGNS---ELTIIGNRQQVSLTVLYDIQGGRIGFGGNG 493
Query: 481 CS 482
CS
Sbjct: 494 CS 495
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/349 (37%), Positives = 183/349 (52%), Gaps = 28/349 (8%)
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFA 199
G +++DT SD+ WVQC PC + CY Q D ++DPS S S + C+S TC L +A
Sbjct: 179 GVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYA 238
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFG 258
G S SS+S C Y V Y DGS T G L + L L S V F FGC +G F
Sbjct: 239 NGCSS--SSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGSFS 296
Query: 259 --GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
+G+M LGR SLVSQTS +G +FSYC P T A G +LG ++S+
Sbjct: 297 RSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPT--ASHKGFFVLGVPR---RSSSRY 351
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSAL 374
T M+ P L Y + L I++ G++L + FA G L DS TVITRLPP+ Y AL
Sbjct: 352 AVTPMLKTPML---YQVRLEAIAVAGQRLDVPPTVFAAGAAL-DSRTVITRLPPTAYQAL 407
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN-AEMTVDVTGIVYFVK 433
++ F + S + A LDTC++ + + +P + + F+ A + +D +G+++
Sbjct: 408 RSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLF--- 464
Query: 434 SDASQVCLALASLSYEDE-TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA AS + +D TGIIG Q + V+Y+ +GF C
Sbjct: 465 ----GSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 193/372 (51%), Gaps = 33/372 (8%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDP 177
P+TSG Y A I +G ++ + DTGSD++W+QCQPC CY Q P+FDP
Sbjct: 172 PVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S SY + C+S CH L+ A C ++S C Y V YGDGS+T GEL E
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEA-----ACDANS---CIYEVEYGDGSFTVGELATETFSF 283
Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+ S+ + GCG +N+GLF G +GL+GLG +SL SQ + FSYCL D+
Sbjct: 284 RHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQ---LEATSFSYCL-VDLDSE 339
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
+S +L + ++P ++ N + TF + + G+S+GGK L S +
Sbjct: 340 SSSTLDFNADQPSDSLTSP-----LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 394
Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
GGI++DSGT IT +P +Y L+ F+ P APG S DTC++LS+ V +P
Sbjct: 395 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 454
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ G + + ++ V S A CLA ++ IIGN QQ+ RV YD
Sbjct: 455 TIAFILPGENSLQLPAKNCLFQVDS-AGTFCLAFLPSTF--PLSIIGNVQQQGIRVSYDL 511
Query: 470 KNSQLGFAGEDC 481
NS +GF+ + C
Sbjct: 512 ANSLVGFSTDKC 523
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 156/450 (34%), Positives = 221/450 (49%), Gaps = 41/450 (9%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
SS +S+ ++ ++G T +L H++ + E RL + +H SR+ +
Sbjct: 16 SSPFLSNANAKSKLG-FTADLIHRDSPKSPFYNPTETSSQRL-RNAIHRSV--SRVFHFT 71
Query: 111 SGNIKDVSNT--EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS 166
+ KD S+ +I LTS + Y+ I LG + I DTGSDL W QC+PC
Sbjct: 72 DISQKDASDNAPQIDLTS----NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDD 127
Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
CY Q DP+FDP S +YK V C+SS C ALE N CS+ C+Y SYGD SYT
Sbjct: 128 CYTQVDPLFDPKASSTYKDVSCSSSQCTALE----NQASCSTED-NTCSYSTSYGDRSYT 182
Query: 227 RGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEI 280
+G + + L LG + + I GCG NN G F SG++GLG +SL++Q +
Sbjct: 183 KGNIAVDTLTLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDS 242
Query: 281 FGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
G FSYCL P T + + + G N+ V + T + T +I Q TFY L L IS
Sbjct: 243 IDGKFSYCLVPLTSENDRTSKINFGTNAVV--SGTGVVSTPLIAKSQ-ETFYYLTLKSIS 299
Query: 340 IGGKQLQ----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
+G K++Q SG +G I+IDSGT +T LP YS L+ + L
Sbjct: 300 VGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLS 359
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGI 454
C+ SA ++ +P + M F+G V++ FV+ VC A S S+ I
Sbjct: 360 LCY--SATGDLKVPAITMHFDG---ADVNLKPSNCFVQISEDLVCFAFRGSPSF----SI 410
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
GN Q N V YDT + + F DC+ M
Sbjct: 411 YGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 440
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 194/389 (49%), Gaps = 43/389 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC +C+ Q P +DP S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESS 240
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
S++ + C+ C + SS PP C YF YGD S T G+ E
Sbjct: 241 SFENITCHDPRC----------KLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALE 290
Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ GK+ V + +FGCG N+GLF G +GL+GLGR LS SQ I+G
Sbjct: 291 TFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHS 350
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIG 341
FSYCL D S LI G + + + + +T+ + + + TFY + + I +
Sbjct: 351 FSYCLVDRNSDTSVSSKLIFGEDKELLSHPN-LNFTSFVGGEENSVDTFYYVGIKSIMVD 409
Query: 342 GKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
G+ L+ S GG +IDSGT +T Y +K F+K+ G+ GF L
Sbjct: 410 GEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPL 469
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
C+N+S +++ +P + F A V YF++ + VCLA+ + I
Sbjct: 470 KPCYNVSGIEKMELPDFGILFSDGAMWDFPVEN--YFIQIEPDLVCLAILGTP-KSALSI 526
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
IGNYQQ+N ++YD K S+LG+A C++
Sbjct: 527 IGNYQQQNFHILYDMKKSRLGYAPMKCTA 555
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 145/430 (33%), Positives = 214/430 (49%), Gaps = 44/430 (10%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
L+H + SGK + E+ Q+ + +Q L + + + + + E P+ +G
Sbjct: 52 LRHVD--SGKNLTKLERVQHGIKRGKSRLQRLNAMV--LAASTLDSEDQLEAPIHAG--- 104
Query: 131 QTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
Y+ + +G ++ ++DTGSDL W QC+PC CY Q P+FDP S S+ KV C
Sbjct: 105 -NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSC 163
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVND 244
SS C A+ +T + G C Y SYGD S T+G L E GK+ SV++
Sbjct: 164 GSSLCSAVPSSTCSDG---------CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214
Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGCG +N+G F SGL+GLGR LSLVSQ E FSYCL D S++L
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---PRFSYCLTPMDDTKE--SILL 269
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGIL 356
G+ K++ + T ++ NP +FY L+L GIS+G +L + S F GG++
Sbjct: 270 LGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVI 329
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEF 415
IDSGT IT + + ALK EF+ Q + LD CF+L S +V IP + F
Sbjct: 330 IDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHF 389
Query: 416 EGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+G +++ Y + S+ CLA+ + S I GN QQ+N V +D + +
Sbjct: 390 KGG---DLELPAENYMIGDSNLGVACLAMGASS---GMSIFGNVQQQNILVNHDLEKETI 443
Query: 475 GFAGEDCSSM 484
F C +
Sbjct: 444 SFVPTSCDQL 453
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 191/372 (51%), Gaps = 33/372 (8%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDP 177
P+TSG Y A I +G ++ + DTGSD++W+QCQPC CY Q P+FDP
Sbjct: 172 PVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S SY + C+S CH L+ A C ++S C Y V YGDGS+T GEL E
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEA-----ACDANS---CIYEVEYGDGSFTVGELATETFSF 283
Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+ S+ + GCG +N+GLF G GL+GLG +SL SQ + FSYCL D+
Sbjct: 284 RHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQ---LEATSFSYCLVDL-DSE 339
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
+S +L + ++P ++ N + TF + + G+S+GGK L S +
Sbjct: 340 SSSTLDFNADQPSDSLTSP-----LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 394
Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
GGI++DSGT IT +P +Y L+ F+ P APG S DTC++LS+ V +P
Sbjct: 395 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 454
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ G + + + V S A CLA ++ IIGN QQ+ RV YD
Sbjct: 455 TIAFILPGENSLQLPAKNCLIQVDS-AGTFCLAFLPSTF--PLSIIGNVQQQGIRVSYDL 511
Query: 470 KNSQLGFAGEDC 481
NS +GF+ + C
Sbjct: 512 ANSLVGFSTDKC 523
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 129/392 (32%), Positives = 195/392 (49%), Gaps = 46/392 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC +C+ Q P +DP S
Sbjct: 184 LESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSS 243
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
S+K + C+ C + SS PP C YF YGD S T G+ E
Sbjct: 244 SFKNITCHDPRCQ----------LVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALE 293
Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
+ GK V + +FGCG N+GLF G +GL+GLGR LS +Q ++G
Sbjct: 294 TFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHS 353
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI---PNPQLATFYILNLTGISI 340
FSYCL ++ S LI G + + + + +T+ + NP + TFY + + I +
Sbjct: 354 FSYCLVDRNSNSSVSSKLIFGEDKELLSHPN-LNFTSFVGGKENP-VDTFYYVLIKSIMV 411
Query: 341 GGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
GG+ L+ S GG +IDSGT +T Y +K F+++ GFP F
Sbjct: 412 GGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPP 471
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
L C+N+S +++ +P + F A V YF++ + VCLA+
Sbjct: 472 LKPCYNVSGVEKMELPEFAILFADGAMWDFPVEN--YFIQIEPEDVVCLAILGTP-RSAL 528
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGNYQQ+N ++YD K S+LG+A C+ +
Sbjct: 529 SIIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 188/368 (51%), Gaps = 28/368 (7%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ LG + + + +DT +D TW C PC +C + +F P+ S SY + C+SS
Sbjct: 80 SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 137
Query: 192 TCHALEFAT------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
C + G ++ P C + + D S+ + L + L LGK ++ ++
Sbjct: 138 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNY 196
Query: 246 IFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + G + GL+GLGR ++L+SQ ++ G+FSYCLPS + SGSL L
Sbjct: 197 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 256
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGIL 356
G ++ + YT M+ NP ++ Y +N+TG+S+G ++ A FA G +
Sbjct: 257 GAGGGQPRS---VRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTV 313
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+DSGTVITR +Y+AL+ EF +Q + DTCFN P V + +
Sbjct: 314 VDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD 373
Query: 417 GNAEMTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQ 473
G ++ + + + + S A+ + CLA+A + +I N QQ+N RV++D NS+
Sbjct: 374 GGVDLALPMENTL--IHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 431
Query: 474 LGFAGEDC 481
+GFA E C
Sbjct: 432 IGFAKESC 439
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 145/466 (31%), Positives = 227/466 (48%), Gaps = 46/466 (9%)
Query: 57 HQKSRIEMGAITLELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSR------IKNM 109
H + +++ E+K + + +VD Q R+ LH ++ +S+ +K
Sbjct: 72 HTRESVKLHLRRREIKQETKRTTHSVVDLQIQDLTRI--QTLHARFKKSKKQRNEKVKKK 129
Query: 110 ISGNIKDVSNTEIP-------LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
I+ +I V E+ L SG+ L + Y + +G ++ ++I+DTGSDL W+Q
Sbjct: 130 ITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQ 189
Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
C PC C++Q + +DP S S+K + CN C + ++ V S C YF Y
Sbjct: 190 CLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLI--SSPEPPVQCKSDNQSCPYFYWY 247
Query: 221 GDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
GD S T G+ E + G++S V + +FGCG N+GLF G SGL+GLGR L
Sbjct: 248 GDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPL 307
Query: 272 SLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LA 328
S SQ ++G FSYCL D S LI G + + N T + +T+ + + +
Sbjct: 308 SFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL-NHTNLNFTSFVNGKENSVE 366
Query: 329 TFYILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
TFY + + I +GG+ L S GG +IDSGT ++ Y +K +F ++
Sbjct: 367 TFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEK 426
Query: 382 F-SGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ F +LD CFN+S +E NI P + + F A ++ D
Sbjct: 427 MKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDL-- 484
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
VCLA+ + + IIGNYQQ+N ++YDTK S+LGF C+ +
Sbjct: 485 VCLAILG-TPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKCADI 529
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 188/368 (51%), Gaps = 28/368 (7%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ LG + + + +DT +D TW C PC +C + +F P+ S SY + C+SS
Sbjct: 78 SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 135
Query: 192 TCHALEFAT------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
C + G ++ P C + + D S+ + L + L LGK ++ ++
Sbjct: 136 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNY 194
Query: 246 IFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + G + GL+GLGR ++L+SQ ++ G+FSYCLPS + SGSL L
Sbjct: 195 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 254
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGIL 356
G ++ + YT M+ NP ++ Y +N+TG+S+G ++ A FA G +
Sbjct: 255 GAGGGQPRS---VRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTV 311
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
+DSGTVITR +Y+AL+ EF +Q + DTCFN P V + +
Sbjct: 312 VDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD 371
Query: 417 GNAEMTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQ 473
G ++ + + + + S A+ + CLA+A + +I N QQ+N RV++D NS+
Sbjct: 372 GGVDLALPMENTL--IHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 429
Query: 474 LGFAGEDC 481
+GFA E C
Sbjct: 430 VGFAKESC 437
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 212/430 (49%), Gaps = 43/430 (10%)
Query: 71 LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
L+H + SGK + E+ Q+ + +Q L + + S ++E L + I
Sbjct: 51 LRHVD--SGKNLTKLERVQHGIKRGKSRLQKLNAMVLAASS-----TPDSEDQLEAPIHA 103
Query: 131 QTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
Y+ + +G ++ ++DTGSDL W QC+PC CY Q P+FDP S S+ KV C
Sbjct: 104 GNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSC 163
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVND 244
SS C AL +T + G C Y SYGD S T+G L E GK+ SV++
Sbjct: 164 GSSLCSALPSSTCSDG---------CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214
Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGCG +N+G F SGL+GLGR LSLVSQ E FSYCL D S++L
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---QRFSYCLTPIDDTKE--SVLL 269
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGIL 356
G+ K++ + T ++ NP +FY L+L IS+G +L + S F GG++
Sbjct: 270 LGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVI 329
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEF 415
IDSGT IT + Y ALK EF+ Q + LD CF+L S +V IP + F
Sbjct: 330 IDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHF 389
Query: 416 EGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+G +++ Y + S+ CLA+ + S I GN QQ+N V +D + +
Sbjct: 390 KGG---DLELPAENYMIGDSNLGVACLAMGASS---GMSIFGNVQQQNILVNHDLEKETI 443
Query: 475 GFAGEDCSSM 484
F C +
Sbjct: 444 SFVPTSCDQL 453
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 205/408 (50%), Gaps = 37/408 (9%)
Query: 99 VQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYIATIELGGRNMTVIV--DTGSD 155
V L ++ K G+ + +T +P+ +G + L+T +Y+A LG T++V D +D
Sbjct: 66 VATLAAKPKPKPKGHSR---HTFVPIAAGRQILRTPSYVARARLGTPPQTLLVAIDPSND 122
Query: 156 LTWVQCQPCKSCY-NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
WV C C C P FDP+ S +Y+ V C + C + AT + C + C
Sbjct: 123 AAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATPS---CPAGPGASC 179
Query: 215 NYFVSYGDGSYTRGELGREHLGL----GKASVND-FIFGCGRNNKGLFGGVS--GLMGLG 267
+ +SY S LG++ L L G A +D + FGC R G G V GL+G G
Sbjct: 180 AFNLSYAS-STLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFG 238
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R LS +SQT +G +FSYCLPS + + SG+L LG + I T ++ NP
Sbjct: 239 RGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRR----IKTTPLLSNPHR 294
Query: 328 ATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALKAEFL 379
+ Y + + G+ + GK + AS A +GG ++D+GT+ TRL P Y+AL+ F
Sbjct: 295 PSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFR 354
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ S P+AP DTC+ ++ + V P V F G A +T+ +V +
Sbjct: 355 RGVSA-PAAPALGGFDTCYYVNGTKSV--PAVAFVFAGGARVTLPEENVV-ISSTSGGVA 410
Query: 440 CLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA+A+ + ++ + QQ+N RV++D N ++GF+ E C+++
Sbjct: 411 CLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 136/376 (36%), Positives = 191/376 (50%), Gaps = 35/376 (9%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QCQPC +C++Q P FDPS S +
Sbjct: 30 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 89
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
C+S+ C L A+ S + C Y SYGD S T G L + ASV
Sbjct: 90 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 147
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG N G+F +G+ G GR LSL SQ G FS+C +T +++L
Sbjct: 148 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLD 203
Query: 305 GNSSVFKN------STP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA---- 351
+ +F N +TP I Y NP T Y L+L GI++G +L S FA
Sbjct: 204 LPADLFSNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLPVPESAFALTNG 260
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPL 410
GG +IDSGT IT LPP +Y ++ EF Q P PG + TCF+ + + ++P
Sbjct: 261 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVPK 319
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
+ + FEG A M + V+ V DA S +CLA ++ DET IIGN+QQ+N V+YD
Sbjct: 320 LVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNFQQQNMHVLYD 375
Query: 469 TKNSQLGFAGEDCSSM 484
+N+ L F C +
Sbjct: 376 LQNNMLSFVAAQCDKL 391
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 183/389 (47%), Gaps = 18/389 (4%)
Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
I +I+G + P+ SG L + Y LG + ++IVD+GSDL WVQC P
Sbjct: 35 ITAVIAGPPSHDYGFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSP 94
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C+ CY Q P++ PS S ++ V C SS C + G C P C Y Y D
Sbjct: 95 CRQCYAQDSPLYVPSNSSTFSPVPCLSSDCLLIPATEGFP--CDFRYPGACAYEYLYADT 152
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
S ++G E + ++ FGCG +N+G F G++GLG+ LS SQ +G
Sbjct: 153 SSSKGVFAYESATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGN 212
Query: 284 LFSYCLPSTQD-AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
F+YCL + D S SLI G + + YT ++ NP+ T Y + + +++GG
Sbjct: 213 KFAYCLVNYLDPTSVSSSLIFG--DELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGG 270
Query: 343 KQLQASGFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
K L S A GG + DSGT +T PS YS + A F +P A LD
Sbjct: 271 KSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVH-YPRAESVQGLD 329
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE-DETGI 454
C L+ + + P +EF+ A + YFV + CLA+A L+
Sbjct: 330 LCVELTGVDQPSFPSFTIEFDDGAVFQPEAEN--YFVDVAPNVRCLAMAGLASPLGGFNT 387
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
IGN Q+N V YD + + +GFA CSS
Sbjct: 388 IGNLLQQNFFVQYDREENLIGFAPAKCSS 416
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 133/365 (36%), Positives = 189/365 (51%), Gaps = 40/365 (10%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I G + +VIVDTGSDL W QC PC++C +FDP S +Y V C S+
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASNF 139
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
C +L F + C++S C Y YGDGS T G L E + +G ++ + FGCG
Sbjct: 140 CSSLPFQS-----CTTS----CKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHT 190
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
N G F G +G++GLG+ LSL+SQ S I FSYCL S LI G+S+
Sbjct: 191 NLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLI--GDSAAAGG 248
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSGTVI 363
+ YT ++ N TFY +LTGIS+ GK + ASG +GG ++DSGT +
Sbjct: 249 ---VAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASG--QGGFILDSGTTL 303
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
T L ++AL A LK FP A G LD CF+ + P + F+G A+
Sbjct: 304 TYLETGAFNALVAA-LKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKG-ADYE 361
Query: 423 VDVTGIVYFVKSD-ASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGE 479
+ + FV D +CLA+A+ TG I+GN QQ+N +++D N ++GF
Sbjct: 362 LPPENV--FVALDTGGSICLAMAA-----STGFSIMGNIQQQNHLIVHDLVNQRVGFKEA 414
Query: 480 DCSSM 484
+C ++
Sbjct: 415 NCETI 419
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 132/372 (35%), Positives = 178/372 (47%), Gaps = 41/372 (11%)
Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ + +G + T +VDTGSDL W QC PC C +Q P F P+ S +Y+ V C S
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
C AL + C S C Y YGD + T G L E G A+ V+D
Sbjct: 151 LCAALPYPA-----CFQRS--VCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASG 299
FGCG N G SG++GLGR LSLVSQ FSYCL PS + G
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSFLSPEPSRLNFGVFA 260
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------K 352
+L G N+S + +P+ T ++ N L + Y ++L GIS+G K+L
Sbjct: 261 TLN-GTNAS--SSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT 317
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIP 409
GG+ IDSGT +T L Y A++ E + P I L+TCF V +P
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVP 377
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+++ F+G A MTV ++ D + L LA + D T IIGNYQQ+N ++YD
Sbjct: 378 DMELHFDGGANMTVPPEN---YMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILYDI 433
Query: 470 KNSQLGFAGEDC 481
NS L F C
Sbjct: 434 ANSLLSFVPAPC 445
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 185/364 (50%), Gaps = 24/364 (6%)
Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ LG +++ DT +D TW C PC +C + +F P+ S SY + C+S+
Sbjct: 76 SYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPANSTSYAPLPCSST 134
Query: 192 TCHALE--FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
C L+ SS+ P C + + D S+ + L + L LGK ++ ++ FGC
Sbjct: 135 MCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKDAIPNYAFGC 193
Query: 250 GRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
G + GL+GLGR ++L+SQ ++ G+FSYCLPS + SGSL LG
Sbjct: 194 VSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAG 253
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSG 360
+ YT M+ NP ++ Y +N+TG+S+G ++ A FA G ++DSG
Sbjct: 254 QPRG----VRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSG 309
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
TVITR P +Y+AL+ EF + + DTCFN P V + +G +
Sbjct: 310 TVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLD 369
Query: 421 MTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
+ + + + + S A+ + CLA+A + ++ N QQ+N RV++D NS++GFA
Sbjct: 370 LALPMENTL--IHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFA 427
Query: 478 GEDC 481
E C
Sbjct: 428 RESC 431
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 132/372 (35%), Positives = 178/372 (47%), Gaps = 41/372 (11%)
Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ + +G + T +VDTGSDL W QC PC C +Q P F P+ S +Y+ V C S
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
C AL + C S C Y YGD + T G L E G A+ V+D
Sbjct: 151 LCAALPYPA-----CFQRS--VCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASG 299
FGCG N G SG++GLGR LSLVSQ FSYCL PS + G
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSFLSPEPSRLNFGVFA 260
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------K 352
+L G N+S + +P+ T ++ N L + Y ++L GIS+G K+L
Sbjct: 261 TLN-GTNAS--SSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT 317
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIP 409
GG+ IDSGT +T L Y A++ E + P I L+TCF V +P
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVP 377
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+++ F+G A MTV ++ D + L LA + D T IIGNYQQ+N ++YD
Sbjct: 378 DMELHFDGGANMTVPPEN---YMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILYDI 433
Query: 470 KNSQLGFAGEDC 481
NS L F C
Sbjct: 434 ANSLLSFVPAPC 445
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 210/433 (48%), Gaps = 39/433 (9%)
Query: 87 QQQNRLILDNLHVQYLQSR------IKNMISGNIKDVSNTEIP-------LTSGIRLQTL 133
Q Q+ + LH ++ +S+ ++ I+ +I V E+ L SG+ L +
Sbjct: 99 QIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSG 158
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y + +G ++ ++I+DTGSDL W+QC PC C++Q +DP S S+K + CN
Sbjct: 159 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDP 218
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---------GLGKASV 242
C + ++ + V S C YF YGD S T G+ E G + V
Sbjct: 219 RCSLI--SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKV 276
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSL 301
+ +FGCG N+GLF G SGL+GLGR LS SQ ++G FSYCL + S L
Sbjct: 277 GNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKL 336
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQLQ-------ASGFAK 352
I G + + N T + +T+ + + + TFY + + I +GGK L S
Sbjct: 337 IFGEDKDLL-NHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
GG +IDSGT ++ Y +K +F ++ +P F +LD CFN+S +E NI L
Sbjct: 396 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLP 455
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
++ + F+ VCLA+ + IIGNYQQ+N ++YDTK
Sbjct: 456 ELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTP-KSTFSIIGNYQQQNFHILYDTKR 514
Query: 472 SQLGFAGEDCSSM 484
S+LGF C+ +
Sbjct: 515 SRLGFTPTKCADI 527
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 141/476 (29%), Positives = 228/476 (47%), Gaps = 73/476 (15%)
Query: 66 AITLELKHKNYCSG-----KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG-------- 112
++ L LKH++ G ++D + R+ NLH + +++R +N IS
Sbjct: 100 SVKLHLKHRSGSKGAEPKNSVIDSTVRDLTRI--QNLHRRVIENRNQNTISRLQRLQKEQ 157
Query: 113 ---NIKDV----SNTEIP--------LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSD 155
+ K V +++ P L SG+ L + Y + +G ++ ++I+DTGSD
Sbjct: 158 PKQSFKPVFAPAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSD 217
Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-- 213
L W+QC PC +C+ Q P +DP S S++ + C+ C + SS PP+
Sbjct: 218 LNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQ----------LVSSPDPPNPC 267
Query: 214 ------CNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFG 258
C YF YGDGS T G+ E + GK+ V + +FGCG N+GLF
Sbjct: 268 KAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFH 327
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPIT 317
G +GL+GLG+ LS SQ ++G FSYCL +A S LI G + + + +
Sbjct: 328 GAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPN-LN 386
Query: 318 YTNM--IPNPQLATFYILNLTGISIGGKQLQA-------SGFAKGGILIDSGTVITRLPP 368
+T+ + + TFY + + + + + L+ S GG +IDSGT +T
Sbjct: 387 FTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAE 446
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
Y +K F+++ G+ G L C+N+S +++ +P + F A V
Sbjct: 447 PAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVEN- 505
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
YF++ D VCLA+ + IIGNYQQ+N ++YD K S+LG+A C+ +
Sbjct: 506 -YFIQIDPDVVCLAILG-NPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 183/348 (52%), Gaps = 24/348 (6%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ DTGSDLTW QCQPCK C+ Q PV+DPS S ++ + C+S+TC L + N C+
Sbjct: 86 ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATC--LPIWSRN---CT 140
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRNNKGLFGGVSGL 263
SS C Y +YGDG+Y+ G LG E L LG + SV FGCG +N G +G
Sbjct: 141 PSS--LCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGT 198
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
+GLGR LSL++Q G FSYCL ++ +LG + + + + T ++
Sbjct: 199 VGLGRGTLSLLAQLGV---GKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQ 255
Query: 324 NPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKA 376
+PQ + Y ++L GIS+G +L G GG+++DSGT T L S + +
Sbjct: 256 SPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVG 315
Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
+ G P S+ CF A + +P + + F G A+M + + + + D+
Sbjct: 316 RVARVL-GQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDS 374
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
S CL +A + E T ++GN+QQ+N ++++DT QL F DCS +
Sbjct: 375 S-FCLNIAGTTPE-STSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 180/348 (51%), Gaps = 39/348 (11%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+ +I+DTGSD TW+QC C +C+N++ F+PS+S SY C ST
Sbjct: 140 QKFNLIIDTGSDTTWIQCNSCSLGNCHNKK--TFNPSLSSSYSNRSCIPST--------- 188
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
D NY + Y D SY++G + + L F FGCG + G FG S
Sbjct: 189 -----------DTNYTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGEFGTAS 237
Query: 262 GLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G++GL + + SL+SQT+ F FSYC P + GSL+ G S + +T
Sbjct: 238 GVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEH--TLGSLLFG--EKAISASPSLKFTQ 293
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
++ NP Y + L GIS+ K+L S FA G +IDSGTVITRLP + Y AL+ F
Sbjct: 294 LL-NPPSGLGYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAF 352
Query: 379 LKQFSGFPS---APGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
++ PS P +LDTC+NL + + +P + + F G ++++ +GI++
Sbjct: 353 QQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILW-AN 411
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D +Q CLA A S IIGN QQ + +V+YD + +LGF G DC
Sbjct: 412 GDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGF-GNDC 458
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 134/420 (31%), Positives = 190/420 (45%), Gaps = 84/420 (20%)
Query: 65 GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
G ++ L H+ CS + E++ + L D L Y++ + +G
Sbjct: 29 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 88
Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
S +P T G L TL Y+ ++ LG +T V++DTGSD++WVQC+PC + C+
Sbjct: 89 SKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 148
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
+FDP+ S +Y C+++ C L +G + C + S C Y V YGDGS T G
Sbjct: 149 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTG---- 201
Query: 233 EHLGLGKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
F FGC G+ GL+GLG SLVSQT+
Sbjct: 202 ----------TGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTA------------ 239
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
+ +P T+Y L I++GGK+L S
Sbjct: 240 --------------------------ARSKKVP-----TYYFAALEDIAVGGKKLGLSPS 268
Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
FA G L+DSGTVITRLPP+ Y+AL + F + + A ILDTCFN + +V+I
Sbjct: 269 VFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSI 327
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P V + F G A + +D GIV S CLA A + G IGN QQ+ V+YD
Sbjct: 328 PTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 134/377 (35%), Positives = 192/377 (50%), Gaps = 38/377 (10%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QC+PC SC++Q P FD S S + +
Sbjct: 30 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLP 89
Query: 188 CNSSTCHALEFATGNSGVCS--SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVND 244
C S+ C T VC + + C Y+ SYGD S T G L + + S+
Sbjct: 90 CESTQCKLDPTVT----VCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPG 145
Query: 245 FIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGCG NN G+F +G+ G GR LSL SQ G FS+C +T +++L
Sbjct: 146 VTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLL 201
Query: 304 GGNSSVFKN------STP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA--- 351
+ +F N +TP I Y NP T Y L+L GI++G +L S FA
Sbjct: 202 DLPADLFSNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLPVPESAFALTN 258
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIP 409
GG +IDSGT IT LPP +Y ++ EF Q P PG + TCF+ + + ++P
Sbjct: 259 GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVP 317
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+ + FEG A M + V+ V DA S +CLA ++ DET IIGN+QQ+N V+Y
Sbjct: 318 KLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNFQQQNMHVLY 373
Query: 468 DTKNSQLGFAGEDCSSM 484
D +N+ L F C +
Sbjct: 374 DLQNNMLSFVAAQCDKL 390
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 136/440 (30%), Positives = 204/440 (46%), Gaps = 64/440 (14%)
Query: 94 LDNLHVQYLQSRIKNMIS----GNIKDVSNTEIPLTSGIRLQTLNYIATIELG------- 142
+ LH + L+ +N +S N K+V T P+ S + Q +AT+E G
Sbjct: 111 IQTLHKRVLEKNNQNTVSQKQKKNDKEVVTT-TPVASSVEEQAGQLVATLESGMTLGSGE 169
Query: 143 ----------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
++ ++I+DTGSDL W+QC PC C+ Q +DP S SYK + CN
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229
Query: 193 CHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGREHLGLGKAS--- 241
C+ + SS PP C Y+ YGD S T G+ E + +
Sbjct: 230 CN----------LVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGG 279
Query: 242 ------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQD 294
V + +FGCG N+GLF G +GL+GLGR LS SQ ++G FSYCL D
Sbjct: 280 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSD 339
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL------- 345
S LI G + + + + +T+ + + + TFY + + I + G+ L
Sbjct: 340 TNVSSKLIFGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETW 398
Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQ 404
S GG +IDSGT ++ Y +K + ++ G +P F ILD CFN+S
Sbjct: 399 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIH 458
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
V +P + + F A ++ D VCLA+ + IIGNYQQ+N
Sbjct: 459 NVQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAMLGTP-KSAFSIIGNYQQQNFH 515
Query: 465 VIYDTKNSQLGFAGEDCSSM 484
++YDTK S+LG+A C+ +
Sbjct: 516 ILYDTKRSRLGYAPTKCADI 535
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 176/344 (51%), Gaps = 31/344 (9%)
Query: 149 IVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
++DTGSD+TW+QC PC CY Q P+FDP +S SY V C+S C L+ A N
Sbjct: 13 VLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVN- 71
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
C Y V YGDGS+T GEL E L + S+ + GCG +N+GLF G GL+
Sbjct: 72 -------SCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADGLI 124
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
GLG +S+ SQ + FSYCL D + L N+ +S + ++ N
Sbjct: 125 GLGGGAISISSQ---LKASSFSYCL---VDIDSPSFSTLDFNTDPPSDSL---ISPLVKN 175
Query: 325 PQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAE 377
+ +F + + G+S+GGK L S GGI++DSGT IT+LP +Y L+
Sbjct: 176 DRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREA 235
Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
FL + P AP S DTC++LS+ V +P + G + + + V S A
Sbjct: 236 FLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDS-AG 294
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA S ++ IIGN+QQ+ RV YD NS +GF+ C
Sbjct: 295 TFCLAFVSATF--PLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 183/373 (49%), Gaps = 36/373 (9%)
Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y+A I +G + ++ DT SDLTW+QCQPC+ CY Q PVFDP S SY ++ ++
Sbjct: 133 EYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAP 192
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDG----SYTRGELGREHLGLGKASVNDFI- 246
C AL + G + C Y V YGDG S + G+L E L ++
Sbjct: 193 DCQALGRSGGGDAKRGT-----CIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLS 247
Query: 247 FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLILG 304
GCG +NKGLFG +G++GLGR +S+ Q + + + FSYCL S S L
Sbjct: 248 IGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLT 307
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKGGI 355
+ S P ++T + N + TFY + L G+S+GG + LQ + +GG+
Sbjct: 308 FGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGV 367
Query: 356 LIDSGTVITRLPPSIY------SALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNI 408
++DSGT +TRL Y A L Q S G PS + DTC+ + V +
Sbjct: 368 ILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSG----LFDTCYTVGGRAGVKV 423
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P V M F G E+++ + V S + VC A A + +IGN Q+ RV+YD
Sbjct: 424 PAVSMHFAGGVEVSLQPKNYLIPVDSRGT-VCFAFAGTG-DRSVSVIGNILQQGFRVVYD 481
Query: 469 TKNSQLGFAGEDC 481
++GFA +C
Sbjct: 482 LAGQRVGFAPNNC 494
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 194/384 (50%), Gaps = 34/384 (8%)
Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
K+ +N +P+ G ++ ++ NYIA LG + + V +D +D WV C C C
Sbjct: 81 KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 139
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
P F P+ S +Y+ V C S C + + +GV SS C + ++Y ++ + LG
Sbjct: 140 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 193
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
++ L L V + FGC R G GL+G GR LS +SQT + +G +FSYCLP+
Sbjct: 194 QDSLALENNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN 253
Query: 292 TQDAGASGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
+ + SG+L LG G K +TP+ Y NP + Y +N+ GI +G K +Q
Sbjct: 254 YRSSNFSGTLKLGPIGQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQ 307
Query: 350 FAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
A G +ID+GT+ TRL +Y+A++ F + P AP DTC+N++
Sbjct: 308 SALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT- 365
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQ 459
V++P V F G +T+ ++ S CLA+A S ++ + Q
Sbjct: 366 ---VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQ 421
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
Q+NQRV++D N ++GF+ E C++
Sbjct: 422 QQNQRVLFDVANGRVGFSRELCTA 445
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 194/384 (50%), Gaps = 34/384 (8%)
Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
K+ +N +P+ G ++ ++ NYIA LG + + V +D +D WV C C C
Sbjct: 62 KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 120
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
P F P+ S +Y+ V C S C + + +GV SS C + ++Y ++ + LG
Sbjct: 121 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 174
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
++ L L V + FGC R G GL+G GR LS +SQT + +G +FSYCLP+
Sbjct: 175 QDSLALENNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN 234
Query: 292 TQDAGASGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
+ + SG+L LG G K +TP+ Y NP + Y +N+ GI +G K +Q
Sbjct: 235 YRSSNFSGTLKLGPIGQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQ 288
Query: 350 FAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
A G +ID+GT+ TRL +Y+A++ F + P AP DTC+N++
Sbjct: 289 SALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT- 346
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQ 459
V++P V F G +T+ ++ S CLA+A S ++ + Q
Sbjct: 347 ---VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQ 402
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
Q+NQRV++D N ++GF+ E C++
Sbjct: 403 QQNQRVLFDVANGRVGFSRELCTA 426
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 205/435 (47%), Gaps = 48/435 (11%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTS 126
T+EL H++ S K +N + H + ++ IS N V+NT E P+ +
Sbjct: 31 TVELIHRD--SPKSPMYNPLEN--------HYHRVADTLRRSISHNTGLVTNTVEAPIYN 80
Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
Y+ + +G +I DTGSD+ W QC+PC +CY Q P+F+PS S +Y+
Sbjct: 81 ----NRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYR 136
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
KV C+S C TG CS PDC Y +SYGD S+++G+ + L +G S
Sbjct: 137 KVSCSSPVCS----FTGEDNSCSFK--PDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190
Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGA 297
F GCG +N G F VSG++GLG SL+ Q GG FSYCL P D G
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGG 250
Query: 298 SGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---- 350
S L G N++V + STPI ++ + +FY L L +S+G S
Sbjct: 251 SNKLNFGSNANVSGSGAVSTPIYISD-----KFKSFYSLKLKAVSVGRNNTFYSTANSIL 305
Query: 351 -AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
K I+IDSGT +T LP +Y + + L+ CF + + +P
Sbjct: 306 GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTT-DDYKVP 364
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ M FEG A + + ++ V + +CLA A + +++ I GN Q N V YD
Sbjct: 365 FIAMHFEG-ANLRLQRENVLIRVSDNV--ICLAFAG-AQDNDISIYGNIAQINFLVGYDV 420
Query: 470 KNSQLGFAGEDCSSM 484
N L F +C +M
Sbjct: 421 TNMSLSFKPMNCVAM 435
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 144/403 (35%), Positives = 202/403 (50%), Gaps = 43/403 (10%)
Query: 105 RIKNMISGNIKDV------SNT---EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTG 153
R++N I ++ V NT +I LTS + Y+ + +G + I DTG
Sbjct: 55 RLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSN----SGEYLMNVSIGTPPFPIMAIADTG 110
Query: 154 SDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
SDL W QC PC CY Q DP+FDP S +YK V C+SS C ALE N CS++
Sbjct: 111 SDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALE----NQASCSTND-NT 165
Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFG-GVSGLMGLG 267
C+Y +SYGD SYT+G + + L LG + + + I GCG NN G F SG++GLG
Sbjct: 166 CSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLG 225
Query: 268 RSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
+SL+ Q + G FSYCL P T + + G N+ V + + + T +I
Sbjct: 226 GGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV--SGSGVVSTPLIAKAS 283
Query: 327 LATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
TFY L L IS+G KQ+Q S ++G I+IDSGT +T LP YS L+
Sbjct: 284 QETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSI 343
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
S L C+ SA ++ +P++ M F+G A++ +D + FV+ VC A
Sbjct: 344 DAEKKQDPQSGLSLCY--SATGDLKVPVITMHFDG-ADVKLDSSNA--FVQVSEDLVCFA 398
Query: 443 L-ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
S S+ I GN Q N V YDT + + F DC+ M
Sbjct: 399 FRGSPSF----SIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 139/433 (32%), Positives = 195/433 (45%), Gaps = 56/433 (12%)
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--R 144
QQQN L N V L+S K SGNI L SG L T Y + +G +
Sbjct: 132 QQQNNLA--NAFVASLESS-KGEFSGNIMAT------LESGASLGTGEYFLDMFVGTPPK 182
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
++ +I+DTGSDL+W+QC PC C+ Q + P S +Y+ + C C
Sbjct: 183 HVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQ---------- 232
Query: 205 VCSSSSP--------PDCNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIF 247
+ SSS P C YF Y DGS T G+ E + GK V D +F
Sbjct: 233 LVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMF 292
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGN 306
GCG NKG F G SGL+GLGR +S SQ I+G FSYCL + S LI G +
Sbjct: 293 GCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFGED 352
Query: 307 SSVFKNSTPITYTNMIPNPQL--ATFYILNLTGISIGGKQLQAS------------GFAK 352
+ N + +T ++ + TFY L + I +GG+ L S A
Sbjct: 353 KELLNNHN-LNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAG 411
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS-AYQEVNIPLV 411
GG +IDSG+ +T P S Y +K F K+ A ++ C+N+S A +V +P
Sbjct: 412 GGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPDF 471
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F Y + D +CLA+ IIGN Q+N ++YD K
Sbjct: 472 GIHFADGGVWNFPAENYFYQYEPDEV-ICLAIMKTPNHSHLTIIGNLLQQNFHILYDVKR 530
Query: 472 SQLGFAGEDCSSM 484
S+LG++ C+ +
Sbjct: 531 SRLGYSPRRCAEV 543
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 186/364 (51%), Gaps = 30/364 (8%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ + +G + I DTGSDL W QC PC CY Q DP+FDP S +YK V C+SS
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIF 247
C ALE N CS++ C+Y +SYGD SYT+G + + L LG + + + I
Sbjct: 150 CTALE----NQASCSTND-NTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG NN G F SG++GLG +SL+ Q + G FSYCL P T + + G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSGT 361
N+ V + + + T +I TFY L L IS+G KQ+Q S ++G I+IDSGT
Sbjct: 265 NAIV--SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGT 322
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
+T LP YS L+ S L C+ SA ++ +P++ M F+G A++
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY--SATGDLKVPVITMHFDG-ADV 379
Query: 422 TVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+D + FV+ VC A S S+ I GN Q N V YDT + + F D
Sbjct: 380 KLDSSNA--FVQVSEDLVCFAFRGSPSF----SIYGNVAQMNFLVGYDTVSKTVSFKPTD 433
Query: 481 CSSM 484
C+ M
Sbjct: 434 CAKM 437
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 188/384 (48%), Gaps = 29/384 (7%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+ SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C++Q PVFDP+ S
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASS 199
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
SY+ V C C L C C Y+ YGD S T G+L E +
Sbjct: 200 SYRNVTCGDQRC-GLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 258
Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
V+D +FGCG N+GLF G +GL+GLGR LS SQ ++G FSYCL
Sbjct: 259 PGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSD 318
Query: 296 GASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG---- 349
AS + ++ + P + YT P A TFY + L G+ +GG+ L S
Sbjct: 319 VASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWG 378
Query: 350 -----FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAY 403
GG +IDSGT ++ Y ++ F+ + +P P F +L C+N+S
Sbjct: 379 VGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGV 438
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG--IIGNYQQ 460
+P + + F A D YF++ D + CLA+ TG IIGN+QQ
Sbjct: 439 DRPEVPELSLLFADGA--VWDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQ 493
Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
+N V+YD KN++LGFA C+ +
Sbjct: 494 QNFHVVYDLKNNRLGFAPRRCAEV 517
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 134/442 (30%), Positives = 203/442 (45%), Gaps = 69/442 (15%)
Query: 94 LDNLHVQYLQSRIKNMISGNIKDVSNTEI---PLTSGIRLQTLNYIATIELG-------- 142
+ LH + L + +N +S K N E+ P+ S + Q +AT+E G
Sbjct: 97 IQTLHKRVLAKKNQNTVSQKQKK-KNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEY 155
Query: 143 ---------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
++ ++I+DTGSDL W+QC PC C+ Q +DP S SYK + CN C
Sbjct: 156 FMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRC 215
Query: 194 HALEFATGNSGVCSSSSPPD-----------CNYFVSYGDGSYTRGELGREHLGLGKAS- 241
+ + SPPD C Y+ YGD S T G+ E + +
Sbjct: 216 NLV-------------SPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTS 262
Query: 242 --------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
V + +FGCG N+GLF G +GL+GLGR LS SQ ++G FSYCL
Sbjct: 263 GGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN 322
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL----- 345
D S LI G + + + + +T+ + + + TFY + + I + G+ L
Sbjct: 323 SDTNVSSKLIFGEDKDLLSHPN-LNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEE 381
Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSA 402
S GG +IDSGT ++ Y +K + ++ G +P F ILD CFN+S
Sbjct: 382 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSG 441
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
+ +P + + F A ++ D VCLA+ + IIGNYQQ+N
Sbjct: 442 IDSIQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAILGTP-KSAFSIIGNYQQQN 498
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
++YDTK S+LG+A C+ +
Sbjct: 499 FHILYDTKRSRLGYAPTKCADI 520
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 133/371 (35%), Positives = 191/371 (51%), Gaps = 46/371 (12%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I +G + + I DTGSDL W QC PC+ CY Q P+FDP S +Y+KV C+SS
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIF 247
C ALE A+ CS+ C+Y ++YGD SYT+G++ + + +G + S+ + I
Sbjct: 146 CRALEDAS-----CSTDENT-CSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMII 199
Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG N G F SG++GLG SLVSQ + G FSYCL P T + G + + G
Sbjct: 200 GCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGT 259
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKGGILIDSGT 361
N V + + T+M+ AT+Y LNL IS+G K++Q + G +G I+IDSGT
Sbjct: 260 NGIVSGDG--VVSTSMV-KKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGT 316
Query: 362 VITRLPPSIY--------SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
+T LP + Y S +KAE ++ G IL C+ S+ +P + +
Sbjct: 317 TLTLLPSNFYYELESVVASTIKAERVQDPDG--------ILSLCYRDSS--SFKVPDITV 366
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F+G V + + FV C A A+ ++ I GN Q N V YDT +
Sbjct: 367 HFKGG---DVKLGNLNTFVAVSEDVSCFAFAA---NEQLTIFGNLAQMNFLVGYDTVSGT 420
Query: 474 LGFAGEDCSSM 484
+ F DCS M
Sbjct: 421 VSFKKTDCSQM 431
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 128/390 (32%), Positives = 194/390 (49%), Gaps = 38/390 (9%)
Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
R I+ ++ S E P+ +G + Y+ + +G +++ I+DTGSDL W QC+
Sbjct: 70 RRMRSINAMLQSSSGIETPVYAG----SGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE 125
Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
PC C++Q P+F+P S S+ + C S C L S S DC Y YGD
Sbjct: 126 PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLP---------SESCYNDCQYTYGYGD 176
Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
GS T+G + E +SV + FGCG +N+G G +GL+G+G LSL SQ +
Sbjct: 177 GSSTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ---LG 233
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYC+ ++ + + +L LG +S +P T +I + T+Y + L GI++G
Sbjct: 234 VGQFSYCM-TSSGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYYITLQGITVG 290
Query: 342 GK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
G QLQ G GG++IDSGT +T LP Y+A+ F Q + P S
Sbjct: 291 GDNLGIPSSTFQLQDDG--TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSS 348
Query: 393 ILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
L TCF L S V +P + M+F+G +++ + +CLA+ S S +
Sbjct: 349 GLSTCFQLPSDGSTVQVPEISMQFDGGV---LNLGEENVLISPAEGVICLAMGS-SSQQG 404
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I GN QQ+ +V+YD +N + F C
Sbjct: 405 ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 204/435 (46%), Gaps = 48/435 (11%)
Query: 68 TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTS 126
T+EL H++ S K +N + H + ++ IS N V+NT E P+ +
Sbjct: 31 TVELIHRD--SPKSPMYNPLEN--------HYHRVADTLRRSISHNTGLVTNTVEAPIYN 80
Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
Y+ + +G +I DTGSD+ W QC PC +CY Q P+F+PS S +Y+
Sbjct: 81 ----NRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYR 136
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
KV C+S C TG CS PDC Y +SYGD S+++G+ + L +G S
Sbjct: 137 KVSCSSPVCS----FTGEDNSCSFK--PDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190
Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGA 297
F GCG +N G F VSG++GLG SL+ Q GG FSYCL P D G
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGG 250
Query: 298 SGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---- 350
S L G N++V + STPI ++ + +FY L L +S+G S
Sbjct: 251 SNKLNFGSNANVSGSGAVSTPIYISD-----KFKSFYSLKLKAVSVGRNNTFYSTANSIL 305
Query: 351 -AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
K I+IDSGT +T LP +Y + + L+ CF + + +P
Sbjct: 306 GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTT-DDYKVP 364
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ M FEG A + + ++ V + +CLA A + +++ I GN Q N V YD
Sbjct: 365 FIAMHFEG-ANLRLQRENVLIRVSDNV--ICLAFAG-AQDNDISIYGNIAQINFLVGYDV 420
Query: 470 KNSQLGFAGEDCSSM 484
N L F +C +M
Sbjct: 421 TNMSLSFKPMNCVAM 435
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 139/443 (31%), Positives = 212/443 (47%), Gaps = 48/443 (10%)
Query: 56 SHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK 115
SH S I + A H + S ++D + D+ YL S +++G K
Sbjct: 37 SHDLSIIPINAKCSPFAHT-HVSASVIDTVLHMASS---DSHRFTYLSS----LVAGKSK 88
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
T +P+ SG +L NY+ LG + M +++DT +D W+ C C C N
Sbjct: 89 P---TSVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 145
Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
S S +Y V C+++ C A G + S+ P C++ SYG S L ++
Sbjct: 146 FNTNSSS-TYSTVSCSTTQCTQ---ARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQD 201
Query: 234 HLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
L L + +F FGC + G GLMGLGR +SLVSQT+ ++ G+FSYCLPS +
Sbjct: 202 TLTLSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFR 261
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF--- 350
SGSL LG + I YT ++ NP+ + Y +NLTG+S+G Q+
Sbjct: 262 SFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLT 317
Query: 351 ----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCF---NL 400
+ G +IDSGTVITR +Y A++ EF KQ +G FS L DTCF N
Sbjct: 318 FDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNG-----SFSTLGAFDTCFSADNE 372
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET--GIIGNY 458
+ ++ + + ++ + E T+ S + CL++A + +I N
Sbjct: 373 NVTPKITLHMTSLDLKLPMENTL-------IHSSAGTLTCLSMAGIRQNANAVLNVIANL 425
Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
QQ+N R+++D NS++G A E C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 131/358 (36%), Positives = 186/358 (51%), Gaps = 45/358 (12%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
N+ I DTGSDLTW QC PC+ C+NQ P+F+P S SY+KV C S TC +LE
Sbjct: 102 NVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLE------- 154
Query: 205 VCSSSSPPD---CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
S PD C+Y SYGD S+T G+L + + +G + + GCG N G FGGV+
Sbjct: 155 --SYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVT 212
Query: 262 -GLMGLGRSDLSLVSQTSEIFG--GLFSYCLPS-TQDAGASGSLILGGNSSVFKNSTPIT 317
G++GLG LSLVSQ I G FSYCLP+ +A +G++ G + V + +
Sbjct: 213 SGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVV--SGRQVV 270
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFA----KGGILIDSGTVITRLPPSIY- 371
T ++P TFY L L IS+G K+ +A+ G + G I+IDSGT +T LP S+Y
Sbjct: 271 STPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYY 329
Query: 372 -------SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
+KA+ + SG IL+ C++ ++NIP++ F G A+ V
Sbjct: 330 GVFSTLARVIKAKRVDDPSG--------ILELCYSAGQVDDLNIPIITAHFAGGAD--VK 379
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ + F + CL A + + I GN Q N V YD N +L F + C+
Sbjct: 380 LLPVNTFAPVADNVTCLTFAPAT---QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 151/449 (33%), Positives = 213/449 (47%), Gaps = 44/449 (9%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
SS +S+ ++ ++G T +L H++ + E R I + +H + +R+ +
Sbjct: 16 SSHILSNVNAKPKLG-FTTDLIHRDSPKSPFYNPAETPSQR-IRNAIHRSF--NRVSHFT 71
Query: 111 SGNIKDVS----NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC 164
+ D S T+I G Y+ + LG + + DTGS+L W QC+PC
Sbjct: 72 DLSEMDASLNSPQTDITPCGG------EYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC 125
Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
CY Q DP+FDP S +YK V C+SS C ALE N CS+ C+Y VSY DGS
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALE----NQASCSTED-KTCSYLVSYADGS 180
Query: 225 YTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTS 278
YT G+ + L LG + + I GCG+NN F SG++GLG +SL+ Q
Sbjct: 181 YTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLG 240
Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
+ G FSYCL D + + G N+ V S P T + + TFY L L I
Sbjct: 241 DSIDGKFSYCLVPEND--QTSKINFGTNAVV---SGPGTVSTPLVVKSRDTFYYLTLKSI 295
Query: 339 SIGGKQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
S+G K +Q KG ++IDSGT +T LP Y ++ + S C
Sbjct: 296 SVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLC 355
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY--FVKSDASQVCLALASLSYEDETGII 455
+N +A ++NIP++ M FEG DV Y F K VCLA Y + GI
Sbjct: 356 YNATA--DLNIPVITMHFEG-----ADVKLYPYNSFFKVTEDLVCLAFGMSFYRN--GIY 406
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
GN QKN V YDT + + F DC+ M
Sbjct: 407 GNVAQKNFLVGYDTASKTMSFKPTDCAKM 435
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 178/362 (49%), Gaps = 32/362 (8%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + I DTGSDL W QC+PC+ CY Q DP+FDP S +Y+ C++
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQ 154
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
C L+ +T + + C Y SYGD SYT G + + + L S +
Sbjct: 155 CSLLDQSTCSGNI--------CQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVI 206
Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG N G F SG++GLG LSL+SQ GG FSYCL P + AG S L G
Sbjct: 207 GCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGS 266
Query: 306 NSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSG 360
N+ V S P + T ++ + +++FY L L +S+G ++++ + G +G I+IDSG
Sbjct: 267 NAVV---SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T +T +P +S L Q G + L C+ SA ++ +P + F G
Sbjct: 324 TTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHFTG--- 378
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
V + I FV+ VCLA AS + I GN Q N V Y+ + L F D
Sbjct: 379 ADVKLKPINTFVQVSDDVVCLAFASTT--SGISIYGNVAQMNFLVEYNIQGKSLSFKPTD 436
Query: 481 CS 482
C+
Sbjct: 437 CT 438
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 141/463 (30%), Positives = 212/463 (45%), Gaps = 50/463 (10%)
Query: 48 SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ---------NRLILDNLH 98
SG S + +SH S A + + W+E + N +D+
Sbjct: 65 SGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAG 124
Query: 99 VQYLQS-RIKNMISGNIK-DVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDL 156
+ QS ++ + + N+ S+T+ GI +L G +++VDT SD+
Sbjct: 125 EETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDV 184
Query: 157 TWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNSGVCSSSSPPD 213
WVQC PC CY Q D ++DP+ S C+S C +L +A G +G ++ +
Sbjct: 185 PWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAGNTGT--- 241
Query: 214 CNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR--------NNKGLFGGVSG 262
C Y V Y DGS T G + L L K +V+ F FGC NNK +G
Sbjct: 242 CQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNK-----TAG 296
Query: 263 LMGLGRSDLSLVSQTSEIF--GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
M LGR SL SQT F G +FSYCLP T SL + +++ TP+ +
Sbjct: 297 FMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLKSK 356
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
M P Y++ L GI + G++L FA + DS T+ITRLPP+ Y AL+A F
Sbjct: 357 MAP-----MIYMVRLIGIDVAGQRLPVPPAVFAANAAM-DSRTIITRLPPTAYMALRAAF 410
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
Q + + LDTC++ + V +P V + F+ NA + +D +G++
Sbjct: 411 RAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML-------D 463
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + + GIIGN QQ+ V+Y+ + +GF C
Sbjct: 464 SCLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 215/460 (46%), Gaps = 51/460 (11%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLT 125
++ L L H+ G+ + E + D + ++ + R G + S+ L+
Sbjct: 76 SLKLRLNHRAAEGGRTRE--ESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALS 133
Query: 126 --------SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF 175
SG+ + + Y+ + +G R +I+DTGSDL W+QC PC C+ Q+ PVF
Sbjct: 134 ERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVF 193
Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSG----VCSSSSPPDCNYFVSYGDGSYTRGELG 231
DP+ S SY+ V C C + C C Y+ YGD S T G+L
Sbjct: 194 DPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLA 253
Query: 232 REHLGL------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
E + V+ +FGCG N+GLF G +GL+GLGR LS SQ ++G F
Sbjct: 254 LESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTF 313
Query: 286 SYCLPSTQDAGAS-GSLILGG---NSSVFKNSTPITYTNMIPNPQLA----TFYILNLTG 337
SYCL D G+ GS ++ G ++ + YT P + TFY + L G
Sbjct: 314 SYCL---VDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKG 370
Query: 338 ISIGGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAP 389
+ +GG+ L S GG +IDSGT ++ Y ++ F+ + S +P P
Sbjct: 371 VLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVP 430
Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD---ASQVCLALASL 446
F +L C+N+S + +P + + F A D YF++ D S +CLA+
Sbjct: 431 EFPVLSPCYNVSGVERPEVPELSLLFADGA--VWDFPAENYFIRLDPDGGSIMCLAVLGT 488
Query: 447 SYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
TG IIGN+QQ+N V+YD +N++LGFA C+ +
Sbjct: 489 P---RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 139/398 (34%), Positives = 209/398 (52%), Gaps = 41/398 (10%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQ 160
Q R++ + ++ +V E P+ +G ++ + +G +++ I+DTGSDLTW Q
Sbjct: 88 QDRLEKL-QMSVDEVKAVEAPVYAG----NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQ 142
Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
C+PC CY Q P++DPS S +Y KV C+SS C AL + CS + +C Y SY
Sbjct: 143 CKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPMYS-----CSGA---NCEYLYSY 194
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK-GLFGGVSGLMGLGRSDLSLVSQTSE 279
GD S T+G L E L S+ FGCG+ N+ G F GL+G GR LSL+SQ +
Sbjct: 195 GDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQ 254
Query: 280 IFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
G FSYCL S D+ + S L +G +S+ N+ ++ T ++ + TFY L+L GI
Sbjct: 255 SLGNKFSYCLVSITDSPSKTSPLFIGKTASL--NAKTVSSTPLVQSRSRPTFYYLSLEGI 312
Query: 339 SIGGK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
S+GG+ LQ G GG++IDSGT +T L S Y +K + + P
Sbjct: 313 SVGGQLLDIADGTFDLQLDG--TGGVIIDSGTTVTYLEQSGYDVVKKAVISSIN-LPQVD 369
Query: 390 GFSI-LDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASL 446
G +I LD CF S + P + FEG A+ + +Y +D+S + CLA+
Sbjct: 370 GSNIGLDLCFEPQSGSSTSHFPTITFHFEG-ADFNLPKENYIY---TDSSGIACLAMLP- 424
Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ I GN QQ+N +++YD + + L FA C ++
Sbjct: 425 --SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 179/364 (49%), Gaps = 44/364 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R + I+DTGSDL W QC PC C +Q P FDP+ S +Y+ + C S C+AL +
Sbjct: 101 RYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQ 160
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFGG 259
VC YF YGD + T G L E G + S+ FGCG N GL
Sbjct: 161 KVCVY------QYF--YGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLAN 212
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKN 312
SG++G GR LSLVSQ FSYCL PS G +L +S +
Sbjct: 213 GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATL-----NSTNAS 264
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVIT 364
S P+ T + NP L T Y LN+TGIS+GG L + FA GG +IDSGT IT
Sbjct: 265 SEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTIT 324
Query: 365 RLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAE 420
L Y A++A F Q + P + S+LDTCF Q V +P + + F+G A+
Sbjct: 325 YLAEPAYDAVRAAFASQIT-LPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDG-AD 382
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + + S +CLA+AS S + IIG+YQ +N V+YD +NS + F
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMASSS---DGSIIGSYQHQNFNVLYDLENSLMSFVPAP 439
Query: 481 CSSM 484
C M
Sbjct: 440 CHLM 443
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/342 (35%), Positives = 177/342 (51%), Gaps = 25/342 (7%)
Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
D GSD+TW+QC PC CY+Q PV++ S S V C + C AL G+SG C
Sbjct: 148 DMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRAL----GSSGGCVQFL 203
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLF-GGVSGLMGLGR 268
+C Y V YGDGS + G+ G E L V GCG +N+GLF +G++GLGR
Sbjct: 204 -NECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGR 262
Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-GNSSVFKNSTPITYTNMIPNPQL 327
LS SQ + +G FSYCL G S +L G G S+ +TP ++T M+ N ++
Sbjct: 263 GSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRM 322
Query: 328 ATFYILNLTGISIGGKQLQA---------SGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
TFY + L GIS+GG +++ GG+++DSGT +TRL Y+A + F
Sbjct: 323 YTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF 382
Query: 379 ----LKQFSGFPSAPG-FSILDTCF-NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
+K+ G+PS G F+ DTC+ ++ +P V M F G E+ + + V
Sbjct: 383 RVAAVKEL-GWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPV 441
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
S+ +C A A S + IIGN Q + RV+YD ++
Sbjct: 442 DSNKGTMCFAFAG-SGDRGVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 187/384 (48%), Gaps = 35/384 (9%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-QDPVFDPSIS 180
L +G + T Y+ + +G R + + +DTGSDL W QC PC C+ Q PV DP+ S
Sbjct: 79 LGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAAS 138
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG 238
++ + C++ C AL F + C S D C Y YGD S T G+L + G
Sbjct: 139 STHAALPCDAPLCRALPFTS-----CGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFG 193
Query: 239 K------ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ FGCG NKG+F +G+ G GR SL SQ + FSYC S
Sbjct: 194 GDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTS 250
Query: 292 TQDAGASGSLILGGNSSVF------KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
D +S + LG ++ ++ + T +I NP + Y + L GIS+GG ++
Sbjct: 251 MFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARV 310
Query: 346 QA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNL--- 400
+ +IDSG IT LP +Y A+KAEF+ Q G P +A G + LD CF L
Sbjct: 311 AVPESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQV-GLPAAAAGSAALDLCFALPVA 369
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
+ ++ +P + + +G A+ + G F A +C+ L + + E +IGNYQQ
Sbjct: 370 ALWRRPAVPALTLHLDGGADWELP-RGNYVFEDYAARVLCVVLDAAAGEQV--VIGNYQQ 426
Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
+N V+YD +N L FA C +
Sbjct: 427 QNTHVVYDLENDVLSFAPARCDKL 450
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/380 (35%), Positives = 190/380 (50%), Gaps = 49/380 (12%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y IELG + IVDTGSDL W+QC+PC CY+Q DP++DPS S ++ K C++S+
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIF 247
C +L A+G CSSS+ C Y YGD S T+G+ E L L + + +F F
Sbjct: 64 CQSLP-ASG----CSSSA-KTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQF 117
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILGGN 306
GCGR N G FGG +G++GLG+ +SL +Q FSYCL D+ + LI G +
Sbjct: 118 GCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSS 177
Query: 307 SSVFKN--STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA------------- 351
+S STPI IPN +T+Y + L GIS+GGKQL + A
Sbjct: 178 ASTGSGAISTPI-----IPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLR 232
Query: 352 -------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAY 403
GG + DSGT +T L ++YS +K+ F S P+ S D C+++S
Sbjct: 233 VRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVS-LPTVDASSSGFDLCYDVSKS 291
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ--VCLALASLSYEDETGIIGNYQQK 461
+ P + + F+G YFV D ++ CLA+ IIGN Q+
Sbjct: 292 KNFKFPALTLAFKGTKFSPPQKN---YFVIVDTAETVACLAMGGSGSLGLG-IIGNLMQQ 347
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
N V+YD S + + C
Sbjct: 348 NYHVVYDRGTSTISMSPAQC 367
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 190/369 (51%), Gaps = 27/369 (7%)
Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+ PLT G + Y+ ++ +G + I DTGSDL W QC PC CY Q P+FDP
Sbjct: 82 QAPLTPG----SGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPL 137
Query: 179 ISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S S+ V CNS C A++ + G GV C+Y +YGD +YT+G+LG E + +
Sbjct: 138 KSTSFSHVPCNSQNCKAIDDSHCGAQGV--------CDYSYTYGDQTYTKGDLGFEKITI 189
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG--GLFSYCLPSTQDA 295
G +SV I GCG + G FG SG++GLG LSLVSQ S+ G FSYCLP T +
Sbjct: 190 GSSSVKSVI-GCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLP-TLLS 247
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
A+G + G N+ V S P + + + T+Y + L ISIG ++ AS +G +
Sbjct: 248 HANGKINFGQNAVV---SGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASA-KQGNV 303
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN--LSAYQEVNIPLVKM 413
+IDSGT ++ LP +Y + + LK + D CF+ ++ IP++
Sbjct: 304 IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITA 363
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
+F G A V++ + F K + CL L S DE GIIGN N + YD + +
Sbjct: 364 QFSGGAN--VNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 421
Query: 474 LGFAGEDCS 482
L F C+
Sbjct: 422 LSFKPTVCT 430
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 190/382 (49%), Gaps = 27/382 (7%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L SG+ L + Y + +G ++ ++I+DTGSDL W+QC PC +C+ Q P +DP S
Sbjct: 186 LESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSS 245
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S++ + C+ C + A C + + C YF YGDGS T G+ E + +
Sbjct: 246 SFRNISCHDPRCQLVS-APDPPKPCKAEN-QSCPYFYWYGDGSNTTGDFALETFTVNLTT 303
Query: 242 ---------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
V + +FGCG N+GLF G +GL+GLG+ LS SQ ++G FSYCL
Sbjct: 304 PNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDR 363
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNM--IPNPQLATFYILNLTGISIGGKQLQA-- 347
+A S LI G + + + + +T+ + + TFY + + + + + L+
Sbjct: 364 NSNASVSSKLIFGEDKELLSHPN-LNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPE 422
Query: 348 -----SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
S GG +IDSGT +T Y +K F+++ G+ G L C+N+S
Sbjct: 423 ETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSG 482
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
+++ +P + F A V YF+ D VCLA+ + IIGNYQQ+N
Sbjct: 483 IEKMELPDFGILFADEAVWNFPVEN--YFIWIDPEVVCLAILG-NPRSALSIIGNYQQQN 539
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
++YD K S+LG+A C+ +
Sbjct: 540 FHILYDMKKSRLGYAPMKCADV 561
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 131/403 (32%), Positives = 200/403 (49%), Gaps = 41/403 (10%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
D+ + YL S +++G K T +P+ SG +L NY+ +LG + M +++DT
Sbjct: 71 DSHRLTYLSS----LVAGKPKP---TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDT 123
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
+D W+ C C C N S S +Y V C+++ C A G + SS P
Sbjct: 124 SNDAVWLPCSGCSGCSNASTSFNTNSSS-TYSTVSCSTAQCTQ---ARGLTCPSSSPQPS 179
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
C++ SYG S L ++ L L + +F FGC + G GLMGLGR +S
Sbjct: 180 VCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMS 239
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
LVSQT+ ++ G+FSYCLPS + SGSL LG + I YT ++ NP+ + Y
Sbjct: 240 LVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYY 295
Query: 333 LNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FS 383
+NLTG+S+G Q+ + G +IDSGTVITR +Y A++ EF KQ S
Sbjct: 296 VNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVS 355
Query: 384 GFPSAPGFSILDTCF---NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
F + F DTCF N + ++ + + ++ + E T+ S + C
Sbjct: 356 SFSTLGAF---DTCFSADNENVAPKITLHMTSLDLKLPMENTL-------IHSSAGTLTC 405
Query: 441 LALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
L++A + +I N QQ+N R+++D NS++G A E C
Sbjct: 406 LSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 133/433 (30%), Positives = 201/433 (46%), Gaps = 38/433 (8%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
G +++L H++ D ++ + RL D H SR+ G + + T +
Sbjct: 30 GGFSVDLIHRDSPHSPFFDPSKTRTERLT-DAFHRS--ASRV-----GRFRQSAMTSDGI 81
Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
S + YI + +G + VI VDTGSDLTW QC+PC CY Q P FDP S +
Sbjct: 82 QSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSST 141
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK--- 239
Y+ C +S C AL GN C + C + SY DGS+T G L E L +
Sbjct: 142 YRDSSCGTSFCLAL----GNDRSCRNGK--KCTFMYSYADGSFTGGNLAVETLTVASTAG 195
Query: 240 --ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
S F FGC + G+F SG++GLG ++LS++SQ G FSYC LP D+
Sbjct: 196 KPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDS 255
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK--- 352
S S I G S + + ++ ++ P +Y++ L G S+G K+L GF+K
Sbjct: 256 SMS-SRINFGRSGIVSGAGTVSTPLVMKGPD-TYYYLITLEGFSVGKKRLSYKGFSKKAE 313
Query: 353 ---GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
G I++DSGT T LP Y L+ G I C+N + Q ++ P
Sbjct: 314 VEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAP 372
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
++ F+ + V++ F++ VC + S + GI+GN Q N V +D
Sbjct: 373 IITAHFK---DANVELQPWNTFLRMQEDLVCFTVLPTS---DIGILGNLAQVNFLVGFDL 426
Query: 470 KNSQLGFAGEDCS 482
+ ++ F DC+
Sbjct: 427 RKKRVSFKAADCT 439
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 122/358 (34%), Positives = 178/358 (49%), Gaps = 36/358 (10%)
Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EF 198
GG T+++DT SD+ WVQC PC + C+ Q D ++DPS S S C+S C L +
Sbjct: 152 GGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPY 211
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRN-- 252
A G C+ + C Y V Y DGS + G + L L A ++++F FGC
Sbjct: 212 ANG----CTPAGD-QCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALL 266
Query: 253 NKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
G F SG+M LGR SL +QT +G +FSYCLP T SG ILG
Sbjct: 267 QPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVH--SGFFILGVPRVAAS 324
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPS 369
T M+ + Y++ L I + GK+L FA G ++ DS T++TRLPP+
Sbjct: 325 R---YAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVM-DSRTIVTRLPPT 380
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLS-----AYQEVNIPLVKMEFEG-NAEMTV 423
Y AL+A F+ + + +A LDTC++ S V +P + + F+G N + +
Sbjct: 381 AYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVEL 440
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D +G++ CLA A + + TGIIGN QQ+ V+Y+ + +GF C
Sbjct: 441 DPSGVLL-------DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 143/423 (33%), Positives = 210/423 (49%), Gaps = 76/423 (17%)
Query: 77 CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNY 135
CSG Q D V ++ S+ N+KD N ++ G N+
Sbjct: 75 CSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKDHTPNNKLFDEDG------NF 128
Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
+ + G +N T+I+DTGS +TW QC+ C
Sbjct: 129 LVDVAFGTPPQNFTLILDTGSSITWTQCKACTV--------------------------- 161
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRN 252
+ NY ++YGD S + G G + + L + V F FG GRN
Sbjct: 162 -------------------ENNYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRN 202
Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
NKG FG GV G++GLG+ LS VSQT+ F +FSYCLP + + GSL+ G ++
Sbjct: 203 NKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--S 257
Query: 312 NSTPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRL 366
S+ + +T+++ P Q + +Y +NL+ IS+G ++L +S FA G +IDS TVITRL
Sbjct: 258 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRL 317
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
P YSALKA F K + +P + G ILDTC+NLS ++V +P + + F G A++
Sbjct: 318 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVR 377
Query: 423 VDVTGIVYFVKSDASQVCLALASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
++ T IV+ SD S++CLA A S E IIGN QQ + V+YD + ++GF
Sbjct: 378 LNGTNIVW--GSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSN 435
Query: 480 DCS 482
CS
Sbjct: 436 GCS 438
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 125/353 (35%), Positives = 184/353 (52%), Gaps = 41/353 (11%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
I+DTGSD+ W+QC+PC+ CYNQ +FDPS S +YK + +S+TC ++E + CSS
Sbjct: 102 IIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTS-----CSS 156
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
+ C Y + YGDGSY++G+L E L LG + + F GCGRNN F G SG
Sbjct: 157 DNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSG 216
Query: 263 LMGLGRSDLSLVSQ---TSEIFGGLFSYCLPSTQDAGAS----GSLILGGNSSVFKNSTP 315
++GLG +SL++Q S G FSYCL S + + + ++ G+ +V STP
Sbjct: 217 IVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAVVSGDGTV---STP 273
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF---AKGGILIDSGTVITRLPPSI 370
I + +P++ FY L L S+G +++ +S F KG I+IDSGT +T LP I
Sbjct: 274 I----VTHDPKV--FYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGTTLTLLPNDI 327
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
YS L++ L C+ S + E+N P++ F G V + +
Sbjct: 328 YSKLESAVADLVELDRVKDPLKQLSLCYR-STFDELNAPVIMAHFSG---ADVKLNAVNT 383
Query: 431 FVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
F++ + CLA S + G I GN Q+N V YD + + F DCS
Sbjct: 384 FIEVEQGVTCLAFIS----SKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 137/419 (32%), Positives = 196/419 (46%), Gaps = 34/419 (8%)
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--R 144
QQQN L N V L+S K+ SGNI L SG L T Y + +G +
Sbjct: 131 QQQNNLA--NAVVASLKSS-KDEFSGNIMAT------LESGASLGTGEYFIDMFVGTPPK 181
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
++ +I+DTGSDL+W+QC PC C+ Q P ++P+ S SY+ + C C + ++ +
Sbjct: 182 HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLV--SSPDPL 239
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKG 255
+ C YF Y DGS T G+ E + GK V D +FGCG NKG
Sbjct: 240 QHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKG 299
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGNSSVFKNST 314
F G GL+GLGR LS SQ I+G FSYCL + S LI G + + N
Sbjct: 300 FFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELL-NHH 358
Query: 315 PITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITR 365
+ +T ++ + TFY L + I +GG K S GG +IDSG+ +T
Sbjct: 359 NLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTF 418
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
P S Y +K F K+ A I+ C+N+S +V +P + F A
Sbjct: 419 FPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPA 478
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y + D +CLA+ IIGN Q+N ++YD K S+LG++ C+ +
Sbjct: 479 ENYFYQYEPDEV-ICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 536
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 187/343 (54%), Gaps = 43/343 (12%)
Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
+TW QC+PC C FDPS S +Y C ST GN+
Sbjct: 98 ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPST-------VGNT------------ 138
Query: 216 YFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSL 273
Y ++YGD S + G G + + L + V F FGCGRNN+G FG G G++GLG+ LS
Sbjct: 139 YNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLST 198
Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLA 328
VSQT+ F +FSYCLP + + GSL+ G ++ + + + +T+++ P + +
Sbjct: 199 VSQTASKFKKVFSYCLP---EEDSIGSLLFGEKAT---SQSSLKFTSLVNGPGTSGLEES 252
Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
+Y + L IS+G K+L +S FA G +IDSGTVIT LP YSAL A F K + +P
Sbjct: 253 GYYFVKLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYP 312
Query: 387 SAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+ G ILDTC+NLS ++V +P + + F A++ ++ +++ +DAS++CLA
Sbjct: 313 LSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIW--GNDASRLCLA 370
Query: 443 LASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A S E IIGN QQ + V+YD + ++GF G CS
Sbjct: 371 FAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCS 413
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 199/415 (47%), Gaps = 40/415 (9%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIAT 138
K W N D V YL S + + + T +P+ SG ++ + NY+
Sbjct: 51 KAGSWVNTVINMASKDPARVTYLSSLVASPKA--------TSVPIASGQQVLNIGNYVVR 102
Query: 139 IELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
++LG G+ M +++DT D WV C C C P F P+ S +Y + C+ C +
Sbjct: 103 VKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFSPNTSSTYASLQCSVPQCTQV 159
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
+ C ++ C + +YG S L ++ LGL ++ + FGC G
Sbjct: 160 RGLS-----CPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPSYSFGCVNAVSGS 214
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR +SL+SQ+ ++ G+FSYC PS + SGSL LG I
Sbjct: 215 TLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGP----LGQPKNI 270
Query: 317 TYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG-GILIDSGTVITRLPPS 369
T ++ NP T Y +NLTG+S+G +L A G G +IDSGTVITR
Sbjct: 271 RTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEP 330
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGI 428
+Y+A++ EF KQ G P A DTCF +A E P V F G + ++ ++ T I
Sbjct: 331 VYAAIRDEFRKQVKG-PFA-TIGAFDTCF--AATNEDIAPPVTFHFTGMDLKLPLENTLI 386
Query: 429 VYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S S CLA+A+ + +I N QQ+N R+++D NS+LG A E C
Sbjct: 387 ---HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 178/360 (49%), Gaps = 35/360 (9%)
Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
R + I+DTGSDL W QC PC C +Q P FDP+ S +Y+ + C++ C+AL +
Sbjct: 102 ARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPACNALYYP--- 158
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFG 258
+C + C Y YGD + T G L E G + ++ FGCG N G
Sbjct: 159 --LCYQKT---CVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLA 213
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV-FKNSTPIT 317
SG++G GR LSLVSQ + FSYCL S S L G +++ N++ +
Sbjct: 214 NGSGMVGFGRGSLSLVSQ---LGSPRFSYCLTSFLSPVRS-RLYFGAYATLNSTNASTVQ 269
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--------KGGILIDSGTVITRLPPS 369
T I NP L T Y LN+TGIS+GG +L GG +IDSGT IT L
Sbjct: 270 STPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEP 329
Query: 370 IYSALKAEFLKQF-SGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAEMTVD 424
Y A++ F+ S P S+LDTCF Q V +P + + F+G A+ +
Sbjct: 330 AYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDG-ADWELP 388
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ + V +CLA+A+ S + IIG+YQ +N V+YD +NS L F C+ M
Sbjct: 389 LQNYM-LVDPSTGGLCLAMATSS---DGSIIGSYQHQNFNVLYDLENSLLSFVPAPCNLM 444
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 141/434 (32%), Positives = 213/434 (49%), Gaps = 42/434 (9%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
T+EL H++ + + +E +R I++ L +S +N + + + E P+ +
Sbjct: 27 FTVELIHRDSPKSPMYNSSETHFDR-IVNALR----RSSHRNTV---VLESDTAEAPIFN 78
Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
Y+ I +G +++ DTGSD+ W QC+PC +CY Q P+FDPS S +YK
Sbjct: 79 ----NGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYK 134
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
V C+S C +G+ CS S +C Y ++YGD S+++G L + + + S
Sbjct: 135 NVACSSPVCS----YSGDGSSCSDDS--ECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRP 188
Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-P-STQDAG 296
F GCG +N G F VSG++GLGR SLV+Q GG FSYCL P T
Sbjct: 189 VAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTN 248
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGG- 354
S L G N++V + T T + + Q TFY L L +S+G + G +K G
Sbjct: 249 DSTKLNFGSNANVSGSGT--VSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGG 306
Query: 355 ---ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS-ILDTCFNLSAYQEVNIPL 410
I+IDSGT +T LP ++ ++ + + Q P A S LD CF + + +P
Sbjct: 307 ESNIIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEFLDYCFATTT-DDYEMPP 364
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V M FEG A++ + + FV+ +CLA S +D I GN Q N V YD K
Sbjct: 365 VTMHFEG-ADVPLQRENL--FVRLSDDTICLAFGSFP-DDNIFIYGNIAQSNFLVGYDIK 420
Query: 471 NSQLGFAGEDCSSM 484
N + F C ++
Sbjct: 421 NLAVSFQPAHCGAV 434
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 184/378 (48%), Gaps = 34/378 (8%)
Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
T Y+ + +G R + + +DTGSDL W QC PC+ C++Q P+ DP+ S +Y + C
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148
Query: 190 SSTCHALEFATGNSGVCSS--SSPPDCNYFVSYGDGSYTRGELGREHLGLG--------K 239
+ C AL F + G SS + C Y YGD S T GE+ + G +
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208
Query: 240 ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
FGCG NKG+F +G+ G GR SL SQ + FSYC S ++ +S
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT---TFSYCFTSMFESKSS 265
Query: 299 GSLILGGN-------SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
+ LGG S S + T ++ NP + Y L+L GIS+G +L
Sbjct: 266 -LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAK 324
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL---SAYQEV 406
+IDSG IT LP ++Y A+KAEF Q G P S LD CF L + ++
Sbjct: 325 LRSTIIDSGASITTLPEAVYEAVKAEFAAQV-GLPPTGVVEGSALDLCFALPVTALWRRP 383
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P + + +G A+ + V+ + A++V + + D+T +IGN+QQ+N V+
Sbjct: 384 PVPSLTLHLDG-ADWELPRGNYVF--EDLAARVMCVVLDAAPGDQT-VIGNFQQQNTHVV 439
Query: 467 YDTKNSQLGFAGEDCSSM 484
YD +N L FA C S+
Sbjct: 440 YDLENDWLSFAPARCDSL 457
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 178/350 (50%), Gaps = 28/350 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ +DTGSDL W QCQPC C+NQ P +D S S ++ C+S+ C T +C
Sbjct: 106 LTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 161
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
+ + C + SYGD S T G L E + + ASV +FGCG NN G+F +G+ G
Sbjct: 162 NQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 221
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPN 324
GR LSL SQ G FS+C + S +++ + ++KN + T +I N
Sbjct: 222 FGRGPLSLPSQLKV---GNFSHCFTAVSGRKPS-TVLFDLPADLYKNGRGTVQTTPLIKN 277
Query: 325 PQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
P TFY L+L GI++G +L S FA GG +IDSGT T LPP +Y + EF
Sbjct: 278 PAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEF 337
Query: 379 LK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
+ PS +L CF+ + ++P + + FEG A M + V+ K
Sbjct: 338 AAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKDG 394
Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ +CLA+ E E IIGN+QQ+N V+YD KNS+L F C +
Sbjct: 395 GNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 128/392 (32%), Positives = 196/392 (50%), Gaps = 37/392 (9%)
Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
+ ++++G K T +P+ SG +L NY+ +LG + M +++DT +D W+ C
Sbjct: 4 LSSLVAGKPKP---TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG 60
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C C N S S +Y V C+++ C A G + SS P C++ SYG
Sbjct: 61 CSGCSNASTSFNTNSSS-TYSTVSCSTAQCTQ---ARGLTCPSSSPQPSVCSFNQSYGGD 116
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
S L ++ L L + +F FGC + G GLMGLGR +SLVSQT+ ++ G
Sbjct: 117 SSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSG 176
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
+FSYCLPS + SGSL LG + I YT ++ NP+ + Y +NLTG+S+G
Sbjct: 177 VFSYCLPSFRSFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV 232
Query: 344 Q-------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FSGFPSAPGFSIL 394
Q L + G +IDSGTVITR +Y A++ EF KQ S F + F
Sbjct: 233 QVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF--- 289
Query: 395 DTCF---NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
DTCF N + ++ + + ++ + E T+ S + CL++A +
Sbjct: 290 DTCFSADNENVAPKITLHMTSLDLKLPMENTL-------IHSSAGTLTCLSMAGIRQNAN 342
Query: 452 T--GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+I N QQ+N R+++D NS++G A E C
Sbjct: 343 AVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 374
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 191/368 (51%), Gaps = 38/368 (10%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ T +G + I DTGSD+ W+QC+PC+ CYNQ P+F+PS S SYK + C+S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
CH++ + CS + C Y +SYGD S+++G+L + L L S +
Sbjct: 147 CHSVRDTS-----CSDQN--SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVI 199
Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-G 305
GCG +N G FGG SG++GLG +SL++Q GG FSYCL + ++ S IL G
Sbjct: 200 GCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG-----ILIDSG 360
+++V ++ + +P FY L L S+G K+++ G ++GG I+IDSG
Sbjct: 260 DAAVVSGDGVVSTPLIKKDP---VFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 361 TVITRLPPSIYSALKA---EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
T +T +P +Y+ L++ + +K FS+ C++L + E + P++ + F+G
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL---CYSLKS-NEYDFPIITVHFKG 372
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGF 476
V++ I FV VC A + G I GN Q+N V YD + + F
Sbjct: 373 ---ADVELHSISTFVPITDGIVCFAFQP---SPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426
Query: 477 AGEDCSSM 484
DC+ +
Sbjct: 427 KPTDCTKV 434
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 178/364 (48%), Gaps = 44/364 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R + I+DTGSDL W QC PC C +Q P FDP+ S +Y+ + C S C+AL +
Sbjct: 101 RYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQ 160
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFGG 259
VC YF YGD + T G L E G + S+ FGCG N G
Sbjct: 161 KVCVY------QYF--YGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLAN 212
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKN 312
SG++G GR LSLVSQ FSYCL PS G +L +S +
Sbjct: 213 GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATL-----NSTNAS 264
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVIT 364
S P+ T + NP L T Y LN+TGIS+GG L + FA GG +IDSGT IT
Sbjct: 265 SEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTIT 324
Query: 365 RLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAE 420
L Y A++A F Q + P + S+LDTCF Q V +P + + F+G A+
Sbjct: 325 YLAEPAYDAVRAAFASQIT-LPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDG-AD 382
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + + S +CLA+AS S + IIG+YQ +N V+YD +NS + F
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMASSS---DGSIIGSYQHQNFNVLYDLENSLMSFVPAP 439
Query: 481 CSSM 484
C M
Sbjct: 440 CHLM 443
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 125/376 (33%), Positives = 174/376 (46%), Gaps = 22/376 (5%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
PL SG L + Y LG + +IVDTGSDL +VQC PC CY Q P++ PS S
Sbjct: 22 PLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNS 81
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPD--CNYFVSYGDGSYTRGELGREHL 235
++ V C+S+ C + G CSSS SPP C+Y YGD S T G E
Sbjct: 82 STFTPVPCDSAECLLIPAPVG--APCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETA 139
Query: 236 GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQD 294
+G VN FGCG N+G F G++GLG+ LS SQ F F+YCL S
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSP 199
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QA 347
SLI G + + + +T ++ NP + Y + + I GG+ L +
Sbjct: 200 TSVFSSLIFGDD--MMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKI 257
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
GG + DSGT +T P Y+ + A F K + P L C N+S
Sbjct: 258 DSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPI 317
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
P +EF+ A + YF++ + CLA+ S D +IGN Q+N V Y
Sbjct: 318 YPSFTIEFDQGATYRPNQGN--YFIEVSPNIDCLAMLESS-SDGFNVIGNIIQQNYLVQY 374
Query: 468 DTKNSQLGFAGEDCSS 483
D + ++GFA +C +
Sbjct: 375 DREEHRIGFAHANCDA 390
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 133/378 (35%), Positives = 187/378 (49%), Gaps = 44/378 (11%)
Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
+R Y+ + +G R + ++DTGSDL W QC PC C Q P F+P+ S SY
Sbjct: 81 LRFSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYAS 140
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KAS 241
+ C+S+ C+AL S +C ++ C Y YGD + + G L E G + +
Sbjct: 141 LPCSSAMCNALY-----SPLCFQNA---CVYQAFYGDSASSAGVLANETFTFGTNSTRVA 192
Query: 242 VNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V FGCG N G LF G SG++G GR LSLVSQ FSYCL S A+
Sbjct: 193 VPRVSFGCGNMNAGTLFNG-SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSP-ATSR 247
Query: 301 LILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA--- 351
L G NS+ +S P+ T I NP L T Y LN+TGIS+ G L S FA
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINE 307
Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL--SAYQ 404
GG++IDSGT +T L Y+ ++ F+ + G P A DTCF +
Sbjct: 308 TDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRR 366
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
V +P + + F+G A+M + + Y V + +CLA+ D+ IIG++Q +N
Sbjct: 367 MVTLPEMVLHFDG-ADMELPLEN--YMVMDGGTGNLCLAMLP---SDDGSIIGSFQHQNF 420
Query: 464 RVIYDTKNSQLGFAGEDC 481
++YD +NS L F C
Sbjct: 421 HMLYDLENSLLSFVPAPC 438
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 133/378 (35%), Positives = 187/378 (49%), Gaps = 44/378 (11%)
Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
+R Y+ + +G R + ++DTGSDL W QC PC C Q P F+P+ S SY
Sbjct: 78 LRFSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYAS 137
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KAS 241
+ C+S+ C+AL S +C ++ C Y YGD + + G L E G + +
Sbjct: 138 LPCSSAMCNALY-----SPLCFQNA---CVYQAFYGDSASSAGVLANETFTFGTNSTRVA 189
Query: 242 VNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
V FGCG N G LF G SG++G GR LSLVSQ FSYCL S A+
Sbjct: 190 VPRVSFGCGNMNAGTLFNG-SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSP-ATSR 244
Query: 301 LILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA--- 351
L G NS+ +S P+ T I NP L T Y LN+TGIS+ G L S FA
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINE 304
Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL--SAYQ 404
GG++IDSGT +T L Y+ ++ F+ + G P A DTCF +
Sbjct: 305 TDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRR 363
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
V +P + + F+G A+M + + Y V + +CLA+ D+ IIG++Q +N
Sbjct: 364 MVTLPEMVLHFDG-ADMELPLEN--YMVMDGGTGNLCLAMLP---SDDGSIIGSFQHQNF 417
Query: 464 RVIYDTKNSQLGFAGEDC 481
++YD +NS L F C
Sbjct: 418 HMLYDLENSLLSFVPAPC 435
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 196/403 (48%), Gaps = 41/403 (10%)
Query: 99 VQYLQSRIKNM---ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTG 153
+Q Q R++ + + N + + E P+T I + Y+ + +G +++ I+DTG
Sbjct: 5 IQRSQERLEKLQITSAVNTHQMKDIETPVTPDIG--SGEYLIQMAIGTPALSLSAIMDTG 62
Query: 154 SDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPP 212
SDL W +C PC C S S +Y KVLC SS C F+ N G
Sbjct: 63 SDLVWTKCNPCTDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDG-------- 112
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
DC Y YGD S T G L E + S+ + FGCG +N+G F V GL+G GR LS
Sbjct: 113 DCEYVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLS 171
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
LVSQ G FSYCL S D+ + L +G +S+ +T + T ++ + +Y
Sbjct: 172 LVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASL--EATTVGSTPLVQSSSTNHYY- 228
Query: 333 LNLTGISIGGKQL---------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
L+L GIS+GG+ L Q+ G GG++IDSGT +T L + Y A+K + +
Sbjct: 229 LSLEGISVGGQSLAIPTGTFDIQSDG--SGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN 286
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLA 442
P A G LD CFN P + F+G DV Y F S + VCLA
Sbjct: 287 -LPQADG--QLDLCFNQQGSSNPGFPSMTFHFKG---ADYDVPKENYLFPDSTSDIVCLA 340
Query: 443 -LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ + S I GN QQ+N +++YD +N+ L FA C ++
Sbjct: 341 MMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
D+ + +L S+ + SG + T P+ SG QT +Y+ LG + + + +D
Sbjct: 48 DDARLLFLSSKAAS--SGGV-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
T +D TW C PC +C F P+ S SY + C S C E + +S+
Sbjct: 98 TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
P C + + D S+ + LG + L LGK ++ + FGC G + GL+GLGR
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
+SL+SQT + G+FSYCLPS + SGSL LG +N + YT ++ NP +
Sbjct: 215 PMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270
Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Y +N+TG+S+G ++ A FA G +IDSGTVITR +Y+AL+ EF +Q
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
+ DTCFN P V + +G ++T+ + + + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388
Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A+A ++ N QQ+N RV+ D S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 168/346 (48%), Gaps = 25/346 (7%)
Query: 147 TVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNS 203
T+ +DT D+ W+QC PC CY Q++ FDP S + V C S C L +A G S
Sbjct: 160 TMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCS 219
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFIFGCGRNNKGLFGG-VS 261
S+ DC Y + Y D T G + L + ++ +F FGC +G F S
Sbjct: 220 KPNSTG---DCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQAS 276
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--ASGSLI---LGGNSSVFKNSTPI 316
G M LG SL+SQT+ +G FSYC+P AG + G + GG S F +TP+
Sbjct: 277 GTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFA-TTPL 335
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALK 375
+ + NP T Y++ L GI + G++L GG ++DS VIT+LPP+ Y AL+
Sbjct: 336 VRSANVINP---TIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRALR 392
Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
F + + LDTCF+ +V +P V + F+G A + + + ++
Sbjct: 393 LAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL----- 447
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A ++ + G IGN QQ+ V+YD +GF C
Sbjct: 448 --DSCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 191/377 (50%), Gaps = 43/377 (11%)
Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
Y+ T+ +G + + DTGSDL W QC PC + C+ Q P+++P+ S ++ + CNS
Sbjct: 113 EYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS 172
Query: 191 S--TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
S C C+ C Y+ +YG G +T G G E G +A V
Sbjct: 173 SLSMCAGALAGAAPPPGCA------CMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVP 225
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + + G +GL+GLGR LSLVSQ + G FSYCL QD ++ +L+L
Sbjct: 226 GVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLL 282
Query: 304 GGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------KG 353
G ++++ N T + T + +P ++T+Y LNLTGIS+G K L S A G
Sbjct: 283 GPSAAL--NGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 340
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPG--FSILDTCFNLSAYQEVN--- 407
G++IDSGT IT L + Y ++A Q + P+ G + LD CF L A
Sbjct: 341 GLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAV 400
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P + + F+G A+M + ++ S + CLA+ + + + GNYQQ+N ++Y
Sbjct: 401 LPSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILY 455
Query: 468 DTKNSQLGFAGEDCSSM 484
D + L FA CS++
Sbjct: 456 DVREETLSFAPAKCSTL 472
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 131/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
D+ + +L S+ + SG I T P+ SG QT +Y+ LG + + + +D
Sbjct: 48 DDARLLFLSSKAAS--SGGI-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
T +D TW C PC +C F P+ S SY + C S C E + +S+
Sbjct: 98 TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
P C + + D S+ + LG + L LGK ++ + FGC G + GL+GLGR
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
+SL+SQT + G+FSYCLPS + SGSL LG +N + YT ++ NP +
Sbjct: 215 PMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270
Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Y +N+TG+S+G ++ A FA G +IDSGTVITR +Y+AL+ EF +Q
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
+ DTCFN P V + +G ++T+ + + + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388
Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A+A ++ N QQ+N RV+ D S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 182/379 (48%), Gaps = 42/379 (11%)
Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+YIA I +G + ++ DT SDLTW+QCQPC+ CY Q PVFDP S SY ++ ++
Sbjct: 140 DYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAP 199
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDG------SYTRGELGREHLGLGKASVNDF 245
C AL + G + C Y V YGDG S + G+L E L +
Sbjct: 200 DCQALGRSGGGDAKRGT-----CIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAY 254
Query: 246 I-FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLI 302
+ GCG +NKGLFG +G++GL R +S+ Q + + + FSYCL S S
Sbjct: 255 LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSST 314
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKG 353
L + S P ++T + N + TFY + L G+S+GG + LQ + G
Sbjct: 315 LTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHG 374
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGF-------PSAPGFSILDTCFNLSAYQE- 405
G+++DSGT +TRL Y+A + F +G PS + DTC+ +
Sbjct: 375 GVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSG----LFDTCYTVGGRAGL 430
Query: 406 ---VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
V +P V M F G E+++ + V S + VC A A + +IGN Q+
Sbjct: 431 RHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGT-VCFAFAGTG-DRSVSVIGNILQQG 488
Query: 463 QRVIYDTKNSQLGFAGEDC 481
RV+YD ++GFA C
Sbjct: 489 FRVVYDIGGQRVGFAPNSC 507
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 177/350 (50%), Gaps = 28/350 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ +DTGS L W QCQPC C+NQ P +D S S ++ C+S+ C T +C
Sbjct: 106 LTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 161
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
+ + C Y SYGD S T G L E + + ASV +FGCG NN G+F +G+ G
Sbjct: 162 NQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 221
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPN 324
GR LSL SQ G FS+C + S +++ + ++KN + T +I N
Sbjct: 222 FGRGPLSLPSQLKV---GNFSHCFTAVSGRKPS-TVLFDLPADLYKNGRGTVQTTPLIKN 277
Query: 325 PQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
P TFY L+L GI++G +L S FA GG +IDSGT T LPP +Y + EF
Sbjct: 278 PAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEF 337
Query: 379 LK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
+ PS +L CF+ + ++P + + FEG A M + V+ K
Sbjct: 338 AAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKDG 394
Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ +CLA+ E E IIGN+QQ+N V+YD KNS+L F C +
Sbjct: 395 GNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 133/436 (30%), Positives = 204/436 (46%), Gaps = 39/436 (8%)
Query: 58 QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
Q ++ I + K + K W D ++YL + + D
Sbjct: 29 QSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPERLKYLST---------LADQ 79
Query: 118 SNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
T +P+ G + L+ NY+ ++LG G+ M +++DT +D WV C C C +
Sbjct: 80 KTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSST---T 136
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
F P+ S + + C+ + C + G S C ++ C + SYG S L ++
Sbjct: 137 FLPNASTTLGSLDCSGAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLTATLVQDA 191
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ L + F FGC G GL+GLGR +SL+SQ ++ G+FSYCLPS +
Sbjct: 192 ITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKS 251
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQA 347
SGSL LG I T ++ NP + Y +NLTG+S+G +QL
Sbjct: 252 YYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
G +IDSGTVITR +Y A++ EF KQ +G S+ G DTCF +A E
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AATNEAE 363
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRV 465
P + + FEG + +++ S S CL++A+ + +I N QQ+N R+
Sbjct: 364 APAITLHFEGLNLVLPMENSLIH--SSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRI 421
Query: 466 IYDTKNSQLGFAGEDC 481
++DT NS+LG A E C
Sbjct: 422 MFDTTNSRLGIARELC 437
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/283 (39%), Positives = 160/283 (56%), Gaps = 19/283 (6%)
Query: 7 PLTILSLLLPLMVSLFLLAKGAHCFEGKKKL-----HLHKLQWQQKSGSSSSCVSHQKSR 61
P++ + LL L+ S L +K F+G+K LH + SS V +
Sbjct: 4 PISTIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSS---VCSPSPK 60
Query: 62 IEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQSRI-KNMISGNIKDV 117
+ +LE+ HK+ K+ + +R L D V ++SR+ KN G
Sbjct: 61 GDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKG 120
Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
S +P SG + T NY+ T+ LG R++T I DTGSDLTW QC+PC + CY+QQ+P+
Sbjct: 121 SKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPI 180
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
F+PS S SY + C+S TC L+ TGNS CS+S+ C Y + YGD SY+ G ++
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST---CVYGIQYGDQSYSVGFFAQDK 237
Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
L L V N+F+FGCG+NN+GLF GV+GL+GLGR+ LSL+S+
Sbjct: 238 LALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
L S +P A SILDTC++ S Y V++P + + F AEM +D +GI Y + + SQ
Sbjct: 275 LSLMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYIL--NISQ 332
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
VCLA A S + I+GN QQK V+YD ++GFA C
Sbjct: 333 VCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 209/430 (48%), Gaps = 42/430 (9%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPL 124
+ + EL H++ + + QN+ HV R N + KD +SNT
Sbjct: 27 SFSFELIHRDSSKSPLY---KPAQNKF----QHVVNAARRSINRANRLFKDSLSNTP--- 76
Query: 125 TSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
S + + Y+ T +G V +VDTGSD+ W+QC+PC+ CY Q P+F+PS S S
Sbjct: 77 ESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSS 136
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---- 238
YK + C+S+ C ++ + + N C Y +++ D SY++GEL E L L
Sbjct: 137 YKNIPCSSNLCQSVRYTSCNKQ-------NSCEYTINFSDQSYSQGELSVETLTLDSTTG 189
Query: 239 -KASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
S + GCG NN+G+F G SG++GLG +SL +Q GG FSYC LP D+
Sbjct: 190 HSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDS 249
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AK 352
+ L G+++V ++ + +PQ FY L L S+G K+++ +
Sbjct: 250 NKTSKLNF-GDAAVVSGDGVVSTPFVKKDPQ--AFYYLTLEAFSVGNKRIEFEVLDDSEE 306
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G I++DSGT +T LP +Y+ L++ + +L+ C+++++ Q + P++
Sbjct: 307 GNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQ-YDFPIIT 365
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKN 471
F+G + + I F VCLA S +TG I GN Q N V YD +
Sbjct: 366 AHFKG---ADIKLNPISTFAHVADGVVCLAFTS----SQTGPIFGNLAQLNLLVGYDLQQ 418
Query: 472 SQLGFAGEDC 481
+ + F DC
Sbjct: 419 NIVSFKPSDC 428
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 140/446 (31%), Positives = 207/446 (46%), Gaps = 57/446 (12%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE----- 121
+ LEL H + G + ++ R D H R N G I+ S+T
Sbjct: 25 VRLELTHADDRGGYV----GAERVRRAADRSH------RRVNGFLGAIEGPSSTARLGID 74
Query: 122 ----IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
+ + T Y+ I +G + T ++DTGSDL W QC PC+ C+ Q P+
Sbjct: 75 GAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL 134
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGR 232
+ P+ S +Y V C S C AL+ S SPPD C Y+ SYGDG+ T G L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLAT 188
Query: 233 EHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
E LG +V FGCG N G SGL+G+GR LSLVSQ FSYC +
Sbjct: 189 ETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVT---RFSYCF-T 244
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQL- 345
+A A+ L LG ++ + S+ T +P+P + +++Y L+L GI++G L
Sbjct: 245 PFNATAASPLFLGSSA---RLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP 301
Query: 346 ------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCF 398
+ + GG++IDSGT T L S + AL A L P A G + L CF
Sbjct: 302 IDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVAL-ARALASRVRLPLASGAHLGLSLCF 360
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
++ + V +P + + F+G A+M + V +S A CL + S ++G+
Sbjct: 361 AAASPEAVEVPRLVLHFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSM 415
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
QQ+N ++YD + L F C +
Sbjct: 416 QQQNTHILYDLERGILSFEPAKCGEL 441
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 174/350 (49%), Gaps = 28/350 (8%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+ VI DTGSDLTWVQC PC CY Q+ P+FDPS S SY+ +LC S C+AL+ +
Sbjct: 107 VIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVS---EQA 163
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGV 260
C+ + C Y SYGD SYT G L E +G S ++ +FGCG N G F +
Sbjct: 164 CTMDT-NICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDEL 222
Query: 261 SGLMGLGRSD-LSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITY 318
+ LSLVSQ S I G FSYCL P ++ + + + G +S + S P
Sbjct: 223 GSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVI---SGPQVV 279
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYS 372
+ + + Q T+Y + L IS+G K+L + KG ++IDSGT +T L ++
Sbjct: 280 STPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFT 339
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
L+ + + + CF + ++++P++ + F + V + + FV
Sbjct: 340 ELERVLEETVKAERVSDPRGLFSVCFRSAG--DIDLPVIAVHFN---DADVKLQPLNTFV 394
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
K+D +C + S ++ GI GN Q + V YD + + F DC+
Sbjct: 395 KADEDLLCFTMIS---SNQIGIFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
D+ + +L S+ + SG + T P+ SG QT +Y+ LG + + + +D
Sbjct: 48 DDARLLFLSSKAAS--SGGV-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
T +D TW C PC +C F P+ S SY + C S C E + +S+
Sbjct: 98 TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
P C + + D S+ + LG + L LGK ++ + FGC G + GL+GLGR
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
+SL+SQT + G+FSYCLPS + SGSL LG +N + YT ++ NP +
Sbjct: 215 PMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270
Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Y +N+TG+S+G ++ A FA G +IDSGTVITR +Y+AL+ EF +Q
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
+ DTCFN P V + +G ++T+ + + + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388
Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A+A ++ N QQ+N RV+ D S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 182/354 (51%), Gaps = 31/354 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ DTGSDLTW QCQPCK C+ Q P++D ++S S+ V C S+TC + +S C+
Sbjct: 108 ALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATCLPIW----SSRNCT 163
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHL---GLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
+SS P C Y +YGDG+Y+ G LG E L G SV FGCG +N GL +G +
Sbjct: 164 ASSSP-CRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTV 222
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMI 322
GLGR LSLV+Q G FSYCL + ++ G + + ST + T ++
Sbjct: 223 GLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLV 279
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSAL- 374
+P + T+Y ++L GIS+G +L GG+++DSGT T L S + +
Sbjct: 280 QSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVV 339
Query: 375 --KAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
A L+Q P S+ CF + Q +P + + F G A+M + +
Sbjct: 340 DHVAGVLRQ----PVVNASSLDSPCFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMS 395
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
F + ++S CL +A S + I+GN+QQ+N ++++D QL F DC +
Sbjct: 396 FNQEESS-FCLNIAG-SPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 189/417 (45%), Gaps = 19/417 (4%)
Query: 80 KIVDWNEQQQNRLILDN---LHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
++ + +Q+ + DN H I +I G + + P+ SG L + Y
Sbjct: 7 RLASFRKQRGRHKLSDNDNGAHNSANPPVITAVIEGPPSHDHDFQSPVVSGSTLGSGQYF 66
Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
LG + ++IVD+GSDL WVQC PC CY Q P++ PS S ++ V C S C
Sbjct: 67 VDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECL 126
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
+ G C P C Y Y D S ++G E + ++ FGCGR+N+
Sbjct: 127 LIPATEGFP--CDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVRIDKVAFGCGRDNQ 184
Query: 255 GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
G F G++GLG+ LS SQ +G F+YCL + D + S ++ G+ +
Sbjct: 185 GSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGD-ELISTIH 243
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLP 367
+ +T ++ N + T Y + + + +GG+ L S A GG + DSGT +T
Sbjct: 244 DLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWL 303
Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
P Y + A F K +P A LD C +++ + + P + G A
Sbjct: 304 PPAYRNILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGN 362
Query: 428 IVYFVKSDASQVCLALASL-SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
YFV + CLA+A L S IGN Q+N V YD + +++GFA CSS
Sbjct: 363 --YFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKCSS 417
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 187/367 (50%), Gaps = 46/367 (12%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I DTGSDLTW+Q +PC CY Q+ P+FDPS S ++ K+ C ++ C+AL+ S
Sbjct: 95 AIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAPCNALD-----ESARS 149
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV--NDFIFGCGRNNKGLFGG-VSGLM 264
+ P C Y SYGD SYT G L + + +G ASV + FGCG N G F SG++
Sbjct: 150 CTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIV 209
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL----------PSTQDAGASGSLILGGNSSVFKNST 314
GLG +LS VSQ + G FSYCL PS D+ A+ ++ G N VF +S+
Sbjct: 210 GLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPS--DSPATSRIVFGDN-PVFSSSS 266
Query: 315 P---ITYTNMIPNPQLATFYILNLTGISIGGKQL---------------QASGFAKGGIL 356
+ T + N + +T+Y L + I++G K+L S +G I+
Sbjct: 267 TNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNII 326
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
IDSGT +T L Y AL+A +++ + S+ CF S +EV +PL+K+ F
Sbjct: 327 IDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFK-SGKEEVELPLMKVHF 385
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A+ V++ + FV+++ VC + ++ GI GN Q N V YD +
Sbjct: 386 RGGAD--VELKPVNTFVRAEEGLVCFTMLP---TNDVGIYGNLAQMNFVVGYDLGKRTVS 440
Query: 476 FAGEDCS 482
F DCS
Sbjct: 441 FLPADCS 447
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 190/390 (48%), Gaps = 37/390 (9%)
Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
R I+ ++ S E P+ +G Y+ + +G + + I+DTGSDL W QC+
Sbjct: 70 RRMRSINAMLQSSSGIETPVYAGDG----EYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE 125
Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
PC C++Q P+F+P S S+ + C S C L T N+ +C Y YGD
Sbjct: 126 PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--------ECQYTYGYGD 177
Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
GS T+G + E +SV + FGCG +N+G G +GL+G+G LSL SQ +
Sbjct: 178 GSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ---LG 234
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYC+ S + S +L LG +S +P T +I + T+Y + L GI++G
Sbjct: 235 VGQFSYCMTSYGSSSPS-TLALGSAASGVPEGSP--STTLIHSSLNPTYYYITLQGITVG 291
Query: 342 GK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
G QLQ G GG++IDSGT +T LP Y+A+ F Q + S
Sbjct: 292 GDNLGIPSSTFQLQDDG--TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSS 349
Query: 393 ILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
L TCF S V +P + M+F+G +++ + +CLA+ S S +
Sbjct: 350 GLSTCFQQPSDGSTVQVPEISMQFDGGV---LNLGEQNILISPAEGVICLAMGS-SSQLG 405
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I GN QQ+ +V+YD +N + F C
Sbjct: 406 ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 133/436 (30%), Positives = 203/436 (46%), Gaps = 39/436 (8%)
Query: 58 QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
Q ++ I + K + K W D ++YL + + D
Sbjct: 29 QSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPERLKYLST---------LADQ 79
Query: 118 SNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
T +P+ G + L+ NY+ ++LG G+ M +++DT +D WV PC C
Sbjct: 80 KTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTT 136
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
F P+ S + + C+ + C + G S C ++ C + SYG S L ++
Sbjct: 137 FLPNASTTLGSLDCSGAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLTATLVQDA 191
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ L + F FGC G GL+GLGR +SL+SQ ++ G+FSYCLPS +
Sbjct: 192 ITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKS 251
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQA 347
SGSL LG I T ++ NP + Y +NLTG+S+G +QL
Sbjct: 252 YYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
G +IDSGTVITR +Y A++ EF KQ +G S+ G DTCF +A E
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AATNEAE 363
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRV 465
P + + FEG + +++ S S CL++A+ + +I N QQ+N R+
Sbjct: 364 APAITLHFEGLNLVLPMENSLIH--SSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRI 421
Query: 466 IYDTKNSQLGFAGEDC 481
++DT NS+LG A E C
Sbjct: 422 MFDTTNSRLGIARELC 437
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 178/351 (50%), Gaps = 30/351 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ +DTGS L W QCQPC C+NQ P +D S S ++ C+S+ C T +C
Sbjct: 50 LTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 105
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
+ + C Y SYGD S T G L E + + ASV +FGCG NN G+F +G+ G
Sbjct: 106 NQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 165
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSSVFKNST-PITYTNMIP 323
GR LSL SQ G FS+C T +G S +L + ++KN + T +I
Sbjct: 166 FGRGPLSLPSQLKV---GNFSHCF--TAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIK 220
Query: 324 NPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAE 377
NP TFY L+L GI++G +L S FA GG +IDSGT T LPP +Y + E
Sbjct: 221 NPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDE 280
Query: 378 FLK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
F + PS +L CF+ + ++P + + FEG A M + V+ K
Sbjct: 281 FAAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKD 337
Query: 435 DAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ +CLA+ E E IIGN+QQ+N V+YD KNS+L F C +
Sbjct: 338 GGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 384
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 139/432 (32%), Positives = 207/432 (47%), Gaps = 40/432 (9%)
Query: 63 EMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
++ I + K + + K W + D ++YL S +
Sbjct: 31 DLSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSS---------LTAQKTVAA 81
Query: 123 PLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
P+ SG + L NY+ ++LG G+ M +++DT +D W C C C + F
Sbjct: 82 PIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTT--TFSAQN 139
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S ++ + C+ C A G S C ++ DC + +YG S L ++ L LG
Sbjct: 140 SSTFATLDCSKPECTQ---ARGLS--CPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGP 194
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ +F FGC + G GLMGLGR LSL+SQ+ ++ GLFSYCLPS + SG
Sbjct: 195 NVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSG 254
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG 353
SL LG I T ++ NP + Y +NLTGIS+G +L A G
Sbjct: 255 SLKLG----PVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTG 310
Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G +IDSGTVITR P+IY+A++ EF KQ G S G DTCF + EV+ P +
Sbjct: 311 AGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLG--AFDTCF--ATNNEVSAPAIT 366
Query: 413 MEFEG-NAEMTVDVTGIVYFVKSDASQVCLALAS--LSYEDETGIIGNYQQKNQRVIYDT 469
+ G + ++ ++ + I S S CLA+A+ + +I N QQ+N R+++D
Sbjct: 367 LHLSGLDLKLPMENSLI---HSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDI 423
Query: 470 KNSQLGFAGEDC 481
NS+LG A E C
Sbjct: 424 NNSKLGIARELC 435
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 189/376 (50%), Gaps = 42/376 (11%)
Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
Y+ T+ +G + + DTGSDL W QC PC + C+ Q P+++P+ S ++ + CNS
Sbjct: 111 EYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS 170
Query: 191 S--TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
S C C+ C Y +YG G +T G G E G +A V
Sbjct: 171 SLSMCAGALAGAAPPPGCA------CMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVP 223
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + + G +GL+GLGR LSLVSQ + G FSYCL QD ++ +L+L
Sbjct: 224 GVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLL 280
Query: 304 GGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------KG 353
G ++++ N T + T + +P ++T+Y LNLTGIS+G K L S A G
Sbjct: 281 GPSAAL--NGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 338
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVN---I 408
G++IDSGT IT L + Y ++A + P+ G + LD CF L A +
Sbjct: 339 GLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVL 398
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P + + F+G A+M + ++ S + CLA+ + + + GNYQQ+N ++YD
Sbjct: 399 PSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILYD 453
Query: 469 TKNSQLGFAGEDCSSM 484
+ L FA CS++
Sbjct: 454 VREETLSFAPAKCSTL 469
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 194/377 (51%), Gaps = 45/377 (11%)
Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCN 189
Y+ T+ +G ++ I DTGSDL W QC PC C+ Q P+++P+ S ++ + CN
Sbjct: 91 EYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCN 150
Query: 190 SSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGELGREHLGLGKASVND-- 244
SS +GV + +PP C Y +YG G +T G G E G A+ +
Sbjct: 151 SSLSMC-------AGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQAR 202
Query: 245 ---FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
FGC + + G +GL+GLGR LSLVSQ + G FSYCL QD ++ +L
Sbjct: 203 VPGIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTL 259
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA------- 351
+LG ++++ N T + T + +P ++T+Y LNLTGIS+G K L S A
Sbjct: 260 LLGPSAAL--NGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADG 317
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNLSAYQEV--N 407
GG++IDSGT IT L + Y ++A ++ P+ G + LD C+ L
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPA 376
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P + + F+G A+M + ++ S + CLA+ + + + GNYQQ+N ++Y
Sbjct: 377 MPSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILY 431
Query: 468 DTKNSQLGFAGEDCSSM 484
D +N L FA CS++
Sbjct: 432 DVRNEMLSFAPAKCSTL 448
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 139/446 (31%), Positives = 206/446 (46%), Gaps = 57/446 (12%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE----- 121
+ LEL H + G + ++ R D H R N G I+ S+T
Sbjct: 25 VRLELTHADDRGGYV----GAERVRRAADRSH------RRVNGFLGAIEGPSSTARLGSD 74
Query: 122 ----IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
+ + T Y+ I +G + T ++DTGSDL W QC PC+ C+ Q P+
Sbjct: 75 GAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL 134
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGR 232
+ P+ S +Y V C S C AL+ S SPPD C Y+ SYGDG+ T G L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLAT 188
Query: 233 EHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
E LG +V FGCG N G SGL+G+GR LSLVSQ FSYC +
Sbjct: 189 ETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVT---RFSYCF-T 244
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQL- 345
+A A+ L LG ++ + S+ T +P+P + +++Y L+L GI++G L
Sbjct: 245 PFNATAASPLFLGSSA---RLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP 301
Query: 346 ------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCF 398
+ + GG++IDSGT T L + AL A L P A G + L CF
Sbjct: 302 IDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVAL-ARALASRVRLPLASGAHLGLSLCF 360
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
++ + V +P + + F+G A+M + V +S A CL + S ++G+
Sbjct: 361 AAASPEAVEVPRLVLHFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSM 415
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
QQ+N ++YD + L F C +
Sbjct: 416 QQQNTHILYDLERGILSFEPAKCGEL 441
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 189/368 (51%), Gaps = 38/368 (10%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ T +G + I DTGSD+ W+QC+PC+ CYNQ P+F+PS S SYK + C S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
CH++ + CS + C Y +SYGD S+++G+L + L L S +
Sbjct: 147 CHSVRDTS-----CSDQN--SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVI 199
Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-G 305
GCG +N G FGG SG++GLG +SL++Q GG FSYCL + ++ S IL G
Sbjct: 200 GCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG-----ILIDSG 360
+++V ++ + +P FY L L S+G K+++ G ++GG I+IDSG
Sbjct: 260 DAAVVSGDGVVSTPLIKKDP---VFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 361 TVITRLPPSIYSALKA---EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
T +T +P +Y+ L++ + +K FS+ C++L + E + P++ F+G
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL---CYSLKS-NEYDFPIITAHFKG 372
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGF 476
+++ I FV VC A + G I GN Q+N V YD + + F
Sbjct: 373 ---ADIELHSISTFVPITDGIVCFAFQP---SPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426
Query: 477 AGEDCSSM 484
DC+ +
Sbjct: 427 KPTDCTKV 434
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 134/373 (35%), Positives = 183/373 (49%), Gaps = 44/373 (11%)
Query: 135 YIATIELGGR--NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I LG +M I DTGSDL W QC+PC SCY Q +P+FDP+ S +Y+ + C +
Sbjct: 95 YLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKS 154
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIF 247
C L G G CS + C Y SYGDGS+T G+L + L +G SV +F
Sbjct: 155 CSNL----GGQGGCSDDN--TCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVF 208
Query: 248 GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG NN G F SGL+GLG LS++SQ + GG FSYCL P D S + G
Sbjct: 209 GCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGS 268
Query: 306 N---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK---------- 352
S STP+ + Q TFY L L +S+G K+L GF+K
Sbjct: 269 RGIVSGAGAVSTPLA------SRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADE 322
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF-NLSAYQEVNIPLV 411
G I+IDSGT +T LP Y L++ + G P ++ C+ NLS + IP +
Sbjct: 323 GNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSNLSG---LRIPTI 379
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
F G +++ + FV+ C A+ +S + I GN Q N V YD K+
Sbjct: 380 TAHFVG---ADLELKPLNTFVQVQEDLFCFAMIPVS---DLAIFGNLAQMNFLVGYDLKS 433
Query: 472 SQLGFAGEDCSSM 484
+ F DC+ +
Sbjct: 434 RTVSFKPTDCTKI 446
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 190/390 (48%), Gaps = 41/390 (10%)
Query: 123 PLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
P+T+ + Y+ +G + + + +DTGSDL W QC PC C++Q P+FDPS+
Sbjct: 75 PVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSV 134
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---- 235
S +++ V C C ++G S + C Y SYGD S T G + ++
Sbjct: 135 SSTFRAVACPDPICRP---SSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191
Query: 236 ----GLGKASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
G +V+ FGCG N G+F SG+ G GR LSL SQ + G FSYCL
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQ---LRVGRFSYCLT 248
Query: 291 S--TQDAGASGSLILGGNSSVFK--NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL- 345
S ++ + ++ LG + + +S P T +I +P TFY L+L GI++G +L
Sbjct: 249 SHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308
Query: 346 -QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
+S FA GG +IDSGT +T P +++ LK EF+ Q P + N
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQL----PLPRYDNTSEVGN 364
Query: 400 LSAYQEVN----IPLVKMEFE-GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
L +Q +P+ K+ F +A+M + ++ D + L E + +
Sbjct: 365 LLCFQRPKGGKQVPVPKLIFHLASADMDLPREN---YIPEDTDSGVMCLMINGAEVDMVL 421
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IGN+QQ+N ++YD +NS+L FA C M
Sbjct: 422 IGNFQQQNMHIVYDVENSKLLFASAQCDKM 451
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 128/385 (33%), Positives = 181/385 (47%), Gaps = 42/385 (10%)
Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-PVFDPSISPSYKKVLC 188
T Y+ + +G R + + +DTGSDL W QC PC +C++Q PV DP+ S ++ V C
Sbjct: 91 TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRC 150
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND---- 244
++ C AL F + G SS C Y YGD S T G+L + G D
Sbjct: 151 DAPVCRALPFTSCGRG-GSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209
Query: 245 ----FIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
FGCG NKG+F +G+ G GR SL SQ FSYC S ++ S
Sbjct: 210 SERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFES-TSS 265
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAKGGI 355
+ LG + + + T ++ +P + Y L+L I++G ++ + +
Sbjct: 266 LVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNL-------SAY---- 403
+IDSG IT LP +Y A+KAEF+ Q G P SA S LD CF L SA+
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAVEGSALDLCFALPSAAAPKSAFGWRW 384
Query: 404 ------QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIG 456
V +P + G A+ + V F A +CL L A+ D+T +IG
Sbjct: 385 RGRGRAMPVRVPRLVFHLGGGADWELPRENYV-FEDYGARVMCLVLDAATGGGDQTVVIG 443
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
NYQQ+N V+YD +N L FA C
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARC 468
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 186/369 (50%), Gaps = 29/369 (7%)
Query: 133 LNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
+ Y+ + +G + + DTGSDLTW QCQPCK C+ Q PV+DPS S ++ V C+S
Sbjct: 75 VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 134
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA------SVND 244
+TC S CS+ S C Y SY DG+Y+ G LG E L LG + SV+D
Sbjct: 135 ATC----LPVLRSRNCSTPS-SLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSD 189
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG +N G +G +GLGR LSL++Q G FSYCL ++ +LG
Sbjct: 190 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGV---GKFSYCLTDFFNSTLDSPFLLG 246
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQASGFAKGGILI 357
+ + + T ++ +P + Y+++L GI++G K + GG+++
Sbjct: 247 TLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVV 306
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKMEF 415
DSGT + LP S + + + + Q G P S+ CF A Q +P + + F
Sbjct: 307 DSGTTFSILPESGFRVV-VDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHF 365
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G A+M + + + + D+S CL + + ++GN+QQ+N ++++D QL
Sbjct: 366 AGGADMRLHRDNYMSYNQEDSS-FCLNIVGTT--STWSMLGNFQQQNIQMLFDMTVGQLS 422
Query: 476 FAGEDCSSM 484
F DCS +
Sbjct: 423 FLPTDCSKL 431
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 178/352 (50%), Gaps = 30/352 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ DTGSDLTW QCQPCK C+ Q PV+DPS S ++ V C+S+TC T S CS
Sbjct: 81 ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC----LPTWRSRNCS 136
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA------SVNDFIFGCGRNNKGLFGGVS 261
+ S P C Y SY DG+Y+ G LG E L +G + SV FGCG +N G +
Sbjct: 137 NPSSP-CRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNST 195
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
G +GLGR LSL++Q G FSYCL ++ LG + + + T +
Sbjct: 196 GTVGLGRGTLSLLAQLGV---GKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPL 252
Query: 322 IPNPQLATFYILNLTGISIGGKQ---------LQASGFAKGGILIDSGTVITRLPPSIYS 372
+ +P + Y +NL GIS+G + L+A G GG+++DSGT T L S +
Sbjct: 253 LQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADG--NGGMMVDSGTTFTILAKSGFR 310
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
+ + + Q G P S+ CF S E +P + + F G A+M + + +
Sbjct: 311 EV-VDRVAQLLGQPPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHRDNYMSYN 368
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ D+S CL + + +GN+QQ+N ++++D QL F DCS +
Sbjct: 369 EDDSS-FCLNI--VGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 417
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 184/348 (52%), Gaps = 37/348 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+N+ +I+DTGSD TW++C C +C+N++ P F+PS+S SY C ST
Sbjct: 140 QNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPST--------- 190
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
NY ++Y D SY++G + + L F FGCG + G FG S
Sbjct: 191 -----------KTNYTMNYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSAS 239
Query: 262 GLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G++GL + + SL+SQT+ F FSYC P ++ GSL+ G S + +T
Sbjct: 240 GVLGLAQGEQYSLISQTASKFKKKFSYCFPHNENT--RGSLLFG--EKAISASPSLKFTR 295
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
++ NP + Y + L GIS+ K+L S FA G +IDSGTVIT LP + Y AL+ F
Sbjct: 296 LL-NPSSGSVYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVITHLPTAAYEALRTAF 354
Query: 379 LKQFSGFPSA---PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
++ PS P LDTC+NL + + +P + + F G ++++ +GI++
Sbjct: 355 QQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILW-AN 413
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D +Q CLA A S+ IIGN QQ + +V+YD + +LGF G DC
Sbjct: 414 GDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF-GNDC 460
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 139/378 (36%), Positives = 188/378 (49%), Gaps = 42/378 (11%)
Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
E P+ SG Y+ I G + T IVDTGSDL WVQC PCKSCY FDPS
Sbjct: 80 ETPVASG----NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPS 135
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S SYK + C S+ C L F + C++S C Y YGDGS T G L + + +G
Sbjct: 136 KSASYKTLGCGSNFCQDLPFQS-----CAAS----CQYDYMYGDGSSTSGALSTDDVTIG 186
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+ + FGCG +N G F G GL+GLG+ LSLVSQ FSYCL S
Sbjct: 187 TGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTS 246
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----A 351
I G+S++ + + YT M+ N TFY L GIS+ GK + A+ F
Sbjct: 247 PLYI--GDSTL---AGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATG 301
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSAYQEVNIPL 410
+GG+++DSGT +T L ++ + A LK +P A G F L+ CF+ + P
Sbjct: 302 RGGLILDSGTTLTYLDVDAFNPMVAA-LKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360
Query: 411 VKMEFEG-NAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETG--IIGNYQQKNQRVI 466
V F G + + D T F+ D CLA+AS TG I GN QQ N ++
Sbjct: 361 VVFHFNGADVALAPDNT----FIALDFEGTTCLAMAS-----STGFSIFGNIQQLNHVIV 411
Query: 467 YDTKNSQLGFAGEDCSSM 484
+D N ++GF +C ++
Sbjct: 412 HDLVNKRIGFKSANCETI 429
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 133/389 (34%), Positives = 191/389 (49%), Gaps = 57/389 (14%)
Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPC--------KSCYNQQDPVFDPSISPSY 183
YI T+ +G ++ I DTGSDL W QC PC C+ Q +++PS S ++
Sbjct: 86 EYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145
Query: 184 KKVLCNS--STCHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLGK 239
+ CNS S C A+ S PP C Y +YG G +T G E G
Sbjct: 146 GVLPCNSPLSMCAAMA---------GPSPPPGCACMYNQTYGTG-WTAGVQSVETFTFGS 195
Query: 240 AS------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
+S V + FGC + + G +GL+GLGR +SLVSQ + G FSYCL Q
Sbjct: 196 SSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQ---LGAGAFSYCLTPFQ 252
Query: 294 DAGASGSLILGGNSSV-FKNSTPITYTNMIPNPQ---LATFYILNLTGISIG-------- 341
DA ++ +L+LG +++ K + P+ T + P ++T+Y LNLTGIS+G
Sbjct: 253 DANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPP 312
Query: 342 -GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSA--PGFSI-LDT 396
L+A G GG++IDSGT IT L S Y ++A + P A P S LD
Sbjct: 313 DAFSLRADG--TGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDL 370
Query: 397 CFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
CF L A +P + + FEG A+M + V + + CLA+ + + ++
Sbjct: 371 CFALKASTPPPAMPSMTLHFEGGADMVLPVENYMIL---GSGVWCLAMRNQTV-GAMSMV 426
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
GNYQQ+N V+YD + L FA CSS+
Sbjct: 427 GNYQQQNIHVLYDVRKETLSFAPAVCSSL 455
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 181/379 (47%), Gaps = 68/379 (17%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
+V+ DTGS L W QC PC C + P F P+ S ++ K+ C SS C +F T
Sbjct: 102 TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYL 158
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
C+++ C Y+ YG G +T G L E L +G AS FGC N G+ SG++
Sbjct: 159 TCNATG---CVYYYPYGMG-FTAGYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIV 213
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITY 318
GLGRS LSLVSQ G FSYCL S DAG S GSL + GGN STP
Sbjct: 214 GLGRSPLSLVSQVGV---GRFSYCLRSDADAGDSPILFGSLAKVTGGN----VQSTP--- 263
Query: 319 TNMIPNPQL--ATFYILNLTGISIGGKQLQAS----GFAK-------GGILIDSGTVITR 365
++ NP++ +++Y +NLTGI++G L + GF + GG ++DSGT +T
Sbjct: 264 --LLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTY 321
Query: 366 LPPSIYSALKAEFLKQFS-------------GFPSAPGFSILDTCFNLSAY---QEVNIP 409
L Y+ +K FL Q + GF D CF+ +A V +P
Sbjct: 322 LVKEGYAMVKRAFLSQMATANLTTTVNGTRFGF---------DLCFDATAAGGGSGVPVP 372
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSD----ASQVCLALASLSYEDETGIIGNYQQKNQRV 465
+ + F G AE V V V D A+ CL + S + IIGN Q + V
Sbjct: 373 TLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHV 432
Query: 466 IYDTKNSQLGFAGEDCSSM 484
+YD FA DC+++
Sbjct: 433 LYDLDGGMFSFAPADCANV 451
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 130/414 (31%), Positives = 193/414 (46%), Gaps = 50/414 (12%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
V W ++ L+ D +QYL S K +P+ SG + Q+ YI
Sbjct: 52 VSW----ESTLLKDKARLQYLSSLAKK-----------PSVPIASGRAIVQSPTYIVRAN 96
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+G + M V +DT +D WV C C C + +FDPS S S + + C++ C
Sbjct: 97 IGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCDAPQCKQAPN 154
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
T +G C + ++YG GS L ++ L L + + FGC G
Sbjct: 155 PTCTAGK-------SCGFNMTYG-GSTIEASLTQDTLTLANDVIKSYTFGCISKATGTSL 206
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
GLMGLGR LSL+SQT ++ FSYCLP+++ + SGSL LG + I
Sbjct: 207 PAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQPVR----IKT 262
Query: 319 TNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIY 371
T ++ NP+ ++ Y +NL GI +G K + S A G + DSGTV TRL Y
Sbjct: 263 TPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVEPAY 322
Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
A++ EF ++ +A DTC++ S V P V F G M V +
Sbjct: 323 VAVRNEFRRRIKNA-NATSLGGFDTCYSGS----VVYPSVTFMFAG---MNVTLPPDNLL 374
Query: 432 VKSDA-SQVCLALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S + S CLA+A+ + +I + QQ+N RV+ D NS+LG + E C+
Sbjct: 375 IHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETCT 428
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 138/437 (31%), Positives = 216/437 (49%), Gaps = 51/437 (11%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
G ++E+ H++ E Q R + + +H ++ N ++ + ++ E +
Sbjct: 27 GGFSVEMIHRDSSRSPFFSPTETQFQR-VANAVHRSINRA---NHLNQSFVSPNSPETTV 82
Query: 125 TSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
S + Y+ + +G ++ V I+DTGSD+ W+QCQPCK CY Q P+FD S S +
Sbjct: 83 ISALG----EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQT 138
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
YK + C S+TC +++ CSS C Y + Y DGS + G+L E L LG +
Sbjct: 139 YKTLPCPSNTCQSVQ-----GTFCSSRK--HCLYSIHYVDGSQSLGDLSVETLTLGSTNG 191
Query: 243 NDFIF-----GCGRNNK-GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
+ F GCGR N G+ SG++GLGR +SL++Q S GG FSYCL P A
Sbjct: 192 SPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTA 251
Query: 296 GASGSLILGGNSSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----A 347
+ + GN++V STP+ N + FY L L S+G +++
Sbjct: 252 SSKLNF---GNAAVVSGRGTVSTPLFSKNGL------VFYFLTLEAFSVGRNRIEFGSPG 302
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ-EV 406
SG KG I+IDSGT +T LP +YS L+A K +L C+ ++ + +
Sbjct: 303 SG-GKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDA 361
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRV 465
++P++ F G A++T++ I FV+ VC A ETG + GN Q+N V
Sbjct: 362 SVPVITAHFSG-ADVTLN--AINTFVQVADDVVCFAFQ----PTETGAVFGNLAQQNLLV 414
Query: 466 IYDTKNSQLGFAGEDCS 482
YD + + + F DC+
Sbjct: 415 GYDLQMNTVSFKHTDCT 431
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 174/368 (47%), Gaps = 38/368 (10%)
Query: 141 LGGRNM-----------TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
+GG NM +V+ DTGSDL W QC PC C+ Q P F P+ S ++ K+ C
Sbjct: 83 VGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCT 142
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
SS C +F + C+++ C Y YG G YT G L E L +G AS FGC
Sbjct: 143 SSFC---QFLPNSIRTCNATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGC 195
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
N G+ SG+ GLGR LSL+ Q G FSYCL S AGAS IL G+ +
Sbjct: 196 STEN-GVGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAAGASP--ILFGSLAN 249
Query: 310 FKNSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSG 360
+ + T + NP + ++Y +NLTGI++G L + GF + GG ++DSG
Sbjct: 250 LTDGN-VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSG 308
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGN 418
T +T L Y +K FL Q + + G LD CF + +P + + F+G
Sbjct: 309 TTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGG 368
Query: 419 AEMTVDV--TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
AE V G+ + + CL + + +IGN Q + ++YD F
Sbjct: 369 AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSF 428
Query: 477 AGEDCSSM 484
A DC+ +
Sbjct: 429 APADCAKV 436
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 181/379 (47%), Gaps = 33/379 (8%)
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y ++LG + +I+DTGSD++W+QC PCK C P F+P S S+ K+ C S
Sbjct: 136 LEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCAS 195
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE-------HLGLGK-ASV 242
STC G CS S C + + YGDGS + G L E + G G+ +
Sbjct: 196 STC--TNVYQGVKPFCSPSG-RTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 252
Query: 243 NDFIFGCGR-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
++ GC + +GL G SGL+G+ R +S SQ S + FS+C P S L
Sbjct: 253 SNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGL 312
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLAT----FYILNLTGISIGGKQLQASG-------- 349
+ G S + S + YT ++ NP + + +Y + L GIS+ +L S
Sbjct: 313 VFFGESDII--SPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 370
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL----SAYQE 405
GG +IDSGT T L + A++ EFL + S S C+N+ +A +
Sbjct: 371 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 430
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQR 464
+P + + F G ++ + I+ V S Q L LA +S + IIGNYQQ+N
Sbjct: 431 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNLW 490
Query: 465 VIYDTKNSQLGFAGEDCSS 483
V YD + +LG A C++
Sbjct: 491 VEYDLEKLRLGIAPAQCAT 509
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 180/379 (47%), Gaps = 33/379 (8%)
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y +++G + +I+DTGSD++W+QC PCK C P F+P S S+ K+ C S
Sbjct: 137 LEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCAS 196
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE-------HLGLGK-ASV 242
STC G CS S C + + YGDGS + G L E + G G+ +
Sbjct: 197 STC--TNVYQGVKPFCSPSG-RTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 253
Query: 243 NDFIFGCGR-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
++ GC + +GL G SGL+G+ R +S SQ S + FS+C P S L
Sbjct: 254 SNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGL 313
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLAT----FYILNLTGISIGGKQLQASG-------- 349
+ G S + S + YT ++ NP + + +Y + L GIS+ +L S
Sbjct: 314 VFFGESDII--SPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 371
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS----AYQE 405
GG +IDSGT T L + A++ EFL + S S C+N++ A +
Sbjct: 372 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 431
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED-ETGIIGNYQQKNQR 464
+P + + F G ++ + I+ V S Q L LA L D IIGNYQQ+N
Sbjct: 432 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNLW 491
Query: 465 VIYDTKNSQLGFAGEDCSS 483
V YD + +LG A C++
Sbjct: 492 VEYDLEKLRLGIAPAQCAT 510
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
YI T+ +G ++ I DTGSDL W QC PC + C+ Q P+++PS SP+++ + C+S
Sbjct: 91 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 191 STCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
AL + + ++ PP C Y +YG G +T G G E G + V
Sbjct: 151 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 205
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + + G +GL+GLGR LSLVSQ + G+FSYCL QD + +L+L
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 262
Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
G ++ N T + T +P+P ++T+Y LNLTGIS+G L FA
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGT 322
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
GG++IDSGT IT L + Y ++A ++ P G + LD CF L S+ +
Sbjct: 323 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P + + F G A+M + V + D CLA+ S + + E +GNYQQ+N ++YD
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 437
Query: 469 TKNSQLGFAGEDCSSM 484
+ L FA CS++
Sbjct: 438 VQKETLSFAPAKCSTL 453
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 124/370 (33%), Positives = 183/370 (49%), Gaps = 48/370 (12%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ + LG V IVDT SD+ WVQCQ C++CYN P+FDPS S +YK + C+S+
Sbjct: 87 DYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSST 146
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND------- 244
TC +++ G S CSS C + V+Y DGS+++G+L E + LG S ND
Sbjct: 147 TCKSVQ---GTS--CSSDERKICEHTVNYKDGSHSQGDLIVETVTLG--SYNDPFVHFPR 199
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GS 300
+ GC RN F + G++GLG +SLV Q S FSYCL D + +
Sbjct: 200 TVIGCIRNTNVSFDSI-GIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKFGDA 258
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-----ASGFAKGGI 355
++ G+ +V ST I + + FY L L S+G +++ + KG I
Sbjct: 259 AMVSGDGTV---STRIVFKDW------KKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNI 309
Query: 356 LIDSGTVITRLPPSIYSALK---AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
+IDSGT T LP +YS L+ A+ +K FS+ C+ S Y +V++P++
Sbjct: 310 IIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSL---CYK-STYDKVDVPVIT 365
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G V + + F+ + VCLA S I GN Q+N V YD +
Sbjct: 366 AHFSG---ADVKLNALNTFIVASHRVVCLAFLS---SQSGAIFGNLAQQNFLVGYDLQRK 419
Query: 473 QLGFAGEDCS 482
+ F DC+
Sbjct: 420 IVSFKPTDCT 429
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 115/344 (33%), Positives = 174/344 (50%), Gaps = 28/344 (8%)
Query: 146 MTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGN 202
+TV++DT D+ W++C PC C + +DP+ S +Y CNSS C L +A G
Sbjct: 163 VTVVLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANG- 216
Query: 203 SGVCSSSSPPDCNYFV-SYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGG- 259
C ++ C Y V + GD T G + L + V F FGC +N +G F
Sbjct: 217 ---CDANG--QCQYMVVTAGDSFTTSGTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQ 271
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
G+M LGR SL++QTS +G FSYCLP T+ + + +S +TP+
Sbjct: 272 ADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKGFFQIGVPIGASYRFVTTPMLKE 331
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
+ AT Y L I++ GK+L A FA G ++ DS T+ITRLP + Y AL+A
Sbjct: 332 RGGASAAAATLYRALLLAITVDGKELNVPAEVFAAGTVM-DSRTIITRLPVTAYGALRAA 390
Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
F + + AP LDTC++L+ + +P + + F+GNA + +D +GI+
Sbjct: 391 FRNRMR-YRVAPPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILL------- 442
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA AS + I+GN QQ+ +V++D ++GF C
Sbjct: 443 NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 183/370 (49%), Gaps = 27/370 (7%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QCQPC +C++Q P FDPS S +
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
C+S+ C L A+ S + C Y SYGD S T G L + ASV
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG N G+F +G+ G GR LSL SQ G FS+C + S +++L
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250
Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
+ ++K+ + T +I NP TFY L+L GI++G +L S FA GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTII 310
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
DSGT +T LP +Y ++ F Q P G + D F LSA +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368
Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
EG A M + V+ V+ S + CLA+ E IGN+QQ+N V+YD +NS+L
Sbjct: 369 EG-ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKL 424
Query: 475 GFAGEDCSSM 484
F C +
Sbjct: 425 SFVPAQCDKL 434
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
YI T+ +G ++ I DTGSDL W QC PC + C+ Q P+++PS SP+++ + C+S
Sbjct: 96 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 155
Query: 191 STCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
AL + + ++ PP C Y +YG G +T G G E G + V
Sbjct: 156 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 210
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + + G +GL+GLGR LSLVSQ + G+FSYCL QD + +L+L
Sbjct: 211 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 267
Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
G ++ N T + T +P+P ++T+Y LNLTGIS+G L FA
Sbjct: 268 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGT 327
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
GG++IDSGT IT L + Y ++A ++ P G + LD CF L S+ +
Sbjct: 328 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 386
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P + + F G A+M + V + D CLA+ S + + E +GNYQQ+N ++YD
Sbjct: 387 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 442
Query: 469 TKNSQLGFAGEDCSSM 484
+ L FA CS++
Sbjct: 443 VQKETLSFAPAKCSTL 458
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 137/406 (33%), Positives = 184/406 (45%), Gaps = 49/406 (12%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLT 157
++R+ + S + + P+T+ L T + Y+ + +G + T I+DTGSDL
Sbjct: 55 KARVAALQSAAVSPAPVAD-PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLI 113
Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
W QC PC C Q P FD S +Y+ + C SS C AL +S C C Y
Sbjct: 114 WTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRCAAL-----SSPSCFKKM---CVYQ 165
Query: 218 VSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
YGD + T G L E G AS + FGCG N G SG++G GR LS
Sbjct: 166 YYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLS 225
Query: 273 LVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LVSQ FSYCL PS G +L NS+ + +P+ T + NP
Sbjct: 226 LVSQLGP---SRFSYCLTSYLSPTPSRLYFGVFANL----NSTNTSSGSPVQSTPFVINP 278
Query: 326 QLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSIYSALKAEF 378
L Y L++ GIS+G K+L GG++IDSGT IT L Y A++
Sbjct: 279 ALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVR-RG 337
Query: 379 LKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
L P+ I LDTCF V +P F+G A MT+ + + S
Sbjct: 338 LASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDG-ANMTLPPENYM-LIAST 395
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+CLA+A S IIGNYQQ+N ++YD NS L F C
Sbjct: 396 TGYLCLAMAPTSVGT---IIGNYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 186/375 (49%), Gaps = 39/375 (10%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ + +G + I DTGSDL W QC PC S C+ Q P+++PS S ++ + CNSS
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 192 TCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLGK-----ASVND 244
L ++ PP C Y V+YG G +T G E G A V
Sbjct: 152 ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPG 206
Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + G SGL+GLGR LSLVSQ + FSYCL QD ++ +L+L
Sbjct: 207 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 263
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
G ++S+ + ++ T + +P A TFY LNLTGIS+G L A G
Sbjct: 264 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 322
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
G++IDSGT IT L + Y ++A + + P+ G + LD CF L S +P
Sbjct: 323 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSADTGLDLCFMLPSSTSAPPAMP 381
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ + F G A+M + Y + D+ CLA+ + + + E I+GNYQQ+N ++YD
Sbjct: 382 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 437
Query: 470 KNSQLGFAGEDCSSM 484
L FA CS++
Sbjct: 438 GQETLSFAPAKCSAL 452
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 174/360 (48%), Gaps = 41/360 (11%)
Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
I+DTGSDLTW QC PC +C+ Q P++DP+ S ++ K+ C S C AL A C
Sbjct: 111 AIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASPLCQALPSAFR---AC 167
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL--------GKASVNDFIFGCGRNNKGLFG 258
+++ C Y Y G +T G L + L + +S FGC N G
Sbjct: 168 NATG---CVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMD 223
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G SG++GLGRS LSL+SQ I G FSYCL S DAGAS ++ G ++V + +
Sbjct: 224 GASGIVGLGRSALSLLSQ---IGVGRFSYCLRSDADAGAS-PILFGALANVTGDK--VQS 277
Query: 319 TNMIPNP----QLATFYILNLTGISIGGKQLQAS----GF---AKGGILIDSGTVITRLP 367
T ++ NP + A +Y +NLTGI++G L + GF GG+++DSGT T L
Sbjct: 278 TALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLA 337
Query: 368 PSIYSALKAEFLKQFSGF---PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
+ Y+ L+ FL Q +G S F D CF A + +P + F G AE V
Sbjct: 338 EAGYTMLRQAFLSQTAGLLTRVSGAQFD-FDLCFEAGA-ADTPVPRLVFRFAGGAEYAVP 395
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V CL + +IGN Q + V+YD + FA DC+S+
Sbjct: 396 RQSYFDAVDEGGRVACLLVLP---TRGVSVIGNVMQMDLHVLYDLDGATFSFAPADCASL 452
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
YI T+ +G ++ I DTGSDL W QC PC + C+ Q P+++PS SP+++ + C+S
Sbjct: 91 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 191 STCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
AL + + ++ PP C Y +YG G +T G G E G + V
Sbjct: 151 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 205
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + + G +GL+GLGR LSLVSQ + G+FSYCL QD + +L+L
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 262
Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
G ++ N T + T +P+P ++T+Y LNLTGIS+G L FA
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGT 322
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
GG++IDSGT IT L + Y ++A ++ P G + LD CF L S+ +
Sbjct: 323 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P + + F G A+M + V + D CLA+ S + + E +GNYQQ+N ++YD
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 437
Query: 469 TKNSQLGFAGEDCSSM 484
+ L FA CS++
Sbjct: 438 VQKETLSFAPAKCSTL 453
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/375 (34%), Positives = 187/375 (49%), Gaps = 39/375 (10%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ + +G + I DTGSDL W QC PC S C+ Q P+++PS S ++ + CNSS
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 192 TCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLG-----KASVND 244
L ++ PP C Y V+YG G +T G E G ++ V
Sbjct: 150 ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVPG 204
Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + G SGL+GLGR LSLVSQ + FSYCL QD ++ +L+L
Sbjct: 205 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 261
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
G ++S+ + ++ T + +P A TFY LNLTGIS+G L A G
Sbjct: 262 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTG 320
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
G++IDSGT IT L + Y ++A + + P+ G + LD CF L S +P
Sbjct: 321 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSAATGLDLCFMLPSSTSAPPAMP 379
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ + F G A+M + Y + D+ CLA+ + + + E I+GNYQQ+N ++YD
Sbjct: 380 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 435
Query: 470 KNSQLGFAGEDCSSM 484
L FA CS++
Sbjct: 436 GQETLSFAPAKCSAL 450
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/446 (30%), Positives = 211/446 (47%), Gaps = 60/446 (13%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
+ ++L H + +G + E+ + + + + Y Q + + SG++ ++
Sbjct: 28 LRMKLTHVDDKAGYTTE--ERVRRAVAVSRERLAYTQQQQQLRASGDV----------SA 75
Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQC-QPC--KSCYNQQDPVFDPSISP 181
+ L T YIA +G + ++DTGS+L W QC C K+C Q P ++ S S
Sbjct: 76 PVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSS 135
Query: 182 SYKKVLCNSST--CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
++ V C S C A +GV C + SYG GS G LG E +
Sbjct: 136 TFAAVPCADSAKLCAA-------NGVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF-Q 186
Query: 240 ASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
+ FGC R KG G SGL+GLGR LSLVSQT FSYCL P ++
Sbjct: 187 SGAAKLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGAT---KFSYCLTPYLRNH 243
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ------ 346
GAS L +G ++S+ +T + +P+ +TFY L L GIS+G +L
Sbjct: 244 GASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAF 303
Query: 347 -----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNL 400
A+G+ GG++ID+G+ +T L + YSAL E +Q + P + LD C
Sbjct: 304 ELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCV-- 361
Query: 401 SAYQEVN--IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
A Q+V+ +P++ F G A+M V Y+ D S C+ + YE +IGN+
Sbjct: 362 -ARQDVDKVVPVLVFHFGGGADMAVSAGS--YWGPVDKSTACMLIEEGGYET---VIGNF 415
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
QQ++ ++YD +L F DCS +
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCSVL 441
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/414 (30%), Positives = 194/414 (46%), Gaps = 33/414 (7%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATI 139
K V +E + + + V+++ +R + ++ ++ E PL Y+ I
Sbjct: 4 KGVKRSEAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGGYVMDI 59
Query: 140 ELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
+G G+ I DTGSDL WVQ +PC C +FDP S +++++ C+S C L
Sbjct: 60 SVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELP 117
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRN 252
G C S C+Y YG G T GE R+ + LG S F GCG
Sbjct: 118 ------GSCEPGSS-TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMV 169
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
N G F GV GL+GLG+ +SL SQ S FSYCL S L+ G ++++ +
Sbjct: 170 NSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL--H 226
Query: 313 STPITYTNMI-PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY 371
T I T + P+ T+Y+L + GI++ G+ + + G +IDSGT +T +P +Y
Sbjct: 227 GTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG----SPGTTIIDSGTTLTYVPSGVY 282
Query: 372 SALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
+ + ++ P G S+ LD C++ S+ + P + + G A MT +
Sbjct: 283 GRVLSR-MESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAG-ATMTPPSSNYFL 340
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V VCLA+ S S IIGN Q+ ++YD +S+L F C S+
Sbjct: 341 VVDDSGDTVCLAMGSASGL-PVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 191/429 (44%), Gaps = 58/429 (13%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
+ L H+ + + +RL D + + +N+ P+ S
Sbjct: 78 VRFLLAHREAFAAPNATAAQLLAHRLARDAARAEAISVSARNVTRAG----GGFSAPVVS 133
Query: 127 GIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G+ + Y A++ +G +++DTGSD+ W+QC PC+ CY Q VFDP S SY
Sbjct: 134 GLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYA 193
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVN 243
V C + C L+ G + C Y V+YGDGS T G+L E L + A V
Sbjct: 194 AVRCGAPPCRGLDAGGGGGCDRRRGT---CLYQVAYGDGSVTAGDLATETLWFARGARVP 250
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GCG +N+GLF +GL+GLGR LSL +QT+ +G FSYC
Sbjct: 251 RVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYC--------------- 295
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---------FAKGG 354
F+ S L I+ +GG +++ G +GG
Sbjct: 296 ------FQGS------------DLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGG 337
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEVNIPLVKM 413
+++DSGT +TRL +Y A++ F G AP GFS+ DTC++L + V +P V +
Sbjct: 338 VILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSV 397
Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
G AE+ + Y + D CLALA + I+GN QQ+ RV++D
Sbjct: 398 HLAGGAEVALPPEN--YLIPVDTRGTFCLALAGT--DGGVSIVGNIQQQGFRVVFDGDRQ 453
Query: 473 QLGFAGEDC 481
++ + C
Sbjct: 454 RVALVPKSC 462
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
LQ + + + ++ V + +P+ SG + Q+ YI +G + M V +DT +D W
Sbjct: 54 LQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAW 113
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
+ C C C + +FDPS S S + + C + C A S S S C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164
Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
+YG GS L ++ L L + ++ FGC G GLMGLGR LSL+SQ+
Sbjct: 165 TYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223
Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
++ FSYCLP+++ + SGSL LG + + I T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279
Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+G K + S A G + DSGTV TRL Y A++ EF ++ +A
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA-NATSL 338
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS--LSY 448
DTC++ S V P V F G M V + + S A + CLA+A+ ++
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPVNV 391
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+I + QQ+N RV+ D NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 186/375 (49%), Gaps = 39/375 (10%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ + +G + I DTGSDL W QC PC S C+ Q P+++PS S ++ + CNSS
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 192 TCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLGK-----ASVND 244
L ++ PP C Y V+YG G +T G E G A V
Sbjct: 92 ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPG 146
Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGC + G SGL+GLGR LSLVSQ + FSYCL QD ++ +L+L
Sbjct: 147 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 203
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
G ++S+ + ++ T + +P A TFY LNLTGIS+G L A G
Sbjct: 204 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 262
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
G++IDSGT IT L + Y ++A + + P+ G + LD CF L S +P
Sbjct: 263 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSADTGLDLCFMLPSSTSAPPAMP 321
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ + F G A+M + Y + D+ CLA+ + + + E I+GNYQQ+N ++YD
Sbjct: 322 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 377
Query: 470 KNSQLGFAGEDCSSM 484
L FA CS++
Sbjct: 378 GQETLSFAPAKCSAL 392
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
LQ + + + ++ V + +P+ SG + Q+ YI +G + M V +DT +D W
Sbjct: 54 LQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAW 113
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
+ C C C + +FDPS S S + + C + C A S S S C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164
Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
+YG GS L ++ L L + ++ FGC G GLMGLGR LSL+SQ+
Sbjct: 165 TYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223
Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
++ FSYCLP+++ + SGSL LG + + I T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279
Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+G K + S A G + DSGTV TRL Y A++ EF ++ +A
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA-NATSL 338
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS--LSY 448
DTC++ S V P V F G M V + + S A + CLA+A+ ++
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPVNV 391
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+I + QQ+N RV+ D NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 170/349 (48%), Gaps = 26/349 (7%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
V+ DTGSDL W QC PC C+ Q P F P+ S ++ K+ C SS C +F + C+
Sbjct: 101 VVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFC---QFLPNSIRTCN 157
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
++ C Y YG G YT G L E L +G AS FGC N G+ SG+ GLG
Sbjct: 158 ATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLG 212
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
R LSL+ Q G FSYCL S AGAS ++ G +++ + + T + NP +
Sbjct: 213 RGALSLIPQLGV---GRFSYCLRSGSAAGAS-PILFGSLANLTDGN--VQSTPFVNNPAV 266
Query: 328 -ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYSALKAEF 378
++Y +NLTGI++G L + GF + GG ++DSGT +T L Y +K F
Sbjct: 267 HPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF 326
Query: 379 LKQFSGFPSAPGFSILDTCF-NLSAYQEVNIPLVKMEFEGNAEMTVDV--TGIVYFVKSD 435
L Q + + G LD CF + + +P + + F+G AE V G+ +
Sbjct: 327 LSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGS 386
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ CL + + +IGN Q + ++YD F+ DC+ +
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAKV 435
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
LQ + + + ++ V+ + +P+ SG + Q+ YI +G + M V +DT +D W
Sbjct: 54 LQDKARFLYLSSLAGVTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAW 113
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
+ C C C + +FDPS S S + + C + C A S S S C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164
Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
+YG GS L ++ L L + ++ FGC G GLMGLGR LSL+SQ+
Sbjct: 165 TYG-GSAIEAYLTQDTLTLATDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223
Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
++ FSYCLP+++ + SGSL LG + + I T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279
Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+G K + S A G + DSGTV TRL Y A++ EF ++ +A
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKN-ANATSL 338
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED 450
DTC++ S V P V F G M V + + S A + CLA+A+
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPTNV 391
Query: 451 ET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ +I + QQ+N RV+ D NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 127/370 (34%), Positives = 182/370 (49%), Gaps = 27/370 (7%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QCQPC +C++Q P FDPS S +
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
C+S+ C L A+ S + C Y SYGD S T G L + ASV
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG N G+F +G+ G GR LSL SQ G FS+C + S +++L
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250
Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
+ ++K+ + T +I NP TFY L+L GI++G +L S F GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTII 310
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
DSGT +T LP +Y ++ F Q P G + D F LSA +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368
Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
EG A M + V+ V+ S + CLA+ E IGN+QQ+N V+YD +NS+L
Sbjct: 369 EG-ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKL 424
Query: 475 GFAGEDCSSM 484
F C +
Sbjct: 425 SFVPAQCDKL 434
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 137/390 (35%), Positives = 197/390 (50%), Gaps = 49/390 (12%)
Query: 120 TEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
+ P TSG Y+A I +G + + +DTGSD+TW+QCQPC+ CY Q PVFDP
Sbjct: 125 SRAPTTSG------EYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDP 178
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG-DGSYTRGELGREHLG 236
S SY+++ ++ C AL + G + C Y V YG DGS T G+ E L
Sbjct: 179 RHSTSYREMGYDAPDCQALGRSGGGDAKRMT-----CVYAVGYGDDGSTTVGDFIEETLT 233
Query: 237 L-GKASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLP-- 290
G V GCG +NKGLF +G++GLGR +S SQ + + + FSYCL
Sbjct: 234 FAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADF 293
Query: 291 --STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG------ 342
S+ S +L +G ++ S P ++T + N +ATFY + L G+S+GG
Sbjct: 294 FLSSPGRSVSSTLTIGDGAAA--GSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGV 351
Query: 343 --KQLQASGF-AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFS-GFPSAPGFS 392
L+ + +GG+++DSGT +TRL Y A + F L Q S G PS GF
Sbjct: 352 TEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPS--GF- 408
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDE 451
DTC+ + + + +P V M F G E+T+ Y + D+ VC A A +
Sbjct: 409 -FDTCYTMGG-RAMKVPTVSMHFAGGVELTLPPKN--YLIPVDSMGTVCFAFAGTG-DRS 463
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IIGN QQ+ RV+Y+ ++GFA C
Sbjct: 464 VSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 194/391 (49%), Gaps = 32/391 (8%)
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
SR+ + S ++ + P+ SG +L QTL Y+ LG + + + VDT +D +W+
Sbjct: 80 SRLLYLDSLAVRGRARAYAPIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP---DCNYF 217
C C C FDP+ S SY+ V C S C A + C PP C +
Sbjct: 140 CAGCAGCPTSSAAPFDPAASASYRTVPCGSPLC-----AQAPNAAC----PPGGKACGFS 190
Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
++Y D S + L ++ L + +V + FGC + G GL+GLGR LS +SQT
Sbjct: 191 LTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQT 249
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
+++ FSYCLPS + SG+L LG N + I T ++ NP ++ Y +N+TG
Sbjct: 250 KDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQR----IKTTPLLANPHRSSLYYVNMTG 305
Query: 338 ISIGGKQLQASGF--AKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI- 393
+ +G K + F A G G ++DSGT+ TRL Y A++ E ++ AP S+
Sbjct: 306 VRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV----GAPVSSLG 361
Query: 394 -LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
DTCFN +A V P + + F+G + +++ S + +A A
Sbjct: 362 GFDTCFNTTA---VAWPPMTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVL 418
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+I + QQ+N RV++D N ++GFA E C++
Sbjct: 419 NVIASMQQQNHRVLFDVPNGRVGFARERCTA 449
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 173/356 (48%), Gaps = 37/356 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
+ V DT ++ ++C+PC C DP F+PS S S+ + C S C A+E
Sbjct: 187 QRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE--- 238
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGLF 257
C+ +S C + + +G+ + G L R+ L L A+ F FGC + F
Sbjct: 239 -----CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTF 290
Query: 258 GGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
G GL+ L RS SL S+ + FSYCLPS+ + G L +G + +
Sbjct: 291 DGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGG 350
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIY 371
I Y M NP Y ++L GIS+GG+ L FA G L+++ T T L P+ Y
Sbjct: 351 D-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAAY 409
Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+AL+ F K + +P+AP F +LDTC+NL+ + +P V + F G E+ +DV ++YF
Sbjct: 410 AALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYF 469
Query: 432 VKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D S V ++A + +IG Q++ V+YD + ++GF C
Sbjct: 470 --ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 173/356 (48%), Gaps = 37/356 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
+ V DT ++ ++C+PC C DP F+PS S S+ + C S C A+E
Sbjct: 99 QRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE--- 150
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGLF 257
C+ +S C + + +G+ + G L R+ L L A+ F FGC + F
Sbjct: 151 -----CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTF 202
Query: 258 GGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
G GL+ L RS SL S+ + FSYCLPS+ + G L +G + +
Sbjct: 203 DGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGG 262
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIY 371
I Y M NP Y ++L GIS+GG+ L FA G L+++ T T L P+ Y
Sbjct: 263 D-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAAY 321
Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+AL+ F K + +P+AP F +LDTC+NL+ + +P V + F G E+ +DV ++YF
Sbjct: 322 AALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYF 381
Query: 432 VKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+D S V ++A + +IG Q++ V+YD + ++GF C
Sbjct: 382 --ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 126/381 (33%), Positives = 176/381 (46%), Gaps = 59/381 (15%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
T +DT SDL W QCQPC CY Q DPVF+P S SY V CNS TC L+ + C
Sbjct: 102 TAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELD-----THRC 156
Query: 207 SSSSPPD----CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVS 261
+ D C Y SYG + TRG L + L +G +FGC ++ G VS
Sbjct: 157 ARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGGPPPQVS 216
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV-FKNSTPITYTN 320
G++GLGR LSLVSQ S F YCLP A G L+LG +++ +N++
Sbjct: 217 GVVGLGRGALSLVSQLSV---RRFMYCLPPPVSRSA-GRLVLGADAAATVRNASERVVVP 272
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQ----------ASGFAKG----------------- 353
M + ++Y LNL GISIG + + G A G
Sbjct: 273 MSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGT 332
Query: 354 -----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQ 404
G++ID + IT L S+Y + + ++ P G + LD CF L
Sbjct: 333 GPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIR-LPRGSGSDLGLDLCFILPEGVPMS 391
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
V P V + FEG + +D + FV+ AS +CL + D I+GNYQQ+N
Sbjct: 392 RVYAPPVSLAFEG-VWLRLDKEQM--FVEDRASGMMCLMVGK---TDGVSILGNYQQQNM 445
Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
+V+Y+ + ++ F C S+
Sbjct: 446 QVMYNLRRGRITFIKTACESV 466
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 201/422 (47%), Gaps = 50/422 (11%)
Query: 78 SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYI 136
+G + D + +RL L++ L +R K + P+ SG +L QT Y+
Sbjct: 66 AGFLADQASRDASRL----LYLDSLAARGK----------ARAYAPIASGRQLLQTPTYV 111
Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
LG + + + VDT +D W+ C C C P FDP+ S SY+ V C S C
Sbjct: 112 VRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPLC- 170
Query: 195 ALEFATGNSGVCSSSSPP---DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
A + C PP C + ++Y D S + L ++ L + +V + FGC +
Sbjct: 171 ----AQAPNAAC----PPGGKACGFSLTYADSSL-QAALSQDSLAVAGDAVKTYTFGCLQ 221
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
G GL+GLGR LS +SQT +++ G FSYCLPS + SG+L LG N +
Sbjct: 222 KATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPR 281
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVIT 364
I T ++ NP ++ Y +N+TGI +G K + A G ++DSGT+ T
Sbjct: 282 ----IKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFT 337
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
RL Y A++ E ++ AP S+ DTCFN +A V P V + F+G
Sbjct: 338 RLVAPAYVAVRDEVRRRV----GAPVSSLGGFDTCFNTTA---VAWPPVTLLFDGMQVTL 390
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ +++ S + +A A +I + QQ+N RV++D N ++GFA E C+
Sbjct: 391 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 450
Query: 483 SM 484
++
Sbjct: 451 AV 452
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 181/382 (47%), Gaps = 43/382 (11%)
Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSY 183
+R TL Y+A +G + ++DTGSDL W QC C K C Q P ++ S S ++
Sbjct: 83 VRWATLQYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTF 142
Query: 184 KKVLCNSSTCHA----LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
V C + C A + F +G C+ YG G G LG E +
Sbjct: 143 APVPCAARICAANDDIIHFCDLAAG---------CSVIAGYGAG-VVAGTLGTEAFAF-Q 191
Query: 240 ASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
+ + FGC R +G G SGL+GLGR LSLVSQT FSYCL P +
Sbjct: 192 SGTAELAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGAT---KFSYCLTPYFHNN 248
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------- 346
GA+G L +G ++S+ + +T T + P+ + FY L L G+++G +L
Sbjct: 249 GATGHLFVGASASLGGHGDVMT-TQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLR 307
Query: 347 --ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
A G GG++IDSG+ T L Y AL +E + +G AP D + A +
Sbjct: 308 EVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCV-ARR 366
Query: 405 EVN--IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
+V +P V F G A+M V Y+ D + C+A+AS +IGNYQQ+N
Sbjct: 367 DVGRVVPAVVFHFRGGADMAVPAES--YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQN 424
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
RV+YD N F DCS++
Sbjct: 425 MRVLYDLANGDFSFQPADCSAL 446
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 33/363 (9%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+ VIVDTGS+L W QC PC C+ + PV P+ S ++ ++ CN S C L ++
Sbjct: 103 DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTSS-R 161
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
C++++ C Y +YG G YT G L E L +G + FGC N SG
Sbjct: 162 PRTCNATA--ACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
++GLGR LSLVSQ + G FSYCL S G + ++ G + + + S + T ++
Sbjct: 217 IVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGASPILFGSLAKLTERSV-VQSTPLL 272
Query: 323 PNP--QLATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYS 372
NP Q +T Y +NLTGI++ +L + GF + GG ++DSGT +T L Y+
Sbjct: 273 KNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYA 332
Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFNLSA---YQEVNIPLVKMEFEGNAEMTVDV 425
+K F Q + P++ LD C+ SA + V +P + + F G A+ V V
Sbjct: 333 MVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPV 392
Query: 426 TGIVYFVKSDA----SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V++D+ + CL + + + IIGN Q + ++YD FA DC
Sbjct: 393 QNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
Query: 482 SSM 484
+ +
Sbjct: 453 AKL 455
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 123/412 (29%), Positives = 192/412 (46%), Gaps = 33/412 (8%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIEL 141
V +E + + + V+++ +R + ++ ++ E PL Y+ I +
Sbjct: 6 VKRSEAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGGYVMDISV 61
Query: 142 G--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
G G+ I DTGSDL WVQ +PC C +FDP S +++++ C+S C L
Sbjct: 62 GTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCTELP-- 117
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNK 254
G C S C+Y YG G T GE R+ + LG S F GCG N
Sbjct: 118 ----GSCEPGSSA-CSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNS 171
Query: 255 GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
G F GV GL+GLG+ +SL SQ S FSYCL S L+ G ++++ + T
Sbjct: 172 G-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL--HGT 228
Query: 315 PITYTNMI-PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
I T + P+ T+Y+L + GI++ G+ + + G +IDSGT +T +P +Y
Sbjct: 229 GIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTT----IIDSGTTLTYVPSGVYGR 284
Query: 374 LKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
+ + ++ P G S+ LD C++ S+ + P + + G A MT + V
Sbjct: 285 VLSR-MESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAG-ATMTPPSSNYFLVV 342
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
VCLA+ S IIGN Q+ ++YD +S+L F C S+
Sbjct: 343 DDSGDTVCLAMGSAGGL-PVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 129/376 (34%), Positives = 191/376 (50%), Gaps = 47/376 (12%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
YI T+ +G ++ I DTGSDL W QC PC S C+ Q ++PS S ++ + CNSS
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147
Query: 192 T--CHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLG-----KASV 242
C AL S PP C+ Y +YG G +T G E G + V
Sbjct: 148 VSMCAALA---------GPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRV 197
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
FGC + + G +GL+GLGR +SLVSQ + G+FSYCL QDA ++ +L+
Sbjct: 198 PGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQ---LGAGMFSYCLTPFQDANSTSTLL 254
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
LG ++++ N T + T + +P ++T+Y LNLTGISIG L + FA
Sbjct: 255 LGPSAAL--NGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGT 312
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEV--NI 408
GG++IDSGT IT L + Y ++A ++ P A G + LD CF L++ ++
Sbjct: 313 GGLIIDSGTTITSLVDAAYQQVRAA-IESLVTLPVADGSDSTGLDLCFALTSETSTPPSM 371
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
P + F+G A+M + V + + CLA+ + + GNYQQ+N ++YD
Sbjct: 372 PSMTFHFDG-ADMVLPVDNYMIL---GSGVWCLAMRNQTV-GAMSTFGNYQQQNVHLLYD 426
Query: 469 TKNSQLGFAGEDCSSM 484
L FA CS++
Sbjct: 427 IHEETLSFAPAKCSTL 442
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 33/363 (9%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGN 202
+ VIVDTGS+L W QC PC C+ + PV P+ S ++ ++ CN S C L ++
Sbjct: 103 DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTSS-R 161
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
C++++ C Y +YG G YT G L E L +G + FGC N SG
Sbjct: 162 PRTCNATA--ACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
++GLGR LSLVSQ + G FSYCL S G + ++ G + + + S + T ++
Sbjct: 217 IVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGASPILFGSLAKLTEGSV-VQSTPLL 272
Query: 323 PNP--QLATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYS 372
NP Q +T Y +NLTGI++ +L + GF + GG ++DSGT +T L Y+
Sbjct: 273 KNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYA 332
Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFNLSA---YQEVNIPLVKMEFEGNAEMTVDV 425
+K F Q + P++ LD C+ SA + V +P + + F G A+ V V
Sbjct: 333 MVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPV 392
Query: 426 TGIVYFVKSDA----SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V++D+ + CL + + + IIGN Q + ++YD FA DC
Sbjct: 393 QNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
Query: 482 SSM 484
+ +
Sbjct: 453 AKL 455
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 114/367 (31%), Positives = 178/367 (48%), Gaps = 31/367 (8%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I +G + + I DTGSDL WVQCQPC+ CY Q P+FDP S SY+ VLC +
Sbjct: 93 YLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEF 152
Query: 193 CHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-------- 243
C+ L+ G + C + C Y SYGD S++ G L E G+G + N
Sbjct: 153 CNKLD---GEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYF 209
Query: 244 -DFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGS 300
+ FGCG N G F SG++GLG +SLVSQ G FSYCL P+++ + +
Sbjct: 210 QEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSK 269
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-----AKGGI 355
+ G + ++ ++ + T ++P + T+Y L L IS+ K+L + KG I
Sbjct: 270 INFGNDINISGSNYNVVSTPLLPK-KPETYYYLTLEAISVENKRLPYTNLWNGEVEKGNI 328
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSGT +T L ++ L + + G + + + CF + + +P++ F
Sbjct: 329 IIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDE--KAIELPIITAHF 386
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G V++ + F K + +C + ++ I GN Q N V YD + +
Sbjct: 387 TG---ADVELQPVNTFAKVEEDLLCFTMIP---SNDIAIFGNLAQMNFLVGYDLEKKAVS 440
Query: 476 FAGEDCS 482
F DC+
Sbjct: 441 FLPTDCT 447
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 175/378 (46%), Gaps = 53/378 (14%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
T +DT SDL W QCQPC CY+Q DP+F+P +S +Y + C+S TC L+
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG----V 260
C C Y +Y + T G L + L +G+ + FGC ++ G G
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTG--GAPPPQA 213
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
SG++GLGR LSLVSQ S F+YCLP + G L+LG ++ +N+T
Sbjct: 214 SGVVGLGRGPLSLVSQLSV---RRFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVP 269
Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG-- 353
M +P+ ++Y LNL G+ IG + + A+ A G
Sbjct: 270 MRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDA 329
Query: 354 ---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEV 406
G++ID + IT L S+Y L + + P G S+ LD CF L A+ V
Sbjct: 330 NRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFILPDGVAFDRV 388
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V + F+G + +D + F + S + + + I+GN+QQ+N +V+
Sbjct: 389 YVPAVALAFDGR-WLRLDKARL--FAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 445
Query: 467 YDTKNSQLGFAGEDCSSM 484
Y+ + ++ F C ++
Sbjct: 446 YNLRRGRVTFVQSPCGAL 463
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 188/393 (47%), Gaps = 49/393 (12%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PVFDPS 178
L SG + + Y + +G + +IVDTGSDLTW+QC P + N P +D S
Sbjct: 48 LVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 107
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
S SY+++ C C L G+S CS +SP C+Y Y D S T G L E + +
Sbjct: 108 SSSSYREIPCTDDECQFLPAPIGSS--CSITSPSPCDYTYGYSDQSRTTGILAYETISMK 165
Query: 238 -----GKAS---------VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEI-F 281
GK + + + GC R + G F G SG++GLG+ +SL +QT
Sbjct: 166 SRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 225
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
GG+FSYCL + S ++ G + K + +T ++ NP +FY +N+TG+++
Sbjct: 226 GGIFSYCLVDYLRGSNASSFLVMGRTHWRK----LAHTPIVRNPAAQSFYYVNVTGVAVD 281
Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYS----ALKAE-FLKQFSGFPSA 388
GK + G G + DSGT ++ L YS AL A +L + P
Sbjct: 282 GKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE- 340
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
GF + C+N++ E +P + +EF+G A M + + V + C+AL ++
Sbjct: 341 -GFEL---CYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQ--CVALQKVTT 393
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + I+GN Q++ + YD +++GF C
Sbjct: 394 TNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 175/378 (46%), Gaps = 53/378 (14%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
T +DT SDL W QCQPC CY+Q DP+F+P +S +Y + C+S TC L+
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG----V 260
C C Y +Y + T G L + L +G+ + FGC ++ G G
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTG--GAPPPQA 213
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
SG++GLGR LSLVSQ S F+YCLP + G L+LG ++ +N+T
Sbjct: 214 SGVVGLGRGPLSLVSQLSV---RRFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVP 269
Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG-- 353
M +P+ ++Y LNL G+ IG + + A+ A G
Sbjct: 270 MRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDA 329
Query: 354 ---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEV 406
G++ID + IT L S+Y L + + P G S+ LD CF L A+ V
Sbjct: 330 NRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFILPDGVAFDRV 388
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V + F+G + +D + F + S + + + I+GN+QQ+N +V+
Sbjct: 389 YVPAVALAFDGR-WLRLDKARL--FAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 445
Query: 467 YDTKNSQLGFAGEDCSSM 484
Y+ + ++ F C ++
Sbjct: 446 YNLRRGRVTFVQSPCGAL 463
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 193/391 (49%), Gaps = 32/391 (8%)
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
SR+ + S ++ + P+ SG +L QT Y+ LG + + + VDT +D +W+
Sbjct: 80 SRLLYLDSLAVRGRARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP---DCNYF 217
C C C FDP+ S SY+ V C S C A + C PP C +
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLC-----AQAPNAAC----PPGGKACGFS 190
Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
++Y D S + L ++ L + +V + FGC + G GL+GLGR LS +SQT
Sbjct: 191 LTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQT 249
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
+++ FSYCLPS + SG+L LG N + I T ++ NP ++ Y +N+TG
Sbjct: 250 KDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQR----IKTTPLLANPHRSSLYYVNMTG 305
Query: 338 ISIGGKQLQASGF--AKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI- 393
I +G K + F A G G ++DSGT+ TRL Y A++ E ++ AP S+
Sbjct: 306 IRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV----GAPVSSLG 361
Query: 394 -LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
DTCFN +A V P V + F+G + +++ S + +A A
Sbjct: 362 GFDTCFNTTA---VAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVL 418
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+I + QQ+N RV++D N ++GFA E C++
Sbjct: 419 NVIASMQQQNHRVLFDVPNGRVGFARERCTA 449
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/377 (32%), Positives = 193/377 (51%), Gaps = 44/377 (11%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
Y+ T+ +G ++ I DTGSDL W QC PC S C+ Q P+++PS S ++ + CNSS
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145
Query: 192 TCHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLGKAS------VN 243
+ + + ++ PP C Y ++YG G +T G E G ++ V
Sbjct: 146 ------LSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVP 198
Query: 244 DFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
FGC + G SGL+GLGR LSLVSQ + FSYCL QD ++ +L+
Sbjct: 199 GIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLL 255
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------K 352
LG ++S+ ++ ++ T + +P ++T+Y LNLTGIS+G L A
Sbjct: 256 LGPSASL-NDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGT 314
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNL--SAYQEVN 407
GG +IDSGT IT L + Y ++A + + P+ G S LD CF L S
Sbjct: 315 GGFIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGGSAATGLDLCFELPSSTSAPPT 373
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P + + F+G A+M + ++ D++ CLA+ + + + I+GNYQQ+N ++Y
Sbjct: 374 MPSMTLHFDG-ADMVLPADS---YMMLDSNLWCLAMQNQT-DGGVSILGNYQQQNMHILY 428
Query: 468 DTKNSQLGFAGEDCSSM 484
D L FA CS++
Sbjct: 429 DVGQETLTFAPAKCSTL 445
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 172/357 (48%), Gaps = 37/357 (10%)
Query: 143 GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
+ V DT ++ ++C+PC C DP F+PS S S+ + C S C A+E
Sbjct: 98 AQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE-- 150
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGL 256
C+ +S C + + +G+ + G L R+ L L A+ F FGC +
Sbjct: 151 ------CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADT 201
Query: 257 FGGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
F G GL+ L RS SL S+ + FSYCLPS+ + G L +G + +
Sbjct: 202 FDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSG 261
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSI 370
I Y M NP Y + L GIS+GG+ L FA G L+++ T T L P+
Sbjct: 262 GD-IKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAA 320
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
Y+AL+ F + + +P+AP F +LDTC+NL+ + +P V + F G E+ +DV ++Y
Sbjct: 321 YAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMY 380
Query: 431 FVKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
F +D S V ++A + +IG Q++ V+YD + ++GF C
Sbjct: 381 F--ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 141/418 (33%), Positives = 209/418 (50%), Gaps = 46/418 (11%)
Query: 93 ILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IV 150
+ D L+ +L+S ++ NI +S T+ L SG+ + +I +G M V I
Sbjct: 47 VTDRLNAAFLRSISRSRRLNNI--LSQTD--LQSGLIGADGEFFMSITIGTPPMKVFAIA 102
Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
DTGSDLTWVQC+PC+ CY + P+FD S +YK C+S CHAL ++ G S +
Sbjct: 103 DTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCHAL--SSSERGCDESKN 160
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IFGCGRNNKGLFGGV-SGLM 264
C Y SYGD S+++G++ E + + AS + +FGCG NN G F SG++
Sbjct: 161 V--CKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGII 218
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI--LGGN---SSVFKNSTPITYT 319
GLG LSL+SQ FSYCL S + A +G+ + LG N SS+ K+S I+
Sbjct: 219 GLGGGHLSLISQLGSSISKKFSYCL-SHKSATTNGTSVINLGTNSIPSSLSKDSGVISTP 277
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFA------------KGGILIDSGTVITRLP 367
+ P+ T+Y L L IS+G K++ +G + G I+IDSGT +T L
Sbjct: 278 LVDKEPR--TYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLD 335
Query: 368 PSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
+ A + +G S P +L CF S E+ +P + + F G V +
Sbjct: 336 SGFFDKFGAAVEELVTGAKRVSDPQ-GLLSHCFK-SGSAEIGLPEITVHFTG---ADVRL 390
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ I FVK VCL++ + E I GN+ Q + V YD + + F DCS+
Sbjct: 391 SPINAFVKVSEDMVCLSMVPTT---EVAIYGNFAQMDFLVGYDLETRTVSFQRMDCSA 445
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 132/422 (31%), Positives = 194/422 (45%), Gaps = 50/422 (11%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
G +++L H++ D ++ Q RL D SR+ G + + T +
Sbjct: 30 GGFSVDLIHRDSPHSPFFDPSKTQAERLT-DAFRRSV--SRV-----GRFRPTAMTSDGI 81
Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
S I Y+ + +G + VI VDTGSDLTW QC+PC CY Q P+FDP S +
Sbjct: 82 QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSST 141
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----G 238
Y+ C +S C AL G CS C + SY DGS+T G L E L + G
Sbjct: 142 YRDSSCGTSFCLAL----GKDRSCSKEK--KCTFRYSYADGSFTGGNLASETLTVDSTAG 195
Query: 239 K-ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
K S F FGCG ++ G+F SG++GLG +LSL+SQ GLFSYC LP + D+
Sbjct: 196 KPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDS 255
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
S + G + V T ++T L G S + + +G I
Sbjct: 256 SISSRINFGASGRVSGYGT------------VSTPLRLPYKGYS------KKTEVEEGNI 297
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
++DSGT T LP YS L+ G I C+N +A E+N P++ F
Sbjct: 298 IVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHF 355
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
+ + V++ + F++ VC +A S + G++GN Q N V +D + + G
Sbjct: 356 K---DANVELQPLNTFMRMQEDLVCFTVAPTS---DIGVLGNLAQVNFLVGFDLRKKR-G 408
Query: 476 FA 477
F+
Sbjct: 409 FS 410
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 64/149 (42%), Gaps = 13/149 (8%)
Query: 340 IGGKQLQASGFAK------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
+G + GF+K G I++DSGT T LP Y L+ G I
Sbjct: 399 VGFDLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 458
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
C+N + Q ++ P++ F+ + V++ F++ VC + S + G
Sbjct: 459 SSLCYNTTVDQ-IDAPIITAHFK---DANVELQPWNTFLRMQEDLVCFTVLPTS---DIG 511
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+GN Q N V +D + ++ F DC+
Sbjct: 512 ILGNLAQVNFLVGFDLRKKRVSFKAADCT 540
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 131/367 (35%), Positives = 178/367 (48%), Gaps = 36/367 (9%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I LG + + I DTGSDL W QC PC +CY Q +P+FDP S +YK + C++
Sbjct: 94 YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIF 247
C L G G C + C Y SYGD SYTRG+L + L +G AS F
Sbjct: 154 CQDL----GQQGSCDDDN--TCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAF 207
Query: 248 GCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG +N G F GL+GLG LSLV Q S GG FSYCL P + D+ S S I G
Sbjct: 208 GCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVS-SKINFG 266
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----------KGGI 355
S V S ++ + P TFY L L G+S+G + + GF+ +G I
Sbjct: 267 KSGVVSGSGTVSTPLIKGTPD--TFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNI 324
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSGT +T LP Y+ +++ G + I C+ S+ + IP + F
Sbjct: 325 IIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTITAHF 382
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G V + + FV+ VC ++ S I GN Q N V YD KN+++
Sbjct: 383 TG---ADVQLPPLNTFVQVQEDLVCFSMIPSS---NLAIFGNLAQINFLVGYDLKNNKVS 436
Query: 476 FAGEDCS 482
F DC+
Sbjct: 437 FKQTDCT 443
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 176/386 (45%), Gaps = 35/386 (9%)
Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
S NI+++ I G L + YI T + +T +VDTGSDL W+QC PC CY Q
Sbjct: 50 SNNIQNIVQAPINAYIGQHLMEI-YIGTPPI---KITGLVDTGSDLIWIQCAPCLGCYKQ 105
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
P+FDP S +Y + C+S CH L+ +GVCS CNY YGD S T+G L
Sbjct: 106 IKPMFDPLKSSTYNNISCDSPLCHKLD-----TGVCSPEK--RCNYTYGYGDNSLTKGVL 158
Query: 231 GREHLGL----GK-ASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG- 283
++ GK S++ F+FGCG NN G F GL+GLG SL+SQ +FGG
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218
Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
FS CL P D S + G S V N + T ++P + T Y + L GIS+
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNG--VVTTPLVPR-EKDTSYFVTLLGISVED 275
Query: 343 KQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSILDT 396
S K +L+DSGT LP +Y + AE LK + PS L T
Sbjct: 276 TYFPMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPS------LGT 329
Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
+ P + F G + + + CLA+ + + D G+ G
Sbjct: 330 QLCYRTQTNLKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDP-GVYG 388
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCS 482
N+ Q N + +D + F DC+
Sbjct: 389 NFAQSNYLIGFDLDRQVVSFKPTDCT 414
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/460 (29%), Positives = 205/460 (44%), Gaps = 42/460 (9%)
Query: 39 LHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLH 98
+H L + + S VS ++ +++L H++ + +R+I L
Sbjct: 1 MHPLVFLSLALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALR 60
Query: 99 VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTW 158
Y +R + K + IP G L YI T + I DT SDL W
Sbjct: 61 SIYQLNRASHSDLNEKKTLERVRIP-NHGEYLMRF-YIGTPPV---ERLAIADTASDLIW 115
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
VQC PC++C+ Q P+F+P S ++ + C+S C + N C C Y
Sbjct: 116 VQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPC-----TSSNIYYCPLVG-NLCLYTN 169
Query: 219 SYGDGSYTRGELGREHLGLGKASVN--DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSL 273
+YGDGS T+G L E + G +V IFGCG NN + V+G++GLG LSL
Sbjct: 170 TYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSL 229
Query: 274 VSQTSEIFGGLFSYC-LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
VSQ + G FSYC LP T + ++ L G ++++ N + T +I +P ++Y
Sbjct: 230 VSQLGDQIGHKFSYCLLPFT--STSTIKLKFGNDTTITGNG--VVSTPLIIDPHYPSYYF 285
Query: 333 LNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIY--------SALKAEFLKQF 382
L+L GI+IG K LQ + G I+ID GTV+T L + Y AL K
Sbjct: 286 LHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDD 345
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+P D CF + NI K+ F+ ++F D + +CLA
Sbjct: 346 IPYP-------FDFCFP----NQANITFPKIVFQFTGAKVFLSPKNLFFRFDDLNMICLA 394
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ Y + GN Q + +V YD K ++ FA DCS
Sbjct: 395 VLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/388 (35%), Positives = 181/388 (46%), Gaps = 40/388 (10%)
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
T+ L SG+ Y +I +G I DTGSDLTWVQC+PC+ CY Q P+FD
Sbjct: 70 TKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDK 129
Query: 178 SISPSYKKVLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
S +YK C+S TC+AL E G C S C Y SYGD S+T+GE+ E +
Sbjct: 130 KKSSTYKTESCDSITCNALSEHEEG----CDESRNA-CKYRYSYGDESFTKGEVATETIS 184
Query: 237 LGKASVNDF-----IFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLP 290
+ +S + FGCG NN G F + LSLVSQ G FSYCL
Sbjct: 185 IDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLS 244
Query: 291 STQDAGASGSLI-LGGNSSVFKNS--TPITYTNMI-PNPQLATFYILNLTGISIGGKQLQ 346
T S+I LG NS K S + I T +I +P+ T+Y L L I++G +L
Sbjct: 245 HTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPE--TYYFLTLEAITVGKTKLP 302
Query: 347 ASG----------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSIL 394
+G G I+IDSGT +T L Y A + +G S P IL
Sbjct: 303 YTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQ-GIL 361
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
CF S +E+ +P + M F G V ++ I FVK VCL++ + E I
Sbjct: 362 THCFK-SGDKEIGLPTITMHFTG---ADVKLSPINSFVKLSEDIVCLSMIPTT---EVAI 414
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
GN Q + V YD + + F DCS
Sbjct: 415 YGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 132/368 (35%), Positives = 179/368 (48%), Gaps = 36/368 (9%)
Query: 134 NYIATIELGGR--NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ I LG +M I DTGSDL W QC PC CY Q +P+FDP S +YK + CN+
Sbjct: 93 SYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNND 152
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFI 246
C L G G C + C SYGD SYTR +L E +G AS
Sbjct: 153 FCQDL----GQQGSCGDDN--TCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLA 206
Query: 247 FGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILG 304
FGCG +N G F GL+GLG LSLV Q S GG FSYCL P + D+ AS S I
Sbjct: 207 FGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTAS-SKINF 265
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG----------G 354
G S+V S ++ + P TFY L L G+S+G +++ GF+K
Sbjct: 266 GKSAVVSGSGTVSTPLIKGTPD--TFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESN 323
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
I+IDSGT +T LP Y+ +++ K G + C+ S +++ IP +
Sbjct: 324 IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAH 381
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F G V + + FV++ VC ++ S I GN Q N V YD KN+++
Sbjct: 382 FIG---ADVQLPPLNTFVQAQEDLVCFSMIPSS---NLAIFGNLSQMNFLVGYDLKNNKV 435
Query: 475 GFAGEDCS 482
F DC+
Sbjct: 436 SFKPTDCT 443
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 132/449 (29%), Positives = 211/449 (46%), Gaps = 58/449 (12%)
Query: 67 ITLELKHKNYCSGKIVDWNE---QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
+ + LKH + +GK + +E + R + +++R + D T P
Sbjct: 32 VRVALKHVD--AGKQLSRSELIRRAMQRSKARAAALSAVRNRAASARFSGKNDDQRTTPP 89
Query: 124 LTSGIRLQ-TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
+R L Y+ + +G + ++ ++DTGSDL W QC PC SC Q DP+F P S
Sbjct: 90 TGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGES 149
Query: 181 PSYKKVLCNSSTC-----HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
SY+ + C C H E P C Y +YGDG+ T G E
Sbjct: 150 ASYEPMRCAGQLCSDILHHGCEM------------PDTCTYRYNYGDGTMTMGVYATERF 197
Query: 236 GLGKASVNDFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
+ + + FGCG N G SG++G GR+ LSLVSQ S FSYCL
Sbjct: 198 TFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSI---RRFSYCL- 253
Query: 291 STQDAGASGSLILGGNS-SVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ-- 346
++ +G +L+ G S V+ ++T P+ T ++ + Q TFY ++L G+++G ++L+
Sbjct: 254 TSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIP 313
Query: 347 ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL 400
S FA GG+++DSGT +T LP ++ + + F +Q P A G + D CF +
Sbjct: 314 ESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLR-LPFANGGNPEDGVCFLV 372
Query: 401 -------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDET 452
S+ +V +P + F+ + +D+ Y + ++CL LA D+
Sbjct: 373 PAAWRRSSSTSQVPVPRMVFHFQ---DADLDLPRRNYVLDDHRKGRLCLLLADSG--DDG 427
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IGN Q++ RV+YD + L FA C
Sbjct: 428 STIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 121/377 (32%), Positives = 180/377 (47%), Gaps = 47/377 (12%)
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y+ + +G + +T ++DTGSDL W QC C +C Q DP+F P +S SY+ + C
Sbjct: 96 LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
C G+ S P C Y SYGDG+ T G E G+
Sbjct: 156 QLC-------GDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG 208
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
FGCG N G SG++G GR LSLVSQ S FSYCL + S GSL
Sbjct: 209 FGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSI---RRFSYCLTPYASSRKSTLQFGSL- 264
Query: 303 LGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGG 354
+ ++ ++T P+ T ++ + Q TFY + TG+++G ++L+ AS FA GG
Sbjct: 265 --ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGG 322
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAY--------QE 405
++IDSGT +T P ++ + + F Q P A G S D CF A ++
Sbjct: 323 VIIDSGTALTLFPAAVLAEVVRAFRSQLR-LPFANGSSPDDGVCFAAPAVAAGGGRMARQ 381
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
V +P + F+G +D+ Y ++ +C+ L D+ IGN+ Q++ R
Sbjct: 382 VAVPRMVFHFQG---ADLDLPRENYVLEDHRRGHLCVLLGDSG--DDGATIGNFVQQDMR 436
Query: 465 VIYDTKNSQLGFAGEDC 481
V+YD + L FA +C
Sbjct: 437 VVYDLERETLSFAPVEC 453
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 121/353 (34%), Positives = 158/353 (44%), Gaps = 44/353 (12%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
DTGSDL W QC PC CY QQ+P+FDP S SY + C + +C+ L+ S +CS+
Sbjct: 77 ADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTESCNKLD-----SSLCSTD 131
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLM 264
CNY SY D S T+G L +E L L + IFGCG NN G GL+
Sbjct: 132 Q-KTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLI 190
Query: 265 GLGRSDLSLVSQTSEIFGG---LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPIT 317
GLGR LSL+SQ G +FS CL P D + + G S V N STP+
Sbjct: 191 GLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLI 250
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIY 371
+ T Y L GIS+ L S KG ILIDSGT IT LP Y
Sbjct: 251 SKD-------GTGYFATLLGISVEDINLPFSNGSSLGTITKGNILIDSGTTITYLPEEFY 303
Query: 372 SALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
L + + + P F I + C+ +N P + + FEG V +T
Sbjct: 304 HRLIEQVRNKVALEP----FRIDGYELCYQTPT--NLNGPTLTIHFEGG---DVLLTPAQ 354
Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
F+ C A+ +E GNY Q N + +D + + F DC+
Sbjct: 355 MFIPVQDDNFCFAV--FDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 176/364 (48%), Gaps = 34/364 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ LG ++ I DTGSDL W QC+PC CY Q P+FDP S +Y+ + C++
Sbjct: 92 YLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQ 151
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIF 247
C L+ CS C+Y SYGD S+T G + + + LG S + I
Sbjct: 152 CDLLK----EGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAII 207
Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GCG NN G F SG++GLG +SL+SQ G FSYCL P + +A S L G
Sbjct: 208 GCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGS 267
Query: 306 NSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILID 358
N V STP+ + P+ TFY L L +S+G ++++ + G ++G I+ID
Sbjct: 268 NGIVSGGGVQSTPLISKD--PD----TFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIID 321
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
SGT +T P +S L + +G P IL C+++ A ++ P + F+G
Sbjct: 322 SGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--DLKFPSITAHFDG- 378
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
V + + FV+ + +C A + + I GN Q N V YD + + F
Sbjct: 379 --ADVKLNPLNTFVQVSDTVLCFAFNPI---NSGAIFGNLAQMNFLVGYDLEGKTVSFKP 433
Query: 479 EDCS 482
DC+
Sbjct: 434 TDCT 437
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 179/362 (49%), Gaps = 37/362 (10%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+ DTGSDLTW QC+PCK C+ Q P++D + S S+ V C S+TC + ++ N C+
Sbjct: 110 ALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASATCLPIWRSSRN---CT 166
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA---------SVNDFIFGCGRNNKGLFG 258
+++ C Y +Y DG+Y+ G LG E L + SV FGCG +N GL
Sbjct: 167 ATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSY 226
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST---- 314
+G +GLGR LSLV+Q G FSYCL + ++ G + + ST
Sbjct: 227 NSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTIGGA 283
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLP 367
+ T ++ P + Y ++L GIS+G +L GG+++DSGT+ T L
Sbjct: 284 AVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLV 343
Query: 368 PSIYSAL---KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMT 422
S + + A L Q P S+ CF +A ++ ++P + + F G A+M
Sbjct: 344 ESAFRVVVNHVAGVLNQ----PVVNASSLDSPCFPATAGEQQLPDMPDMLLHFAGGADMR 399
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ + F ++S CL +A + I+GN+QQ+N ++++D QL F DCS
Sbjct: 400 LHRDNYMSF-NQESSSFCLNIAG-APSAYGSILGNFQQQNIQMLFDITVGQLSFVPTDCS 457
Query: 483 SM 484
+
Sbjct: 458 KL 459
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/348 (35%), Positives = 179/348 (51%), Gaps = 29/348 (8%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
IVDTGSD+ W+QCQPC+ CYNQ P+FDPS S +YK + C+S+ C +++ A CSS
Sbjct: 110 IVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQSAAS----CSS 165
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
++ +C Y ++YGD S+++G+L E L LG + F GCG NNKG F SG
Sbjct: 166 NN-DECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSG 224
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
++GLG +SL+SQ S GG FSYCL P + +S L G + V T T +
Sbjct: 225 IVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGT--VSTPI 282
Query: 322 IPNPQLATFYILNLTGISIGGKQL------QASGFAKGGILIDSGTVITRLPPSIYSALK 375
+P L FY L L S+G ++ S +G I+IDSGT +T LP Y L+
Sbjct: 283 VPKNGLG-FYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLE 341
Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
+ L C+ ++ E+N+P++ F+G V++ I F++ D
Sbjct: 342 SAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKG---ADVELNPISTFIEVD 398
Query: 436 ASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
VC A S + G I GN Q+N V YD + F DC+
Sbjct: 399 EGVVCFAFRS----SKIGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCT 442
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 178/362 (49%), Gaps = 27/362 (7%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y I +G + V+V DTGSDL WVQCQPC+ CY Q+ P+F+P S +Y++VLC +
Sbjct: 94 YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRY 153
Query: 193 CHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGREHLGLGKA--SVNDFIFGC 249
C+AL + CS+ C Y SYGD S+T G L E +G S+ + FGC
Sbjct: 154 CNAL---NSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFGC 210
Query: 250 GRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGAS-GSLILGGN 306
G +N G F SG++GLG LSL+SQ FSYCL P + + S G ++ G N
Sbjct: 211 GNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDN 270
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS------GFAKGGILIDSG 360
S + + T ++ + P+ TFY L L IS+G ++L KG I+IDSG
Sbjct: 271 SFISGSDTYVSTPLVSKEPE--TFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSG 328
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T +T L +Y+ L+ K G + I CF + +P++ + F +
Sbjct: 329 TTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKI--GIELPIITVHF---TD 383
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
V++ I F K++ +C + + I GN Q N V YD + + F D
Sbjct: 384 ADVELKPINTFAKAEEDLLCFTMIP---SNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTD 440
Query: 481 CS 482
CS
Sbjct: 441 CS 442
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 187/393 (47%), Gaps = 49/393 (12%)
Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PVFDPS 178
L SG + + Y + +G + +I+DTGSDLTW+QC P + N P +D S
Sbjct: 16 LVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 75
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
S SY+++ C C L G+S CS SP C+Y Y D S T G L E + +
Sbjct: 76 SSSSYREIPCTDDECLFLPAPIGSS--CSIKSPSPCDYTYGYSDQSRTTGILAYETISMK 133
Query: 238 -----GKAS---------VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEI-F 281
GK + + + GC R + G F G SG++GLG+ +SL +QT
Sbjct: 134 SRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 193
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
GG+FSYCL + S ++ G + K + +T ++ NP +FY +N+TG+++
Sbjct: 194 GGIFSYCLVDYLRGSNASSFLVMGRTRWRK----LAHTPIVRNPAAQSFYYVNVTGVAVD 249
Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYS----ALKAE-FLKQFSGFPSA 388
GK + G G + DSGT ++ L YS AL A +L + P
Sbjct: 250 GKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE- 308
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
GF + C+N++ E +P + +EF+G A M + + V + C+AL ++
Sbjct: 309 -GFEL---CYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQ--CVALQKVTT 361
Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ + I+GN Q++ + YD +++GF C
Sbjct: 362 TNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 141/441 (31%), Positives = 206/441 (46%), Gaps = 52/441 (11%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
+T+EL H++ + + + + D L+ +L+S IS + + + T+ L S
Sbjct: 29 LTVELIHRDSPHSPLYN-----PHHTVSDRLNAAFLRS-----ISRSRRFTTKTD--LQS 76
Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G+ Y +I +G V I DTGSDLTWVQC+PC+ CY Q P+FD S +YK
Sbjct: 77 GLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYK 136
Query: 185 KVLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
C+S TC AL E G C S C Y SYGD S+T+G++ E + + +S +
Sbjct: 137 TESCDSKTCQALSEHEEG----CDESKDI-CKYRYSYGDNSFTKGDVATETISIDSSSGS 191
Query: 244 DF-----IFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+FGCG NN G F + LSLVSQ G FSYCL T
Sbjct: 192 SVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTN 251
Query: 298 SGSLILGGNSSVFKNSTPITYTNMIP----NPQLATFYILNLTGISIGGKQLQASGFA-- 351
S+I G +S+ N + + T P +P+ T+Y L L +++G +L +G
Sbjct: 252 GTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE--TYYFLTLEAVTVGKTKLPYTGGGYG 309
Query: 352 --------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLS 401
G I+IDSGT +T L Y + +G S P +L CF S
Sbjct: 310 LNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQ-GLLTHCFK-S 367
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
+E+ +P + M F NA+ V ++ I FVK + VCL++ + E I GN Q
Sbjct: 368 GDKEIGLPAITMHFT-NAD--VKLSPINAFVKLNEDTVCLSMIPTT---EVAIYGNMVQM 421
Query: 462 NQRVIYDTKNSQLGFAGEDCS 482
+ V YD + + F DCS
Sbjct: 422 DFLVGYDLETKTVSFQRMDCS 442
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 172/354 (48%), Gaps = 38/354 (10%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
IVDTGSD+ W+QC+PC+ CYNQ P+F+PS S SYK + C S C ++E + N
Sbjct: 103 IVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNY-- 160
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKGLF-GGVSG 262
C Y YGD S++ G+L + L L S + + GCG NN + G SG
Sbjct: 161 -----CEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSG 215
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLP-----STQDAGASGSLILGGNSSVFKN---ST 314
++G G S ++Q GG FSYCL + + A+ L G ++V + +T
Sbjct: 216 IVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTT 275
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSI 370
PI + +P+ TFY L L S+G ++++ G +G I+IDSGT +T L
Sbjct: 276 PI----LKKDPE--TFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDD 329
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
YS L++ + L+ C+++ A + + P++ M F+G VD+ I
Sbjct: 330 YSFLESAVVDLVKLERVDDPTQTLNLCYSVKA-EGYDFPIITMHFKG---ADVDLHPIST 385
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
FV CLA S + I GN Q+N V YD + + F DC+ +
Sbjct: 386 FVSVADGVFCLAFES---SQDHAIFGNLAQQNLMVGYDLQQKIVSFKPSDCTKV 436
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 168/362 (46%), Gaps = 34/362 (9%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
L GI T N++ I +GG + +I D +D TW+QCQPC CY+Q D +FDPS S
Sbjct: 176 LNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSS 235
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
SY + C + C+ L NS S S C Y ++Y DG+ T G L E + +
Sbjct: 236 SYTLLSCETKHCNLLP----NS---SCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG 288
Query: 242 VNDFI-FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
D + GC N+G F G G GLGR LS S+ I SYCL ++D +S +
Sbjct: 289 WVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSR---INASSMSYCLVESKDGYSSST 345
Query: 301 LILGGNSSVFKNSTPIT---YTNMIPNPQLATFYILNLTGISIGGKQLQASG-------F 350
L NS P + ++ NP+ Y + L GI +GG+++ +
Sbjct: 346 LEF--------NSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPY 397
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
GG+++ S ++IT L Y+ ++ F+ + F DTC+NLS+ V +P+
Sbjct: 398 GNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPI 457
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
++ E + +Y V + + C A A + I+G QQ RV +D
Sbjct: 458 LEFEVNDGKSWLLPKESYLYAVDKNGT-FCFAFAPS--KGSFSILGTLQQYGTRVTFDLV 514
Query: 471 NS 472
NS
Sbjct: 515 NS 516
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 121/377 (32%), Positives = 180/377 (47%), Gaps = 47/377 (12%)
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y+ + +G + +T ++DTGSDL W QC C +C Q DP+F P +S SY+ + C
Sbjct: 96 LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
C G+ S P C Y SYGDG+ T G E G+
Sbjct: 156 QLC-------GDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG 208
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
FGCG N G SG++G GR LSLVSQ S FSYCL + S GSL
Sbjct: 209 FGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSI---RRFSYCLTPYASSRKSTLQFGSL- 264
Query: 303 LGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGG 354
+ ++ ++T P+ T ++ + Q TFY + TG+++G ++L+ AS FA GG
Sbjct: 265 --ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGG 322
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAY--------QE 405
++IDSGT +T P ++ + + F Q P A G S D CF A ++
Sbjct: 323 VIIDSGTALTLFPVAVLAEVVRAFRSQLR-LPFANGSSPDDGVCFAAPAVAAGGGRMARQ 381
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
V +P + F+G +D+ Y ++ +C+ L D+ IGN+ Q++ R
Sbjct: 382 VAVPRMVFHFQG---ADLDLPRENYVLEDHRRGHLCVLLGDSG--DDGATIGNFVQQDMR 436
Query: 465 VIYDTKNSQLGFAGEDC 481
V+YD + L FA +C
Sbjct: 437 VVYDLERETLSFAPVEC 453
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 135/421 (32%), Positives = 210/421 (49%), Gaps = 62/421 (14%)
Query: 110 ISGNIKDVSNTEIPL-----TSGIR--LQTLNYIA--TIELG----GRNMTVIVDTGSDL 156
I ++D N + L TSG+R + L A +++LG +N++ I+DTGS+
Sbjct: 64 IQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLSAIIDTGSEA 123
Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT--GNSGVCSSSSPPDC 214
VQC ++ PVFDP+ S SY++V C S C A++ T G+S C +SS C
Sbjct: 124 VLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSA-TC 176
Query: 215 NYFVSYGDGSYTRGELGREHLGL------GKA-SVNDFIFGCGRNNKGLFG--GVSGLMG 265
Y +SYGD + G+ ++ + L G+A D FGC + +G G G++G
Sbjct: 177 TYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVG 236
Query: 266 LGRSDLSLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
R +LSL SQ + GG FSYC PS + +I G+S + K + + YT ++ N
Sbjct: 237 FNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSK--SKVGYTPLLDN 294
Query: 325 P---QLATFYILNLTGISIGGKQLQ--ASGF------AKGGILIDSGTVITRLPPSIYSA 373
P + Y + LT IS+ GK L S F GG ++DSGT TR+ Y+A
Sbjct: 295 PVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTA 354
Query: 374 LKAEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVDV 425
+ F L++ G +A GF D C+N+SA + +P V++ + N + +
Sbjct: 355 FRNAFAASNRSGLRKKVG--AAAGF---DDCYNISAGSSLPGVPEVRLSLQNNVRLELRF 409
Query: 426 TGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ V + ++V + LA LS + + ++GNYQQ N V YD + S++GF DC
Sbjct: 410 EHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469
Query: 482 S 482
S
Sbjct: 470 S 470
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 169/369 (45%), Gaps = 41/369 (11%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ + +G + I DTGSDLTW C PC CY Q++P+FDP S SY+ + C+S
Sbjct: 24 HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSK 83
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
CH L+ +GVCS CNY +Y + T+G L +E + L + +
Sbjct: 84 LCHKLD-----TGVCSPQK--HCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIV 136
Query: 247 FGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLIL 303
FGCG NN G F G++GLG +S +SQ FGG FS CL P D S + L
Sbjct: 137 FGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSL 196
Query: 304 GGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG-----FAKGGI 355
G S V STP+ Q T Y + L GIS+G L +G KG +
Sbjct: 197 GKGSEVSGKGVVSTPLVAK------QDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNV 250
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKM 413
+DSGT T LP +Y L A+ + + P + LD L + N+ P++
Sbjct: 251 FLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVT---NDLDLGPQLCYRTKNNLRGPVLTA 307
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
FEG V + FV CL + S + G+ GN+ Q N + +D
Sbjct: 308 HFEGG---DVKLLPTQTFVSPKDGVFCLGFTNTS--SDGGVYGNFAQSNYLIGFDLDRQV 362
Query: 474 LGFAGEDCS 482
+ F DC+
Sbjct: 363 VSFKPMDCT 371
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 181/375 (48%), Gaps = 42/375 (11%)
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y+ + +G + ++ ++DTGSDL W QC PC SC +Q DP+F P S SY+ + C
Sbjct: 94 LEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAG 153
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI---- 246
+ C + S P C Y +YGDG+ T G E +
Sbjct: 154 TLCSDILHH-------SCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTV 206
Query: 247 ---FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
FGCG N G SG++G GR+ LSLVSQ S FSYCL S S L
Sbjct: 207 PLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSI---RRFSYCLTSYASRRQSTLLFG 263
Query: 304 GGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGI 355
+ V+ ++T + T ++ +PQ TFY ++ TG+++G ++L+ S FA GG+
Sbjct: 264 SLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 323
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL-------SAYQEVN 407
++DSGT +T LP ++ + + F +Q P A G + D CF + S+ ++
Sbjct: 324 IVDSGTALTLLPAAVLAEVVRAFRQQLR-LPFANGGNPEDGVCFLVPAAWRRSSSTSQMP 382
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P + + F+G +D+ Y + ++CL LA D+ IGN Q++ RV+
Sbjct: 383 VPRMVLHFQG---ADLDLPRRNYVLDDHRRGRLCLLLADSG--DDGSTIGNLVQQDMRVL 437
Query: 467 YDTKNSQLGFAGEDC 481
YD + L A C
Sbjct: 438 YDLEAETLSIAPARC 452
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/384 (33%), Positives = 171/384 (44%), Gaps = 44/384 (11%)
Query: 123 PLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
P+T+ L T + Y+ + +G + T I+DTGSDL W QC PC C +Q P FD
Sbjct: 74 PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDV 133
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S +Y+ + C SS C +L +S C C Y YGD + T G L E
Sbjct: 134 KKSATYRALPCRSSRCASL-----SSPSCFKKM---CVYQYYYGDTASTAGVLANETFTF 185
Query: 238 G-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
G K + FGCG N G SG++G GR LSLVSQ FSYCL S
Sbjct: 186 GAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSYCLTSY 242
Query: 293 QDAGASGSLILGGNSSVFKNST----PITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
A S L G +++ +T P+ T + NP L Y L+L IS+G K L
Sbjct: 243 LSATPS-RLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301
Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
GG++IDSGT IT L Y A++ + P+ I LDTCF
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIP-LPAMNDTDIGLDTCFQW 360
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---IIGN 457
V + + + F ++ + + S +CL +A TG IIGN
Sbjct: 361 PPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLVMA------PTGVGTIIGN 414
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
YQQ+N ++YD NS L F C
Sbjct: 415 YQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 184/402 (45%), Gaps = 49/402 (12%)
Query: 109 MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKS 166
+I N V I + + + +Y+ + +G + VDTGSDL W+QC PC +
Sbjct: 33 LIPRNSSQVLFNRITAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTN 92
Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDG 223
CY Q +P+FDP S +Y + S +C S + S+S PD CNY SY D
Sbjct: 93 CYKQLNPMFDPQSSSTYSNIAYGSESC---------SKLYSTSCSPDQNNCNYTYSYEDD 143
Query: 224 SYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQT 277
S T G L +E L L GK ++ IFGCG NN G+F G++GLGR LSLVSQ
Sbjct: 144 SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQI 203
Query: 278 SEIFGG-LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYI 332
FGG +FS CL P + + + G S V N STP+ N FY
Sbjct: 204 GSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNT-----HQAFYF 258
Query: 333 LNLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
+ L GIS+ L + KG ++IDSGT T LP Y L E + +
Sbjct: 259 VTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDP 318
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIP--LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
P P + L N+ + FEG A++ + T I F+ C A
Sbjct: 319 IPIDPTLG-----YQLCYRTPTNLKGTTLTAHFEG-ADVLLTPTQI--FIPVQDGIFCFA 370
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
S ++ +E GI GN+ Q N + +D + + F DC+++
Sbjct: 371 FTS-TFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDCTNL 411
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 170/370 (45%), Gaps = 49/370 (13%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ +DT SDL W+QCQPC SCY Q DPVF+P +S SY V C S TC L+ G+ C
Sbjct: 106 SAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLD---GHR--C 160
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMG 265
C Y Y T+G L + L +G + +FGC ++ G SGL+G
Sbjct: 161 HEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVG 220
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
LGR LSLVSQ S F YCLP + SG L+LG + +N + M +
Sbjct: 221 LGRGPLSLVSQLSV---HRFMYCLPPPM-SRTSGKLVLGAGADAVRNMSDRVTVTMSSST 276
Query: 326 QLATFYILNLTGISIGGKQLQASGFAKG--------------------------GILIDS 359
+ ++Y LNL G+++G + + A G+++D
Sbjct: 277 RYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDV 336
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEF 415
+ I+ L S+Y L + ++ + P + LD CF L V +P V + F
Sbjct: 337 ASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF 396
Query: 416 EGN-AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+G E+ D FV +D +CL + S I+GN+Q +N RV+++ + ++
Sbjct: 397 DGRWLELDRD----RLFV-TDGRMMCLMIGRTS---GVSILGNFQLQNMRVLFNLRRGKI 448
Query: 475 GFAGEDCSSM 484
FA C S+
Sbjct: 449 TFAKASCDSL 458
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/418 (29%), Positives = 198/418 (47%), Gaps = 46/418 (11%)
Query: 78 SGKIVDWNEQQQNRLI-LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNY 135
+G + D + +RL+ LD+L V+ P+ SG +L QT Y
Sbjct: 65 AGFLADQAARDASRLLYLDSLAVK-----------------GRAYAPIASGRQLLQTPTY 107
Query: 136 IATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
+ LG + + + VDT +D W+ C C C F+P+ S SY+ V C S C
Sbjct: 108 VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQC 165
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
+ CS ++ C + +SY D S + L ++ L + V + FGC +
Sbjct: 166 -----VLAPNPSCSPNAK-SCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRA 218
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
G GL+GLGR LS +SQT +++G FSYCLPS + SG+L LG N +
Sbjct: 219 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRR-- 276
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRL 366
I T ++ NP ++ Y +N+TGI +G K + AS A G ++DSGT+ TRL
Sbjct: 277 --IKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 334
Query: 367 PPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
+Y AL+ E ++ +G + DTC+N + V P V + F+G +
Sbjct: 335 VAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTT----VAWPPVTLLFDGMQVTLPEE 390
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+++ S + +A A +I + QQ+N RV++D N ++GFA E C++
Sbjct: 391 NVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCTA 448
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/376 (32%), Positives = 189/376 (50%), Gaps = 47/376 (12%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSS 191
++ T+ +G + I DTGSDL W QC PC + C+ Q P+++PS S ++ + CNSS
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI----- 246
G+C+ + C Y ++YG G +T G E G ++ D +
Sbjct: 145 L-----------GLCAPAC--ACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGI 190
Query: 247 -FGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC + G SGL+GLGR LSLVSQ + FSYCL QD ++ +L+LG
Sbjct: 191 AFGCSNASSGFNASSASGLVGLGRGSLSLVSQ---LGAPKFSYCLTPYQDTNSTSTLLLG 247
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILI 357
++S+ ++ ++ T + +P + +Y LNLTGIS+G L A GG++I
Sbjct: 248 PSASL-NDTGVVSSTPFVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIPLVKM 413
DSGT IT L + Y ++A L + P+ G + LD CF L S ++P + +
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVLSLVT-LPTTDGSAATGLDLCFELPSSTSAPPSMPSMTL 364
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV---CLALASLSYED--ETGIIGNYQQKNQRVIYD 468
F+G A+M + + + S CLA+ + + D I+GNYQQ+N ++YD
Sbjct: 365 HFDG-ADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYD 423
Query: 469 TKNSQLGFAGEDCSSM 484
L FA CS++
Sbjct: 424 VGKETLSFAPAKCSTL 439
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 191/422 (45%), Gaps = 51/422 (12%)
Query: 92 LILDNLHVQYLQSRIKNMISGN-----------IKDVSNTEIPLT--SGIRLQTLNYIAT 138
L+L LH+ + N+I N + ++S E LT S I +Y+
Sbjct: 16 LMLLPLHISATEGFSVNLIRKNSSHAHVLPLRRLMELSAMEKTLTPQSPIYAYLGHYLME 75
Query: 139 IELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
+ +G + I DTGSDLTW C PC +CY Q++P+FDP S +Y+ + C+S CH L
Sbjct: 76 LSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKL 135
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKA-SVNDFIFGCGR 251
+ +GVCS CNY +Y + TRG L +E + L GK+ + +FGCG
Sbjct: 136 D-----TGVCSPQK--RCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGH 188
Query: 252 NNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLILGGNSS 308
NN G F G++GLG +SL+SQ FGG FS CL P D S + G S
Sbjct: 189 NNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSK 248
Query: 309 VFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG----FAKGGILIDSGT 361
V STP+ Q T Y + L GIS+ L +G KG + +DSGT
Sbjct: 249 VSGKGVVSTPLVAK------QDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGT 302
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T LP +Y + A+ + + P + C+ + P++ FEG
Sbjct: 303 PPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTK--NNLRGPVLTAHFEG--- 357
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
V ++ F+ CL + S + G+ GN+ Q N + +D + F +D
Sbjct: 358 ADVKLSPTQTFISPKDGVFCLGFTNTS--SDGGVYGNFAQSNYLIGFDLDRQVVSFKPKD 415
Query: 481 CS 482
C+
Sbjct: 416 CT 417
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 183/372 (49%), Gaps = 28/372 (7%)
Query: 123 PLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
P+ SG +L QT Y+ LG + + + VDT +D W+ C C C F+P+
Sbjct: 41 PIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAA 98
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S SY+ V C S C + CS ++ C + +SY D S + L ++ L +
Sbjct: 99 SASYRPVPCGSPQC-----VLAPNPSCSPNAK-SCGFSLSYADSSL-QAALSQDTLAVAG 151
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
V + FGC + G GL+GLGR LS +SQT +++G FSYCLPS + SG
Sbjct: 152 DVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSG 211
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----K 352
+L LG N + I T ++ NP ++ Y +N+TGI +G K + AS A
Sbjct: 212 TLRLGRNGQPRR----IKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATG 267
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
G ++DSGT+ TRL +Y AL+ E ++ +G + DTC+N + V P V
Sbjct: 268 AGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTT----VAWPPV 323
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F+G + +++ S + +A A +I + QQ+N RV++D N
Sbjct: 324 TLLFDGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPN 383
Query: 472 SQLGFAGEDCSS 483
++GFA E C++
Sbjct: 384 GRVGFARESCTA 395
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/291 (36%), Positives = 151/291 (51%), Gaps = 34/291 (11%)
Query: 206 CSSSSPP-----------DCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNN 253
C+ SSPP C + +SY DG+ T G ++ L L A V +F FGCG
Sbjct: 18 CARSSPPMRTAAAVTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGHGK 77
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+ G G++GLGR SL ++ +GG+FSYCLPS G L LG KN
Sbjct: 78 HAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVSSK--PGFLALGAG----KNP 127
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIY 371
+ +T M P TF + L GI++GGK+L + S F+ GG+++DSGTVIT L + Y
Sbjct: 128 SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-GGMIVDSGTVITGLQSTAY 186
Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVY 430
AL++ F K + P LDTC+NL+ Y+ V +P + + F G A + +DV GI+
Sbjct: 187 RALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILV 245
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA A + G++GN Q+ V++DT S+ GF + C
Sbjct: 246 -------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 186/377 (49%), Gaps = 33/377 (8%)
Query: 122 IPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+P+ G +L ++ +Y+A LG + + V +D +D WV C + P FDP+
Sbjct: 93 VPIAPGRQLLSIPSYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPT 150
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +Y+ V C + C + G+ SS C + +SY ++ + LG++ L L
Sbjct: 151 RSSTYRPVRCGAPQCSQAPAPSCPGGLGSS-----CAFNLSYAASTF-QALLGQDALALH 204
Query: 239 KA--SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+V + FGC G GL+G GR LS SQT +++G +FSYCLPS + +
Sbjct: 205 DDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSN 264
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG--- 353
SG+L LG + I T ++ NP + Y +N+ GI +GG+ + A
Sbjct: 265 FSGTLRLGPAGQPKR----IKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDP 320
Query: 354 ----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
G ++D+GT+ TRL +Y+A++ F + P A DTC+N++ +++P
Sbjct: 321 TSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRA-PVAGPLGGFDTCYNVT----ISVP 375
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS---YEDETGIIGNYQQKNQRVI 466
V F+G +T+ +V S CLA+A+ + ++ + QQ+N RV+
Sbjct: 376 TVTFSFDGRVSVTLPEENVV-IRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVL 434
Query: 467 YDTKNSQLGFAGEDCSS 483
+D N ++GF+ E C++
Sbjct: 435 FDVANGRVGFSRELCTA 451
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 52/442 (11%)
Query: 65 GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS--GNIKDVSNTEI 122
T EL H++ S K +N QQ H+Q ++ +S + + + T
Sbjct: 29 AGFTTELVHRD--SPKSPLYNSQQT--------HLQRWNKAMRRSVSRVHHFQRTAATVS 78
Query: 123 P--LTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
P + S I Y+ ++ LG + I DTGSDL W QC PC CY Q P+FDP
Sbjct: 79 PKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPK 138
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
S +Y+ + C++ C L G S CSS C Y YGD S+T G L + + L
Sbjct: 139 SSKTYRDLSCDTRQCQNL----GESSSCSSEQL--CQYSYYYGDRSFTNGNLAVDTVTLP 192
Query: 238 ----GKASVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLP-- 290
G + GCGR N G F SG++GLG +SL+SQ GG FSYCL
Sbjct: 193 STNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPF 252
Query: 291 STQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQL-- 345
S++ AG S L G N+ V + STP+ + NP TFY L L +S+G K++
Sbjct: 253 SSESAGNSSKLHFGRNAVVSGSGVQSTPL----ISKNPD--TFYYLTLEAMSVGDKKIEF 306
Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ-FSGFPSAPGFSILDTCFNLSA 402
+ G ++G I+IDSGT +T P + ++ +G + +L C+ +
Sbjct: 307 GGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP 366
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
++ +P++ F G A++ + + D +CLA S I GN Q N
Sbjct: 367 --DLKVPVITAHFNG-ADVVLQTLNTFILISDDV--LCLAFNS---TQSGAIFGNVAQMN 418
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
+ YD + + F DC+ +
Sbjct: 419 FLIGYDIQGKSVSFKPTDCTQL 440
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 178/359 (49%), Gaps = 57/359 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQ--PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+ + + +DTGSD+TW QC+ P +C+NQ P+FDPS S S+ + C+S C G
Sbjct: 99 QEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGG 158
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL----GLGK---ASVNDFIFGCGRNNK 254
G ++S P CNY +SYGDGS +RGE+GRE G G+ A+V +FGCG N+
Sbjct: 159 --GNDATSRP--CNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANR 214
Query: 255 GLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
G+F +G+ G GR LSL SQ + G FS+C +T + +++LG ++
Sbjct: 215 GVFTSNETGIAGFGRGSLSLPSQ---LKVGNFSHCF-TTITGSKTSAVLLGLPGVAPPSA 270
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
+P+ G G + +++ + +SGT IT LPP Y A
Sbjct: 271 SPL--------------------GRRRGSYRCRSTPRSS-----NSGTSITSLPPRTYRA 305
Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
++ EF Q P PG + TCF+ + ++P + + FEG A M + V+
Sbjct: 306 VREEFAAQVK-LPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEG-ATMRLPQENYVFE 363
Query: 432 VKSDASQ------VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V D +CLA+ E I+GN QQ+N V+YD +NS+L F C +
Sbjct: 364 VVDDDDAGNSSRIICLAV----IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCDQL 418
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 134/448 (29%), Positives = 202/448 (45%), Gaps = 52/448 (11%)
Query: 55 VSH-QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
V+H + ++ G +++L H++ + + +E RL D +++ S + IS N
Sbjct: 22 VAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL--DRFFRRFM-SFSEASISPN 78
Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
E P++S Y+ I +G V I DTGSDL W QC PC SCY Q+
Sbjct: 79 -----TPEPPVSS----NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQK 129
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGEL 230
+P+FDPS S S+K+V C S C L+ S S P C++ YGDGS +G +
Sbjct: 130 NPMFDPSKSTSFKEVSCESQQCRLLD-------TVSCSQPQKLCDFSYGYGDGSLAQGVI 182
Query: 231 GREHLGLG-----KASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG- 283
E L L S+ + +FGCG NN G F GL G G LSL SQ G
Sbjct: 183 ATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSG 242
Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGI 338
FS CL P D + +I G + V + STP+ + +P T+Y + L GI
Sbjct: 243 RKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKD---DP---TYYFVTLDGI 296
Query: 339 SIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
S+G K S + KG + ID+GT T LP Y+ L + P
Sbjct: 297 SVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP 356
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
C+ + ++ P++ F+G V + + F+ C A+ + + +TGI
Sbjct: 357 QLCYRSATL--IDGPILTAHFDG---ADVQLKPLNTFISPKEGVYCFAMQPI--DGDTGI 409
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
GN+ Q N + +D ++ F DC+
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 188/373 (50%), Gaps = 49/373 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT--G 201
+N++ I+DTGS+ VQC ++ PVFDP+ S SY++V C S C A++ T G
Sbjct: 10 KNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNG 63
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-------VNDFIFGCGRNNK 254
+S C +SS C Y +SYGD + G+ ++ + L + D FGC + +
Sbjct: 64 SSQPCVNSSAA-CTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQ 122
Query: 255 GLFG--GVSGLMGLGRSDLSLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFK 311
G G G++G R +LSL SQ + GG FSYC PS + +I G+S + K
Sbjct: 123 GFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSK 182
Query: 312 NSTPITYTNMIPN---PQLATFYILNLTGISIGGKQLQ--ASGF------AKGGILIDSG 360
+ ++YT ++ N P + Y + LT IS+ GK L S F GG ++DSG
Sbjct: 183 --SKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSG 240
Query: 361 TVITRLPPSIYSALKAEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVK 412
T TR+ Y+A + F L++ G +A GF D C+N+SA + +P V+
Sbjct: 241 TTFTRVVDDAYTAFRNAFAASNRSGLRKKVG--AAAGF---DDCYNISAGSSLPGVPEVR 295
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYD 468
+ + N + + + V + ++V + LA LS + + ++GNYQQ N V YD
Sbjct: 296 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 355
Query: 469 TKNSQLGFAGEDC 481
+ S++GF DC
Sbjct: 356 NERSRVGFERADC 368
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 210/450 (46%), Gaps = 43/450 (9%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
S S +S +++R + +++L H++ S + + R+I L R+ + +
Sbjct: 13 SLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQRVSHFL 72
Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCY 168
N K + IP Y+ +G + +VDTGS L W+QC PC +C+
Sbjct: 73 DEN-KLPESLLIP-------DKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCF 124
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
Q+ P+F+P S +YK C+S C L+ + + G C Y + YGD S++ G
Sbjct: 125 PQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLG-----QCIYGIMYGDKSFSVG 179
Query: 229 ELGREHLGLGK------ASVNDFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE 279
LG E L G S + IFGCG +N V G+ GLG LSLVSQ
Sbjct: 180 ILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGA 239
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
G FSYCL D+ ++ L G + + N + T +I P L T+Y LNL ++
Sbjct: 240 QIGHKFSYCL-LPYDSTSTSKLKFGSEAIITTNG--VVSTPLIIKPSLPTYYFLNLEAVT 296
Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSIL 394
IG K + ++G G I+IDSGT +T L + Y+ A +K PS L
Sbjct: 297 IGQK-VVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSP-----L 350
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
TCF A + IP + +F G A + + ++ + +D++ +CLA+ S +
Sbjct: 351 KTCFPNRA--NLAIPDIAFQFTG-ASVALRPKNVLIPL-TDSNILCLAVVP-SSGIGISL 405
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
G+ Q + +V YD + ++ FA DC+ +
Sbjct: 406 FGSIAQYDFQVEYDLEGKKVSFAPTDCAKV 435
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 134/448 (29%), Positives = 202/448 (45%), Gaps = 52/448 (11%)
Query: 55 VSH-QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
V+H + ++ G +++L H++ + + +E RL D +++ S + IS N
Sbjct: 22 VAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL--DRFFRRFM-SFSEASISPN 78
Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
E P++S Y+ I +G V I DTGSDL W QC PC SCY Q+
Sbjct: 79 -----TPEPPVSS----NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQK 129
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGEL 230
+P+FDPS S S+K+V C S C L+ S S P C++ YGDGS +G +
Sbjct: 130 NPMFDPSKSTSFKEVSCESQQCRLLD-------TVSCSQPQKLCDFSYGYGDGSLAQGVI 182
Query: 231 GREHLGLG-----KASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG- 283
E L L S+ + +FGCG NN G F GL G G LSL SQ G
Sbjct: 183 ATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSG 242
Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGI 338
FS CL P D + +I G + V + STP+ + +P T+Y + L GI
Sbjct: 243 RKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKD---DP---TYYFVTLDGI 296
Query: 339 SIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
S+G K S + KG + ID+GT T LP Y+ L + P
Sbjct: 297 SVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP 356
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
C+ + ++ P++ F+G V + + F+ C A+ + + +TGI
Sbjct: 357 QLCYRSATL--IDGPILTAHFDG---ADVQLKPLNTFISPKEGVYCFAMQPI--DGDTGI 409
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
GN+ Q N + +D ++ F DC+
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 165/349 (47%), Gaps = 33/349 (9%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
++DT +D W QC PCK C+N P+FDPS S +YK + C+S C +E + CSS
Sbjct: 105 VMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKCKNVE-----NTHCSS 159
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKG-LFGGVSG 262
C Y +YG +Y++G+L + L L S + + GCG NKG L G VSG
Sbjct: 160 DDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSG 219
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVF---KNSTPITY 318
+GLGR LS +SQ + GG FSYCL P + G SG L G S V STPIT
Sbjct: 220 NIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITA 279
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-----GGILIDSGTVITRLPPSIYSA 373
+ Y L +S+G ++ G +IDSGT +T LP ++YS
Sbjct: 280 GEI--------GYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLTILPENVYSR 331
Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
L++ + C+ + + +++P++ F G V + + F
Sbjct: 332 LESIVTSMVKLERAKSPNQQFKLCYK-ATLKNLDVPIITAHFNG---ADVHLNSLNTFYP 387
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
D VC A S+ T IIGN Q+N V +D + + + F DC+
Sbjct: 388 IDHEVVCFAFVSVGNFPGT-IIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/420 (28%), Positives = 192/420 (45%), Gaps = 47/420 (11%)
Query: 76 YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLN 134
+ K + W E D +Q+L S + + +P+ SG ++ Q+
Sbjct: 46 FWPSKPLKWEESVLQMQAKDQARLQFLSSLVAR----------KSVVPIASGRQIVQSPT 95
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
YI ++G + M + +DT +D W+ C C C + VF+ S ++K V C +
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST---VFNNVKSTTFKTVGCEAPQ 152
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
C + + C S+ C + ++YG S L ++ + L S+ + FGC
Sbjct: 153 CKQVP-----NSKCGGSA---CAFNMTYGSSSIA-ANLSQDVVTLATDSIPSYTFGCLTE 203
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
G GL+GLGR +SL+SQT ++ FSYCLPS + SGSL LG +
Sbjct: 204 ATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKR- 262
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
I T ++ NP+ ++ Y +NL I +G + + S A G + DSGTV TR
Sbjct: 263 ---IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTR 319
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
L Y+A++ F K+ G + DTC+ + P + F G M V +
Sbjct: 320 LVAPAYTAVRDAFRKRV-GNATVTSLGGFDTCYT----SPIVAPTITFMFSG---MNVTL 371
Query: 426 TGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S AS + CLA+A+ + +I N QQ+N R+++D NS+LG A E C+
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 177/388 (45%), Gaps = 54/388 (13%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNS 190
YIA +G + I+DTGS+L W QC C+ +C+ Q P +DPS S + + V CN
Sbjct: 70 QYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCND 129
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
+ C A G+ C S + C YG G+ G L E+L +V+ +FGC
Sbjct: 130 AAC-----ALGSETQCLSDNK-TCAVVTGYGAGNIA-GTLATENLTFQSETVS-LVFGCI 181
Query: 250 --GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---------PSTQDAGAS 298
+ + G G SG++GLGR LSL SQ + FSYCL PS GAS
Sbjct: 182 VVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIEPSHMVVGAS 238
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA---- 351
LI G SS TP+T + +P +TFY L LTGI+ G +L A
Sbjct: 239 AGLINGSASS-----TPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLR 293
Query: 352 ------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCFNLSAY 403
G IDSG +T L Y AL+AE +Q P G + D C L
Sbjct: 294 QVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDA 353
Query: 404 QEVNIPLVKMEFEGNAEMTVD--VTGIVYFVKSDASQVCLALASLSYE-----DETGIIG 456
+ + PLV + F G + D V Y+ D++ C+ + S +ET +IG
Sbjct: 354 ERLVPPLV-LHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIG 412
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
NY Q+N V+YD L F DCSS+
Sbjct: 413 NYMQQNMHVLYDLAGGVLSFQPADCSSI 440
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 123/368 (33%), Positives = 174/368 (47%), Gaps = 49/368 (13%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QCQPC +C++Q P FDPS S +
Sbjct: 84 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 143
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
C+S+ C L A S P + F G G ASV F
Sbjct: 144 CDSTLCQGLPVA----------SLPRSDKFTFVGAG----------------ASVPGVAF 177
Query: 248 GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
GCG N G+F +G+ G GR LSL SQ G FS+C +T +++L
Sbjct: 178 GCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLDLP 233
Query: 307 SSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDS 359
+ +F N + T +I NP TFY L+L GI++G +L S FA GG +IDS
Sbjct: 234 ADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 293
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEFEG 417
GT +T LP +Y ++ F Q P G + D F LSA +P + + FEG
Sbjct: 294 GTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEG 351
Query: 418 NAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
A M + V+ V+ S + CLA+ E IGN+QQ+N V+YD +NS+L F
Sbjct: 352 -ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKLSF 407
Query: 477 AGEDCSSM 484
C +
Sbjct: 408 VPAQCDKL 415
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/429 (27%), Positives = 198/429 (46%), Gaps = 44/429 (10%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---------DVSNTEIPLTSGIRLQT 132
V+ +R N+ LQ RI N+++ +IK +S+ ++P + I
Sbjct: 29 VELIHPDSSRSPFYNIRETQLQ-RISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIPYAG 87
Query: 133 LNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y+ + +G + +VDTGSD W QC+PCK C NQ P+F+PS S +YK + C+S
Sbjct: 88 SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSS 147
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDF 245
C G CSS+ C Y ++Y D S ++G++ ++ L L S
Sbjct: 148 PIC-----KRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKI 202
Query: 246 IFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
+ GCG N G+ SG++G GR + S+VSQ GG FSYCL S S +
Sbjct: 203 VIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYF 262
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYI----LNLTGISIGGK--QLQASGFA---KGGI 355
G+ +V ++ ++ P + +FY+ NL S+G +L+ S +G
Sbjct: 263 GDMAV------VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNA 316
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+IDSG+ IT+LP +YS L+ + L C+ + ++ +P++ F
Sbjct: 317 VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYK-TTLKKYEVPIITAHF 375
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
G V + F++ + +C A S ++ + GN Q+N V YDT + +
Sbjct: 376 RG---ADVKLNAFNTFIQMNHEVMCFAFNSSAF--PWVVYGNIAQQNFLVGYDTLKNIIS 430
Query: 476 FAGEDCSSM 484
F +C+ +
Sbjct: 431 FKPTNCTKL 439
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 171/360 (47%), Gaps = 47/360 (13%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
VDTGSD+ W QC+PC C+ Q P FD S S + VLC C AL G
Sbjct: 110 VDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLG----- 164
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHL-----GLGKASVNDFIFGCGRNNKGLF-GGVSGL 263
C Y V+YGD S T G+L ++ G GK +V D +FGCG+ N G F +G+
Sbjct: 165 ---GCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFHSNETGI 221
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNM 321
G GR LSL Q FSYC + ++ ++ + G + + PI T
Sbjct: 222 AGFGRGPLSLPRQLGV---SSFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPILSTPF 278
Query: 322 IPNPQLATFYILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITRLPPSIYSAL 374
+PN +Y L+L GI++G +L S F GG +IDSGT IT P +++ +L
Sbjct: 279 LPN--HPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSL 336
Query: 375 KAEFLKQFSGFPSAPGFSILDT------CFNLSAYQE---VNIPLVKMEFEG-NAEMTVD 424
F+ Q P S DT CF+ + + V +P + + EG + E+ +
Sbjct: 337 WEAFVAQV----PLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRE 392
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y D+ Q+C+ + L+ +D+ +IGN+QQ+N +++D ++L C M
Sbjct: 393 NYMAEY---PDSDQLCVVV--LAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQCDKM 447
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 185/377 (49%), Gaps = 43/377 (11%)
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L++ T+ +G + T+I+DTGSDL W QC+ + +++ P++DP+ S S+ C+
Sbjct: 87 LHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIF 247
C F T N CS + C Y +YG + T+GEL E G + SV+ F
Sbjct: 147 RLCETGSFNTKN---CSRNK---CIYTYNYGSAT-TKGELASETFTFGEHRRVSVS-LDF 198
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
GCG+ G G SG++G+ LSLVSQ FSYCL D + + G +
Sbjct: 199 GCGKLTSGSLPGASGILGISPDRLSLVSQLQI---PRFSYCLTPFLDRNTTSHIFFGAMA 255
Query: 308 --SVFKNSTPITYTNMIPNPQLAT-FYILNLTGISIGGKQLQ--ASGFA-----KGGILI 357
S ++ + PI T+++ NP + +Y + L GIS+G K+L S FA GG +
Sbjct: 256 DLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFV 315
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP----SAPGFSILDTCFNL------SAYQEVN 407
DSG LP + ALK E + + P + G+ + CF L + V
Sbjct: 316 DSGDTTGMLPSVVMEALK-EAMVEAVKLPVVNATDHGYE-YELCFQLPRNGGGAVETAVQ 373
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
+P + F+G A M + Y V+ A ++CL ++S + IIGNYQQ+N V++
Sbjct: 374 VPPLVYHFDGGAAMLLRRDS--YMVEVSAGRMCLVISSGA---RGAIIGNYQQQNMHVLF 428
Query: 468 DTKNSQLGFAGEDCSSM 484
D +N + FA C+ +
Sbjct: 429 DVENHEFSFAPTQCNQI 445
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 177/371 (47%), Gaps = 33/371 (8%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y +I+LG G+ +IVDTGS+LTW+QC PCK C D ++D + S SY+ V CN+S
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNS 158
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDF 245
+ + G C+ S C + YGDGS++ G L + L G +V DF
Sbjct: 159 QLCS-NSSQGTYAYCARGS--QCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC + + L G SG++GL ++L Q + FG FS+C P S ++
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVI 363
GN+ + T + + FY + L G+SI +L +G ++I DSG+
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVF--LPRGSVVILDSGSSF 333
Query: 364 TRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLS--AYQEVN--IPLVKME 414
+ +S L+ FLK PS F L TCF +S E++ +P + +
Sbjct: 334 SSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTK 470
FE + + G++ V + V + A +ED +IGNYQQ+N V YD +
Sbjct: 392 FEDGVTIGIPSIGVLLPVARFQNHVKMCFA---FEDGGPNPVNVIGNYQQQNLWVEYDIQ 448
Query: 471 NSQLGFAGEDC 481
S++GFA C
Sbjct: 449 RSRVGFARASC 459
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 124/375 (33%), Positives = 173/375 (46%), Gaps = 56/375 (14%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQ--QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
T+ +DT D+ W+QC+PC ++ +FDP+ S S V C S C AL GN G
Sbjct: 166 TMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRAL----GNYG 221
Query: 205 ---------------VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFG 248
S++S DCNY V+Y DG + G + L + S +F FG
Sbjct: 222 NGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFG 281
Query: 249 CGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
C +G F G SG M LG SL+SQT+ +G FSYC+P ASG L LGG
Sbjct: 282 CSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPKPS---ASGFLSLGGAI 338
Query: 308 SVFKN---------STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILI 357
+ + +TP+ I NP T+Y++ L GI + G++L GG L+
Sbjct: 339 NDGDSDSDSPSSFVTTPLMRNARIVNP---TYYVVRLQGIDVAGRRLNVPPVVFSGGTLM 395
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGF-----------PSAPGFSILDTCFNLSAYQEV 406
DS V+T+LPP+ Y AL+ F G+ A G ILDTC++ V
Sbjct: 396 DSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNV 455
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+P V + F G A + +D T V + CLA + + G IGN QQ+ V+
Sbjct: 456 TVPTVSLVFFGGAVVDLDPTTAVMM------EGCLAFVPTPADFDLGFIGNVQQQTHEVL 509
Query: 467 YDTKNSQLGFAGEDC 481
YD +GF C
Sbjct: 510 YDVGARNVGFRRGAC 524
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 170/356 (47%), Gaps = 39/356 (10%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCY--NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ ++DTGSDL W++C C C + + +F S SYKK+ CNS+ C + A G
Sbjct: 18 IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNSTHCSGMSSA-GIG 76
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-----GLG---KASVNDFIFGCGRNNKG 255
C + C Y YGDGS T G++G + + G G ++ + F+FGCGR KG
Sbjct: 77 PRCEET----CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFGCGRKLKG 132
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN--- 312
+ GL+GLG+ SL+ Q + G FSYCL S ++ S + G+S+ +
Sbjct: 133 DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSSAALRGHDV 192
Query: 313 -STPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASG-------FAKGGILIDSG 360
STPI + + + T Y ++L I++GG + + SG F +IDSG
Sbjct: 193 VSTPILHGDHLDQ----TLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLANKTVIDSG 248
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T T L P +Y A++ +Q P+ + LD CFN S P V F +
Sbjct: 249 TTYTLLTPPVYEAMRKSIEEQVI-LPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQ 307
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
+ + I D VCL++ S + IIGN QQ+N ++YD SQ+ F
Sbjct: 308 LVLPFENIFQVTSRDV--VCLSMDSSG--GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 127/391 (32%), Positives = 175/391 (44%), Gaps = 26/391 (6%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQC 161
L + ++ S NI+D+ I G L L YI T + ++ VDTGSDL WVQC
Sbjct: 37 LIRKSSHLSSNNIQDIVQAPINAYIGQYLMEL-YIGTPPI---KISGTVDTGSDLIWVQC 92
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
PC CYNQ +P+FDP S +Y + C+S C+ G CS C+Y Y
Sbjct: 93 VPCLGCYNQINPMFDPLKSSTYTNISCDSPLCYKPYI-----GECSPEK--RCDYTYGYA 145
Query: 222 DGSYTRGELGREHLGL----GKA-SVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVS 275
D S T+G L +E + L GK S+ +FGCG NN G F GL+GLG SLVS
Sbjct: 146 DSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVS 205
Query: 276 QTSEIFGG-LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
Q +FGG FS CL P D S + G S V + T ++ Q T Y +
Sbjct: 206 QIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEG--VVTTPLVQREQDMTSYYV 263
Query: 334 NLTGISIGGKQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L GIS+ L S KG +L+DSGT LP +Y + E + P S
Sbjct: 264 TLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPS 323
Query: 393 I-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
+ C+ + P + FEG + + + CLA+ + + D
Sbjct: 324 LGPQLCYRTQT--NLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDP 381
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
GI GN+ Q N + +D + F DC+
Sbjct: 382 -GIYGNFAQTNYLIGFDLDRQIVSFKPTDCT 411
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 175/374 (46%), Gaps = 41/374 (10%)
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L Y+ + +G + ++ ++DTGSDL W QC PC SC Q DP+F P S SY+ + C
Sbjct: 102 LEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAG 161
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN------- 243
C+ + S P C Y SYGDG+ TRG E +S
Sbjct: 162 ELCNDILHH-------SCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLS 214
Query: 244 -DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
FGCG NKG SG++G GR+ LSLVSQ + FSYCL + +G +L+
Sbjct: 215 APLGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIR---RFSYCL-TPYASGRKSTLL 270
Query: 303 LGG-NSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KG 353
G V+ +T + T ++ + Q TFY + TG+++G ++L+ S FA G
Sbjct: 271 FGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSG 330
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY---QEVNI 408
G ++DSGT +T P + + + F Q +A G S D CF +A + +
Sbjct: 331 GAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVV 390
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIY 467
P + +G +D+ Y + +CL LA D IGN+ Q++ RV+Y
Sbjct: 391 PRMVFHLQG---ADLDLPRRNYVLDDQRKGNLCLLLADSG--DSGTTIGNFVQQDMRVLY 445
Query: 468 DTKNSQLGFAGEDC 481
D + L FA C
Sbjct: 446 DLEADTLSFAPAQC 459
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 177/371 (47%), Gaps = 33/371 (8%)
Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y +I+LG G+ +IVDTGS+LTW++C PCK C D ++D + S SYK V CN+S
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNS 158
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDF 245
+ + G C+ S C + YGDGS++ G L + L G +V DF
Sbjct: 159 QLCS-NSSQGTYAYCARGS--QCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC + + L G SG++GL ++L Q + FG FS+C P S ++
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVI 363
GN+ + T + + FY + L G+SI +L +G ++I DSG+
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVL--LPRGSVVILDSGSSF 333
Query: 364 TRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLS--AYQEVN--IPLVKME 414
+ +S L+ FLK PS F L TCF +S E++ +P + +
Sbjct: 334 SSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTK 470
FE + + G++ V + V + A +ED +IGNYQQ+N V YD +
Sbjct: 392 FEDGVTIGIPSIGVLLPVARYQNHVKMCFA---FEDGGPNPVNVIGNYQQQNLWVEYDIQ 448
Query: 471 NSQLGFAGEDC 481
S++GFA C
Sbjct: 449 RSRVGFARASC 459
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 167/368 (45%), Gaps = 54/368 (14%)
Query: 141 LGGRNM-----------TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
+GG NM +V+ DTGSDL W QC PC C+ Q P F P+ S ++ K+ C
Sbjct: 83 VGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCT 142
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
SS C +F + C+++ C Y YG G YT G L E L +G AS FGC
Sbjct: 143 SSFC---QFLPNSIRTCNATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGC 195
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
N GLG+ DL + G FSYCL S AGAS IL G+ +
Sbjct: 196 STEN-----------GLGQLDLGV---------GRFSYCLRSGSAAGASP--ILFGSLAN 233
Query: 310 FKNSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSG 360
+ + T + NP + ++Y +NLTGI++G L + GF + GG ++DSG
Sbjct: 234 LTDGN-VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSG 292
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGN 418
T +T L Y +K FL Q + + G LD CF + +P + + F+G
Sbjct: 293 TTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGG 352
Query: 419 AEMTVDV--TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
AE V G+ + + CL + + +IGN Q + ++YD F
Sbjct: 353 AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSF 412
Query: 477 AGEDCSSM 484
A DC+ +
Sbjct: 413 APADCAKV 420
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 128/218 (58%), Gaps = 14/218 (6%)
Query: 92 LILDNLHVQYLQSRIKN---------MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG 142
L D+ V+ L SR+ + +I+ + +PL G + + NY + G
Sbjct: 66 LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFG 125
Query: 143 --GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
R ++IVDTGS L+W+QC+PC C+ Q DP+FDPS S +YK + C SS C +L A
Sbjct: 126 SPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDA 185
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG 258
T N+ +C +SS C Y SYGD SY+ G L ++ L L + ++ F++GCG+++ GLFG
Sbjct: 186 TLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
+G++GLGR+ LS++ Q S FG FSYCLP+ G
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGG 282
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 156/352 (44%), Gaps = 39/352 (11%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DTGSDL W QC PC C +Q P FD S +Y+ + C SS C +L +S C
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASL-----SSPSCFKK 55
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLM 264
C Y YGD + T G L E G K + FGCG N G SG++
Sbjct: 56 M---CVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMV 112
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST----PITYTN 320
G GR LSLVSQ FSYCL S A S L G +++ +T P+ T
Sbjct: 113 GFGRGPLSLVSQLGP---SRFSYCLTSYLSATPS-RLYFGVYANLSSTNTSSGSPVQSTP 168
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSIYSA 373
+ NP L Y L+L IS+G K L GG++IDSGT IT L Y A
Sbjct: 169 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 228
Query: 374 LKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
++ + P+ I LDTCF V + + + F ++ + +
Sbjct: 229 VRRGLVSAIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLI 287
Query: 433 KSDASQVCLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S +CL +A TG IIGNYQQ+N ++YD NS L F C
Sbjct: 288 ASTTGYLCLVMA------PTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 169/356 (47%), Gaps = 39/356 (10%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCY--NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ ++DTGSDL W++C C C + + +F S SYKK+ CNS+ C + A G
Sbjct: 18 IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNSTHCSGMSSA-GIG 76
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-----GLG---KASVNDFIFGCGRNNKG 255
C + C Y YGDGS T G++G + + G G ++ + F+FGC R KG
Sbjct: 77 PRCEET----CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFGCARKLKG 132
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN--- 312
+ GL+GLG+ SL+ Q + G FSYCL S ++ S + G+S+ +
Sbjct: 133 DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSSAALRGHDV 192
Query: 313 -STPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASG-------FAKGGILIDSG 360
STPI + + + T Y ++L I+IGG + + SG F +IDSG
Sbjct: 193 VSTPILHGDHLDQ----TLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLANKTVIDSG 248
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T T L P +Y A++ +Q P+ + LD CFN S P V F +
Sbjct: 249 TTYTLLTPPVYEAMRKSIEEQVI-LPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQ 307
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
+ + I D VCL++ S + IIGN QQ+N ++YD SQ+ F
Sbjct: 308 LVLPFENIFQVTSRDV--VCLSMDSSG--GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 121/377 (32%), Positives = 179/377 (47%), Gaps = 48/377 (12%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTW--VQCQP--CKSCYNQQDPVFD 176
PL SG+ T Y A + +G T +++DTGSD+ W V+ P ++
Sbjct: 110 PLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAA 169
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
P+ +P + C + C L+ A + S C Y V+YGDGS T G+ E L
Sbjct: 170 PAPTPRWN---CVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLT 220
Query: 237 LGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
+ A V GCG +N+GLF SGL+GLGR LS SQ + FG FSYCL +
Sbjct: 221 FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSS 280
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-------- 347
+ G + P++ATFY ++L G S+GG +++
Sbjct: 281 RRARPSRRWGGT-----------------PRMATFYYVHLLGFSVGGARVKGVSQSDLRL 323
Query: 348 -SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQE 405
+GG+++DSGT +TRL +Y A++ F G +P GFS+ DTC+NLS +
Sbjct: 324 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 383
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
V +P V M G A + + Y + D S C A+A + IIGN QQ+ R
Sbjct: 384 VKVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFR 439
Query: 465 VIYDTKNSQLGFAGEDC 481
V++D ++GF + C
Sbjct: 440 VVFDGDAQRVGFVPKSC 456
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 149/319 (46%), Gaps = 35/319 (10%)
Query: 117 VSNTEIPLTSGIR---------LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK 165
+S+ E P+ + +R + T Y+ + +G R + + +DTGSDL W QC PC+
Sbjct: 59 LSSHERPVRARVRAGLVAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCR 118
Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
C++Q P+ DP+ S +Y + C + C AL F + C S C Y YGD S
Sbjct: 119 DCFDQGIPLLDPAASSTYAALPCGAPRCRALPFTS-----CGGRS---CVYVYHYGDKSV 170
Query: 226 TRGELGREHLGLGK----------ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLV 274
T G++ + G + FGCG NKG+F +G+ G GR SL
Sbjct: 171 TVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLP 230
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNSTPITYTNMIPNPQLATFYI 332
SQ + FSYC S D+ +S + G ++++ +S + T + NP + Y
Sbjct: 231 SQLNATS---FSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYF 287
Query: 333 LNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L+L GIS+G +L +IDSG IT LP +Y A+KAEF Q PS S
Sbjct: 288 LSLKGISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGS 347
Query: 393 ILDTCFNLSAYQEVNIPLV 411
LD CF L P V
Sbjct: 348 ALDVCFALPVSALWRRPAV 366
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 140/443 (31%), Positives = 216/443 (48%), Gaps = 49/443 (11%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
++EL H++ I +N Q + D L+ +L+S ++ + +S T+ L S
Sbjct: 26 FSVELIHRDSPLSPI--YNPQIT---VTDRLNAAFLRSVSRSRRFNH--QLSQTD--LQS 76
Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G+ + +I +G + V I DTGSDLTWVQC+PC+ CY + P+FD S +YK
Sbjct: 77 GLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYK 136
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
C+S C AL ++ G S++ C Y SYGD S+++G++ E + + AS +
Sbjct: 137 SEPCDSRNCQAL--SSTERGCDESNNI--CKYRYSYGDQSFSKGDVATETVSIDSASGSP 192
Query: 245 F-----IFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+FGCG NN G F SG++GLG LSL+SQ FSYCL S + A +
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL-SHKSATTN 251
Query: 299 GSLI--LGGNS--SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--- 351
G+ + LG NS S + + T ++ L T+Y L L IS+G K++ +G +
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKIPYTGSSYNP 310
Query: 352 ---------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL 400
G I+IDSGT +T L + + + +G S P +L CF
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ-GLLSHCFK- 368
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
S E+ +P + + F G V ++ I FVK VCL++ + E I GN+ Q
Sbjct: 369 SGSAEIGLPEITVHFTG---ADVRLSPINAFVKLSEDMVCLSMVPTT---EVAIYGNFAQ 422
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
+ V YD + + F DCS+
Sbjct: 423 MDFLVGYDLETRTVSFQHMDCSA 445
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 175/379 (46%), Gaps = 33/379 (8%)
Query: 128 IRLQTLNYIATIELGGR--NMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYK 184
+ T Y+ +G ++ ++DTGSDL W QC PC+ C+ Q P++ P+ S +Y
Sbjct: 93 VHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYA 152
Query: 185 KVLCNSSTCHALE-----FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
V C S C AL S + C Y+ SYGDGS T G L E G
Sbjct: 153 NVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGA 212
Query: 240 AS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
+ V+D FGCG +N G SGL+G+GR LSLVSQ FSYC D S
Sbjct: 213 GTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVT---KFSYCFTPFNDTTTS 269
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASG 349
L LG ++S+ + + P+ +++Y L+L GI++G +L ASG
Sbjct: 270 SPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASG 329
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQE 405
+GG++IDSGT T L + L + + P A G + L CF +
Sbjct: 330 --RGGLIIDSGTTFTALEERAFVVLARAVAARVA-LPLASGAHLGLSVCFAAPQGRGPEA 386
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
V++P + + F+G A+M + + V + A CL + S ++G+ QQ+N V
Sbjct: 387 VDVPRLVLHFDG-ADMELPRSSAVVEDRV-AGVACLGIVS---ARGMSVLGSMQQQNMHV 441
Query: 466 IYDTKNSQLGFAGEDCSSM 484
YD L F +C +
Sbjct: 442 RYDVGRDVLSFEPANCGEL 460
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 130/419 (31%), Positives = 205/419 (48%), Gaps = 52/419 (12%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYIATIE 140
+ W + L D +QYL S +++G + +P+ SG + LQ+ YI +
Sbjct: 55 LSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYIVKVL 104
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+G + + + +DT SD+ W+ C C C + F P+ S S+K V C++ C +
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCKQVP- 161
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
+ C + + C++ ++YG S L ++ + L + F FGC NK G
Sbjct: 162 ----NPACGARA---CSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NKVAGG 211
Query: 259 GV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
G GL+GLGR LSL+SQ ++ FSYCLPS + SGSL LG S +
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR--- 268
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLP 367
+ YT ++ NP+ ++ Y +NL I +G K L + A G + DSGTV TRL
Sbjct: 269 -VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLA 327
Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
+Y A++ EF K+ P+A S+ DTC++ +V +P + F+G MT+
Sbjct: 328 KPVYEAVRNEFRKRVKP-PTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VNMTMPA 381
Query: 426 TGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++ + S CLA+AS + +I + QQ+N RV+ D N +LG A E CS
Sbjct: 382 DNLMLH-STAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 176/397 (44%), Gaps = 43/397 (10%)
Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ---------PCKSCY 168
E P+ SG L Y+ ++ G + + +I DTGSDL W+QC P K+C
Sbjct: 39 AESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC- 97
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+ P F S S + V C+++ C + G+ CS ++P C Y Y DGS T G
Sbjct: 98 -SRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTG 156
Query: 229 ELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
L R+ + G A+V FGCG RN G F G G++GLG+ LS +Q+ +F
Sbjct: 157 FLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA 216
Query: 283 GLFSYCLPSTQDA--GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
FSYCL + G S S + G + YT ++ NP TFY + + I +
Sbjct: 217 QTFSYCLLDLEGGRRGRSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRV 273
Query: 341 GGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--- 390
G + L G GG +IDSG+ +T L Y L + F P P
Sbjct: 274 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH-LPRIPSSAT 332
Query: 391 -FSILDTCFNLSAYQEV-----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
F L+ C+N+S+ + P + ++F + + + V D CLA+
Sbjct: 333 FFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIR 390
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
++GN Q+ V +D ++++GFA +C
Sbjct: 391 PTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 177/358 (49%), Gaps = 34/358 (9%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+I+DTGSDL W QC+PC C+++ DPS S ++ + C+S C L +++
Sbjct: 430 LILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWG 489
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDFIFGCGRNNKGLF-GGV 260
+ + C Y +Y DGS T G L E G G+A+V D FGCG N G+F
Sbjct: 490 NQT---CVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNE 546
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYT 319
+G+ G GR LSL SQ FS+C + + S S++LG ++++ ++ + T
Sbjct: 547 TGIAGFGRGALSLPSQLKV---DNFSHCFTAITGSEPS-SVLLGLPANLYSDADGAVQST 602
Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYS 372
++ N Y L+L GI++G +L S FA GG +IDSGT +T LP Y
Sbjct: 603 PLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYK 662
Query: 373 ALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEV--NIPLVKMEFEGNAEMTVDVTGIV 429
+ F Q +A S+ CF+ S + ++P + + FEG T+D+
Sbjct: 663 LVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEG---ATLDLPREN 719
Query: 430 Y---FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y F + S CLA+ + D+ IIGNYQQ+N V+YD + L F C+ +
Sbjct: 720 YMFEFEDAGGSVTCLAINA---GDDLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNRL 774
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 176/367 (47%), Gaps = 45/367 (12%)
Query: 130 LQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
T Y+ +++G + ++DTGS+ W QC PC CYNQ P+FDPS S ++K++
Sbjct: 60 FDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIR 119
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-- 245
C++ H+ C Y + YG SYT+G L E + + S F
Sbjct: 120 CDTHD-HS------------------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVM 160
Query: 246 ---IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
I GCGRNN G G +G++GL R SL++Q + GL SYC AG S I
Sbjct: 161 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCF-----AGKGTSKI 215
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
G +++ ++ T + + FY LNL +S+G +++ G KG I+ID
Sbjct: 216 NFGANAIVAGDGVVSTTVFVKTAKPG-FYYLNLDAVSVGNTRIETVGTPFHALKGNIVID 274
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEG 417
SG+ +T P S Y L + ++Q P IL C+ + ++I P++ M F G
Sbjct: 275 SGSTLTYFPES-YCNLVRKAVEQVVTAVRFPRSDIL--CY---YSKTIDIFPVITMHFSG 328
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A++ +D + Y + CLA+ S +E I GN Q N V YD+ + + F
Sbjct: 329 GADLVLDKYNM-YVASNTGGVFCLAIICNSPIEE-AIFGNRAQNNFLVGYDSSSLLVSFK 386
Query: 478 GEDCSSM 484
+CS++
Sbjct: 387 PTNCSAL 393
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 190/432 (43%), Gaps = 84/432 (19%)
Query: 94 LDNLHVQYLQSRIKNMIS----GNIKDVSNTEIPLTSGIRLQTLNYIATIELG------- 142
+ LH + L+ +N +S N K+V T P+ S + Q +AT+E G
Sbjct: 111 IQTLHKRVLEKNNQNTVSQKQKKNDKEVVTT-TPVASSVEEQAGQLVATLESGMTLGSGE 169
Query: 143 ----------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
++ ++I+DTGSDL W+QC PC C+ Q D N S
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND----------------NQS- 212
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---------VN 243
C Y+ YGD S T G+ E + + V
Sbjct: 213 ---------------------CPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 251
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLI 302
+ +FGCG N+GLF G +GL+GLGR LS SQ ++G FSYCL D S LI
Sbjct: 252 NMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 311
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL-------QASGFAKG 353
G + + + + +T+ + + + TFY + + I + G+ L S G
Sbjct: 312 FGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 370
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G +IDSGT ++ Y +K + ++ G +P F ILD CFN+S V +P +
Sbjct: 371 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELG 430
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ F A ++ D VCLA+ + IIGNYQQ+N ++YDTK S
Sbjct: 431 IAFADGAVWNFPTENSFIWLNEDL--VCLAMLGTP-KSAFSIIGNYQQQNFHILYDTKRS 487
Query: 473 QLGFAGEDCSSM 484
+LG+A C+ +
Sbjct: 488 RLGYAPTKCADI 499
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 176/367 (47%), Gaps = 45/367 (12%)
Query: 130 LQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
T Y+ +++G + ++DTGS+ W QC PC CYNQ P+FDPS S ++K++
Sbjct: 54 FDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIR 113
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-- 245
C++ H+ C Y + YG SYT+G L E + + S F
Sbjct: 114 CDTHD-HS------------------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVM 154
Query: 246 ---IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
I GCGRNN G G +G++GL R SL++Q + GL SYC AG S I
Sbjct: 155 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCF-----AGKGTSKI 209
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
G +++ ++ T + + FY LNL +S+G +++ G KG I+ID
Sbjct: 210 NFGANAIVAGDGVVSTTVFVKTAKPG-FYYLNLDAVSVGNTRIETVGTPFHALKGNIVID 268
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEG 417
SG+ +T P S Y L + ++Q P IL C+ + ++I P++ M F G
Sbjct: 269 SGSTLTYFPES-YCNLVRKAVEQVVTAVRFPRSDIL--CY---YSKTIDIFPVITMHFSG 322
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A++ +D + Y + CLA+ S +E I GN Q N V YD+ + + F
Sbjct: 323 GADLVLDKYNM-YVASNTGGVFCLAIICNSPIEE-AIFGNRAQNNFLVGYDSSSLLVSFK 380
Query: 478 GEDCSSM 484
+CS++
Sbjct: 381 PTNCSAL 387
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 135/483 (27%), Positives = 202/483 (41%), Gaps = 77/483 (15%)
Query: 28 AHCFEGKKKLHL-HKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNE 86
A F G +LHL H +Q S + Q+S+ A+++ GK E
Sbjct: 27 ADAFAGDVRLHLTHVDAGKQMSRRELIRRAMQRSKARAAALSVARSGSGRVPGKSAQQGE 86
Query: 87 QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
Q Q Q + SG+ L Y+ + +G +
Sbjct: 87 QHQ-------------QPGVPVRPSGD-------------------LEYLIDLAIGTPPQ 114
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
++ ++DTGSDL W QC PC SC Q DP+F P+ S SY + C+ C+ +
Sbjct: 115 PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHH----- 169
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI----FGCGRNNKGLFGGV 260
S P C Y +YGDG+ T G E +S FGCG N G
Sbjct: 170 --SCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNG 227
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS-VFKNSTPIT-- 317
SG++G GR LSLVSQ S FSYCL + + +L+ G S VF+ T
Sbjct: 228 SGIVGFGRDPLSLVSQLSI---RRFSYCL-TPYTSTRKSTLMFGSLSDGVFEGDDAATGQ 283
Query: 318 --YTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA-----KGGILIDSGTVITRLPP 368
T ++ + Q TFY + TG+++G ++L+ S FA GG+++DSGT +T P
Sbjct: 284 VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPA 343
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQE---------VNIPLVKMEFEGN 418
++ + + F Q P S D CF V++P + F+G
Sbjct: 344 AVLTEVLRAFRAQLR-LPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHFQG- 401
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A++ + +V D + L + D IGN+ Q++ RV+YD + L FA
Sbjct: 402 ADLELPRRN---YVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAP 458
Query: 479 EDC 481
C
Sbjct: 459 AQC 461
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 183/405 (45%), Gaps = 46/405 (11%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
Q + + +++G DVS + + T YIA+ +G + ++DTGSDL W Q
Sbjct: 61 QQQQQRLMAGAEDDVS-------AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQ 113
Query: 161 CQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
C KSC Q P ++ S S ++ V C F N GV C +
Sbjct: 114 CATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKA----GFCAAN-GVHLCGLDGSCTFI 168
Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLV 274
SYG G G LG E ++ FGC R G SGL+GLGR LSLV
Sbjct: 169 ASYGAGRVI-GSLGTESFAF-ESGTTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLV 226
Query: 275 SQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
SQ I FSYCL P +GAS L +G ++S+ + + + +TFY L
Sbjct: 227 SQ---IGATRFSYCLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYL 283
Query: 334 NLTGISIGGKQLQA------------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
L GI++G +L A G+ GG++ID+G+ +T+L Y ALK E Q
Sbjct: 284 PLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQ 343
Query: 382 F--SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
AP S L+ C +Q+V +P + F G A+M V Y+ D +
Sbjct: 344 LGNGSLVPAPEDSGLELCVAREGFQKV-VPALVFHFGGGADMAVPAAS--YWAPVDKAAA 400
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
C+ + Y+ IIGN+QQ++ ++YD + + F DC+ +
Sbjct: 401 CMMILEGGYDS---IIGNFQQQDMHLLYDLRRGRFSFQTADCTML 442
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 175/397 (44%), Gaps = 43/397 (10%)
Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ---------PCKSCY 168
E P+ SG L Y+ ++ G + + +I DTGSDL W+QC P K+C
Sbjct: 38 AESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC- 96
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+ P F S S + V C+++ C + G+ CS ++P C Y Y DGS T G
Sbjct: 97 -SRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTG 155
Query: 229 ELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
L R+ + G A+V FGCG RN G F G G++GLG+ LS +Q+ +F
Sbjct: 156 FLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA 215
Query: 283 GLFSYCLPSTQDA--GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
FSYCL + G S S + G + YT ++ NP TFY + + I +
Sbjct: 216 QTFSYCLLDLEGGRRGRSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRV 272
Query: 341 GGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--- 390
G + L G GG +IDSG+ +T L Y L + F P P
Sbjct: 273 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH-LPRIPSSAT 331
Query: 391 -FSILDTCFNL-----SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
F L+ C+N+ SA P + ++F + + + V D CLA+
Sbjct: 332 FFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIR 389
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
++GN Q+ V +D ++++GFA +C
Sbjct: 390 PTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 126/411 (30%), Positives = 187/411 (45%), Gaps = 41/411 (9%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
L +R + +S K V + P+ SG + Y + +G +++ +I DTGSDL WV
Sbjct: 50 LDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWV 109
Query: 160 QCQPCKSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYF 217
+C C++C ++ VF P S ++ C C + G + C+ + C Y
Sbjct: 110 KCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVP-KPGRAPRCNHTRIHSTCPYE 168
Query: 218 VSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL------FGGVSGLMGL 266
Y DGS T G RE L GK A + FGCG G F G +G+MGL
Sbjct: 169 YGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGL 228
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILG-GNSSVFKNSTPITYTNMIPN 324
GR +S SQ FG FSYCL T + LI+G G +V K + +T ++ N
Sbjct: 229 GRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSK----LFFTPLLTN 284
Query: 325 PQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
P TFY + L + + G +L + GG ++DSGT + L Y + A
Sbjct: 285 PLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA 344
Query: 378 FLKQFSGFPSA----PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+KQ P+A PGF D C N+S E +P +K EF G A YF
Sbjct: 345 -VKQRIKLPNADELTPGF---DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRN--YF 398
Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++++ CLA+ S+ + +IGN Q+ +D S+LGF+ C+
Sbjct: 399 IETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 131/424 (30%), Positives = 202/424 (47%), Gaps = 54/424 (12%)
Query: 78 SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYI 136
S + W + L D +QYL S +++G + +P+ SG + LQ+ YI
Sbjct: 51 SSSPLSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYI 100
Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
+G + + + +DT SD+ W+ C C C + F P+ S S+K V C++ C
Sbjct: 101 VKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCK 158
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
+ T + CS + ++YG S L ++ + L + F FGC NK
Sbjct: 159 QVPNPTCGARACS--------FNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NK 207
Query: 255 GLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
GG GL+GLGR LSL+SQ I+ FSYCLPS + SGSL LG S
Sbjct: 208 VAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQ 267
Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVI 363
+ + YT ++ NP+ ++ Y +NL I +G K L + A G + DSGTV
Sbjct: 268 R----VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 323
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCFNLSAYQEVNIPLVKMEFEGNAE 420
TRL +Y A++ EF K+ P+ + L DTC++ +V +P + F+G
Sbjct: 324 TRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VN 376
Query: 421 MTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
MT+ ++ + S CLA+A+ + +I + QQ+N RV+ D N +LG A
Sbjct: 377 MTMPADNLMLH-STAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLAR 435
Query: 479 EDCS 482
E CS
Sbjct: 436 ERCS 439
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 131/424 (30%), Positives = 202/424 (47%), Gaps = 54/424 (12%)
Query: 78 SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYI 136
S + W + L D +QYL S +++G + +P+ SG + LQ+ YI
Sbjct: 67 SSSPLSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYI 116
Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
+G + + + +DT SD+ W+ C C C + F P+ S S+K V C++ C
Sbjct: 117 VKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCK 174
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
+ T + CS + ++YG S L ++ + L + F FGC NK
Sbjct: 175 QVPNPTCGARACS--------FNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NK 223
Query: 255 GLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
GG GL+GLGR LSL+SQ I+ FSYCLPS + SGSL LG S
Sbjct: 224 VAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQ 283
Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVI 363
+ + YT ++ NP+ ++ Y +NL I +G K L + A G + DSGTV
Sbjct: 284 R----VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 339
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCFNLSAYQEVNIPLVKMEFEGNAE 420
TRL +Y A++ EF K+ P+ + L DTC++ +V +P + F+G
Sbjct: 340 TRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VN 392
Query: 421 MTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
MT+ ++ + S CLA+A+ + +I + QQ+N RV+ D N +LG A
Sbjct: 393 MTMPADNLMLH-STAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLAR 451
Query: 479 EDCS 482
E CS
Sbjct: 452 ERCS 455
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/373 (30%), Positives = 172/373 (46%), Gaps = 43/373 (11%)
Query: 134 NYIATIELGG-RNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y+ + +G R+ V++ DTGSD+ W QC+PC C+ Q P FD + S + + V C+
Sbjct: 91 EYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSD 150
Query: 191 STCHALE----FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKA 240
C+A F G C Y YGDGS + G R+ G GK
Sbjct: 151 PLCNAHSEHGCFLHG------------CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKV 198
Query: 241 SVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+V D FGCG N G F +G+ G GR LSL SQ FSYC +T+ S
Sbjct: 199 TVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKV---RQFSYCF-TTRFEAKSS 254
Query: 300 SLILGGNSSVFKNST-PITYTNMI---PNPQLATFYILNLTGISIGGKQL---QASGFAK 352
+ LGG + ++T PI T + P + Y+L+ G+++G +L +
Sbjct: 255 PVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGS 314
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
G IDSGT IT P +++ LK+ F+ Q + P D CF+ + +P +
Sbjct: 315 GATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDICFSWDGKKTAAMPKLV 373
Query: 413 MEFEGNAEMTVDVTGIVYFVKS-DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
EG D+ Y + ++ QVC+A+++ D T +IGN+QQ+N ++YD
Sbjct: 374 FHLEG---ADWDLPRENYVTEDRESGQVCVAVSTSGQMDRT-LIGNFQQQNTHIVYDLAA 429
Query: 472 SQLGFAGEDCSSM 484
+L C +
Sbjct: 430 GKLLLVPAQCDKL 442
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 136/438 (31%), Positives = 212/438 (48%), Gaps = 57/438 (13%)
Query: 68 TLELKHK-NYCSG----KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
TLE+ H + CS K + W E D +Q+L S M++G + +
Sbjct: 35 TLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLAS----MVAGR------SVV 84
Query: 123 PLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
P+ SG ++ Q+ YI ++G T+++ DT +D W+ C C C + +F P
Sbjct: 85 PIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTST---LFAPEK 141
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S ++K V C S C+ + + C +S+ C + ++YG S + ++ + L
Sbjct: 142 STTFKNVSCGSPQCNQVP-----NPSCGTSA---CTFNLTYGSSSIA-ANVVQDTVTLAT 192
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ D+ FGC G GL+GLGR LSL+SQT ++ FSYCLPS + SG
Sbjct: 193 DPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSG 252
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF--AKG 353
SL LG + + I YT ++ NP+ ++ Y +NL I +G K + +A F A G
Sbjct: 253 SLRLGPVAQPIR----IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATG 308
Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-----LDTCFNLSAYQEVN 407
G + DSGTV TRL Y+A++ EF ++ + A ++ DTC+ + +
Sbjct: 309 AGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKA-NLTVTSLGGFDTCYTV----PIV 363
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQR 464
P + F G M V + + S A S CLA+AS + +I N QQ+N R
Sbjct: 364 APTITFMFSG---MNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 420
Query: 465 VIYDTKNSQLGFAGEDCS 482
V+YD NS+LG A E C+
Sbjct: 421 VLYDVPNSRLGVARELCT 438
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 123/384 (32%), Positives = 171/384 (44%), Gaps = 42/384 (10%)
Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SG+ + Y I +G +++DTGSD+ W+QC PC+ CY+Q +FDP S
Sbjct: 135 PVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
SY V C + C L+ SG C C Y V+YGDGS T G+ E L
Sbjct: 195 HSYGAVDCAAPLCRRLD-----SGGCDLRR-KACLYQVAYGDGSVTAGDFATETLTFASG 248
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-------- 291
A V GCG +N+GLF +GL+GLGR LS SQ S FG FSYCL
Sbjct: 249 ARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASA 308
Query: 292 -------TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
T +GA G+L G + + +++ +
Sbjct: 309 TSRSSTVTFGSGARGAL---GRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRP 365
Query: 345 LQASGFAKGGILIDSG------TVITRLPP-SIYSALKAEFLKQFSGFPSAPGFSILDTC 397
+GG+++DSG R PP + S A L+ G GFS+ DTC
Sbjct: 366 PPDPSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPG-----GFSLFDTC 420
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
++LS + V +P V M F G AE + + V S + C A A + IIGN
Sbjct: 421 YDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGN 477
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
QQ+ RV++D +LGF + C
Sbjct: 478 IQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 151/340 (44%), Gaps = 45/340 (13%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L
Sbjct: 166 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----------- 214
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG---CGRNNKGLFGGVSGLM 264
G Y R L + L + C SG M
Sbjct: 215 ---------------GRYGRWLLQQPVPVLRRLRRRQGQPRGRTCHAVRGNFSASTSGTM 259
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
LG SL+SQT+ FG FSYC+P D +SG L LGG + T ++ N
Sbjct: 260 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 315
Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
P + T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F
Sbjct: 316 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 375
Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
+ +P A G + LDTC++ + V +P V + F+G A + +D G++ + CL
Sbjct: 376 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 428
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A + G IGN QQ+ V+YD +GF C
Sbjct: 429 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)
Query: 60 SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
+R + A+ L H D R +L + + ++R ++SG
Sbjct: 47 ARCDAAALRLHATH--------ADAGRGLSTRELLRRMAARS-KARSARLLSGRAASARM 97
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
T G+ Y+ + +G + + +I+DTGSDLTW QC PC SC+ Q P F+P
Sbjct: 98 DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
S S ++ + C+ C L +++ C S + C Y +Y D S T G L +
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 210
Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
+G ASV D FGCG N G+F +G+ G R LS+ +Q FSY
Sbjct: 211 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 267
Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
C PS G +L GG V +++ I Y + QL +YI +L G
Sbjct: 268 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYH----SSQLKAYYI-SLKG 322
Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
+++G +L S FA GG ++DSGT +T LP ++Y+ + F+ Q
Sbjct: 323 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 382
Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
S+ CF++ + ++P + + FEG T+D+ Y F +A + L +++
Sbjct: 383 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 439
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ +IGN+QQ+N V+YD N L F C+ +
Sbjct: 440 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)
Query: 60 SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
+R + A+ L H D R +L + + ++R ++SG
Sbjct: 21 ARCDAAALRLHATH--------ADAGRGLSTRELLRRMAARS-KARSARLLSGRAASARM 71
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
T G+ Y+ + +G + + +I+DTGSDLTW QC PC SC+ Q P F+P
Sbjct: 72 DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 129
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
S S ++ + C+ C L +++ C S + C Y +Y D S T G L +
Sbjct: 130 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 184
Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
+G ASV D FGCG N G+F +G+ G R LS+ +Q FSY
Sbjct: 185 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 241
Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
C PS G +L GG V +++ I Y + QL +YI +L G
Sbjct: 242 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSS----QLKAYYI-SLKG 296
Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
+++G +L S FA GG ++DSGT +T LP ++Y+ + F+ Q
Sbjct: 297 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 356
Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
S+ CF++ + ++P + + FEG T+D+ Y F +A + L +++
Sbjct: 357 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 413
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ +IGN+QQ+N V+YD N L F C+ +
Sbjct: 414 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 448
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/445 (27%), Positives = 217/445 (48%), Gaps = 40/445 (8%)
Query: 54 CVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
C S S+ ++EL H++ Q + + ++D +H R N ++ +
Sbjct: 15 CFSISFSQAVSNGFSIELIHRDSSKSPFYKPT-QNKYQHVVDAVH------RSINRVNHS 67
Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
K+ S P ++ I + +YI + +G + IVDTGSD+ W+QC+PC+ CYNQ
Sbjct: 68 NKN-SLASTPESTVISYEG-DYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQT 125
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
P F+PS S SYK + C+S C ++ + N +C Y ++YG+ S+++G+L
Sbjct: 126 TPKFNPSKSSSYKNISCSSKLCQSVRDTSCN-------DKKNCEYSINYGNQSHSQGDLS 178
Query: 232 REHLGLGK-----ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLF 285
E L L S + GCG NN G F SG++GLG SL++Q GG F
Sbjct: 179 LETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKF 238
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL----ATFYILNLTGISIG 341
SYCL + + ++ +G + F + ++ N++ P + + FY L + S+G
Sbjct: 239 SYCL--VRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVG 296
Query: 342 GKQLQASGFAK----GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
K+++ +G +K G I+IDS T++T +P +Y+ L + + + C
Sbjct: 297 DKRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLC 356
Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
+N+S+ +E + P + F+G A++ + T FV+ +C A A + I G+
Sbjct: 357 YNVSSDEEYDFPYMTAHFKG-ADILLYATNT--FVEVARDVLCFAFAP---SNGGAIFGS 410
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
+ Q++ V YD + + F DC+
Sbjct: 411 FSQQDFMVGYDLQQKTVSFKSVDCT 435
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 172/354 (48%), Gaps = 45/354 (12%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
++ G++L W P C+ Q P F+P ++ + L +S C + +F +
Sbjct: 12 LENGNELIWNHSNPSPECFEQAFPYFEPL---TFSRGLPFAS-CGSPKFWPNQT------ 61
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLF-GGVSGLMGL 266
C Y SYGD S T G L + ASV FGCG N G+F +G+ G
Sbjct: 62 ----CVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGIAGF 117
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN------STP-ITYT 319
GR LSL SQ G FS+C +T +++L + +F N +TP I Y
Sbjct: 118 GRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYA 173
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSA 373
NP T Y L+L GI++G +L S FA GG +IDSGT IT LPP +Y
Sbjct: 174 KNEANP---TLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQV 230
Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
++ EF Q P PG + TCF+ + + ++P + + FEG A M + V+ V
Sbjct: 231 VRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEG-ATMDLPRENYVFEV 288
Query: 433 KSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
DA S +CLA+ + DET IIGN+QQ+N V+YD +N+ L F C +
Sbjct: 289 PDDAGNSIICLAI---NKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 339
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)
Query: 60 SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
+R + A+ L H D R +L + + ++R ++SG
Sbjct: 47 ARSDAAALRLHATH--------ADAGRGLSTRELLHRMAARS-KARSARLLSGRAASARV 97
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
T G+ Y+ + +G + + +I+DTGSDLTW QC PC SC+ Q P F+P
Sbjct: 98 DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
S S ++ + C+ C L +++ C S + C Y +Y D S T G L +
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 210
Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
+G ASV D FGCG N G+F +G+ G R LS+ +Q FSY
Sbjct: 211 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 267
Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
C PS G +L GG V +++ I Y + QL +YI +L G
Sbjct: 268 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYH----SSQLKAYYI-SLKG 322
Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
+++G +L S FA GG ++DSGT +T LP ++Y+ + F+ Q
Sbjct: 323 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 382
Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
S+ CF++ + ++P + + FEG T+D+ Y F +A + L +++
Sbjct: 383 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 439
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ +IGN+QQ+N V+YD N L F C+ +
Sbjct: 440 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 200/421 (47%), Gaps = 54/421 (12%)
Query: 78 SGKIVDWNEQQQNRLI-LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNY 135
+G + D + + +RL+ LD+L V P+ SG +L QT Y
Sbjct: 66 AGFLADQSSRDASRLLYLDSLAV-----------------AGRAYAPIASGRQLLQTPTY 108
Query: 136 IATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
+ LG + + + VDT +D W+ C C C F+P+ S SY+ V C S C
Sbjct: 109 VVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPAC 166
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
+ + CS ++ C + ++Y D S L ++ L + V + FGC +
Sbjct: 167 -----SRAPNPSCSLNTK-SCGFSLTYADSSL-EAALSQDSLAVANDVVKSYTFGCLQKA 219
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
G GL+GLGR LS +SQT +++ G FSYCLPS + SG+L LG +
Sbjct: 220 TGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLR-- 277
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRL 366
I T ++ NP ++ Y +++TGI +G K + A G ++DSGT+ TRL
Sbjct: 278 --IKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335
Query: 367 PPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
Y A++ E ++ G P S GF DTC+N + V P V F G ++T+
Sbjct: 336 VAPAYVAVRDEVRRRIRGAPLSSLGGF---DTCYNTT----VKWPPVTFMFTG-MQVTLP 387
Query: 425 VTGIVYFVKSDASQVCLALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+V + + CLA+A+ T +I + QQ+N R+++D N ++GFA E C+
Sbjct: 388 ADNLVIH-STYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446
Query: 483 S 483
+
Sbjct: 447 A 447
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 207/444 (46%), Gaps = 34/444 (7%)
Query: 55 VSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNI 114
+S S++ ++E+ H++ + E R+ ++ I N
Sbjct: 23 ISFSNSKVLNSGFSVEMIHRDSSRSPLYRHTETPFQRV------ANAMRRSINRANHFNK 76
Query: 115 KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQD 172
K + S ++ Y+ + +G + +VDTGS +TW+QCQ C+ CY Q
Sbjct: 77 KSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTT 136
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
P+FDPS S +YK + C+S+ C ++ S SS C Y + YGDGS+++G+L
Sbjct: 137 PIFDPSKSKTYKTLPCSSNMCQSVI-----STPSCSSDKIGCKYTIKYGDGSHSQGDLSV 191
Query: 233 EHLGLGK---ASVN--DFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
E L LG +SV + + GCG NNKG F G SG++GLG +SL+SQ S GG FS
Sbjct: 192 ETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFS 251
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
YCL S S + G+++V ++ T ++ FY L L S+G K+++
Sbjct: 252 YCLAPMFSQSNSSSKLNFGDAAVVSGLGAVS-TPLVSKTGSEVFYYLTLEAFSVGDKRIE 310
Query: 347 --------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
S +G I+IDSGT +T LP YS L++ + + L C+
Sbjct: 311 FVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCY 370
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
+ ++++P++ F+G V++ I FV+ VC A S + I GN
Sbjct: 371 QTTPSGQLDVPVITAHFKG---ADVELNPISTFVQVAEGVVCFAFHS---SEVVSIFGNL 424
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
Q N V YD + F DC+
Sbjct: 425 AQLNLLVGYDLMEQTVSFKPTDCT 448
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 176/348 (50%), Gaps = 31/348 (8%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
IVDTGSD+ W+QC+PC+ CY Q P+FDPS S +YK + C+S+TC +L + CSS
Sbjct: 107 IVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLR-----NTACSS 161
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
+ C Y + YGDGS++ G+L E L LG + F GCG NN G F SG
Sbjct: 162 DNV--CEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSG 219
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
++GLG +SL+SQ S GG FSYCL P ++ +S L G + V T T +
Sbjct: 220 IVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDP 279
Query: 322 IPNPQLATFYILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSAL 374
+ N Q+ FY L L S+G +++ SG G I+IDSGT +T LP Y L
Sbjct: 280 L-NGQV--FYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNL 336
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
++ + +L C+ ++ E+++P++ F+G V++ I FV
Sbjct: 337 ESAVSDVIKLERARDPSKLLSLCYKTTS-DELDLPVITAHFKG---ADVELNPISTFVPV 392
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ VC A S I GN Q+N V YD + F DC+
Sbjct: 393 EKGVVCFAFISSKI---GAIFGNLAQQNLLVGYDLVKKTVSFKPTDCT 437
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/349 (33%), Positives = 163/349 (46%), Gaps = 31/349 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL----EFATGNS 203
I DTGSDL WVQC PC+ C Q P+FDP S ++K V C+S C L G S
Sbjct: 107 AIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKS 166
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS----VNDFIFGCGRNNKGLFGG 259
G C Y YGD + G LG E + G + FGC +N
Sbjct: 167 G--------QCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDE 218
Query: 260 VS---GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLG LSL+SQ G FSYC P + S S + GN ++ K +
Sbjct: 219 SKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPL--SSNSTSKMRFGNDAIVKQIKGV 276
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSALK 375
T +I ++Y LNL G+SIG K+++ S G ILIDSGT T L S Y+
Sbjct: 277 VSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFV 336
Query: 376 AEFLKQFSGFPSA--PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
A +K+ G + P + + CF ++ P V F G A++ VD + + F
Sbjct: 337 A-LVKEVYGVEAVKIPPL-VYNFCFENKGKRK-RFPDVVFLFTG-AKVRVDASNL--FEA 390
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
D + +C+ S ED++ I GN+ Q +V YD + + FA DC+
Sbjct: 391 EDNNLLCMVALPTSDEDDS-IFGNHAQIGYQVEYDLQGGMVSFAPADCA 438
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 190/403 (47%), Gaps = 60/403 (14%)
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC 161
SR+ N+ + I D+ + P+ + ++A I +G + +++DTGSDLTW+QC
Sbjct: 62 SRLDNLWTTEIADIVSHVTPIPN-----PAAFLANISIGDPPVPQLLLIDTGSDLTWIQC 116
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE--FATGNSGVCSSSSPPDCNYFVS 219
PCK CY Q P F PS S +Y+ C S+ HA+ F +G +C Y +
Sbjct: 117 LPCK-CYPQTIPFFHPSRSSTYRNASCESAP-HAMPQIFRDEKTG--------NCRYHLR 166
Query: 220 YGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
Y D S TRG L +E L G S + +FGCG++N G F SG++GLG S+V
Sbjct: 167 YRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIV 225
Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
++ FG FSYC S D + LILG + + + TP+ Y L
Sbjct: 226 TRN---FGSKFSYCFGSLIDPTYPHNFLILGNGARIEGDPTPLQI--------FQDRYYL 274
Query: 334 NLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIYSALKAEF-------LK 380
+L IS+G K L +KGG +ID+G T L Y L E L+
Sbjct: 275 DLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLR 334
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQ 438
+ + + C+ + ++ P+V F G AE+ +DV + FV S++
Sbjct: 335 RVKDWE-----QYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESL--FVSSESGDS 387
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CLA+ +++D + +IG Q+N V Y+ + ++ F DC
Sbjct: 388 FCLAMTMNTFDDMS-VIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 180/370 (48%), Gaps = 40/370 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGN 202
+ + + +DTGSDL W QC C C++Q PVF S+S ++ +V C+ C HA+
Sbjct: 106 QRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSG 164
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-------ASVNDFIFGCGRNNKG 255
S C Y Y D S T G++ + A+V + FGCG N G
Sbjct: 165 CAARDRS----CFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYG 220
Query: 256 LF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
LF SG+ G G LSL SQ FSYC + +++ S ++ G ++ ++T
Sbjct: 221 LFTPNQSGIAGFGTGPLSLPSQLKV---RRFSYCFTAMEESRVSPVILGGEPENIEAHAT 277
Query: 315 -PITYTNMIPNPQLAT-----FYILNLTGISIGGKQL--QASGFA-----KGGILIDSGT 361
PI T P P A FY L+L G+++G +L AS FA GG IDSGT
Sbjct: 278 GPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGT 337
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT--CFNLSAYQEV-NIPLVKMEFEGN 418
IT P +++ +L+ F+ Q P A G++ D CF++ A ++ +P + + EG
Sbjct: 338 AITFFPQAVFRSLREAFVAQVP-LPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEG- 395
Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
A+ + V D S ++C+ + S + T IIGN+QQ+N ++YD +++++
Sbjct: 396 ADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGT-IIGNFQQQNMHIVYDLESNKM 454
Query: 475 GFAGEDCSSM 484
FA C +
Sbjct: 455 VFAPARCDKL 464
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 184/409 (44%), Gaps = 37/409 (9%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
L +R + +S K + + P+ SG + Y + +G +++ +I DTGSDL WV
Sbjct: 51 LDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWV 110
Query: 160 QCQPCKSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYF 217
+C C++C ++ VF P S ++ C C + + +C+ + C+Y
Sbjct: 111 KCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVP-KPDRAPICNHTRIHSTCHYE 169
Query: 218 VSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL------FGGVSGLMGL 266
Y DGS T G RE L GK A + FGCG G F G +G+MGL
Sbjct: 170 YGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGL 229
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
GR +S SQ FG FSYCL + S ++ GN + + +T ++ NP
Sbjct: 230 GRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGG--DGISKLFFTPLLTNPL 287
Query: 327 LATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
TFY + L + + G +L + GG ++DSGT + L Y ++ A
Sbjct: 288 SPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 347
Query: 380 KQFSGFPSA----PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
++ P A PGF D C N+S E +P +K EF G A YF++
Sbjct: 348 RRVK-LPIADALTPGF---DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRN--YFIE 401
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++ CLA+ S+ + +IGN Q+ +D S+LGF+ C+
Sbjct: 402 TEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 206/438 (47%), Gaps = 57/438 (13%)
Query: 68 TLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
TLE+ H + K + W E D +Q+L S M++G + +
Sbjct: 34 TLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLAS----MVAGR------SIV 83
Query: 123 PLTSGIRL-QTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
P+ SG ++ Q+ YI ++G T++ +DT +D W+ C C C + +F P
Sbjct: 84 PIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTST---LFAPEK 140
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S ++K V C S C+ + S C +S+ C + ++YG S + ++ + L
Sbjct: 141 STTFKNVSCGSPECNKVP-----SPSCGTSA---CTFNLTYGSSSIA-ANVVQDTVTLAT 191
Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
+ + FGC G GL+GLGR LSL+SQT ++ FSYCLPS + SG
Sbjct: 192 DPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSG 251
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK-------QLQASGFAK 352
SL LG + + I YT ++ NP+ ++ Y +NL I +G K L +
Sbjct: 252 SLRLGPVAQPIR----IKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATG 307
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-----LDTCFNLSAYQEVN 407
G + DSGTV TRL +Y+A++ EF ++ + A ++ DTC+ + +
Sbjct: 308 AGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKA-NLTVTSLGGFDTCYTV----PIV 362
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQR 464
P + F G M V + + S A S CLA+AS + +I N QQ+N R
Sbjct: 363 APTITFMFSG---MNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHR 419
Query: 465 VIYDTKNSQLGFAGEDCS 482
V+YD NS+LG A E C+
Sbjct: 420 VLYDVPNSRLGVARELCT 437
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 194/419 (46%), Gaps = 86/419 (20%)
Query: 77 CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
CSG Q D V ++ S+ SGN+K+ ++ + + + N++
Sbjct: 75 CSGSGHSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFL 129
Query: 137 ATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
+ G +N +I+DTGS +TW QC+ C +C F+ S S +Y C T
Sbjct: 130 VDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSCIPGTVE 189
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNN 253
NY ++YGD S + G G + + L + V F FGCGRNN
Sbjct: 190 N-------------------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNN 230
Query: 254 KGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
KG FG GV G++GLG+ LS VSQT+ F +FSYCLP + + GSL+ G ++
Sbjct: 231 KGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--SQ 285
Query: 313 STPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLP 367
S+ + +T+++ P Q + +Y +NL+ IS+G ++L +S FA G +IDS TVITRLP
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLP 345
Query: 368 PSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
YSALKA F K + +P + G ILDTC+N E+T
Sbjct: 346 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNXXXXXX-------------PELT- 391
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
IIGN QQ + V+YD + ++GF CS
Sbjct: 392 ------------------------------IIGNRQQLSLTVLYDIQGGRIGFRSNGCS 420
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 125/382 (32%), Positives = 177/382 (46%), Gaps = 44/382 (11%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNS 190
YIA +G + I+DTGS+L W QC C+ C++Q +DPS S + + V CN
Sbjct: 71 YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACND 130
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFIFGC 249
+ C A G+ C+ + C +YG G G LG E S N FGC
Sbjct: 131 TAC-----ALGSETRCARDNK-ACAVLTAYGAG-VIGGVLGTEAFTFQPQSENVSLAFGC 183
Query: 250 ---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
R G G SG++GLGR +LSLVSQ + FSYCL P + + L +G
Sbjct: 184 IAATRLTPGSLDGASGIIGLGRGNLSLVSQLGD---NKFSYCLTPYFSQSTNTSRLFVGA 240
Query: 306 NSSVFKNSTPITYTNMIPNPQL---ATFYILNLTGISIGGKQL----------QASGFAK 352
++ + P T + NP + +TFY L LTGI++G +L Q +
Sbjct: 241 SAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLW 300
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVN--I 408
G LIDSG+ T L Y AL+ E ++Q P G LD C + A+ +V +
Sbjct: 301 AGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV-AHGDVGKLV 359
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKN 462
P + + F G+ V V Y+ D S C+ + S + ET IIGNY Q++
Sbjct: 360 PPLVLHF-GSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQD 418
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
++YD + L F DCSSM
Sbjct: 419 MHLLYDLEKGMLSFQPADCSSM 440
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 196/415 (47%), Gaps = 41/415 (9%)
Query: 86 EQQQNRLILDNLHV--QYLQSRIKNMISGNIKD---VSNTEIPLTSGIRL-QTLNYIATI 139
+ Q N L +HV LQ + K+ D + +P+ SG ++ Q+ YI
Sbjct: 23 DVQDNGSTLQVIHVFKSVLQMQAKDTTRLQFLDSLVARKSVVPIASGRQIIQSPTYIVRA 82
Query: 140 ELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
++G T+++ DT +D W+ C C C + +F P S ++K V C + C +
Sbjct: 83 KIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAPECKQVP 139
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
N G C SS CN+ ++YG S L ++ + L V + FGC G
Sbjct: 140 ----NPG-CGVSS---CNFNLTYGSSSIA-ANLVQDTITLATDPVPSYTFGCVSKTTGTS 190
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
GL+GLGR LSL+SQT ++ FSYCLPS + SGSL LG + + I
Sbjct: 191 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKR----IK 246
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSI 370
YT ++ NP+ ++ Y +NL I +G K + A G + DSGTV TRL +
Sbjct: 247 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPV 306
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
Y A++ EF ++ + DTC+N+ + +P + F G M V +
Sbjct: 307 YVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIFTG---MNVTLPQDNI 359
Query: 431 FVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S A S CLA+A + +I N QQ+N RV+YD NS++G A E C+
Sbjct: 360 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/315 (33%), Positives = 155/315 (49%), Gaps = 26/315 (8%)
Query: 113 NIKDVSNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
+ D T +P+ G + L+ NY+ ++LG G+ M +++DT +D WV C C C +
Sbjct: 22 TLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
F P+ S + + C+ + C + G S C ++ C + SYG S
Sbjct: 82 T---TFLPNASTTLGSLDCSEAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLAAT 133
Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
L ++ + L + F FGC G GL+GLGR +SL+SQ ++ G+FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------G 342
PS + SGSL LG I T ++ NP + Y +NLTG+S+G
Sbjct: 194 PSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249
Query: 343 KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
+QL G +IDSGTVITR +Y A++ EF KQ +G S+ G DTCF +A
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AA 305
Query: 403 YQEVNIPLVKMEFEG 417
E P V + FEG
Sbjct: 306 TNEAEAPAVTLHFEG 320
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 134/428 (31%), Positives = 211/428 (49%), Gaps = 51/428 (11%)
Query: 74 KNYCSGK---IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
KNY + I+D + R++ YL S + +++ + P+ SG
Sbjct: 56 KNYSTSWENIIIDMASKDPERVV-------YLSS-----LDASLRRKPISAAPIASGQAF 103
Query: 131 QTLNYIATIELGGRN--MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
+Y+ ++LG N +++DT +D WV C C C + + P S +Y +
Sbjct: 104 GIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGC-SSSSTYYSPQASTTYGGAV- 161
Query: 189 NSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
C+A A + C + C + SY GS L ++ L LG ++ + F
Sbjct: 162 ---ACYAPRCAQARGALPCPYTGSKACTFNQSYA-GSTFSATLVQDSLRLGIDTLPSYAF 217
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
GC + G GL+GLGR LSL SQ+S+++ G+FSYCLPS Q + SGSL LG
Sbjct: 218 GCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYFSGSLKLGPTG 277
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ------ASGFAKG-GILIDSG 360
+ I T ++ NP+ + Y +NLTG+++G ++ A KG G ++DSG
Sbjct: 278 QPRR----IRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSGTILDSG 333
Query: 361 TVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
TVITR +YSA++ EF Q G F S GF DTCF + Y+ + PL+K+ F G
Sbjct: 334 TVITRFVGPVYSAIRDEFRNQVKGPFFSRGGF---DTCF-VKTYENLT-PLIKLRFTG-- 386
Query: 420 EMTVDVT-----GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+DVT +++ + + +A A + +I NYQQ+N RV++DT N+++
Sbjct: 387 ---LDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNNRV 443
Query: 475 GFAGEDCS 482
G A E C+
Sbjct: 444 GIARELCN 451
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 71/339 (20%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L G G C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
S++ C YFV YGDG T G + L L ++V +F FGC
Sbjct: 206 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGC---------------- 246
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
S + GN S + T T ++ NP
Sbjct: 247 ----------------------------------SHAVRGNFSASTSGTMFARTPLVRNP 272
Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F +
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 332
Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+P A G + LDTC++ + V +P V + F+G A + +D G++ + CLA
Sbjct: 333 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 385
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ G IGN QQ+ V+YD +GF C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 71/339 (20%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L G G C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
S++ C YFV YGDG T G + L L ++V +F FGC
Sbjct: 206 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGC---------------- 246
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
S + GN S + T T ++ NP
Sbjct: 247 ----------------------------------SHAVRGNFSASTSGTMFARTPLVRNP 272
Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F +
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 332
Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+P A G + LDTC++ + V +P V + F+G A + +D G++ + CLA
Sbjct: 333 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 385
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ G IGN QQ+ V+YD +GF C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 147/339 (43%), Gaps = 71/339 (20%)
Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
+DT DL W+QC PC CY QQ+ +FDP S + V C S+ C L G G C
Sbjct: 168 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 223
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
S++ C YFV YGDG T G + L L ++V +F FGC +
Sbjct: 224 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVR----------- 269
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
GN S + T T ++ NP
Sbjct: 270 ---------------------------------------GNFSASTSGTMFARTPLVRNP 290
Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
+ T Y++ L GI +GG++L GG ++DS +IT+LPP+ Y AL+ F +
Sbjct: 291 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 350
Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+P A G + LDTC++ + V +P V + F+G A + +D G++ + CLA
Sbjct: 351 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 403
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ G IGN QQ+ V+YD +GF C
Sbjct: 404 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/391 (30%), Positives = 180/391 (46%), Gaps = 42/391 (10%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSI 179
PL SG + Y I LG +++ ++ DTGSDL WV+C C++C ++ F P
Sbjct: 76 PLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRH 135
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELGREHLG 236
S S+ C C L A + +C+ + SP C + SY DGS + G +E
Sbjct: 136 SSSFSPFHCFDPHCRLLPHAPHH--LCNHTRLHSP--CRFLYSYADGSLSSGFFSKETTT 191
Query: 237 LGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
L S + FGCG G F G G+MGLGR +S SQ FG F
Sbjct: 192 LKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKF 251
Query: 286 SYCLPS-TQDAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
SYCL T + L++GG +S N+T I+YT + NP TFY + + I+I G
Sbjct: 252 SYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDG 311
Query: 343 KQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGF 391
+L + GG ++DSGT +T L + Y + ++ P+A PGF
Sbjct: 312 VKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK-LPNAAELTPGF 370
Query: 392 SILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED 450
D C N S + ++P ++ G A YF++++ +CLA+ ++ +
Sbjct: 371 ---DLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRN--YFLETEEGVMCLAIRAVESGN 425
Query: 451 ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+IGN Q+ + +D + S+LGF C
Sbjct: 426 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/419 (29%), Positives = 193/419 (46%), Gaps = 52/419 (12%)
Query: 88 QQNRLILDNLHVQYLQS-----RIKNMISGNIKDVSNTE---------IPLTSGIRLQTL 133
+Q++L +D++H++ L S R+ G +K+ +E I +T G +
Sbjct: 45 RQDQLRVDHIHMRLLSSSSQGVRVSKQKQGPVKEPVRSEVIHLHDQPVIQVTIGSERKGA 104
Query: 134 NYIATIEL-------GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYK 184
+ + G TV++DT SD+ WVQC P S +DP+ S +Y
Sbjct: 105 SGGSGGSGDQQQSQAAGVVQTVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYY 164
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR---GELGREHLGLGKAS 241
+ CNS+ C E G C ++ C Y V + G G + L L
Sbjct: 165 ALACNSAAC--TELGRLYRGACVNN---QCQYRVPIPSSPASSSSSGTYGSDLLKLTADP 219
Query: 242 VN----DFIFGC--GRNNKGLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
+ F FGC G +G G + +G+M LG SLVSQ + ++G FSYC+P+
Sbjct: 220 ADGASMSFKFGCSHGEAKQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPA 279
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SG 349
T+ ++ GG + T M+ ++ T Y + L I++ G+QL S
Sbjct: 280 TESRRPGFFVLGGGVGD-LSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSV 338
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
FA G +L DS T ITRLPP+ Y AL+ F + + + AP LDTC++ + V +P
Sbjct: 339 FASGSVL-DSRTAITRLPPTAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVP 397
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
V + +GNA + +D GI++ CL S + + GI+GN QQ+ V+Y+
Sbjct: 398 RVALLLDGNAVVALDRQGILF-------HDCLVFTSNTDDRMPGILGNVQQQTMEVLYN 449
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 130/432 (30%), Positives = 197/432 (45%), Gaps = 37/432 (8%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
T++L H++ + + R+I L +R+ N++ N K + + + L +
Sbjct: 29 FTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSISRLNRVSNLLDQNNK-LPQSVLILHN 87
Query: 127 GIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKV 186
G L YI T + DTGSDL WVQC PC SC+ Q P+F P S ++
Sbjct: 88 GEYLMRF-YIGTPPV---ERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPT 143
Query: 187 LCNSSTCHAL---EFATGNSGVCSSSSPPDCNYFVSYGDG-SYTRGELGREHL------G 236
C S C L + G SG +C Y YGD S++ G L E L G
Sbjct: 144 TCRSQPCTLLLPEQKGCGKSG--------ECIYTYKYGDQYSFSEGLLSTETLRFDSQGG 195
Query: 237 LGKASVNDFIFGCG-RNNKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
+ + + FGCG NN +F ++G+MGLG LSLVSQ + G FSYCL
Sbjct: 196 VQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCL--LP 253
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG 353
S S + GN S+ + T MI P L T+Y LNL +++ K + +G G
Sbjct: 254 LGSTSTSKLKFGNESIITGEG-VVSTPMIIKPWLPTYYFLNLEAVTVAQKTV-PTGSTDG 311
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVK 412
++IDSGT++T L S Y A + + S L CF Y++ P +
Sbjct: 312 NVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCF---PYRDNFVFPEIA 368
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+F G A +++ + + + D + VCL +A S I G++ Q + +V YD +
Sbjct: 369 FQFTG-ARVSLKPANL-FVMTEDRNTVCLMIAPSSVSG-ISIFGSFSQIDFQVEYDLEGK 425
Query: 473 QLGFAGEDCSSM 484
++ F DCS +
Sbjct: 426 KVSFQPTDCSKV 437
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 189/405 (46%), Gaps = 68/405 (16%)
Query: 134 NYIATIELGG---RNMTVIVDTGSDLTWVQCQP-----CKSCYNQQDPVFDPSISPSYKK 185
+Y + LG +++T+ +DTGSDL W C P C+ +N P+ +I+ S++
Sbjct: 18 DYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPL---NITRSHR- 73
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGR 232
V C S C + + +C+ + P DC+ ++ +YGDGS+ L R
Sbjct: 74 VSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFI-AHLHR 132
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---FGGLFSYCL 289
+ L + + + +F FGC +G+ G GR LSL +Q + + G FSYCL
Sbjct: 133 DTLSMSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCL 189
Query: 290 PS----TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
S + LILG YT+M+ NP+ + FY + LTGIS+G + +
Sbjct: 190 VSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTI 249
Query: 346 QASGFAK-------GGILIDSGTVITRLPPSIYSALKAEF-------LKQFSGFPSAPGF 391
A + GG+++DSGT T LP S+Y+++ AEF K+ S G
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTG- 308
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-----SDASQVCLALASL 446
L C+ L EV P V F GN V + + YF + +A + L +
Sbjct: 309 --LGPCYFLEGLVEV--PTVTWHFLGNNS-NVMLPRMNYFYEFLDGEDEARRKVGCLMLM 363
Query: 447 SYEDET-------GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ D+T I+GNYQQ+ V+YD +N ++GFA C+S+
Sbjct: 364 NGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASL 408
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 167/372 (44%), Gaps = 55/372 (14%)
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+ +DT SDL W+QCQPC SCY Q DP+F+P +S SY V C+S TC L+ G+ C
Sbjct: 102 SAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLD---GHR--C 156
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMG 265
C Y Y + T G L + L +G + + GC ++ G SGL+G
Sbjct: 157 DEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVLGCSDSSVGGPPPQASGLVG 216
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKNSTPITYTNMI 322
L R LSL+SQ S F YCLP + G L+LG G +V S +T T M
Sbjct: 217 LARGPLSLLSQLSV---RRFMYCLPPPM-SRTPGKLVLGAGAGADAVRNVSDRVTVT-MS 271
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAKG-------------------------GILI 357
+ + ++Y LN G+++G Q G + G+++
Sbjct: 272 SSTRYPSYYYLNFDGLAVGD---QTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIV 328
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKM 413
D + I+ L S+Y L + ++ + P + LD CF L V +P V M
Sbjct: 329 DVASTISFLEASLYDELADDLEEEIRLPRATPSTRLGLDLCFILPEGVGIDRVYVPTVSM 388
Query: 414 EFEGN-AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F+G E+ D D +CL + S I+GNYQQ+N V+Y+ +
Sbjct: 389 SFDGRWLELERD-----RLFLEDGRMMCLMIGRTS---GVSILGNYQQQNMHVLYNLRRG 440
Query: 473 QLGFAGEDCSSM 484
++ FA C S+
Sbjct: 441 KITFAKASCDSL 452
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 138/421 (32%), Positives = 203/421 (48%), Gaps = 43/421 (10%)
Query: 77 CSGKIVDWNEQQQNRLI----LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQT 132
CS I E N +I D ++YL S M T +P+ G ++
Sbjct: 43 CSPFIPPKQEPLVNTVIDMASKDPARLKYLSSLAAQM---------TTAVPIAPGQQVLN 93
Query: 133 L-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
+ NY+ ++LG G+ M +++DT +D WV C C C + + S +Y + C+
Sbjct: 94 IGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFST---NTSSTYGSLDCS 150
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
+ C + G S C ++ C + SYG S L + L L + +F FGC
Sbjct: 151 MAQCTQVR---GFS--CPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVNDVIPNFAFGC 205
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
+ G GL+GLGR LSL++Q+ ++ GLFSYCLPS + SGSL LG
Sbjct: 206 INSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAG-- 263
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG-GILIDSGTV 362
I YT ++ NP + Y +NLTG+S+G +L A G G +IDSGTV
Sbjct: 264 --QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTV 321
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
ITR IY+A++ EF KQ +G S+ G DTCF +A E P V + F G +
Sbjct: 322 ITRFVQPIYTAIRDEFRKQVAGPFSSLG--AFDTCF--AATNEAVAPAVTLHFTGLNLVL 377
Query: 423 VDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+++ S S CLA+A+ + +I N QQ+N R+++D NS+LG A E
Sbjct: 378 PMENSLIH--SSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIAREL 435
Query: 481 C 481
C
Sbjct: 436 C 436
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 168/365 (46%), Gaps = 34/365 (9%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ + LG + + +VDTGSDL W QC PC CY Q+ P+F+P S +Y + C S
Sbjct: 81 DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESE 140
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
C ++ +C+ Y SY D S T+G L RE + V D I
Sbjct: 141 QCSFFGYSCSPQKMCA--------YSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDII 192
Query: 247 FGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLIL 303
FGCG +N G F G++G+G LSLVSQ ++G FS CL P DA SG++
Sbjct: 193 FGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSG 360
G S V S T + + + T Y++ L GIS+G ++ + +KG I+IDSG
Sbjct: 253 GEESDV---SGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSG 309
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGN 418
T T +P Y L E Q S P D L E N+ P++ FEG
Sbjct: 310 TPATYIPQEFYERLVEELKVQSSLLPIE---DDPDLGTQLCYRSETNLEGPILTAHFEG- 365
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
V + I F+ C A+A + D I GN+ Q N + +D + F
Sbjct: 366 --ADVQLLPIQTFIPPKDGVFCFAMAGST--DGDYIFGNFAQSNILMGFDLDRKTISFKP 421
Query: 479 EDCSS 483
DC++
Sbjct: 422 TDCTN 426
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 154/315 (48%), Gaps = 26/315 (8%)
Query: 113 NIKDVSNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
+ D T +P+ G + L+ NY+ ++LG G+ M +++DT +D WV C C C +
Sbjct: 22 TLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
F P+ S + + C+ + C + G S C ++ C + SYG S
Sbjct: 82 T---TFLPNASTTLGSLDCSEAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLAAT 133
Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
L ++ + L + F FGC G GL+GLGR +SL+SQ ++ G+FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------G 342
PS + SGSL LG I T ++ NP + Y +NLTG+S+G
Sbjct: 194 PSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249
Query: 343 KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
+QL G +IDSGTVITR +Y A++ EF KQ +G S+ G DTCF +
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AE 305
Query: 403 YQEVNIPLVKMEFEG 417
E P V + FEG
Sbjct: 306 TNEAEAPAVTLHFEG 320
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/418 (30%), Positives = 196/418 (46%), Gaps = 51/418 (12%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
K V W + L D +Q+L S + + +P+ SG ++ Q+ YI
Sbjct: 44 KPVSWEDSVLQMLAEDQARLQFLSSLVGR----------KSWVPIASGRQIVQSPTYIVK 93
Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
+G T + +DT +D W+ C C C + VF+ S ++K + C++ C +
Sbjct: 94 ANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCDAPQCKQV 150
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
T C S+ C + +YG GS L R+ + L V + FGC + G
Sbjct: 151 PNPT-----CGGST---CTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR LS +SQT +++ FSYCLPS + SG+L LG + I
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR----I 257
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
T ++ NP+ ++ Y +NL GI +G K + AS A G + DSGTV TRL
Sbjct: 258 KTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAP 317
Query: 370 IYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
+Y+A++ EF K+ S GF DTC+ + P + F G M V +
Sbjct: 318 VYTAVRDEFRKRVGNAIVSSLGGF---DTCYT----GPIVAPTMTFMFSG---MNVTLPP 367
Query: 428 IVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++S A S CLA+A+ + +I N QQ+N R+++D NS++G A E CS
Sbjct: 368 DNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/418 (30%), Positives = 196/418 (46%), Gaps = 51/418 (12%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
K V W + L D +Q+L S + + +P+ SG ++ Q+ YI
Sbjct: 44 KPVSWEDSVLQMLAEDQARLQFLSSLVGR----------KSWVPIASGRQIVQSPTYIVK 93
Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
+G T + +DT +D W+ C C C + VF+ S ++K + C++ C +
Sbjct: 94 ANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCDAPQCKQV 150
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
T C S+ C + +YG GS L R+ + L V + FGC + G
Sbjct: 151 PNPT-----CGGST---CTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR LS +SQT +++ FSYCLPS + SG+L LG + I
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR----I 257
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
T ++ NP+ ++ Y +NL GI +G K + AS A G + DSGTV TRL
Sbjct: 258 KTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAP 317
Query: 370 IYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
+Y+A++ EF K+ S GF DTC+ + P + F G M V +
Sbjct: 318 VYTAVRDEFRKRVGNAIVSSLGGF---DTCYT----GPIVAPTMTFMFSG---MNVTLPT 367
Query: 428 IVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++S A S CLA+A+ + +I N QQ+N R+++D NS++G A E CS
Sbjct: 368 DNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 132/439 (30%), Positives = 188/439 (42%), Gaps = 78/439 (17%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN------T 120
TLEL H++ S K + + QN+ RI N + +I V++ T
Sbjct: 29 FTLELIHRD--SSK-SPFYQPTQNKY-----------ERIANAVRRSINRVNHFYKYSLT 74
Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
P S + Y+ + +G V VDTGSDL W+QC+PCK CY Q P+FDPS
Sbjct: 75 STP-QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPS 133
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
+S SY+ + C S TCH++ + C+ RG L E L L
Sbjct: 134 LSSSYQNIPCLSDTCHSMRTTS-------------CD----------VRGYLSVETLTLD 170
Query: 239 -----KASVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCL--- 289
S + GCG N G F G SG++GLG +SL SQ GG FSYCL
Sbjct: 171 STTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPW 230
Query: 290 --PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
ST + I+ G+ ++ +TPI + + Y L L S+G K ++
Sbjct: 231 LPNSTSKLNFGDAAIVYGDGAM---TTPIVKKDA------QSGYYLTLEAFSVGNKLIEF 281
Query: 348 SGFAKGG----ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
G GG ILIDSGT T LP +Y ++ + + C+N+ AY
Sbjct: 282 GGPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNV-AY 340
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
PL+ F+G + + I F+K CLA +T I GN Q+N
Sbjct: 341 HGFEAPLITAHFKG---ADIKLYYISTFIKVSDGIACLAFI----PSQTAIFGNVAQQNL 393
Query: 464 RVIYDTKNSQLGFAGEDCS 482
V Y+ + + F DC+
Sbjct: 394 LVGYNLVQNTVTFKPVDCT 412
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 119/201 (59%), Gaps = 14/201 (6%)
Query: 92 LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
L D V+ + S++ I+ + +T++P +GI L + NYI TI +G +++++
Sbjct: 91 LRRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLM 150
Query: 150 VDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
DTGSDLTW QC+PC SCY+Q++P F+PS S SY V C+S C GN CS+
Sbjct: 151 FDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSPMC-------GNPESCSA 203
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
S +C Y + YGDGS T G L +E L + V +D FGCG NNKG+F G +G++GLG
Sbjct: 204 S---NCLYGIGYGDGSVTVGFLAKEKFTLTNSDVLDDIYFGCGENNKGVFIGSAGILGLG 260
Query: 268 RSDLSLVSQTSEIFGGLFSYC 288
S QT+ + +FSYC
Sbjct: 261 PGKFSFPLQTTTTYNNIFSYC 281
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/407 (28%), Positives = 188/407 (46%), Gaps = 42/407 (10%)
Query: 101 YLQSRIKNM------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
Y+ +R+++ ++ + S +P++SG T Y + +G + T++ DT
Sbjct: 76 YICARLRSRQGGSRRVAAEVASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADT 135
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGNSGVCSSSSP 211
GSDLTWV+C + + VF P S S+ + C+S TC + F N CSS +
Sbjct: 136 GSDLTWVKC----AGASPPGRVFRPKTSRSWAPIPCSSDTCKLDVPFTLAN---CSSPAS 188
Query: 212 PDCNYFVSYGDGSY-TRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL-FGGVSGLM 264
P C Y Y +GS RG +G E + GK A + D + GC ++ G F G++
Sbjct: 189 P-CTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVL 247
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
LG + +S +Q + FGG FSYCL A+G L G TP T T +
Sbjct: 248 SLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQV---PRTPATQTKLFL 304
Query: 324 NPQLATFYILNLTGISIGGKQL----QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
+P++ FY + + I + GK L + GG+++DSG +T L Y A+ A
Sbjct: 305 DPEM-PFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALS 363
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
K G P F + C+N +A + E+ IP + ++F G+A + V VK
Sbjct: 364 KHLDGVPKV-SFPPFEHCYNWTARRPGAPEI-IPKLAVQFAGSARLEPPAKSYVIDVKPG 421
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
C+ + + + +IGN Q+ +D KN Q+ F +C+
Sbjct: 422 VK--CIGVQEGEWPGLS-VIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 136/463 (29%), Positives = 218/463 (47%), Gaps = 44/463 (9%)
Query: 33 GKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRL 92
G + + + W + + C S Q ++ I + K + K W+ + N
Sbjct: 6 GTTLIVIFSVMWLMRVNAIDPCAS-QPDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMA 64
Query: 93 ILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
D + V+YL + + K VS P+ SG NY+ ++LG G+ + +++
Sbjct: 65 SKDPVRVKYLSTLVSQ------KTVSTA--PIASGQAFNIGNYVVRVKLGTPGQLLFMVL 116
Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
DT +D +V C C C D F P S SY + C+ C + + C ++
Sbjct: 117 DTSTDEAFVPCSGCTGC---SDTTFSPKASTSYGPLDCSVPQCGQVRGLS-----CPATG 168
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSD 270
C++ SY S++ L ++ L L + + FGC G GL+GLGR
Sbjct: 169 TGACSFNQSYAGSSFS-ATLVQDALRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRGP 227
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
LSL+SQ+ + G+FSYCLPS + SGSL LG I T ++ +P +
Sbjct: 228 LSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRSPHRPSL 283
Query: 331 YILNLTGISIG-------GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y +N TGIS+G + L + G +IDSGTVITR +Y+A++ EF KQ
Sbjct: 284 YYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVG 343
Query: 384 G--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYFVKSDASQVC 440
G F S F DTCF + Y+ + P + + FEG + ++ ++ + I S S C
Sbjct: 344 GTTFTSIGAF---DTCF-VKTYETL-APPITLHFEGLDLKLPLENSLI---HSSAGSLAC 395
Query: 441 LALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
LA+A+ + +I N+QQ+N R+++D N+++G A E C
Sbjct: 396 LAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 164/353 (46%), Gaps = 46/353 (13%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DTGSDL WVQC+PC C+ Q P+FDPS S +Y + +S C NS +
Sbjct: 108 IDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKKYN 160
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VSGL 263
C Y SY DGS + G L E + G +V+ +FGCG +N+G F G SG+
Sbjct: 161 HLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGI 220
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTNMI 322
+GL D S+VS+ G FSYC+ D + L+LG + +STP N
Sbjct: 221 LGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN-- 274
Query: 323 PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALK 375
FY + L GIS+G +L Q + +GG+++DSGT T L + L
Sbjct: 275 ------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLS 328
Query: 376 AEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVTGI 428
E + G + + PG+ C+ +++ P + F A++ +D +
Sbjct: 329 NEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDANSL 384
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
FV+ + CLA+ + ++ +IG Q++ V YD ++ F DC
Sbjct: 385 --FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 178/391 (45%), Gaps = 42/391 (10%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSI 179
P+ SG + Y ++ +G + + ++ DTGSDL WV+C PC++C + F
Sbjct: 74 PVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARH 133
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELGREHLG 236
S +Y + C S C + N C+ + SP C Y +Y D S T G +E L
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPNP--CNRTRLHSP--CRYQYTYADSSTTTGFFSKEALT 189
Query: 237 LGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
L ++ +N FGCG G F G G+MGLGR+ +S SQ FG F
Sbjct: 190 LNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKF 249
Query: 286 SYCLPS-TQDAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
SYCL T + L +GG N +V K +++T ++ NP TFY + + G+ + G
Sbjct: 250 SYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGI-MSFTPLLINPLSPTFYYIAIKGVYVNG 308
Query: 343 KQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGF 391
+L + GG +IDSGT +T + Y+ + F K+ PS PGF
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVK-LPSPAEPTPGF 367
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
D C N+S +P + G + + YF+++ CLA+ +S +
Sbjct: 368 ---DLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRN--YFIETGDQIKCLAVQPVSQDGG 422
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++GN Q+ + +D S+LGF C+
Sbjct: 423 FSVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 171/362 (47%), Gaps = 31/362 (8%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ LG ++ I DTGSDL+W+QC PCK+CY Q+ P+FDP+ S +Y V C S
Sbjct: 88 YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH-------LGLGKASVNDF 245
C N C SS C Y YG S+T G LG + +G G A+
Sbjct: 148 CTLFP---QNQRECGSSK--QCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKS 202
Query: 246 IFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
+FGC + F +G +GLG LSL SQ + G FSYC+ + ++G L
Sbjct: 203 VFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCM-VPFSSTSTGKLK 261
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV 362
G + + + T + NP ++Y+LNL GI++G K++ +G G I+IDS +
Sbjct: 262 FGSMAP----TNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV-LTGQIGGNIIIDSVPI 316
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
+T L IY+ + + + + + + C + +N P F G A++
Sbjct: 317 LTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTG-ADVV 373
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ + F+ D + VC+ + I GN+ Q N +V YD ++ FA +CS
Sbjct: 374 LGPKNM--FIALDNNLVCMTVVP---SKGISIFGNWAQVNFQVEYDLGEKKVSFAPTNCS 428
Query: 483 SM 484
++
Sbjct: 429 TI 430
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/444 (28%), Positives = 205/444 (46%), Gaps = 32/444 (7%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
S S +S +++ + +++L H++ D + R+ +R+ + +
Sbjct: 16 SPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRSSSRLNRVSHFL 75
Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
N ++ + + +G L TL YI T + I DTGSDL WVQC PC++C+ Q
Sbjct: 76 DEN--NLPESLLIPENGEYLMTL-YIGTPPV---ERLAIADTGSDLIWVQCSPCQNCFPQ 129
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
P+F+P S ++K C+S C ++ + G C Y SYGD S+T G +
Sbjct: 130 DTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVG-----QCIYSYSYGDKSFTVGVV 184
Query: 231 GREHLGLGK------ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---F 281
G E L G S IFGCG N F + GL +S S++
Sbjct: 185 GTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQI 244
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G FSYCL + ++ L G + V N + T +I P +FY LNL ++IG
Sbjct: 245 GYKFSYCL-LPFSSNSTSKLKFGSEAIVTTNG--VVSTPLIIKPLFPSFYFLNLEAVTIG 301
Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
K + +G G I+IDSGTV+T L + Y+ A L++ SA CF
Sbjct: 302 QK-VVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVAS-LQEVLSVESAQDLPFPFKFCF-- 357
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
Y+++ IP++ +F G A + + ++ ++ D + +CLA+ S I GN Q
Sbjct: 358 -PYRDMTIPVIAFQFTG-ASVALQPKNLLIKLQ-DRNMLCLAVVPSSLSG-ISIFGNVAQ 413
Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
+ +V+YD + ++ FA DC+ +
Sbjct: 414 FDFQVVYDLEGKKVSFAPTDCTKV 437
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 132/444 (29%), Positives = 204/444 (45%), Gaps = 54/444 (12%)
Query: 59 KSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
K I+ TL++ H + K + W E N D +QY S +
Sbjct: 25 KCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVAR----- 79
Query: 114 IKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQ 170
+ +P+ S ++ Q+ YI + G T+++ DT SD W+ C C C
Sbjct: 80 -----KSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTS 134
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
+ F P S S++ V C S C + T C S+ C + +YG S +
Sbjct: 135 KP--FAPIKSTSFRNVSCGSPHCKQVPNPT-----CGGSA---CAFNFTYGSSSIA-ASV 183
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
++ L L + + FGC G GL+GLGR LSL+SQ+ ++ FSYCLP
Sbjct: 184 VQDTLTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLP 243
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
S + SGSL LG V++ I YT ++ NP+ ++ Y +NL I +G K +
Sbjct: 244 SFKSINFSGSLRLG---PVYQPKR-IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPA 299
Query: 351 A-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLS 401
A G + DSGTV TRL +Y+A++ EF ++ P P ++ DTC+N+
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKLPVTTLGGFDTCYNV- 356
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
+ +P + F G +T+ IV + S A S CLA+A + +I N
Sbjct: 357 ---PIVVPTITFLFSG-MNVTLPPDNIV--IHSTAGSTTCLAMAGAPDNVNSVLNVIANM 410
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
QQ+N RV++D NS++G A E C+
Sbjct: 411 QQQNHRVLFDVPNSRIGIARELCT 434
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 202/444 (45%), Gaps = 54/444 (12%)
Query: 59 KSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
K I+ TL++ H + K + W E N D +QY S +
Sbjct: 25 KCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVAR----- 79
Query: 114 IKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQ 170
+ +P+ S ++ Q+ YI + G T+++ DT SD W+ C C C
Sbjct: 80 -----KSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTS 134
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
+ F P S S++ V C S C + T C S+ C + +YG S +
Sbjct: 135 KP--FAPIKSTSFRNVSCGSPHCKQVPNPT-----CGGSA---CAFNFTYGSSSIA-ASV 183
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
++ L L + + FGC G GL+GLGR LSL+SQ+ ++ FSYCLP
Sbjct: 184 VQDTLTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLP 243
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
S + SGSL LG V++ I YT ++ NP+ ++ Y +NL I +G K +
Sbjct: 244 SFKSINFSGSLRLG---PVYQPKR-IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPA 299
Query: 351 A-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLS 401
A G + DSGTV TRL +Y+A++ EF ++ P P ++ DTC+N+
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKLPVTTLGGFDTCYNV- 356
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
+ +P + F G M V + + S A S CLA+A + +I N
Sbjct: 357 ---PIVVPTITFLFSG---MNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 410
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
QQ+N RV++D NS++G A E C+
Sbjct: 411 QQQNHRVLFDVPNSRIGIARELCT 434
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 165/355 (46%), Gaps = 46/355 (12%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
V +DTGSDL WVQC+PC C+ Q P+FDPS S +Y + +S C NS
Sbjct: 74 VGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKK 126
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VS 261
+ C Y SY DGS + G L E + G +V+ +FGCG +N+G F G S
Sbjct: 127 YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQS 186
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
G++GL D S+VS+ G FSYC+ D + L+LG + +STP N
Sbjct: 187 GILGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN 242
Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
FY + L GIS+G +L Q + +GG+++DSGT T L +
Sbjct: 243 --------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294
Query: 374 LKAEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVT 426
L E + G + + PG+ C+ +++ P + F A++ +D
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDAN 350
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ FV+ + CLA+ + ++ +IG Q++ V YD ++ F DC
Sbjct: 351 SL--FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 191/387 (49%), Gaps = 40/387 (10%)
Query: 117 VSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPV 174
+S + P + +R Y+ + +G + I DTGSDLTW QC+PCK C+ Q P+
Sbjct: 65 LSTSSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPI 124
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
+D + S S+ + C+S+TC + S CS+ S C Y +Y DG+Y+ E
Sbjct: 125 YDTTTSSSFSPLPCSSATCLPIW-----SSRCSTPS-ATCRYRYAYDDGAYS-----PEC 173
Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
G+ SV FGCG +N GL +G +GLGR LSLV+Q G FSYCL +
Sbjct: 174 AGI---SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFN 227
Query: 295 AGASGSLILGGNSSVFKNSTP-----ITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
S + G + + +S + T ++ +P + Y ++L GIS+G +L
Sbjct: 228 TSLSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPN 287
Query: 350 --------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
GG+++DSGT+ T L + + + + + G P S+ CF
Sbjct: 288 GTFDLNDDDGSGGMIVDSGTIFTILVETGFRVV-VDHVAGVLGQPVVNASSLDRPCFPAP 346
Query: 402 A--YQEV-NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGN 457
A QE+ ++P + + F G A+M + + F + ++S CL + + E +G ++GN
Sbjct: 347 AAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESS-FCLNI--VGTESASGSVLGN 403
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+QQ+N ++++D QL F DCS +
Sbjct: 404 FQQQNIQMLFDITVGQLSFMPTDCSKL 430
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 165/355 (46%), Gaps = 46/355 (12%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
V +DTGSDL WVQC+PC C+ Q P+FDPS S +Y + +S C NS
Sbjct: 74 VGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKK 126
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VS 261
+ C Y SY DGS + G L E + G +V+ +FGCG +N+G F G S
Sbjct: 127 YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQS 186
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
G++GL D S+VS+ G FSYC+ D + L+LG + +STP N
Sbjct: 187 GILGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN 242
Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
FY + L GIS+G +L Q + +GG+++DSGT T L +
Sbjct: 243 --------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294
Query: 374 LKAEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVT 426
L E + G + + PG+ C+ +++ P + F A++ +D
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDAN 350
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ FV+ + CLA+ + ++ +IG Q++ V YD ++ F DC
Sbjct: 351 SL--FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 111/347 (31%), Positives = 168/347 (48%), Gaps = 32/347 (9%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
+DTGS++ W+QCQPC +C+NQ P+F+PS S SYK + C SSTC T ++ + S
Sbjct: 105 FMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSSTCK----DTNDTHISCS 160
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGG-VSG 262
+ C Y ++YG + ++G+L + L L S + +F GCG N SG
Sbjct: 161 NGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSG 220
Query: 263 LMGLGRSDLSLVSQT-SEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPIT 317
++G+GR +SL+ Q S G FSYCL P D+ +S LI G + V STP+
Sbjct: 221 VVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMV 280
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDSGTVITRLPPSIYSAL 374
N N +Y L L S+G ++ + S + ILIDSGT +T LP S L
Sbjct: 281 KVNGQEN-----YYFLTLEAFSVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKL 335
Query: 375 KAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
++ Q P P L C+N + +++N+P + F G A++ ++ G F
Sbjct: 336 -VSYVAQEVKLPRIEPPDHHLSLCYNTTG-KQLNVPDITAHFNG-ADVKLNSNGT--FFP 390
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ +C S + I GN Q N + YD + + F D
Sbjct: 391 FEDGIMCFGFIS---SNGLEIFGNIAQNNLLIDYDLEKEIISFKPTD 434
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 168/367 (45%), Gaps = 57/367 (15%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSS 191
Y +TI LG ++ ++++DTGSDLTWV+C PC C + FD S +YK + C
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSS----TFDRLASNTYKALTCAD- 57
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND------F 245
+Y YGDGS+T+G+L + L + A+ ++ F
Sbjct: 58 -----------------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGF 94
Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLIL 303
+FGCG KGL G G++ L LS SQ E +G FSYCL + Q++ ++
Sbjct: 95 VFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVF 154
Query: 304 GGNSSVFKN--STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GIL 356
G + K S + P + + +Y + L GIS+G ++L S F G +
Sbjct: 155 GEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPTI 214
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
DSGT +T LPP + ++K SG F + G LD CF + +P +
Sbjct: 215 FDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG---LDACFRVPPSSGQGLPDITFH 271
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F G A+ VT +V S CL +E I GN QQ++ V++D N ++
Sbjct: 272 FNGGADF---VTRPSNYVIDLGSLQCLIFVP---TNEVSIFGNLQQQDFFVLHDMDNRRI 325
Query: 475 GFAGEDC 481
GF DC
Sbjct: 326 GFKETDC 332
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 199/460 (43%), Gaps = 74/460 (16%)
Query: 67 ITLELKH---KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
+ LEL H K C+ K ++ R + H R+ +M G +
Sbjct: 33 LRLELTHVDAKQNCTTK-------ERMRRATERTH-----RRLASMAGGGGE-------- 72
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSI 179
++ I YIA +G + I+DTGS+L W QC C++ C+ Q +DPS
Sbjct: 73 ASAPIHWNETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSR 132
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGL 237
S + K V CN + C G+ C+ C +YG G+ G LG E G
Sbjct: 133 SRTAKPVACNDTAC-----LLGSETRCARDGK-ACAVLTAYGAGAIG-GFLGTEVFTFGH 185
Query: 238 GKASVND--FIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
G++S N+ FGC R G G SG++GLGR LSL SQ + FSYCL P
Sbjct: 186 GQSSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGD---NKFSYCLTPY 242
Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQAS 348
DA + +L +G ++ + P T + NP +FY L LTGI++G +L
Sbjct: 243 FSDAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVP 302
Query: 349 GFA----------KGGILIDSGTVITRLPPSIYSALKAEFLKQF--SGFPSAPGFSILDT 396
A GG LIDSG+ T L Y AL+ E ++Q S P G LD
Sbjct: 303 AAAFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDL 362
Query: 397 CFNLSAYQEVN--IPLVKMEFEGNAEMTVDVTGIV----YFVKSDASQVCLALASLSYED 450
C A + +P + + F DV +V Y+ D S C+ + S +
Sbjct: 363 CVGGVAPGDAGKLVPPLVLHFGSGGGGGGDV--VVPPENYWGPVDDSTACMVVFSSGGPN 420
Query: 451 ------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
ET IIGNY Q++ ++YD L F DCSS+
Sbjct: 421 STLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADCSSV 460
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 171/388 (44%), Gaps = 36/388 (9%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPS 178
P+ SG + Y + LG + + ++ DTGSDL WV+C C++C + P F
Sbjct: 77 PVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNC-TRHTPGSAFLAR 135
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S ++ C S C + + + P C Y SYGDGS T G +E L
Sbjct: 136 HSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSP-CRYEYSYGDGSKTSGFFSKETTTLN 194
Query: 239 -----KASVNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
+A + FGC G F G G+MGLGR +SL SQ FG FSY
Sbjct: 195 TSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSY 254
Query: 288 CLPSTQDAGASGSLILGGNS--SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
CL + + S +L G++ V + +T + NP TFY + + +S+ G +L
Sbjct: 255 CLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKL 314
Query: 346 QASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGFSIL 394
+ GG ++DSGT +T LP Y + +K+ PS PGF
Sbjct: 315 PINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI-LTVIKRRVRLPSPAEPTPGF--- 370
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
D C N+S + +P K+ F+ + YFV +D CLAL ++ +
Sbjct: 371 DLCVNVSEIEHPRLP--KLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSV 428
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
IGN Q+ + +D ++LGF+ C+
Sbjct: 429 IGNLMQQGFLLEFDKDRTRLGFSRHGCA 456
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 161/325 (49%), Gaps = 24/325 (7%)
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFAT-GNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
P FD S S + C+S+ C L A+ GN+ + + C Y Y D S T G L
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQT---CVYTYYYNDKSVTTGLLE 231
Query: 232 REHLGLGK-ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+ G ASV FGCG N G+F +G+ G GR LSL SQ G FS+C
Sbjct: 232 VDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF 288
Query: 290 PSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA- 347
+ S +++L + ++KN + T +I N T Y L+L GI++G +L
Sbjct: 289 TAVNGLKQS-TVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347
Query: 348 -SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLS 401
S FA GG +IDSGT IT LPP +Y ++ EF Q P PG + TCF+
Sbjct: 348 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGPYTCFSAP 406
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQ 459
+ + ++P + + FEG A M + V+ V DA S +CLA+ L DE IGN+Q
Sbjct: 407 SQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSMICLAINELG--DERATIGNFQ 463
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSSM 484
Q+N V+YD +N+ L F C +
Sbjct: 464 QQNMHVLYDLQNNMLSFVAAQCDKL 488
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 14/139 (10%)
Query: 337 GISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
GI++G +L S FA GG +IDSGT IT LPP +Y ++ EF Q P PG
Sbjct: 41 GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPG 99
Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLS 447
+ TCF+ + + ++P + + FEG A M + V+ V DA S +CLA ++
Sbjct: 100 NATGPYTCFSAPSQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---IN 155
Query: 448 YEDETGIIGNYQQKNQRVI 466
DET IIGN+QQ+N +
Sbjct: 156 KGDETTIIGNFQQQNMHAL 174
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 128/373 (34%), Positives = 183/373 (49%), Gaps = 53/373 (14%)
Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCN 189
NY+ I +G ++ I DTGSDLTWVQC PC + C+ Q P++DP S ++ + C+
Sbjct: 95 NYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCD 154
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGLGKASVNDFI- 246
S C L ++ VCS DC Y +YGD SY+ G L + L L + N I
Sbjct: 155 SQPCTQLPYS---QYVCSDYG--DCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKIC 209
Query: 247 FGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDAGAS---- 298
FGCG NK G +G++GLG LSLVSQ + G FSYC LP + ++ +
Sbjct: 210 FGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFG 269
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILID 358
+ I+ GN V STP +I P L FY LNL GI++G K ++ +G G I+ID
Sbjct: 270 EAAIVQGNGVV---STP-----LIIKPDLP-FYYLNLEGITVGAKTVK-TGQTDGNIIID 319
Query: 359 SGTVITRLPPSIYS---ALKAEFL----KQFSGFPSAPGFSILDTCFNLSAYQE--VNIP 409
SG+ +T L S Y+ +L E + Q+ +P D CF Y+E P
Sbjct: 320 SGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYP-------FDFCF---TYKEGMSTPP 369
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
V F G + + +V + + +C + S+ D I GN Q + V YD
Sbjct: 370 DVVFHFTGGDVVLKPMNTLVLI---EDNLICSTVVP-SHFDGIAIFGNLGQIDFHVGYDI 425
Query: 470 KNSQLGFAGEDCS 482
+ ++ FA DCS
Sbjct: 426 QGGKVSFAPTDCS 438
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 180/397 (45%), Gaps = 38/397 (9%)
Query: 118 SNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP-- 173
++++ PL SG + Y +I LG + + ++ DTGSDLTWV+C CK+ + P
Sbjct: 66 TSSKSPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGS 125
Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGR 232
F S ++ C SS C + N C+ + C Y Y DGS T G +
Sbjct: 126 TFLARHSTTFSPTHCFSSLCQLV--PQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSK 183
Query: 233 EHLGLGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIF 281
E L +S + FGCG + G F G SG+MGLGR +S SQ F
Sbjct: 184 ETTTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRF 243
Query: 282 GGLFSYC-LPSTQDAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATFYILNLTGIS 339
G FSYC L T + L++G S K N + +++T ++ NP+ TFY +++ G+
Sbjct: 244 GRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVF 303
Query: 340 IGGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-- 390
+ G +L GG +IDSGT +T L Y + + F ++ PG
Sbjct: 304 VDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGA 363
Query: 391 --FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
S D C N++ P + +E G + + YF+ CLA+ +
Sbjct: 364 STRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRN--YFIDISEGIKCLAIQPV-- 419
Query: 449 EDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
E E+G +IGN Q+ + +D S+LGF+ C+
Sbjct: 420 EAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 49/358 (13%)
Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
+QCQPC SCY Q DPVF+P +S SY V C S TC L+ G+ C C Y
Sbjct: 1 MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLD---GHR--CHEDDDGACQYTY 55
Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMGLGRSDLSLVSQT 277
Y T+G L + L +G + +FGC ++ G SGL+GLGR LSLVSQ
Sbjct: 56 KYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQL 115
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
S F YCLP + SG L+LG + +N + M + + ++Y LNL G
Sbjct: 116 SV---HRFMYCLPPPM-SRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDG 171
Query: 338 ISIGGKQLQASGFAKG--------------------------GILIDSGTVITRLPPSIY 371
+++G + + A G+++D + I+ L S+Y
Sbjct: 172 LAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLY 231
Query: 372 SALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGN-AEMTVDVT 426
L + ++ + P + LD CF L V +P V + F+G E+ D
Sbjct: 232 DELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRD-- 289
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+D +CL + S I+GN+Q +N RV+++ + ++ FA C S+
Sbjct: 290 ---RLFVTDGRMMCLMIGRTS---GVSILGNFQLQNMRVLFNLRRGKITFAKASCDSL 341
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 121/220 (55%), Gaps = 14/220 (6%)
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
MGLG SLVSQT+ G FSYCLP T + SG L L ++ ++ T M+
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSS--SGFLTL--GAAGGSGTSGFVKTPMLR 56
Query: 324 NPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
+ Q+ TFY + L I +GG+QL AS F+ G ++ DSGTVITRLPP+ YSAL + F
Sbjct: 57 SSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM-DSGTVITRLPPTAYSALSSAFKAG 115
Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
+P A ILDTCF+ S V+IP V + F G A +++D +GI+ CL
Sbjct: 116 MKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-------SNCL 168
Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
A A S + GIIGN QQ+ V+YD +GF C
Sbjct: 169 AFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 117/401 (29%), Positives = 183/401 (45%), Gaps = 47/401 (11%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI-RLQTLNYIATIELGGRNMTVI--VD 151
D +Q+L S + + +P+ SG +Q+ +YI ++G T++ +D
Sbjct: 4 DQARLQFLSSLVAK----------KSVVPIASGRGVIQSPSYIVKAKVGTPPQTLLMALD 53
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
D W+ PCK C VF+ S ++K + C + C + + +C S+
Sbjct: 54 NSYDAAWI---PCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQVP-----NPICGGST- 104
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
C + +YG S L R+ + L V + FGC + G GL+G GR L
Sbjct: 105 --CTWNTTYGS-STILSNLTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPL 161
Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
S +SQT ++ FSYCLPS + SGSL LG + I T ++ NP+ ++ Y
Sbjct: 162 SFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPR----IKTTPLLKNPRRSSLY 217
Query: 332 ILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+ L GI +G K + S A G + DSGTV TRL Y A++ EF K+ G
Sbjct: 218 YVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRV-G 276
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLAL 443
+ DTC+++ + P + F G M V + + S A CLA+
Sbjct: 277 NATVSSLGGFDTCYSV----PIVPPTITFMFSG---MNVTMPPENLLIHSTAGVTSCLAM 329
Query: 444 ASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A+ + +I + QQ+N R+++D NS+LG A E CS
Sbjct: 330 AAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 159/348 (45%), Gaps = 46/348 (13%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
I DTGSD+ W+QC+PCK CYNQ P F PS S +YK + C+S C + +
Sbjct: 103 IADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQ----------- 151
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLG 267
G+ + L E S + GCG +N F G SG++GLG
Sbjct: 152 -------------QGNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLG 198
Query: 268 RSDLSLVSQTSEIFGGLFSYC-LPSTQDAGASGSLILGGNSSVFKN---STPITYTNMIP 323
SL++Q FSYC LP+ ++ + L G + V + STPI + I
Sbjct: 199 GGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPI- 257
Query: 324 NPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGTVITRLPPSIYSALKAEFL 379
FY L L S+G K+++ G + GG I+IDSGT +T +P +Y+ L++ L
Sbjct: 258 -----VFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVL 312
Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ + + C+++++ + P++ F+G V + I FV V
Sbjct: 313 ELVKLKRVNDPTRLFNLCYSVTS-DGYDFPIITTHFKG---ADVKLHPISTFVDVADGIV 368
Query: 440 CLALASLSY---EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA A+ S D I GN Q+N V YD + + F DCS +
Sbjct: 369 CLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCSKV 416
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 163/371 (43%), Gaps = 57/371 (15%)
Query: 128 IRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYK 184
+ T Y+ I +G + T ++DTGSDL W QC PC+ C+ Q P++ P+ S +Y
Sbjct: 85 VHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYA 144
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG-KAS 241
V C S C AL+ S SPPD C Y+ SYGDG+ T G L E LG +
Sbjct: 145 NVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTA 198
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V FGCG N G SGL+G+GR LSLVSQ L T+ + +
Sbjct: 199 VRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQ------------LGVTRPRRSCRAR 246
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGG 354
++P L GI++G L + + GG
Sbjct: 247 AAARGGGAPTTTSP-------------------LEGITVGDTLLPIDPAVFRLTPMGDGG 287
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKM 413
++IDSGT T L + AL A L P A G + L CF ++ + V +P + +
Sbjct: 288 VIIDSGTTFTALEERAFVAL-ARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVL 346
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F+G A+M + V +S A CL + S ++G+ QQ+N ++YD +
Sbjct: 347 HFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSMQQQNTHILYDLERGI 401
Query: 474 LGFAGEDCSSM 484
L F C +
Sbjct: 402 LSFEPAKCGEL 412
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 193/400 (48%), Gaps = 56/400 (14%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++N ++PL R ++ Y I+LG + V VDTGSD+ WV C PC C + D
Sbjct: 58 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 117
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++D S + K V C + C + S C + P C+Y V YGDGS + G
Sbjct: 118 GIPLSLYDSKASSTSKNVGCEDAFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 171
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ ++++ L + + N + +FGCG+N G G V G+MG G+S+ S++SQ
Sbjct: 172 DFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQ 231
Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
+ GG +FS+CL + G +G S +TP ++PN Y
Sbjct: 232 LAA--GGSVKRIFSHCL---DNMNGGGIFAIGEVESPVVKTTP-----LVPN---QVHYN 278
Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
+ L G+ + G+ + AS GG +IDSGT + LP ++Y++L +++ +
Sbjct: 279 VILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 334
Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
+ +T CF+ ++ + P+V + FE + +++V ++ ++ D
Sbjct: 335 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 394
Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ +D +I G+ N+ V+YD +N +G+A +CSS
Sbjct: 395 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 434
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 186/399 (46%), Gaps = 63/399 (15%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDP---V 174
IPL+ G QTL +I+DTGSDL W C C++C ++ +P +
Sbjct: 92 IPLSFGTPPQTL-------------PLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNI 138
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP--PDCN-----YFVSYGDGSYTR 227
F P S S K + C + C + + S C P P+C Y V YG G T
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITG 196
Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
G + E L L V +FI GC + G+SG GR SL SQ GL F
Sbjct: 197 GIMLSETLDLPGKGVPNFIVGCSVLSTSQPAGISGF---GRGPPSLPSQL-----GLKKF 248
Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA------TFYILNLTG 337
SYCL S + D S SL+L G S + + ++YT + NP++A +Y L L
Sbjct: 249 SYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRH 308
Query: 338 ISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSA 388
I++GGK ++ GG +IDSGT T + I+ + AEF KQ
Sbjct: 309 ITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEV 368
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS--L 446
G + L CFN+S + P + ++F G AEM + + V F+ D VCL + +
Sbjct: 369 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGD-DVVCLTIVTDGA 427
Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ ++ +G I+GN+QQ+N V YD +N +LGF + C
Sbjct: 428 AGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 191/400 (47%), Gaps = 56/400 (14%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++N ++PL R ++ Y I+LG + V VDTGSD+ WV C PC C + D
Sbjct: 55 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 114
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++D S + K V C C + S C + P C+Y V YGDGS + G
Sbjct: 115 GIPLSLYDSKTSSTSKNVGCEDDFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 168
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ ++++ L + + N + +FGCG+N G G V G+MG G+S+ S++SQ
Sbjct: 169 DFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ 228
Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
+ GG +FS+CL + G +G S +TPI +PN Y
Sbjct: 229 LAA--GGSTKRIFSHCL---DNMNGGGIFAVGEVESPVVKTTPI-----VPN---QVHYN 275
Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
+ L G+ + G + AS GG +IDSGT + LP ++Y++L +++ +
Sbjct: 276 VILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 331
Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
+ +T CF+ ++ + P+V + FE + +++V ++ ++ D
Sbjct: 332 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 391
Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ +D +I G+ N+ V+YD +N +G+A +CSS
Sbjct: 392 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 431
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 172/357 (48%), Gaps = 38/357 (10%)
Query: 7 PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQ----WQQKSGSSSSCVSHQKSRI 62
PL + LL + + LFL + + + H L ++ ++ ++++
Sbjct: 10 PLLPFTFLLCVGMLLFLQSAQSRPISVPEVPAYHALDVASSLRETDTAAGGAEYKRETKP 69
Query: 63 EMGAITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
++E+ H++ + + + + +L + + V+ L+ +I+ ++ N V
Sbjct: 70 RRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPV 129
Query: 118 SNTEI----------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
+ E + SG+ + Y I +G R +++DTGSD+ W+QC+PC+
Sbjct: 130 NRYENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCR 189
Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
CY+Q DP+F+PS S S+ V C+S+ C L+ +SG C Y SYGDGSY
Sbjct: 190 ECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSG--------GCLYEASYGDGSY 241
Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
+ G E L G SV + GCG N GLF G +GL+GLG LS +Q G F
Sbjct: 242 STGSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTF 301
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISI 340
SYCL +++ +SG L G S P+ +T + NP L TFY L++T ISI
Sbjct: 302 SYCL-VDRESDSSGPLQFG------PKSVPVGSIFTPLEKNPHLPTFYYLSVTAISI 351
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 191/400 (47%), Gaps = 56/400 (14%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++N ++PL R ++ Y I+LG + V VDTGSD+ WV C PC C + D
Sbjct: 59 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 118
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++D S + K V C C + S C + P C+Y V YGDGS + G
Sbjct: 119 GIPLSLYDSKTSSTSKNVGCEDDFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 172
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ ++++ L + + N + +FGCG+N G G V G+MG G+S+ S++SQ
Sbjct: 173 DFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ 232
Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
+ GG +FS+CL + G +G S +TPI +PN Y
Sbjct: 233 LAA--GGSTKRIFSHCL---DNMNGGGIFAVGEVESPVVKTTPI-----VPN---QVHYN 279
Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
+ L G+ + G + AS GG +IDSGT + LP ++Y++L +++ +
Sbjct: 280 VILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 335
Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
+ +T CF+ ++ + P+V + FE + +++V ++ ++ D
Sbjct: 336 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 395
Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ +D +I G+ N+ V+YD +N +G+A +CSS
Sbjct: 396 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 435
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 195/434 (44%), Gaps = 74/434 (17%)
Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQP--CKSC 167
++++ +PL+ G +Y + L +++++ +DTGSDL W C+P C C
Sbjct: 65 HLRNRHQVSLPLSPGS-----DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECILC 119
Query: 168 Y----NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN- 215
N P +S + + V C SS C A S +C+ + P DC+
Sbjct: 120 EGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHS 179
Query: 216 -----YFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGL 266
++ +YGDGS L + + L A S+++F FGC G+ G
Sbjct: 180 FSCPSFYYAYGDGSLV-ARLYHDSIKLPLATPSLSLHNFTFGCAHT---ALAEPVGVAGF 235
Query: 267 GRSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSS----VFKNSTP 315
GR LSL +Q + G FSYCL S + LILG + V K+
Sbjct: 236 GRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQ 295
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPP 368
YT+M+ NP+ FY + L GISIG K++ A F K GG+++DSGT T LP
Sbjct: 296 FVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPA 355
Query: 369 SIYSALKAEFLKQ----FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
S+Y+++ AEF + + + L C+ VNIP + + F GN E +V
Sbjct: 356 SLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCYYYDTV--VNIPSLVLHFVGN-ESSVV 412
Query: 425 VTGIVYF---------VKSDASQVCLALASLSYEDE-TG----IIGNYQQKNQRVIYDTK 470
+ YF V+ CL L + E E TG +GNYQQ V+YD +
Sbjct: 413 LPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLE 472
Query: 471 NSQLGFAGEDCSSM 484
++GFA C+S+
Sbjct: 473 QRRVGFARRKCASL 486
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 167/362 (46%), Gaps = 44/362 (12%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ +++G + ++DTGS++TW QC PC CY Q P+FDPS S ++K+ C+ +
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCHDHS 439
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IF 247
C Y V Y D +YT+G L + + + S F I
Sbjct: 440 CP---------------------YEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETII 478
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
GCGRNN G +GL LSL++Q + GL SYC AG S I G +
Sbjct: 479 GCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCF-----AGNGTSKINFGTN 533
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVI 363
++ ++ T M FY LNL +S+G +++ G +G I+IDSGT +
Sbjct: 534 AIVGGGGVVS-TTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTL 592
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
T P S + ++ P+A C+ S E+ P++ M F G A++ +
Sbjct: 593 TYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCY-YSNTTEI-FPVITMHFSGGADLVL 650
Query: 424 DVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
D + F++S + + CLA+ + E I GN Q N V YD+ + + F +CS
Sbjct: 651 DKYNM--FMESYSGGLFCLAIICNNPTQE-AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Query: 483 SM 484
++
Sbjct: 708 AL 709
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 154/369 (41%), Gaps = 73/369 (19%)
Query: 117 VSNTEI--PLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQD 172
VSNT+ P + T Y+ +++G V ++DTGS+L W QC PC CY+Q+
Sbjct: 46 VSNTQAGSPYADTV-FDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKA 104
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGEL 230
P+FDPS S ++K+ CN+ PD C Y + Y D SYT+G L
Sbjct: 105 PIFDPSKSSTFKETRCNT---------------------PDHSCPYKLVYDDKSYTQGTL 143
Query: 231 GREHLGLGKASVNDF-----IFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
E + + S F I GC RNN G SG++GL R LSL+SQ + G
Sbjct: 144 ATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGAYPG 203
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
D S T M Y LNL +S+G
Sbjct: 204 ----------DGVVS--------------------TTMFAKTAKRGQYYLNLDAVSVGDT 233
Query: 344 QLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
+++ G G I+IDSGT +T P S Y L + +++ S D
Sbjct: 234 RIETVGTPFHALNGNIVIDSGTPLTYFPVS-YCNLVRKAVERVVTADRVVDPSRNDMLCY 292
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
S E+ P++ + F G A++ +D + Y + CLA+ + + I GN
Sbjct: 293 YSNTIEI-FPVITVHFSGGADLVLDKYNM-YMELNRGGVFCLAII-CNNPTQVAIFGNRA 349
Query: 460 QKNQRVIYD 468
Q N V YD
Sbjct: 350 QNNFLVGYD 358
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 129/412 (31%), Positives = 202/412 (49%), Gaps = 43/412 (10%)
Query: 84 WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG- 142
W+ + N D L +YL + + K VS P+ SG NY+ ++LG
Sbjct: 57 WDNRIINMASKDPLRFKYLSTLVGQ------KTVSTA--PIASGQTFNIGNYVVRVKLGT 108
Query: 143 -GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
G+ + +++DT +D +V C C C D F P S SY + C+ C + +
Sbjct: 109 PGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKASTSYGPLDCSVPQCGQVRGLS- 164
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
C ++ C++ SY S++ L ++ L L + ++ FGC G
Sbjct: 165 ----CPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQ 219
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
GL+GLGR LSL+SQ+ + G+FSYCLPS + SGSL LG I T +
Sbjct: 220 GLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPL 275
Query: 322 IPNPQLATFYILNLTGISIG-------GKQLQASGFAKGGILIDSGTVITRLPPSIYSAL 374
+ +P + Y +N TGIS+G + L + G +IDSGTVITR +Y+A+
Sbjct: 276 LRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAV 335
Query: 375 KAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYF 431
+ EF KQ G F S F DTCF + Y+ + P + + FEG + ++ ++ + I
Sbjct: 336 REEFRKQVGGTTFTSIGAF---DTCF-VKTYETL-APPITLHFEGLDLKLPLENSLI--- 387
Query: 432 VKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S S CLA+A+ + +I N+QQ+N R+++DT N+++G A E C
Sbjct: 388 HSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 44/362 (12%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ +++G + I+DTGS++TW QC PC CY Q P+FDPS S ++K+ C+
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD--- 121
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IF 247
G+S C Y V Y D +YT G L E + L S F I
Sbjct: 122 --------GHS----------CPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETII 163
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
GCG NN SG++GL SL++Q + GL SYC + + G N+
Sbjct: 164 GCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF----SGQGTSKINFGANA 219
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVI 363
V + + T M FY LNL +S+G +++ G +G I+IDSGT +
Sbjct: 220 IVAGDG--VVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTL 277
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEGNAEMT 422
T P S + ++ + +A C+N ++I P++ M F G ++
Sbjct: 278 TYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN---SDTIDIFPVITMHFSGGVDLV 334
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+D + Y ++ CLA+ S E I GN Q N V YD+ + + F+ +CS
Sbjct: 335 LDKYNM-YMESNNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
Query: 483 SM 484
++
Sbjct: 393 AL 394
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 121/417 (29%), Positives = 197/417 (47%), Gaps = 45/417 (10%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
K + W + D +Q+L S + + +P+ S +L Q+ ++
Sbjct: 57 KPLSWADNVLQMQAKDQARLQFLSSLVAR----------RSFVPIASARQLIQSPTFVVR 106
Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
++G T+++ DT +D W+ C C C + VF S S++ + C S C+ +
Sbjct: 107 AKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQSPQCNQV 164
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
+ CS S+ C + ++YG S +L +++L L SV + FGC R G
Sbjct: 165 PNPS-----CSGSA---CGFNLTYG-SSTVAADLVQDNLTLATDSVPSYTFGCIRKATGS 215
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR LSL+ Q+ ++ FSYCLPS + SGSL LG + + I
Sbjct: 216 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIR----I 271
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
YT ++ NP+ ++ Y +NL I +G K + S A G +IDSGT TRL
Sbjct: 272 KYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAP 331
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
Y+A++ EF ++ + DTC+ + + P + F G M V +
Sbjct: 332 AYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMFAG---MNVTLPPDN 384
Query: 430 YFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + S A S CLA+A+ + +I + QQ+N R+++D NS++G A E CSS
Sbjct: 385 FLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 441
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 196/418 (46%), Gaps = 47/418 (11%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
K + W E D +QYL S + + +P+ SG ++ Q+ YI
Sbjct: 52 KPMSWEESVLKLQAKDQARMQYLSSLVAR----------RSIVPIASGRQITQSPTYIVK 101
Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
++G T+++ DT +D +WV C C C F P+ S ++KKV C +S C +
Sbjct: 102 AKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPAKSTTFKKVGCGASQCKQV 159
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
T C S+ C + +YG S L ++ + L V + FGC + G
Sbjct: 160 RNPT-----CDGSA---CAFNFTYGTSSVA-ASLVQDTVTLATDPVPAYAFGCIQKVTGS 210
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR LSL++QT +++ FSYCLPS + SGSL LG + + I
Sbjct: 211 SVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGSLRLGPVAQPKR----I 266
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF---AKGGILIDSGTVITRLPPS 369
+T ++ NP+ ++ Y +NL I +G + + +A F G + DSGTV TRL
Sbjct: 267 KFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTRLVEP 326
Query: 370 IYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
Y+A++ EF ++ + S+ DTC+ + P + F G M V +
Sbjct: 327 AYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT----APIVAPTITFMFSG---MNVTLPP 379
Query: 428 IVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S A V CLA+A + +I N QQ+N RV++D NS+LG A E C+
Sbjct: 380 DNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 437
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 166/346 (47%), Gaps = 27/346 (7%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +T + DTGSDL W +C + P+ S ++ ++ C+ C AL + +
Sbjct: 111 QKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALR--SYSL 168
Query: 204 GVCSSSSPPDCNYFVSYG---DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGV 260
C++ +C+Y +YG D +T+G LG E LG +V FGC +G +G
Sbjct: 169 ARCAAGGA-ECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTALEGDYGEG 227
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GL+GLGR LSLVSQ + G F YCL T DA + L+ G +++ + T
Sbjct: 228 AGLVGLGRGPLSLVSQ---LDAGTFMYCL--TADASKASPLLFGALATMTGAGAGVQSTG 282
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
++ + TFY +NL I+I G A GG++ DSGT +T L Y+ KA FL
Sbjct: 283 LLAS---TTFYAVNLRSITI-GSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLS 338
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
Q + G + C+ + IP + + F+G A+M + V Y V+ D VC
Sbjct: 339 QTTSLTPVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALPVAN--YVVEVDDGVVC 395
Query: 441 LAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ SLS IIGN Q N V++D + S L F +C S
Sbjct: 396 WVVQRSPSLS------IIGNIMQMNYLVLHDVRKSVLSFQPANCDS 435
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 176/378 (46%), Gaps = 35/378 (9%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSI 179
+ S I ++ Y+ + +G M I DTGSDL WV C D VF PS
Sbjct: 89 VESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSR 148
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG- 238
S +Y + C S+ C AL A+ C + S +C Y +YGDGS T G L E
Sbjct: 149 STTYSLLSCQSAACQALSQAS-----CDADS--ECQYQYAYGDGSRTIGVLSTETFSFAA 201
Query: 239 -------KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCL 289
+ V FGC + G F GL+GLG LSLVSQ + FSYCL
Sbjct: 202 AGGGGEGQVRVPRVSFGCSTGSAGSFRS-DGLVGLGAGALSLVSQLGAAARIARRFSYCL 260
Query: 290 -PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
P A +S +L G + V S P T ++P+ ++ ++Y + L +++ G+ + +
Sbjct: 261 VPPYAAANSSSTLSFGARAVV---SDPGAASTPLVPS-EVDSYYTVALESVAVAGQDVAS 316
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL---SAYQ 404
+ ++ I++DSGT +T L P++ L AE ++ + P +L C+++ S +
Sbjct: 317 ANSSR--IIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAE 374
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
+ IP V + F G A +T+ ++ +CL L +S I+GN Q+N
Sbjct: 375 DFGIPDVTLRFGGGASVTLRPENTFSLLEE--GTLCLVLVPVSESQPVSILGNIAQQNFH 432
Query: 465 VIYDTKNSQLGFAGEDCS 482
V YD + FA DC+
Sbjct: 433 VGYDLDARTVTFAAVDCT 450
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 124/393 (31%), Positives = 189/393 (48%), Gaps = 48/393 (12%)
Query: 123 PLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
PL SG +L T Y+ LG + + + VDT +D WV C C C P F+P+
Sbjct: 81 PLASGRQLLHTPTYLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTA-PSFNPAS 139
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +++ V C + C + S S +S C + +SYGD S L +++L +
Sbjct: 140 SATFRPVPCGAPPCSQAPNPSCTSLAKSKNS---CGFSLSYGDSSLD-ATLSQDNLAVTA 195
Query: 240 --ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+ + FGC + G GL+GLGR L V+QT I+ G FSYCLPS + A
Sbjct: 196 NGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAA 255
Query: 298 --SGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
SGSL LG G + K T T ++ +P + Y + +TG+ IG K + S A
Sbjct: 256 NFSGSLTLGRKGQPAPEKMKT----TPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALA 311
Query: 352 -----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-------------FPSAPGFSI 393
G ++DSGT+ RL Y+A++ E ++ +G S GF
Sbjct: 312 FDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGF-- 369
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
DTC+N+S V P V + F G E+ + +V + S CLA+A+ +
Sbjct: 370 -DTCYNVS---TVAWPAVTLVFGGGMEVRLPEENVV-IRSTYGSTSCLAMAASPADGVNA 424
Query: 454 ---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+IG+ QQ+N RV++D N+++GFA E C++
Sbjct: 425 ALNVIGSLQQQNHRVLFDVPNARVGFARERCTA 457
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 175/378 (46%), Gaps = 37/378 (9%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP----VFDP 177
+ S I ++ Y+ + +G + I DTGSDL WV C D VF P
Sbjct: 92 VESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQP 151
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-- 235
+ S +Y ++ C S+ C AL A+ C + S +C Y SYGDGS T G L E
Sbjct: 152 TRSSTYSQLSCQSNACQALSQAS-----CDADS--ECQYQYSYGDGSRTIGVLSTETFSF 204
Query: 236 ----GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYC 288
G G+ V FGC + G F GL+GLG SLVSQ T+ I L SYC
Sbjct: 205 VDGGGKGQVRVPRVNFGCSTASAGTFRS-DGLVGLGAGAFSLVSQLGATTHIDRKL-SYC 262
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
L + DA +S +L G + V S P T ++P+ + ++Y + L +++GG+++
Sbjct: 263 LIPSYDANSSSTLNFGSRAVV---SEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVAT 318
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
I++DSGT +T L P++ L E ++ P +L C+++ E +
Sbjct: 319 H---DSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETD 375
Query: 408 ---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
IP V + F G A +T+ ++ +CL L +S I+GN Q+N
Sbjct: 376 NFGIPDVTLRFGGGAAVTLRPENTFSLLQE--GTLCLVLVPVSESQPVSILGNIAQQNFH 433
Query: 465 VIYDTKNSQLGFAGEDCS 482
V YD + FA DC+
Sbjct: 434 VGYDLDARTVTFAAADCA 451
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 125/435 (28%), Positives = 196/435 (45%), Gaps = 51/435 (11%)
Query: 57 HQKSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS 111
+ K ++ TL++ H + K + W E D +Q+L S +
Sbjct: 19 NPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVAR--- 75
Query: 112 GNIKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCY 168
+ +P+ SG ++ Q+ YI ++G T+++ DT +D W+ C C C
Sbjct: 76 -------KSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA 128
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+ +F P S ++K V C + C + N G SS N+ ++YG S
Sbjct: 129 ST---LFAPEKSTTFKNVSCAAPECKQVP----NPGCGVSSR----NFNLTYGSSSIA-A 176
Query: 229 ELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
L ++ + L V + FGC G GL+GLGR LSL+SQT ++ FSYC
Sbjct: 177 NLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYC 236
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
LPS + SGSL LG + + I YT ++ NP+ ++ Y +NL I +G K +
Sbjct: 237 LPSFKSLNFSGSLRLGPVAQPKR----IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIP 292
Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
A G + DSGTV TRL +Y A++ EF ++ + DTC+N+
Sbjct: 293 PAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV- 351
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
+ +P + F G M V + + S A S CLA+A + +I N
Sbjct: 352 ---PIVVPTITFIFTG---MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 405
Query: 459 QQKNQRVIYDTKNSQ 473
QQ+N RV+YD NS+
Sbjct: 406 QQQNHRVLYDVPNSR 420
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 168/389 (43%), Gaps = 48/389 (12%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSI 179
+++ + T YIA +G + ++DTGS L W QC C K C Q P F+ S
Sbjct: 75 VSAPVHWATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASS 134
Query: 180 SPSYKKVLCNSSTCHA--LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S S+ V C C L F + C + V+YG G G LG +
Sbjct: 135 SGSFAPVPCQDKACAGNYLHFCALDG---------TCTFRVTYGAGGII-GFLGTDAFTF 184
Query: 238 GKASVNDFIFGCGRNNK----GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
FGC + + G SGL+GLGR LSL SQT FSYCL P
Sbjct: 185 QSGGAT-LAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYF 240
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--- 346
+ GAS L +G +S+ + + +P+ +TFY L L GI++G +L
Sbjct: 241 HNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPS 300
Query: 347 --------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF---PSAPGFSILD 395
GF +GG++IDSG+ T L Y L E +Q +G P +
Sbjct: 301 TAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMA 360
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
C V +P + + F G A+M + Y+ + S C+A+ + II
Sbjct: 361 LCVARGDLDRV-VPTLVLHFSGGADMALPPEN--YWAPLEKSTACMAIVRGYLQS---II 414
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
GN+QQ+N +++D +L F DCS++
Sbjct: 415 GNFQQQNMHILFDVGGGRLSFQNADCSTI 443
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 147/289 (50%), Gaps = 23/289 (7%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
VDTGSDL WV+C PC C P++DP+ S S K+ C+S C AL S C S
Sbjct: 104 VDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQC-SD 162
Query: 210 SPPDCNYFVSYG-DGSY-TRGELGREHLGLGKASV-NDFIFGCGRNNKG-LFGGVSGLMG 265
PP C Y +YG G + T+G LG E G V N+ FG G FGG +GL+G
Sbjct: 163 DPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVG 222
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI--P 323
LGR LSLVSQ + G F+YCL + D +++ G +++ ++ ++ T ++ P
Sbjct: 223 LGRGHLSLVSQ---LGAGRFAYCLAA--DPNVYSTILFGSLAALDTSAGDVSSTPLVTNP 277
Query: 324 NPQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKA 376
P T Y +NL GIS+GG +L + FA GG+ DSG + T L + Y ++
Sbjct: 278 KPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQ 337
Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVD 424
+ G DTCF + Q V +P + + F+ A+M+++
Sbjct: 338 AITSEIQRLGYDAGD---DTCFVAANQQAVAQMPPLVLHFDDGADMSLN 383
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 215/450 (47%), Gaps = 40/450 (8%)
Query: 44 WQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ 103
+ S ++ C S Q ++ I + K + K W+ + N D + YL
Sbjct: 16 FMSMSNATDPCAS-QPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
S + K VS+ P+ SG NYI +++G G+ + +++DT +D ++
Sbjct: 75 SLVAQ------KTVSSA--PIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI-- 124
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
P C F P+ S SY + C+ C + + C ++ C++ SY
Sbjct: 125 -PSSGCIGCSATTFSPNASTSYVPLECSVPQCSQVRGLS-----CPATGSGACSFNKSYA 178
Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
+Y+ L ++ L L + + FG G GL+GLGR LSL+SQT ++
Sbjct: 179 GSTYS-ATLVQDSLRLATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLY 237
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G+FSYCLPS + SGSL LG I T ++ NP+ + Y +NLTGI++G
Sbjct: 238 SGVFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVG 293
Query: 342 G------KQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
K+L A G G +IDSGTVITR +Y+A++ EF KQ +G S+ G
Sbjct: 294 KVNVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLG--AF 351
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE---DE 451
DTCF + Y+ + P + + F + ++ + + + S S CLA+AS
Sbjct: 352 DTCF-VKNYETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMASTPKNVNYTV 407
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+I NYQQ+N RV++DT N+++G A E C
Sbjct: 408 LNVIANYQQQNLRVLFDTVNNKVGIARELC 437
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/393 (29%), Positives = 178/393 (45%), Gaps = 32/393 (8%)
Query: 107 KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
+ ++ + S +P++SG T Y + +G + T++ DTGS+LTWV+C
Sbjct: 63 RQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGG 122
Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
S VF P S S+ V C+S TC + F+ N CSSS+ P C+Y Y +G
Sbjct: 123 ASPPGL---VFRPEASKSWAPVPCSSDTCKLDVPFSLAN---CSSSASP-CSYDYRYKEG 175
Query: 224 SY-TRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQ 276
S G +G + + GK A + D + GC + G F V G++ LG + +S S+
Sbjct: 176 SAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASR 235
Query: 277 TSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
+ FGG FSYCL A+G L G TP T T + +P + FY + +
Sbjct: 236 AAARFGGSFSYCLVDHLAPRNATGYLAFGPGQV---PRTPATQTKLFLDPAM-PFYGVKV 291
Query: 336 TGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+ + G+ L GG+++DSGT +T L Y A+ A K +G P F
Sbjct: 292 DAVHVAGQALDIPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKV-DF 350
Query: 392 SILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
+ C+N +A + IP + ++F G A + V VK C+ L +
Sbjct: 351 PPFEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVK--CIGLQEGEWP 408
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+IGN Q+ +D KN ++ F C+
Sbjct: 409 G-VSVIGNIMQQEHLWEFDLKNMEVRFMPSTCT 440
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 171/366 (46%), Gaps = 35/366 (9%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ + LG + V +VDTGSDL W QC PC+ CY Q+ P+F+P S +Y + C+S
Sbjct: 49 DYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108
Query: 192 TCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDF 245
C++L S SP C Y +Y D S T+G L RE + V D
Sbjct: 109 ECNSL--------FGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDI 160
Query: 246 IFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLI 302
+FGCG +N G F G++GLG LSLVSQ ++G FS CL P D G++
Sbjct: 161 VFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTIS 220
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDS 359
G S V + + T ++ + + T Y++ L GIS+G + + +KG I+IDS
Sbjct: 221 FGDASDV--SGEGVAATPLV-SEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDS 277
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEG 417
GT T LP Y L E Q + P D L E N+ P++ FEG
Sbjct: 278 GTPATYLPQEFYDRLVKELKVQSNMLPID---DDPDLGTQLCYRSETNLEGPILIAHFEG 334
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
V + I F+ C A+A + D I GN+ Q N + +D + F
Sbjct: 335 ---ADVQLMPIQTFIPPKDGVFCFAMAGTT--DGEYIFGNFAQSNVLIGFDLDRKTVSFK 389
Query: 478 GEDCSS 483
DCS+
Sbjct: 390 ATDCSN 395
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 175/374 (46%), Gaps = 59/374 (15%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
++A I +G + +++DTGSDLTW+ C PCK CY Q P F PS S +Y+ C S+
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCK-CYPQTIPFFHPSRSSTYRNASCVSAP 136
Query: 193 CHALE--FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDF 245
HA+ F +G +C Y + Y D S TRG L E L G S +
Sbjct: 137 -HAMPQIFRDEKTG--------NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNI 187
Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILG 304
+FGCG++N G F SG++GLG S+V++ FG FSYC S T LILG
Sbjct: 188 VFGCGQDNSG-FTKYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLTNPTYPHNILILG 243
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF----AKGGILID 358
+ + + TP+ Y L+L IS G K L + F ++GG +ID
Sbjct: 244 NGAKIEGDPTPLQI--------FQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVID 295
Query: 359 SGTVITRLPPSIYSALKAEF----------LKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
+G T L Y L E +K + + + P + + L Y
Sbjct: 296 TGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQY-TTPCY---EGNLKLDLY---GF 348
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
P+V F G AE+ +DV + FV S++ CLA+ +++D + +IG Q+N V Y
Sbjct: 349 PVVTFHFAGGAELALDVESL--FVSSESGDSFCLAMTMNTFDDMS-VIGAMAQQNYNVGY 405
Query: 468 DTKNSQLGFAGEDC 481
+ + ++ F DC
Sbjct: 406 NLRTMKVYFQRTDC 419
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 167/357 (46%), Gaps = 69/357 (19%)
Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
C + P F P+ S ++ K+ C SS C +F T C+++ C Y+ YG G +T
Sbjct: 88 CAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYLTCNATG---CVYYYPYGMG-FT 140
Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
G L E L +G AS FGC N G+ SG++GLGRS LSLVSQ G FS
Sbjct: 141 AGYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGV---GRFS 196
Query: 287 YCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITYTNMIPNPQL--ATFYILNLTGI 338
YCL S DAG S GSL + GG SS ++ NP++ +++Y +NLTGI
Sbjct: 197 YCLRSDADAGDSPILFGSLAKVTGGKSS----------PAILENPEMPSSSYYYVNLTGI 246
Query: 339 SIGGKQLQAS----GFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFS---- 383
++G L + GF +G G ++DSGT +T L Y+ +K FL Q +
Sbjct: 247 TVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANL 306
Query: 384 ---------GFPSAPGFSILDTCFNLSAY---QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
GF D CF+ +A V +P + + F G AE V V
Sbjct: 307 TTTVNGTRFGF---------DLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGV 357
Query: 432 VKSD----ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V+ D A+ CL + S + IIGN Q + V+YD FA DC+++
Sbjct: 358 VEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 414
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 162/317 (51%), Gaps = 27/317 (8%)
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFAT-GNSGVCSSSSPPDCNYFVSYGDGSYTRG--E 229
P FD S S + C+S+ C L A+ GN+ + + C Y Y D S T G E
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQT---CVYTYYYNDKSVTTGLIE 79
Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
+ + G G ASV FGCG N G+F +G+ G GR LSL SQ G FS+C
Sbjct: 80 VDKFTFGAG-ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHC 135
Query: 289 LPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
+ S +++L + ++KN + T +I N TFY L+L GI++G +L
Sbjct: 136 FTAVNGLKQS-TVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPV 194
Query: 348 --SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL 400
S FA GG +IDSGT IT LPP +Y ++ EF Q P PG + TCF+
Sbjct: 195 PESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGPYTCFSA 253
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNY 458
+ + ++P + + FEG A M + V+ V DA S +CLA ++ DET IIGN+
Sbjct: 254 PSQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNF 309
Query: 459 QQKNQRVIYDTKNSQLG 475
QQ+N V+YD +N G
Sbjct: 310 QQQNMHVLYDLQNMHRG 326
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 176/383 (45%), Gaps = 54/383 (14%)
Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
K+ +N +P+ G ++ ++ NYIA LG + + V +D +D WV C C C
Sbjct: 81 KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 139
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
P F P+ S +Y+ V C S C + + +GV SS C + ++Y ++ + LG
Sbjct: 140 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 193
Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGL-GRSDLSLVSQTSEIFGGLFSYCLP 290
++ L L V + FGC R G +G L R+ L LV+
Sbjct: 194 QDSLALENNVVVSYTFGCLRVVNGNSRAAAGAHRLRPRAALLLVA--------------- 238
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
D G G + G K +TP+ Y NP + Y +N+ GI +G K +Q
Sbjct: 239 ---DQGHLGPI---GQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQS 286
Query: 351 AKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
A G +ID+GT+ TRL +Y+A++ F + P AP DTC+N++
Sbjct: 287 ALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT-- 343
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQQ 460
V++P V F G +T+ ++ S CLA+A S ++ + QQ
Sbjct: 344 --VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQQ 400
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
+NQRV++D N ++GF+ E C++
Sbjct: 401 QNQRVLFDVANGRVGFSRELCTA 423
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 171/367 (46%), Gaps = 33/367 (8%)
Query: 131 QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
Q +NY+A +G + + ++D +L W QC+ C C+ Q P+FDP+ S +Y+ C
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
+ C ++ + N CS + C Y S G T G++G + +G A + FG
Sbjct: 107 GTPLCESIPSDSRN---CSGNV---CAYQASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158
Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
C ++ GG SG++GLGR+ SLV+QT FSYCL + DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGRNSALFLGSSA 214
Query: 308 SVF----KNSTPITYTNMIPN-PQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
+ STP + N+ N L+ +Y + L G+ G L SG +L+D+
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
+ I+ L Y A+K P A D CF S LV F G A
Sbjct: 270 SPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328
Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
MTV T Y + VCLA+ A L+ E ++G+ QQ+N ++D L F
Sbjct: 329 MTVPATN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386
Query: 478 GEDCSSM 484
DC+ +
Sbjct: 387 PADCTKL 393
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 113/365 (30%), Positives = 175/365 (47%), Gaps = 40/365 (10%)
Query: 147 TVIVDTGSDLTWVQC-------QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
T+IVDTGSDL W QC + S Q++P+++P S S+ + C+ C +F+
Sbjct: 98 TLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFS 157
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVN-DFIFGCGRNNKGLF 257
N C+ ++ C Y YG G L E G A V+ FGCG + G
Sbjct: 158 YKN---CARNN--RCMYDELYGSAE-AGGVLASETFTFGVNAKVSLPLGFGCGALSAGDL 211
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTP 315
G SGLMGL +SLVSQ S FSYCL + S L+ G + + ++ +
Sbjct: 212 VGASGLMGLSPGIMSLVSQLSV---PRFSYCLTPFAERKTS-PLLFGAMADLRRYRTTGT 267
Query: 316 ITYTNMIPNPQLAT-FYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRL 366
+ T+++ NP + T +Y + L G+S+G K+L G K GG ++DSG+ ++ L
Sbjct: 268 VQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYL 327
Query: 367 PPSIYSALKAEFLKQFSGFPSAPG----FSILDTCFNLS---AYQEVNIPLVKMEFEGNA 419
+ + A+K ++ P A G + + CF L A + V P + + F+G A
Sbjct: 328 EETAFRAVKKAVVEAVR-LPVANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGA 386
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
MT+ YF + A +CLA+ + IIGN QQ+N V++D +N + FA
Sbjct: 387 AMTLPRDN--YFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPT 444
Query: 480 DCSSM 484
C +
Sbjct: 445 KCDDI 449
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+A +G + ++ +VD +L W QC PC+ C+ Q P+FDP+ S +++ + C S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
C ++ ++ N S VC +P GD T G+ G + +G A FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGKAGTDTFAIGAAK-ETLGFGCV 166
Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
+K L GG SG++GLGR+ SLV+Q + FSYCL GA+ +
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223
Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
GG +S+ F T ++ NP +Y++ L GI GG LQA+ + +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKTGGAPLQAASSSGSTVLLDTVS 279
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
+ L Y ALK P A D CF + + P + F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA--PELVFTFDGGAAL 337
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLG 475
TV Y + S VCL + S + + TG I+G+ QQ+N V++D K L
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLS 395
Query: 476 FAGEDCSSM 484
F DCSS+
Sbjct: 396 FKPADCSSL 404
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+A +G + ++ +VD +L W QC PC+ C+ Q P+FDP+ S +++ + C S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
C ++ ++ N S VC +P GD T G G + +G A FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGMAGTDTFAIGAAK-ETLGFGCV 166
Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
+K L GG SG++GLGR+ SLV+Q + FSYCL GA+ +
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223
Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
GG +S+ F T ++ NP +Y++ L GI GG LQA+ + +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKAGGAPLQAASSSGSTVLLDTVS 279
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
+ L Y ALK P A D CF+ + + P + F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDA--PELVFTFDGGAAL 337
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLG 475
TV Y + S VCL + S + + TG I+G+ QQ+N V++D K L
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLS 395
Query: 476 FAGEDCSSM 484
F DCSS+
Sbjct: 396 FKPADCSSL 404
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 149/292 (51%), Gaps = 26/292 (8%)
Query: 214 CNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLM 264
C Y+ YGD S T G+ E + GK V + +FGCG N+GLF G +GL+
Sbjct: 74 CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
GLGR LS SQ ++G FSYCL DA S LI G + + + + +T ++
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVA 192
Query: 324 ---NPQLATFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
NP + TFY + + I +GG ++ Q + GG +IDSGT ++ Y
Sbjct: 193 GKENP-VDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQV 251
Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
+K F+ + G+P F +L+ C+N++ ++ ++P + F A V YF++
Sbjct: 252 IKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVEN--YFIE 309
Query: 434 SDASQ-VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ + VCLA+ IIGNYQQ+N ++YDTK S+LGFA C+ +
Sbjct: 310 IEPREVVCLAILGTP-PSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 360
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 117/408 (28%), Positives = 189/408 (46%), Gaps = 40/408 (9%)
Query: 101 YLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTW 158
+L +++ ++S VS ++ L+ L + T+ +G + +IVDTGSDL W
Sbjct: 60 WLTAKLAGVLSNRRGGVSPADVRLSP---LSDQGHSLTVGIGTPPQPRKLIVDTGSDLIW 116
Query: 159 VQCQPCKS----CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
QC+ S + PV+DP S ++ + C+ C +F+ N C+S + C
Sbjct: 117 TQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKN---CTSKN--RC 171
Query: 215 NYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
Y YG + G L E G +A FGCG + G G +G++GL LS
Sbjct: 172 VYEDVYGSAAAV-GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLS 230
Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNPQLATF 330
L++Q FSYCL D S L+ G + + ++ T PI T ++ NP +
Sbjct: 231 LITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVY 286
Query: 331 YILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y + L GIS+G K+L A+ A GG ++DSG+ + L + + A+K E +
Sbjct: 287 YYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK-EAVMDVV 345
Query: 384 GFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
P A + CF L +A + V +P + + F+G A M + YF + A
Sbjct: 346 RLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDN--YFQEPRA 403
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+CLA+ + IIGN QQ+N V++D ++ + FA C +
Sbjct: 404 GLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 451
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 177/368 (48%), Gaps = 46/368 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+T+++DTGS+L+W+ C+ + +N +F+P S +Y K+ C+S TC E T +
Sbjct: 78 QNITMVLDTGSELSWLHCKK-EPNFNS---IFNPLASKTYTKIPCSSPTC---ETRTRDL 130
Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFG 258
+ S P C++ +SY D S G L E +G + +FGC +N
Sbjct: 131 PLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDA 190
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
+GLMG+ R LS V+Q FSYC+ D +SG L+LG S F P+ Y
Sbjct: 191 KTTGLMGMNRGSLSFVNQMG---FRKFSYCI---SDRDSSGVLLLGEAS--FSWLKPLNY 242
Query: 319 TNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFAK-----GGILIDSGTVITRL 366
T ++ P P Y + L GI + K L S F G ++DSGT T L
Sbjct: 243 TPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFL 302
Query: 367 PPSIYSALKAEFLKQFSGF------PSAPGFSILDTCFNLSAYQEV--NIPLVKMEFEGN 418
+YSALK EFL Q G P +D C+ + + N+P+V + F G
Sbjct: 303 LGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRG- 361
Query: 419 AEMTVDVTGIVYFVKSDA----SQVCLALA-SLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
AEM+V ++Y V + S C S S E+ +IG++QQ+N + YD + S+
Sbjct: 362 AEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDLEKSR 421
Query: 474 LGFAGEDC 481
+GFA C
Sbjct: 422 IGFAEVRC 429
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 127/476 (26%), Positives = 200/476 (42%), Gaps = 63/476 (13%)
Query: 42 LQWQQ--KSGSSSSCVSHQKSRIEMGAITLELKHKNY----CSGKIVDWNEQQQNRLILD 95
+QW K+ + + + ++ LEL H+++ G VD E + + D
Sbjct: 6 MQWNTITKASILVTITLLLILPVAVNSMRLELVHRHHERFAGGGGDVDRVEAVKGFVKRD 65
Query: 96 NLHVQYLQSR---IKNMISGN-----IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
L Q + R + N S + E+P+ SG Y A +++G G+
Sbjct: 66 KLRRQRMNQRWGVVSNYDSRRKGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQR 125
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
++VDTGS+ TW+ C S S++ V C S C + V
Sbjct: 126 FWLVVDTGSEFTWLNC------------------SKSFEAVTCASRKCKVDLSELFSLSV 167
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGV 260
C S P C Y +SY DGS +G G + + +G + +N+ GC K + GV
Sbjct: 168 CPKPSDP-CLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGC---TKSMLNGV 223
Query: 261 S------GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNS 313
+ G++GLG + S + + + +G FSYCL S +L +GG+ + K
Sbjct: 224 NFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNA-KLL 282
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF-AKGGILIDSGTVITRLPP 368
I T +I P FY +N+ GISIGG+ L Q F A+GG LIDSGT +T L
Sbjct: 283 GEIRRTELILFPP---FYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLIDSGTTLTSLLL 339
Query: 369 SIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
Y A+ K + G F L+ CF+ + + +P + F G A V
Sbjct: 340 PAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVK 399
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
Y + C+ + + +IGN Q+N +D + +GFA C+
Sbjct: 400 S--YIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 108/398 (27%), Positives = 175/398 (43%), Gaps = 37/398 (9%)
Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
+ + + S +PLTSG T Y +G + ++ DTGSDLTWV+C+ ++
Sbjct: 86 TAPMPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASS 145
Query: 169 NQQDP-----VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSS--SSPPDCNYFVSY 220
P VF P+ S S+ + C+S TC + + F+ N CS+ + P C Y Y
Sbjct: 146 PDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLAN---CSAGTTPPAPCGYDYRY 202
Query: 221 GDGSYTRGELGREHLGLG--------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDL 271
D S RG +G + + KA + + + GC + G F G++ LG S++
Sbjct: 203 KDKSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNI 262
Query: 272 SLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
S S+ + FGG FSYCL A+ L G + S T ++ + Q+A F
Sbjct: 263 SFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSR----TPLLLDAQVAPF 318
Query: 331 YILNLTGISIGGKQLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
Y + + +S+ GK L GG ++DSGT +T L Y A+ A KQ +
Sbjct: 319 YAVTVDAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARV 378
Query: 386 PSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
P + C+N +A + +P +++ F G+A + Y + + C+ L
Sbjct: 379 PRV-TMDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKS--YVIDAAPGVKCIGLQ 435
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ +IGN Q+ +D N L F C+
Sbjct: 436 EGVWPG-VSVIGNILQQEHLWEFDLANRWLRFQESRCA 472
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 176/390 (45%), Gaps = 42/390 (10%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ-----DPV 174
+PLTSG T Y + +G + ++ DTGSDLTWV+C S + V
Sbjct: 91 MPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRV 150
Query: 175 FDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGR 232
F P+ S S+ + C+S TC + + F+ N SSPPD C+Y Y D S RG +G
Sbjct: 151 FRPAGSKSWSPLPCDSDTCKSYVPFSLANC-----SSPPDPCSYDYRYKDNSSARGVVGL 205
Query: 233 EHL--------GLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
+ G KA + + + GC + G F G++ LG S++S S+ + FGG
Sbjct: 206 DSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGG 265
Query: 284 LFSYCLPSTQDAGASGSLILGGN------SSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
FSYCL + S + GN TP+ ++ + + FY +++
Sbjct: 266 RFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLV---LLEDARTRPFYFVSVDA 322
Query: 338 ISIGGKQLQ----ASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
+++ G++L+ F K GG ++DSGT +T L Y A+ KQF+G P
Sbjct: 323 VTVAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRV-NMD 381
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
+ C+N + IP +++ F G A T+ G Y + + C+ + ++
Sbjct: 382 PFEYCYNWTGVS-AEIPRMELRFAGAA--TLAPPGKSYVIDTAPGVKCIGVVEGAWPG-V 437
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+IGN Q+ +D N L F C+
Sbjct: 438 SVIGNILQQEHLWEFDLANRWLRFKQSRCA 467
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 149/302 (49%), Gaps = 22/302 (7%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ T Y+ + +G + + + +DTGSDL W QCQPC +C++Q P FDPS S +
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
C+S+ C L A+ S + C Y SYGD S T G L + ASV
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194
Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGCG N G+F +G+ G GR LSL SQ G FS+C + S +++L
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250
Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
+ ++K+ + T +I NP TFY L+L GI++G +L S FA GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTII 310
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
DSGT +T LP +Y ++ F Q P G + D F LSA +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368
Query: 416 EG 417
EG
Sbjct: 369 EG 370
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DTGSDL W QC PC +CY+Q P+FDPS S ++K+ CN ++CH
Sbjct: 78 IDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNSCH--------------- 122
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGGVSGLM 264
Y + Y D +Y++G L E + + S F+ GCG N+ SG++
Sbjct: 123 ------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMV 176
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
GL SL++Q + GL SYC S T + I+ G+ V ST + T
Sbjct: 177 GLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV---STTMFLTTAK 233
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
P Y LNL +S+G ++ G +G I+IDSGT +T P S Y L E
Sbjct: 234 PG-----LYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVREA 287
Query: 379 LKQFSGFPSAPGFSILDTCFN--LSAYQE-VNI-PLVKMEFEGNAEMTVDVTGIVYFVKS 434
+ + D N L Y + ++I P++ M F G A++ +D + Y
Sbjct: 288 VDHY-----VTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNM-YIETI 341
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA+ + + I GN Q N V YD+ + + F+ +CS++
Sbjct: 342 TRGTFCLAIIC-NNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 390
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 174/383 (45%), Gaps = 25/383 (6%)
Query: 113 NIKDVSNTEIP--LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC-QPCKSC 167
+ D + T P +T + Y+ + +G + ++ I+D G +L W QC Q C+ C
Sbjct: 27 ELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRC 86
Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
+ Q P+FD + S +++ C ++ C ++ + + Y S G T
Sbjct: 87 FKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACG-----YEASTSFG-RTV 140
Query: 228 GELGREHLGLGKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
G +G + + +G A+ FGC + G SG +GLGR++LSL +Q + FS
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFS 197
Query: 287 YCLPSTQDAGASGSLILGGNSSVF-----KNSTPITYTNMIPNPQLATFYILNLTGISIG 341
YCL + D G S +L LG ++ + +TP T+ P+ L+ Y+L L I G
Sbjct: 198 YCL-APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAG 256
Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
+ A + I++ + T +T L S+Y L+ P P D CF
Sbjct: 257 NATI-AMPQSGNTIMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFP-K 314
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
A P + + F+G AEMTV V+ ++ +D + C+A+ I+G+ QQ
Sbjct: 315 ASASGGAPDLVLAFQGGAEMTVPVSSYLFDAGNDTA--CVAILGSPALGGVSILGSLQQV 372
Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
N +++D L F DCS++
Sbjct: 373 NIHLLFDLDKETLSFEPADCSAL 395
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 57/376 (15%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS- 191
++ I +G +T ++ DT SDL W+QC+PC +CY Q P+FDPS S +++ C +S
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144
Query: 192 -TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVN 243
+ +L F ++ C Y + Y DG+ ++G L +E L A+++
Sbjct: 145 YSMPSLRF---------NAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALH 195
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
D +FGCG +N G +G++GLG + SLV + FG FSYC S D ++++
Sbjct: 196 DVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGTKFSYCFGSLDDPSYPHNVLV 251
Query: 304 GGN--SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--------KG 353
G+ +++ ++TP+ N FY + + IS+ G L + G
Sbjct: 252 LGDDGANILGDTTPLEIYN--------GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLG 303
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ--------E 405
G +ID+G +T L Y LK + F G +A + D F + Y E
Sbjct: 304 GTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVN-QDDMFKVECYNGNLERDLVE 362
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
P+V F AE+++DV + F+K + CLA+ IG Q++ +
Sbjct: 363 SGFPIVTFHFSDGAELSLDVKSV--FMKLSPNVFCLAVTP----GNMNSIGATAQQSYNI 416
Query: 466 IYDTKNSQLGFAGEDC 481
YD + ++ F DC
Sbjct: 417 GYDLEAKKISFERIDC 432
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 171/367 (46%), Gaps = 33/367 (8%)
Query: 131 QTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
Q +NY+A +G + + ++D +L W QC+ C C+ Q P+FDP+ S +Y+ C
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
+ C ++ + N CS + C Y S G T G++G + +G A + FG
Sbjct: 107 GTPLCESIPSDSRN---CSGNV---CAYQASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158
Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
C ++ GG SG++GLGR+ SLV+QT FSYCL + DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGKNSALFLGSSA 214
Query: 308 SVF----KNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
+ STP + N+ N L+ +Y + L G+ G L SG +L+D+
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
+ I+ L Y A+K P A D CF S LV F G A
Sbjct: 270 SPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328
Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
MTV + Y + VCLA+ A L+ E ++G+ QQ+N ++D L F
Sbjct: 329 MTVAASN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386
Query: 478 GEDCSSM 484
DC+ +
Sbjct: 387 PADCTKL 393
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 185/375 (49%), Gaps = 35/375 (9%)
Query: 122 IPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
+P+ S +L Q+ ++ ++G T+++ DT +D W+ C C C + VF
Sbjct: 12 VPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSD 69
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S S++ + C S C+ + + CS S+ C + ++YG S +L +++L L
Sbjct: 70 KSSSFRPLPCQSPQCNQVPNPS-----CSGSA---CGFNLTYG-SSTVAADLVQDNLTLA 120
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
SV + FGC R G GL+GLGR LSL+ Q+ ++ FSYCLPS + S
Sbjct: 121 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFS 180
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA----- 351
GSL LG + + I YT ++ NP+ ++ Y +NL I +G K + S A
Sbjct: 181 GSLRLGPVAQPIR----IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSAT 236
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
G +IDSGT TRL Y+A++ EF ++ + DTC+ + + P +
Sbjct: 237 GAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTI 292
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYD 468
F G M V + + + S + S CLA+A+ + +I + QQ+N R+++D
Sbjct: 293 TFMFAG---MNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 349
Query: 469 TKNSQLGFAGEDCSS 483
NS++G A E CSS
Sbjct: 350 IPNSRVGVARESCSS 364
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 173/383 (45%), Gaps = 25/383 (6%)
Query: 113 NIKDVSNTEIP--LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC-QPCKSC 167
+ D + T P +T + Y+ + +G + ++ I+D G +L W QC Q C+ C
Sbjct: 27 ELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRC 86
Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
+ Q P+FD + S +++ C ++ C ++ + + Y S G T
Sbjct: 87 FKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACG-----YEASTSFG-RTV 140
Query: 228 GELGREHLGLGKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
G +G + + +G A+ FGC + G SG +GLGR++LSL +Q + FS
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFS 197
Query: 287 YCLPSTQDAGASGSLILGGNSSVF-----KNSTPITYTNMIPNPQLATFYILNLTGISIG 341
YCL + D G S +L LG ++ + +TP T+ PN L+ Y+L L I G
Sbjct: 198 YCL-APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAG 256
Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
+ A + I + + T +T L S+Y L+ P P D CF
Sbjct: 257 NATI-AMPQSGNTITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFP-K 314
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
A P + + F+G AEMTV V+ ++ +D + C+A+ I+G+ QQ
Sbjct: 315 ASASGGAPDLVLAFQGGAEMTVPVSSYLFDAGNDTA--CVAILGSPALGGVSILGSLQQV 372
Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
N +++D L F DCS++
Sbjct: 373 NIHLLFDLDKETLSFEPADCSAL 395
>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
Length = 183
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 96/156 (61%), Gaps = 4/156 (2%)
Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
+FY LN+ I++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F + S +P
Sbjct: 30 SFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYP 89
Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
+ G SILDTCF+LS ++ V IP V F G A + + GI Y K SQVCLA A
Sbjct: 90 TTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAGN 147
Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S + I GN QQ+ V+YD ++GFA CS
Sbjct: 148 SDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 183
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 170/367 (46%), Gaps = 33/367 (8%)
Query: 131 QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
Q +NY+A +G + + ++D +L W QC+ C C+ Q P+FDP+ S +Y+ C
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPC 106
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
+ C ++ N CS + C Y S G T G++G + +G A + FG
Sbjct: 107 GTPLCESIPSDVRN---CSGNV---CAYEASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158
Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
C ++ GG SG++GLGR+ SLV+QT FSYCL + DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGKNSALFLGSSA 214
Query: 308 SVF----KNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
+ STP + N+ N L+ +Y + L G+ G L SG +L+D+
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
+ I+ L Y A+K P A D CF S LV F G A
Sbjct: 270 SPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328
Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
MTV T Y + VCLA+ A L+ E ++G+ QQ+N ++D L F
Sbjct: 329 MTVPATN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386
Query: 478 GEDCSSM 484
DC+ +
Sbjct: 387 PADCTKL 393
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 168/359 (46%), Gaps = 35/359 (9%)
Query: 148 VIVDTGSDLTWVQCQ----PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+IVDTGSDL W QC+ + + PV+DP S ++ + C+ C +F+ N
Sbjct: 28 LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKN- 86
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVS 261
C+S + C Y YG + G L E G +A FGCG + G G +
Sbjct: 87 --CTSKN--RCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGAT 141
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYT 319
G++GL LSL++Q FSYCL D S L+ G + + ++ T PI T
Sbjct: 142 GILGLSPESLSLITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTT 197
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYS 372
++ NP +Y + L GIS+G K+L A+ A GG ++DSG+ + L + +
Sbjct: 198 AIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFE 257
Query: 373 ALKAEFLKQFSGFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDV 425
A+K E + P A + CF L +A + V +P + + F+G A M +
Sbjct: 258 AVK-EAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPR 316
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
YF + A +CLA+ + IIGN QQ+N V++D ++ + FA C +
Sbjct: 317 DN--YFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 373
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 163/362 (45%), Gaps = 51/362 (14%)
Query: 150 VDTGSDLTWVQCQPCKS----CYNQQDPVFDPSISPSYKKVLCNS-STCHALEFATGNSG 204
+DTG++L+W+QC+ C++ C+ +DP + S S SYK V CN S C + G
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCEPNQCKEG--- 161
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGLF-- 257
C Y V+YG GSYT G L E GK ++ FGC +++ +
Sbjct: 162 --------LCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYA 213
Query: 258 -----GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
VSG++G+G S ++Q I G FSYC+ A + + L V K+
Sbjct: 214 FLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCI----TANNTHNTYLRFGKHVVKS 269
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITR 365
T M P A Y +NL GIS+ G +L + G +ID+GT+ T
Sbjct: 270 KNLQTTKIMQVKPSAA--YHVNLLGISVNGVKLNITKTDLAVRKDGSRGCIIDAGTLATL 327
Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSIL----DTCFN-LSAYQEVNIPLVKMEFEGNAE 420
L I+ L S + + I D C+ LS N+P+V E NA+
Sbjct: 328 LVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLE-NAD 386
Query: 421 MTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
+ V I F + + V CL++ S +D IIG YQQ Q+ +YDTK L F E
Sbjct: 387 LEVKPEAIFLFREFEGKNVFCLSMLS---DDSKTIIGAYQQMKQKFVYDTKARVLSFGPE 443
Query: 480 DC 481
DC
Sbjct: 444 DC 445
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/251 (37%), Positives = 134/251 (53%), Gaps = 17/251 (6%)
Query: 91 RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
RL D+L V+ + S +N + + SG+ + Y + +G
Sbjct: 86 RLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPA 145
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
N+ +++DTGSD+ W+QC PCK+CYNQ D +FDP S ++ V C S C L+ +S
Sbjct: 146 TNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD----DS 201
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
C + C Y VSYGDGS+T G+ E L A V+ GCG +N+GLF G +GL
Sbjct: 202 SECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGL 261
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GLGR LS SQT + G FSYCL S+ + S I+ GN++V K S +T
Sbjct: 262 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS---VFTP 318
Query: 321 MIPNPQLATFY 331
++ NP+L TFY
Sbjct: 319 LLTNPKLDTFY 329
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 161/354 (45%), Gaps = 33/354 (9%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
VI+D GSDL W QC Q +PVFD + S S+ + C+S C A F + C+
Sbjct: 122 VILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGTF---TNKTCT 178
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK---ASVNDFIFGCGRNNKGLFGGVSGLM 264
C Y YG + T G L E G S N FGCG+ G SG++
Sbjct: 179 DRK---CAYENDYGIMTAT-GVLATETFTFGAHHGVSAN-LTFGCGKLANGTIAEASGIL 233
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTPITYTNMI 322
GL LS++ Q + FSYCL D S ++ G + + +K + + ++
Sbjct: 234 GLSPGPLSMLKQLAIT---KFSYCLTPFADRKTS-PVMFGAMADLGKYKTTGKVQTIPLL 289
Query: 323 PNPQLATFYILNLTGISIGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALK 375
NP +Y + + G+S+G K+L GG ++DS T + L ++ LK
Sbjct: 290 KNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELK 349
Query: 376 AEFLKQFSGFPSAPGFSILD--TCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
++ P A S+ D CF L + + V +P + + F+G+AEM++ Y
Sbjct: 350 KAVMEGIK-LPVA-NRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDN--Y 405
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
F + +CLA+ +E +IGN QQ+N V+YD N + +A C S+
Sbjct: 406 FQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKCDSI 459
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 121/455 (26%), Positives = 196/455 (43%), Gaps = 64/455 (14%)
Query: 69 LELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
L + H+ N CS + + + + + + L+S + SG+ + + G
Sbjct: 68 LPVLHRLNPCSPLNAGGKQSTTSSVDVSHRAGRRLRSLFAAVQSGDDAAPAPAPAAASGG 127
Query: 128 IRLQTLNYIATIELGGRNMTVIV-------------DTGSDLTWVQCQPCKS---CYNQQ 171
+ + T G + TV+V DTG ++ V+C C+ C
Sbjct: 128 VTIPTTGTPEPGAPGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLA 187
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
FDPS S ++ V C S C SG CSS S P C S+ + G +
Sbjct: 188 S--FDPSRSSTFAPVPCGSPDC--------RSG-CSSGSTPSCP-LTSF---PFLSGAVA 232
Query: 232 REHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
++ L L ASV+DF FGC + G G +GL+ L R S+ S+ + GG FSYCLP
Sbjct: 233 QDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLP 292
Query: 291 STQDAGASGSLILGGNSSVFKNST-------PITYTNMIPNPQLATFYILNLTGISIGGK 343
+ +S + G + V N T P+ Y PN Y+++L G+S+GG+
Sbjct: 293 LSTT--SSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPN-----HYVIDLAGVSLGGR 345
Query: 344 QL---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
+ + A +++D+ T + PS+Y+ L+ F + + +P AP LDTC+N
Sbjct: 346 DIPIPPHAATASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNF 405
Query: 401 SAYQ-EVNIPLVKMEFEGNAEMTVDVTGIV----YFVKSDA----SQVCLALASLSYEDE 451
+ + EV IPLV + F G + F S+ S CLA A+L + +
Sbjct: 406 TGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGD 465
Query: 452 TG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
++G Q + V++D ++GF C
Sbjct: 466 AEAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 184/405 (45%), Gaps = 64/405 (15%)
Query: 134 NYIATIELGGRN---MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSYKKVLC 188
+Y + LG +T+ +DTGSDL W C P C C + +I+ V C
Sbjct: 74 DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133
Query: 189 NSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHL 235
S C A + +S +C+ S P DC+ ++ +YGDGS+ L ++ L
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLYQQTL 192
Query: 236 GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---FGGLFSYCLPST 292
L + +F FGC +G+ G GR LSL +Q S + G FSYCL S
Sbjct: 193 SLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSH 249
Query: 293 QDAG----ASGSLILGGNSSVFK-----NSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
G LILG ++ S YT+M+ NP+ +Y + L GIS+G +
Sbjct: 250 SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKR 309
Query: 344 QLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF----S 392
+ A K GG+++DSGT T LP S Y+A+ EF K+ + F +
Sbjct: 310 TVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKT 369
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF--------VKSDASQVCLALA 444
L C+ L+ + IP++K+ F GN V ++ ++ C+ L
Sbjct: 370 GLGPCYYLNGLSQ--IPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMML- 426
Query: 445 SLSYEDETGI-------IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++ EDET + +GNYQQ+ V+YD + ++GFA ++C+
Sbjct: 427 -MNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 470
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 160/355 (45%), Gaps = 67/355 (18%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
+V+ DTGS L W QC PC C + P F P+ S ++ K+ C SS C +F T
Sbjct: 102 TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYR 158
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
C+++ C Y+ YG G +T G L E L +G AS FGC N G+ SG++
Sbjct: 159 TCNATG---CVYYYPYGMG-FTAGYLATETLHVGGASFPGVTFGCSTEN-GVGNSSSGIV 213
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITY 318
GLGRS LSLVSQ FSYCL S DAG S GSL + GGN STP
Sbjct: 214 GLGRSPLSLVSQVGV---ARFSYCLRSNADAGDSPILFGSLAKVTGGN----VQSTP--- 263
Query: 319 TNMIPNPQL--ATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKA 376
++ NP++ +++Y +NLTGI++G L A + +GT
Sbjct: 264 --LLENPEMPSSSYYYVNLTGITVGATDLP---MAMANLTTVNGTRF------------- 305
Query: 377 EFLKQFSGFPSAPGFSILDTCFN---LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
GF D CF+ V +P + + F G AE V V+
Sbjct: 306 -------GF---------DLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVE 349
Query: 434 SD----ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
D A+ CL + S + IIGN Q + V+YD FA DC+++
Sbjct: 350 VDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 404
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DTGSDL W QC PC +CY+Q P+FDPS S ++K+ CN ++CH
Sbjct: 78 IDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNSCH--------------- 122
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGGVSGLM 264
Y + Y D +Y++G L E + + S F+ GCG N+ SG++
Sbjct: 123 ------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMV 176
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
GL SL++Q + GL SYC S T + I+ G+ V ST + T
Sbjct: 177 GLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV---STTMFLTTAK 233
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
P Y LNL +S+G ++ G +G I+IDSGT +T P S Y L E
Sbjct: 234 PG-----LYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVREA 287
Query: 379 LKQFSGFPSAPGFSILDTCFN--LSAYQE-VNI-PLVKMEFEGNAEMTVDVTGIVYFVKS 434
+ + D N L Y + ++I P++ M F G A++ +D + Y
Sbjct: 288 VDHY-----VTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNM-YIETI 341
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CLA+ + + I GN Q N V YD+ + + F+ +CS++
Sbjct: 342 TRGTFCLAIIC-NNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCSAL 390
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/387 (30%), Positives = 182/387 (47%), Gaps = 60/387 (15%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP----VFDPSISPSYKKVL 187
TL TI +N+T+++DTGS+L+W++C +++P +F+P S +Y K+
Sbjct: 66 TLTASLTIGTPPQNITMVLDTGSELSWLRC--------KKEPNFTSIFNPLASKTYTKIP 117
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
C+S TC T + + + P C++ +SY D S G L E G + +
Sbjct: 118 CSSQTCKT---RTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATV 174
Query: 247 FGC----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
FGC +N +GLMG+ R LS V+Q FSYC+ ++G L+
Sbjct: 175 FGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMG---FRKFSYCI---SGLDSTGFLL 228
Query: 303 LGGNSSVFKNSTPITYTNMI----PNPQL-ATFYILNLTGISIGGK--QLQASGFA---- 351
LG + + P+ YT ++ P P Y + L GI + K L S F
Sbjct: 229 LG--EARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHT 286
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF------PSAPGFSILDTCFNLSAYQ 404
G ++DSGT T L +YSAL+ EFL Q +G P +D C+ + +
Sbjct: 287 GAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTS 346
Query: 405 EV--NIPLVKMEFEGNAEMTVDVTGIVYFVKSDA----SQVCLALASLSYEDETGI---- 454
N+P+VK+ F G AEM+V ++Y V + S C + DE GI
Sbjct: 347 STLPNLPVVKLMFRG-AEMSVSGQRLLYRVPGEVRGKDSVWCFTFGN---SDELGISSFL 402
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
IG++QQ+N + YD +NS++GFA C
Sbjct: 403 IGHHQQQNVWMEYDLENSRIGFAELRC 429
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 171/374 (45%), Gaps = 51/374 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISPSYKKVLCNSSTC---HAL 196
+ ++ ++DTGS W C C +C + + F P S S K + C + C H
Sbjct: 88 QTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQT 147
Query: 197 EF----ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
+ NS CS PP Y + YG G+ T G E L L V +F+ GC
Sbjct: 148 DLRCTDCDNNSRNCSQICPP---YLILYGSGT-TGGVALSETLHLHGLIVPNFLVGCS-- 201
Query: 253 NKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ--DAGASGSLILGGN 306
+F +G+ G GR SL SQ GL FSYCL S + D S SL+L
Sbjct: 202 ---VFSSRQPAGIAGFGRGPSSLPSQL-----GLTKFSYCLLSHKFDDTQESSSLVLDSQ 253
Query: 307 SSVFKNSTPITYTNMIPNPQL------ATFYILNLTGISIGG-------KQLQASGFAKG 353
S K + + YT ++ NP++ + +Y ++L ISIGG K L G
Sbjct: 254 SDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNG 313
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSILDTCFNLSAYQEVNIPL 410
G +IDSGT T + + L EF+ Q + A S L CFN+S +E+ +P
Sbjct: 314 GTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELELPQ 373
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---IIGNYQQKNQRVIY 467
+++ F+G A++ + + F+ S C + + E +G I+GN+Q +N V Y
Sbjct: 374 LRLHFKGGADVELPLENYFAFLGS-REVACFTVVTDGAEKASGPGMILGNFQMQNFYVEY 432
Query: 468 DTKNSQLGFAGEDC 481
D +N +LGF E C
Sbjct: 433 DLQNERLGFKKESC 446
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 166/359 (46%), Gaps = 26/359 (7%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
TI + + I+D +L W QC C C+ Q P+F P+ S +++ C + C ++
Sbjct: 72 TIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIP 131
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
+ +S +C+ ++ G +T G + + +G A+ + FGC +
Sbjct: 132 TSNCSSNMCTYEG------TINSKLGGHTLGIVATDTFAIGTATAS-LGFGCVVASGIDT 184
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
GG SGL+GLGR+ SLVSQ + FSYCL + D+G + L+LG ++ + NST
Sbjct: 185 MGGPSGLIGLGRAPSSLVSQMNIT---KFSYCL-TPHDSGKNSRLLLGSSAKLAGGGNST 240
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAKGGILIDSGTVITRLPPSIYS 372
+ P ++ +Y + L GI G L SG +L+ + ++ L S Y
Sbjct: 241 TTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSG---NTVLVQTLAPMSFLVDSAYQ 297
Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF-EGNAEMTVDVTGIVYF 431
ALK E K P+A D CF + + P + F +G A +TV +
Sbjct: 298 ALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLID 357
Query: 432 VKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V + VC+A+ S S+ + T I+G+ QQ+N + D + L F DCSS+
Sbjct: 358 VGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCSSL 416
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 102/262 (38%), Positives = 140/262 (53%), Gaps = 23/262 (8%)
Query: 91 RLILDNLHV--QYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGG--RN 145
RL L H L + ++ +IK ++ E PL SG + Y + + +G ++
Sbjct: 6 RLTLMVFHCCKSILATYFHVILLFSIKTIAEALETPLVSGASQGSGEYFSRVGIGSPPKH 65
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+ ++VDTGSD+ WVQC PC CY Q DP+F+PS S SY + C + C +L+ +
Sbjct: 66 VYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSE----- 120
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLM 264
C + S C Y VSYGDGSYT G+ E + L G AS+N+ GCG +N+GLF G +GL+
Sbjct: 121 CRNDS---CLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLL 177
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
GLG LS SQ I FSYCL + AS L NS + +S ++ N
Sbjct: 178 GLGGGSLSFPSQ---INASSFSYCLVNRDTDSAS---TLEFNSPIPSHSVT---APLLRN 228
Query: 325 PQLATFYILNLTGISIGGKQLQ 346
QL TFY L +TGI K LQ
Sbjct: 229 NQLDTFYYLGMTGIGESYKILQ 250
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 177/382 (46%), Gaps = 43/382 (11%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFDPS 178
+ S + ++ Y+ T+ LG R+M I DTGSDL WV+C+ S FDPS
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
S +Y +V C + C AL AT + G +C Y +YGDGS T G L E
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDG-------SNCAYLYAYGDGSNTTGVLSTETFTFD 202
Query: 236 --GLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSY 287
G G++ V FGC G F + +SLV+Q + G FSY
Sbjct: 203 DGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLG-GGAVSLVTQLGGATSLGRRFSY 261
Query: 288 CLPSTQDAGASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
CL AS +L G + V + STP+ + + T+Y + L + +G K
Sbjct: 262 CL-VPHSVNASSALNFGALADVTEPGAASTPLVAGD------VDTYYTVVLDSVKVGNKT 314
Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
+ ++ ++ I++DSGT +T L PS+ + E ++ + P +L C+N+ A +
Sbjct: 315 VASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNV-AGR 371
Query: 405 EV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
EV +IP + +EF G A + + FV +CLA+ + + + I+GN Q
Sbjct: 372 EVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQQPVSILGNLAQ 429
Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
+N V YD + FAG DC+
Sbjct: 430 QNIHVGYDLDAGTVTFAGADCA 451
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/289 (31%), Positives = 133/289 (46%), Gaps = 43/289 (14%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
T +DT SDL W QCQPC CY+Q DP+F+P +S +Y + C+S TC L+
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG--VSG 262
C C Y +Y + T G L + L +G+ + FGC ++ G SG
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASG 215
Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
++GLGR LSLVSQ S F+YCLP + G L+LG ++ +N+T M
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVPMR 271
Query: 323 PNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG---- 353
+P+ ++Y LNL G+ IG + + A+ A G
Sbjct: 272 RDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANR 331
Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
G++ID + IT L S+Y L + + P G S+ LD CF L
Sbjct: 332 YGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFIL 379
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 173/389 (44%), Gaps = 35/389 (8%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ---- 171
E+P+ Y ++G + ++ DTGSDLTW+ C+ C+S C N++
Sbjct: 69 EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128
Query: 172 --DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
VF ++S S+K + C + C + C + P C Y Y DGS G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGF 187
Query: 230 LGREHLGL-----GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
E + + K +++ + GC + +G F G+MGLG S S + +E FGG
Sbjct: 188 FANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG 247
Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
FSYCL S L G + S +TYT ++ + +FY +N+ GISIGG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGG 306
Query: 343 KQLQASGFA-----KGGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI 393
L+ GG ++DSG+ +T L Y +AL+ LK F G
Sbjct: 307 AMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--P 363
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
L+ CFN + ++E +P + F AE V Y + + CL S+++ T
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TS 420
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++GN Q+N +D +LGFA C+
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 165/352 (46%), Gaps = 39/352 (11%)
Query: 151 DTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
DTG ++ +C C+ C FDPS S ++ V C S C SG CS
Sbjct: 4 DTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDC--------RSG-CS 52
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
S S P C S+ + G + ++ L L ASV+DF FGC + G G +GL+ L
Sbjct: 53 SGSTPSCP-LTSF---PFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDL 108
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT-YTNMIPNP 325
R SL S+ + GG FSYCLP + + + G L++G S +T ++ +P
Sbjct: 109 SRDSRSLASRLAAGAGGTFSYCLPLSTTS-SHGFLVIGEADVPHNRSARVTAVAPLVYDP 167
Query: 326 QLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
Y+++L G+S+GG+ + A +++D+ T + PS+Y+ L+ F + + +
Sbjct: 168 AFPNHYVIDLAGVSLGGRDIPIPPHAA--MVLDTALPYTYMKPSMYAPLRDAFRRAMARY 225
Query: 386 PSAPGFSILDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTG--------IVYFVKSDA 436
P AP LDTC+N + + EV IPLV + F G + ++Y +
Sbjct: 226 PRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPGN 285
Query: 437 --SQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S CLA A+L + + ++G Q + V++D + ++GF C
Sbjct: 286 FFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 177/410 (43%), Gaps = 44/410 (10%)
Query: 104 SRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
SRI+++I + K S ++ L SGI T Y I +G + V+VD
Sbjct: 65 SRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVD 124
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
TGS+LTWV C+ ++ VF S S+K V C + TC + C + S
Sbjct: 125 TGSELTWVNCR-YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 183
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGL-FGGVSGLMG 265
P C+Y Y DGS +G +E + +G A + + GC + G F G G++G
Sbjct: 184 P-CSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLG 242
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSS---VFKNSTPITYTNM 321
L SD S S + ++G FSYCL + S LI G + S F+ +TP+ T +
Sbjct: 243 LAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRI 302
Query: 322 IPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGILIDSGTVITRLPPSIYSALK- 375
P FY +N+ GIS+G L + GG ++DSGT +T L + Y +
Sbjct: 303 PP------FYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 356
Query: 376 --AEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
A +L + P ++ CF+ S + +P + +G A Y V
Sbjct: 357 GLARYLVELKRV--KPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKS--YLV 412
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ CL S T +IGN Q+N +D S L FA C+
Sbjct: 413 DAAPGVKCLGFVSAG-TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 184/419 (43%), Gaps = 68/419 (16%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRN--MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDP 177
+PL+ G +Y + LG + +T+ +DTGSDL W C P C C + DP
Sbjct: 67 LPLSPGS-----DYTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDP 121
Query: 178 S----ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSY 220
S IS S + CNS C +T +S +C+ + P DC ++ +Y
Sbjct: 122 SPPTNISHS-TPISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAY 180
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ---T 277
GDGS L R+ L L + +F FGC F +G+ G GR LSL +Q
Sbjct: 181 GDGSLI-ASLYRDTLSLSTLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATH 236
Query: 278 SEIFGGLFSYCLPS----TQDAGASGSLILGG-NSSVFKNSTPIT---YTNMIPNPQLAT 329
S G FSYCL S ++ LILG N N + YT+M+ NP+ +
Sbjct: 237 SPQLGNRFSYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSY 296
Query: 330 FYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEF---- 378
FY + L GIS+G K + A + GG+++DSGT T LP Y+++ F
Sbjct: 297 FYTVGLKGISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRA 356
Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYF------ 431
K P + L C+ L+ +P V + F G N+ + + Y
Sbjct: 357 RKSNRRAPEIEQKTGLSPCYYLNT--AAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGD 414
Query: 432 -VKSDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V+ CL + E E G++GNYQQ+ V YD + ++GFA C+S+
Sbjct: 415 GVRRKERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCASL 473
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 172/372 (46%), Gaps = 29/372 (7%)
Query: 128 IRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
IR Y+A +G + + IVD +L W QC C+ C+ Q PVF P+ S ++K
Sbjct: 38 IRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKP 97
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
C ++ C ++ + + VCS PP + G+ T G + +G A+V
Sbjct: 98 EPCGTAVCESIPTRSCSGDVCSYKGPP------TQLRGN-TSGFAATDTFAIGTATVR-L 149
Query: 246 IFGC-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC ++ G SG +GLGR+ SLV+Q FSYCL S ++ G S L LG
Sbjct: 150 AFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-SPRNTGKSSRLFLG 205
Query: 305 GNSSVF--KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGT 361
++ + ++++ + P+ + +Y+L+L I G + + GGIL+ + +
Sbjct: 206 SSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATA--QSGGILVMHTVS 263
Query: 362 VITRLPPSIYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEG 417
+ L S Y A K + G P A D CF +A + P + F+G
Sbjct: 264 PFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 323
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNS 472
A +TV + V + C A+ S+++ + TG ++G+ QQ++ +YD K
Sbjct: 324 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 383
Query: 473 QLGFAGEDCSSM 484
L F DCSS+
Sbjct: 384 TLSFEPADCSSL 395
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 177/410 (43%), Gaps = 44/410 (10%)
Query: 104 SRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
SRI+++I + K S ++ L SGI T Y I +G + V+VD
Sbjct: 43 SRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVD 102
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
TGS+LTWV C+ ++ VF S S+K V C + TC + C + S
Sbjct: 103 TGSELTWVNCR-YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 161
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGL-FGGVSGLMG 265
P C+Y Y DGS +G +E + +G A + + GC + G F G G++G
Sbjct: 162 P-CSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLG 220
Query: 266 LGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSS---VFKNSTPITYTNM 321
L SD S S + ++G FSYCL + S LI G + S F+ +TP+ T +
Sbjct: 221 LAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRI 280
Query: 322 IPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGILIDSGTVITRLPPSIYSALK- 375
P FY +N+ GIS+G L + GG ++DSGT +T L + Y +
Sbjct: 281 PP------FYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 334
Query: 376 --AEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
A +L + P ++ CF+ S + +P + +G A Y V
Sbjct: 335 GLARYLVELKRV--KPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKS--YLV 390
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ CL S T +IGN Q+N +D S L FA C+
Sbjct: 391 DAAPGVKCLGFVSAG-TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/270 (32%), Positives = 131/270 (48%), Gaps = 23/270 (8%)
Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
D FDPS S S+ + C S C A+E C+ +S C + + +G+ + G L
Sbjct: 30 DVAFDPSRSSSFAAIPCGSPEC-AVE--------CTGAS---CPFTIQFGNVTVANGTLV 77
Query: 232 REHLGLGK-ASVNDFIFGCGR--NNKGLFGGVSGLMGLGRSDLSLVSQT-----SEIFGG 283
R+ L L A+ F FGC + F G GL+ L RS SL S+ +
Sbjct: 78 RDTLTLSPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTTTTA 137
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
FSYCLPS + G L +G + + I Y M NP Y ++L GIS+GG+
Sbjct: 138 AFSYCLPSLSSTRSRGFLSIGASRPEYSGGD-IKYAPMSSNPNHPNSYFVDLVGISVGGE 196
Query: 344 QLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
L A G L+++ T T L P+ Y+AL+ F + +P+AP F +LDTC+NL+
Sbjct: 197 DLPVPPAVLAAHGTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLDTCYNLT 256
Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+ +P V + F G E+ +DV +YF
Sbjct: 257 GLASLAVPAVALRFAGGTELELDVRQTMYF 286
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 175/362 (48%), Gaps = 33/362 (9%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
LQT Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV
Sbjct: 77 LQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVS 134
Query: 188 CNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDF 245
C +S C G+ C S PDC + VSY DGS + G L ++ L + F
Sbjct: 135 CGTSMC----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF 190
Query: 246 IFGCGRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGAS 298
FGC ++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +
Sbjct: 191 TFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTT 249
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGIL 356
G LG ++ T + YT M+ + + ++L IS+ G++ L S F++ G++
Sbjct: 250 GYFSLGKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVV 305
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
DSG+ ++ +P S L ++ +++ A C+++ + E ++P + + F+
Sbjct: 306 FDSGSELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFD 364
Query: 417 GNAEMTVDVTGIVYFVKSDASQ---VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
A + G+ FV+ + CLA A + IIG+ Q ++ V+YD K
Sbjct: 365 DGARFDLGSHGV--FVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQL 419
Query: 474 LG 475
+G
Sbjct: 420 IG 421
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/425 (28%), Positives = 183/425 (43%), Gaps = 77/425 (18%)
Query: 122 IPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFD 176
+PL+ G +Y + LG + +++ +DTGSDL W C P C C + D
Sbjct: 65 LPLSPGS-----DYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAAT 119
Query: 177 PSISP----SYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVS 219
+SP S V C S C A + +S +C+ + P DC+ ++ +
Sbjct: 120 GGLSPPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYA 179
Query: 220 YGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
YGDGS L R+ L + +S +++F FGC G G+ G GR LSL +Q
Sbjct: 180 YGDGSLV-ARLYRDSLSMPASSPLVLHNFTFGCAHTA---LGEPVGVAGFGRGVLSLPAQ 235
Query: 277 TSEI---FGGLFSYCLPS----TQDAGASGSLILGGNS-------SVFKNSTPITYTNMI 322
+ G FSYCL S LILG S V + YT M+
Sbjct: 236 LASFSPHLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAML 295
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALK 375
NP+ FY + L GI++G +++ K GG+++DSGT T LP +Y +L
Sbjct: 296 DNPKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLV 355
Query: 376 AEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
EF K+ + G L C+ S +P V + F GN+ + +
Sbjct: 356 TEFNHRMGRVYKRATQIEERTG---LGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNY 411
Query: 429 VY--FVKSDASQV-----CLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFA 477
Y F D + CL L + E E+G +GNYQQ+ V+YD + ++GFA
Sbjct: 412 YYEFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFA 471
Query: 478 GEDCS 482
C+
Sbjct: 472 RRKCA 476
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 135/483 (27%), Positives = 198/483 (40%), Gaps = 87/483 (18%)
Query: 66 AITLELKHKNYCSGKIVDWNE----QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
A+ LEL H VD NE +++ R + H + L + +
Sbjct: 22 ALRLELAH--------VDANEHCTMEERVRRATERTHHRRL------LHASTAAAAGGVA 67
Query: 122 IPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK----------SCYN 169
PL + Q YIA+ +G + +VDTGSDL W QC C+ C+
Sbjct: 68 APLRWSGKTQ---YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFP 124
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTR 227
Q P ++ S+S + + V C+ A +G D C SYG G
Sbjct: 125 QNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VAL 183
Query: 228 GELGREHLGLGKASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
G LG + +S FGC R + G G SG++GLGR LSLVSQ +
Sbjct: 184 GVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNAT---E 240
Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNST---------PITYTNMIPNPQ---LATFY 331
FSYCL P +D + L +G ++ P+T NP+ +TFY
Sbjct: 241 FSYCLTPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFY 300
Query: 332 ILNLTGISIGGK--QLQASGFA---------KGGILIDSGTVITRLPPSIYSALKAEFLK 380
L L G++ G L A F GG LIDSG+ TRL + AL E +
Sbjct: 301 YLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELAR 360
Query: 381 QFSGF-----PSAPGFSILDTCFNL----SAYQEVNIPLVKMEFE----GNAEMTVDVTG 427
Q G P A L+ C + +P + + F+ G E+ +
Sbjct: 361 QLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420
Query: 428 IVYFVKSDASQVCLALASLSY------EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
Y+ + +AS C+A+ S + +ET IIGN+ Q++ RV+YD N L F +C
Sbjct: 421 --YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
Query: 482 SSM 484
S++
Sbjct: 479 SAV 481
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/367 (31%), Positives = 167/367 (45%), Gaps = 43/367 (11%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+T+++DTGS+L+W+ C ++ D F P S ++ V C S+ C + + S
Sbjct: 72 QNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCSSRDLPAPPS 130
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNNKGLFGGV 260
C ++S C +SY DGS + G L + +G A FGC ++
Sbjct: 131 --CDAASR-RCRVSLSYADGSASDGALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVAT 187
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
+GL+G+ R LS V+Q S FSYC+ DAG L+LG + F P+ YT
Sbjct: 188 AGLLGMNRGALSFVTQAST---RRFSYCISDRDDAGV---LLLGHSDLPF---LPLNYTP 238
Query: 321 MI-PNPQLATF----YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPP 368
+ P P L F Y + L GI +GGK L G ++DSGT T L
Sbjct: 239 LYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLG 298
Query: 369 SIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLS---AYQEVNIPLVKMEFEGNA 419
YSA+KAEFLKQ A P F+ DTCF + +P V + F G A
Sbjct: 299 DAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNG-A 357
Query: 420 EMTVDVTGIVYFVKSDASQV----CLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQL 474
+M+V ++Y V + CL + T +IG++ Q N V YD + ++
Sbjct: 358 QMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDLERGRV 417
Query: 475 GFAGEDC 481
G A C
Sbjct: 418 GLAPVKC 424
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 173/389 (44%), Gaps = 35/389 (8%)
Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ---- 171
E+P+ Y ++G + ++ DTGSDLTW+ C+ C+S C N++
Sbjct: 69 EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128
Query: 172 --DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
VF ++S S+K + C + C + C + P C Y Y DGS G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGF 187
Query: 230 LGREHLGL-----GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
E + + K +++ + GC + +G F G+MGLG S S + +E FGG
Sbjct: 188 FANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG 247
Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
FSYCL S L G + S +TYT ++ + +FY +N+ GISIGG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGG 306
Query: 343 KQLQASGFA-----KGGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI 393
L+ GG ++DSG+ +T L Y +AL+ LK F G
Sbjct: 307 AMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--P 363
Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
L+ CFN + ++E +P + F AE V Y + + CL S+++ T
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TS 420
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++GN Q+N +D +LGFA C+
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 171/372 (45%), Gaps = 29/372 (7%)
Query: 128 IRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
IR Y+A +G + + IVD +L W QC C+ C+ Q PVF P+ S ++K
Sbjct: 55 IRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKP 114
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
C ++ C ++ + + VCS PP + G+ T G + +G A+V
Sbjct: 115 EPCGTAVCESIPTRSCSGDVCSYKGPP------TQLRGN-TSGFAATDTFAIGTATVR-L 166
Query: 246 IFGC-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
FGC ++ G SG +GLGR+ SLV+Q FSYCL S ++ G S L LG
Sbjct: 167 AFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-SPRNTGKSSRLFLG 222
Query: 305 GNSSVF--KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGT 361
++ + ++++ + P+ +Y+L+L I G + + GGIL+ + +
Sbjct: 223 SSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVS 280
Query: 362 VITRLPPSIYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEG 417
+ L S Y A K + G P A D CF +A + P + F+G
Sbjct: 281 PFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 340
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNS 472
A +TV + V + C A+ S+++ + TG ++G+ QQ++ +YD K
Sbjct: 341 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 400
Query: 473 QLGFAGEDCSSM 484
L F DCSS+
Sbjct: 401 TLSFEPADCSSL 412
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 122/416 (29%), Positives = 188/416 (45%), Gaps = 50/416 (12%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
+ W E D +Q+L S + + +P+ SG ++ Q YI +
Sbjct: 57 LSWEESVLQMQAKDKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAK 106
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+G + M + +DT SD+ W+ C C C + +F+ S +YK + C ++ C +
Sbjct: 107 IGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQVPK 163
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
T GVCS + ++YG GS L ++ + L +V + FGC + G
Sbjct: 164 PTCGGGVCS--------FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSL 214
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
GL+GLGR LSL+SQT ++ FSYCLPS + SGSL LG + I Y
Sbjct: 215 PAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKY 270
Query: 319 TNMIPNPQLATFYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPS 369
T ++ NP+ + Y +NL + + G S A G + DSGTV TRL
Sbjct: 271 TPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTP 328
Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
Y A++ F + + DTC+ + + P + F G M V +
Sbjct: 329 AYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDN 381
Query: 430 YFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S A S CLA+A+ + +I N QQ+N R++YD NS+LG A E C+
Sbjct: 382 LLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 176/366 (48%), Gaps = 33/366 (9%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
R + ++VDT S+LTWVQ C +C + P F+P +S S+ C SS C G
Sbjct: 10 REVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRS-KLGFQ 68
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKAS-VNDFIFGCGRNN-KGLF 257
C+ S+ C++ V+Y DGS G + RE L G AS + D IFGC + +
Sbjct: 69 SACNRST-GSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQRPV 127
Query: 258 GGVSGLMGLGRSDLSLVSQT-SEIFGGL---FSYCLPSTQDAGASGSLILGGNSSVFKNS 313
SG +GL R S +Q S GL FSYC P+ + S +I+ G+S + +
Sbjct: 128 DFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIPAHH 187
Query: 314 TPITYTNMIPNPQLAT---FYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVI 363
Y ++ P +A+ FY + L GIS+GG+ L S F GG DSGT +
Sbjct: 188 --FQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGTTV 245
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL-DTCFNLSA--YQEVNIPLVKMEFEGNAE 420
+ L ++AL F ++ G + C++++A + PLV + F+ N +
Sbjct: 246 SFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLHFKNNVD 305
Query: 421 MTVDVTGIVYFVKSDASQV---CLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLG 475
M + V+ + QV CLA + + G +IGNYQQ++ + +D + S++G
Sbjct: 306 MELREAS-VWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLERSRIG 364
Query: 476 FAGEDC 481
FA +C
Sbjct: 365 FAPANC 370
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 58/371 (15%)
Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ +++G ++ +DTGSD+ W QC PC +CY+Q P+FDPS S ++++ CN ++
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGNS 480
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF----- 247
CH Y + Y D +Y++G L E + + S F+
Sbjct: 481 CH---------------------YEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKI 519
Query: 248 GCGRNN-----KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
GCG +N G SG++GL LSL+SQ + GL SYC +G S I
Sbjct: 520 GCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF-----SGQGTSKI 574
Query: 303 LGGNSSVFKNSTPITYTNMIP--NPQLATFYILNLTGISIGGKQLQASGFA----KGGIL 356
G +++ + I NP FY LNL +S+ + G G I
Sbjct: 575 NFGTNAIVAGDGTVAADMFIKKDNP----FYYLNLDAVSVEDNLIATLGTPFHAEDGNIF 630
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI---PLVKM 413
IDSGT +T P S Y L E ++Q P NL Y I P++ M
Sbjct: 631 IDSGTTLTYFPMS-YCNLVREAVEQVVTAVKVPDMG----SDNLLCYYSDTIDIFPVITM 685
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F G A++ +D + Y CLA+ + + GN Q N V YD ++
Sbjct: 686 HFSGGADLVLDKYNM-YLETITGGIFCLAIG-CNDPSMPAVFGNRAQNNFLVGYDPSSNV 743
Query: 474 LGFAGEDCSSM 484
+ F+ +CS++
Sbjct: 744 ISFSPTNCSAL 754
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 149/338 (44%), Gaps = 50/338 (14%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DTGSDL W QC PC CY+Q DP+FDPS S ++ + C+ +CH
Sbjct: 99 IDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGKSCH--------------- 143
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCG-----RNNKGLFGG 259
Y + Y D +Y++G L E + + S F+ GCG +N G
Sbjct: 144 ------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASS 197
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
SG++GL SL+SQ + GL SYC +G S I G +++ +
Sbjct: 198 SSGIVGLNMGPRSLISQMDLPYPGLISYCF-----SGQGTSKINFGTNAIVAGDGTVAAD 252
Query: 320 NMIP--NPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSA 373
I NP FY LNL +S+ +++ G G I+IDSG+ +T P S Y
Sbjct: 253 MFIKKDNP----FYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVS-YCN 307
Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
L + ++Q P S D S ++ P++ M F G A++ +D + Y
Sbjct: 308 LVRKAVEQVVTAVRVPDPSGNDMLCYFSETIDI-FPVITMHFSGGADLVLDKYNM-YMES 365
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ CLA+ S E I GN Q N V YD+ +
Sbjct: 366 NSGGLFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSS 402
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 210/442 (47%), Gaps = 40/442 (9%)
Query: 44 WQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ 103
+ S ++ C S Q ++ I + K + K W+ + N D + YL
Sbjct: 16 FMSMSNATDPCAS-QPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
S + K VS+ P+ SG NYI +++G G+ + +++DT +D ++
Sbjct: 75 SLVAQ------KTVSSA--PIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPS 126
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
C C F P+ S SY + C+ C + + C ++ C++ SY
Sbjct: 127 SGCIGC---SATTFSPNASTSYVPLECSVPQCSQVRGLS-----CPATGSGACSFNKSYA 178
Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
+Y+ L ++ L L + + FG G GL+GLGR LSL+SQT ++
Sbjct: 179 GSTYS-ATLVQDSLRLATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLY 237
Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
G+FSYCLPS + SGSL LG I T ++ NP+ + Y +NLTGI++G
Sbjct: 238 SGVFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVG 293
Query: 342 G------KQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
K+L A G G +IDSGTVITR +Y+A++ EF KQ +G S+ G
Sbjct: 294 KVNVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLG--AF 351
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE---DE 451
DTCF + Y+ + P + + F + ++ + + + S S CLA+AS
Sbjct: 352 DTCF-VKNYETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMASTPKNVNYTV 407
Query: 452 TGIIGNYQQKNQRVIYDTKNSQ 473
+I NYQQ+N RV++DT N++
Sbjct: 408 LNVIANYQQQNLRVLFDTVNNK 429
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 183/395 (46%), Gaps = 49/395 (12%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
+++ ++PL R+ ++ Y I+LG + V VDTGSD+ WV C+PC C ++ +
Sbjct: 55 LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNL 114
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+FD + S + KKV C+ C + S C + C+Y + Y D S + G
Sbjct: 115 NFHLSLFDVNASSTSKKVGCDDDFCSFIS----QSDSCQPAV--GCSYHIVYADESTSEG 168
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
R+ L L + + + + +FGCG + G G V G+MG G+S+ S++SQ
Sbjct: 169 NFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQ 228
Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
+ +FS+CL + + G ++ +S + T M+PN Y +
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277
Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L G+ + G L S GG ++DSGT + P +Y +L L +
Sbjct: 278 LMGMDVDGTALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHI 333
Query: 393 ILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED 450
+ DT CF+ S +V P V EFE + ++TV ++ ++ + L+ +
Sbjct: 334 VEDTFQCFSFSENVDVAFPPVSFEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGE 393
Query: 451 ETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
T +I G+ N+ V+YD +N +G+A +CSS
Sbjct: 394 RTEVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 428
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 177/390 (45%), Gaps = 41/390 (10%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLT 157
+ L +R+ + SG+ + T + L SG + + +I + ++ + DTGSDL
Sbjct: 53 RLSMLAARLDDAASGSAQ----TPLQLDSGGGAYDMTF--SIGTPPQELSALADTGSDLI 106
Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
W +C C C Q P + P+ S S+ K+ C+ S C L S CS+ +C+Y
Sbjct: 107 WAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLP-----SSQCSAGGA-ECDYK 160
Query: 218 VSYGDGS----YTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSL 273
SYG S YT+G LG E LG +V FGC ++G +G SGL+GLGR LSL
Sbjct: 161 YSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSL 220
Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS--SVFKNSTPITYTNMIPNPQLATFY 331
VSQ + G FSYCL T DA + L+ G + STP+ T+ +Y
Sbjct: 221 VSQLNV---GAFSYCL--TSDAAKTSPLLFGSGALTGAGVQSTPLLRTSTY-------YY 268
Query: 332 ILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+NL ISIG +G GI+ DSGT + L Y+ K L Q + A G
Sbjct: 269 TVNLESISIGAATTAGTG--SSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGR 326
Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
+ CF S P + + F+G +D+ YF D S C +
Sbjct: 327 DGYEVCFQTSG---AVFPSMVLHFDGG---DMDLPTENYFGAVDDSVSCWIVQK---SPS 377
Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GN Q N + YD + S L F +C
Sbjct: 378 LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 175/362 (48%), Gaps = 33/362 (9%)
Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
LQT Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV
Sbjct: 77 LQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVS 134
Query: 188 CNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDF 245
C +S C G+ C S PDC + VSY DGS + G L ++ L + F
Sbjct: 135 CGTSMC----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGF 190
Query: 246 IFGCGRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGAS 298
FGC ++ G FG V GL+G+G +S++ Q+S F FSYCLP + + +
Sbjct: 191 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTT 249
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGIL 356
G LG ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++
Sbjct: 250 GYFSLGKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVV 305
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
DSG+ ++ +P S L ++ +++ A C+++ + E ++P + + F+
Sbjct: 306 FDSGSELSYIPDRALSVL-SQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFD 364
Query: 417 GNAEMTVDVTGIVYFVKSDASQ---VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
A + G+ FV+ + CLA A + IIG+ Q ++ V+YD K
Sbjct: 365 DGARFDLGSHGV--FVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQL 419
Query: 474 LG 475
+G
Sbjct: 420 IG 421
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 169/358 (47%), Gaps = 28/358 (7%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y T +G + ++ + DTGSDL W +C CK C + + P+ S S+ K+ C+S+
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGS----YTRGELGREHLGLGKASVNDFIFG 248
C LE + + + + C+Y SYG S YT+G +G E LG +V FG
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFG 200
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
C ++G +G SGL+GLGR LSLV Q + G FSYCL T D S L+ G +
Sbjct: 201 CTTMSEGGYGSGSGLVGLGRGKLSLVRQ---LKVGAFSYCL--TSDPSTSSPLLFGAGAL 255
Query: 309 VFK--NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRL 366
STP+ N + +TFY +NL ISIG + +G + GI+ DSGT +T L
Sbjct: 256 TGPGVQSTPLV------NLKTSTFYTVNLDSISIGAAKTPGTG--RHGIIFDSGTTLTFL 307
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
Y+ +A L Q + PG + CF S P + + F+G +M +
Sbjct: 308 AEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGG-DMALKTE 364
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
YF + S C + E I+GN Q + + YD S L F +C S+
Sbjct: 365 N--YFGAVNDSVSCWLVQ--KSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNCDSV 418
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 159/367 (43%), Gaps = 49/367 (13%)
Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++LG ++ +DTGSDL W QC PC +CY Q P+FDPS S ++K+ C+
Sbjct: 61 YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCH--- 117
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF----- 247
GNS C Y + Y D SY+ G L E + + S F+
Sbjct: 118 --------GNS----------CPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSI 159
Query: 248 GCGRNNKGLF-----GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
GCG NN L SG++GL SL+SQ GL SYC S + +
Sbjct: 160 GCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQ----GTSKIN 215
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
G N+ V + T + I Q FY LNL +S+G K+++ G G I ID
Sbjct: 216 FGTNAVVAGDGT-VAADMFIKKDQ--PFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEG 417
SGT T LP S + ++ P S + C+N + P++ + F G
Sbjct: 273 SGTTYTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEI--FPVITLHFAG 330
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
A++ +D + Y CLA+ + I GN N V YD+ + F+
Sbjct: 331 GADLVLDKYNM-YVETITGGTFCLAIGCVD-PSMPAIFGNRAHNNLLVGYDSSTLVISFS 388
Query: 478 GEDCSSM 484
+CS++
Sbjct: 389 PTNCSAL 395
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 167/384 (43%), Gaps = 64/384 (16%)
Query: 120 TEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
T P+ SG QT +Y+ LG + + + +DT +D TW C PC +C F
Sbjct: 66 TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FI 120
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
P+ S SY + C S C GE GR
Sbjct: 121 PASSSSYASLPCASDWCPLFRRPA-------------------------VPGEPGR---- 151
Query: 237 LGKASVNDFIFGCGRNNK-GLFGGVSGLMGLGRSD--------LSLVSQTSEIFGGLFSY 287
+G A+ + R + G+ G R+ +SL+SQT + G+FSY
Sbjct: 152 VGAAADVRLLQAASRTPRSGVLAATR--CGWARTPSPATRSGPMSLLSQTGSRYNGVFSY 209
Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
CLPS + SGSL LG +N + YT ++ NP + Y +N+TG+S+G ++A
Sbjct: 210 CLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPSLYYVNVTGLSVGRALVKA 265
Query: 348 SG--FA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
FA G +IDSGTVITR +Y+AL+ EF +Q + DTCFN
Sbjct: 266 PAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNT 325
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLS--YEDETGIIGN 457
P V + G ++T+ + + S A+ + CLA+A ++ N
Sbjct: 326 DEVAAGGAPPVTLHMGGGVDLTLPMENT--LIHSSATPLACLAMAEAPQNVNSVVNVVAN 383
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
QQ+N RV+ D S++GFA E C
Sbjct: 384 LQQQNVRVVVDVAGSRVGFAREPC 407
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/415 (26%), Positives = 185/415 (44%), Gaps = 52/415 (12%)
Query: 98 HVQYL----QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVD 151
H+Q++ +R K + + +K++ +++ + ++T + +G + I+D
Sbjct: 27 HIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVPQFTIMD 86
Query: 152 TGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
TGS L W+QC PCK C + PVF+P++S ++ + C+ C +G CSS+
Sbjct: 87 TGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCRY-----APNGHCSSN 141
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-----FGCGRNN-KGLFGGVSGL 263
C Y Y G+ ++G L +E L + N + FGCG N + L +G+
Sbjct: 142 K---CVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHENGEQLESEFTGI 198
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG-ASGSLILGGNSSVFKNSTPITYTNMI 322
+GLG SL Q G FSYC+ + L+LG ++ + + TPI +
Sbjct: 199 LGLGAKPTSLAVQ----LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETE- 253
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGF------AKGGILIDSGTVITRLPPSIYSALKA 376
Y +NL GIS+G KQL ++ G+++D+GT+ T L Y L
Sbjct: 254 -----NGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYN 308
Query: 377 EFLKQFSGFPSAPGFSILD-TCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVY-FVK 433
E P F D C++ +E + P+V F G AE+ ++ T + Y +
Sbjct: 309 EIKSILD--PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTE 366
Query: 434 SDASQ--VCLALASLS-----YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
SD C+++ + Y+D T IG Q+ + YD K + DC
Sbjct: 367 SDTYHNVFCMSVRPTTEHGGEYKDFTA-IGLMAQQYYNIAYDLKERNIYLQRIDC 420
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/440 (29%), Positives = 205/440 (46%), Gaps = 44/440 (10%)
Query: 56 SHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK 115
+ Q ++ I + K + K W+ + N D + YL + + +
Sbjct: 27 ASQPDDSDLNVIPMYGKCSPFNPPKADSWDNRVINMASKDPARMSYLSTLVAQKTA---- 82
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
T P+ SG NY+ +++G G+ + +++DT +D +V C C
Sbjct: 83 ----TSAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGC---SAT 135
Query: 174 VFDPSISPSYKKVLCNSSTC---HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
F P++S S+ + C+ C L SG CS + SY GS L
Sbjct: 136 TFYPNVSTSFVPLDCSVPQCGQVRGLSCPATGSGACS--------FNQSYA-GSTFSATL 186
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
++ L L + + FG G GL+GLGR LSL+SQ+ I+ G+FSYCLP
Sbjct: 187 VQDSLRLATDVIPSYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLP 246
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQ 344
S + SGSL LG I T ++ NP + Y +NLT IS+G +
Sbjct: 247 SFKSYYFSGSLKLGP----VGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSE 302
Query: 345 LQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
L A + G G +IDSGTVITR IY+A++ EF KQ +G S+ G DTCF + Y
Sbjct: 303 LLAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVTGPFSSLG--AFDTCF-VKNY 359
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET--GIIGNYQQK 461
+ + P + + F + ++ + + + S S CLA+A+ + +I N+QQ+
Sbjct: 360 ETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMAAAPSNVNSVLNVIANFQQQ 416
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
N RV++DT N+++G A E C
Sbjct: 417 NLRVLFDTVNNKVGIARELC 436
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 170/386 (44%), Gaps = 49/386 (12%)
Query: 131 QTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDP---VFDPSISPSYKK 185
+ Y+ IE+G + V I DTGSDL WV+C+ + N P F PS S +Y +
Sbjct: 106 RQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGR 165
Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG----- 238
V C++ C AL A +S PD C Y SYGDGS G+L E
Sbjct: 166 VGCDTKACRALSSA--------ASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADS 217
Query: 239 -----------------KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSE 279
+ + FGC G F GL+GLG +SL SQ +
Sbjct: 218 SKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRA-DGLVGLGGGPVSLASQLGATT 276
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
G FSYCL + AS +L G + V S P + + ++ T+Y + L I+
Sbjct: 277 SLGRKFSYCLAPYANTNASSALNFGSRAVV---SEPGAASTPLITGEVETYYTIALDSIN 333
Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
+ G + + A+ I++DSGT +T L ++ + L + ++ + ILD C++
Sbjct: 334 VAGTKRPTTA-AQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYD 392
Query: 400 LSAYQ---EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
+S + + IP V + G E+T+ FV +CLAL + S I+G
Sbjct: 393 ISGVRGEDALGIPDVTLVLGGGGEVTLKPDNT--FVVVQEGVLCLALVATSERQSVSILG 450
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCS 482
N Q+N V YD + + FA DC+
Sbjct: 451 NIAQQNLHVGYDLEKGTVTFAAADCA 476
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 127/435 (29%), Positives = 186/435 (42%), Gaps = 73/435 (16%)
Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQ 170
N + +PL+ G +Y + L + + + +DTGSDL W CQP C C +
Sbjct: 65 NTHNHRQVSLPLSPGS-----DYTLSFTLDSQPIFLYLDTGSDLVWFPCQPFECILCEGK 119
Query: 171 QD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN--- 215
+ P +S + V C SS C A +S +C+ S+ P DC
Sbjct: 120 AENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHS 179
Query: 216 ---YFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLG 267
++ +YGDGS L R+ + L ++ VN+F FGC G+ G G
Sbjct: 180 CPQFYYAYGDGSLI-ARLYRDSISLPLSNPTNLIVNNFTFGCAHT---ALAEPIGVAGFG 235
Query: 268 RSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFK-------NS 313
R LSL +Q + + G FSYCL S + LILG K N
Sbjct: 236 RGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNK 295
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRL 366
YT+M+ N + FY + L GISIG K++ A GF + GG+++DSGT T L
Sbjct: 296 PRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTML 355
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK---MEFEGNAEMTV 423
P S+Y ++ AEF + DT + Y + N+ V + F GN V
Sbjct: 356 PASLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVV 415
Query: 424 DVTGIVYFVK---------SDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDT 469
+ YF + CL L + E E +GNYQQ+ V+YD
Sbjct: 416 -LPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDL 474
Query: 470 KNSQLGFAGEDCSSM 484
+N ++GFA C+S+
Sbjct: 475 ENKRVGFARRQCASL 489
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/445 (25%), Positives = 181/445 (40%), Gaps = 54/445 (12%)
Query: 81 IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
+ D + R+ H + S + +PLTSG Y
Sbjct: 43 LADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSGAYTGIGQYFVRFR 102
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV---------FDPSISPSYKKVLCN 189
+G + ++ DTGSDLTWV+C+ S + P F P S ++ + C
Sbjct: 103 VGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPISCA 162
Query: 190 SSTC-HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KAS 241
S TC +L F+ C + P C Y Y DGS RG +G E + KA
Sbjct: 163 SDTCTKSLPFSLAT---CPTPGSP-CAYDYRYKDGSAARGTVGTESATIALSGREERKAK 218
Query: 242 VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASG 299
+ + GC + G F G++ LG S +S S + FGG FSYCL A+
Sbjct: 219 LKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNATS 278
Query: 300 SLILGGNSSVFKNSTP-------------ITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
L G N +V S+P T ++ + ++ FY ++L IS+ G+ L+
Sbjct: 279 YLTFGPNPAV---SSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLK 335
Query: 347 ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
A GG+++DSGT +T L Y A+ A K +G P + C+N +
Sbjct: 336 IPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRV-TMDPFEYCYNWT 394
Query: 402 AYQ----EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
+ +V +P + + F G A + + G Y + + C+ L + +IGN
Sbjct: 395 SPSGKDADVAVPKMAVHFAGAARL--EPPGKSYVIDAAPGVKCIGLQEGPWPG-ISVIGN 451
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
Q+ +D KN +L F C+
Sbjct: 452 ILQQEHLWEFDIKNRRLKFQRSRCT 476
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 144 RNMTVIVDTGSD-LTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
+ TV DT + T +QC+PC + C++ FDPS S S V C S C F
Sbjct: 156 QQFTVGFDTTTTGATQLQCKPCAADEPCHH----AFDPSASSSIAHVPCGSPDC---PFN 208
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFG 258
G CS S C VS + + L L + V+DF F C
Sbjct: 209 KG----CSGHS---CTLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAGFRPDD 261
Query: 259 GVSGLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
+G++ L R+ SL S+ S FSYCLPS G L LG +
Sbjct: 262 DSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSD--VGFLSLGATKPELLGRK-V 318
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSAL 374
+YT + N Y++ L G+ +GG L + A GG +++ T T L P +Y+AL
Sbjct: 319 SYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAIAGGGTILELHTTFTYLKPKVYAAL 378
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+ EF K S +P AP LDTC+N +A ++P V ++F+G AE + + ++YF +
Sbjct: 379 RDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEP 438
Query: 435 DA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ S CLA + +D +IG+ Q + V+YD + ++GF C
Sbjct: 439 GSYFSVGCLAFVA---QDGGAVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 172/373 (46%), Gaps = 48/373 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCK------SCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
+N+T+++DTGS+L+W+ C + F P S ++ V C S+ C + +
Sbjct: 74 QNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRD 133
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRN 252
S C +S C+ +SY DGS + G L + +G+A FGC +
Sbjct: 134 LPAPPS--CDGASR-QCHVSLSYADGSASDGALATDVFAVGEAPPLRSAFGCMSTAYDSS 190
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK- 311
G+ +GL+G+ R LS V+Q S FSYC+ DAG L+LG + F
Sbjct: 191 PDGV--ATAGLLGMNRGTLSFVTQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLP 242
Query: 312 -NSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTV 362
N TP+ Y +P P Y + L GI +GGK L AS A G ++DSGT
Sbjct: 243 LNYTPL-YQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQ 301
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE---VNIPLVKM 413
T L YSALKAEFLKQ A P F+ LDTCF + A + +P V +
Sbjct: 302 FTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTL 361
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV----CLALASLSYEDETG-IIGNYQQKNQRVIYD 468
F G AEM+V ++Y V + CL + T +IG++ Q N V YD
Sbjct: 362 LFNG-AEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYD 420
Query: 469 TKNSQLGFAGEDC 481
+ ++G A C
Sbjct: 421 LERGRVGLAPVKC 433
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 171/370 (46%), Gaps = 55/370 (14%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS- 191
++ I +G +T ++ DT SDL W+QC PC +CY Q P+FDPS S +++ C +S
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 192 -TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVN 243
+ +L+F C Y + Y D + ++G L RE L A+++
Sbjct: 145 YSMPSLKFNANTRS---------CEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALH 195
Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
D +FGCG +N G +G++GLG + SLV + FG FSYC S D ++++
Sbjct: 196 DVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGKKFSYCFGSLDDPSYPHNVLV 251
Query: 304 GGN--SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK------G 353
G+ +++ ++TP+ N FY + + IS+ G L F + G
Sbjct: 252 LGDDGANILGDTTPLEIHN--------GFYYVTIEAISVDGIILPIDPRVFNRNHQTGLG 303
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT----CFNLSAYQ---EV 406
G +ID+G +T L Y LK F G +A S D C+N + + E
Sbjct: 304 GTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVES 363
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
P+V F AE+++DV + F+K + CLA+ IG Q++ +
Sbjct: 364 GFPIVTFHFSEGAELSLDVKSL--FMKLSPNVFCLAVTP----GNLNSIGATAQQSYNIG 417
Query: 467 YDTKNSQLGF 476
YD + ++ F
Sbjct: 418 YDLEAMEVSF 427
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 176/380 (46%), Gaps = 56/380 (14%)
Query: 150 VDTGSDLTWVQCQPCKSCYN-----QQDPVFDPSISPSYKKVLCNSSTCHAL-------- 196
+DTGSDL WV C SC N + VF P +S S V C S C L
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 197 -EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL------GKASVNDFIFGC 249
+ G+ CS + PP Y + YG GS T G L E L L G ++ F GC
Sbjct: 61 CQSCAGSLKNCSETCPP---YGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGC 116
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG-GLFSYCLPSTQ-DAGASGSLILGGNS 307
+ SG+ G GR LS+ SQ E G F+YCL S + D SL++ G+
Sbjct: 117 SIVSSQQ---PSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDK 173
Query: 308 SVFKNSTPITYTNMI------PNPQLATFYILNLTGISIGGKQLQA--------SGFAKG 353
++ N+ P+ YT + P+ Q +Y + L G+SIGGK+L+ G
Sbjct: 174 AL-PNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNG 232
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSILDTCFNLSAYQEVNIPL 410
G +IDSGT T I+ + A F Q G+ A + + C++++ + + +P
Sbjct: 233 GTIIDSGTTFTVFSDEIFKHIAAGFASQI-GYRRAGEVEDKTGMGLCYDVTGLENIVLPE 291
Query: 411 VKMEFEGNAEMTVDVTGIV-YFVKSDASQVCLALASLS--YEDETG---IIGNYQQKNQR 464
F+G ++M + V YF D+ +CL + S E ++G I+GN QQ++
Sbjct: 292 FAFHFKGGSDMVLPVANYFSYFSSFDS--ICLTMISSRGLLEVDSGPAVILGNDQQQDFY 349
Query: 465 VIYDTKNSQLGFAGEDCSSM 484
++YD + ++LGF + C +
Sbjct: 350 LLYDREKNRLGFTQQTCKTF 369
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 127/435 (29%), Positives = 186/435 (42%), Gaps = 73/435 (16%)
Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQ 170
N + +PL+ G +Y + L + + + +DTGSDL W CQP C C +
Sbjct: 65 NTHNHRQVSLPLSPGS-----DYTLSFTLDSQPIFLYLDTGSDLVWFPCQPFECILCEGK 119
Query: 171 QD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN--- 215
+ P +S + V C SS C A +S +C+ S+ P DC
Sbjct: 120 AENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHS 179
Query: 216 ---YFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLG 267
++ +YGDGS L R+ + L ++ VN+F FGC G+ G G
Sbjct: 180 CPQFYYAYGDGSLI-ARLYRDSISLPLSNPTNLIVNNFTFGCAHT---ALAEPIGVAGFG 235
Query: 268 RSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFK-------NS 313
R LSL +Q + + G FSYCL S + LILG K N
Sbjct: 236 RGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNK 295
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRL 366
YT+M+ N + FY + L GISIG K++ A GF + GG+++DSGT T L
Sbjct: 296 PRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTML 355
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK---MEFEGNAEMTV 423
P S+Y ++ AEF + DT + Y + N+ V + F GN V
Sbjct: 356 PASLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVV 415
Query: 424 DVTGIVYFVK---------SDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDT 469
+ YF + CL L + E E +GNYQQ+ V+YD
Sbjct: 416 -LPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDL 474
Query: 470 KNSQLGFAGEDCSSM 484
+N ++GFA C+S+
Sbjct: 475 ENKRVGFARRQCASL 489
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/394 (29%), Positives = 166/394 (42%), Gaps = 64/394 (16%)
Query: 149 IVDTGSDLTWVQCQPCK----------SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+VDTGSDL W QC C+ C+ Q P ++ S+S + + V C+
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 199 ATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNN 253
A +G D C SYG G G LG + +S FGC R +
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVTLAFGCVSQTRIS 195
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKN 312
G G SG++GLGR LSLVSQ + FSYCL P +D + L +G
Sbjct: 196 PGALNGASGIIGLGRGALSLVSQLNAT---EFSYCLTPYFRDTVSPSHLFVGDGELAGLR 252
Query: 313 ST---------PITYTNMIPNPQ---LATFYILNLTGISIGGK--QLQASGFA------- 351
+ P+T NP+ +TFY L L G++ G L A F
Sbjct: 253 AAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPK 312
Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAPGFSILDTCFNL---- 400
GG LIDSG+ TRL + AL E +Q G P A L+ C
Sbjct: 313 VWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDG 372
Query: 401 SAYQEVNIPLVKMEFE----GNAEMTVDVTGIVYFVKSDASQVCLALASLSY------ED 450
+ +P + + F+ G E+ + Y+ + +AS C+A+ S + +
Sbjct: 373 DSLAAAAVPPLVLRFDDGVGGGRELVIPAEK--YWARVEASTWCMAVVSSASGNATLPTN 430
Query: 451 ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
ET IIGN+ Q++ RV+YD N L F +CS++
Sbjct: 431 ETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 464
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 135/307 (43%), Gaps = 33/307 (10%)
Query: 123 PLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
P+T+ L T + Y+ + +G + T I+DTGSDL W QC PC C +Q P FD
Sbjct: 74 PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDV 133
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S +Y+ + C SS C +L +S C C Y YGD + T G L E
Sbjct: 134 KKSATYRALPCRSSRCASL-----SSPSCFKKM---CVYQYYYGDTASTAGVLANETFTF 185
Query: 238 G-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
G K + FGCG N G SG++G GR LSLVSQ FSYCL S
Sbjct: 186 GAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSYCLTSY 242
Query: 293 QDAGASGSLILGGNSSVFKNST----PITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
A S L G +++ +T P+ T + NP L Y L+L IS+G K L
Sbjct: 243 LSATPS-RLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301
Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
GG++IDSGT IT L Y A++ + LDTCF
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQWP 361
Query: 402 AYQEVNI 408
V +
Sbjct: 362 PPPNVTV 368
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 165/375 (44%), Gaps = 50/375 (13%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ + +G G + ++ DTGS L W QC+PC + Q P+F+ + S +Y+ + C
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQH-- 148
Query: 193 CHALEFATGNSGV--CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
+F T N V C C Y ++Y GS T G ++ L + F FGC
Sbjct: 149 ----QFCTNNQNVFQCRDDK---CVYRIAYAGGSATAGVAAQDILQSAENDRIPFYFGCS 201
Query: 251 RNNKGL-----FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP--STQDAGASGSLIL 303
R+N+ G G++GL S +SL+ Q + I FSYCL + SL+
Sbjct: 202 RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLR 261
Query: 304 GGN----SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----K 352
GN S STP +PN Y LNL +S+ G ++Q FA
Sbjct: 262 FGNDIRKSRRKYLSTPFVSPRGMPN------YFLNLIDVSVAGNRMQIPPGTFALKPDGT 315
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD------TCFNLSAYQEV 406
GG +IDSGT +T + + Y + F F GF ++ C+ +
Sbjct: 316 GGTIIDSGTAVTYISQTAYFPVITAFKNYFDQH----GFQRVNIQLSGYICYKQQGHTFH 371
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
N P + F+G A+ V+ VY D C+AL +S + T IIG Q N + I
Sbjct: 372 NYPSMAFHFQG-ADFFVE-PEYVYLTVQDRGAFCVALQPISPQQRT-IIGALNQANTQFI 428
Query: 467 YDTKNSQLGFAGEDC 481
YD N QL F E+C
Sbjct: 429 YDAANRQLLFTPENC 443
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 118/417 (28%), Positives = 183/417 (43%), Gaps = 65/417 (15%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+PL+ G TL++ + + +T+ +DTGSDL W C P K + P +P+ SP
Sbjct: 62 LPLSPGSDY-TLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPN-EPNASP 119
Query: 182 SYK-----KVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDG 223
V C S C A S +C+++ P DC ++ +YGDG
Sbjct: 120 PTNITQSVAVSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDG 179
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI--- 280
S L R+ L L + +F FGC +G+ G GR LSL +Q + +
Sbjct: 180 SLI-ARLYRDTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQ 235
Query: 281 FGGLFSYCLPS----TQDAGASGSLILGGNSSVFKNS-----TPITYTNMIPNPQLATFY 331
G FSYCL S ++ LILG K YT+M+ NP+ FY
Sbjct: 236 LGNRFSYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFY 295
Query: 332 ILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
++L GI++G + + A + GG+++DSGT T LP Y+++ EF ++ G
Sbjct: 296 TVSLIGIAVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRV-G 354
Query: 385 FPSAPGFSI-----LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQ 438
+ I L C+ L++ +V P + + F G +V + YF + SD S
Sbjct: 355 RDNKRARKIEEKTGLAPCYYLNSVADV--PALTLRFAGGKNSSVVLPRKNYFYEFSDGSD 412
Query: 439 --------VCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CL L + E + +GNYQQ+ V YD + ++GFA C+
Sbjct: 413 GAKGKRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 161/357 (45%), Gaps = 38/357 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP FDP S +YK + CN ++ +
Sbjct: 94 QQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN------IDCICDSD 147
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
GV C Y Y + S + G LG + + G S +FGC G LF
Sbjct: 148 GV-------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQ 200
Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
G+MGLG DLSLV Q E FS C G G+++LGG S +
Sbjct: 201 RADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP--PSDMIF 256
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKG--GILIDSGTVITRLPPSIYSA 373
TY++ + +P +Y ++L I + GK+L +SG G G ++DSGT LP +SA
Sbjct: 257 TYSDPVRSP----YYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSA 312
Query: 374 LKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVKMEFEGNAEMTVDVTG 427
K + + P + D CF+ + + P V M FE ++++
Sbjct: 313 FKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPEN 372
Query: 428 IVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ CL + + D+T ++G +N V+YD NS++GF +CS +
Sbjct: 373 YFFRHSKVHGAYCLGIFE-NGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 161/357 (45%), Gaps = 38/357 (10%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP FDP S +YK + CN ++ +
Sbjct: 94 QQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN------IDCICDSD 147
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
GV C Y Y + S + G LG + + G S +FGC G LF
Sbjct: 148 GV-------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQ 200
Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
G+MGLG DLSLV Q E FS C G G+++LGG S +
Sbjct: 201 RADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP--PSDMIF 256
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKG--GILIDSGTVITRLPPSIYSA 373
TY++ + +P +Y ++L I + GK+L +SG G G ++DSGT LP +SA
Sbjct: 257 TYSDPVRSP----YYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSA 312
Query: 374 LKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVKMEFEGNAEMTVDVTG 427
K + + P + D CF+ + + P V M FE ++++
Sbjct: 313 FKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPEN 372
Query: 428 IVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ CL + + D+T ++G +N V+YD NS++GF +CS +
Sbjct: 373 YFFRHSKVHGAYCLGIFE-NGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 165/364 (45%), Gaps = 33/364 (9%)
Query: 144 RNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ------DPVFDPSISPSYKKVLCNSSTCH 194
+ ++ DTGSDLTW+ C+ C+S C N++ VF ++S S+K + C + C
Sbjct: 23 QKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCK 82
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGC 249
+ C + P C Y Y DGS G E + + K +++ + GC
Sbjct: 83 IELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGC 141
Query: 250 GRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNS 307
+ +G F G+MGLG S S + +E FGG FSYCL S L G +
Sbjct: 142 SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSR 201
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGILIDSGTV 362
S +TYT ++ + +FY +N+ GISIGG L+ GG ++DSG+
Sbjct: 202 SKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSS 260
Query: 363 ITRLPPSIY----SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
+T L Y +AL+ LK F G L+ CFN + ++E +P + F
Sbjct: 261 LTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--PLEYCFNSTGFEESLVPRLVFHFADG 317
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
AE V Y + + CL S+++ T ++GN Q+N +D +LGFA
Sbjct: 318 AEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TSVVGNIMQQNHLWEFDLGLKKLGFAP 374
Query: 479 EDCS 482
C+
Sbjct: 375 SSCT 378
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 118/378 (31%), Positives = 180/378 (47%), Gaps = 47/378 (12%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
TL T+ +N+T+++DTGS+L+W+ C+ + + F+P +S SY CNSS
Sbjct: 59 TLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSS 114
Query: 192 TCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
C T + + +S P + C+ VSY D S G L E L A+ +FGC
Sbjct: 115 ICTT---RTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC 171
Query: 250 GRNNKGLFGGV------SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
++ G + +GLMG+ R LSLV+Q S FSYC+ S +D A G L+L
Sbjct: 172 -MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSL---PKFSYCI-SGED--ALGVLLL 224
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATF-----YILNLTGISIGGK--QLQASGFA----- 351
G + +P+ YT ++ + + Y + L GI + K QL S F
Sbjct: 225 GDGTDA---PSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 281
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
G ++DSGT T L S+YS+LK EFL+Q G + P F +D C++ A
Sbjct: 282 AGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SF 340
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED-ETGIIGNYQQKNQ 463
+P V + F G AEM V ++Y V + V C + E +IG++ Q+N
Sbjct: 341 AAVPAVTLVFSG-AEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNV 399
Query: 464 RVIYDTKNSQLGFAGEDC 481
+ +D S++GF C
Sbjct: 400 WMEFDLLKSRVGFTQTTC 417
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/426 (28%), Positives = 179/426 (42%), Gaps = 73/426 (17%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQQD-----PV 174
+PL+ G +Y + + + +++ +DTGSDL W CQP C C + +
Sbjct: 74 LPLSPGS-----DYTLSFTINSQPISLYLDTGSDLVWFPCQPFECILCEGKAENASLAST 128
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYG 221
P +S + V C SS C A+ +S +C+ S+ P DC ++ +YG
Sbjct: 129 PPPKLSKTATPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYG 188
Query: 222 DGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
DGS L R+ + L ++ N+F FGC G+ G GR LSL +Q
Sbjct: 189 DGSLI-ARLYRDSIRLPLSNQTNLIFNNFTFGCAHTT---LAEPIGVAGFGRGVLSLPAQ 244
Query: 277 TSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFKN-------STPITYTNMI 322
+ + G FSYCL S + LILG K YT+M+
Sbjct: 245 LATLSPQLGNQFSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSML 304
Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALK 375
NP+ FY + L GISIG K++ A F + GG+++DSGT T LP S+Y +
Sbjct: 305 DNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVV 364
Query: 376 AEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
AEF + + L C+ V + F GN V ++
Sbjct: 365 AEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVP-RVVLHFVGNGSSVVLPRRNYFY 423
Query: 432 VKSDASQV--------CLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAG 478
D CL L + E E +GNYQQ+ V+YD +N ++GFA
Sbjct: 424 EFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFAR 483
Query: 479 EDCSSM 484
C+S+
Sbjct: 484 RQCASL 489
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 121/419 (28%), Positives = 184/419 (43%), Gaps = 68/419 (16%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRN----MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
+PL+ G +Y + LG R +T+ +DTGSDL W C P K + P P
Sbjct: 40 LPLSPGS-----DYTLSFNLGPRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNASP 94
Query: 178 SISPSYK-KVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDG 223
++ + V C S C A S +C+++ P DC ++ +YGDG
Sbjct: 95 PVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDG 154
Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI--- 280
S L R+ L L + +F FGC +G+ G GR LSL +Q + +
Sbjct: 155 SLI-ARLYRDTLSLSSLFLRNFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQ 210
Query: 281 FGGLFSYCLPS----TQDAGASGSLILG------GNSSVFKNSTPITYTNMIPNPQLATF 330
G FSYCL S ++ LILG V YT M+ NP+ F
Sbjct: 211 LGNRFSYCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYF 270
Query: 331 YILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y + L GIS+G + + A + GG+++DSGT T LP Y+++ EF +
Sbjct: 271 YTVGLIGISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGV- 329
Query: 384 GFPSAPGFSI-----LDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVY-FVK-SD 435
G + I L C+ L++ EV P++ + F G N+ + + Y F+ D
Sbjct: 330 GRVNERARKIEEKTGLAPCYYLNSVAEV--PVLTLRFAGGNSSVVLPRKNYFYEFLDGRD 387
Query: 436 ASQV-----CLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
A++ CL L + E E +GNYQQ+ V YD + ++GFA C+S+
Sbjct: 388 AAKGKRRVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCASL 446
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 184/403 (45%), Gaps = 50/403 (12%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVD 151
D +Q+L S + + +P+ SG ++ Q YI ++G + M + +D
Sbjct: 5 DKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMD 54
Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
T SD+ W+ PC C +F+ S +YK + C ++ C + T GVCS
Sbjct: 55 TSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCS---- 107
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
+ ++YG GS L ++ + L +V + FGC + G GL+GLGR L
Sbjct: 108 ----FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPL 162
Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
SL+SQT ++ FSYCLPS + SGSL LG + I YT ++ NP+ + Y
Sbjct: 163 SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKYTPLLKNPRRPSLY 218
Query: 332 ILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
+NL + + G S A G + DSGTV TRL Y A++ F +
Sbjct: 219 FVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTPAYIAVRDAFRNRV 276
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCL 441
+ DTC+ + + P + F G M V + + S A S CL
Sbjct: 277 GRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDNLLIHSTAGSTTCL 329
Query: 442 ALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A+A+ + +I N QQ+N R++YD NS+LG A E C+
Sbjct: 330 AMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 372
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 158/350 (45%), Gaps = 36/350 (10%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I+DTGS++ WV+C PCK C Q P+ DPS S +Y + C ++ CH S C+
Sbjct: 114 AIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMCH-----YAPSAYCN 168
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFG-GVS 261
+ C Y +SY G + G L E L G +V +FGC N +
Sbjct: 169 RLN--QCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHENGDYKDRRFT 226
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN-STPITYTN 320
G+ GLG+ S V++ G FSYCL + D + ++ G + F+ STP+ N
Sbjct: 227 GVFGLGKGITSFVTR----MGSKFSYCLGNIADPHYGYNQLVFGEKANFEGYSTPLKVVN 282
Query: 321 MIPNPQLATFYILNLTGISIGGKQL--QASGFAKGG----ILIDSGTVITRLPPSIYSAL 374
Y + L GIS+G K+L ++ F+ G LIDSGT +T L S + AL
Sbjct: 283 --------GHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTWLAESAFRAL 334
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
E + G P + C+ + Q+ + P+V F G A++ +D + Y
Sbjct: 335 DNEVRQLLDGV-LMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQAT 393
Query: 434 SDASQVCLALASLSYED--ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
D + + AS D +IG Q+ + YD +++L F DC
Sbjct: 394 PDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDC 443
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 119/403 (29%), Positives = 189/403 (46%), Gaps = 48/403 (11%)
Query: 110 ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYN 169
I N S ++P I +L T+ +N+++++DTGS+L+W+ C + +
Sbjct: 11 IPSNSFPRSPNKLPFRHNI---SLTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTS 67
Query: 170 QQDPVFDPSISPSYKKVLCNSSTC--HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
F+ + S SY+ + C+SSTC +F+ S C S+S C+ +SY D S +
Sbjct: 68 YPT-TFNQTRSISYRPIPCSSSTCTNQTRDFSIPAS--CDSNS--LCHATLSYADASSSE 122
Query: 228 GELGREHLGLGKASVNDFIFGCG----RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
G L + +G + + +FGC +N +GLMG+ R LS VSQ
Sbjct: 123 GNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMG---FP 179
Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI----PNPQLATF-YILNLTGI 338
FSYC+ T SG L+LG S F + P+ YT ++ P P Y + L GI
Sbjct: 180 KFSYCISGTD---FSGMLLLG--ESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGI 234
Query: 339 SIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA--- 388
+ + L + G ++DSGT T L Y+AL++EFL Q +GF
Sbjct: 235 KVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLED 294
Query: 389 PGFSI---LDTCFNLSAYQEV--NIPLVKMEFEGNAEMTVDVTGIVYFV----KSDASQV 439
P F +D C+ + Q V +P V + F G AEMTV ++Y V + + S
Sbjct: 295 PDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNG-AEMTVADERVLYRVPGEIRGNDSVH 353
Query: 440 CLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
CL+ + E +IG++ Q+N + +D + S++G A C
Sbjct: 354 CLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 119/445 (26%), Positives = 197/445 (44%), Gaps = 55/445 (12%)
Query: 67 ITLELKHKNYCSGKIVDWNEQQQNRL--ILDNL-----HVQYLQSRIKNMISGNIKDVSN 119
+T +L H++ + N+ ++R +L N +VQ + R ++ + D S
Sbjct: 35 VTTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSA 94
Query: 120 TEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
+ + + + ++ +G + ++DTGS LTW+QC+PC +C+ Q+ P+++P
Sbjct: 95 ADDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNP 154
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S S S+ +F ++ ++ DCNY +Y D + TRG RE L
Sbjct: 155 SSS---------STYVSCSDFDRTDTTFTATHG-SDCNYSQTYADKTTTRGTYAREQLLF 204
Query: 238 -----GKASVNDFIFGCGRNNK---GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
G ++D IFGCG NN G G SG+ GLG S S++S+ G FSYC+
Sbjct: 205 ETPDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISK----LGFGFSYCI 260
Query: 290 PSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
+ D L LG + STP ++P Y + L GISIG ++L
Sbjct: 261 GNIGDPLYGFHRLTLGNKLKIEGYSTP-----LVPR----GLYYITLVGISIGQERLDID 311
Query: 349 GFA---------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTC 397
I+IDSG ++ +P Y+ ++ + SGF S + L C
Sbjct: 312 PIVFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLC 371
Query: 398 FNLSAYQEVN-IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
+ Q++ P A++ V G+ F + + +CLAL ++ET +IG
Sbjct: 372 YIGKLNQDLQGFPDATFHLADGADLVFQVEGL--FFQYTDNVLCLALVPTESDEETCLIG 429
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
Q+ V YD K +L F +C
Sbjct: 430 LLAQQYYNVAYDLKQQKLYFQRIEC 454
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 168/361 (46%), Gaps = 46/361 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP F P +S +Y+ V C + C+ G++
Sbjct: 100 QRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC-TPDCN----CDGDT 154
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
C Y Y + S + G LG + + G S +FGC + G L+
Sbjct: 155 N--------QCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDETGDLYSQ 206
Query: 259 GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
G+MGLGR DLS++ Q ++ FS C D G G++ILGG S VF +
Sbjct: 207 RADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-GGMDVGG-GAMILGGISPPEDMVFTH 264
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
S +P + +Y +NL + + GK+LQ + K G ++DSGT LP +
Sbjct: 265 S----------DPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLPET 314
Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTV 423
+ A K +K+ + + P + D CF + + P+V M FE ++++
Sbjct: 315 AFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSL 374
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ CL + S + D T ++G +N V+YD +NS++GF +CS
Sbjct: 375 SPENYLFRHSKVRGAYCLGVFS-NGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSE 433
Query: 484 M 484
+
Sbjct: 434 L 434
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 170/368 (46%), Gaps = 43/368 (11%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
+N+T+++DTGS+L+W+ C P F P S ++ V C+S+ C + + +
Sbjct: 77 QNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCRSRDLPSP 136
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGL 256
+ C +S C +SY DGS + G L E +G+ FGC + G+
Sbjct: 137 PA--CDGAS-KQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGV 193
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
+GL+G+ R LS VSQ S FSYC+ DAG L+LG + F N T
Sbjct: 194 --ATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLPLNYT 245
Query: 315 PITYTNMIPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
P+ Y +P P Y + L GI +GGK L AS A G ++DSGT T L
Sbjct: 246 PL-YQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFL 304
Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQ--EVNIPLVKMEFEGN 418
YSALKAEF +Q + A P F+ DTCF + + +P V + F G
Sbjct: 305 LGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNG- 363
Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
A+MTV ++Y V + CL + T +IG++ Q N V YD + +
Sbjct: 364 AQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGR 423
Query: 474 LGFAGEDC 481
+G A C
Sbjct: 424 VGLAPIRC 431
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 110/370 (29%), Positives = 177/370 (47%), Gaps = 45/370 (12%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
++A + +G N+ V++DTGSDL W+QC+PC CY Q+DP+++ + S SY ++LCN
Sbjct: 106 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 165
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND-----FIF 247
C +L G G CS S C Y SY DGS T G L E + ++ F
Sbjct: 166 CLSL----GREGQCSDSG--SCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGF 219
Query: 248 GCGRNNKGLFGGVSG--LMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLIL 303
GCG N ++GLG +SLVSQ S I F+YC + + A G L+
Sbjct: 220 GCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVF 279
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ----LQASGFAK-----GG 354
G + + + TP+ +A FY +NL GI +G ++ + +S F + GG
Sbjct: 280 GDATYLNGDMTPMV---------IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGG 330
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNI-PLVK 412
++IDSG+ ++ PP +Y ++ + + G+ +P S D CF +++ + P +
Sbjct: 331 VIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFEGKIGRDLPLFPTLV 389
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
+ E + D I F++ CL S + IIG Q++ + Y+ + S
Sbjct: 390 LYLESTGILN-DRWSI--FLQRYDELFCLGFTS---GEGLSIIGTLAQQSYKFGYNLELS 443
Query: 473 QLGF-AGEDC 481
L + DC
Sbjct: 444 TLSIESNPDC 453
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 174/374 (46%), Gaps = 51/374 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
+N+++++DTGS+L+W++C + +PV FDP+ S SY + C+S TC
Sbjct: 84 QNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND--FIFGCGRNNKG---- 255
C S C+ +SY D S + G L E G S ND IFGC + G
Sbjct: 140 IPASCDSDK--LCHATLSYADASSSEGNLAAEIFHFGN-STNDSNLIFGCMGSVSGSDPE 196
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
+GL+G+ R LS +SQ FSYC+ T D G L+LG S F TP
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMGF---PKFSYCISGTDD--FPGFLLLG--DSNFTWLTP 249
Query: 316 ITYTNMI----PNPQLATF-YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVI 363
+ YT +I P P Y + LTGI + GK L G ++DSGT
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQF 309
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEV-----NIPLVK 412
T L +Y+AL+++FL Q +G + P F +D C+ +S ++ +P V
Sbjct: 310 TFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVS 369
Query: 413 MEFEGNAEMTVDVTGIVYFVKS----DASQVCLALASLSYED-ETGIIGNYQQKNQRVIY 467
+ FEG AE+ V ++Y V + S C + E +IG++ Q+N + +
Sbjct: 370 LVFEG-AEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428
Query: 468 DTKNSQLGFAGEDC 481
D + S++G A C
Sbjct: 429 DLQRSRIGLAPVQC 442
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 152/361 (42%), Gaps = 66/361 (18%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ I +G V I DTGSDL W QC PC SCY Q++P+FDPS S S+K+V C S
Sbjct: 24 YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 83
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
C L+ T S+ + +FGCG N
Sbjct: 84 CRLLDTPT----------------------------------------SILNIVFGCGHN 103
Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG--LFSYCL-PSTQDAGASGSLILGGNSS 308
N G F GL G G LSL SQ G FS CL P D + +I G +
Sbjct: 104 NSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAE 163
Query: 309 VFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGT 361
V + STP+ + +P T+Y + L GIS+G K S + KG + ID+GT
Sbjct: 164 VSGSDVVSTPLVTKD---DP---TYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 217
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
T LP Y+ L + P C+ + ++ P++ F+G
Sbjct: 218 PPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATL--IDGPILTAHFDG---A 272
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V + + F+ C A+ + + +TGI GN+ Q N + +D ++ F DC
Sbjct: 273 DVQLKPLNTFISPKEGVYCFAMQPI--DGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 330
Query: 482 S 482
+
Sbjct: 331 T 331
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 123/418 (29%), Positives = 188/418 (44%), Gaps = 48/418 (11%)
Query: 80 KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
K + W E D +QYL N+++ + +P+ SG ++ Q+ YI
Sbjct: 60 KPMSWEESVLQLQAKDQARMQYL----SNLVA------RRSIVPIASGRQITQSPTYIVR 109
Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
+ G T++ +DT +D WV C C C F P S ++KKV C +S C +
Sbjct: 110 AKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPPKSTTFKKVGCGASQCKQV 167
Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
T C S+ C + +YG S L ++ + L V + FGC + G
Sbjct: 168 RNPT-----CDGSA---CAFNFTYGTSSVA-ASLVQDTVTLATDPVPAYTFGCIQKATGS 218
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GL+GLGR LSL++QT +++ FSYCLPS + SG L + P
Sbjct: 219 SLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGHXDLXPVAQPRDQVYP- 277
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAK---GGILIDSGTVITRLPPS 369
NP+ ++ Y +NL I +G + + +A F G + DSGTV TRL
Sbjct: 278 ----SFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTRLVEP 333
Query: 370 IYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
Y+A++ EF ++ S S+ DTC+ + + P + F G M V +
Sbjct: 334 AYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV----PIVAPTITFMFSG---MNVTLPP 386
Query: 428 IVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S A V CLA+A + +I N QQ+N RV++D NS+LG A E C+
Sbjct: 387 DNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 444
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 120/428 (28%), Positives = 188/428 (43%), Gaps = 51/428 (11%)
Query: 85 NEQQQNRLILDNLHVQYLQSRIKNMISGNIKD----------VSNTEIPLTSGIRLQTL- 133
N+Q + R + D LH L R+++++ + IP + G ++ L
Sbjct: 77 NQQPERRSVADVLHRDAL--RLRSLLHREEDNHRTPAPAAPPGGGVSIP-SRGEPIEELP 133
Query: 134 -----NYIATIELGGRNMTVIVDTGSD-LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
+ +A + + V DT + T +QC PC S D FDPS S S +V
Sbjct: 134 GAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPCGS---GADHAFDPSASSSVSQVP 190
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD---GSYTRGELGREHLGLGKASVND 244
C S C F G SG P C VS+ + G+ T A+V+
Sbjct: 191 CGSPDC---PF-HGCSGR------PSCTLSVSFNNTLLGNATFFTDTLTLTPSSSATVDK 240
Query: 245 FIFGC--GRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYCLP-STQDAGAS 298
F F C G G +G++ L R+ SL S+ +S FSYCLP ST D G
Sbjct: 241 FRFACLEGIAPGPAEDGSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCLPASTADVGF- 299
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--IL 356
L LG ++YT + +P Y+++L G+ +GG L A G +
Sbjct: 300 --LSLGATKPELLGRK-VSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAIAGDDTI 356
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
++ T T L P +Y L+ F K S +P+AP LDTC+N + ++P V ++F
Sbjct: 357 LELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDAFSVPAVTLKFA 416
Query: 417 GNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
G A++ + + ++YF D S CLA + + + G +IG+ Q + V+YD + +
Sbjct: 417 GGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMSTEVVYDVRGGK 476
Query: 474 LGFAGEDC 481
+GF C
Sbjct: 477 VGFVPYRC 484
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 166/387 (42%), Gaps = 62/387 (16%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
LT+G L YI T + +IVD+GS +T+V C C+ C N QDP F P +S SY
Sbjct: 83 LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSY 138
Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
V CN TC S C Y Y + S + G LG + + G+ S
Sbjct: 139 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 184
Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
IFGC + G LF G+MGLGR LS++ Q E + FS C
Sbjct: 185 LKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG 244
Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G G+++LGG +F NS P+ + +Y + L I + GK L+
Sbjct: 245 G--GAMVLGGMLAPPDMIFSNSDPLR----------SPYYNIELKEIHVAGKALRVESRI 292
Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFSILDTCF---- 398
+K G ++DSGT LP + A K LK+ G P S D CF
Sbjct: 293 FNSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRG----PDPSYKDICFAGAG 348
Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
N+S EV P V M F ++++ ++ CL + + +D T ++G
Sbjct: 349 RNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ-NGKDPTTLLGG 406
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+N V YD N ++GF +CS +
Sbjct: 407 IIVRNTLVTYDRHNEKIGFWKTNCSEL 433
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 162/360 (45%), Gaps = 25/360 (6%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
TI + + I+D +L W QC C C+ Q P+F P+ S +++ C + C +
Sbjct: 48 TIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTP 107
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
+ + VC+ S + D T G +G E +G A+ + FGC ++
Sbjct: 108 TSNCSGDVCTYESTTNIRL-----DRHTTLGIVGTETFAIGTATAS-LAFGCVVASDIDT 161
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
G SG +GLGR+ SLV+Q FSYCL S + G S L LG ++ + ++++
Sbjct: 162 MDGTSGFIGLGRTPRSLVAQMKLT---KFSYCL-SPRGTGKSSRLFLGSSAKLAGGESTS 217
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVITRLPPSIYSA 373
+ P+ +Y+L+L I G + + GGIL+ + + + L S Y A
Sbjct: 218 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVSPFSLLVDSAYRA 275
Query: 374 LKAEFLKQFSGF---PSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIV 429
K + G P A D CF +A + P + F+G A +TV +
Sbjct: 276 FKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYL 335
Query: 430 YFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
V + C A+ S+++ + TG ++G+ QQ++ +YD K L F DCSS+
Sbjct: 336 IDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCSSL 395
>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
oleracea]
Length = 165
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 4/156 (2%)
Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
+FY L++ GIS+GG++L + F+ G LIDSGTVI+RLPP Y+AL+ F + S +
Sbjct: 12 SFYGLDIVGISVGGQKLAIPQTVFSTPGALIDSGTVISRLPPKAYAALRGAFKAKMSQYK 71
Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
+ SILDTCF+L+ ++ V IP V F G A + + G++Y K SQVCLA A
Sbjct: 72 NTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAFK--MSQVCLAFAGN 129
Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S ++ I GN QQ+ V+YD ++GFA CS
Sbjct: 130 SDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGCS 165
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 182/396 (45%), Gaps = 51/396 (12%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----Y 168
+++ ++PL R+ ++ Y I+LG + V VDTGSD+ W+ C+PC C
Sbjct: 55 LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL 114
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
N + +FD + S + KKV C+ C + S C + C+Y + Y D S + G
Sbjct: 115 NFRLSLFDMNASSTSKKVGCDDDFCSFIS----QSDSCQPAL--GCSYHIVYADESTSDG 168
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ R+ L L + + + + +FGCG + G G V G+MG G+S+ S++SQ
Sbjct: 169 KFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQ 228
Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
+ +FS+CL + + G ++ +S + T M+PN Y +
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277
Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L G+ + G L S GG ++DSGT + P +Y +L L +
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILAR-----QPVKLH 332
Query: 393 ILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
I++ CF+ S + P V EFE + ++TV ++ ++ + L+ +
Sbjct: 333 IVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTD 392
Query: 450 DETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + +I G+ N+ V+YD N +G+A +CSS
Sbjct: 393 ERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSS 428
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 155/350 (44%), Gaps = 37/350 (10%)
Query: 154 SDLTWVQCQPCKS------CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
S ++ ++C+PC S D FDPS+S S++ VLC S C + G S
Sbjct: 158 SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSSFRSVLCGSPDCGGHSCSAGGS---- 213
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
C + + + G + + L L A+ +F GC + + LF + + +
Sbjct: 214 ------CTFTLQNSTFVFGNGTIVMDTLTLSPSATFENFAVGCMQLDNDLF---TDGVAV 264
Query: 267 GRSDLSL--------VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G DLSL V +S FSYCLP+ D G L + S + + + Y
Sbjct: 265 GNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPA--DTDTHGFLTIAPALSDYSDHAGVKY 322
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKA 376
++ NP FY ++L I+I G+ L + F G +IDS + T L P IY+AL+
Sbjct: 323 VPLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFTGNGTMIDSQSAFTYLNPPIYAALRD 382
Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
EF K + P F LDTC+N + + + +P + + F M +D +YF +
Sbjct: 383 EFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHL 442
Query: 437 SQ----VCLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ CLA A+ ++ +G+ Q+ + ++YD + + F C
Sbjct: 443 TDGFPFGCLAFAAAPDQNFPWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 106/422 (25%), Positives = 180/422 (42%), Gaps = 47/422 (11%)
Query: 95 DNLHVQ-YLQSRIKNMISGNIKD---VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
D+LH Y++S++ + G S +PL+SG T Y +G + +
Sbjct: 57 DDLHRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFVL 116
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDP----VFDPSISPSYKKVLCNSSTCHA-LEFATGNS 203
+ DTGSDLTWV+C+ + VF + S S+ + C+S TC + + F+ N
Sbjct: 117 VADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSSDTCTSYVPFSLAN- 175
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----------------KASVNDFIF 247
CSS + P C Y Y DGS RG +G + + +A + +
Sbjct: 176 --CSSPASP-CAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQGVVL 232
Query: 248 GCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
GC G F G++ LG S++S S+ + FGG FSYCL A+ L G
Sbjct: 233 GCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGP 292
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGILIDSG 360
++ P T ++ + ++ FY + + + + G+ L GG ++DSG
Sbjct: 293 GATA-----PAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILDSG 347
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
T +T L Y A+ K +G P + C+N + + IP +++ F G+A
Sbjct: 348 TSLTILATPAYRAVVTALSKHLAGLPRV-TMDPFEYCYNWTDAGALEIPKMEVHFAGSAR 406
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + Y + + C+ + S+ +IGN Q+ +D ++ L F
Sbjct: 407 L--EPPAKSYVIDAAPGVKCIGVQEGSWPG-VSVIGNILQQEHLWEFDLRDRWLRFKHTR 463
Query: 481 CS 482
C+
Sbjct: 464 CA 465
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 169/368 (45%), Gaps = 43/368 (11%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
+N+T+++DTGS+L+W+ C P F P S ++ V C S+ C + + +
Sbjct: 76 QNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCRSRDLPSP 135
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGL 256
+ C +S C +SY DGS + G L E +G+ FGC + G+
Sbjct: 136 PA--CDGAS-KQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGV 192
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
+GL+G+ R LS VSQ S FSYC+ DAG L+LG + F N T
Sbjct: 193 --ATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLPLNYT 244
Query: 315 PITYTNMIPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
P+ Y +P P Y + L GI +GGK L AS A G ++DSGT T L
Sbjct: 245 PL-YQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFL 303
Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQ--EVNIPLVKMEFEGN 418
YSALKAEF +Q + A P F+ DTCF + + +P V + F G
Sbjct: 304 LGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNG- 362
Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
A+MTV ++Y V + CL + T +IG++ Q N V YD + +
Sbjct: 363 AQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGR 422
Query: 474 LGFAGEDC 481
+G A C
Sbjct: 423 VGLAPIRC 430
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 164/360 (45%), Gaps = 54/360 (15%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
++DTGS LTWV C PC SC Q P+FDPS S +Y + C S C+ + G
Sbjct: 109 VMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSC--SECNKCDVVNG------- 159
Query: 209 SSPPDCNYFVSY-GDGS----YTRGELGREHLGLGKASVNDFIFGCGR-----NNKGLFG 258
+C Y V Y G GS Y R +L E + V IFGCGR +N +
Sbjct: 160 ----ECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQ 215
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPIT 317
G++G+ GLG SL+ FG FSYC+ + ++ L+LG +++ +ST +
Sbjct: 216 GINGVFGLGSGRFSLLPS----FGKKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLN 271
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK------GGILIDSGTVITRLPPS 369
N + Y +NL ISIGG++L + F + G++IDSG T L
Sbjct: 272 VINGL--------YYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKY 323
Query: 370 IYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDV 425
+ L E G + C++ Q+++ PLV F A + +DV
Sbjct: 324 GFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDV 383
Query: 426 TGIVYFVKSDASQVCLALASLSY--EDETGI--IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
T + F+++ ++ C+A+ +Y +D IG Q+N V YD ++ F DC
Sbjct: 384 TSM--FIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 117/378 (30%), Positives = 178/378 (47%), Gaps = 47/378 (12%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
TL TI +N+T+++DTGS+L+W+ C+ + + F+P +S SY CNSS
Sbjct: 58 TLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSS 113
Query: 192 TCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
C T + + +S P + C+ VSY D S G L E L A+ +FGC
Sbjct: 114 VCMT---RTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC 170
Query: 250 GRNNKGLFGGV------SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
++ G + +GLMG+ R LSLV+Q + FSYC+ S +D A G L+L
Sbjct: 171 -MDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCI-SGED--AFGVLLL 223
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATF-----YILNLTGISIGGK--QLQASGFAK---- 352
G S +P+ YT ++ + + Y + L GI + K QL S F
Sbjct: 224 GDGPSA---PSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280
Query: 353 -GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
G ++DSGT T L +Y++LK EFL+Q G + P F +D C++ A
Sbjct: 281 AGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SL 339
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED-ETGIIGNYQQKNQ 463
+P V + F G AEM V ++Y V V C + E +IG++ Q+N
Sbjct: 340 AAVPAVTLVFSG-AEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNV 398
Query: 464 RVIYDTKNSQLGFAGEDC 481
+ +D S++GF C
Sbjct: 399 WMEFDLVKSRVGFTETTC 416
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 188/433 (43%), Gaps = 38/433 (8%)
Query: 63 EMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
E + T EL H++ + + + +E RL +R ++IS +I + E
Sbjct: 33 EKLSFTTELIHRDSPNSPLFNASETTDIRLANAVERSADRVNRFNDLISNSI---TAAEF 89
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-PVFDPSI 179
P L +++ I +G + V V TGSDL W+ C K C + D FDP
Sbjct: 90 PSI----LDNGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPME 145
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S +YK V C+S C AT C S P S G+L + L L
Sbjct: 146 SSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPR-------HQDSCPDGDLAMDTLTLNS 198
Query: 240 ASVNDFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ F+ F CG G + GV G++GLG LSL+++ S + G FS+C+
Sbjct: 199 TTGKSFMLPNTGFICGNRIGGDYPGV-GILGLGHGSLSLLNRISHLIDGKFSHCI-VPYS 256
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG----F 350
+ + L G + V ++ T +M P Y L+ GIS+G K + A G +
Sbjct: 257 SNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYS---YTLSFYGISVGNKSISAGGIGSDY 313
Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS-ILDTCFNLSAYQEVNIP 409
G+ +DSGT+ T P YS L+ + P P + L C+ S + + P
Sbjct: 314 YMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYSP--DFSPP 371
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
+ M FEG + V+++ F++ VCLA A+ S E + + G +QQ N + YD
Sbjct: 372 TITMHFEGGS---VELSSSNSFIRMTEDIVCLAFATSSSEQD-AVFGYWQQTNLLIGYDL 427
Query: 470 KNSQLGFAGEDCS 482
L F DC+
Sbjct: 428 DAGFLSFLKTDCT 440
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 182/385 (47%), Gaps = 54/385 (14%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCN 189
TL T+ +++T+++DTGS+L+W+ C+ QQ + VF+P +S SY + C
Sbjct: 69 TLTVSLTVGTPPQSVTMVLDTGSELSWLHCK------KQQNINSVFNPHLSSSYTPIPCM 122
Query: 190 SSTC--HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
S C +F S C S++ C+ VSY D + G L + + + IF
Sbjct: 123 SPICKTRTRDFLIPVS--CDSNN--LCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIF 178
Query: 248 GCG----RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
G +N +GLMG+ R LS V+Q FSYC+ S +D ASG L+
Sbjct: 179 GSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCI-SGKD--ASGVLLF 232
Query: 304 GGNSSVFKNSTPITYTNMIP-NPQLATF----YILNLTGISIGGKQLQASG--FA----- 351
G + FK P+ YT ++ N L F Y + L GI +G K LQ FA
Sbjct: 233 G--DATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTG 290
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
G ++DSGT T L S+Y+AL+ EF+ Q G + P F +D CF +
Sbjct: 291 AGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGV 350
Query: 406 V-NIPLVKMEFEGNAEMTVDVTGIVYFV-------KSDASQVCLALASLSYED-ETGIIG 456
V +P V M FEG AEM+V ++Y V K + CL + E +IG
Sbjct: 351 VPAVPAVTMVFEG-AEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIG 409
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
++ Q+N + +D NS++GFA C
Sbjct: 410 HHHQQNVWMEFDLVNSRVGFADTKC 434
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 171/384 (44%), Gaps = 58/384 (15%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
Y A I LG ++ V VDTGSD+ WV C C C + D ++DP S S ++
Sbjct: 82 YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
C+ C A G C+ P C Y V YGDGS T G +++L + + N
Sbjct: 142 CDDDFCAAT--YNGVLQGCTKDLP--CQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTS 197
Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
IFGCG G G + G++G G+++ S++SQ + +F++CL + +
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVK 257
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA 351
G +G S N+TP M+PN Y + + I +GG +L F
Sbjct: 258 GGGI---FAIGEVVSPKVNTTP-----MVPN---QPHYNVVMKEIEVGGNVLELPTDIFD 306
Query: 352 KG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-----TCFNLSAY 403
G G +IDSGT + LP +Y ++ + + + PG + TCF +
Sbjct: 307 TGDRRGTIIDSGTTLAYLPEVVYESMMTKIVSE------QPGLKLHTVEEQFTCFQYTGN 360
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQ 459
P+VK F G+ +TV+ ++ + + C + + + G ++G+
Sbjct: 361 VNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVW--CFGWQNSGMQSKDGRDMTLLGDLV 418
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
N+ V+YD +N +G+ +CSS
Sbjct: 419 LSNKLVLYDLENQAIGWTDYNCSS 442
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 176/374 (47%), Gaps = 53/374 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
++A + +G N+ V++DTGSDL W+QC+PC CY Q+DP+++ + S SY ++LCN
Sbjct: 93 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 152
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
C +L G G CS S C Y +Y DG+ T G L E + + F
Sbjct: 153 CVSL----GREGQCSDSG--SCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGF 206
Query: 248 GCGRNNKGLFGGVSG--LMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLIL 303
GCG N ++GLG +SLVSQ S I F+YC + + A G L+
Sbjct: 207 GCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVF 266
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ----LQASGFAK-----GG 354
G + + + TP+ +A FY +NL GI +G + + +S F + GG
Sbjct: 267 GDATYLNGDMTPMV---------IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGG 317
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
++IDSG+ ++ PP +Y ++ + + G+ +P S D CF E ++PL
Sbjct: 318 VIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFE--GKIERDLPLFP- 373
Query: 414 EFEGNAEMTVDVTGIV-----YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
+ ++ TGI+ F++ CL S + IIG Q++ + Y+
Sbjct: 374 ----TLVLYLESTGILNDRWSIFLQRYDELFCLGFTS---GEGLSIIGTLAQQSYKFGYN 426
Query: 469 TKNSQLGF-AGEDC 481
+ S L + DC
Sbjct: 427 LELSTLSIESNPDC 440
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 172/391 (43%), Gaps = 58/391 (14%)
Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
++++ +DTGSDL W C P C C + P P I +++ C S C A
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159
Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
+ S +C+++ P C + +YGDGS L R +GL + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----AS 298
+F F C G+ G GR LSL +Q + G FSYCL + S
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRS 275
Query: 299 GSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF----- 350
LILG + +++ + T YT ++ NP+ FY + L +S+GGK++QA
Sbjct: 276 SPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVD 335
Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAY 403
GG+++DSGT T LP ++ + EF + + A + L C++ S
Sbjct: 336 RDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPS 395
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TG 453
+P V + F GNA + + KS+ + CL L ++ ++ G
Sbjct: 396 DRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAG 454
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+GN+QQ+ V+YD ++GFA C+ +
Sbjct: 455 TLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/446 (24%), Positives = 186/446 (41%), Gaps = 70/446 (15%)
Query: 94 LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
+D + ++ SR + + + S +PL+SG T Y +G + ++ D
Sbjct: 49 MDRERMAFISSRGRRRAA---ETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVAD 105
Query: 152 TGSDLTWVQCQ----------------PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH- 194
TGSDLTWV+C P + + + F P S ++ + C+S+TC
Sbjct: 106 TGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRR-TFRPDKSRTWAPIPCSSATCRE 164
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVNDFIF 247
+L F+ C++ + P C Y Y DGS RG +G + + KA + +
Sbjct: 165 SLPFSLA---ACATPANP-CAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVL 220
Query: 248 GCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
GC + G F G++ LG S++S S+ + FGG FSYCL + S + G
Sbjct: 221 GCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGP 280
Query: 307 SSVFKNSTP----------------------ITYTNMIPNPQLATFYILNLTGISIGGKQ 344
+ F + P T ++ + + FY + + G+S+ G+
Sbjct: 281 NPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGEL 340
Query: 345 LQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
L+ GG ++DSGT +T L Y A+ A K+ +G P D C+N
Sbjct: 341 LKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRV-TMDPFDYCYN 399
Query: 400 LSAYQEVNI----PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
++ ++ P++ + F G+A + + Y + + C+ L + + +I
Sbjct: 400 WTSPSGSDVAAPLPMLAVHFAGSARL--EPPAKSYVIDAAPGVKCIGLQEGPWPGLS-VI 456
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDC 481
GN Q+ YD KN +L F C
Sbjct: 457 GNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 157/353 (44%), Gaps = 21/353 (5%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
TI + + +D +L W QC C C+ Q PVF P+ S ++K C + C ++
Sbjct: 59 TIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIP 118
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
S VC+ Y G G +T G + + +G A+ FGC ++
Sbjct: 119 TPKCASDVCA--------YDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDT 170
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GG SG +GLGR+ SLV+Q FSYCL + D G + L LG ++ +
Sbjct: 171 MGGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-APHDTGKNSRLFLGASAKLAGGGAWT 226
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV-ITRLPPSIYSALK 375
+ PN ++ +Y + L I G + + +L+ + V ++ L S+Y K
Sbjct: 227 PFVKTSPNDGMSQYYPIELEEIKAGDATITMPR-GRNTVLVQTAVVRVSLLVDSVYQEFK 285
Query: 376 AEFLKQFSGFPSA-PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+ P+A P + + CF + P + F+ A +TV ++ V +
Sbjct: 286 KAVMASVGAAPTATPVGAPFEVCFPKAGVS--GAPDLVFTFQAGAALTVPPANYLFDVGN 343
Query: 435 DA---SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
D S + +AL +++ D I+G++QQ+N +++D L F DCSS+
Sbjct: 344 DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 396
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 176/380 (46%), Gaps = 39/380 (10%)
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
T IP+ L+Y + G + + +DT ++ V C+PC DP FD
Sbjct: 134 TIIPIDGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDT 193
Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
S S ++ V C+S C + T N CS+ S N F + G ++ L +
Sbjct: 194 SQSTTFTHVPCDSPDCPS----TAN---CSAGSVCPFNLF-------FVEGTFSQDVLTV 239
Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
+ +V DF F C + G G+ G + L R SL S+ + FSYC+P D
Sbjct: 240 APSVAVQDFTFVC--LDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPD 297
Query: 295 AGASGSLILGGNSSVFKNS----TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
+ G L LG +++V ++ P+ ++ +P LA Y +++ G+S+G L
Sbjct: 298 S--PGFLSLGDDATVRGDNCTAHAPLLSSD---DPDLANMYFIDVVGMSLGDVDLPIPSG 352
Query: 351 AKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGF-PSAPGFSILDTCFNLSAYQEV 406
G ++++GT T L P Y+ L+ F + + + S PGF DTC+N + QE+
Sbjct: 353 TFGNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQEL 412
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYF-VKSDA--SQVCLALASL--SYEDETGIIGNYQQK 461
+PLV+ +F + +D ++Y+ + S+ + CLA ++L +D + +IG Y
Sbjct: 413 TVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLA 472
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
V+YD +GF E C
Sbjct: 473 TTEVVYDVAGGTVGFIPESC 492
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 175/405 (43%), Gaps = 52/405 (12%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP------ 173
+PL+SG T Y +G + +I DTGSDLTWV+C+ S +
Sbjct: 97 MPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAA 156
Query: 174 ---------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPDCNYFVSYGDG 223
VF P S ++ + C+S TC + + F+ N CSSS+ C+Y Y D
Sbjct: 157 PSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLAN---CSSSTA-ACSYDYRYNDN 212
Query: 224 SYTRGELGREHLGLG-------------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRS 269
S RG +G + + KA + + GC + G F G++ LG S
Sbjct: 213 SAARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYS 272
Query: 270 DLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQ 326
++S S+ + FGG FSYCL A+ L G +S P + T ++ + +
Sbjct: 273 NISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDAR 332
Query: 327 LATFYILNLTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
+ FY + + +S+ G L + GG +IDSGT +T L Y A+ A +Q
Sbjct: 333 VRPFYAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQ 392
Query: 382 FSGFPSAPGFSILDTCFNLSAY----QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
+G P D C+N +A ++ +P + ++F G+A + + Y + +
Sbjct: 393 LAGLPRV-AMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARL--EPPAKSYVIDAAPG 449
Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
C+ + ++ + +IGN Q+ +D N L F C+
Sbjct: 450 VKCIGVQEGAWPGVS-VIGNILQQEHLWEFDLNNRWLRFRQTSCT 493
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 172/391 (43%), Gaps = 58/391 (14%)
Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
++++ +DTGSDL W C P C C + P P I +++ C S C A
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159
Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
+ S +C+++ P C + +YGDGS L R +GL + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----AS 298
+F F C G+ G GR LSL +Q + G FSYCL + S
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRS 275
Query: 299 GSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF----- 350
LILG + +++ + T YT ++ NP+ FY + L +S+GGK++QA
Sbjct: 276 SPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVD 335
Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAY 403
GG+++DSGT T LP ++ + EF + + A + L C++ S
Sbjct: 336 RDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPS 395
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TG 453
+P V + F GNA + + KS+ + CL L ++ ++ G
Sbjct: 396 DRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAG 454
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+GN+QQ+ V+YD ++GFA C+ +
Sbjct: 455 TLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 87/146 (59%), Gaps = 6/146 (4%)
Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
P+ SGI ++ Y A + +G +++DTGSDL W+QC PC+ CY Q+ VFDP S
Sbjct: 74 PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+Y++V C+S C AL F +SG + C Y V+YGDGS + G+L + L
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGDLATDKLAFAND 190
Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMG 265
+ VN+ GCGR+N+GLF +GL+G
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLG 216
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 10/134 (7%)
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQEVNIPLVKME 414
DSGT I+R Y+AL+ F + S+ D C++L + PL+ +
Sbjct: 316 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 375
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQKNQRVIYDT 469
F G A+M + YF+ D + A L + +D +IGN QQ+ RV++D
Sbjct: 376 FAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 433
Query: 470 KNSQLGFAGEDCSS 483
+ ++GFA + C+S
Sbjct: 434 EKERIGFAPKGCTS 447
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 166/356 (46%), Gaps = 39/356 (10%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+IVD+GS +T+V C C+ C QDP F P +S +Y+ V CN C+
Sbjct: 106 FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMD-CN----------- 153
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
C C Y Y + S ++G LG + + G S +FGC G L+
Sbjct: 154 CDDDR-EQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRA 212
Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G++GLG+ DLSLV Q + + F C D G GS+ILGG F + + +
Sbjct: 213 DGIIGLGQGDLSLVDQLVDKGLISNSFGLCY-GGMDVGG-GSMILGG----FDYPSDMVF 266
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALK 375
T+ +P + +Y ++LTGI + GKQL + G ++DSGT LP + ++A +
Sbjct: 267 TDS--DPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFE 324
Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTVDVTGI 428
+++ S P + DTCF ++A V+ P V+M F+ +
Sbjct: 325 EAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENY 384
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ CL + + +D T ++G +N V+YD +NS++GF +CS +
Sbjct: 385 MFRHSKVHGAYCLGVFP-NGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSEL 439
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 174/374 (46%), Gaps = 51/374 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
+N+++++DTGS+L+W++C + +PV FDP+ S SY + C+S TC
Sbjct: 84 QNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND--FIFGCGRNNKG---- 255
C S C+ +SY D S + G L E G S ND IFGC + G
Sbjct: 140 IPASCDSDK--LCHATLSYADASSSEGNLAAEIFHFGN-STNDSNLIFGCMGSVSGSDPE 196
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
+GL+G+ R LS +SQ FSYC+ T D G L+LG S F TP
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMGF---PKFSYCISGTDD--FPGFLLLG--DSNFTWLTP 249
Query: 316 ITYTNMI----PNPQLATF-YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVI 363
+ YT +I P P Y + LTGI + GK L G ++DSGT
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQF 309
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEVN-----IPLVK 412
T L +Y+AL++ FL + +G + P F +D C+ +S + + +P V
Sbjct: 310 TFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVS 369
Query: 413 MEFEGNAEMTVDVTGIVYFVKS----DASQVCLALASLSYED-ETGIIGNYQQKNQRVIY 467
+ FEG AE+ V ++Y V + S C + E +IG++ Q+N + +
Sbjct: 370 LVFEG-AEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428
Query: 468 DTKNSQLGFAGEDC 481
D + S++G A +C
Sbjct: 429 DLQRSRIGLAPVEC 442
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/369 (31%), Positives = 166/369 (44%), Gaps = 45/369 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+T+++DTGS+L+W+ C P + F P S ++ V C S+ C + + + +
Sbjct: 96 QNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPA 155
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGLFG 258
+SS C+ +SY DGS + G L + +G FGC + G+
Sbjct: 156 CDGASSR---CSVSLSYADGSSSDGALATDVFAVGSGPPLRAAFGCMSSAFDSSPDGV-- 210
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
+GL+G+ R LS VSQ S FSYC+ DAG L+LG S P+ Y
Sbjct: 211 ASAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLG--HSDLPTFLPLNY 262
Query: 319 TNM----IPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
T M +P P Y + L GI +GGK L AS A G ++DSGT T L
Sbjct: 263 TPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFL 322
Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLS---AYQEVNIPLVKMEFEG 417
YSALKAEF +Q A P F+ DTCF + + +P V + F G
Sbjct: 323 LGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFNG 382
Query: 418 NAEMTVDVTGIVYFVKSDASQ----VCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNS 472
AEM V ++Y V + CL + +IG++ Q N V YD +
Sbjct: 383 -AEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYDLERG 441
Query: 473 QLGFAGEDC 481
++G A C
Sbjct: 442 RVGLAPVRC 450
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 50/397 (12%)
Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQC 161
+R++ ++G++ +PL R+ Y TI +G T+I DT SDLTW QC
Sbjct: 69 ARLEARLTGDM------SVPLA---RISDEGYTVTIGIGTPPQLHTLIADTASDLTWTQC 119
Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV--CSSSSPPDCNYFVS 219
Q +P+FDP+ S S+ V C+S C N G CS+ + C Y
Sbjct: 120 NLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLC-----TEDNPGTKRCSNKT---CRYVYP 171
Query: 220 YGDGSYTRGELGREHLGLGKASVN---DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
Y G L E L + + F FGCG G G SG++G+ + LS+VSQ
Sbjct: 172 YVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQ 230
Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTPITYTNMIPNPQLATFYILN 334
+ FSYCL D +S L G + + +K + PI L +Y +
Sbjct: 231 LAI---PKFSYCLTPYTDRKSS-PLFFGAWADLGRYKTTGPI-------QKSLTFYYYVP 279
Query: 335 LTGISIGGKQLQ--ASGFA--KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
L G+S+G ++L A+ FA +GG ++D G + +L ++ALK L + +
Sbjct: 280 LVGLSLGTRRLDVPAATFALKQGGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRT 339
Query: 391 FSILDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
CF L A V P + + F+G A+M + YF + A +CLAL
Sbjct: 340 VKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDN--YFQEPTAGLMCLALVP-- 395
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGN QQ+N +++D +S+ FA C +
Sbjct: 396 -GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTICDDI 431
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 168/351 (47%), Gaps = 34/351 (9%)
Query: 144 RNMTVIVDTGSDLTWVQC--QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
+ +T + DTGSDL W +C SC Q P + P+ S ++ K+ C+ C L
Sbjct: 102 QKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLLR---S 158
Query: 202 NSGVCSSSSPPDCNYFVSYG----DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
+S +++ +C+Y SYG D YT+G L RE LG +V FGC ++G +
Sbjct: 159 DSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPSVRFGCTTASEGGY 218
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
G SGL+GLGR LSLVSQ + F YCL T DA + L+ G +S+ +
Sbjct: 219 GSGSGLVGLGRGPLSLVSQ---LNASTFMYCL--TSDASKASPLLFGSLASL--TGAQVQ 271
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
T ++ + TFY +NL ISIG G + G++ DSGT +T L YS KA
Sbjct: 272 STGLLAS---TTFYAVNLRSISIGSATTPGVGEPE-GVVFDSGTTLTYLAEPAYSEAKAA 327
Query: 378 FLKQFS--GFPSAPGFSILDTCFNLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFV 432
FL Q S GF + CF A ++ +P + + F+G A+M + V Y V
Sbjct: 328 FLSQTSLDQVEDTDGF---EACFQKPANGRLSNAAVPTMVLHFDG-ADMALPVAN--YVV 381
Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + VC + IIGN Q N V++D S L F +C +
Sbjct: 382 EVEDGVVCWIVQR---SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANCDT 429
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 114/419 (27%), Positives = 179/419 (42%), Gaps = 68/419 (16%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQQDP-VFDPS 178
+PL+ G +Y T + + ++V +DTGSD+ W C P C C + +P P
Sbjct: 86 LPLSPGT-----DYTLTFSINSQTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPL 140
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSY 225
+ C S C + S +C+ + P DC+ ++ +YGDGS
Sbjct: 141 NVSKSSLISCKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSL 200
Query: 226 TRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
+L + +L + S + DF FGC + G G+ G G LSL +Q + +
Sbjct: 201 I-AKLHKHNLIMPSTSNKPFSLKDFTFGCAHS---ALGEPIGVAGFGFGSLSLPAQLANL 256
Query: 281 ---FGGLFSYCLPS----TQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYI 332
G FSYCL S + LILG F T YT M+ NP+ FY
Sbjct: 257 SPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYS 316
Query: 333 LNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEF------- 378
+++ IS+G +++A GG+++DSGT T LP Y+++ E
Sbjct: 317 VSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRV 376
Query: 379 LKQFSGFPSAPGFSILDTCFNLSA----YQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVK 433
K+ S S G L C+ L + +P + F GN + + Y F+
Sbjct: 377 FKRASETESKTG---LSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLD 433
Query: 434 SDASQV-----CLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ + CL L E E G +GNYQQ+ +V+YD + ++GFA C+S+
Sbjct: 434 GEDEKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCASL 492
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 157/376 (41%), Gaps = 58/376 (15%)
Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
++PL+S Y+ + +G + ++DTG+D W QC+PCK C NQ P+F PS
Sbjct: 79 DVPLSS---FMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPS 135
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
S +YK + C S C DG Y LG + L L
Sbjct: 136 KSSTYKTIPCTSPICKN-------------------------ADGHY----LGVDTLTLN 166
Query: 239 K-----ASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
S + + GCG N+G L G VSG +GL R LS +SQ + GG FSYCL P
Sbjct: 167 SNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPL 226
Query: 292 TQDAGASGSLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQ 346
S L G S+V STPI N Y ++L S+G +L+
Sbjct: 227 FSKENVSSKLHFGDKSTVSGLGTVSTPIKEENG---------YFVSLEAFSVGDHIIKLE 277
Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
S +G +IDSGT +T LP +YS L++ L + C+ ++ +
Sbjct: 278 NSD-NRGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLL 336
Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
L+ +E+ ++ Y + + +C A S I GN Q+N V
Sbjct: 337 TKVLIITAHFSGSEVHLNALNTFYPITDEV--ICFAFVSGGNFSSLAIFGNVVQQNFLVG 394
Query: 467 YDTKNSQLGFAGEDCS 482
+D + F DC+
Sbjct: 395 FDLNKKTISFKPTDCT 410
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 168/356 (47%), Gaps = 39/356 (10%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+IVD+GS +T+V C C+ C QDP F P +S +Y+ V CN C+
Sbjct: 107 FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCNMD-CN----------- 154
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
C C Y Y + S ++G LG + + G S +FGC G L+
Sbjct: 155 CDDDK-EQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRA 213
Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G++GLG+ DLSLV Q + + F C D G GS+ILGG F + + +
Sbjct: 214 DGIIGLGQGDLSLVDQLVDKGLISNSFGLCY-GGMDVGG-GSMILGG----FDYPSDMIF 267
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALK 375
T+ +P + +Y ++LTGI + GK+L + + G ++DSGT LP + ++A +
Sbjct: 268 TDS--DPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYLPDAAFAAFE 325
Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTVDVTGI 428
+++ S P + DTCF ++A +V+ P V+M F+ +
Sbjct: 326 EAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENY 385
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ CL + + +D T ++G +N V+YD +NS++GF +CS +
Sbjct: 386 MFRHSKVHGAYCLGVFP-NGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSEL 440
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 119/430 (27%), Positives = 188/430 (43%), Gaps = 72/430 (16%)
Query: 85 NEQQQNRLILDNLH----VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
NE ++R+ LD H Y+Q+RI+ + VSN E L +A I
Sbjct: 53 NETAKDRMELDIQHSAARFAYIQARIEGSL------VSNNEYKARVSPSLTGRTIMANIS 106
Query: 141 LGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+G + V++DTGSD+ WV C PC +C N +FDPS+S ++ LC + +F
Sbjct: 107 IGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSP-LCKT----PCDF 161
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRN- 252
CS P + V+Y D S G GR+ + G + + D +FGCG N
Sbjct: 162 KG-----CSRCDP--IPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNI 214
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFK 311
+ G +G++GL SL ++ G FSYC+ D + LILG + +
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATK----IGQKFSYCIGDLADPYYNYHQLILGEGADLEG 270
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVIT 364
STP N FY + + GIS+G K+L + GG++ID+G+ IT
Sbjct: 271 YSTPFEVHN--------GFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE----------VNIPLVKME 414
L S++ L E G+S T S + + V P+V
Sbjct: 323 FLVDSVHRLLSKEVRNLL-------GWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFH 375
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLA---LASLSYEDETGIIGNYQQKNQRVIYDTKN 471
F A++ +D +F + + + C+ ++SL+ + + +IG Q++ V YD N
Sbjct: 376 FADGADLALDSGS--FFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVN 433
Query: 472 SQLGFAGEDC 481
+ F DC
Sbjct: 434 QFVYFQRIDC 443
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 169/379 (44%), Gaps = 44/379 (11%)
Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-----PCKSCYNQQDPV-----FDPSIS 180
Y+ + +G M I DTGSDL W+ C P + D FDPS S
Sbjct: 98 FEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKS 157
Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
+++ V C+S C L A+ C + S C Y SYGDGS+T G L E A
Sbjct: 158 TTFRLVDCDSVACSELPEAS-----CGADS--KCRYSYSYGDGSHTSGVLSTETFTFADA 210
Query: 241 S----------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYC 288
V + FGC G GL+GLG DLSLVSQ G FSYC
Sbjct: 211 PGARGDGTTTRVANVNFGCSTTFVG-SSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYC 269
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
L AS +L G ++V T +IP+ Q+ +YI+ L + +G K +A
Sbjct: 270 L-VPYSVKASSALNFGPRAAVTDPGA--VTTPLIPS-QVKAYYIVELRSVKVGNKTFEAP 325
Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE--- 405
+ +++DSGT +T LP ++ L E + P+ +L CF++S +E
Sbjct: 326 D--RSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQV 383
Query: 406 -VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
IP V + G A +T+ FV+ +CLA++++S + IIGN Q+N
Sbjct: 384 AAMIPDVTVGLGGGAAVTLKAENT--FVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMH 441
Query: 465 VIYDTKNSQLGFAGEDCSS 483
V YD + FA C+S
Sbjct: 442 VGYDLDKGTVTFAPAACAS 460
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 157/349 (44%), Gaps = 28/349 (8%)
Query: 148 VIVDTGSDLTWVQCQPC-KSCYNQQD---PVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
V +DTGS ++WVQCQ C CY Q P F+ S S +Y++V C++ CH + +
Sbjct: 38 VTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIP 97
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSG 262
C C Y + Y G Y+ G L ++ L L + S+ FIFGCG +N+ G +G
Sbjct: 98 SGCVEEE-DSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKFIFGCGSDNR-YNGHSAG 155
Query: 263 LMGLGRSDLSLVSQTSEIFG-GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
++G G S +Q +++ FSYC PS Q+ G L +G ++S + T +
Sbjct: 156 IIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQE--NEGFLSIG---PYVRDSNKLILTQL 210
Query: 322 IPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFL 379
Y L + + G +LQ + ++DSGTV T + ++ AL
Sbjct: 211 FDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALT 270
Query: 380 KQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
K G + CF N + +P+V+++F + + + + Y+ SD S
Sbjct: 271 KAMVAEGYVRGSDSKEICFHSNGDSVDWSKLPVVEIKFS-RSILKLPAENVFYYETSDGS 329
Query: 438 QVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ S D+ G I+GN ++ RV++D + GF C
Sbjct: 330 -----ICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQQRNFGFEAGAC 373
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 178/377 (47%), Gaps = 58/377 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF---DPSISPSYKK--------VLCNSST 192
+ +++++DTGS L W C + Y Q+ F DP+ P Y + + C S
Sbjct: 85 QKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPK 144
Query: 193 CHALEFATGNSGVCSSSSPPDCNYF-VSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCG 250
C+ + G+ CS++ C Y+ + YG GS T G+L + LGL K + + DF+FGC
Sbjct: 145 CN---WVFGSDLNCSTTK--RCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRIPDFLFGCS 198
Query: 251 R-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ--DAGASGSLILG- 304
+N+ G+ G GR S+ +Q GL FSYCL S + D SG L+L
Sbjct: 199 LVSNRQ----PEGIAGFGRGLASIPAQL-----GLTKFSYCLVSHRFDDTPQSGDLVLHR 249
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATF---YILNLTGISIGGKQ-------LQASGFAKGG 354
G + + Y +P L+ + Y ++L+ I +GGK L S GG
Sbjct: 250 GRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGG 309
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLV 411
+++DSG+ T + I+ + E K + + A S L C+N++ EV++P +
Sbjct: 310 MIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKL 369
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-------IIGNYQQKNQR 464
F+G A M + +T YF VC+ + L+ DE G I+GNYQQ+N
Sbjct: 370 TFSFKGGANMDLPLTD--YFSLVTDGVVCMTV--LTDPDEPGSTTGPAIILGNYQQQNFY 425
Query: 465 VIYDTKNSQLGFAGEDC 481
+ YD K + GF + C
Sbjct: 426 IEYDLKKQRFGFKPQQC 442
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 62/387 (16%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
LT+G L YI T + +IVD+GS +T+V C C+ C N QDP F P +S SY
Sbjct: 84 LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSY 139
Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
V CN TC S C Y Y + S + G LG + + G+ S
Sbjct: 140 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 185
Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
+FGC + G LF G+MGLGR LS++ Q E + FS C
Sbjct: 186 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG 245
Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G G+++LGG + VF +S P+ + +Y + L I + GK L+
Sbjct: 246 G--GAMVLGGVPAPSDMVFSHSDPLR----------SPYYNIELKEIHVAGKALRVDSRV 293
Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
+K G ++DSGT LP + A K LK+ G P + D CF
Sbjct: 294 FNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRG----PDPNYKDICFAGAG 349
Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
N+S EV P V M F ++++ ++ CL + + +D T ++G
Sbjct: 350 RNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ-NGKDPTTLLGG 407
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+N V YD N ++GF +CS +
Sbjct: 408 IIVRNTLVTYDRHNEKIGFWKTNCSEL 434
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 116/399 (29%), Positives = 175/399 (43%), Gaps = 76/399 (19%)
Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDP---V 174
IPL+ G QTL +I+DTGSDL W C C++C ++ +P +
Sbjct: 92 IPLSFGTPPQTL-------------PLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNI 138
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGV---------CSSSSPPDCNYFVSYGDGSY 225
F P S S K + C + C + + S C+ PP N F+ + D +
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLN-FLRFWD--H 195
Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
R + R L S I G GR L +GL + F
Sbjct: 196 RRSQFHRRMLCPLHQSTRREISGFGRGPPSL----PSQLGLKK----------------F 235
Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA------TFYILNLTG 337
SYCL S + D S SL+L G S + + ++YT + NP++A +Y L L
Sbjct: 236 SYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRH 295
Query: 338 ISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSA 388
I++GGK ++ GG +IDSGT T + I+ + AEF KQ
Sbjct: 296 ITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEV 355
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS--L 446
G + L CFN+S + P + ++F G AEM + + V F+ D VCL + +
Sbjct: 356 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGD-DVVCLTIVTDGA 414
Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ ++ +G I+GN+QQ+N V YD +N +LGF + C
Sbjct: 415 AGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 453
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 121/422 (28%), Positives = 189/422 (44%), Gaps = 65/422 (15%)
Query: 90 NRLILDNLH-VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNM 146
+R +LD H +++LQ+ +K S N + + ++ LT+G Y + +G +
Sbjct: 51 HRRVLDRDHRLRHLQNLVKPH-SSNARMRLHDDL-LTNGY------YTTRLWIGSPPQEF 102
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+IVDTGS +T+V C C C N QDP F P +S +Y+ V CN+ C+ E +GV
Sbjct: 103 ALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDE-----NGV- 155
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLF--GGVS 261
C Y Y + S + G L + + GK S +FGC G
Sbjct: 156 ------QCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRAD 209
Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNSTP 315
G+MGLGR LS++ Q + FS C D G G+++LGG SS VF +S
Sbjct: 210 GIMGLGRGTLSVMDQLVGKGVVSNSFSLCY-GGMDVGG-GAMVLGGISSPPGMVFSHS-- 265
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYS 372
+P + +Y + L I + GK L+ + K G ++DSGT P Y
Sbjct: 266 --------DPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYY 317
Query: 373 AL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL----VKMEFEGNAEMT 422
A K FLKQ SG P + D CF+ + +P V M F +++
Sbjct: 318 AFKDAIMKKISFLKQISG----PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ ++ + CL + + D+T ++G +N V Y+ +NS +GF +CS
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFK-NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
Query: 483 SM 484
+
Sbjct: 433 EL 434
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 117/421 (27%), Positives = 190/421 (45%), Gaps = 64/421 (15%)
Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--YIATIELGG--RNMTVIVDTGSDLT 157
L + +++ + N + + ++PL G+ L T Y IE+G + V VDTGSD+
Sbjct: 51 LAALLRHDMGRNGRLLGAVDLPL-GGVGLPTATGLYYTRIEIGSPPKGYYVQVDTGSDIL 109
Query: 158 WVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
WV C C + + +DP+ S + V C C A A+G C S++ P
Sbjct: 110 WVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVGCEQEFCVANSAASGVPPACPSAASP 167
Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GV 260
C + ++YGDGS T G + + + S N FGCG G G +
Sbjct: 168 -CQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSITFGCGAQLGGDLGSSSQAL 226
Query: 261 SGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPI 316
G++G G+SD S++SQ + +F++CL + + G A G+++ PI
Sbjct: 227 DGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGGIFAIGNVV----------QPPI 276
Query: 317 TYTN-MIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSI 370
T ++PN AT Y +NL GIS+GG LQ S F G G +IDSGT + LP +
Sbjct: 277 VKTTPLVPN---ATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREV 333
Query: 371 YSALKAEFLKQFSGFPSAPGFSILD----TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
Y L F P ++ + CF S + P++ FEG ++T++V
Sbjct: 334 YRTLLTAV------FDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFSFEG--DLTLNVY 385
Query: 427 GIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
Y ++ C+ + + G ++G+ N+ V+YD + +G+ +CS
Sbjct: 386 PHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTDYNCS 445
Query: 483 S 483
S
Sbjct: 446 S 446
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 81/242 (33%), Positives = 119/242 (49%), Gaps = 21/242 (8%)
Query: 247 FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--ASGSLIL 303
FGC + +G F G SG M LG SL SQT+ +G FSYC+P +G + G I
Sbjct: 177 FGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIG 236
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-SGFAKGGILIDSGTV 362
S STP+ T NP TFY++ L GI + G++L G L+DS V
Sbjct: 237 SSGSGSGFASTPLVATA---NP---TFYVVRLQGIDVAGRRLNVPPAVFSAGTLMDSSAV 290
Query: 363 ITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
+T+LPP+ Y AL+ F ++++ P+ G ILDTC++ V +P V + F G A
Sbjct: 291 VTQLPPTAYRALRRAFRNAMRRYRRVPAG-GKQILDTCYDFEGLGNVTVPAVSLVFSGGA 349
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
+ ++ ++ + CLA + + G IGN QQ+ V+YD +GF
Sbjct: 350 VVRLEPMAVMM-------EGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRG 402
Query: 480 DC 481
C
Sbjct: 403 AC 404
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 155/353 (43%), Gaps = 21/353 (5%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
TI + + +D +L W QC C C+ Q PVF P+ S ++K C + C ++
Sbjct: 29 TIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIP 88
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
S VC+ G G +T G + + +G A+ FGC ++
Sbjct: 89 TPKCASDVCAFDG--------VTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDT 140
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
GG SG +GLGR+ SLV+Q FSYCL + D G + L LG ++ +
Sbjct: 141 MGGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-APHDTGKNSRLFLGASAKLAGGGAWT 196
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV-ITRLPPSIYSALK 375
+ PN ++ +Y + L I G + + +L+ + V ++ L S+Y K
Sbjct: 197 PFVKTSPNDGMSQYYPIELEEIKAGDATITMPR-GRNTVLVQTAVVRVSLLVDSVYQEFK 255
Query: 376 AEFLKQFSGFPSA-PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
+ P+A P + CF + P + F+ A +TV ++ V +
Sbjct: 256 KAVMASVGAAPTATPVGEPFEVCFPKAGVS--GAPDLVFTFQAGAALTVPPANYLFDVGN 313
Query: 435 DA---SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
D S + +AL +++ D I+G++QQ+N +++D L F DCSS+
Sbjct: 314 DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 366
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 120/422 (28%), Positives = 187/422 (44%), Gaps = 48/422 (11%)
Query: 82 VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
+ W E D +Q+L S + + +P+ SG ++ Q YI +
Sbjct: 57 LSWEESVLQMQAKDKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAK 106
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
+G + M + +DT SD+ W+ C C C + +F+ S +YK + C ++ C +
Sbjct: 107 IGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQVLH 163
Query: 199 ATGNSGVCSSSSPPD------CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
S P C++ ++YG GS L ++ + L +V + FGC +
Sbjct: 164 LLSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQK 222
Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
G GL+GLGR LSL+SQT ++ FSYCLPS + SGSL LG +
Sbjct: 223 ATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR- 281
Query: 313 STPITYTNMIPNPQLATFYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVI 363
I YT ++ NP+ + Y +NL + + G S A G + DSGTV
Sbjct: 282 ---IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVF 336
Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
TRL Y A++ F + + DTC+ + + P + F G M V
Sbjct: 337 TRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNV 389
Query: 424 DVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ + S A S CLA+A+ + +I N QQ+N R++YD NS+LG A E
Sbjct: 390 TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVAREL 449
Query: 481 CS 482
C+
Sbjct: 450 CT 451
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 120/422 (28%), Positives = 188/422 (44%), Gaps = 65/422 (15%)
Query: 90 NRLILDNLH-VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNM 146
+R +LD H +++LQ+ +K S N + + ++ LT+G Y + +G +
Sbjct: 51 HRRVLDRDHRLRHLQNLVKPH-SSNARMRLHDDL-LTNGY------YTTRLWIGSPPQEF 102
Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
+IVDTGS +T+V C C C N QDP F P +S +Y+ V CN+ C+ E +GV
Sbjct: 103 ALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDE-----NGV- 155
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLF--GGVS 261
C Y Y + S + G L + + GK S +FGC G
Sbjct: 156 ------QCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRAD 209
Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNSTP 315
G+MGLGR LS++ Q + FS C G G+++LGG SS VF +S
Sbjct: 210 GIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGG--GAMVLGGISSPPGMVFSHS-- 265
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYS 372
+P + +Y + L I + GK L+ + K G ++DSGT P Y
Sbjct: 266 --------DPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYY 317
Query: 373 AL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL----VKMEFEGNAEMT 422
A K FLKQ SG P + D CF+ + +P V M F +++
Sbjct: 318 AFKDAIMKKISFLKQISG----PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ ++ + CL + + D+T ++G +N V Y+ +NS +GF +CS
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFK-NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
Query: 483 SM 484
+
Sbjct: 433 EL 434
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 50/373 (13%)
Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFD----------PSISPSYKKV 186
I++G N++ +V D GSDL WV C C C +D PS+S + K +
Sbjct: 107 IDIGTPNVSFLVALDAGSDLLWVPCD-CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPL 165
Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGLGKASVN-- 243
CN C E + C SS P C Y SY + + + G L + L L S +
Sbjct: 166 SCNDQLC---ELGSD----CKSSKDP-CPYLASYYSENTSSSGLLIEDRLHLAPFSEHAS 217
Query: 244 ------DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPST 292
I GCGR G F GLMGLG DLS+ S ++ + FS C
Sbjct: 218 RSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF--- 274
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
D SG+++ G V + ST + +P Y++ + G +G L+ +GF
Sbjct: 275 -DDNHSGTILFGDQGLVTQKST-----SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 328
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
L+DSGT T LP IY + EF KQ + S+ S C+N S+ + +NIP V
Sbjct: 329 ---LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVT 385
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F N V I +++ V CL + + +E GIIG R+++D +N
Sbjct: 386 LVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPI--HEEFGIIGQNFMWGYRMVFDREN 443
Query: 472 SQLGFAGEDCSSM 484
+LG++ +C +
Sbjct: 444 LKLGWSTSNCQDI 456
>gi|242086414|ref|XP_002443632.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
gi|241944325|gb|EES17470.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
Length = 556
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 175/381 (45%), Gaps = 49/381 (12%)
Query: 132 TLNYIATIELGG--RNMTVIVDTGS-DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
TL+Y + G + V +DT S + ++C+PC S DP FD S+S ++ VLC
Sbjct: 194 TLDYSVLVSYGTPEQQFPVFLDTSSVGASMIRCKPCASGSVDCDPAFDTSLSSTFNHVLC 253
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT--RGELGREHLGLGKAS-VNDF 245
S C CS D + F DG+Y+ G + L L ++ +NDF
Sbjct: 254 GSPDCPT---------NCSGDG--DGDSFCPL-DGTYSVINGTFVEDVLTLAPSTAINDF 301
Query: 246 IFGCGRNNK-GLFGGVSGLMGLGRS--------DLSLVSQTSEIFGGLFSYCLPSTQDAG 296
F C +K + G + L R S S FSYCLP + +
Sbjct: 302 KFVCLDVHKPDVLQTAVGTLDLSRDRNSLPSQLSSSSSSSGQASAAAAFSYCLP--KSSS 359
Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIP--NPQLATFYILNLTGISIGGKQLQ--ASGFAK 352
+ G L LG N++V K+ + ++ NP+LA+ Y ++L GIS+G + L A F
Sbjct: 360 SQGFLSLGINATV-KDDNATAHATLVSSGNPELASMYFIDLVGISLGDEDLSIPAGTFGN 418
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAP-----GFSILDTCFNLSAYQE 405
+D GT T L P Y+AL+ F +Q S F S+P GF DTCFN + +
Sbjct: 419 RSTNLDVGTTFTILAPDAYTALRESFKRQMSQYNFSSSPTDIAGGF---DTCFNFTDLND 475
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDA---SQVCLALASLSYEDE-TGIIGNYQQ 460
+ IP V+++F + +D ++Y+ +DA + CLA +SL D +IG+Y
Sbjct: 476 LVIPNVQLKFSNGDMLVIDADQMLYYDDDTDAAPFTMACLAFSSLDAGDSFAAVIGSYTL 535
Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
V+YD Q+GF C
Sbjct: 536 ATTEVVYDVAGGQVGFIPWSC 556
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/402 (29%), Positives = 179/402 (44%), Gaps = 48/402 (11%)
Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC- 161
RI+ + S I SN+ S I + Y+ +G + I DTGS++ W+QC
Sbjct: 81 RIRKIRSSGI---SNSRKYPVSRISIIDKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCG 137
Query: 162 QP-CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
P C +CY Q+ P+F+P+ S +Y LC C + G C SS C Y +SY
Sbjct: 138 SPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKSSVQV-CRYHISY 196
Query: 221 GDGSYTRGELGR------EHLG-LGKASVNDFIFGCGRNNKGLFG------GVSGLMGLG 267
D S++ G + EH+ G S+ F FGCG NN G G++GLG
Sbjct: 197 EDHSFSEGTISTDIITFPEHIAEFGNYSLRMF-FGCGYNNSETPGQDPNSFTAPGVVGLG 255
Query: 268 RSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
SLV Q + G FSYC+ P Q + + G +S+ +ST +
Sbjct: 256 NEMASLVGQLTL---GQFSYCISTPDVQKPNGTIEIRFGLAASISGHSTALA-------N 305
Query: 326 QLATFYIL-NLTGISIGGKQLQASG-----FAKGGI---LIDSGTVITRLPPSIYSALKA 376
L +YI N+ GI + +++ FA+GGI ++DSGT T L S AL
Sbjct: 306 NLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDALIG 365
Query: 377 EFLKQFSGFPSAPGF--SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
E +Q P S C+N + + +P ++++F N E T ++ +
Sbjct: 366 ELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDN 425
Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
Q CLA+ S IIG YQ ++ ++ YD K + + F
Sbjct: 426 GNDQYCLAMFGTS---GISIIGIYQHRDIKIGYDLKYNLVSF 464
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 50/373 (13%)
Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFD----------PSISPSYKKV 186
I++G N++ +V D GSDL WV C C C +D PS+S + K +
Sbjct: 97 IDIGTPNVSFLVALDAGSDLLWVPCD-CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPL 155
Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGLGKASVN-- 243
CN C E + C SS P C Y SY + + + G L + L L S +
Sbjct: 156 SCNDQLC---ELGSD----CKSSKDP-CPYLASYYSENTSSSGLLIEDRLHLAPFSEHAS 207
Query: 244 ------DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPST 292
I GCGR G F GLMGLG DLS+ S ++ + FS C
Sbjct: 208 RSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF--- 264
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
D SG+++ G V + ST + +P Y++ + G +G L+ +GF
Sbjct: 265 -DDNHSGTILFGDQGLVTQKST-----SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 318
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
L+DSGT T LP IY + EF KQ + S+ S C+N S+ + +NIP V
Sbjct: 319 ---LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVT 375
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
+ F N V I +++ V CL + + +E GIIG R+++D +N
Sbjct: 376 LVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPI--HEEFGIIGQNFMWGYRMVFDREN 433
Query: 472 SQLGFAGEDCSSM 484
+LG++ +C +
Sbjct: 434 LKLGWSTSNCQDI 446
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 177/393 (45%), Gaps = 59/393 (15%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISP----SYK 184
Y ++ G + T+ + DTGS L W+ C C C ++ DP P P S K
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 185 KVLCNSSTCHAL-------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ C S C L N+ C+ PP Y + YG GS T G L E L
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPP---YILQYGLGS-TAGVLITEKLDF 205
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DA 295
+V DF+ GC + +G+ G GR +SL SQ + FS+CL S + D
Sbjct: 206 PDLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNL---KRFSHCLVSRRFDDT 259
Query: 296 GASGSLIL----GGNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQL 345
+ L L G NS + TP +TYT NP ++ +Y LNL I +G K +
Sbjct: 260 NVTTDLDLDTGSGHNSG---SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316
Query: 346 Q------ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LD 395
+ A G GG ++DSG+ T + ++ + EF Q S + L
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
CFN+S +V +P + EF+G A++ + ++ FV + VCL + S + +G
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFV-GNTDTVCLTVVSDKTVNPSGGT 435
Query: 454 ----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+G++QQ+N V YD +N + GFA + CS
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 159/360 (44%), Gaps = 44/360 (12%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
I+DTGS L W+QCQPCK C + PVF+P++S ++ + C+ C +G C
Sbjct: 112 IMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRY-----APNGHC 166
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-----FGCG-RNNKGLFGGV 260
SS+ C Y Y G+ ++G L +E L + N + FGCG N + L
Sbjct: 167 GSSN--KCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHF 224
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG-ASGSLILGGNSSVFKNSTPITYT 319
+G++GLG SL Q G FSYC+ + L+LG ++ + + TPI +
Sbjct: 225 TGILGLGAKPTSLAVQ----LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFE 280
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFA------KGGILIDSGTVITRLPPSIYSA 373
+ Y +NL GIS+G QL + G+++DSGT+ T L Y
Sbjct: 281 TE------NSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRE 334
Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYF 431
L E P F D C++ +E + P+V F G AE+ ++ T + Y
Sbjct: 335 LYNEIKSILD--PKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYP 392
Query: 432 VKSDAS--QVCLALASL-----SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ + C+++ Y++ T IG Q+ + YD K + DC +
Sbjct: 393 LSEPNTFNVFCMSVKPTKEHGGEYKEFTA-IGLMAQQYYNIGYDLKEKNIYLQRIDCVQL 451
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 58/374 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +T+++DTGS+L+W+ C+ + ++ VFDP S SY + C S TC
Sbjct: 67 QTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTRTRDFSIP 122
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
C C+ +SY D S G L + +G +++ IFGC +N
Sbjct: 123 VSCDKKK--LCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSK 180
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNS---------- 307
+GL+G+ R LS V+Q GL FSYC+ S QD +SG L+ G +S
Sbjct: 181 TTGLIGMNRGSLSFVTQM-----GLQKFSYCI-SGQD--SSGILLFGESSFSWLKALKYT 232
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSG 360
+ + STP+ Y + + Y + L GI + LQ S +A G ++DSG
Sbjct: 233 PLVQISTPLPYFDRVA-------YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSG 285
Query: 361 TVITRLPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEV--NIPLVK 412
T T L +Y+ALK EF++Q P F +D C+ + + +P V
Sbjct: 286 TQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVT 345
Query: 413 MEFEGNAEMTVDVTGIVY----FVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIY 467
+ F G AEM+V ++Y ++ S C S E+ IIG++ Q+N + +
Sbjct: 346 LMFRG-AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEF 404
Query: 468 DTKNSQLGFAGEDC 481
D S++GFA C
Sbjct: 405 DLAKSRVGFAEVRC 418
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 49/387 (12%)
Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSI 179
G+ T Y IE+G + V VDTGSD+ WV C C C + + +DP+
Sbjct: 76 GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S + V C C A A G C S+S P C + ++YGDGS T G + + +
Sbjct: 136 SGT--TVGCEQEFCVA-NSAGGVPPTCPSTSSP-CQFRITYGDGSTTTGFYVTDFVQYNQ 191
Query: 240 ASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ--TSEIFGGLF 285
S N FGCG G G + G++G G+SD S++SQ + +F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
++CL + + G + GN K T T ++PN T Y +NL GIS+GG L
Sbjct: 252 AHCLDTVR----GGGIFAIGNVVQPKVKT----TPLVPN---VTHYNVNLQGISVGGATL 300
Query: 346 Q--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
Q S F G G +IDSGT + LP +Y L A ++ P + CF
Sbjct: 301 QLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFV--CFQF 358
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIG 456
S + P++ FEG ++T++V Y ++ C+ + + G ++G
Sbjct: 359 SGSIDDGFPVITFSFEG--DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLG 416
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ N+ V+YD + +G+ +CSS
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSS 443
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 58/374 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +T+++DTGS+L+W+ C+ + ++ VFDP S SY + C S TC
Sbjct: 74 QTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTRTRDFSIP 129
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
C C+ +SY D S G L + +G +++ IFGC +N
Sbjct: 130 VSCDKKK--LCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSK 187
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNS---------- 307
+GL+G+ R LS V+Q GL FSYC+ S QD +SG L+ G +S
Sbjct: 188 TTGLIGMNRGSLSFVTQM-----GLQKFSYCI-SGQD--SSGILLFGESSFSWLKALKYT 239
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSG 360
+ + STP+ Y + + Y + L GI + LQ S +A G ++DSG
Sbjct: 240 PLVQISTPLPYFDRVA-------YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSG 292
Query: 361 TVITRLPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEV--NIPLVK 412
T T L +Y+ALK EF++Q P F +D C+ + + +P V
Sbjct: 293 TQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVT 352
Query: 413 MEFEGNAEMTVDVTGIVY----FVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIY 467
+ F G AEM+V ++Y ++ S C S E+ IIG++ Q+N + +
Sbjct: 353 LMFRG-AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEF 411
Query: 468 DTKNSQLGFAGEDC 481
D S++GFA C
Sbjct: 412 DLAKSRVGFAEVRC 425
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 179/375 (47%), Gaps = 48/375 (12%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--HA 195
T+ +N+++++DTGS+L+W+ C S FDP+ S SY+ + C+S TC
Sbjct: 36 TVGTPPQNVSMVIDTGSELSWLHCNKTLSYPT----TFDPTRSTSYQTIPCSSPTCTNRT 91
Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----R 251
+F S C S++ C+ +SY D S + G L + +G + ++ +FGC
Sbjct: 92 QDFPIPAS--CDSNN--LCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFS 147
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N +GLMG+ R LS VSQ + FSYC+ T SG L+LG ++ +
Sbjct: 148 SNSDEDSKSTGLMGMNRGSLSFVSQ---LGFPKFSYCISGTD---FSGLLLLGESNLTW- 200
Query: 312 NSTPITYTNMI----PNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDS 359
S P+ YT +I P P Y + L GI + K L + G ++DS
Sbjct: 201 -SVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDS 259
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEV--NIPLV 411
GT T L +Y+AL++ FL Q S P F +D C+ + Q V +P V
Sbjct: 260 GTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTV 319
Query: 412 KMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIGNYQQKNQRVI 466
+ F G AEMTV ++Y V + + S CL+ + E +IG++ Q+N +
Sbjct: 320 TLVFRG-AEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWME 378
Query: 467 YDTKNSQLGFAGEDC 481
+D + S++G A C
Sbjct: 379 FDLEKSRIGLAQVRC 393
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 170/373 (45%), Gaps = 44/373 (11%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
T+ +N+++++DTGS+L+W++C ++ FDP+ S SY V C+S TC
Sbjct: 90 TVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT----TFDPNRSSSYSPVPCSSLTCTDRT 145
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN----N 253
C S+ C+ +SY D S + G L + +G + + IFGC + N
Sbjct: 146 RDFPIPASCDSNQ--LCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTN 203
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GLMG+ R LS VSQ FSYC+ D+ SG L+LG + F
Sbjct: 204 TEEDSKNTGLMGMNRGSLSFVSQMD---FPKFSYCI---SDSDFSGVLLLGDAN--FSWL 255
Query: 314 TPITYTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFAK-----GGILIDSGT 361
P+ YT +I P P Y + L GI + K L S F G ++DSGT
Sbjct: 256 MPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGT 315
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFN--LSAYQEVNIPLVKM 413
T L +YSAL+ EFL Q S P + +D C+ LS +P V +
Sbjct: 316 QFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSL 375
Query: 414 EFEGNAEMTVDVTGIVYFVKSDA----SQVCLALA-SLSYEDETGIIGNYQQKNQRVIYD 468
F G AEM V ++Y V + S C S E +IG++ Q+N + +D
Sbjct: 376 MFRG-AEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFD 434
Query: 469 TKNSQLGFAGEDC 481
+ S++GFA C
Sbjct: 435 LEKSRIGFAQVQC 447
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/424 (27%), Positives = 189/424 (44%), Gaps = 66/424 (15%)
Query: 100 QYLQSRIKNMISGNIKDV-------SNTEIPLTSGIRLQTLN-YIATIELG--GRNMTVI 149
++ R+K++ + DV S +IPL + +++ Y A I LG R+ V
Sbjct: 42 KFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQ 101
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPV----FDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
VDTGSD+ WV C C C + D V +D S + K V C+ + C +
Sbjct: 102 VDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVN----QRSE 157
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLF 257
C S S C Y + YGDGS T G L ++ + L + N IFGCG G
Sbjct: 158 CHSGST--CQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQL 215
Query: 258 G----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAG--ASGSLILGGNSSV 309
G V G+MG G+S+ S +SQ + F++CL + G A G ++
Sbjct: 216 GESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVV------- 268
Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAKG---GILIDSGTVIT 364
S + T M+ + Y +NL I +G +L ++ F G G++IDSGT +
Sbjct: 269 ---SPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLV 322
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVNIPLVKMEFEGNAEMT 422
LP ++Y+ L E L + P ++ + TCF+ + + P V +F+ + +
Sbjct: 323 YLPDAVYNPLLNEIL---ASHPELTLHTVQESFTCFHYTDKLD-RFPTVTFQFDKSVSLA 378
Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAG 478
V ++ V+ D C + + + G I+G+ N+ V+YD +N +G+
Sbjct: 379 VYPREYLFQVREDT--WCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTN 436
Query: 479 EDCS 482
+CS
Sbjct: 437 HNCS 440
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 163/332 (49%), Gaps = 43/332 (12%)
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGELGREHL 235
+S ++K V C C +SGV S+ + C Y SYGD S T G + ++
Sbjct: 1 MSSTFKAVACPDPICRP------SSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTF 54
Query: 236 GLG-----KASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
+V++ FGCG N GLF SG+ G GR SL SQ G FSYCL
Sbjct: 55 TFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKV---GRFSYCL 111
Query: 290 PSTQDAGASGSLILGGNSSV----FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
++ +S +ILG + P T +I NP + TFY L+L GI++G +L
Sbjct: 112 TLVTESKSS-VVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRL 170
Query: 346 --QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDT 396
S FA GG +IDSGT +T LP +++ L+ E + QF + + P + D
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTP--EVGDR 228
Query: 397 -CFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS-DASQVCLALASLSYEDETG 453
CF ++V +P + + G A+M D+ YFV+ D+ +CL + ED T
Sbjct: 229 LCFRRPKGGKQVPVPKLILHLAG-ADM--DLPRDNYFVEEPDSGVMCLQINGA--EDTTM 283
Query: 454 I-IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ IGN+QQ+N V+YD +N++L FA C +
Sbjct: 284 VLIGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 167/380 (43%), Gaps = 54/380 (14%)
Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSCYNQQDPV------FDPSISPSYKKVLCNSSTCH 194
+ ++ I+DTGSD+ W C CK C F P S S K + C + C
Sbjct: 78 QTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCS 137
Query: 195 ALEFATGNSGV-CSSSS------PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
+ + N CS S PP Y + YG G+ T G E L L S +F+
Sbjct: 138 WIHHSNINCDQDCSIKSCLNQTCPP---YMIFYGSGT-TGGVALSETLHLHSLSKPNFLV 193
Query: 248 GCGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DAGASGSLI 302
GC +F +G+ G GR SL SQ G FSYCL S + D S SL+
Sbjct: 194 GCS-----VFSSHQPAGIAGFGRGLSSLPSQLGL---GKFSYCLLSHRFDDDTKKSSSLV 245
Query: 303 LGGNS-SVFKNSTPITYTNMIPNPQL------ATFYILNLTGISIGG-------KQLQAS 348
L K + + YT + NP++ + +Y L L I++GG K L
Sbjct: 246 LDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSPG 305
Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQE 405
GG++IDSGT T + + L EF++Q + L CFN+S +
Sbjct: 306 EDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAKT 365
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQK 461
V+ P +++ F+G A++ + V FV + + + + ++ + G I+GN+Q +
Sbjct: 366 VSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPGMILGNFQMQ 425
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
N V YD +N +LGF E C
Sbjct: 426 NFYVEYDLRNERLGFKQEKC 445
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 163/374 (43%), Gaps = 52/374 (13%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
Y I +G + +IVDTGS LT+V C C+ C QDP F P S +Y+ + C+
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
TC S C Y Y + S + G LG + + GK S +FG
Sbjct: 152 TC--------------DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFG 197
Query: 249 CGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
C G G+MGLGR DLS+V Q E + G FS C D G G+++LG
Sbjct: 198 CENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-GGMDVGG-GAMVLG 255
Query: 305 GNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILI 357
G S VF +S +P + +Y ++L I I GKQL + K G ++
Sbjct: 256 GISPPAGMVFTHS----------DPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTIL 305
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCF-----NLSAYQEVNIPL 410
DSGT LP + A K +K+ + P + D CF ++S + P
Sbjct: 306 DSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPA 364
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V + F +++ ++ CL + + D+T ++G +N V+YD +
Sbjct: 365 VDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQ-NENDQTTLLGGIIVRNTLVMYDRE 423
Query: 471 NSQLGFAGEDCSSM 484
+ ++GF +CS +
Sbjct: 424 HLKIGFWKTNCSEI 437
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 163/374 (43%), Gaps = 52/374 (13%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
Y I +G + +IVDTGS LT+V C C+ C QDP F P S +Y+ + C+
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
TC S C Y Y + S + G LG + + GK S +FG
Sbjct: 152 TC--------------DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFG 197
Query: 249 CGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
C G G+MGLGR DLS+V Q E + G FS C D G G+++LG
Sbjct: 198 CENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-GGMDVGG-GAMVLG 255
Query: 305 GNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILI 357
G S VF +S +P + +Y ++L I I GKQL + K G ++
Sbjct: 256 GISPPAGMVFTHS----------DPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTIL 305
Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCF-----NLSAYQEVNIPL 410
DSGT LP + A K +K+ + P + D CF ++S + P
Sbjct: 306 DSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPA 364
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
V + F +++ ++ CL + + D+T ++G +N V+YD +
Sbjct: 365 VDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQ-NENDQTTLLGGIIVRNTLVMYDRE 423
Query: 471 NSQLGFAGEDCSSM 484
+ ++GF +CS +
Sbjct: 424 HLKIGFWKTNCSEI 437
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 62/387 (16%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
LT+G L YI T + +IVD+GS +T+V C C+ C N QDP F P +S +Y
Sbjct: 86 LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTY 141
Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
V CN TC + C Y Y + S + G LG + + GK S
Sbjct: 142 SPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESE 187
Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
+FGC G LF G+MGLGR LS++ Q E + FS C D
Sbjct: 188 LKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY-GGMDV 246
Query: 296 GASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G G+++LGG + VF +S P+ + +Y + L I + GK L+
Sbjct: 247 GG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVAGKALRLDPKI 295
Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFSILDTCF---- 398
+K G ++DSGT LP + A K LK+ G P + D CF
Sbjct: 296 FNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPNYKDICFAGAG 351
Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
N+S EV P V M F ++++ ++ CL + + +D T ++G
Sbjct: 352 RNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGG 409
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+N V YD N ++GF +CS +
Sbjct: 410 IVVRNTLVTYDRHNEKIGFWKTNCSEL 436
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 168/365 (46%), Gaps = 35/365 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+A + +G + + I+ + W QC PC+ C+ Q P+F+ S S +Y+ C ++
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 193 CHALEFAT-GNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIFGC 249
C ++ +T GVCS Y V +GD T G G + +G A+ + FGC
Sbjct: 88 CESVPASTCSGDGVCS--------YEVETMFGD---TSGIGGTDTFAIGTATAS-LAFGC 135
Query: 250 G--RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
N K L G SG++GLGR+ SLV Q + FSYCL AG +L+LG ++
Sbjct: 136 AMDSNIKQLLGA-SGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASA 191
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLP 367
+ + T T ++ ++ Y+++L GI G + A +L+D+ ++ L
Sbjct: 192 KLAGGKSAAT-TPLVNTSDDSSDYMIHLEGIKF-GDVIIAPPPNGSVVLVDTIFGVSFLV 249
Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMT 422
+ + A+K P A D CF A + +P V + F+G A +T
Sbjct: 250 DAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALT 309
Query: 423 VDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
V + +Y + VCLA+ A L+ E I+G Q+N ++D L F
Sbjct: 310 VPPSKYMY--DAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPA 367
Query: 480 DCSSM 484
DCSS+
Sbjct: 368 DCSSL 372
>gi|345292859|gb|AEN82921.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292861|gb|AEN82922.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292863|gb|AEN82923.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292865|gb|AEN82924.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292867|gb|AEN82925.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292869|gb|AEN82926.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292871|gb|AEN82927.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292873|gb|AEN82928.1| AT5G10770-like protein, partial [Capsella rubella]
Length = 161
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 100/165 (60%), Gaps = 8/165 (4%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
LS SQT+ + +FSYCLPS+ A +G L G S+ S T + I + +F
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTISDGN--SF 54
Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
Y LN+ GI++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F Q S +P+A
Sbjct: 55 YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
G SILDTCF+LS ++ V IP V F G A + + GI Y K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 179/385 (46%), Gaps = 57/385 (14%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
TL T+ + +T+++DTGS+L+W+ C+ + + VF+P S SY + C+S
Sbjct: 39 TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSP 94
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG- 250
C N C C+ VSY D S G L ++ +G +++ +FGC
Sbjct: 95 VCRTRTRDLPNPVTCDPKK--LCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMD 152
Query: 251 ---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILG- 304
+N +GLMG+ R LS V+Q GL FSYC+ S +D +SG L+ G
Sbjct: 153 SGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYCI-SGRD--SSGVLLFGD 204
Query: 305 ------GN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAK- 352
GN + + + STP+ Y + + Y + L GI +G K L S FA
Sbjct: 205 SHLSWLGNLTYTPLVQISTPLPYFDRVA-------YTVQLDGIRVGNKILPLPKSIFAPD 257
Query: 353 ----GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSA 402
G ++DSGT T L +Y+AL+ EFL+Q G + P F +D C+ + A
Sbjct: 258 HTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPA 317
Query: 403 YQEV-NIPLVKMEFEGNAEMTVDVTGIVY----FVKSDASQVCLALASLSYED-ETGIIG 456
++ +P V + F G AEM V ++Y +K CL + E +IG
Sbjct: 318 GGKLPELPAVSLMFRG-AEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIG 376
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
++ Q+N + +D S++GF C
Sbjct: 377 HHHQQNVWMEFDLVKSRVGFVETRC 401
>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
Length = 503
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 173/370 (46%), Gaps = 33/370 (8%)
Query: 132 TLNYIATIELGG--RNMTVIVDTGS-DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
TL Y + G + V++DT S ++ ++C+PC S + FD S S ++ VLC
Sbjct: 147 TLQYSVLVSYGTPEQQFPVLLDTSSIGMSLLRCKPCASGSDDCHLAFDTSRSSTFAHVLC 206
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
S C G+ S P D Y S DG++ L L ++ +F F
Sbjct: 207 GSPDCPTNCSGDGDG---DSFCPLDSTY--SIIDGAFAEDVLT---LAPSSKAIENFRFV 258
Query: 249 C---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG---GLFSYCLPSTQDAGASGSLI 302
C + L V+G + L R SL SQ S G FSYCLP + + G L
Sbjct: 259 CLDVDEPDDDL--PVAGTLDLSRDRNSLPSQLSSSPGQATAAFSYCLP--KSPSSQGYLS 314
Query: 303 LGGNSSVFKNSTPITYTNMIPN---PQLATFYILNLTGISIGGKQLQ---ASGFAKGGIL 356
L +++V ++ + ++ N P+LA+ Y ++L G+S+G + A F G+
Sbjct: 315 LAVDATV-RHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGVDDIPIPPAGSFGNNGVN 373
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
+D GT T+L P +Y L+ F KQ S S GF DTCFNL+ +++ +PL+ +F
Sbjct: 374 LDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDGFDTCFNLTGVRDLAMPLLWFKF 433
Query: 416 EGNAEMTVDVTGIVYFVKSDA---SQVCLALASLSYEDE-TGIIGNYQQKNQRVIYDTKN 471
+ +D+ ++Y+ A + CLA +SL D + +IG + + VIYD
Sbjct: 434 SNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSLDAGDSFSAVIGTHTLASTEVIYDVAG 493
Query: 472 SQLGFAGEDC 481
++GF C
Sbjct: 494 GKVGFIPRSC 503
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 164/364 (45%), Gaps = 52/364 (14%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C CK C + QDP F P S +Y+ V C + C+
Sbjct: 104 QRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQCN--------- 153
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGRNNKGLFGG- 259
C C Y Y + S + G LG + + G + S IFGC + G
Sbjct: 154 --CDDDR-KQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDETGDIYNQ 210
Query: 260 -VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
G+MGLGR DLS++ Q E + FS C G+++LGG S VF +
Sbjct: 211 RADGIMGLGRGDLSIMDQLVEKKVISDAFSLCY--GGMGVGGGAMVLGGISPPADMVFTH 268
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
S P+ + +Y ++L I + GK+L + K G ++DSGT LP S
Sbjct: 269 SDPVR----------SPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPES 318
Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI-------PLVKMEFEGNAE 420
+ A K +K+ S P D CF+ + E+N+ P+V+M F +
Sbjct: 319 AFLAFKHAIMKETHSLKRISGPDPHYNDICFSGA---EINVSQLSKSFPVVEMVFGNGHK 375
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+++ ++ CL + S + D T ++G +N V+YD ++S++GF +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS-NGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTN 434
Query: 481 CSSM 484
CS +
Sbjct: 435 CSEL 438
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 118/397 (29%), Positives = 177/397 (44%), Gaps = 61/397 (15%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQD----PVFDPSISPSYK 184
Y ++ LG + TV I+DTGS L W C C SC + D P F P +S S K
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTRGELGREHLGL 237
+ C + C A F + C + +P N Y + YG GS T G L E +
Sbjct: 144 LIGCKNPKC-AWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINF 201
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ-- 293
+++DF+ GC + G+ G GRS SL Q GL FSYCL S +
Sbjct: 202 PNKTISDFLAGCSLLSTR---QPEGIAGFGRSQESLPLQL-----GLKKFSYCLVSRRFD 253
Query: 294 DAGASGSLILG-GNSSVFKNSTPITYTNMIPN------PQLATFYILNLTGISIGGKQLQ 346
D+ S LIL G S+ +T ++YT N P +Y + L I +G ++
Sbjct: 254 DSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVK 313
Query: 347 AS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDT 396
GG ++DSG+ T + ++ L EF KQ + + A + L
Sbjct: 314 VPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRP 373
Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--- 453
CF++S + V IP + +F+G A+M + ++ YF D VCL + S + G
Sbjct: 374 CFDISGEKSVVIPDLTFQFKGGAKMQLPLSN--YFAFVDMGVVCLTIVSDNAAALGGDGG 431
Query: 454 --------IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+GN+QQ+N + YD +N + GF + C+
Sbjct: 432 VRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 137/268 (51%), Gaps = 26/268 (9%)
Query: 230 LGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
LG++ L L V + FGC R G GL+G G LS SQ +++G +FSY
Sbjct: 343 LGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSY 402
Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-- 345
CLPS + + S +L LG + I T ++ NP + Y +N+ GI +GG+ +
Sbjct: 403 CLPSYKSSNFSSTLRLGPAGQPKR----IKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLV 458
Query: 346 QASGFAKG-----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
AS A G ++D+GT+ TRL +Y+A++ F + + P GF DTC+
Sbjct: 459 PASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGPLGGF---DTCY 515
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGII 455
N++ +++P V F+G +T+ +V SD CLA+A S + ++
Sbjct: 516 NVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIA-CLAMAAGPSDGVDAVLNVL 570
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ QQ+N RV++D N ++GF+ E C++
Sbjct: 571 ASMQQQNHRVLFDVANGRVGFSRELCTT 598
>gi|295830679|gb|ADG39008.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 99/165 (60%), Gaps = 8/165 (4%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
LS SQT+ + +FSYCLPS+ A +G L G S+ S T I + +F
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPIXTISDGN--SF 54
Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
Y LN+ GI++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F Q S +P+A
Sbjct: 55 YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
G SILDTCF+LS ++ V IP V F G A + + GI Y K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 175/390 (44%), Gaps = 53/390 (13%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSCY-----NQQDPVFDPSISPSYK 184
Y ++ G + T+ + DTGS L W C C C Q P F P S S +
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSR 149
Query: 185 KVLCNSSTCHALEFAT-------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ C + C L A N+ C+ PP Y + YG GS T G L E L
Sbjct: 150 VIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPP---YILQYGLGS-TAGILISEKLDF 205
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
+V DF+ GC + +G+ G GR SL SQ FS+CL S +
Sbjct: 206 PDLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKL---KSFSHCLVSRRFDDT 259
Query: 298 SGSLILG---GNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQLQ-- 346
+ + LG G+ + TP ++YT NP ++ +Y LNL I +G K ++
Sbjct: 260 NVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIP 319
Query: 347 ----ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCF 398
A G GG ++DSG+ T + ++ + EF Q S + S + CF
Sbjct: 320 YKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIAPCF 379
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----- 453
N+S +V +P + EF+G A+M + ++ FV +A VCL + S + + G
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKMELPLSNYFSFV-GNADTVCLTVVSDNTVNPGGGTGPA 438
Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+G++QQ+N V YD +N + GFA + CS
Sbjct: 439 IILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|295830681|gb|ADG39009.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830683|gb|ADG39010.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830685|gb|ADG39011.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830687|gb|ADG39012.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 99/165 (60%), Gaps = 8/165 (4%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
LS SQT+ + +FSYCLPS+ A +G L G S+ S T I + +F
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPIATISDGN--SF 54
Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
Y LN+ GI++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++ F Q S +P+A
Sbjct: 55 YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
G SILDTCF+LS ++ V IP V F G A + + GI Y K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 170/367 (46%), Gaps = 44/367 (11%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+T+++DTGS+L+W+ C+ + + VF+P S +Y KV C S TC
Sbjct: 80 QNVTMVLDTGSELSWLHCKKTQFL----NSVFNPLSSKTYSKVPCLSPTCKTRTRDLTIP 135
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
C ++ C+ VSY D + G L E LG + IFGC +N
Sbjct: 136 VSCDATKL--CHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSK 193
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
+GL+G+ R LS V+Q FSYC+ AG ++L GN+S F P++YT
Sbjct: 194 TTGLIGMNRGSLSFVNQMGY---PKFSYCISGFDSAG----VLLLGNAS-FPWLKPLSYT 245
Query: 320 NMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLP 367
++ P P Y + L GI + K L S F G ++DSGT T L
Sbjct: 246 PLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLL 305
Query: 368 PSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE--VNIPLVKMEFEGNA 419
+Y+ALK EFL Q G F +D C+ L + + N+P+V + F+G A
Sbjct: 306 GPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG-A 364
Query: 420 EMTVDVTGIVYFVKSDA----SQVCLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQL 474
EM+V ++Y V + S C + E +IG++ Q+N + +D + S++
Sbjct: 365 EMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNVWMEFDLEKSRI 424
Query: 475 GFAGEDC 481
G A C
Sbjct: 425 GLADVRC 431
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 162/361 (44%), Gaps = 44/361 (12%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
+Y+ + LG + V +VDT SDL W QC PC+ CY Q++P+FDP CNS
Sbjct: 30 DYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKE-------CNSF 82
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
H+ CS C+Y +Y D S T+G L +E GK V IF
Sbjct: 83 FDHS----------CSPEKA--CDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIF 130
Query: 248 GCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLILG 304
GCG NN G+F GL+GLG LSLVSQ ++G FS CL P D SG++ LG
Sbjct: 131 GCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLG 190
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDSGT 361
S V S T + + + T Y++ L GIS+G + + +KG I+IDSGT
Sbjct: 191 EASDV---SGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGT 247
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGNA 419
T LP Y L E Q + P D L E N+ P++ FEG
Sbjct: 248 PETYLPQEFYDRLVEELKVQIN---LPPIHVDPDLGTQLCYKSETNLEGPILTAHFEG-- 302
Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
V + + F+ C A+ + D I GN+ Q N + +D + F
Sbjct: 303 -ADVKLLPLQTFIPPKDGVFCFAMTGTT--DGLYIFGNFAQSNVLIGFDLDKRIVFFKPT 359
Query: 480 D 480
D
Sbjct: 360 D 360
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 179/393 (45%), Gaps = 51/393 (12%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----Y 168
+++ ++PL R+ ++ Y I+LG + V VDTGSD+ W+ C+PC C
Sbjct: 55 LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL 114
Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
N + +FD + S + KKV C+ C + S C + C+Y + Y D S + G
Sbjct: 115 NFRLSLFDMNASSTSKKVGCDDDFCSFI----SQSDSCQPAL--GCSYHIVYADESTSDG 168
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ R+ L L + + + + +FGCG + G G V G+MG G+S+ S++SQ
Sbjct: 169 KFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQ 228
Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
+ +FS+CL + + G ++ +S + T M+PN Y +
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277
Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
L G+ + G L S GG ++DSGT + P +Y +L L +
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILAR-----QPVKLH 332
Query: 393 ILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
I++ CF+ S + P V EFE + ++TV ++ ++ + L+ +
Sbjct: 333 IVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTD 392
Query: 450 DETGII--GNYQQKNQRVIYDTKNSQLGFAGED 480
+ + +I G+ N+ V+YD N +G+A +
Sbjct: 393 ERSEVILLGDLVLSNKLVVYDLDNEVIGWADHN 425
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 173/387 (44%), Gaps = 49/387 (12%)
Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSI 179
G+ T Y IE+G + V VDTGSD+ WV C C C + + +DP+
Sbjct: 76 GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
S + V C C A A G C S+S P C + ++YGDGS T G + + +
Sbjct: 136 SGT--TVGCEQEFCVA-NSAGGVPPTCPSTSSP-CQFRITYGDGSTTTGFYVTDFVQYNQ 191
Query: 240 ASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ--TSEIFGGLF 285
S N FGCG G G + G++G G+SD S++SQ + +F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251
Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
++CL + + G + GN K T T ++PN T Y +NL GIS+GG L
Sbjct: 252 AHCLDTVR----GGGIFAIGNVVQPKVKT----TPLVPN---VTHYNVNLQGISVGGATL 300
Query: 346 Q--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
Q S F G G +IDSGT + LP +Y L A ++ P + CF
Sbjct: 301 QLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFV--CFQF 358
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIG 456
S + P++ F+G ++T++V Y ++ C+ + + G ++G
Sbjct: 359 SGSIDDGFPVITFSFKG--DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLG 416
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ N+ V+YD + +G+ +CSS
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSS 443
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 117/422 (27%), Positives = 187/422 (44%), Gaps = 57/422 (13%)
Query: 85 NEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
NE ++R+ LD H + I+ I G++ VSN + L +A I +G
Sbjct: 53 NETAKDRMELDIQHSAARLANIQARIEGSL--VSNNDYKARVSPSLTGRTIMANISIGQP 110
Query: 145 NMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK---KVLCNSSTCHALEFA 199
+ V++DTGSD+ WV C PC +C N +FDPS S ++ K C+ C
Sbjct: 111 PIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFEGCRCDPIP 170
Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRN-N 253
+ V+Y D S G GR+ + G + ++D +FGCG N
Sbjct: 171 ----------------FTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIG 214
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKN 312
G +G++GL SLV++ G FSYC+ + D + LILG + +
Sbjct: 215 HDTDPGHNGILGLNNGPDSLVTK----LGQKFSYCIGNLADPYYNYHQLILGEGADLEGY 270
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITR 365
STP N FY + + GIS+G K+L + GG++ID+G+ IT
Sbjct: 271 STPFEVYN--------GFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITF 322
Query: 366 LPPSIYSALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMT 422
L S++ L E L +S + S CF S ++ V P+V F A++
Sbjct: 323 LVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLA 382
Query: 423 VDVTGIVYFVKSDASQVCLA---LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
+D +F + + + C+ ++SL+ + + +IG Q++ V YD N + F
Sbjct: 383 LDSGS--FFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRI 440
Query: 480 DC 481
DC
Sbjct: 441 DC 442
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 159/326 (48%), Gaps = 28/326 (8%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
GG + T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 291 DLGRHGV--FVERSVQEQDVWCLAFA 314
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 82/268 (30%), Positives = 134/268 (50%), Gaps = 26/268 (9%)
Query: 230 LGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
LG++ L L V + FGC R G GL+G G LS SQ +++G +FSY
Sbjct: 282 LGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSY 341
Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
CLPS + + S +L LG I T ++ NP + Y +N+ GI +GG+ +
Sbjct: 342 CLPSYKSSNFSSTLRLGPAG----QPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLV 397
Query: 348 SGFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
A G ++D+GT+ TRL +Y+A++ F + + P GF DTC+
Sbjct: 398 PASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGPLGGF---DTCY 454
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGII 455
N++ +++P V F+G +T+ +V SD CLA+A S + ++
Sbjct: 455 NVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIA-CLAMAAGPSDGVDAVLNVL 509
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ QQ+N RV++D N ++GF+ E C++
Sbjct: 510 ASMQQQNHRVLFDVANGRVGFSRELCTT 537
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 173/395 (43%), Gaps = 63/395 (15%)
Query: 146 MTVIVDTGSDLTWVQCQP--CKSCY---------NQQDPVFDPSISPSYKKVLCNSSTCH 194
+++ +DTGSDL W C P C C N +P+ P+ S +++ C S C
Sbjct: 98 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDS---RRIPCASPFCS 154
Query: 195 ALEFATGNSGVCSSSSPP-------DC-------NYFVSYGDGSYTRGELGREHLGLGKA 240
A + + +C+++ P C + +YGDGS L R +G+ +
Sbjct: 155 AAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLV-ARLRRGRVGIAAS 213
Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPS----TQD 294
+V +F F C G G+ G GR LSL +Q + G FSYCL +
Sbjct: 214 VAVENFTFACAHT---ALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADR 270
Query: 295 AGASGSLILGGNSSVFKNS-TPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----- 348
LILG + S T I YT ++ NP+ FY + L +S+GG ++ A
Sbjct: 271 PIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGR 330
Query: 349 -GFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-----TCF--- 398
G A GG+++DSGT T LP Y+ + EF + + + D C+
Sbjct: 331 VGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYD 390
Query: 399 -NLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYED-- 450
+ SA +E + +P + M F G A + + +S+ + CL L + +D
Sbjct: 391 HDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGG 450
Query: 451 -ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
G +GN+QQ+ V+YD ++GFA C+ +
Sbjct: 451 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 28/326 (8%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
GG + T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 291 DLGSHGV--FVERSVQEQDVWCLAFA 314
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 167/389 (42%), Gaps = 64/389 (16%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
D++ T + T+G Y ++I LG ++ ++++DTGSDLTWV+C PC C +
Sbjct: 110 DLAQTPVSFTNGG-----VYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSS--- 161
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
FD S +YK + C + + G R
Sbjct: 162 -TFDRLASNTYKALTCADDL--------------------RLPVLLRLWRRLFHSGRSLR 200
Query: 233 EHLGLGKASVND------FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
+ L + A+ ++ F+FGCG KGL G G++ L LS SQ E +G FS
Sbjct: 201 DTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFS 260
Query: 287 YCL--PSTQDAGASGSLILGGNSSVFKN-----STPITYTNMIPNPQLATFYILNLTGIS 339
YCL + Q++ ++ G + K + YT P + + +Y + L GIS
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYT---PIGESSIYYTVRLDGIS 317
Query: 340 IGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFS 392
+G ++L S F G + DSGT +T LP + ++K SG F + G
Sbjct: 318 VGNQRLDLSPSTFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKG-- 375
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
LD CF + +P + F G A+ VT +V S CL +E
Sbjct: 376 -LDACFRVPPSSGQGLPDITFHFNGGADF---VTRPSNYVIDLGSLQCLIFVP---TNEV 428
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I GN QQ++ V++D N ++GF DC
Sbjct: 429 SIFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 154/366 (42%), Gaps = 57/366 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C CK C QDP F P +S SYK + CN
Sbjct: 91 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-------------- 136
Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGR 251
PDCN Y Y + S + G L + + G S +FGC
Sbjct: 137 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCEN 188
Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
G LF G+MGLGR LS+V Q + + +FS C + G G+++LG
Sbjct: 189 VETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLG--- 243
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVIT 364
K S P +P + +Y ++L + + GK L+ + K G ++DSGT
Sbjct: 244 ---KISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 300
Query: 365 RLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKMEFEGN 418
P + A+K +K+ P + D CF+ + I P + MEF GN
Sbjct: 301 YFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEF-GN 359
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
+ + ++ Y + + L D T ++G +N V YD +N +LGF
Sbjct: 360 GQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLK 418
Query: 479 EDCSSM 484
+CS +
Sbjct: 419 TNCSDL 424
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 112/250 (44%), Gaps = 64/250 (25%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
+TSG+ + Y + +G + + +++DTGSD+ W+QC PC+ CY+Q DPVFDP S
Sbjct: 163 VTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSG 222
Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
S+ + C S C L+ S C+S C Y V+YGDGS+T GE E L
Sbjct: 223 SFSSISCRSPLCLRLD-----SPGCNSRQ--SCLYQVAYGDGSFTFGEFSTETLTFRGTR 275
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
V GCG +N+GLF G +GL+GLGR P G+
Sbjct: 276 VPKVALGCGHDNEGLFVGAAGLLGLGRQ--------------------PRLNRPPVGGAR 315
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
+ G +S+FK L +G GG++IDSGT
Sbjct: 316 VAGITASLFK---------------------------------LDTAG--NGGVIIDSGT 340
Query: 362 VITRLPPSIY 371
+TRL Y
Sbjct: 341 SVTRLTRRAY 350
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 115/239 (48%), Gaps = 25/239 (10%)
Query: 115 KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQD 172
KD N + S + +Y+ + +G + + DTGSDL W+QC PC +CY Q +
Sbjct: 40 KDFFNRNT-IQSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLN 98
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGE 229
P+FD S ++ + C S +C L S+S PD C Y SY DGS T+G
Sbjct: 99 PMFDSQSSSTFSNIACGSESCSKLY---------STSCSPDQINCKYNYSYVDGSETQGV 149
Query: 230 LGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG 283
L +E L L + IFGCG NN G F G++GLGR LSLVSQ GG
Sbjct: 150 LAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGG 209
Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
+FS CL P + S + G S V N + T ++ +FY + L GIS+
Sbjct: 210 NMFSQCLVPFNTNPSISSPMSFGKGSEVLGNG--VVSTPLVSKTTYQSFYFVTLLGISV 266
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/407 (26%), Positives = 166/407 (40%), Gaps = 82/407 (20%)
Query: 146 MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSY--------KKVLCNSSTCHA 195
+++ +DTGSDL W C P C C + P S S ++V C S C A
Sbjct: 109 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSA 168
Query: 196 LEFATGNSGVCSSSSPP-------DCN--------YFVSYGDGSYTRGELGREHLGLGKA 240
+ S +C+++ P C + +YGDGS L R +GLG +
Sbjct: 169 AHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLV-AHLRRGRVGLGAS 227
Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--- 296
+V++F F C G G+ G GR LSL Q + G FSYCL S
Sbjct: 228 VAVDNFTFACAHT---ALGEPVGVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRL 284
Query: 297 -ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG------ 349
LILG + + YT ++ NP+ FY + L +S+G ++QA
Sbjct: 285 IRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVD 344
Query: 350 -FAKGGILIDSGTVITRLPPSIYSALKA--------------EFLKQFSGFPSAPGFSIL 394
GG+++DSGT T LP Y+ + E ++ +G L
Sbjct: 345 RAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTG---------L 395
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV---------CLAL-- 443
C++ +A + +P + + F GNA + + KS+ CL L
Sbjct: 396 TPCYHYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMN 454
Query: 444 -ASLSYED-----ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+S ED G +GN+QQ+ V+YD ++GFA C+ +
Sbjct: 455 GGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTEL 501
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 117/433 (27%), Positives = 198/433 (45%), Gaps = 65/433 (15%)
Query: 86 EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG-- 143
+++Q L H + RI + + N+ +P +G+ Y I LG
Sbjct: 29 QRRQASLTGIKAHDSSRRGRILSAVDFNL---GGNGLPTVTGL------YFTKIGLGSPS 79
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCNSSTCHALEF 198
++ V VDTGSD+ WV C C C + D ++DP S + + V C + C +
Sbjct: 80 KDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSST-- 137
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCG 250
G C + +P C Y +SYGDGS T G +++L + + N IFGCG
Sbjct: 138 YEGRILGCKAENP--CPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCG 195
Query: 251 RNNKGLFG-----GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLIL 303
G F + G++G G+++ S++SQ S +FS+CL + +
Sbjct: 196 AAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN---------VG 246
Query: 304 GGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAK---GGILI 357
GG S+ + P + T ++PN +A + ++ L I + G QL + F G +I
Sbjct: 247 GGIFSIGEVVEPKVKTTPLVPN--MAHYNVI-LKNIEVDGDILQLPSDTFDSENGKGTVI 303
Query: 358 DSGTVITRLPPSIYSALKAEFL-KQFSGFPSAPGFSILD--TCFNLSAYQEVNIPLVKME 414
DSGT + LP +Y L ++ L KQ P + + + +CF + + P+VK+
Sbjct: 304 DSGTTLAYLPRIVYDQLMSKVLAKQ----PRLKVYLVEEQYSCFQYTGNVDSGFPIVKLH 359
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTK 470
FE + +TV ++ K D S C+ + E + G ++G++ N+ V+YD +
Sbjct: 360 FEDSLSLTVYPHDYLFNYKGD-SYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLE 418
Query: 471 NSQLGFAGEDCSS 483
N +G+ +CSS
Sbjct: 419 NMTIGWTDYNCSS 431
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 61/401 (15%)
Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
+S ++PL + +++ Y A I LG R+ V VDTGSD+ WV C C C + D
Sbjct: 66 LSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL 125
Query: 174 V----FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
V +D S + K V C+ + C + C S S C Y + YGDGS T G
Sbjct: 126 VELTPYDADASSTAKSVSCSDNFCSYVN----QRSECHSGST--CQYVILYGDGSSTNGY 179
Query: 230 LGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQT 277
L R+ + L + N IFGCG G G V G+MG G+S+ S +SQ
Sbjct: 180 LVRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQL 239
Query: 278 SE--IFGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
+ F++CL + G A G ++ S + T M+ + Y +
Sbjct: 240 ASQGKVKRSFAHCLDNNNGGGIFAIGEVV----------SPKVKTTPMLSK---SAHYSV 286
Query: 334 NLTGISIGGK--QLQASGFAKG---GILIDSGTVITRLPPSIYSALKAEFL---KQFSGF 385
NL I +G QL + F G G++IDSGT + LP ++Y+ L + L ++ +
Sbjct: 287 NLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLH 346
Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
F TCF+ + P V +F+ + + V ++ V+ D C +
Sbjct: 347 TVQDSF----TCFHYIDRLD-RFPTVTFQFDKSVSLAVYPQEYLFQVREDT--WCFGWQN 399
Query: 446 LSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ + G I+G+ N+ V+YD +N +G+ +CS
Sbjct: 400 GGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 160/366 (43%), Gaps = 56/366 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP F P S +YK + CN S C+
Sbjct: 99 QEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCNPS-CN--------- 148
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
C C Y Y + S + G L + L G S IFGC G LF
Sbjct: 149 --CDDEG-KQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVETGELFSQ 205
Query: 259 GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKN 312
G+MGLGR LS+V Q E+ G FS C G G+++LG VF +
Sbjct: 206 RADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVG--GAMVLGNIPPPPDMVFAH 263
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
S +P + +Y + L + + GK+L+ + K G ++DSGT LP
Sbjct: 264 S----------DPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYLPEE 313
Query: 370 IYSALK------AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGN 418
+ A K +FLKQ G P S D CF+ A ++V+ P V M F
Sbjct: 314 AFVAFKDAIIKEIKFLKQIHG----PDPSYNDICFS-GAGRDVSQLSKIFPEVNMVFGNG 368
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
++++ ++ + CL + + +D T ++G +N V YD N ++GF
Sbjct: 369 QKLSLSPENYLFRHTKVSGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWK 427
Query: 479 EDCSSM 484
+CS +
Sbjct: 428 TNCSEL 433
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 168/373 (45%), Gaps = 41/373 (10%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
T+ +NM++++DTGS+L+W+ C + P F+P+IS SY + C+S TC
Sbjct: 71 TVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPISCSSPTCTTRT 129
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN----N 253
C S++ C+ +SY D S + G L + G G + +FGC + N
Sbjct: 130 RDFPIPASCDSNN--LCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYSTN 187
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GLMG+ LSLVSQ FSYC+ + SG L+LG S F
Sbjct: 188 SESDSNTTGLMGMNLGSLSLVSQLKI---PKFSYCI---SGSDFSGILLLG--ESNFSWG 239
Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGKQLQASG-------FAKGGILIDSGT 361
+ YT ++ P P + Y + L GI I K L SG G + D GT
Sbjct: 240 GSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGT 299
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE--VNIPLVKM 413
+ L +Y+AL+ EFL Q +G A P F +D C+ + Q +P V +
Sbjct: 300 QFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSL 359
Query: 414 EFEGNAEMTVDVTGIVY----FVKSDASQVCLALASLSYED-ETGIIGNYQQKNQRVIYD 468
FEG AEM V ++Y FV + S C + E IIG++ Q++ + +D
Sbjct: 360 VFEG-AEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIGHHHQQSMWMEFD 418
Query: 469 TKNSQLGFAGEDC 481
++G A C
Sbjct: 419 LVEHRVGLAHARC 431
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 173/368 (47%), Gaps = 49/368 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+++++DTGS+L+W+ C+ + + VF+P S +Y V C+S C T +
Sbjct: 76 QNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRT---RTRDL 128
Query: 204 GVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLF 257
+ +S P C+ +SY D + G L E +G + +FGC +N
Sbjct: 129 PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEED 188
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
+GLMG+ R LS V+Q + FSYC+ + +SG L+LG S + PI
Sbjct: 189 AKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSD---SSGFLLLGDASYSWLG--PIQ 240
Query: 318 YTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
YT ++ P P Y + L GI +G K L S F G ++DSGT T
Sbjct: 241 YTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 300
Query: 366 LPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEVN---IPLVKMEFE 416
L +Y+ALK EF+ Q P F +D C+ + + N +P+V + F
Sbjct: 301 LMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR 360
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKNQRVIYDTK 470
G AEM+V ++Y V S+ + ++ + E +IG++ Q+N + +D
Sbjct: 361 G-AEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLA 419
Query: 471 NSQLGFAG 478
S++GFAG
Sbjct: 420 KSRVGFAG 427
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 163/355 (45%), Gaps = 38/355 (10%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+IVDTGS +T+V C C+ C QDP F P S +Y+ V C + C+
Sbjct: 97 FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TIDCN----------- 144
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
C S C Y Y + S + G LG + + G S +FGC G L+
Sbjct: 145 CDSDR-MQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENVETGDLYSQHA 203
Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G+MGLGR DLS++ Q + + FS C D G G+++LGG S + Y
Sbjct: 204 DGIMGLGRGDLSIMDQLVDKNVISDSFSLCY-GGMDVGG-GAMVLGGISP--PSDMAFAY 259
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIYSALK 375
++ + +P +Y ++L I + GK+ L A+ F K G ++DSGT LP + + A K
Sbjct: 260 SDPVRSP----YYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAFK 315
Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDVTGIV 429
+K+ S P + D CF+ + + P+V M FE + T+ +
Sbjct: 316 DAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYM 375
Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ CL + + D+T ++G +N V+YD + +++GF +C+ +
Sbjct: 376 FRHSKVRGAYCLGVFQ-NGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCAEL 429
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 161/361 (44%), Gaps = 46/361 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP F P +S +Y+ V CN S C+
Sbjct: 88 QEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPS-CN--------- 137
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
C C Y Y + S + G + + + G S +FGC G L+
Sbjct: 138 --CDDEG-KQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVETGDLYSQ 194
Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
G+MGLGR LS+V Q + + G FS C G + +GG + V +P
Sbjct: 195 RADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCY---------GGMDVGGGAMVLGQISPP 245
Query: 317 TYTNMI---PNPQLATFYILNLTGISIGGK--QLQASGF-AKGGILIDSGTVITRLPPSI 370
NM+ NP + +Y + L + + GK +L+ F K G ++DSGT P +
Sbjct: 246 --PNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYFPEAA 303
Query: 371 YSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTV 423
+ ALK +K+ PG + D CF+ A +EV+ P V M F ++++
Sbjct: 304 FHALKDAIMKEIRHLKQIPGPDPNYHDICFS-GAGREVSHLSKVFPEVNMVFGSGQKLSL 362
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ + CL + + D T ++G +N V YD +N ++GF +CS
Sbjct: 363 SPENYLFRHTKVSGAYCLGIFQ-NGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCSE 421
Query: 484 M 484
+
Sbjct: 422 L 422
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 135/267 (50%), Gaps = 24/267 (8%)
Query: 230 LGREHLGLGKA--SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
LG++ L L ++ + FGC G GL+G R LS SQ ++G +FSY
Sbjct: 310 LGQDALALHDDVDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSY 369
Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
CLPS + + SG+L LG + T T ++ NP + Y +N+ GI +GG+ +
Sbjct: 370 CLPSYKSSNFSGTLRLGPAGQPKRIKT----TPLLSNPHRPSLYYVNMVGIRVGGRPVAV 425
Query: 348 SGFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
A G ++D+GT+ TRL +Y+A+ F + P A DTC+N+
Sbjct: 426 PASALAFDPASGHGTIVDAGTMFTRLSAPVYAAVCDVFRSRVRA-PVAGPLGGFDTCYNV 484
Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALA---SLSYEDETGIIG 456
+ +++P V F+G +T+ +V ++S + CLA+A S S + ++
Sbjct: 485 T----ISVPTVTFLFDGRVSVTLPEENVV--IRSSLDGIACLAMAAGPSDSVDAVLNVMA 538
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ QQ+N RV++D N ++GF+ E C++
Sbjct: 539 SMQQQNHRVLFDVANGRVGFSRELCTA 565
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 121/401 (30%), Positives = 175/401 (43%), Gaps = 68/401 (16%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV---- 174
PL+ G QTL+ +I DTGS L W C C C + + DP
Sbjct: 84 PLSFGTPQQTLH-------------LIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPR 130
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTR 227
F P +S S K V C + C + F C S +P N Y V YG GS T
Sbjct: 131 FVPKLSSSSKLVGCQNPKCSWI-FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TA 188
Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
G L E L + +F+ GC + SG+ G GR SL SQ GL F
Sbjct: 189 GLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQM-----GLKKF 240
Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT-----FYILNLTGI 338
+YCL S + D+ SG LIL +S+ K+S +TYT NP ++ +Y LN+ I
Sbjct: 241 AYCLASRKFDDSPHSGQLIL--DSTGVKSSG-LTYTPFRQNPSVSNNAYKEYYYLNIRKI 297
Query: 339 SIGGKQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+G + ++ GG +IDSG+ T + + + EF KQ + + A
Sbjct: 298 IVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDV 357
Query: 392 SILD---TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
L CF++S + V P + +F+G A+ + + V S + CL + +
Sbjct: 358 ETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQM 416
Query: 449 ED-------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
ED + I+G +QQ+N V YD N +LGF + CS
Sbjct: 417 EDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 176/398 (44%), Gaps = 50/398 (12%)
Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++ ++PL G+ T Y IE+G + V VDTGSD+ WV C C C + D
Sbjct: 64 LAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDL 123
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++DP S S V C+ C A G C+ + P C Y V YGDGS T G
Sbjct: 124 GIDLRLYDPKGSSSGSTVSCDQKFCAAT--YGGKLPGCAKNIP--CEYSVMYGDGSSTTG 179
Query: 229 ELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ L G + + IFGCG G G + G++G G+S+ S++SQ
Sbjct: 180 YFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQ 239
Query: 277 TSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
+ +FS+CL + + G +G STP ++P+ Y +N
Sbjct: 240 LAAAGEVKKIFSHCLDTIK---GGGIFAIGDVVQPKVKSTP-----LVPD---MPHYNVN 288
Query: 335 LTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
L I++GG LQ K G +IDSGT +T LP +Y + A F+ P
Sbjct: 289 LESINVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAV---FAKHPDTT 345
Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
S+ D + +Q V+ K+ F ++ ++V YF ++ + C + +
Sbjct: 346 FHSVQD-FLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQ 404
Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ G ++G+ N+ V+YD +N +G+ +CSS
Sbjct: 405 SKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCSS 442
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 166/372 (44%), Gaps = 48/372 (12%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y A + +G + +IVDTGS +T+V C C+ C + QDP F P S +Y+ V C +
Sbjct: 93 YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC-TWQ 151
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGC 249
C+ C + C Y Y + S + G LG + + G + S IFGC
Sbjct: 152 CN-----------CDNDR-KQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGC 199
Query: 250 GRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG 305
+ G G+MGLGR DLS++ Q E + FS C G+++LGG
Sbjct: 200 ENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCY--GGMGVGGGAMVLGG 257
Query: 306 NSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILID 358
S VF S P+ + +Y ++L I + GK+L + K G ++D
Sbjct: 258 ISPPADMVFTRSDPVR----------SPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLD 307
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVK 412
SGT LP S + A K +K+ S P D CF+ + I P+V+
Sbjct: 308 SGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVE 367
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
M F ++++ ++ CL + S + D T ++G +N V+YD +++
Sbjct: 368 MVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFS-NGNDPTTLLGGIVVRNTLVMYDREHT 426
Query: 473 QLGFAGEDCSSM 484
++GF +CS +
Sbjct: 427 KIGFWKTNCSEL 438
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/404 (25%), Positives = 168/404 (41%), Gaps = 51/404 (12%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ--------- 170
+PLTS Y +G + ++ DTGSDLTWV+C+P K+
Sbjct: 82 MPLTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASA 141
Query: 171 --QDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
F P S ++ + C S TC +L F+ C + P C Y Y DGS R
Sbjct: 142 SSPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLST---CPTPGSP-CAYDYRYKDGSAAR 197
Query: 228 GELGREHLGLG-------------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSL 273
G +G E + KA + + GC + G F G++ LG S++S
Sbjct: 198 GTVGTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSF 257
Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY------TNMIPNPQL 327
S + FGG FSYCL + S + G +S P T ++ + ++
Sbjct: 258 ASHAASRFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRM 317
Query: 328 ATFYILNLTGISIGGKQLQASG-----FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
FY +++ IS+ G+ L+ GG+++DSGT +T L Y A+ A K+
Sbjct: 318 RPFYDVSIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL 377
Query: 383 SGFPSAPGFSILDTCFNLSAYQEV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
+ FP + C+N ++ ++P + + F G+A + + Y + +
Sbjct: 378 ARFPRV-AMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARL--EPPSKSYVIDAAPGV 434
Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
C+ + + +IGN Q+ +D KN +L F C+
Sbjct: 435 KCIGVQEGPWPG-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 58/367 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C C N QDP F P +S +Y V CN
Sbjct: 7 QEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD------------ 54
Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
C+ + D C Y Y + S + G LG + + G S +FGC G LF
Sbjct: 55 --CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFS 112
Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
G+MGLGR DLS+V Q E + FS C + G G+++LG S VF
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGG--GAMVLGQISPPSDMVFS 170
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTVITRLPP 368
+S +P + +Y + L G+ + GK+L + K G ++DSGT LP
Sbjct: 171 HS----------DPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPE 220
Query: 369 SIY----SALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQEV-----NIPLVKMEFEG 417
+ + A+ +E LKQ G P + D CF+ A E+ P V M F+
Sbjct: 221 AAFLPFIQAITSELHGLKQIRG----PDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDN 275
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
+ ++ ++ CL + + +D T ++G +N V YD ++S++GF
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDREHSKVGFW 334
Query: 478 GEDCSSM 484
+CS +
Sbjct: 335 KTNCSVL 341
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 58/367 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C C N QDP F P +S +Y V CN
Sbjct: 7 QEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD------------ 54
Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
C+ + D C Y Y + S + G LG + + G S +FGC G LF
Sbjct: 55 --CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFS 112
Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
G+MGLGR DLS+V Q E + FS C + G G+++LG S VF
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGG--GAMVLGQISPPSDMVFS 170
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTVITRLPP 368
+S +P + +Y + L G+ + GK+L + K G ++DSGT LP
Sbjct: 171 HS----------DPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPE 220
Query: 369 SIY----SALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQEV-----NIPLVKMEFEG 417
+ + A+ +E LKQ G P + D CF+ A E+ P V M F+
Sbjct: 221 AAFLPFIQAITSELHGLKQIRG----PDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDN 275
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
+ ++ ++ CL + + +D T ++G +N V YD ++S++GF
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDREHSKVGFW 334
Query: 478 GEDCSSM 484
+CS +
Sbjct: 335 KTNCSVL 341
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 174/401 (43%), Gaps = 68/401 (16%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV---- 174
PL+ G QTL+ +I DTGS L W C C C + + DP
Sbjct: 84 PLSFGTPQQTLH-------------LIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPR 130
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTR 227
F P +S S K V C + C + F C S +P N Y V YG GS T
Sbjct: 131 FVPKLSSSSKLVGCQNPKCSWI-FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TA 188
Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
G L E L + +F+ GC + SG+ G GR SL SQ GL F
Sbjct: 189 GLLLSETLDFPDKXIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQM-----GLKKF 240
Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT-----FYILNLTGI 338
+YCL S + D+ SG LIL +S+ K+S +TYT NP ++ +Y LN+ I
Sbjct: 241 AYCLASRKFDDSPHSGQLIL--DSTGVKSSG-LTYTPFRQNPSVSNNAYKEYYYLNIRKI 297
Query: 339 SIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+G K L GG +IDSG+ T + + + EF KQ + + A
Sbjct: 298 IVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDV 357
Query: 392 SILD---TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
L CF++S + V P + +F+G A+ + + V S + CL + +
Sbjct: 358 ETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQM 416
Query: 449 ED-------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
ED + I+G +QQ+N V YD N +LGF + CS
Sbjct: 417 EDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 186/404 (46%), Gaps = 36/404 (8%)
Query: 99 VQYLQSRIKNMISGNIKDV-----SNTEIPLTSGIRLQTLNY-IATIELGGRNMTVIVDT 152
VQ +SR+ + + + + + + PL G +++ I T G ++ DT
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATG---LSGEADT 111
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
GSDL W +C C C + P + P+ S S V C TC L ++ S
Sbjct: 112 GSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 213 DCNYFVSYGDGS----YTRGELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
+C+Y +YG+ YT G L E G A+ FGC ++G FG SGL+GL
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGL 231
Query: 267 GRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSL--ILGGNSSVFKNSTPITYTNMIP 323
GR LSLV+Q + E FG S L S + GSL + GGN F STP+ TN P
Sbjct: 232 GRGKLSLVTQLNVEAFGYRLSSDL-SAPSPISFGSLADVTGGNGDSFM-STPL-LTN--P 286
Query: 324 NPQLATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALK 375
Q FY + LTGIS+GGK Q+ + F+ GG++ DSGT +T LP Y+ ++
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 376 AEFLKQFSGFPSAPGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
E L Q GF P + D CF P + + F+G A+M + + ++
Sbjct: 347 DELLSQM-GFQKPPPAANDDDLICFT-GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD-TKNSQLGF 476
+ + + IIGN Q + V++D + N+++ F
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLF 448
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/376 (29%), Positives = 168/376 (44%), Gaps = 57/376 (15%)
Query: 149 IVDTGSDLTWVQCQP---CKSC-----YNQQDPVFDPSISPSYKKVLCNSSTCHAL---- 196
++DTGS L W C C C P F P +S S K + C + C +
Sbjct: 99 VMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPE 158
Query: 197 -----EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCG 250
+ + C+ + PP Y + YG GS T G L E L K ++ DF+ GC
Sbjct: 159 IQSKCQECDSTAQNCTQTCPP---YVIQYGSGS-TAGLLLSETLDFPNKKTIPDFLVGCS 214
Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPST--QDAGASGSLIL--G 304
+ G+ G GRS SL SQ GL FSYCL S D S L+L G
Sbjct: 215 IFS---IKQPEGIAGFGRSPESLPSQL-----GLKKFSYCLVSHAFDDTPTSSDLVLDTG 266
Query: 305 GNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFAKGGI 355
S V K + +++T + NP A +Y + L I IG K L GG
Sbjct: 267 SGSGVTKTAG-LSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGT 325
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP---GFSILDTCFNLSAYQEVNIPLVK 412
++DSGT T + +Y + EF KQ + + A + L C+N+S + +++P +
Sbjct: 326 IVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLI 385
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVI 466
+F+G A+M + ++ YF D+ +CL + S + I+GNYQQ+N V
Sbjct: 386 FQFKGGAKMALPLSN--YFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVE 443
Query: 467 YDTKNSQLGFAGEDCS 482
+D +N + GF + C+
Sbjct: 444 FDLENEKFGFKQQSCA 459
>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
Length = 157
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/157 (42%), Positives = 93/157 (59%), Gaps = 4/157 (2%)
Query: 327 LATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG- 384
L T Y L+LT I++GGK L A+ K +IDSGTVITRLP +Y+ALK F++ S
Sbjct: 2 LPTLYGLDLTAITVGGKPLGLAASSYKVPTIIDSGTVITRLPMPVYTALKNSFVRIMSKK 61
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
+ APG SILDTCF + + +P ++M F G A++ + ++ D CLA+A
Sbjct: 62 YAQAPGISILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNT--LIELDKGVTCLAIA 119
Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
S + IIGNYQQ+ +V YD NS++GFA C
Sbjct: 120 GSSENNPIAIIGNYQQQTFKVAYDVANSKIGFAAGGC 156
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 156/370 (42%), Gaps = 65/370 (17%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C CK C QDP F P +S SY+ + CN
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-------------- 132
Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR 251
PDCN Y Y + S + G L + + G + S +FGC
Sbjct: 133 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184
Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
G LF G+MGLGR LS+V Q + + +FS C + G G+++LG S
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLGKIS 242
Query: 308 S----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSG 360
VF +S P + +Y ++L + + GK L+ + K G ++DSG
Sbjct: 243 PPPGMVFSHSDPFR----------SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSG 292
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKME 414
T P + A+K +K+ P + D CF+ + I P + ME
Sbjct: 293 TTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAME 352
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F GN + + ++ Y + + L D T ++G +N V YD +N +L
Sbjct: 353 F-GNGQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKL 410
Query: 475 GFAGEDCSSM 484
GF +CS +
Sbjct: 411 GFLKTNCSDI 420
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 155/366 (42%), Gaps = 57/366 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C CK C QDP F P +S SY+ + CN
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-------------- 132
Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR 251
PDCN Y Y + S + G L + + G + S +FGC
Sbjct: 133 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184
Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
G LF G+MGLGR LS+V Q + + +FS C + G G+++LG
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLG--- 239
Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVIT 364
K S P +P + +Y ++L + + GK L+ + K G ++DSGT
Sbjct: 240 ---KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 296
Query: 365 RLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKMEFEGN 418
P + A+K +K+ P + D CF+ + I P + MEF GN
Sbjct: 297 YFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEF-GN 355
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
+ + ++ Y + + L D T ++G +N V YD +N +LGF
Sbjct: 356 GQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLK 414
Query: 479 EDCSSM 484
+CS +
Sbjct: 415 TNCSDI 420
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 115/444 (25%), Positives = 186/444 (41%), Gaps = 55/444 (12%)
Query: 81 IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
+ D + R+ H + R + +G+ + E+PLTSG Y
Sbjct: 45 LADLARSDRQRMAFIASHGR---RRARETAAGS--SAAAFEMPLTSGAYTGIGQYFVRFR 99
Query: 141 LG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDP---VFDPSISPSYKKVLCNSSTC- 193
+G + ++ DTGSDLTWV+C+ P + F P S ++ + C S TC
Sbjct: 100 VGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCT 159
Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---------KASVND 244
+L F+ C + P C Y Y DGS RG +G E + KA +
Sbjct: 160 KSLPFSLAT---CPTPGSP-CAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKG 215
Query: 245 FIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLI 302
+ GC + G VS G++ LG SD+S S + F G FSYCL A+ L
Sbjct: 216 LVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLT 275
Query: 303 LGGNSSVFKNSTPITY------------------TNMIPNPQLATFYILNLTGISIGGKQ 344
G N +V +S+P + T ++ + ++ FY + + +S+ G+
Sbjct: 276 FGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQF 335
Query: 345 LQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
L+ A GG+++DSGT +T L Y A+ A + +G P + C+N
Sbjct: 336 LKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRV-TMDPFEYCYN 394
Query: 400 L-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
S +V +P + + F G A + + G Y + + C+ L + +IGN
Sbjct: 395 WTSPSGDVTLPKMAVHFAGAARL--EPPGKSYVIDAAPGVKCIGLQEGPWPG-ISVIGNI 451
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
Q+ +D KN +L F C+
Sbjct: 452 LQQEHLWEFDIKNRRLKFQRSRCT 475
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 186/404 (46%), Gaps = 36/404 (8%)
Query: 99 VQYLQSRIKNMISGNIKDV-----SNTEIPLTSGIRLQTLNY-IATIELGGRNMTVIVDT 152
VQ +SR+ + + + + + + PL G +++ I T G ++ DT
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATG---LSGEADT 111
Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
GSDL W +C C C + P + P+ S S V C TC L ++ S
Sbjct: 112 GSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 213 DCNYFVSYGDGS----YTRGELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
+C+Y +YG+ YT G L E G A+ FGC ++G FG SGL+GL
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGL 231
Query: 267 GRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSL--ILGGNSSVFKNSTPITYTNMIP 323
GR LSLV+Q + E FG S L S + GSL + GGN F STP+ TN P
Sbjct: 232 GRGKLSLVTQLNVEAFGYRLSSDL-SAPSPISFGSLADVTGGNGDSFM-STPL-LTN--P 286
Query: 324 NPQLATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALK 375
Q FY + LTGIS+GGK Q+ + F+ GG++ DSGT +T LP Y+ ++
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 376 AEFLKQFSGFPSAPGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
E L Q GF P + D CF P + + F+G A+M + + ++
Sbjct: 347 DELLSQM-GFQKPPPAANDDDLICFT-GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404
Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD-TKNSQLGF 476
+ + + IIGN Q + V++D + N+++ F
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLF 448
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 131/466 (28%), Positives = 214/466 (45%), Gaps = 76/466 (16%)
Query: 60 SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
S++ G+ L+LKH+ ++ + +Q + + H + L + + +V
Sbjct: 21 SKVTCGSGVLKLKHRF----SELEGSSKQSGKRGMSEEHFRQLMDHTRARSRRFLLEVD- 75
Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ------- 170
+ +G Y A I +G + + IVDTGSD+ W +C+ C+ C ++
Sbjct: 76 ---LMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCS 132
Query: 171 ----QDPV--FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
Q P+ +DP +S + C+ C GN+ C+ Y +SY D S
Sbjct: 133 SIIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCA--------YDISYEDTS 184
Query: 225 YTRGELGREHLGLG-KASVNDFIF-GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
+ G R+ + LG KAS+N +F GC + GL+ V G+MG GRS +S+ +Q + G
Sbjct: 185 SSTGIYFRDVVHLGHKASLNTTMFLGCATSISGLW-PVDGIMGFGRSKVSVPNQLAAQAG 243
Query: 283 G--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FYILNLTGI 338
+F +CL ++ G G L+LG N + M+ P LA Y + L +
Sbjct: 244 SYNIFYHCLSGEKEGG--GILVLGKNDE---------FPEMVYTPMLANDIVYNVKLVSL 292
Query: 339 SIGGKQL--QASGF------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF----P 386
S+ K L +AS F GG +IDSGT P S A F+K S F P
Sbjct: 293 SVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFP----SKALALFVKAVSKFTTAIP 348
Query: 387 SAPGFSILDTCF-NLSAYQ--EVNIPLVKMEFEGNAEMTVD----VTGIVYFVKSDASQ- 438
+AP S CF ++S EV+ P V ++F+G A M + + +V S+++
Sbjct: 349 TAPLESSGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHF 408
Query: 439 --VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
V L S S + T I+G+ K++ V+YD + S++G+ +D S
Sbjct: 409 QGVRLVCISWSVGNST-ILGDAILKDKVVVYDMEKSRIGWVKQDLS 453
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/326 (29%), Positives = 158/326 (48%), Gaps = 28/326 (8%)
Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T I +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
GG + T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 291 DLGSHGV--FVERSVQEQDVWCLAFA 314
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 166/384 (43%), Gaps = 71/384 (18%)
Query: 149 IVDTGSDLTWVQCQP---CKSCYNQQD-----PVFDPSISPSYKKVLCNS---------- 190
++DTGS L W C C C P F P S S + C +
Sbjct: 108 VMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPK 167
Query: 191 --STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIF 247
S C + T N C+ S PP Y + YG GS T G L E L K ++ F+
Sbjct: 168 VQSKCQECDPTTQN---CTQSCPP---YVIQYGLGS-TAGLLLSETLDFPHKKTIPGFLV 220
Query: 248 GCGRNNKGLFG--GVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPST--QDAGASGSL 301
GC LF G+ G GRS SL SQ GL FSYCL S D AS L
Sbjct: 221 GCS-----LFSIRQPEGIAGFGRSPESLPSQL-----GLKKFSYCLVSHAFDDTPASSDL 270
Query: 302 ILGGNSSVFKNSTP-ITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFA 351
+L S TP ++YT NP A +Y + L I IG K L
Sbjct: 271 VLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDG 330
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNI 408
GG ++DSGT T + +Y + EF KQ + + A + L CFN+S + V++
Sbjct: 331 NGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSV 390
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---------IIGNYQ 459
P F+G A+M + + FV D+ +CL + S ++ +G I+GNYQ
Sbjct: 391 PEFIFHFKGGAKMALPLANYFSFV--DSGVICLTIVS---DNMSGSGIGGGPAIILGNYQ 445
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
Q+N V +D KN + GF ++C S
Sbjct: 446 QRNFHVEFDLKNERFGFKQQNCVS 469
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 61/388 (15%)
Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
TL T+ + +T+++DTGS+L+W+ C+ + + VF+P S SY + C+S
Sbjct: 999 TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSP 1054
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG- 250
C N C C+ VSY D S G L ++ +G +++ +FGC
Sbjct: 1055 ICRTRTRDLPNPVTCDPKKL--CHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMD 1112
Query: 251 ---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILG- 304
+N +GLMG+ R LS V+Q GL FSYC+ S +D +SG L+ G
Sbjct: 1113 SGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYCI-SGRD--SSGVLLFGD 1164
Query: 305 ------GN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-- 351
GN + + + STP+ Y + + Y + L GI +G K L S FA
Sbjct: 1165 LHLSWLGNLTYTPLVQISTPLPYFDRVA-------YTVQLDGIRVGNKILPLPKSIFAPD 1217
Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSA 402
G ++DSGT T L +Y+AL+ EFL+Q G + P F +D C++++A
Sbjct: 1218 HTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAA 1277
Query: 403 YQEV-NIPLVKMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIG 456
++ +P V + F G AEM V ++Y V K + CL + E +IG
Sbjct: 1278 GGKLPTLPSVSLMFRG-AEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIG 1336
Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ Q+N + +D + FA + C S+
Sbjct: 1337 HHHQQNVWMEFDL----VAFAADLCGSI 1360
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 118/225 (52%), Gaps = 19/225 (8%)
Query: 68 TLELK-HKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
TL L+ H D+ +RL D+ V+Y+ +++ N +S P+ S
Sbjct: 69 TLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNF--NTDKLSG---PIIS 123
Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
G + Y + I +G +++DTGSD++WVQC PC CY Q DP+F+P+ S SY
Sbjct: 124 GTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYA 183
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
+ C ++ C L+ + +G +C Y VSYGDGSYT G+ E + +G V +
Sbjct: 184 PLSCEAAQCRYLDQSQCRNG--------NCLYQVSYGDGSYTVGDFVTETVTIGVNKVKN 235
Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
GCG NN+GLF G +GL+GLG LS +Q + FSYCL
Sbjct: 236 VALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST---SFSYCL 277
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 161/361 (44%), Gaps = 26/361 (7%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
TI + + I+D +L W QC C C+ Q P+F P+ S +++ C + C +
Sbjct: 48 TIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTP 107
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
+ + VC+ S + D T G +G E +G A+ + FGC ++
Sbjct: 108 TSNCSGDVCTYESTTNIRL-----DRHTTLGIVGTETFAIGTATAS-LAFGCVVASDIDT 161
Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
G SG +GLGR+ SLV+Q FSYCL S + G S L LG ++ + ++++
Sbjct: 162 MDGTSGFIGLGRTPRSLVAQMKLT---KFSYCL-SPRGTGKSSRLFLGSSAKLAGGESTS 217
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVITRLPPSIYSA 373
+ P+ +Y+L+L I G + + GGIL+ + + + L S Y A
Sbjct: 218 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVSPFSLLVDSAYRA 275
Query: 374 LKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFE-GNAEMTVDVTGI 428
K + G P A D CF +A + P + F+ G A +TV
Sbjct: 276 FKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKY 335
Query: 429 VYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ V + C A+ S++ + TG ++G+ QQ+N +YD K L F DCSS
Sbjct: 336 LIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADCSS 395
Query: 484 M 484
+
Sbjct: 396 L 396
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 58/378 (15%)
Query: 149 IVDTGSDLTWVQC------QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--------- 193
++DTGS L W+ C C S N P F P S S K V C + C
Sbjct: 232 VLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVT 291
Query: 194 -HALEFATG---NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
H + A N+ CS + P Y V YG GS T G L E+L +V+DF+ GC
Sbjct: 292 SHCCKLAKAAFSNNNNCSQTCP---AYTVQYGLGS-TAGFLLSENLNFPAKNVSDFLVGC 347
Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLIL-GGN 306
+ GG++G GR + SL +Q + FSYCL S Q ++ + L++ N
Sbjct: 348 SVVSVYQPGGIAGF---GRGEESLPAQMNLT---RFSYCLLSHQFDESPENSDLVMEATN 401
Query: 307 SSVFKNSTPITYTNMIPNPQ-----LATFYILNLTGISIGGKQ-------LQASGFAKGG 354
S K + ++YT + NP +Y + L I +G K+ L+ GG
Sbjct: 402 SGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGG 461
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQE-VNIPL 410
++DSG+ +T + I+ + EF+KQ + + A L CF L+ E + P
Sbjct: 462 FIVDSGSTLTFMERPIFDLVAEEFVKQVN-YTRARELEKQFGLSPCFVLAGGAETASFPE 520
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG------IIGNYQQKNQ 463
++ EF G A+M + V YF + V CL + S + G I+GNYQQ+N
Sbjct: 521 MRFEFRGGAKMRLPVAN--YFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNF 578
Query: 464 RVIYDTKNSQLGFAGEDC 481
V D +N + GF + C
Sbjct: 579 YVECDLENERFGFRSQSC 596
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 124/439 (28%), Positives = 180/439 (41%), Gaps = 50/439 (11%)
Query: 82 VDWNEQQQNRLIL---DNLHVQYLQSRIKNMISGNIKDVS----------NTEIPLTSGI 128
D E RL L D L L SRI+++I + K S ++ L SGI
Sbjct: 23 ADSTEDTAVRLKLAHRDTLWPNPL-SRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGI 81
Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN-------QQDPVFDPSI 179
T Y + +G + V+VDTGS+LTWV C+ Y + VF
Sbjct: 82 DYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR-----YRGRGKGKVKNRRVFRAEE 136
Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG- 238
S S+K V C + TC + C + S P C+Y Y DGS +G +E + +G
Sbjct: 137 SKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTP-CSYDYRYADGSAAQGVFAKETITVGL 195
Query: 239 ----KASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
KA + + GC + G + G++GL SD S S + +FG SYCL
Sbjct: 196 TNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHL 255
Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA----- 347
+ S LI G +SS T T + + FY +N+ GISIG L
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVW 315
Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALK---AEFLKQFSGFPSAPGFSILDTCF-NLSAY 403
GG ++DSGT +T L + Y + A +L + P ++ CF + S +
Sbjct: 316 DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRV--KPEGIPIEYCFSSTSGF 373
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
E +P + +G A Y V + CL S T ++GN Q+N
Sbjct: 374 NESKLPQLTFHLKGGARFEPHRKS--YLVDAAPGVKCLGFMSAG-TPATNVVGNIMQQNY 430
Query: 464 RVIYDTKNSQLGFAGEDCS 482
+D S L FA C+
Sbjct: 431 LWEFDLMASTLSFAPSTCT 449
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 170/383 (44%), Gaps = 57/383 (14%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+T+++DTGS+L+W+ C ++ D FD S S SY V C+S C L
Sbjct: 74 QNVTMVLDTGSELSWLLCN-----GSRHDAPFDASASSSYAPVPCSSPACTWLGRDLPVR 128
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC----GRNNKGLFGG 259
C SS+ C +SY D S G L + LG + + +FGC +
Sbjct: 129 PFCDSSA---CRVSLSYADASSADGLLAADTFLLGSSPMPA-LFGCITSYSSSTDPSETP 184
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP---- 315
+GL+G+ R LS V+QT+ F+YC+ + Q G L+LGGN + ++P
Sbjct: 185 PTGLLGMNRGGLSFVTQTATR---RFAYCIAAGQ---GPGILLLGGNDTETPLTSPPQQQ 238
Query: 316 ITYTNMI----PNPQL-ATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVI 363
+ YT ++ P P Y + L GI +G L G ++DSGT
Sbjct: 239 LNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSGTRF 298
Query: 364 TRLPPSIYSALKAEFLKQFS-----GFPS--APGF---SILDTCF-----NLSAYQEVN- 407
T L P Y+ALKAEF Q + G PGF D CF +SA
Sbjct: 299 TFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAGGL 358
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY--EDETG----IIGNYQQK 461
+P V + G + ++Y V + + L++ D G +IG++ Q+
Sbjct: 359 LPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHHHQQ 418
Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
+ V YD +N++LGFA C+ +
Sbjct: 419 DVWVEYDLRNARLGFAAARCADL 441
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 160/345 (46%), Gaps = 37/345 (10%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+DT SD+ W+ PC C +F+ S +YK + C ++ C + T GVCS
Sbjct: 1 MDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCS-- 55
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
+ ++YG GS L ++ + L +V + FGC + G GL+GLGR
Sbjct: 56 ------FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRG 108
Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
LSL+SQT ++ FSYCLPS + SGSL LG + I YT ++ NP+ +
Sbjct: 109 PLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKYTPLLKNPRRPS 164
Query: 330 FYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
Y +NL + + G S A G + DSGTV TRL Y A++ F
Sbjct: 165 LYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTPAYIAVRDAFRN 222
Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQV 439
+ + DTC+ + + P + F G M V + + S A S
Sbjct: 223 RVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDNLLIHSTAGSTT 275
Query: 440 CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLA+A+ + +I N QQ+N R++YD NS+LG A E C+
Sbjct: 276 CLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 320
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 159/375 (42%), Gaps = 48/375 (12%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ---DPVFDPSISPSYKKVLCN 189
Y + + +G + +IVDTGS +T+V C C C + Q DP F P S SY+ V CN
Sbjct: 99 YTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCN 158
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFI 246
S C + +C + C Y Y + S ++G LG++ LG G S + +
Sbjct: 159 SPDC--------ITKMCDARV-HQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLL 209
Query: 247 FGCGRNNKG--LFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLI 302
FGC G G+MGLGR LS+V Q + FS C + G GS++
Sbjct: 210 FGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG--GSMV 267
Query: 303 LGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGI 355
LG + VF S +P + +Y L L+ I + G L + G
Sbjct: 268 LGAIPPPPAMVFAKS----------DPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGT 317
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCF----NLSAYQEVNIP 409
++DSGT LP + A K +Q + PG S D CF + S + P
Sbjct: 318 VLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFP 377
Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
V F GN ++ + ++ CL +D T ++G +N V YD
Sbjct: 378 PVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF--FKNQDATTLLGGIVVRNTLVTYDR 435
Query: 470 KNSQLGFAGEDCSSM 484
N Q+GF +C+++
Sbjct: 436 ANHQIGFFKTNCTNL 450
>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
Length = 172
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 103/183 (56%), Gaps = 18/183 (9%)
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
LGG SS ST T ++ T+YI+ L GIS+GG+ L AS FA G + +D+G
Sbjct: 4 LGGPSSTAGFST----TPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTG 58
Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
TV+TRLPP+ YSAL++ F + G+PSAP ILDTC++ + Y V +P + + F G
Sbjct: 59 TVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGG 118
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
A M + +GI+ + CLA A + + I+GN QQ++ V +D S +GF
Sbjct: 119 AAMDLGTSGIL-------TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMP 169
Query: 479 EDC 481
C
Sbjct: 170 ASC 172
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 161/372 (43%), Gaps = 48/372 (12%)
Query: 149 IVDTGSDLTWVQCQP---CKSCY-----NQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
++DTGS L W C C C + P F P S + K + C + C + F +
Sbjct: 108 VLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYI-FGS 166
Query: 201 GNSGVCSSSSPPDCN-------YFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
C P N Y + YG GS T G L ++L +V F+ GC +
Sbjct: 167 DVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPGKTVPQFLVGCSILS 225
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGNSSVFK 311
SG+ G GR SL SQ + FSYCL S + D S L+L +S+
Sbjct: 226 ---IRQPSGIAGFGRGQESLPSQMNL---KRFSYCLVSHRFDDTPQSSDLVLQISSTGDT 279
Query: 312 NSTPITYTNM-----IPNPQLATFYILNLTGISIGGKQ-------LQASGFAKGGILIDS 359
+ ++YT NP +Y L L + +GGK L+ GG ++DS
Sbjct: 280 KTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDS 339
Query: 360 GTVITRLPPSIYSALKAEFLKQ----FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
G+ T + +Y+ + EF+KQ +S A S L CFN+S + V P + +F
Sbjct: 340 GSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKF 399
Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALAS---LSYEDETG---IIGNYQQKNQRVIYDT 469
+G A+MT + V DA VCL + S TG I+GNYQQ+N + YD
Sbjct: 400 KGGAKMTQPLQNYFSLV-GDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDL 458
Query: 470 KNSQLGFAGEDC 481
+N + GF C
Sbjct: 459 ENERFGFGPRSC 470
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 172/391 (43%), Gaps = 70/391 (17%)
Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVL 187
++ + T+ +G + +++DTGS L+W+QC +N+ P FDPS+S S+ +
Sbjct: 85 SMALVVTLPIGTPPQPQQMVLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLP 138
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
C C C + C+Y Y DG+Y G L RE L + + I
Sbjct: 139 CTHPLCKPRVPDFTLPTTCDQNR--LCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLI 196
Query: 247 FGCG---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----ASG 299
GC R+ +G+ G M LGR ++ ++ FSYC+P+ Q A +G
Sbjct: 197 LGCSSESRDARGILG-----MNLGRLSFPFQAKVTK-----FSYCVPTRQPANNNNFPTG 246
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATF-------YILNLTGISIGGKQL------- 345
S LG N NS Y +M+ PQ Y + + GI IGG++L
Sbjct: 247 SFYLGNN----PNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVF 302
Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSA 402
+ + G ++DSG+ T L Y ++ E ++ G G+ + D CF+ +A
Sbjct: 303 RPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVL-GPRVKKGYVYGGVADMCFDGNA 361
Query: 403 YQEVNIPL--VKMEFEGNAEMTV-------DVTGIVYFVKSDASQVCLALASLSYEDETG 453
E+ L V EFE E+ V DV G V+ V S+ L AS
Sbjct: 362 -MEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSER-LGAAS-------N 412
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGN+ Q+N V +D N ++GF DCS +
Sbjct: 413 IIGNFHQQNLWVEFDLANRRIGFGVADCSRL 443
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/431 (25%), Positives = 172/431 (39%), Gaps = 84/431 (19%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----------Y 168
+PL+SG T Y +G R ++ DTGSDLTWV+C+ + Y
Sbjct: 42 MPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNY 101
Query: 169 NQQDP-----------------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSS 210
P VF P S ++ + C+S TC A L F+ C +
Sbjct: 102 GYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLA---ACPTPG 158
Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLG-----------KASVNDFIFGCGRNNKGL-FG 258
P C Y Y DGS RG +G + + +A + + GC + G F
Sbjct: 159 SP-CAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL------------------PSTQDAGASGS 300
G++ LG S++S S+ + FGG FSYCL P+ A AS +
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGI 355
G ++ TP+ + ++ FY + + G+S+ G+ L+ GG
Sbjct: 278 ACAGSAAAPGARQTPLLLDH-----RMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGA 332
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-----QEVNIPL 410
++DSGT +T L Y A+ A K+ G P D C+N ++ V +P
Sbjct: 333 ILDSGTSLTVLVSPAYRAVVAALGKKLVGLPRV-AMDPFDYCYNWTSPLTGEDLAVAVPA 391
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
+ + F G+A + Y + + C+ L + + +IGN Q+ +D K
Sbjct: 392 LAVHFAGSARLQPPPKS--YVIDAAPGVKCIGLQEGDWPGVS-VIGNILQQEHLWEFDLK 448
Query: 471 NSQLGFAGEDC 481
N +L F C
Sbjct: 449 NRRLRFKRSRC 459
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ + G+ FV+ + CLA A
Sbjct: 289 DLGIHGV--FVERSVQEQDVWCLAFA 312
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 171/368 (46%), Gaps = 49/368 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+N+++++DTGS+L+W+ C+ + + VF+P S +Y V C+S C T +
Sbjct: 76 QNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRT---RTRDL 128
Query: 204 GVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLF 257
+ +S P C+ +SY D + G L E +G + +FGC +N
Sbjct: 129 PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEED 188
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
+GLMG+ R LS V+Q + FSYC+ +G+ S+ L + + PI
Sbjct: 189 AKSTGLMGMNRGSLSFVNQ---LGFSKFSYCI-----SGSDSSVFLLLGDASYSWLGPIQ 240
Query: 318 YTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
YT ++ P P Y + L GI +G K L S F G ++DSGT T
Sbjct: 241 YTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 300
Query: 366 LPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEVN---IPLVKMEFE 416
L +Y+ALK EF+ Q P F +D C+ + + N +P+V + F
Sbjct: 301 LMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR 360
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKNQRVIYDTK 470
G AEM+V ++Y V S+ + ++ + E +IG++ Q+N + +D
Sbjct: 361 G-AEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLA 419
Query: 471 NSQLGFAG 478
S++GFAG
Sbjct: 420 KSRVGFAG 427
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 159/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS ++WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ +G+ FV+ + CLA A
Sbjct: 289 DLGSSGV--FVERSVQEQDVWCLAFA 312
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/417 (27%), Positives = 184/417 (44%), Gaps = 52/417 (12%)
Query: 99 VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--YIATIELGG--RNMTVIVDTGS 154
++L + ++ + + + + ++PL G+ L T Y IE+G + V VDTGS
Sbjct: 48 AEHLAALRRHDVGRHGRLLGAVDLPL-GGVGLPTATGLYYTQIEIGSPSKGYYVQVDTGS 106
Query: 155 DLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
D+ WV C C C + +DP+ S + V C+ C A G C S+
Sbjct: 107 DILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVGCDQEFCVA-NSPNGLPPACPST 163
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLFG--- 258
S P C + ++YGDGS T G + + + S N FGCG G G
Sbjct: 164 SSP-CQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSS 222
Query: 259 -GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
+ G++G G++D S++SQ + +F++CL + G +G +TP
Sbjct: 223 QALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGGGI---FAIGNVVQPKVKTTP 279
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSI 370
+ Q T Y +NL GIS+GG LQ +S F G G +IDSGT + LP +
Sbjct: 280 LV--------QNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGTTLAYLPREV 331
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
Y L ++ + CF S + P+V FEG E+T++V Y
Sbjct: 332 YRTLLTAVFDKYQDLALHNYQDFV--CFQFSGSIDDGFPVVTFSFEG--EITLNVYPHDY 387
Query: 431 FVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+++ C+ + + G ++G+ N+ V+YD + +G+A +CSS
Sbjct: 388 LFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADYNCSS 444
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 77/222 (34%), Positives = 117/222 (52%), Gaps = 16/222 (7%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
+SL+SQT + G+FSYCLPS + SGSL LG +N + YT ++ NP +
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPSL 56
Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y +N+TG+S+G ++ A FA G +IDSGTVITR +Y+AL+ EF +Q +
Sbjct: 57 YYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVA 116
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLA 442
DTCFN P V + +G ++T+ + + + S A+ + CLA
Sbjct: 117 APSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACLA 174
Query: 443 LASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+A ++ N QQ+N RV+ D S++GFA E C+
Sbjct: 175 MAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 176/373 (47%), Gaps = 36/373 (9%)
Query: 130 LQTLNYIATIELGG-RNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSISPSYKKVL 187
L+T +A+ EL G + +IVDTGS T++ C+ C SC ++ +D S + +V
Sbjct: 30 LETGVLVASFELAGAQTFELIVDTGSSRTYLPCKGCASCGAHEAGRYYDYDASADFSRVE 89
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFI 246
C S C + G SGV C Y V Y +GS + G L R+ + LG + N +
Sbjct: 90 C--SACAGIGGKCGTSGV--------CRYDVHYLEGSGSEGYLVRDVVSLGGSVGNATVV 139
Query: 247 FGCGRNNKGLFG--GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGAS--GS 300
FGC G GL G GR +L +Q ++ + LFS C+ + G
Sbjct: 140 FGCEERELGSIKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGG 199
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-LIDS 359
L+ GN ++ + YT M+ + A +Y + T ++G ++ S +G + +IDS
Sbjct: 200 LLTLGNFDFGADAPALVYTPMVSS---AMYYQVTTTSWTLGNSVVEGS---RGVLTIIDS 253
Query: 360 GTVITRLPPSIYSAL--KAEFLKQFSGFPS-APGFSILDTCFNLS---AYQEVN--IPLV 411
GT T +P ++++ AE + SG AP D CF S + V+ P +
Sbjct: 254 GTSYTYVPGNMHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPAL 313
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
K+E+ G+A +T+ +Y+ + +AS C+ + L ++D ++G +N +D
Sbjct: 314 KIEYHGSARLTLSPETYLYWHQKNASAFCVGI--LEHDDNRILLGQITMRNTFTEFDVAR 371
Query: 472 SQLGFAGEDCSSM 484
SQ+G A +C +
Sbjct: 372 SQVGMASANCEML 384
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L+ + +A S + C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVLRQRIRELLLKRGAAEEESERN-CYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 163/368 (44%), Gaps = 41/368 (11%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y + +++G ++IVDTGS +T+V C C C N QDP F P++S SYK + C S
Sbjct: 35 YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGSEC 94
Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGC 249
++G C S Y Y + S + G LG++ +G +S +FGC
Sbjct: 95 ---------STGFCDGSR----KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGC 141
Query: 250 GRNNKG-LFGGVS-GLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG 305
G L+ + G++GLGR LS++ Q E +FS C + G G++ILGG
Sbjct: 142 ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGG--GAMILGG 199
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGF-AKGGILIDSGTV 362
F+ + +T +P + +Y L L GI +GG +L+ F K G ++DSGT
Sbjct: 200 ----FQPPKDMVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTT 253
Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVNI----PLVKMEFE 416
P + + A K+ +Q PG D C+ + N+ P V F
Sbjct: 254 YAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFG 313
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
+T+ ++ + CL + D T ++G +N V Y+ + +GF
Sbjct: 314 DGQSVTLSPENYLFRHTKISGAYCLGV--FENGDPTTLLGGIIVRNMLVTYNRGKASIGF 371
Query: 477 AGEDCSSM 484
C+ +
Sbjct: 372 LKTKCNDL 379
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 120/416 (28%), Positives = 176/416 (42%), Gaps = 83/416 (19%)
Query: 134 NYIATIELGGRN--MTVIVDTGSDLTWVQCQP-----CKSCYNQQDPVFDPSISPSYKKV 186
+Y + LG + +++ +DTGSDL W C P C+ Q P+ P I+ + K V
Sbjct: 75 DYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPL--PKIA-NNKSV 131
Query: 187 -------------------LCNSSTC--HALEFATGNSGVCSS-SSPPDCNYFVSYGDGS 224
LC S C ++E + CSS S PP ++ +YGDGS
Sbjct: 132 SCSAAACSAAHGGSLSASHLCAISRCPLESIEISE-----CSSFSCPP---FYYAYGDGS 183
Query: 225 YTRGELGREHLGLGKAS------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
L R+ L L + V +F FGC G G+ G GR LS+ SQ +
Sbjct: 184 LV-ARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT---LGEPVGVAGFGRGVLSMPSQLA 239
Query: 279 EI---FGGLFSYCLPSTQDAG----ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
G FSYCL S A LILG + T YT+++ NP+ FY
Sbjct: 240 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG---RYYTGETEFIYTSLLENPKHPYFY 296
Query: 332 ILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
+ L GIS+G ++ A F GG+++DSGT T LP +Y ++ AEF +
Sbjct: 297 SVGLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGK 356
Query: 385 FPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYF---------VK 433
+ +T + Y E V +P V + F G V ++ V
Sbjct: 357 VANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVG 416
Query: 434 SDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
CL L + E E +GNYQQ+ V+YD + +++GFA CS++
Sbjct: 417 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTL 472
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 169/380 (44%), Gaps = 49/380 (12%)
Query: 135 YIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCN 189
Y I LG ++ V VDTGSD WV C C +C + ++DP++S + K V C+
Sbjct: 76 YYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVND 244
C +T + + + C Y ++YGDGS T G ++ L + +V D
Sbjct: 136 DEFCT----STYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191
Query: 245 ---FIFGCGRNNKGLFGGVS-----GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQD 294
IFGCG G + G++G G+++ S++SQ + +FS+CL S
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSI-- 249
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-----ASG 349
SG GG ++ + P T P Q Y + L I + G +Q
Sbjct: 250 ---SG----GGIFAIGEVVQPKVKTT--PLLQGMAHYNVVLKDIEVAGDPIQLPSDILDS 300
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVN 407
+ G +IDSGT + LP SIY L + L Q SG + + D TCF+ S + V+
Sbjct: 301 SSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKL---YLVEDQFTCFHYSDEESVD 357
Query: 408 --IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQKNQ 463
P VK FE +T ++ K D V + +D E ++G+ N+
Sbjct: 358 DLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANK 417
Query: 464 RVIYDTKNSQLGFAGEDCSS 483
V+YD N +G+A +CSS
Sbjct: 418 LVVYDLDNMAIGWADYNCSS 437
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 112/437 (25%), Positives = 178/437 (40%), Gaps = 88/437 (20%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP------ 173
+PL+SG T Y +G R ++ DTGSDLTWV+C + ++ P
Sbjct: 94 MPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCH--RHDHDAPAPGYGYAA 151
Query: 174 -----------------------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSS 209
VF P S ++ + C+S TC A L F+ C +
Sbjct: 152 PASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSL---AACPTP 208
Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLG-----------KASVNDFIFGCGRNNKG-LF 257
P C Y Y DGS RG +G + + +A + + GC + G F
Sbjct: 209 GSP-CAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSF 267
Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPI 316
G++ LG S++S S+ + FGG FSYCL A+ L G N +V +S+P
Sbjct: 268 LASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAV--SSSPP 325
Query: 317 TYTN---------------------MIPNPQLATFYILNLTGISIGGKQLQASGF----A 351
+ T ++ + ++ FY + + GIS+ G+ L+ A
Sbjct: 326 SKTACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVA 385
Query: 352 K-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ-----E 405
K GG ++DSGT +T L Y A+ A K+ +G P D C+N ++
Sbjct: 386 KGGGAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRV-TMDPFDYCYNWTSPSTGEDLT 444
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
V +P + + F G+A + Y + + C+ L + +IGN Q+
Sbjct: 445 VAMPELAVHFAGSARLQPPAKS--YVIDAAPGVKCIGLQEGEWPG-VSVIGNILQQEHLW 501
Query: 466 IYDTKNSQLGFAGEDCS 482
+D KN +L F C+
Sbjct: 502 EFDLKNRRLRFKRSRCT 518
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 154/337 (45%), Gaps = 28/337 (8%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+D SDL W C F+P S + V C C +FA G + +
Sbjct: 117 LDISSDLVWTACGATAP--------FNPVRSTTVADVPCTDDACQ--QFAPQTCGAGAGA 166
Query: 210 SPPDCNYFVSYGDGSY-TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
+C Y YG G+ T G LG E G ++ +FGCG N G F GVSG++GLGR
Sbjct: 167 GSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGR 226
Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
+LSLVSQ FSY + D+ + S IL G+ + + S ++ T ++ +
Sbjct: 227 GNLSLVSQLQV---DRFSYHF-APDDSVDTQSFILFGDDATPQTSHTLS-TRLLASDANP 281
Query: 329 TFYILNLTGISIGGKQLQ-ASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
+ Y + L GI + GK L SG GG+ + ++T L + Y L+
Sbjct: 282 SLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVAS 341
Query: 381 QFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ G P+ G ++ LD C+ + + +P + + F G A M +++ G +++ S
Sbjct: 342 KI-GLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL-GNYFYMDSTTGLA 399
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
CL + S D + ++G+ Q ++YD S+L F
Sbjct: 400 CLTILPSSAGDGS-VLGSLIQVGTHMMYDINGSKLVF 435
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 176/393 (44%), Gaps = 59/393 (15%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISP----SYK 184
Y ++ G + T+ + DTGS L + C C C ++ DP P P S K
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 185 KVLCNSSTCHAL-------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
+ C S C L N+ C+ PP Y + YG GS T G L E L
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPP---YILQYGLGS-TAGVLITEKLDF 205
Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DA 295
+V DF+ GC + +G+ G GR +SL SQ + FS+CL S + D
Sbjct: 206 PDLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNL---KRFSHCLVSRRFDDT 259
Query: 296 GASGSLIL----GGNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQL 345
+ L L G NS + TP +TYT NP ++ +Y LNL I +G K +
Sbjct: 260 NVTTDLDLDTGSGHNSG---SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316
Query: 346 Q------ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LD 395
+ A G GG ++DSG+ T + ++ + EF Q S + L
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376
Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
CFN+S +V +P + EF+G A++ + ++ FV + VCL + S + +G
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFV-GNTDTVCLTVVSDKTVNPSGGT 435
Query: 454 ----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+G++QQ+N V YD +N + GFA + CS
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 37/362 (10%)
Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
N T+ VD+G +WV C + +F P +S S+ K+ C S +C A + + G
Sbjct: 13 NFTLAVDSG--FSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAFSAVSTSCG 70
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIFGCGRNNKGLFG-- 258
SS C+Y SYG + G+L + + + + GCGR++ GL
Sbjct: 71 PSSS-----CSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLELL 125
Query: 259 GVSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLILGG----NSSVFKNS 313
SG +G + ++S + Q S + + F YCLPS G L++G N+S+ S
Sbjct: 126 DTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDT---FRGKLVIGNYKLRNASI---S 179
Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGF---AKGGILIDSGTVITRLPP 368
+ + YT MI NPQ A Y +NL+ ISI + Q GF GG +ID+ T ++ L
Sbjct: 180 SSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTTFLSYLTS 239
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDT-----CFNLSAYQEVNIP-LVKMEFEGNAEMT 422
Y+ L + +K ++ S+ D C+N+SA + P + F G A +
Sbjct: 240 DFYTQL-VQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFLGGAGVE 298
Query: 423 VDVTGIVYFVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V ++ S + +C+A+ S S +IG YQQ + V YD + + GF + C
Sbjct: 299 VSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358
Query: 482 SS 483
++
Sbjct: 359 NT 360
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 118/406 (29%), Positives = 174/406 (42%), Gaps = 64/406 (15%)
Query: 117 VSNTEIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++ ++PL +GI T Y I +G + V VDTGSD+ WV C C SC +
Sbjct: 70 LTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGL 129
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGV---CSSSSPPDCNYFVSYGDGSY 225
++DP+ S S K V C C N GV C+++SP C Y ++YGDGS
Sbjct: 130 GIDLTLYDPTASASSKTVTCGQEFCATAT----NGGVPPSCAANSP--CQYSITYGDGSS 183
Query: 226 TRGEL-----------GREHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSD 270
T G G L ASV FGCG G G + G++G G+++
Sbjct: 184 TTGFFVADFLQYDQVSGDGQTNLANASVT---FGCGAKIGGALGSSNVALDGILGFGQAN 240
Query: 271 LSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
S++SQ + +FS+CL D G + GN K T T ++P
Sbjct: 241 SSMLSQLTSAGKVTKIFSHCL----DTVNGGGIFAIGNVVQPKVKT----TPLVPG---M 289
Query: 329 TFYILNLTGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Y + L I +GG LQ G G +IDSGT + LP +Y KA F
Sbjct: 290 PHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTLAYLPEVVY---KAVLSAVF 346
Query: 383 SGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
S P ++ D CF S + P V F+G+ + V Y ++ C+
Sbjct: 347 SNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHD--YLFQNTEDVYCV 404
Query: 442 ALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
S + + G ++G+ N+ V+YD +N +G+ +CSS
Sbjct: 405 GFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCSS 450
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS TWV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSRGV--FVERSVQEQDVWCLAFA 312
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 124/516 (24%), Positives = 197/516 (38%), Gaps = 89/516 (17%)
Query: 42 LQWQQKSGSSS--SCVSHQKSRIEMGAITLELKHKNY----CSGKIVDWNEQQQNRLILD 95
+QW + +S + H + + ++ LEL H+++ G VD E + + D
Sbjct: 6 MQWNTITKASILITITLHLILPVAVNSMRLELVHRHHERFSGGGGDVDQVEAVKGFVNRD 65
Query: 96 NLHVQYLQSRI------KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMT 147
L Q + R + + E+P+ +G Y +++G G+
Sbjct: 66 GLRRQRMNQRWGVSNYDRRRKGLETTTTTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFW 125
Query: 148 VIVDTGSDLTWVQC---------------------------------------------Q 162
+ DTGS+ TW C
Sbjct: 126 LAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSN 185
Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
PCK VF P S S++ V C S C + +C S P C Y +SY D
Sbjct: 186 PCKG-------VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDP-CLYDISYAD 237
Query: 223 GSYTRGELGREHLGLG-----KASVNDFIFGCGR---NNKGLFGGVSGLMGLGRSDLSLV 274
GS +G G + + + + +N+ GC + N G++GLG + S +
Sbjct: 238 GSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFI 297
Query: 275 SQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
+ + +G FSYCL S L +GG+ + K I T +I P FY +
Sbjct: 298 DKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNA-KLLGEIKRTELILFP---PFYGV 353
Query: 334 NLTGISIGGKQL----QASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
N+ GISIGG+ L Q F ++GG LIDSGT +T L Y + +K +
Sbjct: 354 NVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRV 413
Query: 389 PG--FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
G F LD CF+ + + +P + F G A V Y + C+ + +
Sbjct: 414 TGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKS--YIIDVAPLVKCIGIVPI 471
Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+IGN Q+N +D + +GFA C+
Sbjct: 472 DGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507
>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 163
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 92/160 (57%), Gaps = 4/160 (2%)
Query: 326 QLATFYILNLTGISIGGKQLQA--SGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Q +FY LNLTGI++ G+ ++ S FA G +IDSGT + LPPS Y+AL++
Sbjct: 5 QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAM 64
Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
+ AP +I DTC++L+ ++ V IP V + F A + + +G++Y S+ SQ CLA
Sbjct: 65 GRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLY-TWSNVSQTCLA 123
Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ G++GN QQ+ VIYD N ++GF C+
Sbjct: 124 FLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 163
>gi|295830689|gb|ADG39013.1| AT5G10770-like protein [Neslia paniculata]
Length = 159
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 100/165 (60%), Gaps = 8/165 (4%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
LS SQT+ + +FSYCLPS+ A +G L G S+ S T + I + +F
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDGT--SF 54
Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
Y L++ I++GG++L ++ F+ G LIDSGTVITRLPP Y+AL++EF + S +P+
Sbjct: 55 YGLSIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSEFKAKMSKYPTT 114
Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
G SILDTCF+LS ++ V IP V F G A + + GI+Y K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGILYAFK 159
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 44/370 (11%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
Y + +G + +IVD+GS +T+V C C+ C N QDP F P +S +Y V CN
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
TC S C Y Y + S + G LG + + G S +FG
Sbjct: 148 TC--------------DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFG 193
Query: 249 CGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
C + G LF G+MGLGR LS++ Q + + G FS C G G+++LG
Sbjct: 194 CENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLG 251
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGILIDSGT 361
+ T++N + +P +Y + L + + GK L+ K G ++DSGT
Sbjct: 252 AMPA--PPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCF-----NLSAYQEVNIPLVKME 414
LP + A K Q P + D CF N+S EV P V M
Sbjct: 306 TYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEV-FPKVDMV 364
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F ++++ ++ CL + + +D T ++G +N V YD N ++
Sbjct: 365 FGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDRHNEKI 423
Query: 475 GFAGEDCSSM 484
GF +CS +
Sbjct: 424 GFWKTNCSEL 433
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 165/406 (40%), Gaps = 80/406 (19%)
Query: 146 MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPS--YKKVLCNSSTCHALEFATG 201
+++ +DTGSDL W C P C C + P + P +++ C S C A +
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRRIPCASPLCSAAHASAP 164
Query: 202 NSGVCSSSSPP-------DCN-------YFVSYGDGSYTRG-ELGREHLGLGK-----AS 241
S +C+++ P C + +YGDGS GR LG G +
Sbjct: 165 PSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVA 224
Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----A 297
V++F F C G G+ G GR LSL Q S G FSYCL S
Sbjct: 225 VDNFTFACAHTA---LGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIR 281
Query: 298 SGSLILGGNSSVFKNSTPIT----YTNMIPNPQLATFYILNLTGISIGGKQLQASG---- 349
LILG + + T YT ++ NP+ FY + L +S+G ++QA
Sbjct: 282 PSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELAR 341
Query: 350 ---FAKGGILIDSGTVITRLPPSIYSALKA--------------EFLKQFSGFPSAPGFS 392
GG+++DSGT T LP +Y+ + E ++ +G
Sbjct: 342 VDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG-------- 393
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--------CLAL- 443
L C+ +A + +P + + F GNA + + KS+ + CL L
Sbjct: 394 -LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLM 451
Query: 444 ----ASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
AS D G +GN+QQ+ V+YD ++GFA C+ +
Sbjct: 452 NGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 497
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/238 (36%), Positives = 121/238 (50%), Gaps = 21/238 (8%)
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYT 319
SGLMGLGR LSLVSQT FSYCL P + GA+G L +G ++S+ + +T T
Sbjct: 152 SGLMGLGRGRLSLVSQTGAT---KFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMT-T 207
Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ-----------ASGFAKGGILIDSGTVITRLPP 368
+ P+ + FY L L G+++G +L A G GG++IDSG+ T L
Sbjct: 208 QFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVH 267
Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEFEGNAEMTVDVT 426
Y AL +E + +G AP D + A ++V +P V F G A+M V
Sbjct: 268 DAYDALASELAARLNGSLVAPPPDADDGALCV-ARRDVGRVVPAVVFHFRGGADMAVPAE 326
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
Y+ D + C+A+AS +IGNYQQ+N RV+YD N F DCS++
Sbjct: 327 S--YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCSAL 382
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 159/359 (44%), Gaps = 46/359 (12%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
+IVDTGS +T+V C C+ C QDP F P +S +Y+ V C L+ N +
Sbjct: 94 FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC------TLDCNCDNDRM 147
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
C Y Y + S + G LG + + G S +FGC G L+
Sbjct: 148 -------QCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENVETGDLYSQHA 200
Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNST 314
G+MGLGR DLS++ Q + + FS C D G G+++LGG S VF S
Sbjct: 201 DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY-GGMDVGG-GAMVLGGISPPSDMVFAQSD 258
Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIY 371
P+ + +Y ++L I + GK+ L S F K G ++DSGT LP +
Sbjct: 259 PVR----------SPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYLPEEAF 308
Query: 372 SALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDV 425
A K +K+ F S P + D CF+ + P+V M F + ++
Sbjct: 309 LAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSP 368
Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ CL + + +D T ++G +N V+YD + +++GF +C+ +
Sbjct: 369 ENYMFRHSKVRGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNCAEL 426
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSRGV--FVERSVQEQDVWCLAFA 312
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 44/370 (11%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
Y + +G + +IVD+GS +T+V C C+ C N QDP F P +S +Y V CN
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
TC S C Y Y + S + G LG + + G S +FG
Sbjct: 148 TC--------------DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFG 193
Query: 249 CGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
C + G LF G+MGLGR LS++ Q + + G FS C G G+++LG
Sbjct: 194 CENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLG 251
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGILIDSGT 361
+ T++N + +P +Y + L + + GK L+ K G ++DSGT
Sbjct: 252 AMPA--PPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCF-----NLSAYQEVNIPLVKME 414
LP + A K Q P + D CF N+S EV P V M
Sbjct: 306 TYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPKVDMV 364
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F ++++ ++ CL + + +D T ++G +N V YD N ++
Sbjct: 365 FGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDRHNEKI 423
Query: 475 GFAGEDCSSM 484
GF +CS +
Sbjct: 424 GFWKTNCSEL 433
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 186/398 (46%), Gaps = 58/398 (14%)
Query: 121 EIPLTSGIRL---QTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF 175
++P +S +L + T+ +G +N+++++DTGS+L+W+ C+ + + VF
Sbjct: 44 KLPRSSSDKLSFRHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS----VF 99
Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGRE 233
+P S +Y V C+S C T + + +S P C+ +SY D + G L +
Sbjct: 100 NPVSSSTYSPVPCSSPICRT---RTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHD 156
Query: 234 HLGLGKASVNDFIFGCGRNNKGLF------GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
+G + +FGC + GL +GLMG+ R LS V+Q + FSY
Sbjct: 157 TFVIGSVTRPGTLFGC--MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSY 211
Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI----PNPQLATF-YILNLTGISIGG 342
C+ + +SG L+LG S + PI YT ++ P P Y + L GI +G
Sbjct: 212 CISGSD---SSGILLLGDAS--YSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGS 266
Query: 343 K--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG---FPSAPGFS 392
K L S F G ++DSGT T L +Y+ALK EF+ Q P F
Sbjct: 267 KILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFV 326
Query: 393 I---LDTCFNLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
+D C+ + + N +P++ + F G AEM+V ++Y V S+ +
Sbjct: 327 FQGTMDLCYRVGSSTRPNFTGLPVISLMFRG-AEMSVSGQKLLYRVNGAGSEGKEEVYCF 385
Query: 447 SYED------ETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
++ + E +IG++ Q+N + +D S++GFAG
Sbjct: 386 TFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAG 423
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 151/355 (42%), Gaps = 29/355 (8%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK---VLCNSSTCHALEFATGNSG 204
+++DTGS L+W+QC K+ +Q P + CN C
Sbjct: 97 MVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCKPRVPDFSLPT 156
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGL 263
C ++S C+Y Y DG+Y G L RE + + + I GC + G+
Sbjct: 157 DCDANS--LCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQSDD----ARGI 210
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN--SSVFKNSTPITYTNM 321
+G+ L SQ FSYC+P+ Q ASGS LG N SS F+ +T+
Sbjct: 211 LGMNLGRLGFPSQAKIT---KFSYCVPTKQAQPASGSFYLGNNPASSSFRYVNLLTFGQS 267
Query: 322 IPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
P L Y L L GISIGGK+L + + G +IDSG+ T L Y+
Sbjct: 268 QRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNV 327
Query: 374 LKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
++ E +K+ G G+ + D CF+ A E+ + M FE + + +
Sbjct: 328 IREELVKKV-GPKIKKGYMYGGVADICFDGDAI-EIGRLVGDMVFEFEKGVQIVIPKERV 385
Query: 431 FVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
D CL + G IIGN+ Q+N V +D N ++GF DCS +
Sbjct: 386 LATVDGGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLANRRVGFGEADCSKL 440
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/411 (27%), Positives = 178/411 (43%), Gaps = 34/411 (8%)
Query: 95 DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELG---GRNMTVIV 150
DN Q + S +++ +VS+T +IP+ SG Y +I +G + ++
Sbjct: 79 DNARRQMISS-LRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVT 137
Query: 151 DTGSDLTWVQCQ-PCKSCYNQQDP----VFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
DTGSDLTW+ C+ CKSC + +P VF + S S++ + C+S C +
Sbjct: 138 DTGSDLTWMNCEYWCKSC-PKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTE 196
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGV 260
C + + P C + Y +G G E + +G K + D + GC + G
Sbjct: 197 CPNPNAP-CLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFP 255
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G+MGLG SL + +EIFG FSYCL + + + G+ K + +T
Sbjct: 256 DGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPK-MQHTE 314
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG-----FAKGGILIDSGTVITRLPPSIY---- 371
++ + FY +N++GIS+GG L S GG+++DSGT +T L Y
Sbjct: 315 LLLG-YINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVV 373
Query: 372 SALKAEFLKQFSGFP-SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
ALK F K P P + + CF + +P + + F A V Y
Sbjct: 374 DALKPIFDKHKKVVPIELPELN--NFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKS--Y 429
Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ CL + + + I+GN Q+N YD +LGF C
Sbjct: 430 IIDVAEGIKCLGIIKADFPGSS-ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 108/398 (27%), Positives = 176/398 (44%), Gaps = 59/398 (14%)
Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QD 172
++PL +G+ +T Y I +G ++ V VDTGSD+ WV C C +C + +
Sbjct: 66 DLPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIEL 125
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELG 231
++DPS S S V C C A + GV S P C Y +SYGDGS T G
Sbjct: 126 TLYDPSGSSSGTGVTCGQDFCVAT-----HGGVIPSCVPAAPCQYSISYGDGSSTTGFFV 180
Query: 232 REHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE 279
+ L + S N FGCG G G + G++G G+S+ S++SQ +
Sbjct: 181 TDFLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA 240
Query: 280 I--FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
+F++CL + G A G ++ ++ T ++P Y +NL
Sbjct: 241 AGKVRKVFAHCLDTINGGGIFAIGDVV----------QPKVSTTPLVPG---MPHYNVNL 287
Query: 336 TGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
I +GG +LQ G +KG I IDSGT + LP +Y+A+ ++ Q+ P
Sbjct: 288 EAIDVGGVKLQLPTNIFDIGESKGTI-IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKN 346
Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
CF S + P++ FEG + + ++ + C+ + +
Sbjct: 347 DQDF--QCFRYSGSVDDGFPIITFHFEGGLPLNIHPHDYLF---QNGELYCMGFQTGGLQ 401
Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ G ++G+ N+ V+YD +N +G+ +CSS
Sbjct: 402 TKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCSS 439
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 173/381 (45%), Gaps = 36/381 (9%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCK-SCYN---QQDPVFD 176
P+ + + I LG + V VDTGS L+WV CQ C+ SC+ + VFD
Sbjct: 63 PVVGNHEIHEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFD 122
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG---DGSYTRGELGRE 233
P S +Y+ V C+S C ++ + C + C Y + YG G Y+ G LG +
Sbjct: 123 PDKSTTYELVGCSSRDCADVQRSLVAPFGCIEET-DTCLYSLRYGSGPSGQYSAGRLGTD 181
Query: 234 HLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS-EIFGGLFSYCLP 290
L L +S ++ FIFGC ++ G SG++G G ++ S +Q + + FSYC P
Sbjct: 182 KLTLASSSSIIDGFIFGCSGDDS-FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFP 240
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--S 348
D A G L +G + YTN+IP+ + Y L + + G +LQ S
Sbjct: 241 G--DHTAEGFLSIGAYP-----KDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQS 293
Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI----LDTCFNLSAYQ 404
+ K +++DSGTV T L ++ A F K + A GF +TCF +
Sbjct: 294 EYTKRMMVVDSGTVDTFLLGPVFDA----FSKAMASAMQAKGFLSDTVGTETCFRPNGGD 349
Query: 405 EV---NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIGNYQQ 460
V ++P V+M F G + + + + + ++CLA ++ I+GN
Sbjct: 350 SVDSGDLPTVEMRFIGTT-LKLPPENVFHDLLPSHDKICLAFKPDVAGVRNVQILGNKAT 408
Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
+ RV+YD + GF C
Sbjct: 409 XSFRVVYDLQAMYFGFQAGAC 429
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGRHGV--FVERSVQEQDVWCLAFA 312
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T I +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/222 (34%), Positives = 117/222 (52%), Gaps = 16/222 (7%)
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
+SL+SQT + G+FSYCLPS + SGSL LG +N + +T ++ NP +
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRHTPLLTNPHRPSL 56
Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y +N+TG+S+G ++ A FA G +IDSGTVITR +Y+AL+ EF +Q +
Sbjct: 57 YYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVA 116
Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLA 442
DTCFN P V + +G ++T+ + + + S A+ + CLA
Sbjct: 117 APSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACLA 174
Query: 443 LASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+A ++ N QQ+N RV+ D S++GFA E C+
Sbjct: 175 MAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 155/353 (43%), Gaps = 48/353 (13%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+IVDTGSDL W QC+ S P + + + TC A A G V +
Sbjct: 55 LIVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTRTCTASAAAVG---VLA 111
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
S + +T G L LG FGCG + G G +G++GL
Sbjct: 112 SET--------------FTFGARRAVSLRLG--------FGCGALSAGSLIGATGILGLS 149
Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNP 325
LSL++Q FSYCL D S L+ G + + ++ T PI T ++ NP
Sbjct: 150 PESLSLITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTTAIVSNP 205
Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYSALKAEF 378
+Y + L GIS+G K+L A+ A GG ++DSG+ + L + + A+K E
Sbjct: 206 VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK-EA 264
Query: 379 LKQFSGFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+ P A + CF L +A + V +P + + F+G A M + YF
Sbjct: 265 VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDN--YF 322
Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+ A +CLA+ + IIGN QQ+N V++D ++ + FA C +
Sbjct: 323 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 375
>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
Length = 486
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 162/358 (45%), Gaps = 45/358 (12%)
Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
L+YI + G + V + T + ++C+PC S + +P FD S ++ V C+S
Sbjct: 149 LDYIVLVSYGSPEQQFPVFLGTNVGTSLLRCKPCASGSDDCNPAFDTLQSSTFAHVPCSS 208
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFG 248
C CSSS C ++ YG G + L L +S V+DF F
Sbjct: 209 PDCPV---------NCSSSV---CPFYDLYG---TVGGTFATDVLTLAPSSMAVHDFRFV 253
Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG-----GLFSYCLPSTQDAGASGSLI 302
C + +G + L R SL SQ S G FSYCLP Q + G L
Sbjct: 254 CMDVESPSPDLPEAGSIDLSRHRNSLPSQLSSSSGIAPTAASFSYCLP--QSRNSQGFLS 311
Query: 303 LGGNSSVFKNS------TPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
LGG+++V + P+ + N +P LA+ Y ++L G+S+GG+ L + F
Sbjct: 312 LGGDATVVGDDDNLTVHAPMVWNN---DPDLASMYFIDLVGMSLGGEDLPIPSGTFGNAS 368
Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGF---PSAPGFSILDTCFNLSAYQEVNIPLV 411
+D G T L P Y+ L+ F K+ S + S GF DTCFN + E+ +PLV
Sbjct: 369 TNLDVGATFTMLAPEAYTTLRDAFRKEMSQYNNRSSPAGFDGFDTCFNFTGLNELVVPLV 428
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDA---SQVCLALASLSYEDE-TGIIGNYQQKNQRV 465
+++F + +D ++Y+ A + CLA +SL D + +IG Y + V
Sbjct: 429 QLKFSNGESLMIDGDQMLYYHDPAAGPFTMACLAFSSLDVGDSFSAVIGTYTLASTEV 486
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 159/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 165/374 (44%), Gaps = 49/374 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN------QQDPVFDPSISPSYKKVLCNSSTCH--- 194
+ ++ +VDTGS + W C +C N ++ P+F+P +S S K + C C
Sbjct: 98 QKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTS 157
Query: 195 ------ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
GNS CS + P Y + YG G+ G E+L +++ F+ G
Sbjct: 158 SPDVHLGCPRCNGNSKKCSHACP---QYTLQYGTGA-ASGFFLLENLDFPGKTIHKFLVG 213
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGN 306
C + L G GR+ SL Q F+YCL S D SG LIL +
Sbjct: 214 C-TTSADREPSSDALAGFGRTMFSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYS 269
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYI-LNLTGISIGGKQLQASGF-------AKGGILID 358
+ ++Y + NP FY L + + IG K L+ G ++GG++ID
Sbjct: 270 DGETQG---LSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMID 326
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
SG + ++ + E KQ S + A S L C+N + ++ + IP + +F
Sbjct: 327 SGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQF 386
Query: 416 EGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE-------TGIIGNYQQKNQRVIY 467
G A M V G+ YF+ S+AS C + + S + + I+GNYQQ + V +
Sbjct: 387 TGGANMVVP--GMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEF 444
Query: 468 DTKNSQLGFAGEDC 481
D KN +LGF + C
Sbjct: 445 DLKNERLGFRQQTC 458
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 167/352 (47%), Gaps = 38/352 (10%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDP---VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
+VD S W QC PC + P F P+ S ++ + C+S C + E
Sbjct: 105 LVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAG 164
Query: 204 GVCSSSSPPDCN-YFVSYG-DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
++++ C+ Y ++YG + T G L + G +V +FGC + G F G S
Sbjct: 165 AAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGAS 224
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLILGGNSSVFK----NSTP 315
G++G+GR +LSL+SQ G FSY L P D G++ S+I G+ +V K STP
Sbjct: 225 GVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTP 281
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITRLP 367
+ + + P+ FY +NLTG+ + G +L A GG+++ S T +T L
Sbjct: 282 LLSSTLYPD-----FYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLE 336
Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
+ Y ++A + G P+ G + LD C+N S+ +V +P + + F+G A+M D+
Sbjct: 337 QAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADM--DL 393
Query: 426 TGIVYF-VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
+ YF + +D CL + ++G Q +IYD +L F
Sbjct: 394 SAANYFYIDNDTGLECLTMLP---SQGGSVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 156/323 (48%), Gaps = 35/323 (10%)
Query: 175 FDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPDCN-YFVSYG-DGSYTRGEL 230
F P+ S ++ + C+S C + E ++++ C+ Y ++YG + T G L
Sbjct: 134 FRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYL 193
Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL- 289
+ G +V +FGC + G F G SG++G+GR +LSL+SQ G FSY L
Sbjct: 194 ATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLL 250
Query: 290 -PSTQDAGASGSLILGGNSSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
P D G++ S+I G+ +V K STP+ + + P+ FY +NLTG+ + G +
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPD-----FYYVNLTGVRVDGNR 305
Query: 345 LQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--L 394
L A GG+++ S T +T L + Y ++A + G P+ G + L
Sbjct: 306 LDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALEL 364
Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDASQVCLALASLSYEDETG 453
D C+N S+ +V +P + + F+G A+M D++ YF + +D CL +
Sbjct: 365 DLCYNASSMAKVKVPKLTLVFDGGADM--DLSAANYFYIDNDTGLECLTMLP---SQGGS 419
Query: 454 IIGNYQQKNQRVIYDTKNSQLGF 476
++G Q +IYD +L F
Sbjct: 420 VLGTLLQTGTNMIYDVDAGRLTF 442
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGRRGV--FVERSVQEQDVWCLAFA 312
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + ++
Sbjct: 89 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148
Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
C+ C A F TG + +C S SSP C Y +YGDGS T G +G
Sbjct: 149 CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 204
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
E AS+ +FGC + G V G+ G G+ LS++SQ + + +FS
Sbjct: 205 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 261
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
+CL + + G G L+LG + P + YT ++P+ Y LNL I++ G++L
Sbjct: 262 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 310
Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
+S F G ++DSGT + L Y + S PS S CF
Sbjct: 311 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 368
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
S+ + + P V + F G M+V ++ AS L + ++ G I+
Sbjct: 369 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 425
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
G+ K++ +YD N ++G+A DCS
Sbjct: 426 GDLVLKDKIFVYDLANMRMGWADYDCS 452
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + ++
Sbjct: 91 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150
Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
C+ C A F TG + +C S SSP C Y +YGDGS T G +G
Sbjct: 151 CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 206
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
E AS+ +FGC + G V G+ G G+ LS++SQ + + +FS
Sbjct: 207 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 263
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
+CL + + G G L+LG + P + YT ++P+ Y LNL I++ G++L
Sbjct: 264 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 312
Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
+S F G ++DSGT + L Y + S PS S CF
Sbjct: 313 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 370
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
S+ + + P V + F G M+V ++ AS L + ++ G I+
Sbjct: 371 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 427
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
G+ K++ +YD N ++G+A DCS
Sbjct: 428 GDLVLKDKIFVYDLANMRMGWADYDCS 454
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 126/462 (27%), Positives = 200/462 (43%), Gaps = 68/462 (14%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNL----HVQYLQSRI 106
SS+ ++ + SR+ +L H+N + D NE ++R + +L+S+I
Sbjct: 27 SSTLITTKPSRLAT-----KLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKI 81
Query: 107 KNMIS-GNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
K + S GN + ++ IP G ++ + +G +T V+VDTGS L WVQC P
Sbjct: 82 KELKSVGN--EARSSLIPFNRGS-----GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C +C+ Q FDP S S+K + C + + N C+ + + Y + Y G
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-----NGYKCNRFNQAE--YKLRYLGG 187
Query: 224 SYTRGELGREHLGL-----GKASVNDFIFGCGR-----NNKGLFGGVSGLMGLGRSDLSL 273
++G L +E L GK ++ FGCG NN + GV GL +++
Sbjct: 188 DSSQGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAY--PHITM 245
Query: 274 VSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
+Q G FSYC+ + L+LG S + +STP+ +Y+
Sbjct: 246 ATQ----LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQI-------HFGHYYV 294
Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
L IS+G K L + S GG+LIDSG T+L + L E + G
Sbjct: 295 -TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL 353
Query: 386 ----PSAPGFSILDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
P+ F L CF + + V P V F G A++ ++ + F + + C
Sbjct: 354 LERIPTQRKFEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSL--FRQHGGDRFC 409
Query: 441 LA-LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
LA L S S +IG Q+N V +D + ++ F DC
Sbjct: 410 LAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 115/442 (26%), Positives = 192/442 (43%), Gaps = 46/442 (10%)
Query: 66 AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLT 125
+ T EL H + + + +E +RL + LQ R N ++ + +SN++ +
Sbjct: 37 SFTAELIHIDSPNSPFFNASETTTHRL------AKALQ-RSANRVA-RLNPLSNSDEGVH 88
Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
+ I NY+ + +G + +DTGS++ W+ C CK C+NQ +F+P S +Y
Sbjct: 89 ASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTY 148
Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
+ C+S C + + VC S C+ + G + + + L +
Sbjct: 149 QDAPCDSYQCETTSSSCQSDNVCLYS----CD---EKHQLNCPNGRIAVDTMTLTSSDGR 201
Query: 244 DFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
F F CG + F GV G++GLGR LSL S+ + G FSYCL S
Sbjct: 202 PFPLPYSDFVCGNSIYKTFAGV-GVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPS 260
Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAK-- 352
+ G S + + + T + + +Y+ L GIS+G K+ FA
Sbjct: 261 -KINFGLQSFISDDDLEVVSTTLGHHRHSGNYYV-TLEGISVGEKRQDLYYVDDPFAPPV 318
Query: 353 GGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI-LDTCFNLSA----Y 403
G +LIDSGT+ T LP Y S + + P F +D LS Y
Sbjct: 319 GNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYY 378
Query: 404 QEVNIPLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
E+ P + + F + + E++ D + F++ VC A A+ + ++ + G++QQ N
Sbjct: 379 PELKFPKITIHFTDADVELSDDNS----FIRVAEDVVCFAFAA-TQPGQSTVYGSWQQMN 433
Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
+ YD K + F DCS +
Sbjct: 434 FILGYDLKRGTVSFKRTDCSKL 455
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 154/350 (44%), Gaps = 51/350 (14%)
Query: 149 IVDTGSDLTWVQCQPCKSCYNQ-QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
I+DTGS L W+QC PCKSC Q P+FDPSIS +Y + C + C SG C
Sbjct: 118 IMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNIICRY-----APSGECD 172
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVS 261
SSS C Y +Y +G + G + E L G+ +VN+ +FGC RN +
Sbjct: 173 SSS--QCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRNGNYKDRRFT 230
Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
G+ GLG S+V+Q G FSYC+ + D S L+L ++ STP+ +
Sbjct: 231 GVFGLGSGITSVVNQ----MGSKFSYCIGNIADPDYSYNQLVLSEGVNMEGYSTPLDVVD 286
Query: 321 MIPNPQLATFYILNLTGISIGGKQL--QASGFAKG----GILIDSGTVITRLPPSIYSAL 374
Y + L GIS+G +L S F + ++IDSGT T L + Y AL
Sbjct: 287 --------GHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAENEYRAL 338
Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
+ E F + P C+ Q+ V P V F A++ VD
Sbjct: 339 EREVRNLLDRFLT-PFMRESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVDTE------- 390
Query: 434 SDASQVCLALASLSYED--ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ AS+ +D + +IG Q+ V YD +L F DC
Sbjct: 391 -------MRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 165/387 (42%), Gaps = 62/387 (16%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
LT+G L YI T + +IVD+GS +T+V C C+ C N QDP F P +S +Y
Sbjct: 80 LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTY 135
Query: 184 KKVLCNSS-TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
V C++ TC S C Y Y + S + G LG + + G S
Sbjct: 136 SPVKCSADCTC--------------DSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESE 181
Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
+FGC + G LF G+MGLGR LS++ Q + + G FS C
Sbjct: 182 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 241
Query: 296 GASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G G+++LG + VF S P+ + +Y + L I + GK L+
Sbjct: 242 G--GAMVLGAMPAPPDMVFSRSDPVR----------SPYYNIELKEIHVAGKALRLDPRI 289
Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
+K G ++DSGT LP + A K LK+ G P + D CF
Sbjct: 290 FDSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRG----PDPNYKDICFAGAG 345
Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
N+S + P V M F ++++ ++ CL + + +D T ++G
Sbjct: 346 RNVSQLSQA-FPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGG 403
Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+N V YD N ++GF +CS +
Sbjct: 404 IVVRNTLVTYDRHNEKIGFWKTNCSEL 430
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + ++
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
C+ C A F TG + +C S SSP C Y +YGDGS T G +G
Sbjct: 65 CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 120
Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
E AS+ +FGC + G V G+ G G+ LS++SQ + + +FS
Sbjct: 121 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 177
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
+CL + + G G L+LG + P + YT ++P+ Y LNL I++ G++L
Sbjct: 178 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 226
Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
+S F G ++DSGT + L Y + S PS S CF
Sbjct: 227 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 284
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
S+ + + P V + F G M+V ++ AS L + ++ G I+
Sbjct: 285 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 341
Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
G+ K++ +YD N ++G+A DCS
Sbjct: 342 GDLVLKDKIFVYDLANMRMGWADYDCS 368
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 162/363 (44%), Gaps = 44/363 (12%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
+++DT S L W++C C Q+ PVFDPS S SY+ + S C A CS
Sbjct: 91 LVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGDKCS 150
Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGL--FGGVSGL 263
P + + G +G + + LG + ++ FGC ++ +G G +G
Sbjct: 151 FHLPGEAH------------GYVGTDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTFAGT 198
Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGNSS-----VFKNSTPIT 317
+G+G+ SL+ Q + G FSYCL G +G + G + V +
Sbjct: 199 LGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILP 258
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQ---LQASGFAK-----GGILIDSGTVITRLPPS 369
+P+ + Y + L GIS+ G ++ + F + GG +D+GT +T L P+
Sbjct: 259 TPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPA 318
Query: 370 IYSALK---AEFLKQFSGFPSA--PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
Y+ ++ A ++Q+ G+ P FS+ CF +IP + ++FEG A TV
Sbjct: 319 AYAVVEEAVAHMVQQW-GYKRVRDPNFSL---CFREHPGIWSHIPKLTLDFEGPASRTVA 374
Query: 425 VTGIV---YFVKSDASQ-VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
IV F+K D VC + S T ++G QQ + R I+D + + F E
Sbjct: 375 HLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPT-VVGAMQQVDTRFIFDLHANTITFHRES 433
Query: 481 CSS 483
C +
Sbjct: 434 CEA 436
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSKGV--FVERSVQEQDVWCLAFA 312
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 167/391 (42%), Gaps = 54/391 (13%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSCYN-QQDPVFDPSISPSYKKVLC 188
Y +E G + T ++DTGS L W+ C C C + P F P S S K V C
Sbjct: 86 YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGC 145
Query: 189 NSSTC----------HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
+ C H CS + P Y V YG GS T G L E+L
Sbjct: 146 TNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCP---AYTVQYGLGS-TAGFLLSENLNFP 201
Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DA 295
+DF+ GC + +G+ G GR + SL SQ + FSYCL S Q A
Sbjct: 202 TKKYSDFLLGCSVVS---VYQPAGIAGFGRGEESLPSQMNLT---RFSYCLLSHQFDDSA 255
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNP------QLATFYILNLTGISIGGKQ----- 344
+ +L+L SS + ++YT + NP +Y + L I +G K+
Sbjct: 256 TITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPR 315
Query: 345 --LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FSGFPSAPGFSILDTCFNL 400
L+ + GG ++DSG+ T + I+ + EF KQ ++ A L CF L
Sbjct: 316 RLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVL 375
Query: 401 SAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETG----- 453
+ E + P ++ EF G A+M + V V K D + CL + S G
Sbjct: 376 AGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVA--CLTIVSDDVAGSGGTVGPA 433
Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
I+GNYQQ+N V YD +N + GF + C +
Sbjct: 434 VILGNYQQQNFYVEYDLENERFGFRSQSCQT 464
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 54/370 (14%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQ-----------DPVFDPSISPSYKKVLCNSSTCH 194
+IVDTGS +T+V C C C + Q DP F P S SY+K+ C SS C
Sbjct: 53 FALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIGCRSSDC- 111
Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGCGR 251
+G+C S+S C Y Y + S ++G LG++ L G AS FGC
Sbjct: 112 -------ITGLCDSNS-HQCKYERMYAEMSTSKGVLGKDLLDFGPASRLQSQLLSFGCET 163
Query: 252 NNKG-LFGGVS-GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGG-- 305
G L+ V+ G+MGLGR LS+V Q + FS C + G GS++LG
Sbjct: 164 AESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGG--GSMVLGAIP 221
Query: 306 --NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSG 360
+ VF S +P+ + +Y L LT I + G L+ K G ++DSG
Sbjct: 222 APSGMVFAKS----------DPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSG 271
Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKME 414
T LP + A + Q + P + D C+ + + PLV
Sbjct: 272 TTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPLVDFV 331
Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
F N ++++ ++ CL +D T ++G +N V YD N Q+
Sbjct: 332 FAENQKVSLAPENYLFKHTKVPGAYCLGF--FKNQDATTLLGGIIVRNMLVTYDRYNHQI 389
Query: 475 GFAGEDCSSM 484
GF +C+ +
Sbjct: 390 GFLKTNCTEL 399
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 177/390 (45%), Gaps = 38/390 (9%)
Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPS 178
+PL+SG T Y +G + ++ DTGSDLTWV+C + VF +
Sbjct: 99 MPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAA 158
Query: 179 ISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-- 235
S S+ + C+S TC + + F+ N CSS + P C Y Y DGS RG +G +
Sbjct: 159 ASRSWAPIACSSDTCTSYVPFSLAN---CSSPASP-CAYDYRYNDGSAARGVVGTDSATI 214
Query: 236 ----------GLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
G +A + + GC + G F G++ LG S++S S+ + FGG
Sbjct: 215 ALSGSESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 274
Query: 285 FSYCLPSTQDAGASGSLIL-------GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
FSYCL + S + GG ++ +S+ T ++ + +++ FY + +
Sbjct: 275 FSYCLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDA 334
Query: 338 ISIGGKQLQASG----FAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
+ + G+ L A+ GG ++DSGT +T L Y A+ A ++ +G P
Sbjct: 335 VHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRV-SMD 393
Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
+ C+N +A + IP +++ F G+A + Y V + C+ + ++ +
Sbjct: 394 PFEYCYNWTA-AALEIPGLEVRFAGSARLQPPAKS--YVVDAAPGVKCIGVQEGAWPGVS 450
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+IGN Q++ +D ++ L F C+
Sbjct: 451 -VIGNILQQDHLWEFDLRDRWLRFKHTRCA 479
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 156/356 (43%), Gaps = 40/356 (11%)
Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SSTCHALEFATGNSG 204
+IVDTGS +T+V C C+ C QDP F P S +Y+ V C C
Sbjct: 125 FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTIDCNCDGDRM------ 178
Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-G 259
C Y Y + S + G LG + + G S +FGC G L+
Sbjct: 179 --------QCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQH 230
Query: 260 VSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
G+MGLGR DLS++ Q ++ FS C D G G+++LGG S P
Sbjct: 231 ADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-GGMDVGG-GAMVLGGISP------PSD 282
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIYSAL 374
T +P + +Y ++L + + GK+ L A+ F K G ++DSGT LP + + A
Sbjct: 283 MTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAF 342
Query: 375 KAEFLKQFSGFP--SAPGFSILDTCF----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
K +K+ S P + D CF N + + P+V M F + ++
Sbjct: 343 KDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENY 402
Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
++ CL + + D+T ++G +N V+YD + +++GF +C+ +
Sbjct: 403 MFRHSKVRGAYCLGIFQ-NGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAEL 457
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 154/337 (45%), Gaps = 32/337 (9%)
Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
+D SDL W C F+P S + V C C +FA G +S
Sbjct: 117 LDISSDLVWTACGATAP--------FNPVRSTTVADVPCTDDACQ--QFAPQTCGAGAS- 165
Query: 210 SPPDCNYFVSYGDGSY-TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
+C Y YG G+ T G LG E G ++ +FGCG N G F GVSG++GLGR
Sbjct: 166 ---ECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGR 222
Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
+LSLVSQ FSY + D+ + S IL G+ + + S ++ T ++ +
Sbjct: 223 GNLSLVSQLQV---DRFSYHF-APDDSVDTQSFILFGDDATPQTSHTLS-TRLLASDANP 277
Query: 329 TFYILNLTGISIGGKQLQ-ASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
+ Y + L GI + GK L SG GG+ + ++T L + Y L+
Sbjct: 278 SLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVAS 337
Query: 381 QFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
+ G P+ G ++ LD C+ + + +P + + F G A M +++ G +++ S
Sbjct: 338 KI-GLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL-GNYFYMDSTTGLA 395
Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
CL + S D + ++G+ Q ++YD S+L F
Sbjct: 396 CLTILPSSAGDGS-VLGSLIQVGTHMMYDINGSKLVF 431
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 137/276 (49%), Gaps = 34/276 (12%)
Query: 92 LILDNLHVQYLQSRIK-----NMISGNIKDVSNTEI----PLTS-GIRLQTLNYIATIE- 140
L D V Y+Q R+ N ++G D T++ P ++ G+ + + A +
Sbjct: 96 LDADQHRVAYIQKRLAGGDTSNGVAGASWDGQTTDVGTYLPASNVGVGAKMIGTTAAPDG 155
Query: 141 LGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-E 197
TVI+D+GSD+ WVQCQPC C+ Q+DP+FDP+ S +Y V C+S+ C L
Sbjct: 156 TSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGP 215
Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKG- 255
+ G CS++ C + +Y DG+ G + L LG V F+FGC ++G
Sbjct: 216 YRRG----CSANV--QCQFGFTYTDGATATGTYSSDDLTLGPYDVVRGFLFGCAHADRGS 269
Query: 256 -LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFK 311
VSG + LG S V QT+ +G +FSYC+P + + G + LG +++
Sbjct: 270 TFSFDVSGTLALGGGAQSFVQQTATQYGRVFSYCIPPSPS--SLGFITLGVPPQRAALVP 327
Query: 312 N--STPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
STP+ ++ +P TFY + L I + G+ L
Sbjct: 328 TFVSTPLLSSSSMP----PTFYRVLLRAIIVAGRPL 359
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 167/397 (42%), Gaps = 72/397 (18%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ----------DP 173
LT+G L YI T + +IVD+GS +T+V C C+ C N Q DP
Sbjct: 87 LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDP 142
Query: 174 VFDPSISPSYKKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
F P +S +Y V CN TC + C Y Y + S + G LG
Sbjct: 143 RFQPDLSSTYSPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGE 188
Query: 233 EHLGLGKAS---VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLF 285
+ + GK S +FGC G LF G+MGLGR LS++ Q E + F
Sbjct: 189 DIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSF 248
Query: 286 SYCLPSTQDAGASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
S C D G G+++LGG + VF +S P+ + +Y + L I +
Sbjct: 249 SLCY-GGMDVGG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVA 296
Query: 342 GKQLQASGF---AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFS 392
GK L+ +K G ++DSGT LP + A K LK+ G P +
Sbjct: 297 GKALRLDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPN 352
Query: 393 ILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
D CF N+S EV P V M F ++++ ++ CL + +
Sbjct: 353 YKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-N 410
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+D T ++G +N V YD N ++GF +CS +
Sbjct: 411 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 447
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F G FSYCLP + + +G L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 161/384 (41%), Gaps = 60/384 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV----FDPSISPSYKKVLCNSSTCHA 195
+N++ I DTGS L W C C C + DP F P +S S K V C + C
Sbjct: 143 QNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAW 202
Query: 196 L---------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
+ S CS S P Y + YG G+ T G L E L L V DF+
Sbjct: 203 IFGPNLKSRCRNCNSKSRKCSDSCP---GYGLQYGSGA-TAGILLSETLDLENKRVPDFL 258
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILG 304
GC + +G+ G GR SL SQ FS+CL S D+ S L+L
Sbjct: 259 VGCSVMS---VHQPAGIAGFGRGPESLPSQMRL---KRFSHCLVSRGFDDSPVSSPLVLD 312
Query: 305 GNSSVFKNST------PITYTNMIPNPQLATFYILNLTGISIGGKQLQ-------ASGFA 351
S ++ T P + N +Y L+L I IGGK ++
Sbjct: 313 SGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTG 372
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQE-VN 407
GG +IDSG+ T L I+ A+ E KQ +P A S L CFN+ +E
Sbjct: 373 NGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAE 432
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---------IIGNY 458
P V ++F+G ++++ + V +D VCL + + DE I+G +
Sbjct: 433 FPDVVLKFKGGGKLSLAAENYLAMV-TDEGVVCLTMMT----DEAVVGGGGGPAIILGAF 487
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
QQ+N V YD ++GF + C+
Sbjct: 488 QQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 167/397 (42%), Gaps = 72/397 (18%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ----------DP 173
LT+G L YI T + +IVD+GS +T+V C C+ C N Q DP
Sbjct: 86 LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDP 141
Query: 174 VFDPSISPSYKKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
F P +S +Y V CN TC + C Y Y + S + G LG
Sbjct: 142 RFQPDLSSTYSPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGE 187
Query: 233 EHLGLGKAS---VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLF 285
+ + GK S +FGC G LF G+MGLGR LS++ Q E + F
Sbjct: 188 DIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSF 247
Query: 286 SYCLPSTQDAGASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
S C D G G+++LGG + VF +S P+ + +Y + L I +
Sbjct: 248 SLCY-GGMDVGG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVA 295
Query: 342 GKQLQASGF---AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFS 392
GK L+ +K G ++DSGT LP + A K LK+ G P +
Sbjct: 296 GKALRLDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPN 351
Query: 393 ILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
D CF N+S EV P V M F ++++ ++ CL + +
Sbjct: 352 YKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-N 409
Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
+D T ++G +N V YD N ++GF +CS +
Sbjct: 410 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 446
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 178/400 (44%), Gaps = 54/400 (13%)
Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++ ++PL G+ T Y I+LG ++ V VDTGSD+ WV C C+ C ++
Sbjct: 67 LAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGL 126
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++DP S + V+C+ + C A F G C ++ P C Y V+YGDGS T G
Sbjct: 127 GLDLTLYDPKASSTGSMVMCDQAFC-AATFG-GKLPKCGANVP--CEYSVTYGDGSSTIG 182
Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ L + + + IFGCG G G + G++G G ++ S++SQ
Sbjct: 183 SFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQ 242
Query: 277 --TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
T+ +F++CL + + G +G +TP+ + P Y +N
Sbjct: 243 LTTAGKVKKIFAHCLDTIK---GGGIFSIGDVVQPKVKTTPL----VADKPH----YNVN 291
Query: 335 LTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIY-SALKAEFLK-QFSGFPS 387
L I +GG LQ K G +IDSGT +T LP ++ + A F K Q F
Sbjct: 292 LKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHD 351
Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
GF CF + P + FE ++ + V YF + C+ + +
Sbjct: 352 VQGF----LCFQYPGSVDDGFPTITFHFED--DLALHVYPHEYFFANGNDVYCVGFQNGA 405
Query: 448 YEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + G ++G+ N+ VIYD +N +G+ +CSS
Sbjct: 406 SQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCSS 445
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 162/375 (43%), Gaps = 43/375 (11%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y +I +G R + VDTGSDLTW+QC PC +C P++ P+ K V
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKE---KIVPPRDL 243
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
C L+ GN C + C+Y + Y D S + G L R+ + L G DF+F
Sbjct: 244 LCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDFVF 298
Query: 248 GCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
GC + +G G++GL + +SL SQ + I +F +C+ T++ G G +
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCI--TREQGGGGYM 356
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILIDS 359
LG + + IT+T++ P Y + G +QL+ A ++ DS
Sbjct: 357 FLGDD---YVPRWGITWTSIRSGPD--NLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDS 411
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN-------LSAYQEVNIPLVK 412
G+ T LP IY L A GF L C+ L ++ PL
Sbjct: 412 GSSYTYLPDEIYENLVAAIKYASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQFFKPL-N 470
Query: 413 MEFEGN---AEMTVDVTGIVYFVKSDASQVCLALASLSYEDE--TGIIGNYQQKNQRVIY 467
+ F T ++ Y + SD VCL L + + + T I+G+ + + V+Y
Sbjct: 471 LHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVY 530
Query: 468 DTKNSQLGFAGEDCS 482
D + Q+G+ DC+
Sbjct: 531 DNQRRQIGWTNSDCT 545
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 167/383 (43%), Gaps = 54/383 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + K+
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
C+ C A + VC +S C Y +YGDGS T G +G E
Sbjct: 151 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208
Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
AS+ +FGC + G V G+ G G+ LS+VSQ + + +FS+CL
Sbjct: 209 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 265
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
+ + G G L+LG + + + YT ++P+ Y LNL I + G++L +S
Sbjct: 266 GSDNGG--GILVLG---EIVEPG--LVYTPLVPS---QPHYNLNLESIVVNGQKLPIDSS 315
Query: 349 GFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAYQ 404
F G ++DSGT + L Y S PS S + CF S+
Sbjct: 316 LFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSSV 373
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
+ + P V + F G MTV ++ AS L + ++ G I+G+
Sbjct: 374 DSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVL 430
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
K++ +YD N ++G+ DCS+
Sbjct: 431 KDKIFVYDLANMRMGWTDYDCST 453
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 41/361 (11%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGV 205
+++DTGS L+W+QC FDPS+S ++ + C C +F S
Sbjct: 112 MVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIPDFTLPTS-- 169
Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND-FIFGCGRNNKGLFGGVSGLM 264
C + C+Y Y DG+Y G L RE ++ I GC + G++
Sbjct: 170 CDQNR--LCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATESTD----PRGIL 223
Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG---ASGSLILG--GNSSVFKNSTPITYT 319
G+ R LS SQ+ FSYC+P+ +GS LG NS+ F+ +T+
Sbjct: 224 GMNRGRLSFASQSKIT---KFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFA 280
Query: 320 NMIPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIY 371
P L Y + L GI IGG++L +A G ++DSG+ T L Y
Sbjct: 281 RSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAY 340
Query: 372 SALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVTG 427
++AE ++ G G+ + D CF+ +A + I + EFE ++ V
Sbjct: 341 DKVRAEVVRAV-GPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKGVQIVVPKER 399
Query: 428 IVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ V+ C+ +A+ D+ G IIGN+ Q+N V +D N ++GF DCS
Sbjct: 400 VLATVEGGVH--CIGIAN---SDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADCSR 454
Query: 484 M 484
+
Sbjct: 455 L 455
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F FSYCLP + + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 153/332 (46%), Gaps = 34/332 (10%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+A +G + ++ +VD +L W QC PC+ C+ Q P+FDP+ S +++ + C S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
C ++ ++ N S VC +P GD T G+ G + +G A FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGKAGTDTFAIGAAK-ETLGFGCV 166
Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
+K L GG SG++GLGR+ SLV+Q + FSYCL GA+ +
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223
Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
GG +S+ F T ++ NP +Y++ L GI GG LQA+ + +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKTGGAPLQAASSSGSTVLLDTVS 279
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
+ L Y ALK P A D CF + + P + F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA--PELVFTFDGGAAL 337
Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
TV Y + S VCL + S + + TG
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTG 367
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 156/361 (43%), Gaps = 46/361 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP F P +S +Y+ V CN C+
Sbjct: 24 QRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNID-CN--------- 73
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGCGRNNKG-LFG- 258
C C Y Y + S + G LG + + G S +FGC G L+
Sbjct: 74 --CDDEK-QQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENMETGDLYSQ 130
Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
G+MG+GR DLS+V + + FS C G+++LGG S VF
Sbjct: 131 HADGIMGMGRGDLSIVDHLVDKGVINDSFSLCY--GGMGIGGGAMVLGGISPPSNMVFSQ 188
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
S P+ + +Y ++L I + GK L + K G ++DSGT LP +
Sbjct: 189 SDPVR----------SPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYLPEA 238
Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTV 423
+ + K +K+ P + D CF+ + + P V+M F ++ +
Sbjct: 239 AFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLL 298
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ CL + + +D T ++G +N V+YD +NS++GF +CS
Sbjct: 299 SPENYLFRHSKVHGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTNCSE 357
Query: 484 M 484
+
Sbjct: 358 L 358
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F FSYCLP + + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L + +A S + C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVLSQRIRELLLKRGAAEEESERN-CYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 167/384 (43%), Gaps = 56/384 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + K+
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
C+ C A + VC +S C Y +YGDGS T G +G E
Sbjct: 151 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208
Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
AS+ +FGC + G V G+ G G+ LS+VSQ + + +FS+CL
Sbjct: 209 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 265
Query: 291 STQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--QA 347
+ + G G L+LG + P + YT ++P+ Y LNL I + G++L +
Sbjct: 266 GSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIVVNGQKLPIDS 314
Query: 348 SGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAY 403
S F G ++DSGT + L Y S PS S + CF S+
Sbjct: 315 SLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSS 372
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQ 459
+ + P V + F G MTV ++ AS L + ++ G I+G+
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429
Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
K++ +YD N ++G+ DCS+
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCST 453
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 179/398 (44%), Gaps = 58/398 (14%)
Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
++PL SG+ +T Y I +G + V VDTGSD+ WV C C C + +
Sbjct: 75 DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
++DP S S + V C+ C A S C+S+SP C Y +SYGDGS T G
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190
Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
+ L + S + FGCG G G + G++G G+S+ S++SQ +
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250
Query: 281 --FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
+F++CL + G A G+++ + T ++P+ Y + L
Sbjct: 251 GKVRKMFAHCLDTVNGGGIFAIGNVV----------QPKVKTTPLVPD---MPHYNVILK 297
Query: 337 GISIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
GI +GG L SG +KG I IDSGT + +P +Y AL A +
Sbjct: 298 GIDVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ-- 354
Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
++ D +CF S + P V FEG+ + V Y ++ + C+ + +
Sbjct: 355 -TLQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGVQ 411
Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ G ++G+ N+ V+YD +N +G+A +CSS
Sbjct: 412 TKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 118/235 (50%), Gaps = 19/235 (8%)
Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
+F G +GL+GLG +S V Q GG FSYCL S + +SGSL G + S P
Sbjct: 1 MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVS-RGTESSGSLEFG------RESVP 53
Query: 316 I--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAKGGILIDSGTVITRL 366
+ ++ ++I NP+ +FY + L+G + I + + +GG+++D+GT +TRL
Sbjct: 54 VGASWVSLIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRL 113
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
P + Y+A + F+ Q + P G SI DTC++L+ + V +P + F G +T+
Sbjct: 114 PAAAYNAFRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPAR 173
Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ V S C A A S IIGN QQ+ + D N +GF C
Sbjct: 174 NFLIPVDS-VGTFCFAFAPSS--SGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + T IV DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F FSYCLP + + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++LT IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGRGGV--FVERSVQEQDVWCLAFA 312
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 171/380 (45%), Gaps = 55/380 (14%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--HA 195
T+ +N+T+++DTGS+L+W+ C ++ + F+P S SY + C+SSTC
Sbjct: 78 TVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWSSSYSPIPCSSSTCTDQT 136
Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----R 251
+F S C S+ C+ +SY D S + G L + +G + + + +FGC
Sbjct: 137 RDFPIRPS--CDSNQ--FCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFS 192
Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
+N +GLMG+ R LS VSQ FSYC+ + SG L+LG + F
Sbjct: 193 SNSEEDSKNTGLMGMNRGSLSFVSQMG---FPKFSYCI---SEYDFSGLLLLGDAN--FS 244
Query: 312 NSTPITYTNMI----PNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDS 359
P+ YT +I P P Y + L GI + K L + G ++DS
Sbjct: 245 WLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDS 304
Query: 360 GTVITRLPPSIYSALKAEFLKQFSG-----------FPSAPGFSILDTCFNLSAYQEVNI 408
GT T L Y+AL+ FL + +G F A +D C+ + Q
Sbjct: 305 GTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGA-----MDLCYRVPTNQTRLP 359
Query: 409 PL--VKMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIGNYQQK 461
PL V + F G AEMTV I+Y V + + S C + E +IG+ Q+
Sbjct: 360 PLPSVTLVFRG-AEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 418
Query: 462 NQRVIYDTKNSQLGFAGEDC 481
N + +D K S++G A C
Sbjct: 419 NVWMEFDLKKSRIGLAEIRC 438
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 171/388 (44%), Gaps = 50/388 (12%)
Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY--------K 184
Y + LG T ++DTGS L W C C + P DP+ P++ K
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPP---DCN-----YFVSYGDGSYTRGELGREHLG 236
+ C + C L F C P +C+ Y + YG G+ T G L ++L
Sbjct: 148 LLGCRNPKCGYL-FGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNLN 205
Query: 237 LGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--D 294
+V F+ GC + SG+ G GR SL SQ + FSYCL S + D
Sbjct: 206 FPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNL---KRFSYCLVSHRFDD 259
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQ----LATFYILNLTGISIGG-------K 343
S L+L +S+ + ++YT NP +Y + L + +GG K
Sbjct: 260 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYK 319
Query: 344 QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ----FSGFPSAPGFSILDTCFN 399
L+ GG ++DSG+ T + +Y+ + EFL+Q +S + S L CFN
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFN 379
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETG---- 453
+S + ++ P +F+G A+M+ + FV DA +C + S + + +T
Sbjct: 380 ISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFV-GDAEVLCFTVVSDGGAGQPKTAGPAI 438
Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+GNYQQ+N V YD +N + GF +C
Sbjct: 439 ILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 167/382 (43%), Gaps = 50/382 (13%)
Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCKSCY-----NQQDPV 174
G L L+Y I++G N++ +V D GSDL WV QC P + Y ++
Sbjct: 100 GNELDWLHY-TWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSASYYNISLDRDLSE 158
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD--GSYTRGELGR 232
+ PS+S + + + C+ C E+ + C + P C Y +Y D + + G L
Sbjct: 159 YSPSLSSTSRHLSCDHQLC---EWGSN----CKNPKDP-CPYIFNYDDFENTTSAGFLVE 210
Query: 233 EHLGLGKASVNDF----------IFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE 279
+ L L ASV D + GCGR G F G+MGLG D+S+ S ++
Sbjct: 211 DKLHL--ASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAK 268
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
GL C D SG ++ G + STP +P Y + +
Sbjct: 269 --AGLIQNCFSLCFDENDSGRILFGDRGHASQQSTP-----FLPIQGTYVAYFVGVESYC 321
Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
+G L+ SGF L+DSG+ T LP +Y+ L +EF KQ + + + D C+N
Sbjct: 322 VGNSCLKRSGFKA---LVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378
Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
S+ + +IP ++++F N V + CL+L + GIIG
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPT--DGSYGIIGQNF 436
Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
R+++D +N +LG++ C
Sbjct: 437 MIGYRMVFDIENLKLGWSNSSC 458
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 55/383 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
Y IE+G + V VDTGSD+ WV C C C + ++DP S S V
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
C++ C A + C++ P C Y YGDGS T G + L + S N
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKP--CEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204
Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
+ IFGCG G + G++G G+S+ S +SQ + +FS+CL + +
Sbjct: 205 HAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIK 264
Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G A G ++ + T ++PN + Y +NL I + G LQ
Sbjct: 265 GGGIFAIGEVV----------QPKVKSTPLLPN---MSHYNVNLQSIDVAGNALQLPPHI 311
Query: 351 ----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQ 404
K G +IDSGT +T LP +Y + A ++ F + GF CF S
Sbjct: 312 FETSEKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGF----LCFEYSESV 367
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
+ P + FE ++ ++V YF ++ + CL + ++ + ++G+
Sbjct: 368 DDGFPKITFHFE--DDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVL 425
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
N+ V+YD + +G+ +CSS
Sbjct: 426 SNKVVVYDLEKQVIGWTDYNCSS 448
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 167/383 (43%), Gaps = 54/383 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + K+
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
C+ C A + VC +S C Y +YGDGS T G +G E
Sbjct: 177 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 234
Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
AS+ +FGC + G V G+ G G+ LS+VSQ + + +FS+CL
Sbjct: 235 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 291
Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
+ + G G L+LG + + + YT ++P+ Y LNL I + G++L +S
Sbjct: 292 GSDNGG--GILVLG---EIVEPG--LVYTPLVPS---QPHYNLNLESIVVNGQKLPIDSS 341
Query: 349 GFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAYQ 404
F G ++DSGT + L Y S PS S + CF S+
Sbjct: 342 LFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSSV 399
Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
+ + P V + F G MTV ++ AS L + ++ G I+G+
Sbjct: 400 DSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVL 456
Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
K++ +YD N ++G+ DCS+
Sbjct: 457 KDKIFVYDLANMRMGWTDYDCST 479
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 176/399 (44%), Gaps = 52/399 (13%)
Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++ ++PL G+ T Y + LG + V VDTGSD+ WV C C C ++
Sbjct: 69 LATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGL 128
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
++DP S + V+C+ C A F G CS++ P C Y V+YGDGS T G
Sbjct: 129 GLDLTLYDPKASSTGSTVMCDQGFC-ADTFG-GRLPKCSANVP--CEYSVTYGDGSSTVG 184
Query: 229 ELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
+ L G + + IFGCG G G + G++G G ++ S++SQ
Sbjct: 185 SFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQ 244
Query: 277 --TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
T+ +F++CL + + G +G +TP+ + P Y +N
Sbjct: 245 LATAGKVKKIFAHCLDTIK---GGGIFAIGDVVQPKVKTTPL----VADKPH----YNVN 293
Query: 335 LTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
L I +GG L+ A F G G +IDSGT +T LP ++ K L F+
Sbjct: 294 LKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTLTYLPELVF---KKVMLAVFNKHQDIT 350
Query: 390 GFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
+ D CF S + P + FE ++ + V YF + C+ + +
Sbjct: 351 FHDVQDFLCFEYSGSVDDGFPTLTFHFE--DDLALHVYPHEYFFPNGNDVYCVGFQNGAL 408
Query: 449 EDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + G ++G+ N+ V+YD +N +G+ +CSS
Sbjct: 409 QSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCSS 447
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 163/376 (43%), Gaps = 43/376 (11%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y +I +G R + VDTGSDLTW+QC PC +C P++ P+ K V
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRD 242
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
C L+ GN C + C+Y + Y D S + G L R+ + + G DF+
Sbjct: 243 LLCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFV 297
Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
FGC + +G G++GL + +S SQ + I +F +C+ T++ G G
Sbjct: 298 FGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCI--TREQGGGGY 355
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILID 358
+ LG + + +T+T++ P Y + G +QL+ A ++ D
Sbjct: 356 MFLGDD---YVPRWGVTWTSIRSGPD--NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-- 416
SG+ T LP IY L A GF L C+ + + + VK FE
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWK-ADFPVRYLEDVKQFFEPL 469
Query: 417 ----GNAEMTVDVTGIV----YFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVI 466
G + + T + Y + SD VCL L + + T I+G+ + + V+
Sbjct: 470 NLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVV 529
Query: 467 YDTKNSQLGFAGEDCS 482
YD + Q+G+A DC+
Sbjct: 530 YDNQRKQIGWADSDCT 545
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 187/415 (45%), Gaps = 81/415 (19%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD--------PVFDPSISPSYK 184
Y ++ LG + + V++DTGS LTWV PC S Y Q+ PVF P S S
Sbjct: 86 YAFSLSLGTPPQPLPVLLDTGSHLTWV---PCTSNYQCQNCSAAAGSFPVFHPKSSSSSL 142
Query: 185 KVLCNSSTC---HA---LEFATGNSGVCSSSS-----------PPDCNYFVSYGDGSYTR 227
V C+S +C H+ L +S C S+ PP Y V YG GS T
Sbjct: 143 LVSCSSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPP---YLVVYGSGS-TA 198
Query: 228 GELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
G L + L L A+ +F GC + + SGL G GR S+ +Q F
Sbjct: 199 GLLVSDTLRLSPRGAASRNFAVGCSLAS--VHQPPSGLAGFGRGAPSVPAQLGV---NKF 253
Query: 286 SYCLPSTQ---DAGASGSLILGGNSSVFKNSTPITYTNMIPN----PQLATFYILNLTGI 338
SYCL S + DA SG L+LG SS K + Y ++ N P + +Y L+LTGI
Sbjct: 254 SYCLLSRRFDDDAAISGELVLGA-SSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGI 312
Query: 339 SIGGKQLQASGFAKGGI--------LIDSGTVITRLPPSIYSALKAEFLKQFSGF----P 386
++GGK + A + +IDSGT T L P+++ + A + G
Sbjct: 313 AVGGKSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSK 372
Query: 387 SAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ------V 439
G L CF L A + +++P + + F G AEM + + YF+ + + +
Sbjct: 373 DVEGALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIEN--YFLAAGPASGVAPEAI 430
Query: 440 CLALAS-----------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
CLA+ S I+G++QQ+N +V YD + ++LGF + CSS
Sbjct: 431 CLAVVSDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPCSS 485
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/408 (24%), Positives = 173/408 (42%), Gaps = 54/408 (13%)
Query: 121 EIPLTSGIRLQTLN-YIATIELGGRNM--TVIVDTGSDLTWVQCQPCK---SCYNQQDP- 173
E+P+ S + + + Y+ ++ +G + +++DT +DLTW+ C+ + Y +Q
Sbjct: 110 ELPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTG 169
Query: 174 ----------------VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNY 216
+ P+ S S++++ C+ C L + T C S S + C+Y
Sbjct: 170 QTMSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNT-----CQSPSKAESCSY 224
Query: 217 FVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSD 270
F DG+ T G G+E + G+ A + I GC G G++ LG D
Sbjct: 225 FQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGD 284
Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
+S ++ FG FS+CL S + AS L G N +V T T+++ N +
Sbjct: 285 MSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGT--METDILYNVDVKP 342
Query: 330 FYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
Y +TG+ +GG++L A F GG+++D+ T +T L P Y+ + A +
Sbjct: 343 AYGAQVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHL 402
Query: 383 SGFPSAPGFSILDTCFN-------LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
S P + C+ + V IP +E G A + + +V + +
Sbjct: 403 SHLPRVYELEGFEYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVV-MPEVE 461
Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
CLA L GI+GN + D + ++ F + C++
Sbjct: 462 PGVACLAFRKL-LRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKCNT 508
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 167/373 (44%), Gaps = 48/373 (12%)
Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSC--YNQQDPVFDPSISPSYKKVLCNSSTCHA 195
TI + + +D G L W QC C S +NQ+ P FDP+ S +Y+ C ++ C
Sbjct: 29 TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALC-- 86
Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC--GRNN 253
EF + CS C Y S +T G++G + + +G A+ FGC +
Sbjct: 87 -EFFPASIRNCSGDV---CAYEASTQLFEHTSGKIGTDAVAIGTATAASVAFGCVMASDI 142
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN- 312
K + GG SG +GL R+ LSLV+Q + FS+CL + D G G NS +F
Sbjct: 143 KLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCL-APHDGGG------GKNSRLFLGA 192
Query: 313 -------------STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDS 359
+TP ++ P+ + +Y++NL GI G + + + +L+ +
Sbjct: 193 AAKLAGGGKSAAMTTPFVKSS--PDDIKSLYYLINLEGIKAGDEAIITVPQSGRTVLLQT 250
Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKMEFE 416
+ ++ L +Y LK G + P SI D CF P V + F+
Sbjct: 251 FSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGGVS--GAPDVVLTFQ 308
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKN 471
G A +TV T + V D VC+A+AS + + T I+G QQ+N +YD +
Sbjct: 309 GAAALTVPPTNYLLDVGDD--TVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEK 366
Query: 472 SQLGFAGEDCSSM 484
L F DCSS+
Sbjct: 367 ETLSFEAADCSSL 379
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)
Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
V +DTGS L+WVQC+ C+ CY+Q +F+P S +Y KV C++ C+ +
Sbjct: 21 VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 80
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
C C Y + YG G Y+ G LG++ L L S+++FIFGCG +N L+ GV +
Sbjct: 81 YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 137
Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G++G G S +Q + FSYC P +D GSL +G + + +T
Sbjct: 138 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 191
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
+I + I L + + G +L+ + ++DSGT T + ++ AL
Sbjct: 192 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 250
Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
K+ G+ CF N + + P V+M+ T+ + F +S
Sbjct: 251 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 307
Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ +C S D+ G+ +GN ++ ++++D + GF C
Sbjct: 308 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 169/375 (45%), Gaps = 60/375 (16%)
Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFDPS 178
+ S + ++ Y+ T+ LG R+M I DTGSDL WV+C+ S FDPS
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149
Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
S +Y +V C + C AL AT + G +C Y +YGDGS T G L E
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGS-------NCAYLYAYGDGSNTTGVLSTETFTFD 202
Query: 236 --GLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSY 287
G G++ + FGC G F GL+GLG +SLV+Q + G FSY
Sbjct: 203 DGGAGRSPRQVRIGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSY 261
Query: 288 CLPSTQDAGASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
CL AS +L G + V + STP+ +G K
Sbjct: 262 CL-VPHSVNASSALNFGALADVTEPGAASTPL-----------------------VGNKT 297
Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
+ ++ ++ I++DSGT +T L PS+ + E ++ + P +L C+N+ A +
Sbjct: 298 VASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNV-AGR 354
Query: 405 EV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
EV +IP + +EF G A + + FV +CLA+ + + + I+GN Q
Sbjct: 355 EVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQQPVSILGNLAQ 412
Query: 461 KNQRVIYDTKNSQLG 475
+N V YD +G
Sbjct: 413 QNIHVGYDLDAGTVG 427
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 77/153 (50%), Gaps = 9/153 (5%)
Query: 334 NLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
+L ++G K + ++ ++ I++DSGT +T L PS+ + E ++ + P +
Sbjct: 420 DLDAGTVGNKTVASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 477
Query: 394 LDTCFNLSAYQEV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
L C+N+ A +EV +IP + +EF G A + + FV +CLA+ + + +
Sbjct: 478 LQLCYNV-AGREVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQ 534
Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
I+GN Q+N V YD + FA DC+
Sbjct: 535 QPVSILGNLAQQNIHVGYDLDAGTVTFAVADCA 567
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 163/384 (42%), Gaps = 70/384 (18%)
Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
++++ +DTGSDL W C P C C + P P I +++ C S C A
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159
Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
+ S +C+++ P C + +YGDGS L R +GL + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218
Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
+F F C G+ G GR LSL +Q + G + DA A G+
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSG--------STDAAAIGA-- 265
Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGI 355
+ T YT ++ NP+ FY + L +S+GGK++QA GG+
Sbjct: 266 ---------SETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGM 316
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAYQEVNIPL 410
++DSGT T LP ++ + EF + + A + L C++ S +P
Sbjct: 317 VVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDRA-VPP 375
Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TGIIGNYQQ 460
V + F GNA + + KS+ + CL L ++ ++ G +GN+QQ
Sbjct: 376 VALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQ 435
Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
+ V+YD ++GFA C+ +
Sbjct: 436 QGFEVVYDVDAGRVGFARRRCTDL 459
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 130/464 (28%), Positives = 205/464 (44%), Gaps = 76/464 (16%)
Query: 50 SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLH----VQYLQSR 105
SS+S VS K R + +L H NE ++R+ LD H + Y+Q+R
Sbjct: 22 SSTSTVSSAKPR----RLVSKLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQAR 77
Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
I+ + N D + + P +G + + + +G ++ V++DTGSD+ W+ C P
Sbjct: 78 IEGSLVYN-NDYTASVSPSLTGRTI-----LVNLSIGQPSIPQLVVMDTGSDILWIMCNP 131
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C +C N +FDPS+S ++ LC + G G C P + +SY D
Sbjct: 132 CTNCDNHLGLLFDPSMSSTFSP-LCKT--------PCGFKG-CKCDPIP---FTISYVDN 178
Query: 224 SYTRGELGREHLGL-----GKASVNDFIFGCGRN---NKGLFGGVSGLMGLGRSDLSLVS 275
S G GR+ L G + ++D I GCG N N G +G++GL SL +
Sbjct: 179 SSASGTFGRDILVFETTDEGTSQISDVIIGCGHNIGFNSD--PGYNGILGLNNGPNSLAT 236
Query: 276 QTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
Q G FSYC+ + D + L LG + + STP + FY +
Sbjct: 237 Q----IGRKFSYCIGNLADPYYNYNQLRLGEGADLEGYSTPFEVYH--------GFYYVT 284
Query: 335 LTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF--LKQFSG- 384
+ GIS+G K+L + GG+++DSGT IT L S + L E L ++S
Sbjct: 285 MEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFR 344
Query: 385 ---FPSAPGFSILDTC-FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
F +AP C + + + V P+V F A++ +D TG + + D C
Sbjct: 345 QVIFENAP----WKLCYYGIISRDLVGFPVVTFHFVDGADLALD-TGSFFSQRDDI--FC 397
Query: 441 LALASLSYEDET---GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ ++ S + T +IG Q++ V YD N + F DC
Sbjct: 398 MTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 174/393 (44%), Gaps = 61/393 (15%)
Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ-QDP-----VFD 176
P +G+ + Y+ T +G V VDTGSD+TW+ C PC SC + Q P +D
Sbjct: 31 PFVTGLYYTKI-YLGTPPVG---YYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYD 86
Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
PS S + + C S C A A G++ V S +S C Y +YGDGS T+G ++ +
Sbjct: 87 PSRSSTDGALSCRDSNCGA---ALGSNEV-SCTSAGYCAYSTTYGDGSSTQGYFIQDVMT 142
Query: 237 L----------GKASVNDFIFGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSEI-- 280
G ASV FGCG G + GL+G G++ +S+ SQ + +
Sbjct: 143 FQEIHNNTQVNGTASV---YFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGK 199
Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
G F++CL G G++++G S + TPI N Y + + I++
Sbjct: 200 VGNRFAHCLQGDNQGG--GTIVIGSVSEPNISYTPIVSRN---------HYAVGMQNIAV 248
Query: 341 GGKQL------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
G+ + + + GG+++DSGT + L Y+ +F+ S F S+ FS
Sbjct: 249 NGRNVTTPASFDTTSTSAGGVIMDSGTTLAYLVDPAYT----QFVNAVSTFESSM-FSSH 303
Query: 395 DTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
C L+ + + P VK+ F+ A M + +Y Q + + G
Sbjct: 304 SQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAG 363
Query: 454 -----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
I+G+ K+ V+YD N +G+ DC
Sbjct: 364 YLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDC 396
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)
Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
V +DTGS L+WVQC+ C+ CY+Q +F+P S +Y KV C++ C+ +
Sbjct: 40 VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 99
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
C C Y + YG G Y+ G LG++ L L S+++FIFGCG +N L+ GV +
Sbjct: 100 YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 156
Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G++G G S +Q + FSYC P +D GSL +G + + +T
Sbjct: 157 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 210
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
+I + I L + + G +L+ + ++DSGT T + ++ AL
Sbjct: 211 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 269
Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
K+ G+ CF N + + P V+M+ T+ + F +S
Sbjct: 270 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 326
Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ +C S D+ G+ +GN ++ ++++D + GF C
Sbjct: 327 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)
Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
V +DTGS L+WVQC+ C+ CY+Q +F+P S +Y KV C++ C+ +
Sbjct: 14 VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 73
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
C C Y + YG G Y+ G LG++ L L S+++FIFGCG +N L+ GV +
Sbjct: 74 YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 130
Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
G++G G S +Q + FSYC P +D GSL +G + + +T
Sbjct: 131 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 184
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
+I + I L + + G +L+ + ++DSGT T + ++ AL
Sbjct: 185 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 243
Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
K+ G+ CF N + + P V+M+ T+ + F +S
Sbjct: 244 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 300
Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ +C S D+ G+ +GN ++ ++++D + GF C
Sbjct: 301 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 133/273 (48%), Gaps = 31/273 (11%)
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
G + T G L + G +V +FGC + G F G SG++G+GR +LSL+SQ
Sbjct: 124 GSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF- 182
Query: 281 FGGLFSYCL--PSTQDAGASGSLILGGNSSVFK----NSTPITYTNMIPNPQLATFYILN 334
G FSY L P D G++ S+I G+ +V K STP+ + + P+ FY +N
Sbjct: 183 --GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPD-----FYYVN 235
Query: 335 LTGISIGGKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
LTG+ + G +L A GG+++ S T +T L + Y ++A + G P
Sbjct: 236 LTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLP 294
Query: 387 SAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDASQVCLAL 443
+ G + LD C+N S+ +V +P + + F+G A+M D++ YF + +D CL +
Sbjct: 295 AVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADM--DLSAANYFYIDNDTGLECLTM 352
Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
++G Q +IYD +L F
Sbjct: 353 LP---SQGGSVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 176/396 (44%), Gaps = 54/396 (13%)
Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
++PL SG+ +T Y I +G + V VDTGSD+ WV C C C + +
Sbjct: 75 DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
++DP S S + V C+ C A S C+S+SP C Y +SYGDGS T G
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190
Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
+ L + S + FGCG G G + G++G G+S+ S++SQ +
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250
Query: 281 --FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
+F++CL D G + GN K T ++M Y + L GI
Sbjct: 251 GKVRKMFAHCL----DTVNGGGIFAIGNVVQPKVKTTPLVSDM-------PHYNVILKGI 299
Query: 339 SIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
+GG L SG +KG I IDSGT + +P +Y AL A + +
Sbjct: 300 DVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ---T 355
Query: 393 ILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
+ D +CF S + P V FEG+ + V Y ++ + C+ + + +
Sbjct: 356 LQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGVQTK 413
Query: 452 TG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
G ++G+ N+ V+YD +N +G+A +CSS
Sbjct: 414 DGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 137/302 (45%), Gaps = 49/302 (16%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SSTCHALEFATGN 202
+ +IVDTGS +T+V C C+ C QDP F+P +S +Y+ V CN TC
Sbjct: 101 QTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNIDCTC--------- 151
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
+ C Y Y + S + G LG + + G S IFGC G L+
Sbjct: 152 -----DNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQETGDLYS 206
Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
G+MGLGR DLS+V Q E + FS C G G++ILGG S VF
Sbjct: 207 QRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGG--GAMILGGISPPSGMVFA 264
Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-AKGGILIDSGTVITRLPP 368
S P+ + +Y ++L I + GKQL S F K G ++DSGT LP
Sbjct: 265 ESDPVR----------SQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYAYLPE 314
Query: 369 SIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEM 421
+ ++A K +K+ + P + D CF+ A +V+ P V+M F ++
Sbjct: 315 AAFTAFKDAMMKELTSLKQIHGPDPNYNDICFS-GAESDVSQLSNTFPAVEMVFSNGQKL 373
Query: 422 TV 423
++
Sbjct: 374 SL 375
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 123/417 (29%), Positives = 181/417 (43%), Gaps = 62/417 (14%)
Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWV------QCQPCKSCYN 169
+ +P T+ + + Y T LG + + V++DTGS LTWV +C+ C S
Sbjct: 82 HPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSA 141
Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS--SPPDCN-----------Y 216
PVF P S S + V C + +C + A + C + SP N Y
Sbjct: 142 SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPY 201
Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
V YG GS T G L + L +V F+ GC + + SGL G GR S+ +Q
Sbjct: 202 AVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQ 258
Query: 277 TSEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATF 330
GL FSYCL S + +A SGSL+LGG P+ + +
Sbjct: 259 L-----GLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVY 313
Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y L L G+++GGK +L A FA GG ++DSGT T L P+++ + +
Sbjct: 314 YYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG 373
Query: 384 GF----PSAPGFSILDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV---KSD 435
G A L CF L + + +P + FEG A M + V YFV +
Sbjct: 374 GRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN--YFVVAGRGA 431
Query: 436 ASQVCLALAS---------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+CLA+ + I+G++QQ+N V YD + +LGF + C+S
Sbjct: 432 VEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 488
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 165/400 (41%), Gaps = 69/400 (17%)
Query: 146 MTVIVDTGSDLTWVQCQP--CKSCY----------------NQQDPVFDPSISPSYKKV- 186
+++ +DTGSDL W C P C C +++ P P S ++
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRIPCASPLCSAAHASAP 164
Query: 187 ---LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG-ELGREHLGLGK--- 239
LC + C + TG+ G S + PP + +YGDGS GR LG G
Sbjct: 165 PSDLCAVARCPLEDIETGSCGA-SHACPP---LYYAYGDGSLVAHLRRGRVALGAGARAS 220
Query: 240 --ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG- 296
+V++F F C G G+ G GR LSL Q S G FSYCL S
Sbjct: 221 VAVAVDNFTFACAHTA---LGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRAD 277
Query: 297 ---ASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG- 349
LILG + + YT ++ NP+ FY + L +S+G ++QA
Sbjct: 278 RLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPE 337
Query: 350 ------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAPGFSILDTCF 398
GG+++DSGT T LP +Y+ + F + + A + L C+
Sbjct: 338 LARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCY 397
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--------CLAL---ASLS 447
+A + +P + + F GNA + + KS+ + CL L S
Sbjct: 398 RYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDAS 456
Query: 448 YED---ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
E+ G +GN+QQ+ V+YD ++GFA C+ +
Sbjct: 457 GEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 496
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 157/366 (42%), Gaps = 57/366 (15%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ +IVDTGS +T+V C C+ C QDP F P S +Y V CN ++ +
Sbjct: 99 QEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN------MDCNCDHD 152
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
GV +C Y Y + S + G LG + + G S +FGC G L+
Sbjct: 153 GV-------NCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVETGDLYSQ 205
Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKN 312
G+MGLGR LS+V Q + + FS C G G+++LGG VF
Sbjct: 206 RADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGG--GAMVLGGIPPPPDMVFSR 263
Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
S +P + +Y + L I + GK L+ S K G ++DSGT LP
Sbjct: 264 S----------DPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYLPEE 313
Query: 370 IYSAL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGN 418
+ A K+ LKQ G P + D CF+ A ++V+ P V M F
Sbjct: 314 AFVAFRDAIIKKSHNLKQIHG----PDPNYNDICFS-GAGRDVSQLSKAFPEVDMVFSNG 368
Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
++++ ++ CL + D T ++G +N V YD +N ++GF
Sbjct: 369 QKLSLTPENYLFQHTKVHGAYCLGI--FRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWK 426
Query: 479 EDCSSM 484
+CS +
Sbjct: 427 TNCSEL 432
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 49/378 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDP-VFDPSISPSYKKVLCNSSTCHALEFAT 200
+N+T+++DTGS+L+W++C + S Q P F+ S S +Y C+S C
Sbjct: 71 QNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDL 130
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-------GRNN 253
C+ C +SY D S G L + LG A +FGC N
Sbjct: 131 PVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXALFGCVTSYSSATATN 190
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GL+G+ R LS V+QT+ + F+YC+ G L+LGG+ +
Sbjct: 191 SSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCI---APGDGPGLLVLGGDGAALAPQ 244
Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGK-------QLQASGFAKGGILIDSGT 361
+ YT +I P P Y + L GI +G L G ++DSGT
Sbjct: 245 --LNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGT 302
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFS------ILDTCFNLS----AYQEVNIPLV 411
T L Y+ LK EFL Q S + G S D CF S A +P V
Sbjct: 303 QFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASXMLPEV 362
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD------ASQV-CLALASLSYEDETG-IIGNYQQKNQ 463
+ G AE+ V ++Y V + A V CL + + +IG++ Q+N
Sbjct: 363 GLVLRG-AEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNV 421
Query: 464 RVIYDTKNSQLGFAGEDC 481
V YD +N ++GFA C
Sbjct: 422 WVEYDLQNGRVGFAPARC 439
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 115/401 (28%), Positives = 178/401 (44%), Gaps = 56/401 (13%)
Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG----GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-- 170
++ +IPL G+ L T + E+G + V VDTGSD+ WV C C C +
Sbjct: 70 LAAADIPL-GGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG 128
Query: 171 ---QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
+ ++DP S + KV C+ C A G C++S P C Y V+YGDGS T
Sbjct: 129 LGLELTLYDPKDSSTGSKVSCDQGFCAAT--YGGLLPGCTTSLP--CEYSVTYGDGSSTT 184
Query: 228 GELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVS 275
G + L G + + + FGCG G G + G++G G+S+ S++S
Sbjct: 185 GYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLS 244
Query: 276 QTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
Q S +F++CL D G + GN K T T ++PN Y +
Sbjct: 245 QLSAAGKVKKIFAHCL----DTINGGGIFAIGNVVQPKVKT----TPLVPN---MPHYNV 293
Query: 334 NLTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALK-AEFLKQFS-GFP 386
NL I +GG L+ K G +IDSGT +T LP +Y + A F K F
Sbjct: 294 NLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFH 353
Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--A 444
+ F CF + + P + FE ++ ++V YF ++ + C+
Sbjct: 354 NVQEF----LCFQYVGRVDDDFPKITFHFEN--DLPLNVYPHDYFFENGDNLYCVGFQNG 407
Query: 445 SLSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
L +D G++ G+ N+ V+YD +N +G+ +CSS
Sbjct: 408 GLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCSS 448
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 164/375 (43%), Gaps = 41/375 (10%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y +I +G R + VDTGSDLTW+QC PC +C P++ P+ K V
Sbjct: 193 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRD 249
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
C L+ G+ C++ C+Y + Y D S + G L ++ + + G DF+
Sbjct: 250 LLCQELQ---GDQNYCATCK--QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDFV 304
Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
FGC + +G G++GL + +SL SQ + I +F +C+ T++ G
Sbjct: 305 FGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCI--TKEPNGGGY 362
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI--LID 358
+ LG + + +T+ + P Y ++ G +QL+ G A I + D
Sbjct: 363 MFLGDD---YVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC----FNLSAYQEVNIPLVKME 414
SG+ T LP IY L + F + L C F++ ++V +
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLN 477
Query: 415 FE-GNAEMTVDVTGIV----YFVKSDASQVCLALASLSYEDE--TGIIGNYQQKNQRVIY 467
GN + T + Y + SD VCL L + + D T I+G+ + + V+Y
Sbjct: 478 LHFGNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVVY 537
Query: 468 DTKNSQLGFAGEDCS 482
D + Q+G+A +C+
Sbjct: 538 DNERRQIGWADSECT 552
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 167/375 (44%), Gaps = 42/375 (11%)
Query: 132 TLNYIATIELGGRNMT--VIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLC 188
++ I ++ +G T +++DTGS L+W+QC+ P K+ FDP +S S+ + C
Sbjct: 75 SMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKT----PPTAFDPLLSSSFSVLPC 130
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIF 247
N S C C + C+Y Y DG+Y G L RE + + I
Sbjct: 131 NHSLCKPRVPDYTLPTSCDQNR--LCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLIL 188
Query: 248 GCGRNN---KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSL 301
GC ++ +G+ G M LGR S +++ S+ FSYC+P S + +GS
Sbjct: 189 GCATDSSDTQGILG-----MNLGRLSFSSLAKISK-----FSYCVPPRRSQSGSSPTGSF 238
Query: 302 ILGGNSSV--FKNSTPITYTNMIPNPQLATF-YILNLTGISIGGKQLQASGFA------- 351
LG N S FK +TY P L Y L + GI I GK+L S A
Sbjct: 239 YLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSG 298
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQ-EVN 407
G LIDSGT T L YS +K E +K +G G+ LD CF+ A
Sbjct: 299 AGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGRM 357
Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
I + EFE E+ V+ ++ V + + + L + IIGN+ Q++ V +
Sbjct: 358 IGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDL-LGVASNIIGNFHQQDLWVEF 416
Query: 468 DTKNSQLGFAGEDCS 482
D ++GF DCS
Sbjct: 417 DLVGRRVGFGRTDCS 431
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y+ ++ LG + V +DTGS +WV C+ C C+ F S S + KV C +S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58
Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
C G+ C S + PDC + VSY DGS + G L ++ L + F FGC
Sbjct: 59 C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114
Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
++ G FG V GL+G+G +S++ Q+S F FSYCLP + + +G L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
G ++ T + YT M+ + + ++L IS+ G++ L S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDSGS 229
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
++ +P S L ++ +++ A C+++ + E ++P + + F+ A
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDAARF 288
Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
+ G+ FV+ + CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 176/401 (43%), Gaps = 63/401 (15%)
Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
++PL +G+ T Y + LG + V VDTGSD+ WV C C +C +
Sbjct: 57 DVPLGGNGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDL 116
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
++DP+ S + V C C T + + C Y ++YGDGS T G
Sbjct: 117 TLYDPNGSKTSNAVPCGDGFCT----DTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVN 172
Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG-----GVSGLMGLGRSDLSLVSQ--T 277
+ L + S N IFGCG G + G++G G+++ S++SQ
Sbjct: 173 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 232
Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
S +FS+CL S G +G N+TP+ P++A + ++ L
Sbjct: 233 SGKVKRIFSHCLDSHHGGGI---FSIGQVMEPKFNTTPLV-------PRMAHYNVI-LKD 281
Query: 338 ISIGGKQ------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
+ + G+ L SG +G I IDSGT + LP SIY+ L + L + PG
Sbjct: 282 MDVDGEPILLPLYLFDSGSGRGTI-IDSGTTLAYLPLSIYNQLLPKVLGR------QPGL 334
Query: 392 SILD-----TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
++ TCF+ S + P+VK FEG +TV ++ K D C+
Sbjct: 335 KLMIVEDQFTCFHYSDKLDEGFPVVKFHFEG-LSLTVHPHDYLFLYKEDI--YCIGWQKS 391
Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
S + + G +IG+ N+ V+YD +N +G+ +CSS
Sbjct: 392 STQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 432
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 165/363 (45%), Gaps = 47/363 (12%)
Query: 150 VDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
+DTGSDLTW+QC PC+SC ++DP + + V C TC ++ G CS
Sbjct: 48 MDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRA---RVVDCRRPTCAQVQ--RGGQFTCSG 102
Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF----IFGCGRNNKGLFGGV---- 260
C+Y V Y DGS T G L + + L + F + GCG + +G
Sbjct: 103 DV-RQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVT 161
Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
G++GL S +SL SQ + I + +CL + G G L G + +T+
Sbjct: 162 DGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGG--GYLFFG---DTLVPALGMTW 216
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAK--GGILIDSGTVITRLPPSIYSALKA 376
T MI P L Y L I GG+ L+ G GG + DSGT T L P+ Y+A+ +
Sbjct: 217 TPMIGRP-LVEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLS 275
Query: 377 EFLKQF--SGFPSAPGFSILDTCF----------NLSAYQEVNIPLVKMEFEGNAEMT-- 422
++Q SG + L C+ ++SAY + V ++F G+ +
Sbjct: 276 AVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFKT----VTLDFGGSTWWSSG 331
Query: 423 --VDVTGIVYFVKSDASQVCLAL--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
++++ Y + S VCL + AS++ + T I+G+ + V+YD Q+G+
Sbjct: 332 KLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGWVR 391
Query: 479 EDC 481
+C
Sbjct: 392 RNC 394
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 164/374 (43%), Gaps = 49/374 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN------QQDPVFDPSISPSYKKVLCNSSTCH--- 194
+ ++ ++DTGS + W C +C N ++ P+F+P +S S K + C C
Sbjct: 98 QKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTS 157
Query: 195 ------ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
GNS CS + P Y + YG G+ G E+L +++ F+ G
Sbjct: 158 SPBVHLGXPRCNGNSKKCSHACP---QYTLQYGTGA-ASGFFLLENLDFPGKTIHKFLVG 213
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGN 306
C + L G GR+ SL Q F+YCL S D SG LIL +
Sbjct: 214 C-TTSADREPSSDALAGFGRTMFSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYS 269
Query: 307 SSVFKNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQLQASGF-------AKGGILID 358
+ ++Y NP +Y L + + IG K L+ G ++GG++ID
Sbjct: 270 DG---ETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVID 326
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQEVNIPLVKMEF 415
SG + + ++ + E KQ S + + + C+N + ++ + IP + +F
Sbjct: 327 SGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQF 386
Query: 416 EGNAEMTVDVTGIVYFVK-SDASQVCLALASLS----YEDETG---IIGNYQQKNQRVIY 467
G A M V G+ YF+ S+AS C + + S E G I+GNYQQ + V +
Sbjct: 387 TGGANMVVP--GMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEF 444
Query: 468 DTKNSQLGFAGEDC 481
D KN +LGF + C
Sbjct: 445 DLKNERLGFRQQTC 458
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 184/408 (45%), Gaps = 44/408 (10%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
Q+R ++ G + V + + TS L L Y ++LG R V +DTGSD+ WV
Sbjct: 55 QARHGRLLRGVVGGVVDFTVYGTSDPYLVGL-YFTKVKLGSPPREFNVQIDTGSDILWVT 113
Query: 161 CQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
C C C + FDPS S + V C+ C +L T CS S C+
Sbjct: 114 CNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAE--CSPQS-NQCS 170
Query: 216 YFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCGRNNKG----LFGGVSGL 263
Y YGDGS T G + L L S +FGC G + + G+
Sbjct: 171 YSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGI 230
Query: 264 MGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
G G+ DLS+VSQ S I +FS+CL D G G L+LG + + + I Y+ +
Sbjct: 231 FGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGG--GKLVLG---EILEPN--IIYSPL 283
Query: 322 IPNPQLATFYILNLTGISIGGKQL--QASGFAKG---GILIDSGTVITRLPPSIYSALKA 376
+P+ + Y LNL IS+ G+ L + FA G ++DSGT +T L + Y +
Sbjct: 284 VPS---QSHYNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVS 340
Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSD 435
S + P S + C+ +S + P V + F G A M + +++ SD
Sbjct: 341 AITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSD 399
Query: 436 -ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A+ C+ ++ E I+G+ K++ +YD + ++G+A DCS
Sbjct: 400 GAAMWCIGFQKVA-EPGITILGDLVLKDKIFVYDLAHQRIGWANYDCS 446
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/390 (28%), Positives = 165/390 (42%), Gaps = 72/390 (18%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
+ + + +DTGSDL W QC C C+ Q P FD S + V C+ C + ++
Sbjct: 112 QRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDPICTSGKYPLSGC 170
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGEL---------------GREHLGLGKASVNDFIFG 248
++ C Y Y D S T G + + H G+ +V + FG
Sbjct: 171 TFNDNT----CFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGV---AVPNVRFG 223
Query: 249 CGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA--------GASG 299
CG+ NKG+F SG+ G R +SL SQ FS+C + DA GA G
Sbjct: 224 CGQYNKGIFKSNESGIAGFSRGPMSLPSQLKV---ARFSHCFTAIADARTSPVFLGGAPG 280
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
LG +++ STP +N + Y L L GI++G +L + A
Sbjct: 281 PDNLGAHATGPVQSTPFANSN-------GSLYYLTLKGITVGKTRLPLNALAFAGKGTGS 333
Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT----CFNLS----- 401
GG +IDSGT I LP +Y +L+A F+ + P A S D CF +
Sbjct: 334 GSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVK-LPVA-NESAADAESTLCFEAARSASL 391
Query: 402 -------AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
A +V + + +++ E V ++ S +CL + S D T I
Sbjct: 392 PPEAPAPALPKVVLHVAGADWDLPRESYV--LDLLEDEDGSGSGLCLVMNSAGDSDLT-I 448
Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IGN+QQ+N V YD + ++L F C M
Sbjct: 449 IGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 162/376 (43%), Gaps = 52/376 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN--------QQDPVFDPSISPSYKKVLCNSSTCHA 195
+ ++ +VDTGSD+ W C +C N ++ P+FDP +S S K + C + C +
Sbjct: 89 QKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVS 148
Query: 196 LEFA---------TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
F GNS CS + C Y YG G+ + G E+L + ++ +F+
Sbjct: 149 TYFPYVHLGCPRCNGNSKHCSYA----CPYSTQYGTGA-SSGYFLLENLKFPRKTIRNFL 203
Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILG 304
GC + L G GRS SL Q F+YCL S D SG LIL
Sbjct: 204 LGCTTSAARELSS-DALAGFGRSMFSLPIQMGV---KKFAYCLNSHDYDDTRNSGKLILD 259
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYI-LNLTGISIGGKQLQ------ASGF-AKGGIL 356
K ++YT + +P + FY L + I IG K L+ A G + G++
Sbjct: 260 YRDGKTKG---LSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVI 316
Query: 357 IDSGT-VITRLPPSIYSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVK 412
IDSG + ++ + E KQ S + A + L C+N + ++ + IP +
Sbjct: 317 IDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLI 376
Query: 413 MEFEGNAEMTVDVTGIVYF-VKSDASQVCLAL------ASLSYEDETGIIGNYQQKNQRV 465
+F G A M V G YF + S C + A D + I+GN Q + V
Sbjct: 377 YQFRGGANMV--VPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYV 434
Query: 466 IYDTKNSQLGFAGEDC 481
YD KN + GF + C
Sbjct: 435 EYDLKNDRFGFRRQTC 450
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 109/401 (27%), Positives = 180/401 (44%), Gaps = 54/401 (13%)
Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
I++++SGNI S+ + P+ S + Y+ +G + I D+GS L W+QC
Sbjct: 75 IRSIMSGNI--TSSMKYPI-SRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGT 131
Query: 164 --CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
C++CY Q+ P+F+PS S +Y K LCN++ C A G+ C Y Y
Sbjct: 132 PYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRV---ALGDEYWRCKKPNQICKYHEDYL 188
Query: 222 DGSYTRGELGR------EHL-GLGKASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSL 273
D SYT G + EH+ G G ++ IFGCG NN GL+GL + SL
Sbjct: 189 DDSYTEGVISTDIFTFPEHISGFGNYTLR-IIFGCGYNNSDPQHFYPPGLVGLTNNKASL 247
Query: 274 VSQTSEIFGGLFSYC--LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
V Q FSYC + + Q+ S + G +S+ +S T ++PN +Y
Sbjct: 248 VGQMDV---DQFSYCVSIDTEQNLKGSMEIRFGLAASISGHS-----TQLVPNSD--GWY 297
Query: 332 IL-NLTGISIGGKQLQASGF----------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
I N+ GI + + + G+ +GG+ +D+GT T L S+ L +
Sbjct: 298 IFKNVDGIYV--NEFEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEE 355
Query: 381 QFSGFP----SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
+ P S GF + C+ + +P +++ F N + + +
Sbjct: 356 HITIVPEKDYSNSGFEL---CYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGR 412
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
SQ+CLA+ + IIG +Q ++ ++ YD ++ + F
Sbjct: 413 SQMCLAMFR---TNGMSIIGMHQLRDIKIGYDLHHNIVSFT 450
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 117/424 (27%), Positives = 180/424 (42%), Gaps = 64/424 (15%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLT 157
H+++ ++ IS + IPL+ G Q L+++ VDTGS +
Sbjct: 65 HLKHGKTSPLTQISLSPHSYGGHSIPLSFGTPPQKLSFL-------------VDTGSHVV 111
Query: 158 WVQCQPCKSCYN--------QQDPVFDPSISPSYKKVLCNSSTC--------H-ALEFAT 200
W C +C N ++ P+F+P +S S K + C + C H
Sbjct: 112 WAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCN 171
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGV 260
GNS CS + PP Y + YG G+ + G+ E+L ++++F+ GC + G
Sbjct: 172 GNSKNCSHACPP---YSLQYGTGA-SSGDFLLENLNFPGKTIHEFLVGCTTSAVGEVTSA 227
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILGGNSSVFKNSTPITY 318
+ L G GRS SL Q F+YCL S D S LIL + K ++Y
Sbjct: 228 A-LAGFGRSMFSLPMQMGV---KKFAYCLNSHDYDDTRNSSKLILDYSDGETKG---LSY 280
Query: 319 TNMIPN-PQLATFYILNLTGISIGGKQLQ------ASGF-AKGGILIDSGTVITRLPPSI 370
+ N P +Y L + I IG K L+ A G +GG++IDSG + +
Sbjct: 281 APFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPV 340
Query: 371 YSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
+ + E K+ S + A + C+N + + + IP + +F G A M V G
Sbjct: 341 FKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVP--G 398
Query: 428 IVYFVK-SDASQVCLALA------SLSYEDETGII-GNYQQKNQRVIYDTKNSQLGFAGE 479
YFV + S C L +L + II GN Q + V +D KN +LGF +
Sbjct: 399 KNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQ 458
Query: 480 DCSS 483
C S
Sbjct: 459 TCQS 462
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 189/424 (44%), Gaps = 73/424 (17%)
Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWV------QCQPCKSCYN 169
+ IP T+ + + Y T LG + + V++DTGS LTWV C+ C S +
Sbjct: 86 HKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFA 145
Query: 170 QQDPVFDPSISPSYKKVLCNSSTC---HALEFATGNSGVCSSSS---------PPDCNYF 217
PVF P S S + V C + +C H+ E CS + PP Y
Sbjct: 146 AAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPP---YA 202
Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
V YG GS T G L + L +V+ F+ GC + + SGL G GR S+ +Q
Sbjct: 203 VVYGSGS-TAGLLIADTLRAPGRAVSGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQL 259
Query: 278 SEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
GL FSYCL S + +A SGSL+LGG++ + P+ + A +Y
Sbjct: 260 -----GLSKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQY-VPLVKSAAGDKQPYAVYYY 313
Query: 333 LNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG- 384
L L+G+++GGK ++ A+ GG ++DSGT T L P+++ + + G
Sbjct: 314 LALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGR 373
Query: 385 FPSAPGFSI---LDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV- 439
+ + L CF L + + +P + + F+G A M + + YFV + + V
Sbjct: 374 YKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPLEN--YFVVAGRAPVP 431
Query: 440 ------------CLALAS--------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
CLA+ + I+G++QQ+N V YD + +LGF +
Sbjct: 432 GAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQ 491
Query: 480 DCSS 483
C+S
Sbjct: 492 PCAS 495
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/412 (23%), Positives = 173/412 (41%), Gaps = 58/412 (14%)
Query: 121 EIPLTSGIRLQTLN-YIATIELGGRNM--TVIVDTGSDLTWVQCQPCK---SCYNQQ--- 171
E+P+ S + + + Y+ ++ +G + +++DT +DLTW+ C+ + Y +Q
Sbjct: 109 ELPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMG 168
Query: 172 ------------------DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
+ P+ S S++++ C+ C L + T C S S +
Sbjct: 169 QTMSVGGEGATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNT-----CQSPSKAE 223
Query: 214 -CNYFVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCG-RNNKGLFGGVSGLMGL 266
C+YF DG+ T G G+E + G+ A + I GC G G++ L
Sbjct: 224 SCSYFQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSL 283
Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNP 325
G D+S ++ FG FS+CL S + AS L G N +V T T+++ N
Sbjct: 284 GNGDMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGT--METDILYNV 341
Query: 326 QLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
+ Y +TG+ +GG++L A F GG+++D+ T +T L P Y+ + A
Sbjct: 342 DVKPAYGAKVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAAL 401
Query: 379 LKQFSGFPSAPGFSILDTCFN-------LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
+ S P + C+ + V IP +E G A + + +V
Sbjct: 402 DRHLSHLPRVYELEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVV-M 460
Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + CLA L GI+GN + D + ++ F + C++
Sbjct: 461 PEVEPGVACLAFRKL-LRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKCNT 511
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 168/371 (45%), Gaps = 38/371 (10%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y I LG + + VIVDTGSD+ WV+C PC+SC ++QD + P + +S +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQD-IIPPLSIYNLSASSTSSVS 141
Query: 193 CHALEFATGNSGVCSSS-SPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
+ TG VCS S S C Y +SY D S + G ++ + G A+ + F
Sbjct: 142 SCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFF 201
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GC N G + G+MG G+ ++ +Q T +FS+CL + G G L G
Sbjct: 202 GCAINITGSW-PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGG--GILEFGE 258
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---------ASGFAKGGIL 356
N+T + +T ++ + T Y ++L IS+ K L ++ + G++
Sbjct: 259 EP----NTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVI 311
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKME 414
IDSGT L L +E +K + P L CF L + E + P V +
Sbjct: 312 IDSGTSFALLATKANRILFSE-IKNLTTAKLGPKLEGLQ-CFYLKSGLTVETSFPNVTLT 369
Query: 415 FEGNAEMTVDVTGIVYFV--KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G + M + + V K + C A +S D I G K++ V YD +N
Sbjct: 370 FSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSS---ADGLTIFGEIVLKDKLVFYDVENR 426
Query: 473 QLGFAGEDCSS 483
++G+ G++CSS
Sbjct: 427 RIGWKGQNCSS 437
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 123/438 (28%), Positives = 188/438 (42%), Gaps = 77/438 (17%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
Q RIK K +S+ ++ + +R Y+ T+ +G + + V +DTGSDLTWV
Sbjct: 59 QERIK-------KPLSSVDV-VMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVP 110
Query: 161 CQ----PCKSCYNQQD------PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC---- 206
C C CY+ ++ VF P S + + C SS C + + C
Sbjct: 111 CGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAG 170
Query: 207 --------SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
S+ P ++ +YG+G G L R+ L V F FGC + +
Sbjct: 171 CSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTST---YR 227
Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPS--TQDAGASGSLILGGNSSVFKNSTP 315
G+ G GR LSL SQ + G FS+C LP + S LILG ++ +
Sbjct: 228 EPIGIAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDS 286
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGG-----------KQLQASGFAKGGILIDSGTVIT 364
+ +T M+ P Y + L I+IG +Q + G GG+L+DSGT T
Sbjct: 287 LQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQG--NGGMLVDSGTTYT 344
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAP------GFSILDTCF-------NLSAYQE---VNI 408
LP YS L L+ +P A GF D C+ NL++ + +
Sbjct: 345 HLPEPFYSQLLTT-LQSTITYPRATETESRTGF---DLCYKVPCPNNNLTSLENDVMMIF 400
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFV--KSDASQV-CLALASLSYED--ETGIIGNYQQKNQ 463
P + F NA + + Y + SD S V CL ++ D G+ G++QQ+N
Sbjct: 401 PSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNV 460
Query: 464 RVIYDTKNSQLGFAGEDC 481
+V+YD + ++GF DC
Sbjct: 461 KVVYDLEKERIGFQAMDC 478
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 49/378 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDP-VFDPSISPSYKKVLCNSSTCHALEFAT 200
+N+T+++DTGS+L+W++C + S Q P F+ S S +Y C+S C
Sbjct: 73 QNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDL 132
Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-------GRNN 253
C+ C +SY D S G L + LG A +FGC N
Sbjct: 133 PVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRALFGCVTSYSSATATN 192
Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
+GL+G+ R LS V+QT+ + F+YC+ G L+LGG+ +
Sbjct: 193 SSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCI---APGDGPGLLVLGGDGAALAPQ 246
Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGK-------QLQASGFAKGGILIDSGT 361
+ YT +I P P Y + L GI +G L G ++DSGT
Sbjct: 247 --LNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGT 304
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFS------ILDTCFNLS----AYQEVNIPLV 411
T L Y+ LK EFL Q S + G S D CF S A +P V
Sbjct: 305 QFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASQMLPEV 364
Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD------ASQV-CLALASLSYEDETG-IIGNYQQKNQ 463
+ G AE+ V ++Y V + A V CL + + +IG++ Q+N
Sbjct: 365 GLVLRG-AEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNV 423
Query: 464 RVIYDTKNSQLGFAGEDC 481
V YD +N ++GFA C
Sbjct: 424 WVEYDLQNGRVGFAPARC 441
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 111/414 (26%), Positives = 178/414 (42%), Gaps = 64/414 (15%)
Query: 100 QYLQSRIKNMISGNIK-DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDL 156
++ Q R++ ++ + +S + T+G+ Y I LG + V VDTGSD+
Sbjct: 18 EHDQRRLRRILPEVVAFPISGDDDTFTTGL------YYTRIYLGTPPQQFYVHVDTGSDV 71
Query: 157 TWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
WV C PC +C + +FDP S S + C C+ ++ CS +S
Sbjct: 72 AWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYL-----ASNSKCSFNS- 125
Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVND---------FIFGCGRNNKGLFGGVSG 262
C Y YGDGS T G L + L + + FGCG N G + G
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW-LTDG 184
Query: 263 LMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
L+G G++++SL SQ S+ + +F++CL D SG+L++G + YT
Sbjct: 185 LVGFGQAEVSLPSQLSKQNVSVNIFAHCLQG--DNKGSGTLVIG-----HIREPGLVYTP 237
Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALKAE 377
++P +LN+ G+S G + F GG+++DSGT +T L Y +A+
Sbjct: 238 IVPKQSHYNVELLNI-GVS-GTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAK 295
Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY--FVKSD 435
+L F E P V + F G A M + + +Y + +
Sbjct: 296 VRDCMRS-------GVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTG 348
Query: 436 ASQVCLALAS-------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S C + LSY I G+ K+Q V+YD N+++G+ DC+
Sbjct: 349 LSAYCFSWLESTSVYGYLSYT----IFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/419 (23%), Positives = 169/419 (40%), Gaps = 58/419 (13%)
Query: 99 VQYLQSRIKNMISGNIKDVSNTEIPLTSGIR--LQTLNYIATIELGGRNMTVIVDTGSDL 156
+Q + ++K I I SN PL + + + E G +N + +D G L
Sbjct: 62 LQRAKEQVKCRIKHQILPTSNEMRPLMCPLEDAVYAVVVGVGTEAGFQNYQLALDMGGGL 121
Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
+W+QC PC+ C Q PVFDP+ SP++ + +++ + +G C +
Sbjct: 122 SWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGACG--------F 173
Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDF------IFGCGRNNKGLFG--GVSGLMGL-- 266
++Y D ++ G L R+ A +DF +FGC + V+G++GL
Sbjct: 174 DIAYRDNTHASGYLARDTFSF-PAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGILGLGM 232
Query: 267 ---GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN------SSVFKNSTPIT 317
G+ + Q GG FSYC P L G + +V + STP+
Sbjct: 233 GPAGKPPTAFTKQVLPAHGGRFSYC-PFVPGMSMYSYLRFGSDIPSHPPPNVHRQSTPV- 290
Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL--------QASGFAKGGILIDSGTVITRLPPS 369
+ + Y + L G+S+G +L + + GG ++D GT +T S
Sbjct: 291 ----LAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHS 346
Query: 370 IY----SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
Y A++ ++ + G +TC A +P + + FE A + V
Sbjct: 347 AYVHIDHAVRQHLQRRGAHIVVVRG----NTCVQQPAPHHDVLPSMTLHFENGAWLRVMP 402
Query: 426 TGI-VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS--QLGFAGEDC 481
+ + FV C S + +IG QQ N R I+D ++ + F EDC
Sbjct: 403 EHVFMPFVVGGHHYQCFGFVS---STDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 170/403 (42%), Gaps = 60/403 (14%)
Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
++ ++PL G+ T Y I+LG + V VDTGSD+ WV C C+ C +
Sbjct: 65 LAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGL 124
Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
+DP S S V C+ C A G C+++ P C Y V YGDGS T G
Sbjct: 125 GLDLTFYDPKASSSGSTVSCDQGFCAATY--GGKLPGCTANVP--CEYSVMYGDGSSTTG 180
Query: 229 ELGREHLGL-----------GKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSL 273
+ L G A+V FGCG G G + G++G G+++ S+
Sbjct: 181 FFVTDALQFDQVTGDGQTQPGNATVT---FGCGAQQGGDLGSSNQALDGILGFGQANTSM 237
Query: 274 VSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
+SQ + +F++CL D G + GN K T +M Y
Sbjct: 238 LSQLAAAGKVKKIFAHCL----DTIKGGGIFAIGNVVQPKVKTTPLVADM-------PHY 286
Query: 332 ILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSG-- 384
+NL I +GG LQ A F G G +IDSGT +T LP ++ + A +
Sbjct: 287 NVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIV 346
Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
F + F CF + P + FE ++ + V YF + C+
Sbjct: 347 FHNVQDF----MCFQYPGSVDDGFPTITFHFE--DDLALHVYPHEYFFPNGNDMYCVGFQ 400
Query: 445 SLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ + + + G ++G+ N+ VIYD +N +G+ +CSS
Sbjct: 401 NGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNCSS 443
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 141/298 (47%), Gaps = 31/298 (10%)
Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-------FGCGRNNKGLFGG 259
S P C Y +YGDG+ T G E + FGCG N G
Sbjct: 15 SCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNN 74
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITY 318
SG++G GR+ LSLVSQ S FSYCL S S L + V+ ++T +
Sbjct: 75 GSGIVGFGRNPLSLVSQLSI---RRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQT 131
Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIY 371
T ++ +PQ TFY ++ TG+++G ++L+ S FA GG+++DSGT +T LP ++
Sbjct: 132 TPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVL 191
Query: 372 SALKAEFLKQFSGFPSAPGFSILD-TCFNL-------SAYQEVNIPLVKMEFEGNAEMTV 423
+ + F +Q P A G + D CF + S+ ++ +P + + F+G A++ +
Sbjct: 192 AEVVRAFRQQLR-LPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQG-ADLDL 249
Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
V ++CL LA D+ IGN Q++ RV+YD + L A C
Sbjct: 250 PRRNYV-LDDHRRGRLCLLLADSG--DDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 304
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 163/374 (43%), Gaps = 51/374 (13%)
Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
G R + +D ++L W+QC+P + + Q P F+P+ SPS++++ N++ C L G
Sbjct: 95 GRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNNAFC--LPAPRG 152
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSY-TRGELGREHLGLG-----KASVNDFIFGCGRNNKG 255
+ C + DGS RG L E L + V + GC N+KG
Sbjct: 153 HRRTVQDP----CKFHSIRLDGSADARGVLSNETLAFAASGQQQTEVTGVVIGCTHNSKG 208
Query: 256 L----FGGVSGLMGLGRSDLSLVSQTSEIFGGL-----FSYCLPSTQDAGASGSLILGGN 306
G ++G++GLGR SL+ + G FSYCLPS + + L +
Sbjct: 209 FNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHTFLRFD 268
Query: 307 SSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-------- 352
V ST I Y + + +++ +LTGIS+ GK LQ F +
Sbjct: 269 DDVPNTQHMVSTKIMYMDSTTSRDFRAYFV-SLTGISVAGKPLQDVKELFKRHVHGQVWT 327
Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL----DTCFNLSAYQEVNI 408
G D+GT + Y+ LK ++ G I+ CF ++ ++
Sbjct: 328 SGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPL----GLQIVSGQYHLCFRATSQLWQHL 383
Query: 409 PLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
P V ++F E A + + + V D +CLA+ SY + IIG QQ ++R +Y
Sbjct: 384 PTVMLQFAETEARLVLPPQRLFVAVGYD---ICLAVVR-SY--DITIIGAMQQVDKRFVY 437
Query: 468 DTKNSQLGFAGEDC 481
D ++ ++ F E+
Sbjct: 438 DVRHGRIYFVPENA 451
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 164/370 (44%), Gaps = 51/370 (13%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEF 198
+ V VDTGSD+ WV C C C + + ++DP S + KV C+ C A
Sbjct: 15 KRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAAT-- 72
Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCG 250
G C++S P C Y V+YGDGS T G + L G + + + FGCG
Sbjct: 73 YGGLLPGCTTSLP--CEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCG 130
Query: 251 RNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILG 304
G G + G++G G+S+ S++SQ S +F++CL D G +
Sbjct: 131 SQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL----DTINGGGIFAI 186
Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-----AKGGILIDS 359
GN K T T ++PN Y +NL I +GG L+ K G +IDS
Sbjct: 187 GNVVQPKVKT----TPLVPN---MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239
Query: 360 GTVITRLPPSIYSALK-AEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
GT +T LP +Y + A F K F + F CF + + P + FE
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF----LCFQYVGRVDDDFPKITFHFEN 295
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLAL--ASLSYEDETGII--GNYQQKNQRVIYDTKNSQ 473
++ ++V YF ++ + C+ L +D G++ G+ N+ V+YD +N
Sbjct: 296 --DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQV 353
Query: 474 LGFAGEDCSS 483
+G+ +CSS
Sbjct: 354 IGWTEYNCSS 363
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 167/386 (43%), Gaps = 52/386 (13%)
Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCK-SCYNQQD---PVF 175
G L L+Y I++G N++ +V D GSDL+WV QC P S Y D +
Sbjct: 95 GNDLDWLHY-TWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCAPLSASLYKPLDRDLSEY 153
Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
PS+S + + + CN C E + C + P C Y Y D + + E +
Sbjct: 154 RPSLSTTSRHLSCNHQLC---ELGSH----CKNLKDP-CPYIADYADPNTSSSGFLVEDI 205
Query: 236 GLGKASVND------------FIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE- 279
L ASV+D I GCGR G + G+MGLG +S+ S ++
Sbjct: 206 -LHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKA 264
Query: 280 -IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
+ FS C D SG+++ G + STP ++P Y++ +
Sbjct: 265 GLIRKSFSLCF----DVNGSGTILFGDQGHTSQKSTP-----LLPTQGNYDAYLIEVESY 315
Query: 339 SIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
+G L+ SGF L+DSG T LP +Y+ + EF KQ + + + C+
Sbjct: 316 CVGNSCLKQSGFKA---LVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGPWNYCY 372
Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
N S+ Q N+P +++ F N + + + Y+V + L + GIIG
Sbjct: 373 NTSSKQLDNVPAMRLSFLMNQSLLIHNS--TYYVPQNQEFAVFCLTLQPTDLNYGIIGQN 430
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
RV++D +N +LG++ +C +
Sbjct: 431 YMTGYRVVFDMENLKLGWSSSNCKDI 456
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 140/345 (40%), Gaps = 97/345 (28%)
Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATG 201
+ TVI+D+GSD+ WVQCQPC C+ Q+DP+FDP+ S +Y V C+S+ C L + G
Sbjct: 80 SQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRG 139
Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGV 260
C ++S C + ++Y +G+ G + L LG V F+FGC ++G
Sbjct: 140 ----CLANS--QCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQG----- 188
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
FSY +G+L LGG S F T Y+
Sbjct: 189 ----------------------STFSY--------DVAGTLALGGGSQSFVQQTASQYSR 218
Query: 321 M----IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKA 376
+ +P P ++F G ++ +PP +A
Sbjct: 219 VFSYCVP-PSTSSF-----------------------------GFIMFGVPPQ-----RA 243
Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
+ F P +L + + + +P + + F+G A + +D GI+
Sbjct: 244 ALVPTFVSTP------LLSSSTMSPTFYSITLPSIALVFDGGATVNLDAAGILL------ 291
Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
Q CLA A + + G IGN QQ+ V+YD + F C
Sbjct: 292 -QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 335
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 167/368 (45%), Gaps = 59/368 (16%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
+I+DTGS L+W+QC K + P VFDPS+S S+ + CN C +F S
Sbjct: 97 MILDTGSQLSWIQCH--KKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTS 154
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNN---KGLFGG 259
C + C+Y Y DG+ G L RE + ++ S I GC + KG+ G
Sbjct: 155 --CDQNR--LCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESSDAKGILG- 209
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DAGASGSLILG--GNSSVFKNST 314
M LGR LS SQ FSYC+P+ Q +GS LG NS F+
Sbjct: 210 ----MNLGR--LSFASQAKLT---KFSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYIN 260
Query: 315 PITYTNMIPNPQLATF-YILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITRL 366
+T++ P L Y + + GI IG ++L S F G +IDSG+ T L
Sbjct: 261 LLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYL 320
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKM--EFEGNAEM 421
Y+ ++ E ++ G G+ + D CFN +A E+ + M EF+ E+
Sbjct: 321 VDEAYNKVREEVVR-LVGARLKKGYVYGGVSDMCFNGNAI-EIGRLIGNMVFEFDKGVEI 378
Query: 422 TV-------DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
V DV G V+ V S++ L AS IIGN+ Q+N V +D N ++
Sbjct: 379 VVEKERVLADVGGGVHCVGIGRSEM-LGAAS-------NIIGNFHQQNIWVEFDLANRRV 430
Query: 475 GFAGEDCS 482
GF DCS
Sbjct: 431 GFGKADCS 438
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/427 (25%), Positives = 184/427 (43%), Gaps = 53/427 (12%)
Query: 88 QQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
QQ+ L+L L+ R +I+ + + N +PL ++ Y AT+ LG R
Sbjct: 24 QQDSLVLP------LRRRDGGIIARGL--LRNATLPLHGAVKDYGYFY-ATLHLGTPARQ 74
Query: 146 MTVIVDTGSDLTWVQCQPC-KSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
VIVDTGS +T+V C C ++C + +D FDP+ S S + C+S C
Sbjct: 75 FAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDSDKC------ICGR 128
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG--VS 261
C S +C Y +Y + S + G L + L L +V + +FGC G
Sbjct: 129 PPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLRDGAV-EVVFGCETKETGEIYNQEAD 187
Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
G++GLG S++SLV+Q S + +F+ C S + GA ++ G+ + + YT
Sbjct: 188 GILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGA----LMLGDVDAAEYDVALQYT 243
Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG-GILIDSGTVITRLPPSIYSALKA 376
++ + +Y + L + +GG+QL + + +G G ++DSGT T LP + K
Sbjct: 244 ALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSGTTFTYLPSEAFQLFK- 302
Query: 377 EFLKQFS---GFPSAPG--------FSILDTCFNLSAYQ--------EVNIPLVKMEFEG 417
E + ++ G S G D CF + + E P+ +++F
Sbjct: 303 EAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVFPVFELQFAD 362
Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
+ ++ + CL + ++G +N V YD +N ++GF
Sbjct: 363 GVRLRTGPLNYLFMHTGEMGAYCLGV--FDNGASGTLLGGISFRNILVQYDRRNRRVGFG 420
Query: 478 GEDCSSM 484
C +
Sbjct: 421 AASCQEI 427
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 43/376 (11%)
Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
Y +I +G R + VDTGSDLTW+QC PC + P++ P+ K V
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKE---KIVPPRD 242
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
C L+ GN C + C+Y + Y D S + G L R+ + + G DF+
Sbjct: 243 LLCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFV 297
Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
FGC + +G G++GL + +S SQ + I +F +C+ T++ G G
Sbjct: 298 FGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCI--TREQGGGGY 355
Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILID 358
+ LG + + +T+T++ P Y + G +QL+ A ++ D
Sbjct: 356 MFLGDD---YVPRWGVTWTSIRSGPD--NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-- 416
SG+ T LP IY L A GF L C+ + + + VK FE
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWK-ADFPVRYLEDVKQFFEPL 469
Query: 417 ----GNAEMTVDVTGIV----YFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVI 466
G + + T + Y + SD VCL L + + T I+G+ + + V+
Sbjct: 470 NLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVV 529
Query: 467 YDTKNSQLGFAGEDCS 482
YD + Q+G+A DC+
Sbjct: 530 YDNQRKQIGWADSDCT 545
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 168/381 (44%), Gaps = 52/381 (13%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
Y ++LG + V +DTGSD+ WV C PC C N Q F+P S + ++
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRG-----------ELGREH 234
C+ C A TG + VC SS P C Y +YGDGS T G +G E
Sbjct: 149 CSDDRCTA-ALQTGEA-VCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQ 206
Query: 235 LGLGKASVNDFIFGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYC 288
ASV +FGC + G V G+ G G+ LS+VSQ + FS+C
Sbjct: 207 TANSSASV---VFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC 263
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
L + + G G L+LG + + + +T ++P+ Y LNL I++ G++L
Sbjct: 264 LKGSDNGG--GILVLG---EIVEPG--LVFTPLVPS---QPHYNLNLESIAVSGQKLPID 313
Query: 347 ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
+S FA G ++DSGT + L Y S + + CF ++
Sbjct: 314 SSLFATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSS 372
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQK 461
+ + P + F+G MTV + S + V L + ++ G I+G+ K
Sbjct: 373 VDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNV---LWCIGWQRSQGITILGDLVLK 429
Query: 462 NQRVIYDTKNSQLGFAGEDCS 482
++ +YD N ++G+A DCS
Sbjct: 430 DKIFVYDLANMRMGWADYDCS 450
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 168/371 (45%), Gaps = 38/371 (10%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
Y I LG + + VIVDTGSD+ WV+C PC+SC ++QD + P + +S +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQD-IIPPLSIYNLSASSTSSVS 141
Query: 193 CHALEFATGNSGVCSSS-SPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
+ TG VCS S + C Y SY D S + G R+ + G A+ + F
Sbjct: 142 SCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFF 201
Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
GC N G + V G+MG G ++ +Q T +FS+CL + G G L G
Sbjct: 202 GCATNITGSW-PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGG--GILEFGE 258
Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAKG-----GIL 356
N+T + +T ++ + T Y ++L IS+ K L + + + G++
Sbjct: 259 AP----NTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVI 311
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKME 414
IDSGT L L E +K + P L+ CF L + E + P V +
Sbjct: 312 IDSGTTFVLLTTKANRMLFQE-IKSLTTAKLGPKLEGLE-CFYLKSGLTMETSFPNVTLT 369
Query: 415 FEGNAEMTV--DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
F G + M + D ++ K + C A +S D I G K++ V YD +N
Sbjct: 370 FSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSS---ADGLTIFGEIVLKDKLVFYDVENR 426
Query: 473 QLGFAGEDCSS 483
++G+ G++CSS
Sbjct: 427 RIGWKGQNCSS 437
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 168/382 (43%), Gaps = 57/382 (14%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVL 187
Y ++LG R V +DTGSD+ WV C PC C + + +FD + S S + +
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LGKASVN 243
C C A+ T C + + C+Y Y D S T G + + LG++++
Sbjct: 144 CTDPICAAVSTTTDQ---CLTQT-DHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIA 199
Query: 244 D----FIFGCG--------RNNKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCL 289
+ +FGC R K L G+ G G+ + S++SQ S I +FS+CL
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKAL----DGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-QAS 348
++ G G L+LG + + S I Y+ +IP+ Y L L I++ G+ +
Sbjct: 256 KGGENGG--GILVLG---EILEPS--IVYSPLIPS---QPHYTLKLQSIALSGQLFPNPT 305
Query: 349 GFA---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
F G +IDSGT + L +Y + + S + P S CF +S
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ-SATPTISRGSQCFRVSMSVA 364
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL------SYEDETGIIGNYQ 459
P+++ FEG A M V + F D+ C ASL ED I+G+
Sbjct: 365 DIFPVLRFNFEGIASMVVTPEEYLQF---DSIVSCYKFASLWCIGFQKAEDGLNILGDLV 421
Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
K++ ++YD ++G+A DC
Sbjct: 422 LKDKIIVYDLAQQRIGWANYDC 443
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 168/376 (44%), Gaps = 43/376 (11%)
Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCN 189
+Y+A +G V IVD +L W QC C+S C+ Q+ PVFDPS S +Y+ C
Sbjct: 61 HYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIF 247
S C ++ T N CS +C Y +GD T G + + +G A F
Sbjct: 121 SPLCKSIP--TRN---CSGDG--ECGYEAPSMFGD---TFGIASTDAIAIGNAE-GRLAF 169
Query: 248 GC----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
GC + G G SG +GLGR+ SLV Q++ FSYCL + G +L L
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCL-ALHGPGKKSALFL 225
Query: 304 GGNSSV--FKNSTPIT-----YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGIL 356
G ++ + S P T + + + +Y + L GI G + A+ G I
Sbjct: 226 GASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAIT 285
Query: 357 I---DSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVK 412
+ ++ ++ LP + Y AL+ + + G PS A D CF +A +P +
Sbjct: 286 VLQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLV 342
Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL----SYEDETGIIGNYQQKNQRVIYD 468
F+G A +T + + + VCL++ S S +D I+G+ Q+N ++D
Sbjct: 343 FTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFD 402
Query: 469 TKNSQLGFAGEDCSSM 484
+ L F DCSS+
Sbjct: 403 LEKETLSFEPADCSSL 418
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 167/375 (44%), Gaps = 43/375 (11%)
Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNS 190
Y+A +G V IVD +L W QC C+S C+ Q+ PVFDPS S +Y+ C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIFG 248
C ++ T N CS +C Y +GD T G + + +G A FG
Sbjct: 122 PLCKSIP--TRN---CSGDG--ECGYEAPSMFGD---TFGIASTDAIAIGNAE-GRLAFG 170
Query: 249 C----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
C + G G SG +GLGR+ SLV Q++ FSYCL + G +L LG
Sbjct: 171 CVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCL-APHGPGKKSALFLG 226
Query: 305 GNSSV--FKNSTPIT-----YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI 357
++ + S P T + + + +Y + L GI G + A+ G I I
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI 286
Query: 358 ---DSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVKM 413
++ ++ LP + Y AL+ + + G PS A D CF +A +P +
Sbjct: 287 LQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLVF 343
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASL----SYEDETGIIGNYQQKNQRVIYDT 469
F+G A +T + + + VCL++ S S +D I+G+ Q+N ++D
Sbjct: 344 TFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDL 403
Query: 470 KNSQLGFAGEDCSSM 484
+ L F DCSS+
Sbjct: 404 EKETLSFEPADCSSL 418
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 166/374 (44%), Gaps = 50/374 (13%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PV--FDPSISPSYKKVL 187
Y ++LG R + VDTGSDL WV C PC C D P+ +D S S KV
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
C+ +C L SG C+ + C Y YGDGS T G L + L + IF
Sbjct: 96 CSDPSC-TLITQISESG-CNDQN--QCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIF 151
Query: 248 GCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
GCG G + G++G G SDLS SQ ++ +F++CL + G G L
Sbjct: 152 GCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGG--GIL 209
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGIL 356
+LG +V + I YT ++P + Y + L IS+ L S G +
Sbjct: 210 VLG---NVIEPD--IQYTPLVP---YMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261
Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
DSGT + LP Y A Q AP F + DT LS + P V + FE
Sbjct: 262 FDSGTTLAYLPDEAYQA-----FTQAVSLVVAP-FLLCDT--RLSRFIYKLFPNVVLYFE 313
Query: 417 GNAEMTVDVTGIVYFVK----SDASQVCL---ALASLSYEDETGIIGNYQQKNQRVIYDT 469
G A MT +T Y ++ ++A C+ ++ S E + I G+ KN+ V+YD
Sbjct: 314 G-ASMT--LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDL 370
Query: 470 KNSQLGFAGEDCSS 483
+ ++G+ DC +
Sbjct: 371 ERGRIGWRPFDCKT 384
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 125/475 (26%), Positives = 200/475 (42%), Gaps = 81/475 (17%)
Query: 51 SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNL----HVQYLQSRI 106
SS+ ++ + SR+ +L H+N + D NE ++R + +L+S+I
Sbjct: 27 SSTLITTKPSRLAT-----KLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKI 81
Query: 107 KNMIS-GNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
K + S GN + ++ IP G ++ + +G +T V+VDTGS L WVQC P
Sbjct: 82 KELKSVGN--EARSSLIPFNRGS-----GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
C +C+ Q FDP S S+K + C + + N C+ + + Y + Y G
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-----NGYKCNRFNQAE--YKLRYLGG 187
Query: 224 SYTRGELGREHL------------------GLGKASVNDFIFGCGR-----NNKGLFGGV 260
++G L +E L + K ++ FGCG NN + GV
Sbjct: 188 DSSQGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYNGV 247
Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYT 319
GL +++ +Q G FSYC+ + L+LG S + +STP+
Sbjct: 248 FGLGAY--PHITMATQ----LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQI- 300
Query: 320 NMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYS 372
+Y+ L IS+G K L + S GG+LIDSG T+L +
Sbjct: 301 ------HFGHYYV-TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFE 353
Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
L E + G P+ F L CF + + V P V F G A++ ++
Sbjct: 354 LLYDEIVDLMKGLLERIPTQRKFEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGS 411
Query: 428 IVYFVKSDASQVCLA-LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ F + + CLA L S S +IG Q+N V +D + ++ F DC
Sbjct: 412 L--FRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 162/362 (44%), Gaps = 45/362 (12%)
Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGN 202
+ +VI DTGS L C C C + D F S + V C+ H + T
Sbjct: 76 QRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQADNSSTLIHVTCSQQQSHFQCKECTEK 135
Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVND----------FIFGCGR 251
S C+ S SY +GS + + + + L G++S +D F FGC
Sbjct: 136 SDTCAISQ--------SYMEGSSWKASVVEDVVYLGGESSFHDEAMRDRYGTHFQFGCQS 187
Query: 252 NNKGLFGG--VSGLMGLGRSDLSLVS---QTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
+ GLF G+MGL SD +V+ + ++I LFS C +G + G
Sbjct: 188 SETGLFVTQVADGIMGLSNSDTHIVAKLHRENKIPSNLFSLCFTE------NGGTMSVGE 241
Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVIT 364
+ + I+Y +I + FY +N+ I IGGK + A + +G ++DSGT +
Sbjct: 242 PNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYTRGHYIVDSGTTDS 301
Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG----NAE 420
LP A+K EFL+ F + + + +C + ++P +++ E N E
Sbjct: 302 YLP----RAMKNEFLQVFKEV-AGRDYQVGTSCHGYTNEDLASLPKIQLVMEAYGDENGE 356
Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
+ +D+ Y + +D S C ++ LS E+ G+IG N+ VI+D N ++GF D
Sbjct: 357 VIIDIPPEQYLLHNDNS-YCGSIY-LS-ENAGGVIGANLMMNRDVIFDNGNQRVGFVDAD 413
Query: 481 CS 482
C+
Sbjct: 414 CA 415
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 158/368 (42%), Gaps = 42/368 (11%)
Query: 135 YIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCKSCYNQQDP---VFDPSISPSYK 184
Y A +++G + +V DTGSDL WV QC P S D ++ P+ S + +
Sbjct: 100 YYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSR 159
Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGL----GK 239
+ C+ C SG + P C Y + Y + + + G L + L L G
Sbjct: 160 HLPCSHELCQP------GSGCTNPKQP--CTYNIDYFSENTTSSGLLIEDSLHLNSREGH 211
Query: 240 ASVN-DFIFGCGRNNKGLF-GGVS--GLMGLGRSDLSLVS--QTSEIFGGLFSYCLPSTQ 293
A VN I GCGR G + G++ GL+GLG +D+S+ S + + FS C
Sbjct: 212 APVNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCF---- 267
Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG 353
+SG + G + STP +P Y +N+ IG K L+ S F
Sbjct: 268 KEDSSGRIFFGDQGVSSQQSTPF-----VPLYGKLQTYAVNVDKSCIGHKCLEGSSFQA- 321
Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
L+DSGT T LPP +Y A EF KQ + S C++ S + ++P + +
Sbjct: 322 --LVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIIL 379
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
F N V I+ F + LA L + GIIG V++D ++ +
Sbjct: 380 AFAANKSFQA-VNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGYHVVFDRESMK 438
Query: 474 LGFAGEDC 481
LG+ +C
Sbjct: 439 LGWYRSEC 446
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 177/390 (45%), Gaps = 33/390 (8%)
Query: 105 RIKNMISGNIKD-VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
R+ N +S D + + + +G+ +Y+ ++LG + V+VDT S L+WV C
Sbjct: 95 RLANRLSSCPADEATASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGC 154
Query: 162 QPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
+PC +C P F+P+ S +YK V C S+ C+A+ AT C + + C+Y SY
Sbjct: 155 EPCINACL---IPTFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPT-EGCSYRQSY 210
Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
D S + G + + L G S FIFGC +G+ G SG++G+ + SL SQ +
Sbjct: 211 HDYSLSVGVVSSDTLTYGLGS-QKFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMT-- 267
Query: 281 FGGLF---SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
G + SYC P ++ G + G K+ T + N Y ++++
Sbjct: 268 VGHRYRAMSYCFPHPRNQG----FLQFGRYDEHKSLLRFTPLYIDGNN-----YFVHVSN 318
Query: 338 ISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
+ + L Q+SG D+GT T LP S++ +L G+ G S
Sbjct: 319 VMVETMSLDVQSSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRV-GASTGQ 377
Query: 396 TCFNLSAYQ---EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
TCF ++ +P VK+EF+ A +T++ +++ + + CLA D
Sbjct: 378 TCFQADGNWIEGDLYMPTVKIEFQNGARITLNSEDLMFMEEPNV--FCLAFKMNDGGDI- 434
Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
++G+ + D + +G G+ C+
Sbjct: 435 -VLGSRHLMGVHTVVDLEMMTMGLRGQGCN 463
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 167/382 (43%), Gaps = 53/382 (13%)
Query: 135 YIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCN 189
Y I LG + V VDTGSD WV C C +C + + ++DP+ S + K V C+
Sbjct: 77 YYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCD 136
Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVND 244
C +T + + C Y ++YGDGS T G ++ L + +V D
Sbjct: 137 DEFCT----STYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 192
Query: 245 ---FIFGCGRNNKGLFGGVS-----GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQD 294
IFGCG G + G++G G+++ S++SQ + +FS+CL +
Sbjct: 193 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVN- 251
Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-----SG 349
G +G +TP+ P++A + ++ L I + G +Q
Sbjct: 252 --GGGIFAIGEVVQPKVKTTPLV-------PRMAHYNVV-LKDIEVAGDPIQLPTDIFDS 301
Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVN 407
+ G +IDSGT + LP SIY L + L Q SG + + D TCF+ S + ++
Sbjct: 302 TSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMEL---YLVEDQFTCFHYSDEKSLD 358
Query: 408 --IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQK 461
P VK FE +T ++ K D C+ + + + G ++G+
Sbjct: 359 DAFPTVKFTFEEGLTLTAYPHDYLFPFKED--MWCIGWQKSTAQTKDGKDLILLGDLVLT 416
Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
N+ IYD N +G+ +CSS
Sbjct: 417 NKLFIYDLDNMSIGWTDYNCSS 438
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 68/401 (16%)
Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
S+T + G T +Y T+ +G + + VDTGSDLTW+QC PC+SC P+
Sbjct: 36 SSTAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPL 95
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
+ P+ + + V C ++ C AL G++ C S P C+Y + Y D + ++G L +
Sbjct: 96 YRPTAN---RLVPCANALCTALHSGQGSNNKCPS--PKQCDYQIKYTDSASSQGVLINDS 150
Query: 235 LGLGKASVN---DFIFGCGRN-----NKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGL 284
L S N FGCG + N + + G++GLGR +SLVSQ + I +
Sbjct: 151 FSLPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNV 210
Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FY-----ILNLTG 337
+CL + G + G+ V P + +P Q + +Y L
Sbjct: 211 VGHCLSTN-----GGGFLFFGDDVV-----PSSRVTWVPMAQRTSGNYYSPGSGTLYFDR 260
Query: 338 ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY----SALK---AEFLKQFSGFPSAP- 389
S+G K ++ ++ DSG+ T Y SALK ++ LKQ S P+ P
Sbjct: 261 RSLGVKPME--------VVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSD-PTLPL 311
Query: 390 ---GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--- 443
G + F++ E + NA M + Y + + VCL +
Sbjct: 312 CWKGQKAFKSVFDVK--NEFKSMFLSFSSAKNAAMEIPPEN--YLIVTKNGNVCLGILDG 367
Query: 444 --ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A LS+ +IG+ ++Q VIYD + SQLG+A C+
Sbjct: 368 TAAKLSFN----VIGDITMQDQMVIYDNEKSQLGWARGACT 404
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 184/418 (44%), Gaps = 64/418 (15%)
Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWVQCQ---PCKSCYNQQD 172
+ +P T+ + + Y T LG + + V++DTGS LTWV C C++C +
Sbjct: 50 HPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSA 109
Query: 173 ---PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS--SPPDCN-----------Y 216
PVF P S S + V C + +C + A + C + SP N Y
Sbjct: 110 SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPY 169
Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
V YG GS T G L + L +V F+ GC + + SGL G GR S+ +Q
Sbjct: 170 AVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQ 226
Query: 277 TSEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATF 330
GL FSYCL S + +A SGSL+LGG P+ + +
Sbjct: 227 L-----GLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVY 281
Query: 331 YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
Y L L G+++GGK ++ A+ GG ++DSGT T L P+++ + +
Sbjct: 282 YYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG 341
Query: 384 GF----PSAPGFSILDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV---KSD 435
G A L CF L + + +P + FEG A M + V YFV +
Sbjct: 342 GRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN--YFVVAGRGA 399
Query: 436 ASQVCLALASLSYEDETG----------IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+CLA+ + + +G I+G++QQ+N V YD + +LGF + C+S
Sbjct: 400 VEAICLAVVT-DFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 456
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 68/401 (16%)
Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
S+T + G T +Y T+ +G + + VDTGSDLTW+QC PC+SC P+
Sbjct: 36 SSTAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPL 95
Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
+ P+ + + V C ++ C AL G++ C S P C+Y + Y D + ++G L +
Sbjct: 96 YRPTAN---RLVPCANALCTALHSGQGSNNKCPS--PKQCDYQIKYTDSASSQGVLINDS 150
Query: 235 LGLGKASVN---DFIFGCGRN-----NKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGL 284
L S N FGCG + N + + G++GLGR +SLVSQ + I +
Sbjct: 151 FSLPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNV 210
Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FY-----ILNLTG 337
+CL + G + G+ V P + +P Q + +Y L
Sbjct: 211 VGHCLSTN-----GGGFLFFGDDVV-----PSSRVTWVPMAQRTSGNYYSPGSGTLYFDR 260
Query: 338 ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY----SALK---AEFLKQFSGFPSAP- 389
S+G K ++ ++ DSG+ T Y SALK ++ LKQ S P+ P
Sbjct: 261 RSLGVKPME--------VVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSD-PTLPL 311
Query: 390 ---GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--- 443
G + F++ E + NA M + Y + + VCL +
Sbjct: 312 CWKGQKAFKSVFDVK--NEFKSMFLSFASAKNAAMEIPPEN--YLIVTKNGNVCLGILDG 367
Query: 444 --ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
A LS+ +IG+ ++Q VIYD + SQLG+A C+
Sbjct: 368 TAAKLSFN----VIGDITMQDQMVIYDNEKSQLGWARGACT 404
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 118/416 (28%), Positives = 182/416 (43%), Gaps = 61/416 (14%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
H + LQS + +++ + S+ P G+ Y ++LG R V +DTGSD
Sbjct: 56 HGRLLQSPVGGVVNFPVDGASD---PFLVGL------YYTKVKLGTPPREFNVQIDTGSD 106
Query: 156 LTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
+ WV C C C Q FDP +S S V C+ C++ N S S
Sbjct: 107 VLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYS------NFQTESGCS 160
Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLG--------LGKASVNDFIFGCGRNNKGLF---- 257
P + C+Y YGDGS T G + + L S F+FGC G
Sbjct: 161 PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPR 220
Query: 258 GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
V G+ GLG+ LS++SQ + + +FS+CL + G G ++LG + + T
Sbjct: 221 RAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG--GIMVLG---QIKRPDT- 274
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKG-GILIDSGTVITRLPPSI 370
YT ++P+ Y +NL I++ G+ L A G G +ID+GT + LP
Sbjct: 275 -VYTPLVPS---QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEA 330
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
YS F++ + S G I CF ++A P V + F G A M +
Sbjct: 331 YS----PFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRA 386
Query: 428 IVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
+ S S + C+ +S+ T I+G+ K++ V+YD ++G+A DCS
Sbjct: 387 YLQIFSSSGSSIWCIGFQRMSHRRIT-ILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 123/409 (30%), Positives = 171/409 (41%), Gaps = 88/409 (21%)
Query: 150 VDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSYKKV-------------------LC 188
+DTGSDL W C+P C C ++ P P S LC
Sbjct: 98 LDTGSDLVWFPCRPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSDLC 157
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
S C TG+ C++SS P ++ +YGDGS +L + L L SV +F FG
Sbjct: 158 AISNCPLDYIETGD---CNTSSYPCPPFYYAYGDGSLV-AKLFSDSLSLPSVSVANFTFG 213
Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYCLPS----TQDAGASGSL 301
C G+ G GR LSL +Q S G FSYCL S + L
Sbjct: 214 CAHTT---LAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVRRPSPL 270
Query: 302 ILG-----------------GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
ILG K +T M+ NP+ FY ++L GISIG +
Sbjct: 271 ILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRN 330
Query: 345 LQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF--------PSAP 389
+ A + GG+++DSGT T LP Y+++ EF + PS
Sbjct: 331 IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPS-- 388
Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV--------KSDASQV-C 440
S + C+ L+ Q V +P + + F GN TV + YF K + +V C
Sbjct: 389 --SGMSPCYYLN--QTVKVPALVLHFAGNGS-TVTLPRRNYFYEFMDGGDGKEEKRKVGC 443
Query: 441 LALASLSYEDE----TG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
L L + E E TG I+GNYQQ+ V+YD N ++GFA C+S+
Sbjct: 444 LMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASL 492
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 169/368 (45%), Gaps = 59/368 (16%)
Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
+I+DTGS L+W+QC K + P VFDPS+S S+ + CN C +F S
Sbjct: 92 MILDTGSQLSWIQCH--KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTS 149
Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRN---NKGLFGG 259
C + C+Y Y DG+ G L RE + + S I GC + +KG+ G
Sbjct: 150 --CDLNR--LCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDASDDKGILG- 204
Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG---ASGSLILG--GNSSVFKNST 314
M LGR LS SQ FSYC+P+ Q +GS LG NS+ F+ +
Sbjct: 205 ----MNLGR--LSFASQAKIT---KFSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYIS 255
Query: 315 PITYTNMIPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRL 366
+T++ P L + + L GI IG K+L +A G +IDSG+ T L
Sbjct: 256 LLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYL 315
Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKM--EFEGNAEM 421
Y+ ++ E ++ +G G+ + D CF+ +A E+ + M EF+ E+
Sbjct: 316 VDVAYNKVREEVVR-LAGPRLKKGYVYSGVSDMCFDGNA-MEIGRLIGNMVFEFDKGVEI 373
Query: 422 TV-------DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
+ DV G V+ V S++ A + IIGN+ Q+N V +D N ++
Sbjct: 374 VIEKGRVLADVGGGVHCVGIGRSEMLGA--------ASNIIGNFHQQNLWVEFDIANRRV 425
Query: 475 GFAGEDCS 482
GF DCS
Sbjct: 426 GFGKADCS 433
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 178/413 (43%), Gaps = 55/413 (13%)
Query: 98 HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
H + LQS + +++ + S+ P G+ Y ++LG R V +DTGSD
Sbjct: 56 HGRLLQSPVGGVVNFPVDGASD---PFLVGL------YYTKVKLGTPPREFNVQIDTGSD 106
Query: 156 LTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
+ WV C C C Q FDP +S S V C+ C++ N S S
Sbjct: 107 VLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYS------NFQTESGCS 160
Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLG--------LGKASVNDFIFGCGRNNKGLF---- 257
P + C+Y YGDGS T G + + L S F+FGC G
Sbjct: 161 PNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPR 220
Query: 258 GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
V G+ GLG+ LS++SQ + + +FS+CL + G G ++LG + + T
Sbjct: 221 RAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG--GIMVLG---QIKRPDT- 274
Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKG-GILIDSGTVITRLPPSI 370
YT ++P+ Y +NL I++ G+ L A G G +ID+GT + LP
Sbjct: 275 -VYTPLVPS---QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEA 330
Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
YS S + + CF ++A P V + F G A M + +
Sbjct: 331 YSPFIQAIANAVSQYGRPITYESYQ-CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQ 389
Query: 431 FVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
S S + C+ +S+ T I+G+ K++ V+YD ++G+A DCS
Sbjct: 390 IFSSSGSSIWCIGFQRMSHRRIT-ILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 134/309 (43%), Gaps = 30/309 (9%)
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GK-ASV 242
C+S CH L+ +GVCS CNY YGD S T+G L ++ GK S+
Sbjct: 21 CDSPLCHKLD-----TGVCSPEK--RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSL 73
Query: 243 NDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASG 299
+ F+FGCG NN G F GL+GLG SL+SQ +FGG FS CL P D S
Sbjct: 74 SRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISS 133
Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-SGFAKGGILID 358
+ G S V + + T ++ Q T Y + L GIS+ L S KG +L+D
Sbjct: 134 RMSFGKGSQVLGDG--VVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVD 191
Query: 359 SGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
SGT LP +Y + E L+ + PS T NL P +
Sbjct: 192 SGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKG------PTLTY 245
Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
FEG + + + CLA+ + + G+ GN+ Q N + +D
Sbjct: 246 HFEGANLLLTPIQTFIPPTPETKGVFCLAINNYT-NSNGGVYGNFAQSNYLIGFDLDRQV 304
Query: 474 LGFAGEDCS 482
+ F DC+
Sbjct: 305 VSFKATDCT 313
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 161/368 (43%), Gaps = 39/368 (10%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
Y +I +G R + VDTGSDLTW+QC PC +C P++ P+ K V S
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRDS 247
Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
C L+ G+ C + C+Y + Y D S + G L ++ + L G DF+F
Sbjct: 248 LCQELQ---GDQNYCETCK--QCDYEIEYADRSSSMGVLAKDDMHLIATNGGREKLDFVF 302
Query: 248 GCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
GC + +G G++GL + +SL SQ + I +F +C+ T++ G +
Sbjct: 303 GCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCI--TRETNGGGYM 360
Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
LG + + +T+ + P Y ++ G ++L A + ++ DSG+
Sbjct: 361 FLGDD---YVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQELHAGNSVQ--VIFDSGS 413
Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-----SAYQEVNIPLVKMEFE 416
T LP +Y L + F + L C+ S ++ +N+ + F
Sbjct: 414 SYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFV 473
Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVIYDTKNSQL 474
T + Y + SD VCL L + + T I+G+ + + V+YD + Q+
Sbjct: 474 --VPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQI 531
Query: 475 GFAGEDCS 482
G+A +C+
Sbjct: 532 GWANSECT 539
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 54/382 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
Y A I +G + V VDTGSD+ WV C C +C + D +++P S + +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
C+ C AT ++ + C Y V YGDGS T G +++ L +A N
Sbjct: 133 CDQPFCS----ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188
Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
+FGCG G G + G++G G+++ S++SQ + +F++CL S
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248
Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----- 346
G A G ++ + T ++PN Y + L G+ +G L
Sbjct: 249 GGGIFAIGEVV----------EPKLXNTPVVPN---QAHYNVVLNGVKVGDTALDLPLGL 295
Query: 347 -ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY 403
+ + +G I IDSGT + LP SIY L + L P ++ D TCF
Sbjct: 296 FETSYKRGAI-IDSGTTLAYLPESIYLPLMEKIL---GAQPDLKLRTVDDQFTCFVFDKN 351
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQK 461
+ P V +FE + +T+ ++ ++ D V + +D E ++G+ +
Sbjct: 352 VDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411
Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
N+ V Y+ +N +G+ +CSS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCSS 433
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 175/401 (43%), Gaps = 52/401 (12%)
Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG----GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQ 170
D S T P+ + L Y I +G G+ + +DTGSDLTW+QC PC SC
Sbjct: 180 DSSTTIFPVGGNVYPDGL-YYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKG 238
Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGN-SGVCSSSSPPDCNYFVSYGDGSYTRGE 229
+ ++ P K L SS +E + C S C+Y + Y D SY+ G
Sbjct: 239 ANQLYKPR-----KDNLVRSSEPFCVEVQRNQLTEHCESCH--QCDYEIEYADHSYSMGV 291
Query: 230 LGRE--HLGL--GKASVNDFIFGCGRNNKGLFGG----VSGLMGLGRSDLSLVSQTSE-- 279
L ++ HL L G + +D +FGCG + +GL G++GL R+ +SL SQ +
Sbjct: 292 LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 351
Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
I + +CL S D G + +G + S +T+ M+ +P L Y + +T +S
Sbjct: 352 IISNVVGHCLAS--DLNGEGYIFMGSD---LVPSHGMTWVPMLHHPHLEV-YQMQVTKMS 405
Query: 340 IGGKQLQASGF--AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
G L G G +L D+G+ T P YS L L++ S S D
Sbjct: 406 YGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTS-LQEVSDLELTRDDS--DEA 462
Query: 398 FNLSAYQEVNIPL-----VKMEFE------GNAEMTVDVTGIV----YFVKSDASQVCLA 442
+ + N P+ VK F G+ + + ++ Y + S+ VCL
Sbjct: 463 LPICWRAKTNSPISSLSDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLG 522
Query: 443 L--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
+ S ++ T IIG+ + + ++YD ++G+ DC
Sbjct: 523 ILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMKSDC 563
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 170/384 (44%), Gaps = 59/384 (15%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV----------FDPSISPS 182
Y ++LG R+ V +DTGSD+ WV C C C PV FDP SP+
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGC-----PVNSGLHIPLNFFDPGSSPT 106
Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LG 238
+ C+ C +L + +S VCS+ + C Y YGDGS T G + L LG
Sbjct: 107 ASLISCSDQRC-SLGLQSSDS-VCSAQNNL-CGYNFQYGDGSGTSGYYVSDLLHFDTVLG 163
Query: 239 KASVND----FIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYC 288
+ +N+ +FGC G V G+ G G+ D+S+VSQ + I FS+C
Sbjct: 164 GSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC 223
Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
L G G L+LG V N I YT ++P+ Y LN+ IS+ G+ L
Sbjct: 224 LKGDDSGG--GILVLG--EIVEPN---IVYTPLVPS---QPHYNLNMQSISVNGQTLAID 273
Query: 349 GFAKG-----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA-PGFSILDTCFNLSA 402
G G +IDSGT + L + Y + S PS P S + C+ +S+
Sbjct: 274 PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVS--PSVRPYLSKGNHCYLISS 331
Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD----ASQVCLALASLSYEDETGIIGNY 458
P V + F G A M + Y ++ A+ C+ + + T I+G+
Sbjct: 332 SINDIFPQVSLNFAGGASMILIPQD--YLIQQSSIGGAALWCIGFQKIQGQGIT-ILGDL 388
Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
K++ +YD N ++G+A DCS
Sbjct: 389 VLKDKIFVYDIANQRIGWANYDCS 412
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 54/382 (14%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
Y A I +G + V VDTGSD+ WV C C +C + D +++P S + +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
C+ C AT ++ + C Y V YGDGS T G +++ L +A N
Sbjct: 133 CDQPFCS----ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188
Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
+FGCG G G + G++G G+++ S++SQ + +F++CL S
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248
Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----- 346
G A G ++ + T ++PN Y + L G+ +G L
Sbjct: 249 GGGIFAIGEVV----------EPKLKTTPVVPN---QAHYNVVLNGVKVGDTALDLPLGL 295
Query: 347 -ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY 403
+ + +G I IDSGT + LP SIY L + L P ++ D TCF
Sbjct: 296 FETSYKRGAI-IDSGTTLAYLPDSIYLPLMEKIL---GAQPDLKLRTVDDQFTCFVFDKN 351
Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQK 461
+ P V +FE + +T+ ++ ++ D V + +D E ++G+ +
Sbjct: 352 VDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411
Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
N+ V Y+ +N +G+ +CSS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCSS 433
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/415 (25%), Positives = 172/415 (41%), Gaps = 70/415 (16%)
Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
+ R +N+++ + + IP +G+ Y I +G V +DTGS WV
Sbjct: 58 RHRRRNLMAAELP-LGGFNIPYGTGL------YYTDIGIGTPAVKYYVQLDTGSKAFWVN 110
Query: 161 CQPCKSCYNQQDPV-----FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP--- 212
CK C ++ D + +DP S S K+V C+ + C +S PP
Sbjct: 111 GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC--------------TSRPPCNM 156
Query: 213 --DCNYFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCGRNNKGLFG---- 258
C Y Y DG T G L + L G + + FGCG G
Sbjct: 157 TLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAV 216
Query: 259 GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
+ G++G G S+ + +SQ + +FS+CL ST G +G +TPI
Sbjct: 217 AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI---FAIGEVVEPKVKTTPI 273
Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-----GILIDSGTVITRLPPSIY 371
N + ++++NL I++ G LQ G G IDSG+ + LP IY
Sbjct: 274 VKNNEV-------YHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIY 326
Query: 372 SALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
S L L F+ P ++ + CF+ + P + FE ++T+DV Y
Sbjct: 327 SEL---ILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN--DLTLDVYPYDY 381
Query: 431 FVKSDASQVCLAL--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
++ + +Q C A + + I+G+ N+ V+YD + +G+ +CSS
Sbjct: 382 LLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNCSS 436
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 179/398 (44%), Gaps = 58/398 (14%)
Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
++PL SG+ +T Y I +G + V VDTGSD+ WV C C C + +
Sbjct: 75 DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134
Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
++DP S S + V C+ C A S C+S+SP C Y +SYGDGS T G
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190
Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
+ L + S + FGCG G G + G++G G+S+ S++SQ +
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250
Query: 281 --FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
+F++CL + G A G+++ + T ++P+ Y + L
Sbjct: 251 GKVRKMFAHCLDTVNGGGIFAIGNVV----------QPKVKTTPLVPD---MPHYNVILK 297
Query: 337 GISIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
GI +GG L SG +KG I IDSGT + +P +Y AL A +
Sbjct: 298 GIDVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ-- 354
Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
++ D +CF S + P V FEG+ + V Y ++ + C+ + +
Sbjct: 355 -TLQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGGK 411
Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
+ G ++G+ N+ V+YD +N +G+A +CSS
Sbjct: 412 TKDGKDLGLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/275 (33%), Positives = 133/275 (48%), Gaps = 29/275 (10%)
Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
R EL RE + + +A FGCG + G G SGLMGL +SL+SQ S FS
Sbjct: 80 RQELHREPVHVRRA----LGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSV---PRFS 132
Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNPQLATF-YILNLTGISIGGK 343
YCL + S ++ G + + K +T PI T ++ NP + TF Y + L G+S+G K
Sbjct: 133 YCLTPFAERKTS-PMLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTK 191
Query: 344 QLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILD 395
+L+ A+ A GG ++DSG+ + L + A+K L+ P G +
Sbjct: 192 RLRVPAASLAINPDGTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVK-LPVFNGTVEDYE 250
Query: 396 TCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED-- 450
CF + A V P + + F+G A M + YF + A +CLA+A S ED
Sbjct: 251 LCFAVPSGVAMAAVKTPPLVLHFDGGAAMALPRDN--YFQEPRAGLMCLAVAR-SPEDLG 307
Query: 451 -ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
IIGN QQ+N V++D N + FA C +
Sbjct: 308 APISIIGNVQQQNMHVLFDVHNQKFSFAPTKCHDI 342
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 139/323 (43%), Gaps = 62/323 (19%)
Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
LT+G L YI T + +IVD+GS +T+V C C+ C N QDP F P +S SY
Sbjct: 84 LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSY 139
Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
V CN TC S C Y Y + S + G LG + + G+ S
Sbjct: 140 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 185
Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
+FGC + G LF G+MGLGR LS++ Q E + FS C
Sbjct: 186 LKAQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIG 245
Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
G G+++LGG + VF S P+ + +Y + L I + GK L+
Sbjct: 246 G--GAMVLGGVPTPSDMVFSRSDPLR----------SPYYNIELKEIHVAGKALRVDSRI 293
Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
+K G ++DSGT LP + A K LK+ G P S D CF
Sbjct: 294 FDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRG----PDPSYKDICFAGAR 349
Query: 399 -NLSAYQEVNIPLVKMEFEGNAE 420
N+S EV P V M F GN +
Sbjct: 350 RNVSKLHEV-FPDVDMVF-GNGQ 370
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 165/381 (43%), Gaps = 51/381 (13%)
Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ----PCKSCYNQQDPVFDPSISPSYKKVLC 188
Y +I +G + + +DTGSDLTWVQC PCK C +D ++ P+ + V C
Sbjct: 62 YTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPN---GKQVVKC 118
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGLGKASVNDFI 246
+ C A + +CS SPP C Y V Y D + T G L R+ H+G +S D +
Sbjct: 119 SDPICVATQSTHVLGQICSKQSPP-CVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPL 177
Query: 247 --FGCGRNNKGLFGGVS-------GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDA 295
FGCG K F G + G++GLG S++SQ + I + +CL A
Sbjct: 178 VAFGCGYEQK--FSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCL----SA 231
Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
G L LG F S+ I +T +I + L Y + GK A G I
Sbjct: 232 EGGGYLFLGDK---FVPSSGIVWTPIIQS-SLEKHYNTGPVDLFFNGKPTPAKGLQ---I 284
Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFN----LSAYQEVN--I 408
+ DSG+ T +Y+ + G P S L C+ + EVN
Sbjct: 285 IFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVNNYF 344
Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQ 463
+ + F + + + + Y + + VCL + + +E G ++G+ +++
Sbjct: 345 KPLTLSFTKSKNLQFQLPPVAYLIITKYGNVCLGILN---GNEAGLGNRNVVGDISLQDK 401
Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
V+YD + Q+G+A +C +
Sbjct: 402 VVVYDNEKQQIGWASANCKQI 422
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 166/376 (44%), Gaps = 40/376 (10%)
Query: 132 TLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPV-FDPSISPSYKKVLC 188
++ I ++ +G T +++DTGS L+W+QC FDPS+S S+ + C
Sbjct: 77 SMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPC 136
Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIF 247
N C C + C+Y Y DG+Y G L RE + + S I
Sbjct: 137 NHPLCKPRIPDFTLPTTCDQNR--LCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLIL 194
Query: 248 GCGR---NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ-DAG--ASGSL 301
GC + KG+ G M LGR + ++ S+ FSYC+P+ Q AG ++GS
Sbjct: 195 GCAEASTDEKGILG-----MNLGRRSFASQAKISK-----FSYCVPTRQARAGLSSTGSF 244
Query: 302 ILGGN--SSVFKNSTPITYTNMIPNPQLATF-YILNLTGISIGGKQLQASGF-------A 351
LG N S F+ +T+T +P L Y + + GI +G +L S
Sbjct: 245 YLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSG 304
Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNI 408
G +IDSG+ T L Y+ ++ E ++ G G+ + D CF+ + E+
Sbjct: 305 AGQTIIDSGSEFTYLVDEAYNKVREEVVR-LVGPKLKKGYVYGGVSDMCFDGNP-MEIGR 362
Query: 409 PLVKM--EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
+ M EFE E+ +D ++ V + + + + + IIGN+ Q+N V
Sbjct: 363 LIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEM-LGAASNIIGNFHQQNLWVE 421
Query: 467 YDTKNSQLGFAGEDCS 482
YD N ++G DCS
Sbjct: 422 YDLANRRIGLGKADCS 437
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 123/411 (29%), Positives = 178/411 (43%), Gaps = 79/411 (19%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN--------QQDPVFDPSISPSYK 184
Y ++ LG + + V++DTGS L+WV C C N VF P S S +
Sbjct: 91 YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150
Query: 185 KVLCNSSTCHALEF-------ATGNSG---VCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
V C + C + +TGN+G VC PP Y V YG GS T G L +
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVC----PP---YLVVYGSGS-TSGLLISDT 202
Query: 235 LGLGKAS-------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
L L +S +F GC + + SGL G GR S+ SQ FSY
Sbjct: 203 LRLSPSSSSSAPAPFRNFAIGC--SIVSVHQPPSGLAGFGRGAPSVPSQLKVP---KFSY 257
Query: 288 CLPSTQ---DAGASGSLILG-GNSSVFKNSTPITYTNMIPN----PQLATFYILNLTGIS 339
CL S + ++ SG L+LG K T + Y ++ N P + +Y L LTGIS
Sbjct: 258 CLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGIS 317
Query: 340 IGGK--QLQASGFAK---GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAP 389
+GGK L + F GG +IDSGT T L P+++ + A G P
Sbjct: 318 VGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVED 377
Query: 390 GFSILDTCFNLSAYQ--EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ--------V 439
L CF L + +P ++++F+G A M + V YFV + + +
Sbjct: 378 ALG-LRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVEN--YFVAAGPAGGPAAGPVAI 434
Query: 440 CLALAS--------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
CLA+ S + I+G++QQ+N + YD +LGF + C+
Sbjct: 435 CLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCA 485
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 167/378 (44%), Gaps = 52/378 (13%)
Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVL 187
Y ++LG R V +DTGSD+ WV C PC C + + +FD + S S + +
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LGKASVN 243
C C A+ T C + + C+Y Y D S T G + + LG++++
Sbjct: 144 CTDPICAAVSTTTDQ---CLTQT-DHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIA 199
Query: 244 D----FIFGCG--------RNNKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCL 289
+ +FGC R K L G+ G G+ + S++SQ S I +FS+CL
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKAL----DGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-QAS 348
++ G G L+LG + + S I Y+ +IP+ Y L L I++ G+ +
Sbjct: 256 KGGENGG--GILVLG---EILEPS--IVYSPLIPS---QPHYTLKLQSIALSGQLFPNPT 305
Query: 349 GFA---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
F G +IDSGT + L +Y + + S + P S CF +S
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ-SATPTISRGSQCFRVSMSVA 364
Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYF--VKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
P+++ FEG A M V + F + + + C+ ED I+G+ K++
Sbjct: 365 DIFPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKA--EDGLNILGDLVLKDK 422
Query: 464 RVIYDTKNSQLGFAGEDC 481
++YD ++G+A DC
Sbjct: 423 IIVYDLARQRIGWANYDC 440
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,647,504,905
Number of Sequences: 23463169
Number of extensions: 335431734
Number of successful extensions: 859219
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 894
Number of HSP's successfully gapped in prelim test: 2806
Number of HSP's that attempted gapping in prelim test: 849690
Number of HSP's gapped (non-prelim): 4478
length of query: 484
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 337
effective length of database: 8,910,109,524
effective search space: 3002706909588
effective search space used: 3002706909588
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)