BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 010525
(508 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/505 (70%), Positives = 412/505 (81%), Gaps = 25/505 (4%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVS-KNRNATSWPAKKSFEYYQVLL 66
+++ V L AE V FS++LIHRFS+EVKAL VS K+ + SWP KKS +YYQ+L+
Sbjct: 18 LFILVMASLLIDKSAE-VTFSSRLIHRFSDEVKALRVSRKDSLSYSWPEKKSMDYYQILV 76
Query: 67 SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-----------------------CDL 103
+SD Q+QKMK GPQ+Q LFPSQGSKTMSLG+DFG DL
Sbjct: 77 NSDFQRQKMKLGPQYQFLFPSQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALDAGSDL 136
Query: 104 LWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 163
LW+PCDC++CAPLSASYY+SLDRDLNEYSPS SSTSKHLSCSH+LC+LG +C +PKQPCP
Sbjct: 137 LWVPCDCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCP 196
Query: 164 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 223
Y+MDYYTENTSSSGLLVEDILHL S GDNAL SV+A V+IGCGMKQSGGYLDGVAPDGL
Sbjct: 197 YSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGL 256
Query: 224 IGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYIT 283
+GLGL EISVPS LAKAGLIRNSFSMCFD+DDSGRIFFGDQGP TQQST FL +G Y T
Sbjct: 257 MGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTT 316
Query: 284 YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 343
Y++GVE C+GSSCLKQTSF+A+VD+G+SFTFLP VYE I EFDRQVN TI+SF GYP
Sbjct: 317 YVVGVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYP 376
Query: 344 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTI 403
WK CYKSSS L K+PSVKL+FP NNSFV++NPVF+IYG Q +TGFCLAIQP +GDIGTI
Sbjct: 377 WKYCYKSSSNHLTKVPSVKLIFPLNNSFVIHNPVFMIYGIQGITGFCLAIQPTEGDIGTI 436
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGH 463
GQNFM GYRVVFDREN+KLGWSHS+C+D ++ + PLT GT NPLP N++QSSPGGH
Sbjct: 437 GQNFMAGYRVVFDRENMKLGWSHSSCEDRSNDKRMPLTSPNGTLVNPLPTNEQQSSPGGH 496
Query: 464 AVGPAVAGRAPSKPSTASTQLISSR 488
AV PAVAGRAPSKPS A+ QL+ SR
Sbjct: 497 AVSPAVAGRAPSKPSAAAVQLLPSR 521
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/479 (69%), Positives = 388/479 (81%), Gaps = 26/479 (5%)
Query: 22 AETVMFSTKLIHRFSEEVKALGVSK--NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGP 79
E FS++LIHRFS+E K + VS+ + N T WP KKS EYYQ+L+SSD+++QK+K GP
Sbjct: 15 VELATFSSRLIHRFSKEYKEVSVSRGGDVNGTWWPEKKSKEYYQILVSSDLKRQKLKLGP 74
Query: 80 QFQMLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPL 116
+Q+LFPSQGSKTMSLGNDFG DL W+PCDCV+CAPL
Sbjct: 75 HYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALDSGSDLFWVPCDCVQCAPL 134
Query: 117 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 176
SAS+Y+SLDRDL+EYSPS SSTSK LSCSHRLCD+G +C+NPKQ CPY+++YYTE+TSSS
Sbjct: 135 SASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSS 194
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
GLLVEDI+HL SGGD+ L SV+A VIIGCGMKQSGGYLDGVAPDGL+GLGL EISVPS
Sbjct: 195 GLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSF 254
Query: 237 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 296
LAKAGLI+NSFSMCF++DDSGRIFFGDQGPATQQS FL NG Y TYI+GVE CC+G+S
Sbjct: 255 LAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVEVCCVGTS 314
Query: 297 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 356
CLKQ+SF A+VDSG+SFTFLP +V+E IA EFD QVN + +SFEGY WK CYK+SSQ LP
Sbjct: 315 CLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLP 374
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
K+PS++L+FPQNNSF+V NPVF+IYG Q V GFCLAIQP DGDIGTIGQNFM GYRVVFD
Sbjct: 375 KIPSLRLIFPQNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGTIGQNFMMGYRVVFD 434
Query: 417 RENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPS 475
RENLKLGWS SNC+ PLTP GTP NPLP N++QS+PGGHAV PAVA APS
Sbjct: 435 RENLKLGWSRSNCEFSGISYTLPLTPS-GTPQNPLPTNEQQSTPGGHAVSPAVAVNAPS 492
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/505 (65%), Positives = 398/505 (78%), Gaps = 26/505 (5%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLS 67
+ ++V LL ES A MFS +LIHRFS+EVKA +++ + SWP ++ EYY++L+
Sbjct: 7 VAMSVVVLLIESCMA--AMFSARLIHRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVR 64
Query: 68 SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-----------------------CDLL 104
SD ++QK+ G ++Q LFPS+GSKTMS GND+G DLL
Sbjct: 65 SDWERQKVMLGSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLL 124
Query: 105 WIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPY 164
WIPCDC++CAPLSASYY SLDRDLN+YSPS SSTSKHLSCSH+LC+ +C +PKQ CPY
Sbjct: 125 WIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPY 184
Query: 165 TMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 224
T++YY+ENTSSSGLL+EDILHL SG D+A +SV+A VIIGCGM+Q+GGYLDGVAPDGL+
Sbjct: 185 TINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLM 244
Query: 225 GLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITY 284
GLGLGEISVPS L+KAGL++NSFS+CF+ DDSGRIFFGDQG ATQQ+T FL S+GKY TY
Sbjct: 245 GLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETY 304
Query: 285 IIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 344
I+GVE CCIGSSC+KQTSF+A+VDSG+SFTFLP E Y + EFD+QVN T SFEGYPW
Sbjct: 305 IVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPW 364
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
+ CYKSSS+ L K PSV L F NNSFVV+NPVFV++G Q V GFCLAIQP DGDIG +G
Sbjct: 365 EYCYKSSSKELLKNPSVILKFALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILG 424
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPG-TPSNPLPANQEQSSPGGH 463
QNFMTGYR+VFDRENLKLGWS SNCQDL DG + PLTP P P NPLPAN++Q++ GH
Sbjct: 425 QNFMTGYRMVFDRENLKLGWSRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGH 484
Query: 464 AVGPAVAGRAPSKPSTASTQLISSR 488
+ PAVAGRAPS PS ASTQLI S+
Sbjct: 485 TITPAVAGRAPSNPSAASTQLILSQ 509
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/489 (66%), Positives = 389/489 (79%), Gaps = 24/489 (4%)
Query: 24 TVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM 83
MFS +LIHRFS+EVKA +++ + SWP ++ EYY++L+ SD ++QK+ G ++Q
Sbjct: 2 AAMFSARLIHRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVRSDWERQKVMLGSKYQF 61
Query: 84 LFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASY 120
LFPS+GSKTMS GND+G DLLWIPCDC++CAPLSASY
Sbjct: 62 LFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASY 121
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
Y SLDRDLN+YSPS SSTSKHLSCSH+LC+ +C +PKQ CPYT++YY+ENTSSSGLL+
Sbjct: 122 YGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLI 181
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
EDILHL SG D+A +SV+A VIIGCGM+Q+GGYLDGVAPDGL+GLGLGEISVPS L+KA
Sbjct: 182 EDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKA 241
Query: 241 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 300
GL++NSFS+CF+ DDSGRIFFGDQG ATQQ+T FL S+GKY TYI+GVE CCIGSSC+KQ
Sbjct: 242 GLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQ 301
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 360
TSF+A+VDSG+SFTFLP E Y + EFD+QVN T SFEGYPW+ CYKSSS+ L K PS
Sbjct: 302 TSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPS 361
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
V L F NNSFVV+NPVFV++G Q V GFCLAIQP DGDIG +GQNFMTGYR+VFDRENL
Sbjct: 362 VILKFALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRENL 421
Query: 421 KLGWSHSNCQDLNDGTKSPLTPGPG-TPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPST 479
KLGWS SNCQDL DG + PLTP P P NPLPAN++Q++ GH + PAVAGRAPS PS
Sbjct: 422 KLGWSRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTITPAVAGRAPSNPSA 481
Query: 480 ASTQLISSR 488
ASTQLI S+
Sbjct: 482 ASTQLILSQ 490
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/504 (63%), Positives = 397/504 (78%), Gaps = 28/504 (5%)
Query: 24 TVMFSTKLIHRFSEEVKALGVSKNRNAT---SWPAKKSFEYYQVLLSSDVQKQKMKTGPQ 80
++ F+++++HRFSEE+KAL S + N + SWP K S EYYQ L+S D ++QKMK G +
Sbjct: 21 SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKLGSR 80
Query: 81 FQMLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLS 117
FQ+LFPS+GSKT++LGNDFG DLLW+PC+C++CAPLS
Sbjct: 81 FQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLS 140
Query: 118 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 177
ASYY SLD+DLNEY PS+SSTSKH+SCSH LCD G SCQ+PKQ CPY +DY TENTSSSG
Sbjct: 141 ASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSG 200
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
LL++D+LHL SG +N+ ++QA VI+GCGMKQSGGYL GVAPDGL GLGLGEISV S L
Sbjct: 201 LLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 260
Query: 238 AKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 297
AK L++NSFS+CF++D SGRIFFGD+GPA+QQ+TSF+ +GKY TYI+GVE CCI +SC
Sbjct: 261 AKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSC 320
Query: 298 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQRLP 356
LKQTSFKA++DSG+SFT+LP+E YE I EFD+++N T SF+GYPWK CYK S+ +P
Sbjct: 321 LKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMP 380
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
K+PSV L+FP NNSFVV++PVF IYG Q + GFC AI P DGDIG +GQN+MTGYR+VFD
Sbjct: 381 KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFD 440
Query: 417 RENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSK 476
R+NLKLGWSH+NCQDL++ K PLTP TP NPLPA+++QS+ GGHAV PAVAGRAPSK
Sbjct: 441 RDNLKLGWSHANCQDLSNEKKMPLTPAKETPPNPLPADEQQSASGGHAVAPAVAGRAPSK 500
Query: 477 PSTASTQLISSRSSSLKVLPFLLL 500
PS A+ I SR S++ LP LLL
Sbjct: 501 PSAATPCFIPSRFYSIR-LPHLLL 523
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 616 bits (1589), Expect = e-174, Method: Compositional matrix adjust.
Identities = 312/489 (63%), Positives = 376/489 (76%), Gaps = 35/489 (7%)
Query: 25 VMFSTKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQ 82
+ FS +L+HRF++E+K + R T WP ++S YYQ+LL+ D+ ++K+K G ++Q
Sbjct: 22 ITFSARLVHRFADEMKPV-----RPPTGYWPDQRSMRYYQMLLTGDILRRKIKVGGTRYQ 76
Query: 83 MLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSAS 119
+LFPS GSKTMSLGNDFG DLLWIPCDCV+CAPLS+S
Sbjct: 77 LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSS 136
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 179
YY++LDRDLNEYSPS S +SKHLSCSHRLCD G++C++ +Q CPY + Y +ENTSSSGLL
Sbjct: 137 YYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLL 196
Query: 180 VEDILHLISGGDNALKNS-VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 238
VEDILHL SGG L NS VQA V++GCGMKQSGGYLDGVAPDGL+GLG GE SVPS LA
Sbjct: 197 VEDILHLQSGG--TLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLA 254
Query: 239 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
K+GLI SFS+CF++DDSGR+FFGDQGP +QQSTSFL +G Y TYIIGVE+CCIG+SCL
Sbjct: 255 KSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCL 314
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
K TSFKA VDSG+SFTFLP VY I EFD+QVN + +SFEG PW+ CY SSQ LPK+
Sbjct: 315 KMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPSSQDLPKV 374
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
PS LMF +NNSFVV +PVFV YG + V GFCLAI P +GD+GTIGQNFMTGYR+VFDR
Sbjct: 375 PSFTLMFQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTEGDMGTIGQNFMTGYRLVFDRG 434
Query: 419 NLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPS 478
N KL WS SNCQDL+ G + PL+P T SNPLP +++Q + GHAV PAVAGRAP KPS
Sbjct: 435 NKKLAWSRSNCQDLSLGKRMPLSPNE-TSSNPLPTDEQQRT-NGHAVAPAVAGRAPHKPS 492
Query: 479 TASTQLISS 487
AS+++ISS
Sbjct: 493 AASSRMISS 501
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 310/489 (63%), Positives = 376/489 (76%), Gaps = 35/489 (7%)
Query: 25 VMFSTKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQ 82
+ FS +L+HRF++E+K + R T WP + S YY++LL+ D+ ++K+K G ++Q
Sbjct: 21 ITFSARLVHRFADEMKPV-----RPPTGYWPDRWSMGYYRMLLTGDILRRKIKVGGARYQ 75
Query: 83 MLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSAS 119
+LFPS GSKTMSLGNDFG DLLWIPCDCV+CAPLS+S
Sbjct: 76 LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSS 135
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 179
YY++LDRDLNEYSPS S +SKHLSCSH+LCD G++C++ +Q CPY + Y +ENTSSSGLL
Sbjct: 136 YYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLL 195
Query: 180 VEDILHLISGGDNALKNS-VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 238
VEDILHL SGG +L NS VQA V++GCGMKQSGGYLDGVAPDGL+GLG GE SVPS LA
Sbjct: 196 VEDILHLQSGG--SLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLA 253
Query: 239 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
K+GLI +SFS+CF++DDSGRIFFGDQGP QQSTSFL +G Y TYIIGVE+CC+G+SCL
Sbjct: 254 KSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCL 313
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
K TSFK VDSG+SFTFLP VY IA EFD+QVN + +SFEG PW+ CY SSQ LPK+
Sbjct: 314 KMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPSSQELPKV 373
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
PS+ L F QNNSFVV +PVFV YG + V GFCLAIQP +GD+GTIGQNFMTGYR+VFDR
Sbjct: 374 PSLTLTFQQNNSFVVYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRG 433
Query: 419 NLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPS 478
N KL WS SNCQDL+ G + PL+P T SNPLP +++Q + GHAV PAVAGRAP KPS
Sbjct: 434 NKKLAWSRSNCQDLSLGKRMPLSPNE-TSSNPLPTDEQQRT-NGHAVAPAVAGRAPHKPS 491
Query: 479 TASTQLISS 487
A +++ISS
Sbjct: 492 AAPSRMISS 500
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 291/504 (57%), Positives = 365/504 (72%), Gaps = 29/504 (5%)
Query: 5 SLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRN--ATSWPAKKSFEYY 62
SL L + L+ +++ A V FS+KLIHRFS+E KA VS+N N A SWP K+SF+YY
Sbjct: 5 SLIPLLMAYLLVVDAAIA--VTFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYY 62
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG---------------------- 100
++LLSSD+++QK+K G ++Q+LFPS+GS + LGN+FG
Sbjct: 63 RLLLSSDLKRQKLKLGAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDA 122
Query: 101 -CDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 159
DLLW+PCDC++CAPLSASYY+ L RDLNEYSPS SSTSK LSC+ +LC+LG+ C++ K
Sbjct: 123 GSDLLWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSK 182
Query: 160 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 219
PCPY YY+ENTSSSGLL+ED LHL ++A ++SV ASVIIGCG KQSG + DG A
Sbjct: 183 DPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAA 242
Query: 220 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 279
PDGL+GLG G++SVPSLLAKAGL+RN+FS+CFD + SG I FGDQG TQ+STSF+ G
Sbjct: 243 PDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEG 302
Query: 280 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
K++TY+I VE +GSS LK F+A+VDSG+SFTFLP E+YE I EFD+QVN T +SF
Sbjct: 303 KFVTYLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSF 362
Query: 340 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF-VIYGTQVVTGFCLAIQPVDG 398
+G PWK CY SSSQ L +P+V L+F N SF+V+NPV +I + FCL IQP+
Sbjct: 363 KGSPWKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHE 422
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS-NPLPANQEQ 457
+ G IGQNFM GYR+VFDRENLKLGWS SNCQD+ DG LTP P S NPLP NQ+Q
Sbjct: 423 EFGIIGQNFMWGYRMVFDRENLKLGWSTSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQ 482
Query: 458 SSPGGHAVGPAVAGRAPSKPSTAS 481
+P HAV PAVAGR P+K + S
Sbjct: 483 MTPSRHAVAPAVAGRTPAKSAAVS 506
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 580 bits (1496), Expect = e-163, Method: Compositional matrix adjust.
Identities = 298/482 (61%), Positives = 361/482 (74%), Gaps = 30/482 (6%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQMLF 85
FS KL HRFSEE+K + V WP +++ Y++ LL +D + K+ G + ++LF
Sbjct: 27 FSVKLFHRFSEEMKPVQVQTG----DWPDRRTLHYHEKLLRNDFLRHKINLGGARHKLLF 82
Query: 86 PSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYN 122
PSQGSKTMS GNDFG DLLW+PCDC+ CAPLSAS+Y+
Sbjct: 83 PSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCAPLSASFYS 142
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQP-CPYTMDYYTENTSSSGLLVE 181
+LDRDLNEYSPS S +SKHLSCSHRLCD+G++C+ KQ CPYT++Y ++NTSSSGLLVE
Sbjct: 143 NLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVE 202
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
DI HL SG + +SVQA V++GCGMKQSGGYLDG APDGLIGLG GE SVPS LAK+G
Sbjct: 203 DIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSG 262
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
LIR+SFS+CF++DDSGR+FFGDQG QQST FL +G + TYI+GVETCCIG+SC K T
Sbjct: 263 LIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVT 322
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
SF A DSG+SFTFLP Y IA EFD+QVN T ++F+G PW+ CY SSQ+LPK+P++
Sbjct: 323 SFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVPSSQQLPKIPTL 382
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
LMF QNNSFVV NPVFV Y Q V GFCLAIQP +G +GTIGQNFMTGYR+VFDREN K
Sbjct: 383 TLMFQQNNSFVVYNPVFVSYNEQGVDGFCLAIQPTEGGMGTIGQNFMTGYRLVFDRENKK 442
Query: 422 LGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAS 481
L WSHSNCQDL+ G + PL+P GT S+ LPA+++Q + GHAV PAVA RAP KPS AS
Sbjct: 443 LAWSHSNCQDLSLGKRMPLSPPNGTSSSQLPADEQQRTK-GHAVAPAVAVRAPQKPSVAS 501
Query: 482 TQ 483
+Q
Sbjct: 502 SQ 503
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 287/488 (58%), Positives = 356/488 (72%), Gaps = 27/488 (5%)
Query: 21 GAETVMFSTKLIHRFSEEVKALGVSKNRN--ATSWPAKKSFEYYQVLLSSDVQKQKMKTG 78
A V FS+KLIHRFS+E KA VS+N N A SWP K+SF+YY++LLSSD+++QK+K G
Sbjct: 9 AAIAVTFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLG 68
Query: 79 PQFQMLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAP 115
++Q+LFPS+GS + LGN+FG DLLW+PCDC++CAP
Sbjct: 69 AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 128
Query: 116 LSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSS 175
LSASYY+ L RDLNEYSPS SSTSK LSC+ +LC+LG+ C++ K PCPY YY+ENTSS
Sbjct: 129 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 188
Query: 176 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 235
SGLL+ED LHL ++A ++SV ASVIIGCG KQSG + DG APDGL+GLG G++SVPS
Sbjct: 189 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 248
Query: 236 LLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 295
LLAKAGL+RN+FS+CFD + SG I FGDQG TQ+STSF+ GK++TY+I VE +GS
Sbjct: 249 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 308
Query: 296 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 355
S LK F+A+VDSG+SFTFLP E+YE I EFD+QVN T +SF+G PWK CY SSSQ L
Sbjct: 309 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQEL 368
Query: 356 PKLPSVKLMFPQNNSFVVNNPVF-VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVV 414
+P+V L+F N SF+V+NPV +I + FCL IQP+ + G IGQNFM GYR+V
Sbjct: 369 LNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMV 428
Query: 415 FDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS-NPLPANQEQSSPGGHAVGPAVAGRA 473
FDRENLKLGWS SNCQD+ DG LTP P S NPLP NQ+Q +P HAV PAVAGR
Sbjct: 429 FDRENLKLGWSTSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPSRHAVAPAVAGRT 488
Query: 474 PSKPSTAS 481
P+K + S
Sbjct: 489 PAKSAAVS 496
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 273/486 (56%), Positives = 352/486 (72%), Gaps = 29/486 (5%)
Query: 25 VMFSTKLIHRFSEEVKALGVSKNRNATS--WPAKKSFEYYQVLLSSDVQKQKMKTGPQF- 81
+ FS+KLIHRFS+E K++ +S+ NA+ WP + SFEY+Q+LL +D+++Q+MK G Q
Sbjct: 26 LTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMKLGSQKN 85
Query: 82 QMLFPSQGSKTMSLGN-----------------------DFGCDLLWIPCDCVRCAPLSA 118
Q+LFPSQGS+ + GN D G DLLW+PCDC++CAPLSA
Sbjct: 86 QLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSA 145
Query: 119 SYYN-SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT-ENTSSS 176
SYYN SLDRDL+EYSPS SSTS+HLSC H+LC+ G++C+NPK PCPY +Y ENT+S+
Sbjct: 146 SYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFENTTSA 205
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G LVED LHL S GD+ + +QASV++GCG KQ G + DG APDG++GLG G+ISVPSL
Sbjct: 206 GFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSL 265
Query: 237 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 296
LAKAGLI+N FS+CFD++DSGRI FGD+G A+QQST FL G Y+ Y +GVE+ C+G+S
Sbjct: 266 LAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNS 325
Query: 297 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 356
CLK++ FKA+VDSGSSFT+LP EVY + +EFD+QVN SF+ W CY +SSQ L
Sbjct: 326 CLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELH 385
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
+P+++L FP+N +FVV+NP + I Q T FCL++QP DG G IGQNFM GYR+VFD
Sbjct: 386 DIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSYGIIGQNFMIGYRMVFD 445
Query: 417 RENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS-NPLPANQEQSSPGGHAVGPAVAGRAPS 475
ENLKLGWS+S+CQD +D L P P S NPLP N++QS P +V PAVAGR S
Sbjct: 446 IENLKLGWSNSSCQDTSDSADVHLAPPPDNKSPNPLPTNEQQSIPRTPSVAPAVAGRTSS 505
Query: 476 KPSTAS 481
+ S AS
Sbjct: 506 ESSAAS 511
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 276/505 (54%), Positives = 346/505 (68%), Gaps = 30/505 (5%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATS--WPAKKSFEYYQVL 65
+++ F L+ S T FS+KLIHRFSEE K+L +S N N +S WP K SF+Y Q+L
Sbjct: 7 LFVICFCFLSNHSIGLT--FSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLL 64
Query: 66 LSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDF-----------------------GCD 102
L +D+++QKMK G Q Q+LFPS GS T GND G D
Sbjct: 65 LDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSD 124
Query: 103 LLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPC 162
L W+PCDC++CAPLSAS Y LDRDL+EY PS S+TS+HLSC+H+LC+LG+ C+N K PC
Sbjct: 125 LSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPC 184
Query: 163 PYTMDYYTENTSSSGLLVEDILHLISGGD--NALKNSVQASVIIGCGMKQSGGYLDGVAP 220
PY DY NTSSSG LVEDILHL S D N+ + VQASVI+GCG KQ+GGYLDG AP
Sbjct: 185 PYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAP 244
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 280
DG++GLG G ISVPSLLAKAGLIR SFS+CFD + SG I FGDQG +Q+ST L + G
Sbjct: 245 DGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGN 304
Query: 281 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
Y Y+I VE+ C+G+SCLKQ+ FKA+VDSG+SFT+LP +VY I EFD+QVN S +
Sbjct: 305 YDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQ 364
Query: 341 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI 400
G PW CY +SS++L +P+++L F N S +++N + + Q FCL +QP D +
Sbjct: 365 GGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNY 424
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS-NPLPANQEQSS 459
G IGQN+MTGYRVVFD ENLKLGWS SNC+D++D T+ L P P S NPLP N++QS
Sbjct: 425 GIIGQNYMTGYRVVFDMENLKLGWSSSNCKDISDETEVTLAPSPNDQSPNPLPTNEQQSV 484
Query: 460 PGGHAVGPAVAGRAPSKPSTASTQL 484
P V PAVAGR SK S AS +
Sbjct: 485 PNKQGVAPAVAGRTSSKHSVASQHI 509
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 268/494 (54%), Positives = 352/494 (71%), Gaps = 38/494 (7%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLS 67
+ V +L TE + A +FS++LIHRFS+E +A + ++ S P K+S EYY++L
Sbjct: 8 LLFCVLFLATEETLAS--LFSSRLIHRFSDEGRA-SIKTPSSSDSLPNKQSLEYYRLLAE 64
Query: 68 SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDF-----------------------GCDLL 104
SD ++Q+M G + Q L PS+GSKT+S GNDF G +LL
Sbjct: 65 SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 124
Query: 105 WIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 163
WIPC+CV+CAPL+++YY+SL +DLNEY+PS+SSTSK CSH+LCD + C++PK+ CP
Sbjct: 125 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 184
Query: 164 YTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGGYLDGVAP 220
YT++Y + NTSSSGLLVEDILHL +N L N SV+A V+IGCG KQSG YLDGVAP
Sbjct: 185 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 244
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA-SNG 279
DGL+GLG EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQST FL N
Sbjct: 245 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 304
Query: 280 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
KY YI+GVE CCIG+SCLKQTSF +DSG SFT+LP+E+Y +A E DR +N T +F
Sbjct: 305 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNF 364
Query: 340 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
EG W+ CY+SS++ PK+P++KL F NN+FV++ P+FV +Q + FCL I P +
Sbjct: 365 EGVSWEYCYESSAE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQE 422
Query: 400 -IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSP-LTPGPGTPSNPLPANQEQ 457
IG+IGQN+M GYR+VFDREN+KLGWS S CQ+ D + P +PG + NPLP +++Q
Sbjct: 423 GIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGSTSSPNPLPTDEQQ 480
Query: 458 SSPGGHAVGPAVAG 471
S GGHAV PA+AG
Sbjct: 481 SR-GGHAVSPAIAG 493
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 266/520 (51%), Positives = 359/520 (69%), Gaps = 37/520 (7%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
M S I L + L++E S A +FS++LIHRFS+E G + ++ S+P K+SFE
Sbjct: 1 MASRSAFILLFILSLVSEKSLAS--LFSSRLIHRFSDE----GRASIKSPGSFPEKRSFE 54
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-------------------- 100
YY++L S D ++QKM G +FQ L PS+GSKT+S GN FG
Sbjct: 55 YYRLLTSIDSRRQKMNLGAKFQSLVPSEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVAL 114
Query: 101 ---CDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
DLLWIPC+CV+CAPLS++YY+SL +DLNE+ PSAS+TSK CSH+LC+ +C+
Sbjct: 115 DSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACE 174
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+PK+ CPYT+ Y +ENTSSSGLLVED+LHL + + +SV+A V++GCG KQSG +L
Sbjct: 175 SPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANAS--SSVKARVVVGCGEKQSGEFLK 232
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
G+APDG++GLG GEISVPS LAKAGL+RNSFSMCFD++DSGRI+FGD GP+TQQST FL
Sbjct: 233 GIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLP 292
Query: 277 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
+++ Y +GVE CC+G+SCLKQ+SF ++DSG SFTFLP+E+Y +A E D +N T+
Sbjct: 293 YKNEFVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATV 352
Query: 337 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
EG PW+ CY++S + PK+P++KL F NN+FV++ P+FV+ ++ + FCL I
Sbjct: 353 KKIEGGPWEYCYETSFE--PKVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISAS 410
Query: 397 -DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQ 455
+G G IGQN+M GYR+VFDREN+KLGWS S CQ+ +PG + NPLP +
Sbjct: 411 EEGTGGVIGQNYMAGYRIVFDRENMKLGWSASKCQEDKIAPPQEASPGSTSSPNPLPTEE 470
Query: 456 EQSSPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVL 495
+QS HAV PA+AG+ PSK S+AS S R S +L
Sbjct: 471 QQSRT--HAVSPAIAGKTPSKTSSASCCFSSMRLLSSSIL 508
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 255/497 (51%), Positives = 337/497 (67%), Gaps = 32/497 (6%)
Query: 21 GAETVMFSTKLIHRFSEEVKALGVSKNRNAT----SWPAKKSFEYYQVLLSSDVQKQKMK 76
GA V FS++LIHRFSEE KA S+ + + +WP + S EY+++LL SDV +Q+M+
Sbjct: 19 GAVGVTFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSDVTRQRMR 78
Query: 77 TGPQFQMLFPSQGSKTMSLGN-----------------------DFGCDLLWIPCDCVRC 113
G Q++ML+P +G +T GN D G D+LW+PCDC+ C
Sbjct: 79 LGSQYEMLYPFEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIEC 138
Query: 114 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 173
A LSA YN LDRDLN+Y PS S+TS+HL C H+LCD+ + C+ K PCPY + Y + NT
Sbjct: 139 ASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANT 198
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
SSSG + ED LHL S G +A +NSVQAS+I+GCG KQ+G YL G PDG++GLG G ISV
Sbjct: 199 SSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISV 258
Query: 234 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 293
PSLLAKAGLI+NSFS+CF++++SGRI FGDQG TQ ST FL +GK+ YI+GVE+ C+
Sbjct: 259 PSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCV 318
Query: 294 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
GS CLK+T F+A++DSGSSFTFLP EVY+ + EFD+QVN T + W+ CY +SSQ
Sbjct: 319 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQN-SWEYCYNASSQ 377
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
L +P + L F +N ++++ NP+F+ +Q T FCL + P D D IGQNF+ GYR+
Sbjct: 378 ELISIPPLNLAFSRNQTYLIQNPIFIDPASQEYTIFCLPVSPSDDDYAAIGQNFLMGYRM 437
Query: 414 VFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRA 473
VFDRENL+ WS NCQD SP + G+P NPLP +Q+QS P H + PA+AG
Sbjct: 438 VFDRENLRFSWSRWNCQD-RASFSSPYS--VGSP-NPLPVDQQQSFPNAHGIPPAIAGHT 493
Query: 474 PSKPSTASTQLISSRSS 490
KPS A+ +LI+SR S
Sbjct: 494 SPKPSAATPELITSRHS 510
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 264/498 (53%), Positives = 343/498 (68%), Gaps = 37/498 (7%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
M S+ I V +L TE + A +FS+++IHRFS+E +A + ++ S P K+S E
Sbjct: 1 MASRSVFILFCVLFLATEETLAS--VFSSRMIHRFSDEGRA-SIRTPSSSESLPEKQSLE 57
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-------------------- 100
YY++L SD ++Q+M G +FQ L PS+GSKT+S GNDFG
Sbjct: 58 YYRLLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVAL 117
Query: 101 ---CDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
DLLWIPC+CV+CAPL+++YY+SL +DLNEY+PS+SSTSK CSH+LCD + C+
Sbjct: 118 DTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCE 177
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGG 213
+PK+ CPYT++Y + NTSSSGLLVEDILHL +N L N SV+A V+IGCG KQSG
Sbjct: 178 SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGD 237
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
YLDGVAPDGL+GLG EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQST
Sbjct: 238 YLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTP 297
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
FL YI+GVE CCIG+SCLKQTSF +DSG SFT+LP+E+Y +A E DR +N
Sbjct: 298 FLQLENNS-GYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHIN 356
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
T SFEG W+ CY+SS + PK+P++KL F NN+FV++ P+FV +Q + FCL I
Sbjct: 357 ATSKSFEGVSWEYCYESSVE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI 414
Query: 394 QPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLP 452
P + IG+IGQN+M GYR+VFDREN+KL WS S CQ + P PG+ S+P P
Sbjct: 415 SPSGQEGIGSIGQNYMRGYRMVFDRENMKLRWSASKCQ---EEKIEPPQASPGSTSSPYP 471
Query: 453 ANQEQSSPGGHAVGPAVA 470
E+ GHAV PA+A
Sbjct: 472 LPTEEQQSRGHAVSPAIA 489
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 255/515 (49%), Positives = 335/515 (65%), Gaps = 38/515 (7%)
Query: 21 GAETVMFSTKLIHRFSEEVKALGVSKNRNAT----SWPAKKSFEYYQVLLSSDVQKQKMK 76
GA FS++LIHRFSEE KA S+ ++ +WP + S EY+++LL SDV +Q+M+
Sbjct: 19 GAVGATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSDVARQRMR 78
Query: 77 TGPQFQMLFPSQGSKTMSLGN-----------------------DFGCDLLWIPCDCVRC 113
G Q++ L+PS+G +T GN D G D+LW+PCDC+ C
Sbjct: 79 LGSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIEC 138
Query: 114 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 173
A LSA YN LDRDLN+Y PS S+TS+HL C H+LCD+ + C+ K PCPY + Y + NT
Sbjct: 139 ASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASANT 198
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
SSSG + ED LHL S G +A +NSVQAS+I+GCG KQ+G YL G PDG++GLG G ISV
Sbjct: 199 SSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNISV 258
Query: 234 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 293
PSLLAKAGLI+NSFS+C D+++SGRI FGDQG TQ ST FL I Y++GVE+ C+
Sbjct: 259 PSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFL----PIIAYMVGVESFCV 314
Query: 294 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
GS CLK+T F+A++DSGSSFTFLP EVY+ + EFD+QVN + + W+ CY +SSQ
Sbjct: 315 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQS-SWEYCYNASSQ 373
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGT--QVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
L +P +KL F +N +F++ NP+F + Q T FCL + P D IGQNF+ GY
Sbjct: 374 ELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDYAAIGQNFLMGY 433
Query: 412 RVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAG 471
R+VFDRENL+ GWS NCQD T +P G NPLPANQ+Q+ P V PA+AG
Sbjct: 434 RLVFDRENLRFGWSRWNCQDRASFT----SPSNGGSPNPLPANQQQTVPNARGVPPAIAG 489
Query: 472 RAPSKPSTASTQLISSRSSSLKVLPFLLLLRLLVS 506
KPS A+ L+++ SL L + L L +S
Sbjct: 490 HTSPKPSAATPGLVTTSRHSLASLLLICHLWLWLS 524
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 277/536 (51%), Positives = 363/536 (67%), Gaps = 39/536 (7%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
M S I V +L TE G +FS++LIHRFS+E +A + ++ S P K+S
Sbjct: 1 MASRSAFILFCVLFLATE--GTLASVFSSRLIHRFSDEGRA-SIKTPSSSESLPEKQSLA 57
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-------------------- 100
YY++L SD ++Q+M G +FQ L PS+GSKT+S GNDFG
Sbjct: 58 YYRLLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVAL 117
Query: 101 ---CDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
DLLWIPC+CV+CAPL+++YY+SL +DLNEY+PS+SS+SK CSH+LC + C
Sbjct: 118 DTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCD 177
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGG 213
+PK+ C YT+ Y + NTSSSGLLVEDILHL +N L N SV+A V++GCG KQSG
Sbjct: 178 SPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGD 237
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
YLDGVAPDGL+GLG EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQS
Sbjct: 238 YLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAP 297
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
FL YI+GVE CCIG+SCLKQTSF +DSG SFT+LP+E+Y +A E DR +N
Sbjct: 298 FLQLENNS-GYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHIN 356
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
T SFEG W+ CY+SS + PK+P++KL F NN+FV++ P+FV +Q + FCL I
Sbjct: 357 ATSKSFEGVSWEYCYESSVE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI 414
Query: 394 QPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLP 452
P + + IG+IGQN+M GYR+VFDREN+KLGWS S CQ+ D T+ P PG+ S+P P
Sbjct: 415 SPSEQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKTEPP-QASPGSTSSPYP 471
Query: 453 ANQEQSSPGGHAVGPAVAGRAPSKPSTASTQLISS--RSSSLKVLPFLLLLRLLVS 506
E+ GHAV PA+AG+ PSK ++S+ SS SS +++ LLLL +VS
Sbjct: 472 LPTEEQQSRGHAVSPAIAGKTPSKTPSSSSSSKSSCIFSSMMRLFNSLLLLHWVVS 527
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 242/388 (62%), Positives = 304/388 (78%), Gaps = 27/388 (6%)
Query: 24 TVMFSTKLIHRFSEEVKALGVSKNRNAT---SWPAKKSFEYYQVLLSSDVQKQKMKTGPQ 80
++ F+++++HRFSEE+KAL S + N + SWP K S EYYQ L+S D ++QKMK G +
Sbjct: 21 SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKLGSR 80
Query: 81 FQMLFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLS 117
FQ+LFPS+GS T++LGNDFG DLLW+PC+C++CAPLS
Sbjct: 81 FQLLFPSEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLS 140
Query: 118 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 177
ASYY SLD+DLNEY PS+SSTSKH+SCSH LCD G SCQ+PKQ CPY +DY TENTSSSG
Sbjct: 141 ASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSG 200
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
LL++D+LHL SG +N+ ++QA VI+GCGMKQSGGYL GVAPDGL GLGLGEISV S L
Sbjct: 201 LLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 260
Query: 238 AKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 297
AK L++NSFS+CF++D SGRIFFGD+GPA+QQ+TSF+ +GKY TYI+GVE CCI +SC
Sbjct: 261 AKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSC 320
Query: 298 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQRLP 356
LKQTSFKA++DSG+SFT+LP+E YE I EFD+++N T SF+GYPWK CYK S+ +P
Sbjct: 321 LKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMP 380
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
K+PSV L+FP NNSFVV++PVF IYG Q
Sbjct: 381 KVPSVTLLFPLNNSFVVHDPVFPIYGDQ 408
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 239/493 (48%), Positives = 319/493 (64%), Gaps = 34/493 (6%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S +++HR S+E + ++ + S +Y++ L+ SD+Q+QK + G ++Q+L S
Sbjct: 29 SARMVHRLSDEARLAAGARGGRRWP--RRGSGDYFRALVRSDLQRQKRRVGGKYQLLSLS 86
Query: 88 QGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSL 124
QG GND G DL W+PCDC++CAPLS SY+ SL
Sbjct: 87 QGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLS-SYHGSL 145
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y PS S+TS+HL CSH LC + C NPKQPCPY +DY++ENT+SSGLL+ED+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL S +A V ASVIIGCG KQSG YL+G+APDGL+GLG+ +ISVPS LA+AGL+R
Sbjct: 206 HLDSREGHA---PVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF KDDSGRIFFGDQG TQQST F+ NGK TY + V+ CIG C + F+
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VD+G+SFT LP + Y++I EFD+Q+N + S + Y ++ CY + +P +P++ L
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F +N SF NP+ Q FCLA+ P +G IGQNFM GY VVFDREN+KLG
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442
Query: 424 WSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
W S C DL++ T L P +P +PLP+N++Q+SP AV PAVAGRAPS + +
Sbjct: 443 WYRSECHDLDNSTMVSLGPSQHNSPEDPLPSNEQQTSP---AVTPAVAGRAPSSGGSTTL 499
Query: 483 QLISSRSSSLKVL 495
Q + + S+ L +L
Sbjct: 500 QNLLANSNMLLLL 512
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 239/493 (48%), Positives = 319/493 (64%), Gaps = 34/493 (6%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S +++HR S+E + ++ + S +Y++ L+ SD+Q+QK + G ++Q+L S
Sbjct: 29 SARMVHRLSDEARLAAGARGGRRWP--RRGSGDYFRALVRSDLQRQKRRVGGKYQLLSLS 86
Query: 88 QGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSL 124
QG GND G DL W+PCDC++CAPLS SY+ SL
Sbjct: 87 QGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLS-SYHGSL 145
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y PS S+TS+HL CSH LC + C NPKQPCPY +DY++ENT+SSGLL+ED+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL S +A V ASVIIGCG KQSG YL+G+APDGL+GLG+ +ISVPS LA+AGL+R
Sbjct: 206 HLDSREGHA---PVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF KDDSGRIFFGDQG TQQST F+ NGK TY + V+ CIG C + F+
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VD+G+SFT LP + Y++I EFD+Q+N + S + Y ++ CY + +P +P++ L
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F +N SF NP+ Q FCLA+ P +G IGQNFM GY VVFDREN+KLG
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442
Query: 424 WSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
W S C DL++ T L P +P +PLP+N++Q+SP AV PAVAGRAPS + +
Sbjct: 443 WYRSECHDLDNSTTVSLGPSQHNSPEDPLPSNEQQTSP---AVTPAVAGRAPSSGGSTTL 499
Query: 483 QLISSRSSSLKVL 495
Q + + S+ L +L
Sbjct: 500 QNLLANSNMLLLL 512
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 255/533 (47%), Positives = 348/533 (65%), Gaps = 42/533 (7%)
Query: 6 LTIYLAVFWLLTESSGAETVM---FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEY 61
+ + + ++ LL + ETV+ FS+++IHRFS+E K L + N SWP + S EY
Sbjct: 1 MAVGVLLWLLLAKGFVLETVIAVTFSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEY 60
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDF---------------------- 99
+++LL+SD+ +QKMK G Q Q +PS+GSKT+S GNDF
Sbjct: 61 FRLLLNSDLTRQKMKLGSQDQSFYPSEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALD 120
Query: 100 -GCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 158
G D+ W+PCDC+ CAPLSA++YN+LDRDLN+YSPS SS+S+HL C H+LC+ ++C+
Sbjct: 121 TGSDMFWVPCDCIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGF 180
Query: 159 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 218
K CPY +Y ++NTSSSG L+ED LHL S +NA KNS+QASVI+GCG KQSG +L+G
Sbjct: 181 KDRCPYIKEYTSDNTSSSGFLIEDKLHLAS--NNATKNSIQASVILGCGRKQSGYFLEGA 238
Query: 219 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ-QSTSFLAS 277
AP+G++GLG G ISVP+LLAKAGLIRNS S+C ++ SGRI FGDQG ATQ +ST FL
Sbjct: 239 APNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFGDQGHATQRRSTPFLLD 298
Query: 278 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-I 336
+G+ + Y +GVE C+GS C K+T FKA +D+G+SFT+LPK VYET+ AEF++QV+ T I
Sbjct: 299 DGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRI 358
Query: 337 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
TS + CCY +SS+ P +K F +N SF++ NP I Q T CLA+
Sbjct: 359 TSQIQSDFNCCYNASSRESNNFPPMKFTFSKNQSFIIQNP--FISMDQEDTTICLAVVQS 416
Query: 397 DGDIGTIG-------QNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSN 449
D ++ TIG QNF+ GY +VFDRENL+ GW SNCQD + + +P G +
Sbjct: 417 DDELITIGRKYTIACQNFLMGYDMVFDRENLRFGWFRSNCQDSMGESANFTSPSIGGSPD 476
Query: 450 PLPANQEQSSPGG-HAVGPAVAGRAPSKPSTASTQLISSR-SSSLKVLPFLLL 500
+P+NQ+Q P +V PA+AG+ KPS A L S +SL ++ LL
Sbjct: 477 SIPSNQQQRVPNNTRSVPPAIAGKTSPKPSAAKPGLNSWHLLNSLSLICLLLF 529
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 239/494 (48%), Positives = 317/494 (64%), Gaps = 35/494 (7%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
FS++++HR S+E + + WP + S YY+ LL SD+Q+QK + + Q+L
Sbjct: 27 FSSRMVHRLSDEAR---LEAGPRMGLWPQRGSGGYYRALLRSDLQRQKRRLAGKNQLLSL 83
Query: 87 SQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNS 123
S+G T S GND G DL W+PCDC++CAPLS SY +
Sbjct: 84 SKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLS-SYRGN 142
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
LDRDL Y P+ S+TS+HL CSH LC G+ C NPKQPC Y +DY++ENT+SSGLL+ED
Sbjct: 143 LDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSENTTSSGLLIEDS 202
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
LHL S +A V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL+
Sbjct: 203 LHLNSREGHA---PVNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLV 259
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 303
RNSFSMCF +D SGRIFFGDQG ++QQST F+ GK TY + V+ CIG CL+ +SF
Sbjct: 260 RNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSF 319
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
+A+VDSG+SFT LP +VY+ EFD+Q+N + +E WK CY +S +P +P++ L
Sbjct: 320 QALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIIL 379
Query: 364 MFPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
F N SF NP+ Q + FCLA+ P IG IGQNF+ GY VVFDRE++KL
Sbjct: 380 AFAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGYHVVFDRESMKL 439
Query: 423 GWSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAS 481
GW S C+D+++ T PL P G+ +PLP+N++Q+SP V PA G AP +T +
Sbjct: 440 GWYRSECRDVDNSTTVPLGPSQHGSSEDPLPSNEQQTSP---PVTPATTGTAPPSSATTN 496
Query: 482 TQLISSRSSSLKVL 495
Q++ + S L L
Sbjct: 497 RQMLFASSYPLLFL 510
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 226/469 (48%), Positives = 310/469 (66%), Gaps = 34/469 (7%)
Query: 25 VMFSTKLIHRFSEEVKALGVSK---NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
+ S L+HRFS+E K+L S+ N +A WP S +Y+Q+L+ D++++++ G ++
Sbjct: 22 LTLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLKYFQMLMDYDLKRRRLNIGSKY 81
Query: 82 QMLFPSQGSKTMSLGNDF-----------------------GCDLLWIPCDCVRCAPLSA 118
+LFPS+GS+ + GN+F G DLLW+PCDC++CAPLSA
Sbjct: 82 DVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSA 141
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 178
+YY+ LDRDL+EY+P+ SSTSKHL C H+LC T+C++ PC Y DYY++NTS+SG
Sbjct: 142 NYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGF 201
Query: 179 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 238
++ED L L S + + +QASV+ GCG KQSG YLDG APDG++GLG G ISVP+LLA
Sbjct: 202 MIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLA 261
Query: 239 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
+ GL+RN+FS+CFD + SGRI FGD GPATQQ+T FL G++ Y IGVE+ C+GSSCL
Sbjct: 262 QEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGSSCL 321
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ--VNDTITSFEGYPWKCCYKSSSQRLP 356
+++ F+A+VDSGSSFT+LP EVY+ I EFD+Q VN T PW CY S+
Sbjct: 322 QRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYNISTLVSF 381
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
+PS++L+FP N F +++PV+V+ Q FCL ++ D D G IGQN M GYR+VFD
Sbjct: 382 NIPSMQLVFPLNQIF-IHDPVYVLPANQGYKVFCLTLEETDEDYGVIGQNLMVGYRMVFD 440
Query: 417 RENLKLGWSHSNCQDLNDGTKSPLTP--GPGTPSNPL---PANQEQSSP 460
RENLKLGWS S C D+N T P G +P+ P N++ +P
Sbjct: 441 RENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALPPTNRQAIAP 489
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 241/472 (51%), Positives = 309/472 (65%), Gaps = 37/472 (7%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGP-QFQMLFP 86
ST++++R S+E + ++ WP + S +YY+ L+ SD+Q+QK + G + Q+L
Sbjct: 135 STRMVYRLSDEAR---MAAGTRGARWPRRGSGDYYRSLVRSDLQRQKRRLGGGKHQLLSF 191
Query: 87 SQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNS 123
S+ + GNDFG DL WIPCDC+ CAPLS Y+ S
Sbjct: 192 SKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCDCIECAPLSG-YHGS 250
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
LDRDL Y P+ S+TS+HL CSH LC LG+ C N KQPCPY Y ENT+SSGLLVEDI
Sbjct: 251 LDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDI 310
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
LHL S +A V+ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL+
Sbjct: 311 LHLDSRESHA---PVKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLV 367
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 303
RNSFSMCF KD SGRIFFGDQG +TQQST F+ GK TY + V+ C+G C + TSF
Sbjct: 368 RNSFSMCFTKD-SGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSF 426
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
+AIVDSG+SFT LP ++Y+ +A EFD+QVN + E + CY +S +P +P+V L
Sbjct: 427 QAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTL 486
Query: 364 MFPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
F N SF NP F+++ + V GFCLA+ IG I QNF+ GY VVFDREN+KL
Sbjct: 487 TFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHVVFDRENMKL 546
Query: 423 GWSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRA 473
GW S C DL++ T PL P +P +PLP+N++Q+SP AV PAVAGRA
Sbjct: 547 GWYRSECHDLDNSTTVPLGPSQHNSPEDPLPSNEQQTSP---AVTPAVAGRA 595
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 237/490 (48%), Positives = 308/490 (62%), Gaps = 39/490 (7%)
Query: 31 LIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS 90
++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S+G
Sbjct: 1 MVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLSKGG 53
Query: 91 KTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSLDRD 127
T S GND G DL W+PCDC++CAPLS Y +LDRD
Sbjct: 54 STFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNLDRD 112
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
L Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED LHL
Sbjct: 113 LRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLN 172
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
D+ V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++NSF
Sbjct: 173 YREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSF 229
Query: 248 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
SMCF +D SGRIFFGDQG +QQST F+ GK TY + V+ CIG CL+ TSFKA+V
Sbjct: 230 SMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALV 289
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSG+SFT LP +VY+ EFD+Q+N T +E WK CY +S +P +P++ L F
Sbjct: 290 DSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAA 349
Query: 368 NNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ S NP+ Q + GFCLA+ P IG I QNF+ GY VVFDRE++KLGW
Sbjct: 350 DKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLGWYR 409
Query: 427 SNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQLI 485
S C D+ D T PL P +P +PLP+N++Q+SP AV PA AG AP +T + Q++
Sbjct: 410 SECHDVEDSTTVPLGPSQRDSPEDPLPSNEQQTSP---AVTPATAGTAPLSCATTNLQML 466
Query: 486 SSRSSSLKVL 495
+ S L +L
Sbjct: 467 LASSYPLLLL 476
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 238/486 (48%), Positives = 312/486 (64%), Gaps = 39/486 (8%)
Query: 24 TVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM 83
+ ST+++HR S+E + ++ + WP S YY+ L+ SD+Q+QK K Q+
Sbjct: 71 SATLSTRMVHRLSDEAR---LAAGPHGARWPRHGSGGYYRALVRSDLQRQKRK----HQL 123
Query: 84 LFPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASY 120
L S+ S GNDFG DL W+PCDC+ CAPL A Y
Sbjct: 124 LSVSEAGGIFSPGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPL-AGY 182
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
+LDRDL Y P+ S+TS+HL CSH LC G+ C +PKQPCPY+ DY ENT+SSGLL+
Sbjct: 183 RETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLI 242
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
EDILHL S +A V+ASV+IGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+A
Sbjct: 243 EDILHLDSRESHA---PVKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA 299
Query: 241 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 300
GL+RNSFSMCF K+DSGRIFFGDQG + QQST F+ GKY TY + V+ C+G C +
Sbjct: 300 GLVRNSFSMCF-KEDSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEA 358
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 360
TSF+A+VDSG+SFT LP VY+ +A EFD+QV+ + E ++ CY +S ++P +P+
Sbjct: 359 TSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPT 418
Query: 361 VKLMFPQNNSFVVNNPVFVIY-GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
V L F N SF NP V+ G V GFCLA+Q IG IGQNF+TGY +VFD+EN
Sbjct: 419 VTLTFAANKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLTGYHIVFDKEN 478
Query: 420 LKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPS 478
+KLGW S C D ++ T PL P +P PLP++++Q+SP PAVAG+AP+ S
Sbjct: 479 MKLGWYRSECHDPDNSTTVPLGPSQHNSPGVPLPSSEQQTSPT--VTPPAVAGKAPTSSS 536
Query: 479 TASTQL 484
+ L
Sbjct: 537 GPPSNL 542
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 237/493 (48%), Positives = 311/493 (63%), Gaps = 39/493 (7%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S++++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S
Sbjct: 28 SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80
Query: 88 QGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSL 124
+G T S GND G DL W+PCDC++CAPLS Y +L
Sbjct: 81 KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL D+ V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D SGRIFFGDQG +QQST F+ GK TY + V+ CIG CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VDSG+SFT LP +VY+ EFD+Q+N T +E WK CY +S +P +P++ L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + S NP+ Q + GFCLA+ P IG I QNF+ GY VVFDRE++KLG
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 424 WSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
W S C+ + D T PL P +P +PLP+N++Q+SP AV PA AG AP +T +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTSP---AVTPATAGTAPLSCATTNL 493
Query: 483 QLISSRSSSLKVL 495
Q++ + S L +L
Sbjct: 494 QMLLASSYPLLLL 506
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 236/493 (47%), Positives = 310/493 (62%), Gaps = 39/493 (7%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S++++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S
Sbjct: 28 SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80
Query: 88 QGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSL 124
+G T S GND G DL W+PCDC++CAPLS Y +L
Sbjct: 81 KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL D+ V ASVIIGCG KQSG YLDG+APDGL+ LG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQ 256
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D SGRIFFGDQG +QQST F+ GK TY + V+ CIG CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VDSG+SFT LP +VY+ EFD+Q+N T +E WK CY +S +P +P++ L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + S NP+ Q + GFCLA+ P IG I QNF+ GY VVFDRE++KLG
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 424 WSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
W S C+ + D T PL P +P +PLP+N++Q+SP AV PA AG AP +T +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTSP---AVTPATAGTAPLSCATTNL 493
Query: 483 QLISSRSSSLKVL 495
Q++ + S L +L
Sbjct: 494 QMLLASSYPLLLL 506
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/427 (49%), Positives = 272/427 (63%), Gaps = 35/427 (8%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S++++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S
Sbjct: 28 SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80
Query: 88 QGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYYNSL 124
+G T S GND G DL W+PCDC++CAPLS Y +L
Sbjct: 81 KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL D+ V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D SGRIFFGDQG +QQST F+ GK TY + V+ CIG CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VDSG+SFT LP +VY+ EFD+Q+N T +E WK CY +S +P +P++ L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + S NP+ Q + GFCLA+ P IG I QNF+ GY VVFDRE++KLG
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 424 WSHSNCQ 430
W S C+
Sbjct: 437 WYRSECK 443
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/373 (52%), Positives = 252/373 (67%), Gaps = 8/373 (2%)
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 3 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
HL D+ V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 63 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 119
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D SGRIFFGDQG +QQST F+ GK TY + V+ CIG CL+ TSFK
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
A+VDSG+SFT LP +VY+ EFD+Q+N T +E WK CY +S +P +P++ L
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 239
Query: 365 FPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + S NP+ Q + GFCLA+ P IG I QNF+ GY VVFDRE++KLG
Sbjct: 240 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 299
Query: 424 WSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
W S C+ + D T PL P +P +PLP+N++Q+SP AV PA AG AP +T +
Sbjct: 300 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTSP---AVTPATAGTAPLSCATTNL 356
Query: 483 QLISSRSSSLKVL 495
Q++ + S L +L
Sbjct: 357 QMLLASSYPLLLL 369
>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
Length = 313
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 166/279 (59%), Positives = 212/279 (75%), Gaps = 8/279 (2%)
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 255
+SV+A V+IGCG KQSG YLDGVAPDGL+GLG EISVPS L+KAGL+RNSFS+CFD++D
Sbjct: 5 SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64
Query: 256 SGRIFFGDQGPATQQSTSFLA-SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
SGRI+FGD GP+ QQST FL N KY YI+GVE CCIG+SCLKQTSF +DSG SFT
Sbjct: 65 SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
+LP+E+Y +A E DR +N T +FEG W+ CY+SS++ PK+P++KL F NN+FV++
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAE--PKVPAIKLKFSHNNTFVIH 182
Query: 375 NPVFVIYGTQVVTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLN 433
P+FV +Q + FCL I P + IG+IGQN+M GYR+VFDREN+KLGWS S CQ+
Sbjct: 183 KPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE-- 240
Query: 434 DGTKSP-LTPGPGTPSNPLPANQEQSSPGGHAVGPAVAG 471
D + P +PG + NPLP +++QS GGHAV PA+AG
Sbjct: 241 DKIEPPQASPGSTSSPNPLPTDEQQSR-GGHAVSPAIAG 278
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 165/416 (39%), Positives = 235/416 (56%), Gaps = 31/416 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAP + Y SL D+ YSP+ S+TS+ + CS LCDL +C++
Sbjct: 80 DTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRS 137
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY++ Y ++NTSSSG+LVED+L+L S D+A V A ++ GCG Q+G +L
Sbjct: 138 KSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGS 195
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA GL NSFSMCF D GRI FGD G + Q+ T +
Sbjct: 196 AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVY 255
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I G+ +GS + T F AIVDSG+SFT L +Y I + FD Q+ +
Sbjct: 256 KQNPYYNITITGIT---VGSKSIS-TEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSS 311
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAI 393
+ P++ CY S+ + P+V L + F VN+P+ I G+CLAI
Sbjct: 312 RNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAI 370
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNP-- 450
+G + IG+NFM+G +VVFDRE + LGW + NC + ++ ++ P+ P P PS P
Sbjct: 371 MKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVNPSPSAVPSKPGL 429
Query: 451 -----LPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVLPFLLLL 501
P + + P G V + +P +P + S + VL FL++L
Sbjct: 430 GPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSVSATI---------VLLFLIVL 476
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 171/465 (36%), Positives = 246/465 (52%), Gaps = 34/465 (7%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKAL--GVSKNRNATSWPAKKSFEY 61
S ++++ + + +FS ++ HRFSE VK G A +WPAK SFEY
Sbjct: 3 FSWSVFIVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEY 62
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKT-------------MSLGN---------DF 99
Y L D + + +L S G+ T +SLG D
Sbjct: 63 YAELAHRDRALRGRRLSDIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDT 122
Query: 100 GCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 159
G DL W+PCDC RCAP + Y S D +L+ Y+P SSTS+ ++C + LC C
Sbjct: 123 GSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTF 181
Query: 160 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 219
CPY + Y + TS+SG+LVED+LHL + ++ + V+A V GCG Q+G +LD A
Sbjct: 182 SNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAA 239
Query: 220 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 279
P+GL GLGL +ISVPS+L+K G +SFSMCF D GRI FGD+G Q+ T F N
Sbjct: 240 PNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF-NLNA 298
Query: 280 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+ TY I V +G++ L F A+ DSG+SFT+L +Y + F Q D+
Sbjct: 299 LHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPP 357
Query: 340 EG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVD 397
+ P++ CY S + +PS+ L + F V +P+ +I +Q +C+A+
Sbjct: 358 DSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIII-SSQSELIYCMAVVR-S 415
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
++ IGQNFMTGYR++FDRE L LGW C D+ + + P+ P
Sbjct: 416 AELNIIGQNFMTGYRIIFDREKLVLGWKEFECDDI-ENSSVPIRP 459
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 165/416 (39%), Positives = 235/416 (56%), Gaps = 31/416 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAP + Y SL D+ YSP+ S+TS+ + CS LCDL +C++
Sbjct: 94 DTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRS 151
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY++ Y ++NTSSSG+LVED+L+L S D+A V A ++ GCG Q+G +L
Sbjct: 152 KSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGS 209
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA GL NSFSMCF D GRI FGD G + Q+ T +
Sbjct: 210 AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVY 269
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I G+ +GS + T F AIVDSG+SFT L +Y I + FD Q+ +
Sbjct: 270 KQNPYYNITITGIT---VGSKSI-STEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSS 325
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAI 393
+ P++ CY S+ + P+V L + F VN+P+ I G+CLAI
Sbjct: 326 RNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAI 384
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNP-- 450
+G + IG+NFM+G +VVFDRE + LGW + NC + ++ ++ P+ P P PS P
Sbjct: 385 MKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVNPSPSAVPSKPGL 443
Query: 451 -----LPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVLPFLLLL 501
P + + P G V + +P +P + S + VL FL++L
Sbjct: 444 GPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSVSATI---------VLLFLIVL 490
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 278 bits (710), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 164/416 (39%), Positives = 235/416 (56%), Gaps = 31/416 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAPL + Y SL D+ YSP+ S+TS+ + CS LCDL +C++
Sbjct: 117 DTGSDLFWVPCDCLKCAPLQSPNYGSLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRS 174
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY++ Y ++NTSSSG+LVED+L+L S D+A V A ++ GCG Q+G +L
Sbjct: 175 KSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGS 232
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA GL NSFSMCF D GRI FGD G + Q+ T +
Sbjct: 233 AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVY 292
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I G+ +GS + T F AIVDSG+SFT L +Y I + FD Q+ +
Sbjct: 293 KQNPYYNITITGIT---VGSKSIS-TEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSS 348
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAI 393
+ P++ CY S+ + P+V L + F VN+P+ I G+CLAI
Sbjct: 349 RNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAI 407
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGT------- 446
+G + IG+NFM+G +VVFDRE + LGW + NC + ++ ++ P+ P P
Sbjct: 408 MKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVNPSPSAVPPKPGL 466
Query: 447 -PSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVLPFLLLL 501
PS+ P + + P G V + +P +P + + VL FL++L
Sbjct: 467 GPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSVFATI---------VLLFLIVL 513
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 178/466 (38%), Positives = 246/466 (52%), Gaps = 43/466 (9%)
Query: 26 MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
+F+ K+ HRFS+ +K L S + + ++P+K SFEYY L D + K L
Sbjct: 27 IFTFKMHHRFSDMLKDL--SDSTTSRNFPSKGSFEYYAELAHRDQMLRGRKLYNVEAPLA 84
Query: 86 PSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNS 123
S G+ T + + D G DL W+PCDC +CAP Y S
Sbjct: 85 FSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYAS 144
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
D +L+ Y P SSTSK ++C++ LC C CPY + Y + TS+SG+LVED+
Sbjct: 145 -DFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDV 203
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
LHL S N + S++A V GCG QSG +L+ AP+GL GLG+ +ISVPS+L++ GL
Sbjct: 204 LHLTSEDSN--QESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLT 261
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 303
+SFSMCF D GRI FGD+G Q+ T F SN + +Y I V +G++ L F
Sbjct: 262 ADSFSMCFGHDGVGRISFGDKGSPDQEETPF-NSNPSHPSYNISVTQVRVGTT-LVDVDF 319
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKL-PSV 361
A+ DSG+SFT+L +Y ++ F Q D + P++ CY S L PS+
Sbjct: 320 TALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSM 379
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L F V +P+ VI TQ +CLAI ++ IGQNFMTGYRVVFDRE L
Sbjct: 380 SLTMKGRGHFTVFDPIIVI-TTQNELVYCLAIVK-STELNIIGQNFMTGYRVVFDREKLV 437
Query: 422 LGWSHSNC--QDLNDGTKSP--------LTPGPGTPSNPLPANQEQ 457
LGW ++C Q+ N P + G G S+P NQ++
Sbjct: 438 LGWKETDCYDQEYNSFPTEPHASDVPPAVAAGLGNYSSPHSTNQDR 483
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 153/358 (42%), Positives = 214/358 (59%), Gaps = 15/358 (4%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAP + Y SL D+ YSP+ S+TS+ + CS LCDL +C++
Sbjct: 117 DTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRS 174
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY++ Y ++NTSSSG+LVED+L+L S D+A V A ++ GCG Q+G +L
Sbjct: 175 KSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGS 232
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA GL NSFSMCF D GRI FGD G + Q+ T +
Sbjct: 233 AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVY 292
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I G+ +GS + T F AIVDSG+SFT L +Y I + FD Q+ +
Sbjct: 293 KQNPYYNITITGI---TVGSKSI-STEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSS 348
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAI 393
+ P++ CY S+ + P+V L + F VN+P+ I G+CLAI
Sbjct: 349 RNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAI 407
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNP 450
+G + IG+NFM+G +VVFDRE + LGW + NC + ++ ++ P+ P P PS P
Sbjct: 408 MKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVNPSPSAVPSKP 464
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 180/492 (36%), Positives = 246/492 (50%), Gaps = 44/492 (8%)
Query: 31 LIHRFSEEVKALGVSKNRNATSW--PAKKSFEYYQVLLSSD---VQKQKMKTGPQFQMLF 85
L HR S V+ ++ +W A+ + EYY L D + ++ + G +L
Sbjct: 33 LHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLARRGLAEGDGEGLLT 92
Query: 86 PSQGSKTMSLGN--------------------DFGCDLLWIPCDCVRCAPLSASYYNSLD 125
+ G+ T L D G DL W+PCDC +CAP++ +
Sbjct: 93 FASGNLTFRLEGSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRGG 152
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ---NPKQPCPYTMDYYTENTSSSGLLVED 182
DL YSP SSTSK ++C H LC+ +C N CPYT+ Y + NTSSSG+LVED
Sbjct: 153 PDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVED 212
Query: 183 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
+LHL +V A V++GCG Q+G +LDG A DGL+GLG+ ++SVPS+L AGL
Sbjct: 213 VLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVPSVLHAAGL 272
Query: 243 I-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
+ +SFSMCF D GRI FGD G Q T F N + TY I V + +
Sbjct: 273 VASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRN-THPTYNISVTAMSVSGKEVA-A 330
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLP 359
F AIVDSG+SFT+L Y +A F+ +V + + P++ CY+ Q +P
Sbjct: 331 EFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQTELFVP 390
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQ-----VVTGFCLAIQPVDGDIGTIGQNFMTGYRVV 414
V L F V P+ VIYG V G+CLA+ D I IGQNFMTG +VV
Sbjct: 391 EVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNFMTGLKVV 450
Query: 415 FDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSS-----PGGHAVGPAV 469
FDRE LGW +C + + PGP +P+ L Q + + PG V P
Sbjct: 451 FDRERSVLGWHEFDCYKDVETEELGAAPGP-SPTTRLKPRQSEVANGTPYPGAVPVTPRQ 509
Query: 470 AGRAPSKPSTAS 481
AG ++PS+ S
Sbjct: 510 AGSGGNRPSSFS 521
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 180/483 (37%), Positives = 248/483 (51%), Gaps = 49/483 (10%)
Query: 26 MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
+++ + HR SE V+ S + P K + EYY L D + K L
Sbjct: 20 VYTFTMHHRHSEPVRKWSHSTASGIPAPPEKGTVEYYAELADRDRLLRGRKLSQIDDGLA 79
Query: 86 PSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNS 123
S G+ T + + D G DL W+PCDC RCA +S + S
Sbjct: 80 FSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRCAATDSSAFAS 139
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
D DLN Y+P+ SSTSK ++C++ LC + C CPY + Y + TS+SG+LVED+
Sbjct: 140 -DFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDV 198
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
LHL ++ + V+A+VI GCG QSG +LD AP+GL GLG+ +ISVPS+L++ G
Sbjct: 199 LHLTQ--EDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFT 256
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 303
+SFSMCF +D GRI FGD+G Q T F N + TY I V +G++ L F
Sbjct: 257 ADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRVGTT-LIDVEF 314
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSSQRLPKL-PSV 361
A+ DSG+SFT+L Y + F QV D S P++ CY S L PSV
Sbjct: 315 TALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 374
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L + F V +P+ +I TQ +CLA+ ++ IGQNFMTGYRVVFDRE L
Sbjct: 375 SLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKT-AELNIIGQNFMTGYRVVFDREKLV 432
Query: 422 LGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHA-VGPAVAGRAPSKPSTA 480
LGW +C D+ D ++ +P + P HA V PAVA + P+T
Sbjct: 433 LGWKKFDCYDIEDH------------NDAIP-----TRPHSHADVPPAVAAGLGNYPATD 475
Query: 481 STQ 483
T+
Sbjct: 476 PTR 478
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 179/483 (37%), Positives = 242/483 (50%), Gaps = 54/483 (11%)
Query: 31 LIHRFSEEVKALGVSKNR-NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQG 89
L HRFS VK S+ R A +W + S EYY L + D ++ + G +L + G
Sbjct: 13 LHHRFSPVVKRWAESRGRPAAAAWWPEGSPEYYSALSAHDRARRVLAGGKGESLLSFADG 72
Query: 90 SKT-----------MSLGN---------DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLN 129
+ T ++LG D G DL W+PCDC RCAP++ + L
Sbjct: 73 NSTTRHAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPIA-----NTSELLK 127
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI-- 187
YSP SSTSK ++CSH LCD +C N CPYT+ Y + NTSSSG+LVED+L++
Sbjct: 128 PYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQ 187
Query: 188 -----SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
SG + +V A V+ GCG +Q+G +LDG A +GL+GLG+ +SVPSLLA AGL
Sbjct: 188 SSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDRVSVPSLLAAAGL 247
Query: 243 I-RNSFSMCFDKDDSGRIFFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 300
+ +SFSMCF D +GRI FG+ A Q T F+ S + TY I V +
Sbjct: 248 VGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTR-PTYNISVTAVNVKGKGAMA 306
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKL 358
F A+VDSG+SFT+L Y +A F+ QV + + P++ CY S Q +
Sbjct: 307 AEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCYALSRGQTEVLM 366
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQV-----VTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
P V L F V P ++ G G+CLA+ D I IGQNFMTG +V
Sbjct: 367 PEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDIIGQNFMTGLKV 426
Query: 414 VFDRENLKLGWSHSNC---QDLNDGTKSPLTPG--------PGTPSNPLPANQEQSSPGG 462
VFDR+ LGW+ +C + D PG P P P + S G
Sbjct: 427 VFDRQRSVLGWTKFDCYKNMKVEDDGSPAAAPGPMPVTQLRPRQSDTPFPGAVQPRSAAG 486
Query: 463 HAV 465
HA+
Sbjct: 487 HAL 489
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 169/456 (37%), Positives = 249/456 (54%), Gaps = 36/456 (7%)
Query: 7 TIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLL 66
T++L +L +F+ ++ HRFS+EVK S R A +P K SFEY+ L+
Sbjct: 9 TLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFNALV 67
Query: 67 SSD--VQKQKMKTGPQFQM--LFPSQGSKT-------------MSLGN---------DFG 100
D ++ +++ L S G+ T + LG D G
Sbjct: 68 LRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTG 127
Query: 101 CDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ 160
DL W+PCDC +CAP + Y S + +L+ Y+P S+T+K ++C++ LC C
Sbjct: 128 SDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFS 186
Query: 161 PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 220
CPY + Y + TS+SG+L+ED++HL + N + V+A V GCG QSG +LD AP
Sbjct: 187 TCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAP 244
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 280
+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF D GRI FGD+G + Q+ T F N
Sbjct: 245 NGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPS 303
Query: 281 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ Y I V +G++ L F A+ D+G+SFT+L +Y T++ F Q D S +
Sbjct: 304 HPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPD 362
Query: 341 G-YPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG 398
P++ CY S+ L PS+ L N+ F +N+P+ VI T+ +CLAI
Sbjct: 363 SRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVK-SS 420
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
++ IGQN+MTGYRVVFDRE L L W +C D+ +
Sbjct: 421 ELNIIGQNYMTGYRVVFDREKLVLAWKKFDCYDIEE 456
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 181/505 (35%), Positives = 256/505 (50%), Gaps = 52/505 (10%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
+ + + L W + G +++ + HR SE V+ S + P + + EYY
Sbjct: 5 VFIIVSLLSLWECCQCHGH---VYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYA 61
Query: 64 VLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN----------------------DFGC 101
L D + K L S G+ T + + D G
Sbjct: 62 ELADRDRLLRGRKLSQIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGS 121
Query: 102 DLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQP 161
DL W+PCDC RCA ++ + S D DLN Y+P+ SSTSK ++C++ LC + C
Sbjct: 122 DLFWVPCDCTRCAASDSTAFAS-DFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSN 180
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD 221
CPY + Y + TS+SG+LVED+LHL ++ + V+A+VI GCG QSG +LD AP+
Sbjct: 181 CPYMVSYVSAETSTSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPN 238
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY 281
GL GLG+ +ISVPS+L++ G +SFSMCF +D GRI FGD+G Q T F N +
Sbjct: 239 GLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSH 297
Query: 282 ITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFE 340
TY I V +G++ + F A+ DSG+SFT+L Y + F QV D S
Sbjct: 298 PTYNITVTQVRVGTTVI-DVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDS 356
Query: 341 GYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
P++ CY S L PSV L + F V +P+ +I TQ +CLA+ +
Sbjct: 357 RIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVK-SAE 414
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSS 459
+ IGQNFMTGYRVVFDRE L LGW +C D+ D ++ +P +
Sbjct: 415 LNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDH------------NDAIP-----TR 457
Query: 460 PGGHA-VGPAVAGRAPSKPSTASTQ 483
P HA V PAVA + P+T ST+
Sbjct: 458 PRSHADVPPAVAAGLGNYPATDSTR 482
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 165/435 (37%), Positives = 242/435 (55%), Gaps = 34/435 (7%)
Query: 26 MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
+F+ ++ HRFS+EVK S R +P K SFEY+ L+ D ++ +++
Sbjct: 28 IFTFEMHHRFSDEVKQWSDSTGR-FVKFPPKGSFEYFNALVLRDWLIRGRRLSDSESESS 86
Query: 84 LFPSQGSKT-------------MSLGN---------DFGCDLLWIPCDCVRCAPLSASYY 121
L S G+ T + LG D G DL W+PCDC +CAP + Y
Sbjct: 87 LTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATY 146
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S + +L+ Y+P S+T+K ++C++ LC C CPY + Y + TS+SG+L+E
Sbjct: 147 AS-EFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILME 205
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D++HL + N + V+A V GCG QSG +LD AP+GL GLG+ +ISVPS+LA+ G
Sbjct: 206 DVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREG 263
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L+ +SFSMCF D GRI FGD+G + Q+ T F N + Y I V +G++ L
Sbjct: 264 LVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-LIDD 321
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKL-P 359
F A+ D+G+SFT+L +Y T++ F Q D S + P++ CY S+ L P
Sbjct: 322 EFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIP 381
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
S+ L N+ F +N+P+ VI T+ +CLAI ++ IGQN+MTGYRVVFDRE
Sbjct: 382 SLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVK-SSELNIIGQNYMTGYRVVFDREK 439
Query: 420 LKLGWSHSNCQDLND 434
L L W +C D+ +
Sbjct: 440 LVLAWKKFDCYDIEE 454
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 160/413 (38%), Positives = 225/413 (54%), Gaps = 16/413 (3%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAPLS+ Y +L D+ YSP SSTS+ + CS +CDL T C
Sbjct: 126 DTGSDLFWVPCDCLKCAPLSSPDYGNLKFDV--YSPRKSSTSRKVPCSSNMCDLQTECSA 183
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY ++Y ++NTSS G+LVED+++L + ++ QA + GCG Q+G +L
Sbjct: 184 ASNSCPYKIEYLSDNTSSKGVLVEDVMYLAT--ESGHSKITQAPITFGCGQVQTGSFLGS 241
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA G+ NSFSMCF +D GRI FGD G A Q T +
Sbjct: 242 AAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGRINFGDTGSADQLETPLNIY 301
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I+G + T F A+VDSG+SFT L +Y I + FD+QV +
Sbjct: 302 KHNPYYNISIVGA----MAGGKTFSTKFSAVVDSGTSFTALSDPMYTEITSAFDKQVKEK 357
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG-TQVVTGFCLAI 393
+ P++ CY SS+ P++ L + F V +P+ I + G+CLAI
Sbjct: 358 RNPADSSLPFEYCYTISSKGAVSPPNISLTAKGGSVFPVKDPIITITDISSSPVGYCLAI 417
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPG-PGTPSNPLP 452
+G + IG+NFM+G +VVFDRE L LGW NC ++ TK P++P P P+
Sbjct: 418 MKSEG-VNLIGENFMSGLKVVFDRERLVLGWKSFNCYSVDHSTKLPVSPNSSAIPPKPVS 476
Query: 453 ANQEQSSPGGHAVGPAVAGRAPSKPSTASTQL--ISSRSSSLKVLPFLLLLRL 503
+ P + +KPS+ S+ L SSR+ + L L L
Sbjct: 477 GPGSSNPEAAKRPSPNITQIDAAKPSSGSSTLFHFSSRTFFFTAITPLFLAIL 529
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 166/453 (36%), Positives = 239/453 (52%), Gaps = 39/453 (8%)
Query: 2 NRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKAL-GVSKNRNATSWPAKKSFE 60
++++ L W+ +++ +F+ K+ HRFS+ K G+++N WP K SFE
Sbjct: 3 SKLTFFFLLITIWVFSKTCKGR--VFTFKMHHRFSDSFKNWSGLTRN-----WPEKGSFE 55
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN----------------------D 98
YY L D + + L S G+ T + + D
Sbjct: 56 YYAALAHRDQMLRGRRLSDADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALD 115
Query: 99 FGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 158
G DL W+PCDC RCAP + Y S D +L+ Y+P SSTSK ++C++ +C C
Sbjct: 116 TGSDLFWVPCDCSRCAPTHGASYAS-DFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGT 174
Query: 159 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 218
CPY + Y + TS+SG+LV+D+LHL + ++ + V+A V GCG QSG +LD
Sbjct: 175 FSSCPYIVSYVSAQTSTSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDIA 232
Query: 219 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASN 278
AP+GL GLG+ +ISVPS+L++ GLI +SFSMCF D GRI FGD+G Q+ T F N
Sbjct: 233 APNGLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPFNV-N 291
Query: 279 GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
+ TY + V +G + L F A+ DSG+SFT++ Y ++ +F D
Sbjct: 292 PAHPTYNVTVTQARVG-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRP 350
Query: 339 FE-GYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
+ P++ CY S L PS+ L F V +P+ VI TQ +CLA+
Sbjct: 351 PDPRIPFEYCYDMSPDANASLVPSMSLTMKGGRHFTVYDPIIVI-STQNEIVYCLAVVK- 408
Query: 397 DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ IGQNFMTGYRVVFDRE L LGW +C
Sbjct: 409 STELNIIGQNFMTGYRVVFDREKLVLGWKKFDC 441
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 177/471 (37%), Positives = 244/471 (51%), Gaps = 43/471 (9%)
Query: 26 MFSTKLIHRFSEEVKAL-GVS-KNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGP 79
+FS K+ HRFS+++K GVS K SWP K + EYY L D Q+ GP
Sbjct: 27 IFSFKMHHRFSDQLKNWSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFRGQRLSEFDGP 86
Query: 80 --------QFQM------------------LFPSQGSKTMSLGNDFGCDLLWIPCDCVRC 113
F++ + G+K M + D G DL W+PCDC RC
Sbjct: 87 LAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFM-VALDTGSDLFWVPCDCSRC 145
Query: 114 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 173
AP S Y S D +L+ YSP SSTSK + C++ LC C CPY + Y + T
Sbjct: 146 APTEGSPYAS-DFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSAET 204
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
S++G+L+ED+LHL + ++ +QA + GCG QSG +LD AP+GL GLG+ +ISV
Sbjct: 205 STTGILIEDLLHLKT--EHKHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262
Query: 234 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 293
PS+L++ GL+ NSFSMCF D GRI FGD+G Q+ T F N + Y I V + +
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRV 321
Query: 294 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSS 352
G++ L A+ DSG+SF++ +Y ++A F Q D P++ CY S
Sbjct: 322 GTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSP 380
Query: 353 QRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
L P + L F V +P+ VI TQ +CLA+ ++ IGQNFMTGY
Sbjct: 381 DANASLTPGISLTMKGGGPFPVYDPIIVI-STQNELIYCLAVVK-SAELNIIGQNFMTGY 438
Query: 412 RVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGT-PSNPLPANQEQSSPG 461
R+VFDRE L LGW +C D+ + + P+ P T P SSPG
Sbjct: 439 RIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPAVAAGVGNHSSPG 489
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 180/510 (35%), Positives = 253/510 (49%), Gaps = 47/510 (9%)
Query: 2 NRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSF 59
R L + +AV + + + A+ F L HRFS V+ ++ A WPA+ +
Sbjct: 9 RRTGLLLAMAVVVVASLIAAADASSFGFDLHHRFSPVVRRWAEARGGPLAADQWPARGTP 68
Query: 60 EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN-------------------DFG 100
EYY L D ++ + G +L + G+ T G D G
Sbjct: 69 EYYSALSRHDRARRALAGGADDGLLTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTG 128
Query: 101 CDLLWIPCDCVRCAPLSASYYNSLDR-DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 159
DL W+PCDC +CA + ++ D L YSP SSTSK ++C + LC C
Sbjct: 129 SDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQRNGCSAAT 188
Query: 160 Q-PCPYTMDYYTENTSSSGLLVEDILHLISG--GDNALKNSVQASVIIGCGMKQSGGYLD 216
CPY + Y + NTSSSG+LV+D+LHL G A ++QA V+ GCG Q+G +LD
Sbjct: 189 NGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLD 248
Query: 217 GV--APDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
G A DGL+GLG+G++SVPS LA +GL+ +SFSMCF D GR+ FGD G Q T
Sbjct: 249 GGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETP 308
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
F + TY + + +GS + F A++DSG+SFT+L Y +A +F+ QV+
Sbjct: 309 FTVRS-LNPTYNVSFTSIGVGSESVA-AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVS 366
Query: 334 DTITSF-----EGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG--TQV 385
+ +F + +P++ CY+ S +Q +P V L F V P F+ G T
Sbjct: 367 ERRVNFSSGSADPFPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQP-FIPVGDTTGR 425
Query: 386 VTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC------QDLNDGTK 437
G+CLAI D IG IGQNFMTG +VVFDRE LGW +C D DG+
Sbjct: 426 AVGYCLAIMRNDMAIGIDIIGQNFMTGLKVVFDRERSVLGWEKFDCYRNARVADAPDGSP 485
Query: 438 SPLTPGPGTPSNPLPANQEQSSPGGHAVGP 467
P + P+ P + S G P
Sbjct: 486 GPSSAPAAGPTKITPRQNDGSGSGYPGAAP 515
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 180/511 (35%), Positives = 247/511 (48%), Gaps = 76/511 (14%)
Query: 26 MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
+F+ + HR+SE VK S + WP K S EYY L D + + L
Sbjct: 25 IFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAGLA 84
Query: 86 PSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAP---LSASY 120
S G+ T + + D G DL W+PCDC RC+ + +
Sbjct: 85 FSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFAS 144
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
+ D DL+ Y+P+ SSTSK ++C++ LC C CPY + Y + TS+SG+LV
Sbjct: 145 ALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILV 204
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
ED+LHL DN + V+A+VI GCG QSG +LD AP+GL GLG+ +ISVPS+L++
Sbjct: 205 EDVLHLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSRE 262
Query: 241 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 300
G +SFSMCF +D GRI FGD+G Q T F N + TY I + +G++ L
Sbjct: 263 GFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV-NPSHPTYNITINQVRVGTT-LID 320
Query: 301 TSFKAIVDSGSSFTFLPKEVY--------------------------ETIAAEFDRQVND 334
F A+ DSG+SFT+L Y E +F QV D
Sbjct: 321 VEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVED 380
Query: 335 TITSFEG-YPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLA 392
+ P+ CY S L PS+ L + FVV +P+ +I TQ +CLA
Sbjct: 381 RRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLA 439
Query: 393 IQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLP 452
+ ++ IGQNFMTGYRVVFDRE L LGW S+C D+ D +N +P
Sbjct: 440 VVK-SAELNIIGQNFMTGYRVVFDREKLILGWKKSDCYDIEDH------------NNAIP 486
Query: 453 ANQEQSSPGGHAVGPAVAGRAPSKPSTASTQ 483
Q V PAVA P+T S++
Sbjct: 487 IGQHSD-----KVPPAVAAGLGDYPTTDSSR 512
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 162/446 (36%), Positives = 230/446 (51%), Gaps = 42/446 (9%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM---L 84
S + HR+S V+ L + + P + EYY L D++++ + L
Sbjct: 26 SLDVHHRYSAAVRGLA----GHLRAPPPAGTAEYYAALAGHDLRRRSLAAAAGGGGAGNL 81
Query: 85 FPSQGSKTMSLGNDFG-----------------------CDLLWIPCDCVRCAPLSASYY 121
+ G+ T L NDFG DL W+PCDC++CAPL++ Y
Sbjct: 82 AFADGNDTYRL-NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDY 140
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
L D+ YSP SSTS+ + CS LCD C CPY++ Y +ENTSS G+LVE
Sbjct: 141 GDLKFDM--YSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVE 198
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+L+L + ++ QA + GCG QSG +L AP+GL+GLG+ SVPSLLA G
Sbjct: 199 DVLYLTT--ESGQSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKG 256
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQ 300
+ NSFSMCF +D GRI FGD G + Q T + Y Y I + +G
Sbjct: 257 IAANSFSMCFGEDGHGRINFGDTGSSDQLETPLNIYKQNPY--YNISITGAMVGGKSF-D 313
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKLP 359
T F A+VDSG+SFT L +Y I + F+ QV ++ + P++ CY S+Q P
Sbjct: 314 TKFSAVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPP 373
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVV-TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
++ L + F VN P+ I T +CLAI +G + IG+NFM+G ++VFDRE
Sbjct: 374 NISLTAKGGSIFPVNGPIITITDTSSRPIAYCLAIMKSEG-VNLIGENFMSGLKIVFDRE 432
Query: 419 NLKLGWSHSNCQDLNDGTKSPLTPGP 444
L LGW NC + ++ +K P+ P
Sbjct: 433 RLVLGWKTFNCYNFDNSSKLPVNRNP 458
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 173/485 (35%), Positives = 248/485 (51%), Gaps = 46/485 (9%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV--QKQKMKTGPQFQML 84
F L HR+S+ VK + + P K S YY + D+ +K+ + L
Sbjct: 41 FGFDLHHRYSDPVKGM-----LSVDDLPEKGSLHYYASMAHRDILIHGRKLVSDNTSTPL 95
Query: 85 FPSQGSKT-------------MSLGN---------DFGCDLLWIPCDCVRCAPLSASYYN 122
G++T +S+G D G DL W+PCDC + +
Sbjct: 96 TFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFP 155
Query: 123 SLDR-DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S ++ D N Y P+ASSTS+ + C++ LC + C + + CPY + Y + TSS+G+LVE
Sbjct: 156 SGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVE 215
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+LHL + D+A ++ A +I GCG Q+G +LDG AP+GL GLG+ ISVPS LA+ G
Sbjct: 216 DLLHLTT--DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREG 273
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
NSFSMCF +D GRI FGD G + Q T F + TY + + +G
Sbjct: 274 YTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQ-LHPTYNVSITKINVGGRD-ADL 331
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKLP 359
F AI DSG+SFT+L Y I+ F+ + +S P++ CY+ SS+Q ++P
Sbjct: 332 EFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIP 391
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V L+ + F V +P+ ++ + +CLAI GD+ IGQNFMTGYR+VF+RE
Sbjct: 392 TVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVK-SGDVNIIGQNFMTGYRIVFNRER 450
Query: 420 LKLGWSHSNCQDLNDGTKSPLTP-GPGTPSNPLPANQEQSSPGG------HAVGPAVAGR 472
LGW S+C D D T P+ P PG P P A Q++ G P V
Sbjct: 451 NVLGWKASDCYDDMDTTTFPVDPISPGIP--PATAVNPQATAGSGNTTEVSGTPPPVGNN 508
Query: 473 APSKP 477
AP P
Sbjct: 509 APKLP 513
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 156/379 (41%), Positives = 215/379 (56%), Gaps = 19/379 (5%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC+ CAPL + Y L D YSP SSTS+ + CS LCDL ++C++
Sbjct: 122 DTGSDLFWVPCDCINCAPLVSPNYRDLKFD--TYSPQKSSTSRKVPCSSNLCDLQSACRS 179
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY+++Y ++NTSS+G+LVED+L+LI+ + V A + GCG Q+G +L
Sbjct: 180 ASSSCPYSIEYLSDNTSSTGVLVEDVLYLIT--EYGQPKIVTAPITFGCGRIQTGSFLGS 237
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF-LA 276
AP+GL+GLG+ ISVPSLLA G+ NSFSMCF D GRI FGD G + QQ T +
Sbjct: 238 AAPNGLLGLGMDSISVPSLLASEGVAANSFSMCFGDDGRGRINFGDTGSSDQQETPLNIY 297
Query: 277 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
Y Y I + +GS T+F AIVDSG+SFT L +Y I + F+ QV D
Sbjct: 298 KQNPY--YNISITGAMVGSKSF-NTNFNAIVDSGTSFTALSDPMYSEITSSFNSQVQDKP 354
Query: 337 TSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY-GTQVVTGFCLAIQ 394
T + P++ CY S + P++ LM + F VN+P+ I +CLA+
Sbjct: 355 TQLDSSLPFEFCYSISPKGSVNPPNISLMAKGGSIFPVNDPIITITDDASNPMAYCLAVM 414
Query: 395 PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNP--- 450
+G + IG+NFM+G +VVFDRE LGW NC +++ + P+ P P G P P
Sbjct: 415 KSEG-VNLIGENFMSGLKVVFDRERKVLGWKKFNCYSVDNSSNLPVNPNPSGVPPKPALG 473
Query: 451 ----LPANQEQSSPGGHAV 465
P + +SP G V
Sbjct: 474 PNSYTPEATKGTSPNGTQV 492
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 186/501 (37%), Positives = 261/501 (52%), Gaps = 56/501 (11%)
Query: 33 HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQMLFPSQGS 90
HRFS++V +GV P + S +YY+V+ D ++ +++ Q + F S G+
Sbjct: 39 HRFSDQV--VGVLP---GDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SDGN 92
Query: 91 KTMSLGN----------------------DFGCDLLWIPCDCVRCA-PLSASYYNSLDRD 127
+T+ + D G DL W+PCDC C L A +SLD
Sbjct: 93 ETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLD-- 150
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
LN YSP+ASSTS + C+ LC G C +P+ CPY + Y + TSS+G+LVED+LHL+
Sbjct: 151 LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLV 210
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
S ++ ++ A V GCG Q+G + DG AP+GL GLGL +ISVPS+LAK G+ NSF
Sbjct: 211 S--NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 248 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
SMCF D +GRI FGD+G Q+ T L + TY I V +G + F A+
Sbjct: 269 SMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAVF 326
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLP-KLPSVKLM 364
DSG+SFT+L Y I+ F+ D T+ P++ CY S + + P+V L
Sbjct: 327 DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLT 386
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
+S+ V +P+ VI + +CLAI ++ DI IGQNFMTGYRVVFDRE L LGW
Sbjct: 387 MKGGSSYPVYHPLVVI-PMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILGW 444
Query: 425 SHSNCQDLNDGTKSPLTPGPGTPSNPLPAN--QEQSSPGGHAVGPAVAGRAPSKPSTAST 482
S+C G S T LP+N + P + P +P+T++T
Sbjct: 445 KESDCY---TGETSART---------LPSNRSSSSARPPASSFDPEATNIPSQRPNTSTT 492
Query: 483 QLISSRSSSLKVLPFLLLLRL 503
S S SL + F +L L
Sbjct: 493 SAAYSLSISLSLFFFSILAIL 513
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 234/425 (55%), Gaps = 42/425 (9%)
Query: 33 HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQMLFPSQGS 90
HRFS++V +GV P + S +YY+V+ D ++ +++ Q + F S G+
Sbjct: 39 HRFSDQV--VGVLP---GDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SDGN 92
Query: 91 KTMSLGN----------------------DFGCDLLWIPCDCVRCA-PLSASYYNSLDRD 127
+T+ + D G DL W+PCDC C L A +SLD
Sbjct: 93 ETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLD-- 150
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
LN YSP+ASSTS + C+ LC G C +P+ CPY + Y + TSS+G+LVED+LHL+
Sbjct: 151 LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLV 210
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
S ++ ++ A V +GCG Q+G + DG AP+GL GLGL +ISVPS+LAK G+ NSF
Sbjct: 211 S--NDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 248 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
SMCF D +GRI FGD+G Q+ T L + TY I V + + F A+
Sbjct: 269 SMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNT-GDLEFDAVF 326
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLP-KLPSVKLM 364
DSG+SFT+L Y I+ F+ D T+ P++ CY S + + P+V L
Sbjct: 327 DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLT 386
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
+S+ V +P+ VI + +CLAI ++ DI IGQNFMTGYRVVFDRE L LGW
Sbjct: 387 MKGGSSYPVYHPLVVI-PMKDTDVYCLAILKIE-DISIIGQNFMTGYRVVFDREKLILGW 444
Query: 425 SHSNC 429
S+C
Sbjct: 445 KESDC 449
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 175/485 (36%), Positives = 242/485 (49%), Gaps = 47/485 (9%)
Query: 27 FSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML 84
F L HRFS V+ ++ A WPA+ + EYY L D ++ + G +L
Sbjct: 36 FGFDLHHRFSPVVRRWAEARGGPLAADRWPARGTPEYYSALSRHDRARRALAGGADDGLL 95
Query: 85 FPSQGSKTMSLGN-------------------DFGCDLLWIPCDCVRCAPLSASYYNSLD 125
+ G+ T G D G DL W+PCDC +CA + ++ D
Sbjct: 96 TFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPD 155
Query: 126 RD-LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDI 183
L YSP SSTS+ ++C + LC C CPY + Y + NTSSSG+LV+D+
Sbjct: 156 APPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDV 215
Query: 184 LHLISG--GDNALKNSVQASVIIGCGMKQSGGYLD--GVAPDGLIGLGLGEISVPSLLAK 239
LHL G A ++QA V+ GCG Q+G +LD G A DGL+GLG+G++SVPS LA
Sbjct: 216 LHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAA 275
Query: 240 AGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
+GL+ +SFSMCF D GR+ FGD G Q T F + TY + + IGS +
Sbjct: 276 SGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRS-LNPTYNVSFTSIGIGSESV 334
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF-----EGYPWKCCYK-SSS 352
F A++DSG+SFT+L Y +A +F+ QV++ +F + +P++ CY+ S +
Sbjct: 335 A-AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPN 393
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPVFVIYG--TQVVTGFCLAIQPVDGDIGT--IGQNFM 408
Q +P V L F V P F+ G T G+CLAI D IG IGQNFM
Sbjct: 394 QTEVAMPDVSLTAKGGALFPVTQP-FIPVGDTTGRAIGYCLAIMRNDMAIGIDIIGQNFM 452
Query: 409 TGYRVVFDRENLKLGWSHSNC------QDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGG 462
TG +VVFDRE LGW +C D DG+ P + P+ P + S G
Sbjct: 453 TGLKVVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKITPRQNDGSGSGY 512
Query: 463 HAVGP 467
P
Sbjct: 513 PGAAP 517
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 174/508 (34%), Positives = 252/508 (49%), Gaps = 52/508 (10%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQML 84
F+ + H +S V+ + S+P + + +YY ++ +D V +++ + L
Sbjct: 35 FTFNIHHLYSPAVRQI-----LPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPL 89
Query: 85 FPSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYN 122
G++T+ + D G DL W+PCDCV C ++
Sbjct: 90 TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTT 147
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
+ N YSP+ SSTSK + CS LC C +P CPY + Y ++NTSS+G LVED
Sbjct: 148 QGPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 207
Query: 183 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
ILHL + ++ V A + +GCG QSG +L AP+GL GLG+ +SVPS+LA AGL
Sbjct: 208 ILHLTT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGL 265
Query: 243 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 302
I NSFS+CF GRI FGD+G Q T F ++ TY + + +G +
Sbjct: 266 ISNSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGR-RHPTYNVSITQIGVGGH-ISDLD 323
Query: 303 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKLPS 360
I DSG+SFT+L Y A +F V + T P++ CY+ S +Q P
Sbjct: 324 VAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPL 383
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
+ L FV+N+P+ V+ T+ FCLAI D I IGQNFMTGY +VFDRE +
Sbjct: 384 MNLTMKGGGHFVINHPI-VLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKM 441
Query: 421 KLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTA 480
LGW SNC D + L GP P PA ++PG A+ P +A S +
Sbjct: 442 VLGWKESNCTGYEDENTNNLPVGP----TPTPA----AAPGTTAIKP----QANSNINNT 489
Query: 481 STQLISSRSSSLKV-LPFLLLLRLLVSA 507
+ + R S++ LP ++L L+S
Sbjct: 490 TQTIEKPRPSNISSKLPTSVILTFLISV 517
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 166/500 (33%), Positives = 254/500 (50%), Gaps = 61/500 (12%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRN----ATSWPAKKSF 59
+ + ++ V W+L + M L H+FS++ A+ ++RN A WP + +
Sbjct: 10 VLVMVHCCVLWMLATTFANALRM---DLFHKFSKQ--AIEAMRSRNGMDYAQDWPTEGTI 64
Query: 60 EYYQVLLSSDVQK-----QKMKTGPQFQMLFPSQGSKTMSLGN----------------- 97
E+ +L DV + +++ QG+ T L
Sbjct: 65 EFQTMLRDHDVARHTRTARRILAASSMDQYVLIQGNATEQLFGGGLHYSYIDIGTPNVQF 124
Query: 98 ----DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 153
D G DLLWIPC+C CAPLSA + LN Y+PS SST+K + CS LC++ +
Sbjct: 125 LVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVLCSDPLCEMSS 184
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI--SGGDNALKNSVQASVIIGCGMKQS 211
+C P CPY ++Y + NTS+SG L ED ++ + SGG N V+ V +GCG Q+
Sbjct: 185 TCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGG-----NPVKLPVYLGCGKVQT 239
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
G L G AP+GL+GLG +ISVP+ LA G + +SFS+C SG + FGD+GPA Q++
Sbjct: 240 GSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEGPAAQRT 299
Query: 272 TSFLASNGKYI-TYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
T + + + TYI+ +++ +G++ L S A+ D+G+SFT+L K VY +D
Sbjct: 300 TPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVYPQFVQAYDA 358
Query: 331 QV-----NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVIYGTQ 384
Q+ ND S W CY++S+ ++P V L NS VV+ ++
Sbjct: 359 QMSLPKWNDPRFS----KWDLCYQTSNTNF-QVPVVSLALSGGNSLDVVSGLKSIVDDNN 413
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPG- 443
+ C+ + + IGQNFMT Y + ++R + +GW+ S+C D T S TPG
Sbjct: 414 AMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGWTPSDCS--TDLTLSNSTPGS 471
Query: 444 -PGT--PSNPLPANQEQSSP 460
P P+ PLPA +SP
Sbjct: 472 VPAALPPTAPLPAVPRPASP 491
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 174/508 (34%), Positives = 252/508 (49%), Gaps = 52/508 (10%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQML 84
F+ + H +S V+ + S+P + + +YY ++ +D V +++ + L
Sbjct: 58 FTFNIHHLYSPAVRQI-----LPFHSFPDEGTLDYYAAMVRTDHFVHSRRLGQVQDHRPL 112
Query: 85 FPSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYN 122
G++T+ + D G DL W+PCDCV C ++
Sbjct: 113 TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTT 170
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
+ N YSP+ SSTSK + CS LC C +P CPY + Y ++NTSS+G LVED
Sbjct: 171 QGPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 230
Query: 183 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
ILHL + ++ V A + +GCG QSG +L AP+GL GLG+ +SVPS+LA AGL
Sbjct: 231 ILHLTT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGL 288
Query: 243 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 302
I NSFS+CF GRI FGD+G Q T F ++ TY + + +G +
Sbjct: 289 ISNSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGR-RHPTYNVSITQIGVGGH-ISDLD 346
Query: 303 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKLPS 360
I DSG+SFT+L Y A +F V + T P++ CY+ S +Q P
Sbjct: 347 VAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPL 406
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
+ L FV+N+P+ V+ T+ FCLAI D I IGQNFMTGY +VFDRE +
Sbjct: 407 MNLTMKGGGHFVINHPI-VLISTESKRLFCLAIARSDS-INIIGQNFMTGYHIVFDREKM 464
Query: 421 KLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTA 480
LGW SNC D + L GP P PA ++PG A+ P +A S +
Sbjct: 465 VLGWKESNCTGYEDENTNNLPVGP----TPTPA----AAPGTTAIKP----QANSNINNT 512
Query: 481 STQLISSRSSSLKV-LPFLLLLRLLVSA 507
+ + R S++ LP ++L L+S
Sbjct: 513 TQTIEKPRPSNISSKLPTSVILTFLISV 540
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 153/376 (40%), Positives = 209/376 (55%), Gaps = 11/376 (2%)
Query: 89 GSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
G+K M + D G DL W+PCDC RCAP S Y S D +L+ YSP SSTSK + C++ L
Sbjct: 14 GTKFM-VALDTGSDLFWVPCDCSRCAPTEGSPYAS-DFELSVYSPKKSSTSKTVPCNNSL 71
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C C CPY + Y + TS++G+L+ED+LHL + +N +QA + GCG
Sbjct: 72 CAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKT--ENKHSEPIQAYITFGCGQ 129
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPAT 268
QSG +LD AP+GL GLG+ +ISVPS+L++ GL+ NSFSMCF D GRI FGD+G
Sbjct: 130 VQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINFGDKGSLE 189
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
Q+ T F N + Y I V + +G++ L A+ DSG+SF++ +Y ++A F
Sbjct: 190 QEETPF-NLNQLHPNYNITVTSIRVGTT-LIDADITALFDSGTSFSYFTDPIYSKLSASF 247
Query: 329 DRQVND-TITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVV 386
Q D P++ CY S L P + L F V +P+ VI TQ
Sbjct: 248 HAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVYDPIIVI-STQNE 306
Query: 387 TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGT 446
+CLA+ ++ IGQNFMTGYR+VFDRE L LGW +C D+ + + P+ P T
Sbjct: 307 LIYCLAVVK-SAELNIIGQNFMTGYRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTT 365
Query: 447 -PSNPLPANQEQSSPG 461
P SSPG
Sbjct: 366 VPPAVAAGVGNHSSPG 381
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 254 bits (649), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 176/509 (34%), Positives = 248/509 (48%), Gaps = 60/509 (11%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSFEYYQ 63
+ + + L + A +V F L HRFS V+ ++ A WPA+ S EYY
Sbjct: 15 VAVAIVAVSFLVAAGDASSVGF--DLHHRFSPVVRQWAEARGHPFAAQDWPARGSPEYYS 72
Query: 64 VLLSSD---VQKQKMKTGPQFQMLFPSQGSKTMSLGN-------------------DFGC 101
L D + ++ + G + F + +G+ D G
Sbjct: 73 ALSRHDRAVLSRRALADGADGLVTFAAGNDTLQYIGSLYYAVVEVGTPNATFLVALDTGS 132
Query: 102 DLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ- 160
DL W+PCDC +CA + A+ L YSP SSTSK ++C + LCD C
Sbjct: 133 DLFWVPCDCKQCASI-ANVTGQPATALRPYSPRESSTSKQVTCDNALCDRPNGCSAATNG 191
Query: 161 PCPYTMDYYTENTSSSGLLVEDILHLIS---GGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY + Y + NTS+SG+LV+D+LHL G ++QA V+ GCG Q+G +LDG
Sbjct: 192 SCPYEVQYLSANTSTSGVLVQDVLHLTRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDG 251
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
A DGL+GLG +SVPS+LA +GL+ +SFSMCF D GRI FGD G + Q T F
Sbjct: 252 AAFDGLMGLGRENVSVPSVLASSGLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTG 311
Query: 277 SNGKY-ITYI-IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 334
Y +++ + VET + + F A++DSG+SFT+L Y +A F+ V +
Sbjct: 312 RRTLYNVSFTAVNVETKSVAA------EFAAVIDSGTSFTYLADPEYTELATNFNSLVRE 365
Query: 335 TITSF-----EGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG 388
T+F + +P++ CY +Q +P V L F V PV + + V G
Sbjct: 366 RRTNFSSGSADPFPFEYCYALGPNQTEALIPDVSLTTKGGARFPVTQPVIGVASGRTVVG 425
Query: 389 FCLAIQPVDGDIGT----IGQNFMTGYRVVFDRENLKLGWSHSNC------QDLNDGTKS 438
+CLAI + D+G IGQNFMTG +VVFDRE LGW +C D DG+ S
Sbjct: 426 YCLAI--MKNDLGVNFNIIGQNFMTGLKVVFDREKSVLGWEKFDCYKNARVADAPDGSPS 483
Query: 439 PLTPGPGTPSNPLPANQEQSSPGGHAVGP 467
P P+ P + SS G A P
Sbjct: 484 PAP--AADPTKITPRQNDGSSNGFPAAAP 510
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 176/514 (34%), Positives = 256/514 (49%), Gaps = 63/514 (12%)
Query: 27 FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLS-------------SDVQK 72
F + HRFS+ VK LG+ + P K S EYY + DV +
Sbjct: 39 FGFDIHHRFSDPVKGILGID------NIPDKGSREYYVAMAHRDRVFRGRRLADGGDVDQ 92
Query: 73 QKMKTGPQ---FQM-LFPSQGSKTMSLGN---------DFGCDLLWIPCDCVRCAPLSAS 119
+ + P +Q+ LF +S+G D G DL W+PC+C +C
Sbjct: 93 KLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTKCVH-GIQ 151
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSSSGL 178
N Y SSTSK+++C+ LC+ T C + CPY ++Y +ENTS++G
Sbjct: 152 LSTGQKIAFNIYDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGF 211
Query: 179 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 238
LVED+LHLI+ D+ +++ + GCG Q+G +LDG AP+GL GLG+ ++SVPS+LA
Sbjct: 212 LVEDVLHLITDNDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILA 270
Query: 239 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
K GL NSFSMCF D GRI FGD + Q + + TY I V +G +
Sbjct: 271 KQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNS- 329
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF---EGYPWKCCYKSSSQRL 355
F AI D+G+SFT+L Y+ I FD ++ SF + P++ CY + +
Sbjct: 330 ADLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQT 389
Query: 356 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVF 415
++P++ L +++ V +P+ G CLA+ + ++ IGQNFMTGYR+VF
Sbjct: 390 IEVPNINLTMKGGDNYFVMDPIITSGGGNNGV-LCLAVLKSN-NVNIIGQNFMTGYRIVF 447
Query: 416 DRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVA--GRA 473
DREN+ LGW SNC D D S LP N+ + AV PA+A
Sbjct: 448 DRENMTLGWKESNCYD--DELSS------------LPVNRSHAP----AVSPAMAVNPEI 489
Query: 474 PSKPSTASTQLISSRSSSLK-VLPFLLLLRLLVS 506
S PS +L SS S + L F + + LL++
Sbjct: 490 QSNPSNGPQRLPSSHSFKKEPALAFTVAIILLLA 523
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 169/454 (37%), Positives = 244/454 (53%), Gaps = 44/454 (9%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
+ L + L W+L G F + HRFS++V +GV P + S +YY+
Sbjct: 12 MGLILMLVSSWVLDRCEGLGE--FGFEFHHRFSDQV--VGVLP---GDGLPNRDSSKYYR 64
Query: 64 VLLSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGN----------------------DF 99
V+ D ++ +++ + Q + F + G++T+ + D
Sbjct: 65 VMAHRDRLIRGRRLASEDQSLVTF-ADGNETIRVNALGFLHYANVTVGTPSDWFLVALDT 123
Query: 100 GCDLLWIPCDC-VRCA-PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
G DL W+PCDC C L A +SLD LN YSP+ASSTS + C+ LC C +
Sbjct: 124 GSDLFWLPCDCSTNCVRELKAPGGSSLD--LNIYSPNASSTSSKVPCNSTLCTRVDRCAS 181
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
P CPY + Y + TSS+G+LVED+LHL+S N+ ++A + +GCG+ Q+G + DG
Sbjct: 182 PLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS--KPIRARITLGCGLVQTGVFHDG 239
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 277
AP+GL GLGL +ISVPS+LAK G+ NSFSMCF D +GRI FGD+G Q+ T L
Sbjct: 240 AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETP-LNI 298
Query: 278 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 337
+ TY + V +G + F A+ D+G+SFT+L Y I+ F+ D
Sbjct: 299 RQPHPTYNVTVTQISVGGNT-GDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRY 357
Query: 338 SFEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP 395
+ P++ CY S +++ + P V L +S+ V +P+ V+ V +CLAI
Sbjct: 358 QTDSELPFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVV-YCLAIMK 416
Query: 396 VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ DI IGQNFMTGYRVVFDRE L LGW S+C
Sbjct: 417 SE-DISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 170/424 (40%), Positives = 228/424 (53%), Gaps = 41/424 (9%)
Query: 98 DFGCDLLWIPCDCVRCA-PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W+PCDC C L A +SLD LN YSP+ASSTS + C+ LC G C
Sbjct: 73 DTGSDLFWLPCDCTNCVRELKAPGGSSLD--LNIYSPNASSTSTKVPCNSTLCTRGDRCA 130
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+P+ CPY + Y + TSS+G+LVED+LHL+S ++ ++ A V GCG Q+G + D
Sbjct: 131 SPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTFGCGQVQTGVFHD 188
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
G AP+GL GLGL +ISVPS+LAK G+ NSFSMCF D +GRI FGD+G Q+ T L
Sbjct: 189 GAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETP-LN 247
Query: 277 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
+ TY I V +G + F A+ DSG+SFT+L Y I+ F+ D
Sbjct: 248 IRQPHPTYNITVTKISVGGNT-GDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKR 306
Query: 337 --TSFEGYPWKCCYKSSSQRLP-------------KLPSVKLMFPQNNSFVVNNPVFVIY 381
T+ P++ CY + RLP + P+V L +S+ V +P+ VI
Sbjct: 307 YQTTDSELPFEYCY---ALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVYHPLVVI- 362
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLT 441
+ +CLAI ++ DI IGQNFMTGYRVVFDRE L LGW S+C G S T
Sbjct: 363 PMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILGWKESDCY---TGETSART 418
Query: 442 PGPGTPSNPLPAN--QEQSSPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVLPFLL 499
LP+N + P + P +P+T++T S S SL + F +
Sbjct: 419 ---------LPSNRSSSSARPPASSFDPEATNIPSQRPNTSTTSAAYSLSISLSLFFFSI 469
Query: 500 LLRL 503
L L
Sbjct: 470 LAIL 473
>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
Length = 207
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 147/203 (72%), Gaps = 1/203 (0%)
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 360
TSFKA VDSG+SFTFLP Y I EFD+QVN + +SFEG PW+ CY SSS++LPK+PS
Sbjct: 2 TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
+ LMF QNNSFVV NPVF Y Q V GFCLAIQP +GD+GTIGQNFMTGYR+VFDREN
Sbjct: 62 LTLMFQQNNSFVVYNPVFTFYDNQGVVGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRENK 121
Query: 421 KLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTA 480
L WS SNCQDL+ G + PL+P T S PLP +++Q + GHAV PA+AGRA KPS A
Sbjct: 122 NLAWSPSNCQDLSLGKRMPLSPPNKTSSAPLPTDEQQRT-NGHAVAPAIAGRASPKPSAA 180
Query: 481 STQLISSRSSSLKVLPFLLLLRL 503
+++IS + FLL L
Sbjct: 181 PSRIISCQVHYWHSYWFLLFQLL 203
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 166/462 (35%), Positives = 226/462 (48%), Gaps = 40/462 (8%)
Query: 28 STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS ++ ++ R WPA S Y L D + G P
Sbjct: 31 SLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGSSSDAPP 90
Query: 87 ---SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
++G+ T+ + N D G DL W+PC C C P + +
Sbjct: 91 LTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAAS 150
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S Y P SSTSK + C+ CDL C Q CPY M Y + TSSSG LVE
Sbjct: 151 GSFQATF--YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVE 207
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+L+L + +NA ++A +++GCG Q+G +LD AP+GL GLG+ E+SVPS+LA+ G
Sbjct: 208 DVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKG 265
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L NSFSMCF +D GRI FGDQ + Q+ T L N ++ TY I + +G+
Sbjct: 266 LTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDM 323
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK--SSSQRLPKL 358
F I D+G+SFT+L Y I F QV + + P++ CY SS R P +
Sbjct: 324 DFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP-I 382
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P + L + F V +P VI + +CLAI + IGQNFMTG RVVFDRE
Sbjct: 383 PDIILRTVTGSMFPVIDPGQVISIQEHEYVYCLAIVK-SMKLNIIGQNFMTGLRVVFDRE 441
Query: 419 NLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSP 460
LGW NC D + + +PL+ S P+ E SP
Sbjct: 442 RKILGWKKFNCYDTD--SSNPLSINSRNSSGFSPSTSENYSP 481
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 155/431 (35%), Positives = 215/431 (49%), Gaps = 43/431 (9%)
Query: 27 FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
F + HRFS+ VK LGV P K + YY V+ D + +++
Sbjct: 30 FGFDIHHRFSDPVKEILGVH------DLPDKGTRLYYVVMAHRDRIFRGRRLAAAVHHSP 83
Query: 84 LFPSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
L ++T +G D G DL W+PC+C +C S
Sbjct: 84 LTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGVES-- 141
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
N N Y SSTS+ + C+ LC+L C + CPY ++Y + TS++G LVE
Sbjct: 142 NGEKIAFNIYDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFLVE 201
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+LHLI+ D + GCG Q+G +LDG AP+GL GLG+G SVPS+LAK G
Sbjct: 202 DVLHLITDDDET--KDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEG 259
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L NSFSMCF D GRI FGD Q T F + TY I V +G +
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGGNA-ADL 317
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQV---NDTITSFEGYPWKCCYKSSSQRLPKL 358
F AI DSG+SFT L Y+ I F+ + + +S + P++ CY SS + +L
Sbjct: 318 EFHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVEL 377
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P + L ++++V +P+ I G + V CL + + ++ IGQNFMTGYR+VFDRE
Sbjct: 378 P-INLTMKGGDNYLVTDPIVTISG-EGVNLLCLGVLKSN-NVNIIGQNFMTGYRIVFDRE 434
Query: 419 NLKLGWSHSNC 429
N+ LGW SNC
Sbjct: 435 NMILGWRESNC 445
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 159/463 (34%), Positives = 225/463 (48%), Gaps = 88/463 (19%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKAL--GVSKNRNATSWPAKKSFEY 61
S ++++ + + +FS ++ HRFSE VK G A +WPAK SFEY
Sbjct: 3 FSWSVFIVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEY 62
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKT-------------MSLGN---------DF 99
Y L D + + +L S G+ T +SLG D
Sbjct: 63 YAELAHRDRALRGRRLSDIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDT 122
Query: 100 GCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 159
G DL W+PCDC RCAP + Y S D +L+ Y+P SSTS+ ++C++ LC C
Sbjct: 123 GSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTF 181
Query: 160 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 219
CPY + Y + TS+SG+LVED+LHL + ++ + V+A V GCG Q+G +LD A
Sbjct: 182 SNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAA 239
Query: 220 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 279
P+GL GLGL +ISVPS+L+K G +SFSMCF D GRI FGD+G Q+ T F N
Sbjct: 240 PNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF-NLNA 298
Query: 280 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+ TY I V +G++ L F A+ DSG+SFT+L +Y +
Sbjct: 299 LHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNV--------------- 342
Query: 340 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
L S +L++ C+A+ +
Sbjct: 343 ------------------LKSSELIY------------------------CMAVVR-SAE 359
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
+ IGQNFMTGYR++FDRE L LGW C D+ + + P+ P
Sbjct: 360 LNIIGQNFMTGYRIIFDREKLVLGWKEFECDDIEN-SSVPIRP 401
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 149/391 (38%), Positives = 202/391 (51%), Gaps = 27/391 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--C 155
D G +LLW+PCDC C S ++D LN YSP+ SSTS+ + C+ LC C
Sbjct: 80 DTGSNLLWLPCDCSSCVHSLRSPSGTVD--LNIYSPNTSSTSEKVPCNSTLCSQTQRDRC 137
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ + CPY + Y + TS++G +V+D+LHLIS D++ +V A + GCG Q+G +L
Sbjct: 138 PSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLIS--DDSQSKAVDAKITFGCGKVQTGSFL 195
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL 275
G AP+GL GLG+ ISVPS LA G SFSMCF + GRI FGD+G Q TSF
Sbjct: 196 TGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIGRISFGDKGSTGQGETSFN 255
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
+ Y I + IG + AI DSG+SFT+L Y IA F++ V +T
Sbjct: 256 QGQPRSSLYNISITQTSIGGQA-SDLVYSAIFDSGTSFTYLNDPAYTLIAESFNKLVKET 314
Query: 336 ITSFEGYPWKCCYKSSS---------------QRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
S P+ CY S Q P +P+V L+ + F V +P+ ++
Sbjct: 315 RRSSTQVPFDYCYDIRSFISAQILPFSCAYANQTEPTIPAVTLVMSGGDYFNVTDPIVLV 374
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
+CL + GD+ IGQNFMTG+R+VFDRE + LGW SNC D D +
Sbjct: 375 QLADGSAVYCLGMIK-SGDVNIIGQNFMTGHRIVFDRERMILGWKPSNCYDNMDTNTLAV 433
Query: 441 TPG----PGTPSNPLPANQEQSSPGGHAVGP 467
+P P T NP SSP G + P
Sbjct: 434 SPNTAVPPATAVNPEAKQIPASSPPGGSHSP 464
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 162/509 (31%), Positives = 235/509 (46%), Gaps = 61/509 (11%)
Query: 27 FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
F + HRFS+++K LG+ P K + +YY V+ D + +++
Sbjct: 33 FGFDIHHRFSDQIKGMLGIDD------VPQKGTPQYYAVMAHRDRVFRGRRLAGADHHSP 86
Query: 84 LFPSQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
L + G+ T + + D G DL W+PCDC+ C
Sbjct: 87 LTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCDCISCVHGGLRTR 146
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHR-LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
N Y SSTS +SC++ C C + C Y +DY + +TSS G +V
Sbjct: 147 TGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVV 206
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
ED+LHLI+ D + GCG Q+G +L+G AP+GL GLG+ ISVPS+LA+
Sbjct: 207 EDVLHLITDDDQT--KDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILARE 264
Query: 241 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 300
GLI NSFSMCF D +GRI FGD G Q+ T F + TY I + + S +
Sbjct: 265 GLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRK-LHPTYNITITKIIVEDS-VAD 322
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE----GYPWKCCYKSSSQRLP 356
F AI DSG+SFT++ Y I ++ +V S + P+ CY S +
Sbjct: 323 LEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTI 382
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
++P + L + + V +P+ + + CL IQ D + IGQNFMTGY++VFD
Sbjct: 383 EVPFLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKSDS-VNIIGQNFMTGYKIVFD 441
Query: 417 RENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSK 476
R+N+ LGW +NC D SN P N SP AV PA+A
Sbjct: 442 RDNMNLGWKETNCSD-------------DVLSNTSPINTPSHSP---AVSPAIA----VN 481
Query: 477 PSTASTQLISSRSSSLKVLPFLLLLRLLV 505
P S I+ + S + P + +L+
Sbjct: 482 PVARSNPSINPPNRSFMIKPTFTFVVVLL 510
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 171/510 (33%), Positives = 251/510 (49%), Gaps = 63/510 (12%)
Query: 27 FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
F + HRFS+ VK LGV P K + +YY + D + +++ G +
Sbjct: 30 FGFDIHHRFSDPVKEILGVHD------LPDKGTRQYYVAMAHRDRIFRGRRLAAGYHSPL 83
Query: 84 LF-PSQGS-----------KTMSLGN---------DFGCDLLWIPCDCVRCAPLSASYYN 122
F PS + +S+G D G DL W+PC+C +C N
Sbjct: 84 TFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVH-GIGLSN 142
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
N Y SSTS+ + C+ LC+L C + CPY ++Y + TS++G LVED
Sbjct: 143 GEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGFLVED 202
Query: 183 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
+LHLI+ D + + GCG Q+G +LDG AP+GL GLG+ SVPS+LAK GL
Sbjct: 203 VLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKEGL 260
Query: 243 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 302
NSFSMCF D GRI FGD Q T F + TY I V +G +
Sbjct: 261 TSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGEK-VDDLE 318
Query: 303 FKAIVDSGSSFTFLPKEVYETIAAEFDRQV---NDTITSFEGYPWKCCYKSSSQRLPKLP 359
F AI DSG+SFT+L Y+ I F+ ++ + +S P++ CY+ S + +L
Sbjct: 319 FHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQTVEL- 377
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
S+ L ++++V +P+ + G + + CL + + ++ IGQNFMTGYR+VFDREN
Sbjct: 378 SINLTMKGGDNYLVTDPIVTVSG-EGINLLCLGVLKSN-NVNIIGQNFMTGYRIVFDREN 435
Query: 420 LKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPST 479
+ LGW SNC D T LP N+ + A+ PA+A P S+
Sbjct: 436 MILGWRESNCYDDELST--------------LPINRSNTP----AISPAIAVN-PEARSS 476
Query: 480 ASTQLISSRSSSLKVLP---FLLLLRLLVS 506
S + S + S K+ P F++ L +L++
Sbjct: 477 QSNNPVLSPNLSFKIKPTSAFMMALFVLLA 506
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 178/488 (36%), Positives = 240/488 (49%), Gaps = 50/488 (10%)
Query: 31 LIHRFSEEVKALGVSKNRNATSWPAKK----SFEYYQVLLSSD---------VQKQKMKT 77
L HR+S V+ + SWPA S EYY L D Q + T
Sbjct: 31 LHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGLVT 90
Query: 78 GPQFQMLFPSQGS------------KTMSLGNDFGCDLLWIPCDCVRCAPLS--ASYYNS 123
+ GS T + D G DL W+PCDC +CAPL +
Sbjct: 91 FADGNITLRLDGSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGG 150
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
+L +YSPS SSTSK ++C+ LCD +C CPY + Y NTSSSG LVED+
Sbjct: 151 GGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDV 210
Query: 184 LHLI---SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
L+L A +V+ V+ GCG Q+G +LDG A DGL+GLG+ ++SVPS+LA
Sbjct: 211 LYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILAST 270
Query: 241 GLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 299
G+++ NSFSMCF KD GRI FGD G A Q T F+ + + Y I + + +G L
Sbjct: 271 GVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-THSYYNISITSMSVGDKNLP 329
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYK-SSS 352
F AI DSG+SFT+L Y F+ Q+++ +F G +P++ CY S
Sbjct: 330 -LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPD 388
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPVFVIY-----GTQVVTGFCLAIQPVDGDIGTIGQNF 407
Q +LP V L F V +PV+ I G + G+CLA+ D I IGQNF
Sbjct: 389 QTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNF 448
Query: 408 MTGYRVVFDRENLKLGWSHSNC---QDLNDG--TKSPLTPGPGTPSNPLPANQEQSSPGG 462
MTG +VVF+RE LGW +C + + D + +P PG ++ P QE SP G
Sbjct: 449 MTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAG 508
Query: 463 HAVGPAVA 470
P A
Sbjct: 509 RTPIPGAA 516
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 178/488 (36%), Positives = 240/488 (49%), Gaps = 50/488 (10%)
Query: 31 LIHRFSEEVKALGVSKNRNATSWPAKK----SFEYYQVLLSSD---------VQKQKMKT 77
L HR+S V+ + SWPA S EYY L D Q + T
Sbjct: 31 LHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGLVT 90
Query: 78 GPQFQMLFPSQGS------------KTMSLGNDFGCDLLWIPCDCVRCAPLS--ASYYNS 123
+ GS T + D G DL W+PCDC +CAPL +
Sbjct: 91 FADGNITLRLDGSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGG 150
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
+L +YSPS SSTSK ++C+ LCD +C CPY + Y NTSSSG LVED+
Sbjct: 151 GGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDV 210
Query: 184 LHLI---SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
L+L A +V+ V+ GCG Q+G +LDG A DGL+GLG+ ++SVPS+LA
Sbjct: 211 LYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILAST 270
Query: 241 GLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 299
G+++ NSFSMCF KD GRI FGD G A Q T F+ + + Y I + + +G L
Sbjct: 271 GVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-THSYYNISITSMSVGDKNLP 329
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYK-SSS 352
F AI DSG+SFT+L Y F+ Q+++ +F G +P++ CY S
Sbjct: 330 -LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPD 388
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPVFVIY-----GTQVVTGFCLAIQPVDGDIGTIGQNF 407
Q +LP V L F V +PV+ I G + G+CLA+ D I IGQNF
Sbjct: 389 QTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNF 448
Query: 408 MTGYRVVFDRENLKLGWSHSNC---QDLNDG--TKSPLTPGPGTPSNPLPANQEQSSPGG 462
MTG +VVF+RE LGW +C + + D + +P PG ++ P QE SP G
Sbjct: 449 MTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAG 508
Query: 463 HAVGPAVA 470
P A
Sbjct: 509 RTPIPGAA 516
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 173/502 (34%), Positives = 245/502 (48%), Gaps = 46/502 (9%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS V+ S+ WP+ F Y L D + G + + F
Sbjct: 24 SLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRHRALSAAGGRPPLTF- 82
Query: 87 SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNSL 124
S+G+ T+ + N D G DL W+PC C C + ++
Sbjct: 83 SEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPPPSSAA 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
+ Y PS SSTS+ + C+ C L C CPY M Y + +TSSSG LVED+L
Sbjct: 140 SAPASFYIPSLSSTSQAVPCNSDFCGLRKECSKTSS-CPYKMVYVSADTSSSGFLVEDVL 198
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
+L + ++ ++A ++ GCG Q+G +LD AP+GL GLG+ ISVPS+LA+ GL
Sbjct: 199 YLST--EDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTS 256
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D GRI FGDQG + Q+ T L N K+ TY I + +G++ L
Sbjct: 257 NSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVS 314
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVK 362
I D+G+SFT+L Y I F QV + + P++ CY SSS+ + PS+
Sbjct: 315 TIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSIS 374
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
L + F +P VI Q +CLAI + IGQNFMTG RVVFDRE L
Sbjct: 375 LRTVGGSLFPAIDPGQVISIQQHEYVYCLAIVK-STKLNIIGQNFMTGVRVVFDRERKIL 433
Query: 423 GWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
GW NC D + + TP N P QE +P AG + + ++S
Sbjct: 434 GWKKFNCYDTDSLNPLSINSRNSTPENYSP--QETKNP---------AGASQLRHVSSSP 482
Query: 483 QLISSRSSSLKVLPFLLLLRLL 504
L+ ++SL ++ F+LL L+
Sbjct: 483 PLVWWHNNSLLLMMFVLLHLLI 504
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 160/431 (37%), Positives = 214/431 (49%), Gaps = 40/431 (9%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS ++ ++ R WPA S Y L D + G P
Sbjct: 30 SLEFHHRFSAPLRRWAEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGGGSGTPP 89
Query: 87 ---SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
++G+ T+ + N D G DL W+PC C C P + +
Sbjct: 90 LTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAAS 149
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S Y P SSTSK + C+ CDL C Q CPY M Y + TSSSG LVE
Sbjct: 150 GSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVE 204
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+L+L + +NA ++A +++GCG Q+G +LD AP+GL GLG+ E+SVPS+LA+ G
Sbjct: 205 DVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKG 262
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L NSFSMCF +D GRI FGDQG + Q+ T L N ++ TY I + IG+
Sbjct: 263 LTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-TDL 320
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCY--KSSSQRLPKL 358
F I D+G+SFT+L Y I F QV + + P++ CY SS R P +
Sbjct: 321 DFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP-I 379
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P + L + F V +P VI + +CLAI + IGQNFMTG RVVFDRE
Sbjct: 380 PDIILRTVSGSLFPVIDPGQVISIQEHEYVYCLAIVK-SRKLNIIGQNFMTGLRVVFDRE 438
Query: 419 NLKLGWSHSNC 429
LGW NC
Sbjct: 439 RKILGWKKFNC 449
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 174/502 (34%), Positives = 243/502 (48%), Gaps = 46/502 (9%)
Query: 28 STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS V+ S+ WP+ F Y L D + G + + F
Sbjct: 24 SLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRHRALSAAGGRPPLTF- 82
Query: 87 SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNSL 124
S+G+ T+ + N D G DL W+PC C C + ++
Sbjct: 83 SEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPPPSSAA 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
+ Y PS SSTS+ + C+ C L C CPY M Y + +TSSSG LVED+L
Sbjct: 140 SAPASFYIPSLSSTSQAVPCNSDFCGLRKECSKTSS-CPYKMVYVSADTSSSGFLVEDVL 198
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
+L + ++ ++A ++ GCG Q+G +LD AP+GL GLG+ ISVPS+LA+ GL
Sbjct: 199 YLST--EDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTS 256
Query: 245 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
NSFSMCF +D GRI FGDQG + Q+ T L N K+ TY I + +G++ L
Sbjct: 257 NSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVS 314
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVK 362
I D+G+SFT+L Y I F QV + + P++ CY SSS+ + PS+
Sbjct: 315 TIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSIS 374
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
L + F +P VI Q +CLAI + IGQNFMTG RVVFDRE L
Sbjct: 375 LRTVGGSLFPAIDPGQVISIQQHEYVYCLAIVK-STKLNIIGQNFMTGVRVVFDRERKIL 433
Query: 423 GWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTAST 482
GW NC D + + TP N P QE +P G + G S P
Sbjct: 434 GWKKFNCYDTDSLNPLSINSRNSTPENYSP--QETKNP----AGASQLGHVSSSPP---- 483
Query: 483 QLISSRSSSLKVLPFLLLLRLL 504
L+ ++SL ++ F+LL L+
Sbjct: 484 -LVWWHNNSLLLMMFVLLHLLI 504
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 157/429 (36%), Positives = 212/429 (49%), Gaps = 38/429 (8%)
Query: 28 STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS ++ ++ R WPA S Y L D + G P
Sbjct: 31 SLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGSSSDAPP 90
Query: 87 ---SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
++G+ T+ + N D G DL W+PC C C P + +
Sbjct: 91 LTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAAS 150
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S Y P SSTSK + C+ CDL C Q CPY M Y + TSSSG LVE
Sbjct: 151 GSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVE 205
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+L+L + +NA ++A +++GCG Q+G +LD AP+GL GLG+ E+SVPS+LA+ G
Sbjct: 206 DVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKG 263
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L NSFSMCF +D GRI FGDQ + Q+ T L N ++ TY I + +G+
Sbjct: 264 LTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDM 321
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKLPS 360
F I D+G+SFT+L Y I F QV + + P++ CY S R P +P
Sbjct: 322 DFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSEARFP-IPD 380
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
+ L + F V +P VI + +CLAI + IGQNFMTG RVVFDRE
Sbjct: 381 IILRTVTGSMFPVIDPGQVISIQEHEYVYCLAIVK-SMKLNIIGQNFMTGLRVVFDRERK 439
Query: 421 KLGWSHSNC 429
LGW NC
Sbjct: 440 ILGWKKFNC 448
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 153/395 (38%), Positives = 210/395 (53%), Gaps = 36/395 (9%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T + D G DL W+PC C C P +++ S Y PS SSTS+ + C+ + C+
Sbjct: 127 QTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCE 182
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
L C Q CPY M Y + +TSSSG LVED+L+L + ++A+ ++A ++ GCG Q
Sbjct: 183 LRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQ 239
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
+G +LD AP+GL GLG+ IS+PS+LA+ GL NSF+MCF +D GRI FGDQG + Q+
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQE 299
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
T L N ++ TY I + +G+S L F I D+G+SFT+L Y I F
Sbjct: 300 ETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHA 357
Query: 331 QVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG 388
QV+ + + P++ CY SSS+ + PS+ L + F V + VI Q
Sbjct: 358 QVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFPVIDEGQVISIQQHEYV 417
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS 448
+CLAI + IGQNFMTG RVVFDRE LGW NC D + S
Sbjct: 418 YCLAIVK-SAKLNIIGQNFMTGLRVVFDRERKILGWKKFNCYDTDS-------------S 463
Query: 449 NPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQ 483
NPL N SS G +PS P S +
Sbjct: 464 NPLSINSRNSS-----------GFSPSAPENYSPE 487
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 203/371 (54%), Gaps = 25/371 (6%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T + D G DL W+PC C C P +++ S Y PS SSTS+ + C+ + C+
Sbjct: 127 QTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCE 182
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
L C Q CPY M Y + +TSSSG LVED+L+L + ++A+ ++A ++ GCG Q
Sbjct: 183 LRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQ 239
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
+G +LD AP+GL GLG+ IS+PS+LA+ GL NSF+MCF +D GRI FGDQG + Q+
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQE 299
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
T L N ++ TY I + +G+S L F I D+G+SFT+L Y I F
Sbjct: 300 ETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHA 357
Query: 331 QVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG 388
QV+ + + P++ CY SSS+ + PS+ L + F V + VI Q
Sbjct: 358 QVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFPVIDEGQVISIQQHEYV 417
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS 448
+CLAI + IGQNFMTG RVVFDRE LGW NC D + S
Sbjct: 418 YCLAIVK-SAKLNIIGQNFMTGLRVVFDRERKILGWKKFNCYDTDS-------------S 463
Query: 449 NPLPANQEQSS 459
NPL N SS
Sbjct: 464 NPLSINSRNSS 474
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 203/371 (54%), Gaps = 25/371 (6%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T + D G DL W+PC C C P +++ S Y PS SSTS+ + C+ + C+
Sbjct: 127 QTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCE 182
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
L C Q CPY M Y + +TSSSG LVED+L+L + ++A+ ++A ++ GCG Q
Sbjct: 183 LRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQ 239
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
+G +LD AP+GL GLG+ IS+PS+LA+ GL NSF+MCF +D GRI FGDQG + Q+
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQE 299
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
T L N ++ TY I + +G+S L F I D+G+SFT+L Y I F
Sbjct: 300 ETP-LDVNPQHPTYTISISEMTVGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHA 357
Query: 331 QVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG 388
QV+ + + P++ CY SSS+ + PS+ L + F V + VI Q
Sbjct: 358 QVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFPVIDEGQVISIQQHEYV 417
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPS 448
+CLAI + IGQNFMTG RVVFDRE LGW NC D + S
Sbjct: 418 YCLAIVK-SAKLNIIGQNFMTGLRVVFDRERKILGWKKFNCYDTDS-------------S 463
Query: 449 NPLPANQEQSS 459
NPL N SS
Sbjct: 464 NPLSINSRNSS 474
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 164/485 (33%), Positives = 234/485 (48%), Gaps = 44/485 (9%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
+ + L+VF+L F + HRFS+ +K + S+ P K + YY +
Sbjct: 11 MLLVLSVFFLAGGLRSGHAASFKFTIHHRFSDSIKEIFGSE-----GLPEKHTPGYYAAM 65
Query: 66 LSSD--VQKQKMKTGPQFQMLFPSQGSKTMSL---GN-------------------DFGC 101
+ D + + + T L S G++T L GN D G
Sbjct: 66 VHRDRLLHGRNLATTNGDTPLMFSYGNETYELSGLGNLYYANVSIGTPGLYFLVALDTGS 125
Query: 102 DLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQP 161
DL W+PC+C +C P + ++ LN YS +ASSTS + CS LC+L C + K
Sbjct: 126 DLFWLPCECTKC-PTYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCELANQCSSNKSS 184
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD 221
CPY Y +EN+SS+G LV+DILH+ + D++ V V +GCG Q+G + + AP+
Sbjct: 185 CPYQTHYLSENSSSAGYLVQDILHMAT--DDSQLKPVDVKVTLGCGKVQTGKFSNVTAPN 242
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY 281
GLIGLG+G++SVPS LA GL +SFSMCF GRI FGD GP Q+ T F ++ Y
Sbjct: 243 GLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFGDIGPVGQRETPFNPASLSY 302
Query: 282 ITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFE 340
I+ + I ++ AI+DSG+SFT+L Y I D + + I S
Sbjct: 303 NVTILQI----IVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITENMDAAMELERIKSDS 358
Query: 341 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI 400
+P++ CY+ S + + P++ F V +V T CLAI DI
Sbjct: 359 DFPFEYCYRLSLATIFQQPNLNFTMEGGRKFDVITS-YVSVDTDDGPALCLAIVK-STDI 416
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGT-----KSPLTPGPGTPSNPLPANQ 455
IG NF GYRVVF+RE + LGW +C + T P T S P +N
Sbjct: 417 NVIGHNFFGGYRVVFNREKMTLGWKEVDCDSYDANTSSDDSPPPSGDSSPTTSTPRKSNS 476
Query: 456 EQSSP 460
Q SP
Sbjct: 477 TQPSP 481
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 158/431 (36%), Positives = 213/431 (49%), Gaps = 40/431 (9%)
Query: 28 STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S + HRFS ++ ++ R WPA S Y L D + G P
Sbjct: 31 SLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGSSSDAPP 90
Query: 87 ---SQGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYY 121
++G+ T+ + N D G DL W+PC C C P + +
Sbjct: 91 LTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAAS 150
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S Y P SSTSK + C+ CDL C Q CPY M Y + TSSSG LVE
Sbjct: 151 GSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVE 205
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+L+L + +NA ++A +++GCG Q+G +LD AP+GL GLG+ E+SVPS+LA+ G
Sbjct: 206 DVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKG 263
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
L NSFSMCF +D GRI FGDQ + Q+ T L N ++ TY I + +G+
Sbjct: 264 LTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDM 321
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK--SSSQRLPKL 358
F I D+G+SFT+L Y I F QV + + P++ CY SS R P +
Sbjct: 322 DFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP-I 380
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P + L + F V +P VI + +CLAI + IGQNFMTG RVVFDRE
Sbjct: 381 PDIILRTVTGSMFPVIDPGQVISIQEHEYVYCLAIVK-SMKLNIIGQNFMTGLRVVFDRE 439
Query: 419 NLKLGWSHSNC 429
LGW NC
Sbjct: 440 RKILGWKKFNC 450
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 169/478 (35%), Positives = 238/478 (49%), Gaps = 60/478 (12%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
+ + L+VF L + F + HRFS+ +K + S+ P K + YY +
Sbjct: 11 MLLVLSVFILAGSLRSGDAASFKFDIHHRFSDSIKGIFHSEG-----LPEKHTPGYYATM 65
Query: 66 LSSD--VQKQKMKTGPQFQMLFPSQGSKT-------------MSLGN---------DFGC 101
+ D V+ +++ L + G+ T +S+G D G
Sbjct: 66 VHRDRLVRGRRLAASDVDTQLTFAYGNDTAFIPDLGFLYYANVSVGTPSLDFLVALDTGS 125
Query: 102 DLLWIPCDCVRCAPLSASYYNSLDRD---LNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 158
DL W+PC+C C +Y N+ + LN YSP+ S+TS + C+ LC+ TS QN
Sbjct: 126 DLFWLPCECSSCF----TYLNTSNGGKFMLNHYSPNDSTTSSTVPCTSSLCNRCTSNQNV 181
Query: 159 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 218
CPY M Y + NTSS G LVED+LHL + D++L V+A + GCG Q+G +
Sbjct: 182 ---CPYEMRYLSANTSSIGYLVEDVLHLAT--DDSLLKPVEAKITFGCGTVQTGIFATTA 236
Query: 219 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASN 278
AP+GLIGLG+ +ISVPS LA GL NSFSMCF D GRI FGD GPA Q+ T F +
Sbjct: 237 APNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDTGPADQKQTPF-NTM 295
Query: 279 GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
+Y +Y + +G F AI DSG+SFT+L + Y TI + D + S
Sbjct: 296 LEYQSYNVTFNVINVGGEP-NDVPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYS 354
Query: 339 FEG--YPWKCCYK--SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG------ 388
G +P++ CY+ ++ L ++ + F + +FV V T
Sbjct: 355 LFGPNFPFEYCYEIPPGAKEFQYL-TLNFTMKGGDEFTPTD-IFVFLPVDVSTMNIIFEE 412
Query: 389 ----FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
CLAI DI IGQNFMTGYR+ F+R+ + LGWS S+C D GT S TP
Sbjct: 413 TTHVACLAIAK-STDIDLIGQNFMTGYRITFNRDQMVLGWSSSDCYDNGVGTPSGDTP 469
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 149/373 (39%), Positives = 200/373 (53%), Gaps = 16/373 (4%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T + D G DL W+PC C C P + + S Y P SSTSK + C+ CD
Sbjct: 18 QTFMVALDTGSDLFWLPCQCDGCTPPATAASGSA----TFYIPGMSSTSKAVPCNSNFCD 73
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
L C Q CPY M Y + TSSSG LVED+L+L + +NA ++A +++GCG Q
Sbjct: 74 LQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLST--ENAHPQILKAQIMLGCGQTQ 130
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
+G +LD AP+GL GLG+ E+SVPS+LA+ GL NSFSMCF +D GRI FGDQ + Q+
Sbjct: 131 TGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQE 190
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
T L N ++ TY I + +G+ F I D+G+SFT+L Y I F
Sbjct: 191 ETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHA 248
Query: 331 QVNDTITSFEG-YPWKCCYK--SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT 387
QV + + P++ CY SS R P +P + L + F V +P VI +
Sbjct: 249 QVQANRHAADSRIPFEYCYDLSSSEARFP-IPDIILRTVTGSMFPVIDPGQVISIQEHEY 307
Query: 388 GFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTP 447
+CLAI + IGQNFMTG RVVFDRE LGW NC D + + +PL+
Sbjct: 308 VYCLAIVK-SMKLNIIGQNFMTGLRVVFDRERKILGWKKFNCYDTD--SSNPLSINSRNS 364
Query: 448 SNPLPANQEQSSP 460
S P+ E SP
Sbjct: 365 SGFSPSTSENYSP 377
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 161/490 (32%), Positives = 237/490 (48%), Gaps = 66/490 (13%)
Query: 27 FSTKLIHRFSEEV-KALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
F + HRFS+ V + LG+ N P K + +YY ++ D +++ +
Sbjct: 39 FGLDIHHRFSDPVTEILGIG---NDELLPHKGTPQYYAAMVHRDRVFHGRRLADDRDTPI 95
Query: 84 LFPSQGSKT-------------MSLGN---------DFGCDLLWIPCDCVRCAPLSASYY 121
F + G++T +S+G D G DL W+PC+C C
Sbjct: 96 TF-AAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSCV-RGLKTQ 153
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
N DLN Y SST K++ C+ +C T C + C Y ++Y + +TSSSG LVE
Sbjct: 154 NGKVIDLNIYELDKSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSSGFLVE 212
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D+LHLI+ DN + + IGCG Q+G +L+G AP+GL GLG+ +SVPS+LA+ G
Sbjct: 213 DVLHLIT--DNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILAQKG 270
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
LI +SFSMCF D SGRI FGD G + Q T F + TY + + +G
Sbjct: 271 LISDSFSMCFGSDGSGRITFGDTGSSDQGKTPFNLRE-SHPTYNVTITQIIVGGYAADH- 328
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQV----NDTITSFEGYPWKCCYKSSSQRLPK 357
F AI DSG+SFT+L Y I+ +F+ V + ++ P++ CY S + +
Sbjct: 329 EFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTIE 388
Query: 358 LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG------------- 404
+P + L + + V +P+ + CL IQ D ++ IG
Sbjct: 389 VPFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKSD-NLNIIGREYTTEEEFLHLK 447
Query: 405 ---------QNFMTGYRVVFDRENLKLGWSHSNCQD--LNDGTKSPLTPG--PGTPSNPL 451
+NFMTGYR+VFDREN+ LGW SNC + L+ T +P P NP+
Sbjct: 448 HMIIKFFIQKNFMTGYRIVFDRENMNLGWKESNCTEEVLSIPTNKSHSPAISPAIAVNPV 507
Query: 452 PANQEQSSPG 461
+ S+PG
Sbjct: 508 ARSDPSSNPG 517
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 184/320 (57%), Gaps = 13/320 (4%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC++CAP + Y SL D+ YSP+ S+TS+ + CS LCDL +C++
Sbjct: 53 DTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRS 110
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY++ Y ++NTSSSG+LVED+L+L S D+A V A ++ GCG Q+G +L
Sbjct: 111 KSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGS 168
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFL 275
AP+GL+GLG+ SVPSLLA GL NSFSMCF D GRI FGD G + Q+ T +
Sbjct: 169 AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVY 228
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
N Y I G+ +GS + T F AIVDSG+SFT L +Y I + FD Q+ +
Sbjct: 229 KQNPYYNITITGI---TVGSKSIS-TEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSS 284
Query: 336 ITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAI 393
+ P++ CY S+ + P+V L + F VN+P+ I G+CLAI
Sbjct: 285 RNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAI 343
Query: 394 QPVDGDIGTIGQNFMTGYRV 413
+G G NF R+
Sbjct: 344 MKSEGVNLIGGYNFDESSRL 363
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 170/508 (33%), Positives = 235/508 (46%), Gaps = 59/508 (11%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S + HRFS ++ ++ Y L+ + + + + F S
Sbjct: 29 SLEFHHRFSARLRGWADARGHELPGGWPPPGGAAYVAALAGHDRHRALAAADHPPLTF-S 87
Query: 88 QGSKTMSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNSLD 125
+G+ T+ + N D G DL W+PC C C P ++ S
Sbjct: 88 EGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGASGSA- 146
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
+ Y PS SSTS+ + C+ CD C CPY M Y + +TSSSG LVED+L+
Sbjct: 147 ---SFYIPSMSSTSQAVPCNSDFCDHRKDCSTTSS-CPYKMVYVSADTSSSGFLVEDVLY 202
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
L S DN ++A ++ GCG Q+G +LD AP+GL GLG+ ISVPS+LA GL +
Sbjct: 203 L-STEDNH-PQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSD 260
Query: 246 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 305
SFSMCF +D GRI FGDQG + Q+ T L N K+ TY I + +G+ + F
Sbjct: 261 SFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGITVGTEPM-DLEFST 318
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVKL 363
I D+G++FT+L Y I F QV + + P++ CY SSS+ + P V
Sbjct: 319 IFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVSF 378
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
+ F V + VI Q +CLAI + IGQNFMTG RVVFDRE LG
Sbjct: 379 RTVGGSLFPVIDLGQVISIQQHEYVYCLAIVK-STKLNIIGQNFMTGVRVVFDRERKILG 437
Query: 424 WSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQ 483
W NC D + +NPL N SS P+ +K +TQ
Sbjct: 438 WKKFNCYDTDS-------------TNPLSINSRNSS----GFSPSTYSPQETKNPAGATQ 480
Query: 484 LISSRSS-------SLKVLPFLLLLRLL 504
L SS + VL FLL+ +L
Sbjct: 481 LRHLNSSPPVMWHNNSLVLMFLLVHSVL 508
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 162/468 (34%), Positives = 230/468 (49%), Gaps = 70/468 (14%)
Query: 17 TESSGAETVMFSTKLIHRFSEEVK-----ALGVSKNRNATSW------PAKKSFEYYQVL 65
TE+SG L HRFS V+ A G +SW PA S EYY L
Sbjct: 24 TEASGG----IGFNLHHRFSPVVRQWMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSAL 79
Query: 66 LSSD----VQKQKMKTGPQFQ--MLFPSQGSKT------------MSLGN---------D 98
L D +++ + + Q L + G+ T + +G D
Sbjct: 80 LRHDRALFTRRRGLASAADGQSTTLTFADGNATRLDTYEYLHYAEVEVGTPSSKFLVALD 139
Query: 99 FGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 158
G DL W+PC+C CA ++ Y SPS SSTSK + C H LC+ +C
Sbjct: 140 TGSDLFWLPCECKLCAKNGSTMY----------SPSLSSTSKTVPCGHPLCERPDACATA 189
Query: 159 KQP---CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ CPY + Y + NT SSG+LVED+LHL+ GG +VQA ++ GCG Q+G +L
Sbjct: 190 GKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAFL 249
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSF 274
G A GL+GLGL ++SVPS LA +GL+ +SFSMCF +D GRI FGD G Q T
Sbjct: 250 RGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETPL 309
Query: 275 LASNGKYITYI-IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
+A+ +Y I V + S + F A+VDSG+SFT+L Y + F+ +V+
Sbjct: 310 IAAGSLQPSYYNISVGAITVDSKAMA-VEFTAVVDSGTSFTYLDDPAYTFLTTNFNSRVS 368
Query: 334 DTITSF-EGYP-WKCCYKSSSQR--LPKLPSVKLMFPQNNSFVVNNPVFVIY-----GTQ 384
+ ++ GY ++ CY+ S + + +LP++ L F + P+ + G
Sbjct: 369 EASETYGSGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPITWPIIPVLASTNGGPY 428
Query: 385 VVTGFCLAI---QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G+CL I + + TIGQNFMTG +VVFDR LGW +C
Sbjct: 429 HPIGYCLGIIKTSILSTEDATIGQNFMTGLKVVFDRRKSVLGWEKFDC 476
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 155/443 (34%), Positives = 227/443 (51%), Gaps = 44/443 (9%)
Query: 18 ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
E+SG FS ++ H FS+ VK +LG+ P K S EY++VL D ++ +
Sbjct: 24 EASGK----FSFEVHHMFSDRVKQSLGLDD-----LVPEKGSLEYFKVLAQRDRLIRGRG 74
Query: 75 MKTGPQFQMLFPSQGSKTMSL----------------------GNDFGCDLLWIPCDC-V 111
+ + + + +G++T+S+ D G DL W+PC+C
Sbjct: 75 LASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGS 134
Query: 112 RCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTE 171
C S R LN YSP+ SSTS + CS C + C +P CPY + Y ++
Sbjct: 135 TCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSK 194
Query: 172 NTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEI 231
+T ++G L ED+LHL++ D L+ V+A++ +GCG Q+G A +GL+GLGL +
Sbjct: 195 DTFTTGTLFEDVLHLVT-EDEGLE-PVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDY 252
Query: 232 SVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVE 289
SVPS+LAKA + NSFSMCF D GRI FGD+G Q T L + TY + V
Sbjct: 253 SVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVT 311
Query: 290 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCY 348
+G + A+ D+G+SFT L + Y I FD V D + P++ CY
Sbjct: 312 EVSVGGDAVG-VQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCY 370
Query: 349 KSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQN 406
S + L P V + F + + NP+F+++ +CL I + VD I IGQN
Sbjct: 371 DLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQN 430
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
FM+GYR+VFDRE + LGW S+C
Sbjct: 431 FMSGYRIVFDRERMILGWKRSDC 453
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 180/329 (54%), Gaps = 24/329 (7%)
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
LN YSP+ S+TS + C+ LC+ TS QN CPY M Y + NTSS G LVED+LHL
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLCNRCTSNQNV---CPYEMRYLSANTSSIGYLVEDVLHLA 59
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
+ D++L V+A + GCG Q+G + AP+GLIGLG+ +ISVPS LA GL NSF
Sbjct: 60 T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117
Query: 248 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
SMCF D GRI FGD GPA Q+ T F + +Y +Y + +G F AI
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPF-NTMLEYQSYNVTFNVINVGGEP-NDVPFTAIF 175
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG--YPWKCCYK--SSSQRLPKLPSVKL 363
DSG+SFT+L + Y TI + D + S G +P++ CY+ ++ L ++
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYL-TLNF 234
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTG----------FCLAIQPVDGDIGTIGQNFMTGYRV 413
+ F + +FV V T CLAI DI IGQNFMTGYR+
Sbjct: 235 TMKGGDEFTPTD-IFVFLPVDVSTMNIIFEETTHVACLAIAK-STDIDLIGQNFMTGYRI 292
Query: 414 VFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
F+R+ + LGWS S+C D GT S TP
Sbjct: 293 TFNRDQMVLGWSSSDCYDNGVGTPSGDTP 321
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 131/362 (36%), Positives = 196/362 (54%), Gaps = 34/362 (9%)
Query: 7 TIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLL 66
T++L +L +F+ ++ HRFS+EVK S R A +P K SFEY+ L+
Sbjct: 9 TLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFNALV 67
Query: 67 SSD--VQKQKMKTGPQFQM--LFPSQGSKT-------------MSLGN---------DFG 100
D ++ +++ L S G+ T + LG D G
Sbjct: 68 LRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTG 127
Query: 101 CDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ 160
DL W+PCDC +CAP + Y S + +L+ Y+P S+T+K ++C++ LC C
Sbjct: 128 SDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFS 186
Query: 161 PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 220
CPY + Y + TS+SG+L+ED++HL + N + V+A V GCG QSG +LD AP
Sbjct: 187 TCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAP 244
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 280
+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF D GRI FGD+G + Q+ T F N
Sbjct: 245 NGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPS 303
Query: 281 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI--AAEFDRQVNDTITS 338
+ Y I V +G++ L F A+ D+G+SFT+L +Y T+ +A+ R D+
Sbjct: 304 HPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESAQDKRHSPDSRIP 362
Query: 339 FE 340
FE
Sbjct: 363 FE 364
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 157/450 (34%), Positives = 227/450 (50%), Gaps = 46/450 (10%)
Query: 13 FWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD 69
FW L E+SG FS ++ H FS+ VK LG+ P K S EY++VL D
Sbjct: 18 FWGLERCEASGK----FSFEVHHMFSDRVKQTLGLDD-----LVPEKGSLEYFKVLAQRD 68
Query: 70 --VQKQKMKTGPQFQMLFPSQGSKTMSL----------------------GNDFGCDLLW 105
++ + + + + + +G++T+S+ D G +L W
Sbjct: 69 RLIRGRGLASNNEETPITFMRGNRTVSIDFLGFLHYANVSVGTPATWFLVALDTGSNLFW 128
Query: 106 IPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPY 164
+PC+C C S R LN YSP+ SSTS + C+ C + C +P CPY
Sbjct: 129 LPCNCGSTCIRDLKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPY 188
Query: 165 TMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 224
+ Y +++T ++G L ED+LHL++ D LK V+A++ +GCG Q+G A +GL+
Sbjct: 189 QIQYLSKDTFTTGTLFEDVLHLVT-EDVDLK-PVKANITLGCGRNQTGFLQSSAAINGLL 246
Query: 225 GLGLGEISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYI 282
GLG+ + SVPS+LAKA + NSFSMCF D GRI FGD+G Q T L +
Sbjct: 247 GLGMKDYSVPSILAKAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPS-P 305
Query: 283 TYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-G 341
TY + V T + A+ D+G+SFT L + Y I FD V D +
Sbjct: 306 TYAVNV-TEVSVGGDVVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 364
Query: 342 YPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI-QPVDGD 399
P++ CY S L P V + F + + NP+F+++ +CL I + VD
Sbjct: 365 IPFEFCYDLSPNSTTILFPRVAMTFEGGSLMFLRNPLFIVWNEDNTAMYCLGILKSVDFK 424
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I IGQNFM+GYRVVFDRE + LGW S+C
Sbjct: 425 INIIGQNFMSGYRVVFDRERMILGWKRSDC 454
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 154/443 (34%), Positives = 224/443 (50%), Gaps = 54/443 (12%)
Query: 18 ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
E+SG FS ++ H FS+ VK +LG+ P K S EY++VL D ++ +
Sbjct: 24 EASGK----FSFEVHHMFSDRVKQSLGLDD-----LVPEKGSLEYFKVLAQRDRLIRGRG 74
Query: 75 MKTGPQFQMLFPSQGSKTMSL----------------------GNDFGCDLLWIPCDC-V 111
+ + + + +G++T+S+ D G DL W+PC+C
Sbjct: 75 LASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGS 134
Query: 112 RCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTE 171
C S R LN YSP+ SSTS + CS C + C +P CPY + Y ++
Sbjct: 135 TCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSK 194
Query: 172 NTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEI 231
+T ++G L ED+LHL++ D L+ V+A++ +GCG Q+G A +GL+GLGL +
Sbjct: 195 DTFTTGTLFEDVLHLVT-EDEGLE-PVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDY 252
Query: 232 SVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVE 289
SVPS+LAKA + NSFSMCF D GRI FGD+G Q T L + +G +
Sbjct: 253 SVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGGD 312
Query: 290 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCY 348
+G L A+ D+G+SFT L + Y I FD V D + P++ CY
Sbjct: 313 A--VGVQLL------ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCY 364
Query: 349 KSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQN 406
S + L P V + F + + NP+F+ +CL I + VD I IGQN
Sbjct: 365 DLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIDNSAM----YCLGILKSVDFKINIIGQN 420
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
FM+GYR+VFDRE + LGW S+C
Sbjct: 421 FMSGYRIVFDRERMILGWKRSDC 443
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 146/461 (31%), Positives = 225/461 (48%), Gaps = 50/461 (10%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S ++ HRFSE+VK + P S +YY+ L+ D +Q +
Sbjct: 22 LSFEIHHRFSEQVKTV-----LGGHGLPEMGSLDYYKALVHRDRGRQLTSNNNNQTTISF 76
Query: 87 SQGSKT----------MSLGN---------DFGCDLLWIPCDCVRCAPLSASYYNSLDRD 127
+QG+ T +++G D G DL W+PC+C S
Sbjct: 77 AQGNSTEEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIK 136
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
LN Y+PS S +S ++C+ LC L C +P CPY + Y + + S+G+LVED++H+
Sbjct: 137 LNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMS 196
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
+ A A + GC Q G + + VA +G++GL + +I+VP++L KAG+ +SF
Sbjct: 197 TEEGEAR----DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSF 251
Query: 248 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
SMCF + G I FGD+G + Q T L+ + Y + + +G + T F A
Sbjct: 252 SMCFGPNGKGTISFGDKGSSDQLETP-LSGTISPMFYDVSITKFKVGKVTV-DTEFTATF 309
Query: 308 DSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYPWKCCY-KSSSQRLPKLPSV 361
DSG++ T+L + Y + F DR+++ ++ S P++ CY +S+ KLPSV
Sbjct: 310 DSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDS----PFEFCYIITSTSDEDKLPSV 365
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQ-VVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDREN 419
++ V +P+ V + +CLA+ + V+ D IGQNFMT YR+V DRE
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRER 425
Query: 420 LKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSP 460
LGW SNC D N T GP + P P+ SSP
Sbjct: 426 RILGWKKSNCNDTNGFT------GPTALAKP-PSMAPTSSP 459
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 143/437 (32%), Positives = 217/437 (49%), Gaps = 41/437 (9%)
Query: 24 TVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQ 80
T F ++ H FS+ VK +LG+ P + S EY++VL D ++ + + +
Sbjct: 26 TGKFGFEVHHIFSDSVKQSLGLGD-----LVPEQGSLEYFKVLAHRDRLIRGRGLASNND 80
Query: 81 FQMLFPSQGSKTMS---LGN-------------------DFGCDLLWIPCDC-VRCAPLS 117
+ G+ T+S LG+ D G DL W+PC+C C
Sbjct: 81 ETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDL 140
Query: 118 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 177
LN Y+P+AS+TS + CS + C C +P CPY + Y + +T + G
Sbjct: 141 EDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTKG 199
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L++D+LHL + +N V+A+V +GCG KQ+G + + +G++GLG+ SVPSLL
Sbjct: 200 TLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLL 257
Query: 238 AKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 295
AKA + NSFSMCF + + GRI FGD+G Q+ T F+ S Y + + +
Sbjct: 258 AKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFI-SVAPSTAYGVNISGVSVAG 316
Query: 296 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYK-SSSQ 353
+ F A D+GSSFT L + Y + FD V D + P++ CY S +
Sbjct: 317 DPVDIRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNA 375
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYR 412
+ P V++ F + ++NNP F + +CL + + V I IGQNF+ GYR
Sbjct: 376 TTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYR 435
Query: 413 VVFDRENLKLGWSHSNC 429
+VFDRE + LGW S C
Sbjct: 436 IVFDRERMILGWKQSLC 452
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 197 bits (502), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 140/453 (30%), Positives = 219/453 (48%), Gaps = 63/453 (13%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S ++ HRFSE+VK + P S +YY+ L+ D ++ Q + F
Sbjct: 32 LSFEIHHRFSEQVKTV-----LGGHGLPEMGSLDYYKALVHRDRGRRLTSNNNQTTISF- 85
Query: 87 SQGSKT-----------------------MSLGN---------DFGCDLLWIPCDC---- 110
+QG+ T +++G D G DL W+PC+C
Sbjct: 86 AQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTC 145
Query: 111 VRCAPLS--ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY 168
VR ++ N+ LN Y+PS S++S ++C+ LC L C +P CPY + Y
Sbjct: 146 VRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRIRY 205
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 228
+ + S+G+LVED++H+ + A A + GC Q G + + VA +G++GL +
Sbjct: 206 LSPGSKSTGVLVEDVIHMSTEEGEAR----DARITFGCSETQLGLFQE-VAVNGIMGLAM 260
Query: 229 GEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 288
+I+VP++L KAG+ +SFSMCF + G I FGD+G + Q T L + Y + +
Sbjct: 261 ADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQHETP-LGGTISPLFYDVSI 319
Query: 289 ETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYP 343
+G + +T F AI DSG++ T+L Y + F DR++ + S
Sbjct: 320 TKFKVGKVTV-ETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDS----T 374
Query: 344 WKCCY-KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQPVD-GDI 400
++ CY +S+ KLPS+ ++ V +P+ V + +CLA+ D D
Sbjct: 375 FEFCYIITSTSDEEKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADF 434
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLN 433
IGQNFMT YR+V DRE + LGW SNC D N
Sbjct: 435 NIIGQNFMTNYRIVHDRERMILGWKKSNCNDTN 467
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/439 (33%), Positives = 220/439 (50%), Gaps = 47/439 (10%)
Query: 27 FSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
F ++ H FS+ VK +LG+ P + S EY++VL D ++ + + + +
Sbjct: 29 FGFEVHHIFSDAVKQSLGLDD-----LVPEQGSLEYFKVLAHRDRLIRGRGLASNNEDTP 83
Query: 84 LFPSQGSKTMS---LGN-------------------DFGCDLLWIPCDC-VRCAPLSASY 120
+ G+ T+S LG+ D G DL W+PC+C C
Sbjct: 84 VTFDGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDI 143
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
LN Y+P+AS+TS + CS + C C +PK CPY + Y + +T ++G L+
Sbjct: 144 GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISY-SNSTGTTGTLL 202
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
+D+LHL + +N V+ +V +GCG KQ+G + + +G++GLG+ SVPSLLAKA
Sbjct: 203 QDVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKA 260
Query: 241 GLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
+ +SFSMCF + + GRI FGD+G Q+ T F+ S Y + V +G +
Sbjct: 261 NITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFI-SVAPSTAYGLNVTGVSVGGDPV 319
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLP- 356
F A D+GSSFT L + Y + FD V D + P++ CY S
Sbjct: 320 GTRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSI 378
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG-----FCLAI-QPVDGDIGTIGQNFMTG 410
+ P V++ F + ++NNP F TQ G +CL + + V I IGQNF+ G
Sbjct: 379 EFPFVEMTFVGGSKIILNNPFFTAR-TQARHGEGNVMYCLGVLKSVGLKINVIGQNFVAG 437
Query: 411 YRVVFDRENLKLGWSHSNC 429
YR+VFDRE + LGW S C
Sbjct: 438 YRIVFDRERMILGWKPSLC 456
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 155/467 (33%), Positives = 229/467 (49%), Gaps = 57/467 (12%)
Query: 4 ISLTIYLAVFWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFE 60
+ L++ + +FW L E+SG FS ++ H FS+ VK LG P S E
Sbjct: 9 VLLSMLVLIFWGLERCEASGK----FSFEVHHMFSDVVKQTLGFDD-----LVPENGSLE 59
Query: 61 YYQVLLSSD--VQKQKMKTGPQFQMLFPSQGSK--------------TMSLGN------- 97
Y++VL D ++ + + + + + S GS +SLG
Sbjct: 60 YFKVLAHRDRFIRGRGLASNNE-ETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLV 118
Query: 98 --DFGCDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS 154
D G DL W+PC+C C S LN Y+P+AS+TS + CS + C
Sbjct: 119 ALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGK 178
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C +P+ CPY + + NT ++G L++D+LHL++ D LK V A+V +GCG Q+G +
Sbjct: 179 CSSPESICPYQI-ALSSNTVTTGTLLQDVLHLVTE-DEDLK-PVNANVTLGCGQNQTGAF 235
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQST 272
+A +G++GL + E SVPSLLAKA + NSFSMCF + S GRI FGD+G Q+ T
Sbjct: 236 QTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEET 295
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 332
L S Y + V +G + F A+ D+GSSFT L + Y FD +
Sbjct: 296 P-LVSLETSTAYGVNVTGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLM 353
Query: 333 NDTITSFE-GYPWKCCYKSSSQRL-----PKLPSVKLMFPQNNSF---VVNNP-VFVIYG 382
D + +P++ CY + L P+ K P + F + N+ V Y
Sbjct: 354 EDKRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYS 413
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +CL I ++ IGQN M+G+R+VFDRE + LGW SNC
Sbjct: 414 NEGTKMYCLGILK-SINLNIIGQNLMSGHRIVFDRERMILGWKQSNC 459
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 154/463 (33%), Positives = 226/463 (48%), Gaps = 57/463 (12%)
Query: 8 IYLAVFWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQV 64
+ + +FW L E+SG FS ++ H FS+ VK LG P S EY++V
Sbjct: 1 MLVLIFWGLERCEASGK----FSFEVHHMFSDVVKQTLGFDD-----LVPENGSLEYFKV 51
Query: 65 LLSSD--VQKQKMKTGPQFQMLFPSQGSK--------------TMSLGN---------DF 99
L D ++ + + + + + S GS +SLG D
Sbjct: 52 LAHRDRFIRGRGLASNNE-ETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDT 110
Query: 100 GCDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 158
G DL W+PC+C C S LN Y+P+AS+TS + CS + C C +P
Sbjct: 111 GSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSP 170
Query: 159 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 218
+ CPY + + NT ++G L++D+LHL++ D LK V A+V +GCG Q+G + +
Sbjct: 171 ESICPYQI-ALSSNTVTTGTLLQDVLHLVTE-DEDLK-PVNANVTLGCGQNQTGAFQTDI 227
Query: 219 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLA 276
A +G++GL + E SVPSLLAKA + NSFSMCF + S GRI FGD+G Q+ T L
Sbjct: 228 AVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETP-LV 286
Query: 277 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
S Y + V +G + F A+ D+GSSFT L + Y FD + D
Sbjct: 287 SLETSTAYGVNVTGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKR 345
Query: 337 TSFE-GYPWKCCYKSSSQRL-----PKLPSVKLMFPQNNSF---VVNNP-VFVIYGTQVV 386
+ +P++ CY + L P+ K P + F + N+ V Y +
Sbjct: 346 RPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGT 405
Query: 387 TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+CL I ++ IGQN M+G+R+VFDRE + LGW SNC
Sbjct: 406 KMYCLGILK-SINLNIIGQNLMSGHRIVFDRERMILGWKQSNC 447
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/245 (42%), Positives = 150/245 (61%), Gaps = 7/245 (2%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W+PCDC +CAP + Y S + +L+ Y+P S+T+K ++C++ LC C
Sbjct: 5 DTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLG 63
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
CPY + Y + TS+SG+L+ED++HL + N + V+A V GCG QSG +LD
Sbjct: 64 TFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDI 121
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 277
AP+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF D GRI FGD+G + Q+ T F
Sbjct: 122 AAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NL 180
Query: 278 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI--AAEFDRQVNDT 335
N + Y I V +G++ L F A+ D+G+SFT+L +Y T+ +A+ R D+
Sbjct: 181 NPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESAQDKRHSPDS 239
Query: 336 ITSFE 340
FE
Sbjct: 240 RIPFE 244
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 158/272 (58%), Gaps = 8/272 (2%)
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 265
CG Q+G +L+G AP+GL GLG+G ISVPS+LAK GL+ +SFSMCF D +GRI FGD+G
Sbjct: 1 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
+ Q+ T F S + + Y I + +G + +F AI DSG+SFT+L Y +I+
Sbjct: 61 SSGQEETPFNPSKSQLL-YNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 118
Query: 326 AEFDRQVNDTITSFEG-YPWKCCYKSSSQRLP-KLPSVKLMFPQNNSFVVNNPVFVIYGT 383
F+ + D +S + P++ CY S Q+ + P V L ++F V +P+ VI
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPI-VIVSI 177
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPG 443
Q +CL + GDI IGQNFMTGYR++FDRE + LGW+ SNC D + P+ P
Sbjct: 178 QGGYVYCLGVVK-SGDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 236
Query: 444 PGTPSNPLPANQEQSSPGGHAVGPAVAGRAPS 475
+P P + E + G+ G ++ APS
Sbjct: 237 -NSPVVPPTVSVEPEATAGNGNGSHIS-EAPS 266
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 158/272 (58%), Gaps = 8/272 (2%)
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 265
CG Q+G +L+G AP+GL GLG+G ISVPS+LAK GL+ +SFSMCF D +GRI FGD+G
Sbjct: 13 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
+ Q+ T F S + + Y I + +G + +F AI DSG+SFT+L Y +I+
Sbjct: 73 SSGQEETPFNPSKSQLL-YNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 130
Query: 326 AEFDRQVNDTITSFEG-YPWKCCYKSSSQRLP-KLPSVKLMFPQNNSFVVNNPVFVIYGT 383
F+ + D +S + P++ CY S Q+ + P V L ++F V +P+ VI
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPI-VIVSI 189
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPG 443
Q +CL + GDI IGQNFMTGYR++FDRE + LGW+ SNC D + P+ P
Sbjct: 190 QGGYVYCLGVVK-SGDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 248
Query: 444 PGTPSNPLPANQEQSSPGGHAVGPAVAGRAPS 475
+P P + E + G+ G ++ APS
Sbjct: 249 -NSPVVPPTVSVEPEATAGNGNGSHIS-EAPS 278
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 168/323 (52%), Gaps = 39/323 (12%)
Query: 33 HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKT 92
HR+S V+ + P + EYY L D++++ + G + + G+ T
Sbjct: 28 HRYSATVREWAGHRA------PPAGTAEYYAALAGHDLRRRSLAGGGEVAF---ADGNDT 78
Query: 93 MSLGN----------------------DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNE 130
L D G DL W+PCDC+ CAPL + Y L D
Sbjct: 79 YRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSPNYRDLKFD--T 136
Query: 131 YSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
YSP SSTS+ + CS LCD ++C++ CPY++ Y ++NTSS+G+LVED+L+L++
Sbjct: 137 YSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVTEY 196
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL-IRNSFSM 249
K V A + GCG Q+G +L AP+GL+GLG+ ISVPSLLA G+ NSFSM
Sbjct: 197 GRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFSM 255
Query: 250 CFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 308
CF +D GRI FGD G + QQ T + Y Y I + +GS + T F AIVD
Sbjct: 256 CFAQDGHGRINFGDTGSSDQQETPLNMYKQNPY--YNISITGATVGSKSI-HTKFNAIVD 312
Query: 309 SGSSFTFLPKEVYETIAAEFDRQ 331
SG+SFT L +Y I + Q
Sbjct: 313 SGTSFTALSDPMYTQITSSVSVQ 335
>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
Length = 414
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 135/436 (30%), Positives = 209/436 (47%), Gaps = 68/436 (15%)
Query: 18 ESSGAETVMFSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
ES+G FS ++ H FS+ VK LG P K S EY+++L D ++ +
Sbjct: 24 ESAGK----FSFEVHHMFSDTVKQNLGF-----GDLVPEKGSLEYFKLLAQRDRLIRGRG 74
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDF-GCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 133
+ + + + G++T+S+ DF G DL W+PC+C
Sbjct: 75 LSSNNEEAPVTFILGNRTVSI--DFLGSDLFWLPCNC----------------------- 109
Query: 134 SASSTSKHLSCSHRLCDLGTS---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
+C L D+G S C +P CPY + Y TS+ G L ED+LHL++
Sbjct: 110 -------GTTCIRDLEDIGLSQGGCSSPASVCPYQIPYLFNTTSTRGTLFEDVLHLVT-E 161
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
D L+ V+A++ +GCG Q+G Y +A +GL+GLG+ + SVPS+LAK + NSFSMC
Sbjct: 162 DEGLE-PVKANITLGCGQNQTGLYRKSLAVNGLLGLGMKDYSVPSVLAKENITANSFSMC 220
Query: 251 FDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 308
F D GRI FGD+G Q T + TY + V +G L + A+ D
Sbjct: 221 FGNIIDFIGRISFGDRGHTDQLQTPLVPIEPN-PTYAVNVTEVTVGGDIL-EIQMLALFD 278
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQ-RLPKLPSVKLMFP 366
+G+SFT L + Y + FD V D + P++ CY +S + K P V + F
Sbjct: 279 TGTSFTHLLEPAYGLLTKAFDDHVTDKRRPIDPEIPFEFCYDTSPNIKSFKFPRVNMTFV 338
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD------------IGTIGQNFMTGYRVV 414
+ + +P+F ++ + ++ D + I + +N M+GYR+V
Sbjct: 339 GGSKLTLRDPLFTVWNEARHGAWMSSLTFSDREKKKKEYVLNAFHIWVVSENLMSGYRIV 398
Query: 415 FDRENLKLGWSHSNCQ 430
FDRE + LGW S+C+
Sbjct: 399 FDRERMILGWKRSDCK 414
>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
Length = 263
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 102/245 (41%), Positives = 141/245 (57%), Gaps = 6/245 (2%)
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+A ++ GCG Q+G +LD AP+GL GLG+ ++SVPS+LA G NSFSMCF D G
Sbjct: 11 VKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMCFGSDGMG 70
Query: 258 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
RI+FGD G + Q T F N + TY I + +G+S + S AIVDSG+SFT L
Sbjct: 71 RIYFGDTGSSDQGETPFDV-NHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSGTSFTCLA 128
Query: 318 KEVYETIAAEFDRQVNDTITSFE-GYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNN 375
+Y ++ F QV + + G P++ CY S +Q LP + L + F +N+
Sbjct: 129 DPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGGSQFPIND 188
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDG 435
P+ VI Q + +CL I + IGQNFMTG R+VFDRE L LGW S+C + D
Sbjct: 189 PIIVISSEQ-SSFYCLGIVK-SSQLNIIGQNFMTGLRIVFDRERLVLGWKESDCYEAEDS 246
Query: 436 TKSPL 440
+ P+
Sbjct: 247 STLPV 251
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 116/201 (57%), Gaps = 34/201 (16%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S++++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S
Sbjct: 28 SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80
Query: 88 QGSKTMSLGNDFGC-----------------------DLLWIPCDCVRCAPLSASYYNSL 124
+G T S GND G DL W+PCDC++CAPLS Y +L
Sbjct: 81 KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139
Query: 125 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 184
DRDL Y P+ S+TS+HL CSH LC C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 185 HLISGGDNALKNSVQASVIIG 205
HL D+ V ASVIIG
Sbjct: 200 HLNYREDHV---PVNASVIIG 217
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 195/439 (44%), Gaps = 98/439 (22%)
Query: 24 TVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQ 80
T F ++ H FS+ VK +LG+ P + S EY++VL D ++ + + +
Sbjct: 26 TGKFGFEVHHIFSDSVKQSLGL-----GDLVPEQGSLEYFKVLAHRDRLIRGRGLASNND 80
Query: 81 FQMLFPSQGSKTMS---LGN-------------------DFGCDLLWIPCDC-VRCAPLS 117
+ G+ T+S LG+ D G DL W+PC+C C
Sbjct: 81 ETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDL 140
Query: 118 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 177
LN Y+P+AS+TS + CS + C C +P CPY + Y + +T + G
Sbjct: 141 EDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTKG 199
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L++D+LHL + +N V+A+V +GCG KQ+G + + +G++GLG+ SVPSLL
Sbjct: 200 TLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLL 257
Query: 238 AKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 295
AKA + NSFSMCF + + GRI FGD+G Q+ T F++ +
Sbjct: 258 AKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPR--------------- 302
Query: 296 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 355
+ VD F F +D N T F
Sbjct: 303 --------RRPVDPELPFEFC-----------YDLSPNATTIQF---------------- 327
Query: 356 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG-----FCLAIQPVDGDIGTIGQNFMTG 410
P V++ F + ++NNP F TQ G +CL + +G NF+ G
Sbjct: 328 ---PLVEMTFIGGSKIILNNPFFTAR-TQARHGEGNVMYCLGVLK---SVGLKINNFVAG 380
Query: 411 YRVVFDRENLKLGWSHSNC 429
YR+VFDRE + LGW S C
Sbjct: 381 YRIVFDRERMILGWKQSLC 399
>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
Length = 307
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 100/267 (37%), Positives = 139/267 (52%), Gaps = 20/267 (7%)
Query: 222 GLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 280
L+GLG+ ++SVPS+LA G+++ NSFSMCF KD GRI FGD G A Q T F+ +
Sbjct: 8 ALMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-T 66
Query: 281 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ Y I + + +G L F AI DSG+SFT+L Y F+ Q+++ +F
Sbjct: 67 HSYYNISITSMSVGDKNLP-LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFS 125
Query: 341 G------YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY-----GTQVVTG 388
G +P++ CY S Q +LP V L F V +PV+ I G + G
Sbjct: 126 GSTRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIG 185
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC---QDLNDG--TKSPLTPG 443
+CLA+ D I IGQNFMTG +VVF+RE LGW +C + + D + +P
Sbjct: 186 YCLAVIKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPS 245
Query: 444 PGTPSNPLPANQEQSSPGGHAVGPAVA 470
PG ++ P QE SP G P A
Sbjct: 246 PGPTTHVFPQPQESDSPAGRTPIPGAA 272
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 118/444 (26%), Positives = 204/444 (45%), Gaps = 62/444 (13%)
Query: 17 TESSGAETVMFSTKLIHRFS-EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKM 75
T ++ +E ++F + +F+ + VK LG + + + + L D Q + +
Sbjct: 27 TAATASENLVFEVR--SKFAGKRVKDLGALRAHDVHR--HSRLLSAIDIPLGGDSQPESI 82
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP- 133
G F + S+ + D G D+LW+ C C+RC S DL E +P
Sbjct: 83 --GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKS---------DLVELTPY 131
Query: 134 --SASSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 188
ASST+K +SCS C + + C + C Y + Y + +S++G LV+D++HL
Sbjct: 132 DVDASSTAKSVSCSDNFCSYVNQRSECHSGS-TCQYVI-MYGDGSSTNGYLVKDVVHLDL 189
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
N S ++I GCG KQSG + A DG++G G S S LA G ++ SF
Sbjct: 190 VTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSF 249
Query: 248 SMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-- 304
+ C D ++ G IF G+ ++T L+ + Y + +E +G+S L+ +S
Sbjct: 250 AHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIE---VGNSVLELSSNAFD 306
Query: 305 ------AIVDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
I+DSG++ +LP VY E +A+ + ++ SF + + +
Sbjct: 307 SGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHY-------TD 359
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--------IGQ 405
+L + P+V F ++ S V P ++ + T +C Q +G + T +G
Sbjct: 360 KLDRFPTVTFQFDKSVSLAV-YPREYLFQVREDT-WCFGWQ--NGGLQTKGGASLTILGD 415
Query: 406 NFMTGYRVVFDRENLKLGWSHSNC 429
++ VV+D EN +GW++ NC
Sbjct: 416 MALSNKLVVYDIENQVIGWTNHNC 439
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 172/367 (46%), Gaps = 49/367 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
S+ + D G D+LW+ C C+RC P + +L Y ASST+K +SCS
Sbjct: 95 SRDFHVQVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDADASSTAKSVSCSDNF 148
Query: 149 C---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
C + + C + C Y + Y + +S++G LV D++HL N S ++I G
Sbjct: 149 CSYVNQRSECHSGS-TCQYVI-LYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFG 206
Query: 206 CGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGD 263
CG KQSG + A DG++G G S S LA G ++ SF+ C D ++ G IF G+
Sbjct: 207 CGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGE 266
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTF 315
++T L+ + Y + +E +G+S L+ +S I+DSG++ +
Sbjct: 267 VVSPKVKTTPMLSKSAHYSVNLNAIE---VGNSVLQLSSDAFDSGDDKGVIIDSGTTLVY 323
Query: 316 LPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
LP VY + +A+ + ++ SF + + RL + P+V F ++ S
Sbjct: 324 LPDAVYNPLMNQILASHQELNLHTVQDSFTCFHYI-------DRLDRFPTVTFQFDKSVS 376
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--------IGQNFMTGYRVVFDRENLKL 422
V P ++ + T +C Q +G + T +G ++ VV+D EN +
Sbjct: 377 LAV-YPQEYLFQVREDT-WCFGWQ--NGGLQTKGGASLTILGDMALSNKLVVYDIENQVI 432
Query: 423 GWSHSNC 429
GW++ NC
Sbjct: 433 GWTNHNC 439
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 35/376 (9%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSA 135
TG F + SK + D G D+LW+ C +C RC S + L Y P
Sbjct: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKS-----DIGIGLTLYDPKR 120
Query: 136 SSTSKHLSCSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 188
S TS+ +SC H C LG +NP CPY++ Y + ++++G V+D L
Sbjct: 121 SKTSEFVSCEHNFCSSTYEGRILGCKAENP---CPYSISY-GDGSATTGYYVQDYLTFNR 176
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNS 246
N + +S+I GCG QSG + A DG+IG G SV S LA +G ++
Sbjct: 177 VNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKI 236
Query: 247 FSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQT 301
FS C D + G IF G+ ++T + + Y + +E + S
Sbjct: 237 FSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSE 296
Query: 302 SFKA-IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
+ K ++DSG++ +LP+ VY+ + ++ +Q + E C++ + P
Sbjct: 297 NGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVE--EQYSCFQYTGNVDSGFP 354
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRV 413
VKL F + S V P ++ + + +C+ Q D+ +G ++ V
Sbjct: 355 IVKLHFEDSLSLTV-YPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLV 413
Query: 414 VFDRENLKLGWSHSNC 429
V+D EN+ +GW+ NC
Sbjct: 414 VYDLENMTIGWTDYNC 429
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 165/358 (46%), Gaps = 42/358 (11%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC--DLGTS 154
D G D+LW+ C C +C S L DL Y P SS+ +SC ++ C G+
Sbjct: 105 DTGSDILWVNCVSCDKCPTKSG-----LGIDLALYDPKGSSSGSAVSCDNKFCAATYGSG 159
Query: 155 CQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+ P +PC Y +Y + +S++G V D L NA +A+VI GCG +Q
Sbjct: 160 EKLPGCTAGKPCEYRAEY-GDGSSTAGSFVSDSLQYNQLSGNAQTRHAKANVIFGCGAQQ 218
Query: 211 SGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPA 267
GG L+ A DG+IG G S S LA AG ++ FS C D G IF G+
Sbjct: 219 -GGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGGGIFAIGEVVQP 277
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFK--AIVDSGSSFTFLPKE 319
+ST L + Y + +++ + + L+ +TS K I+DSG++ T+LP+
Sbjct: 278 KVKSTPLLPNMSH---YNVNLQSIDVAGNALQLPPHIFETSEKRGTIIDSGTTLTYLPEL 334
Query: 320 VYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPV 377
VY+ I AA F + + T + +G+ C++ S P + F + V +
Sbjct: 335 VYKDILAAVFQKHQDITFRTIQGF---LCFEYSESVDDGFPKITFHFEDDLGLNVYPHDY 391
Query: 378 FVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F G + +CL QP D D+ +G ++ VV+D E +GW+ NC
Sbjct: 392 FFQNGDNL---YCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWTDYNC 446
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 168/375 (44%), Gaps = 34/375 (9%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 136
TG F + SK + D G D+LW+ +C+ C S + L DL Y P+AS
Sbjct: 86 TGLYFTQIGIGTPSKGYYVQVDTGSDILWV--NCISCD--SCPRKSGLGIDLTLYDPTAS 141
Query: 137 STSKHLSCSHRLCDLGTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
++SK ++C C T+ P PC Y++ Y + +S++G V D L
Sbjct: 142 ASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITY-GDGSSTTGFFVADFLQYDQVSG 200
Query: 192 NALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
+ N ASV GCG K G VA DG++G G S+ S L AG + FS C
Sbjct: 201 DGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260
Query: 251 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK---------QT 301
D + G IF + T+ L + Y + ++T +G S L+
Sbjct: 261 LDTVNGGGIFAIGNVVQPKVKTTPLVPGMPH--YNVVLKTIDVGGSTLQLPTNIFDIGGG 318
Query: 302 SFKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 360
S I+DSG++ +LP+ VY+ + +A F + T+ + + + C++ S P
Sbjct: 319 SRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDF---LCFQYSGSVDNGFPE 375
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVV 414
V F + VV ++ T+ V +C+ +Q DG D+ +G ++ VV
Sbjct: 376 VTFHFDGDLPLVVYPHDYLFQNTEDV--YCVGFQSGGVQSKDGKDMVLLGDLALSNKLVV 433
Query: 415 FDRENLKLGWSHSNC 429
+D EN +GW++ NC
Sbjct: 434 YDLENQVIGWTNYNC 448
>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
gi|255630909|gb|ACU15817.1| unknown [Glycine max]
Length = 244
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 124/265 (46%), Gaps = 30/265 (11%)
Query: 249 MCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 308
MCF D +GRI FGD G Q+ T F + TY I + + S + F AI D
Sbjct: 1 MCFGPDGAGRITFGDTGSPDQRKTPFNVRK-LHPTYNITITQIVVEDS-VADLEFHAIFD 58
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFE----GYPWKCCYKSSSQRLPKLPSVKLM 364
SG+SFT++ Y + ++ +V S + P++ CY S + ++P + L
Sbjct: 59 SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
+ + V +P+ ++ + CL IQ D + IGQNFM GY++VFDR+N+ LGW
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEGDLLCLGIQKSDS-VNIIGQNFMIGYKIVFDRDNMNLGW 177
Query: 425 SHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQL 484
+NC D SN P N SP AV PA+A P S
Sbjct: 178 KETNCSD-------------DVLSNTSPINTPSPSP---AVSPAIA----VNPVATSNPS 217
Query: 485 ISSRSSSLKVLP---FLLLLRLLVS 506
I+ + S ++ P F+++L L++
Sbjct: 218 INPPNRSFRIKPTFTFVVVLLPLIA 242
>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
Length = 426
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/401 (26%), Positives = 180/401 (44%), Gaps = 59/401 (14%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
S ++ HRFSE+VK + P S +YY+ L+ D +Q +
Sbjct: 22 LSFEIHHRFSEQVKTV-----LGGHGLPEMGSLDYYKALVHRDRGRQLTSNNNNQTTISF 76
Query: 87 SQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+QG+ T + + Y +L L + A +L+ +
Sbjct: 77 AQGNSTEEI----------------------SLYDKNLAPPLYFHLTQAVICFGYLAIAI 114
Query: 147 RLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
L C +P CPY + Y + + S+G+LVED++H+ + A A
Sbjct: 115 PLVYGVWRLTKARCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEAR----DAR 170
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
+ G + G VA +G++GL + +I+VP++L KAG+ +SFSMCF + G I F
Sbjct: 171 ITFG---ESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISF 227
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
GD+G + Q T L+ + Y + + +G + T F A DSG++ T+L + Y
Sbjct: 228 GDKGSSDQLETP-LSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYY 285
Query: 322 ETIAAEF-----DRQVNDTITSFEGYPWKCCY-KSSSQRLPKLPSVKLMFPQNNSFVVNN 375
+ F DR+++ ++ S P++ CY +S+ KLPSV ++ V +
Sbjct: 286 TALTTNFHLSVPDRRLSKSVDS----PFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFS 341
Query: 376 PVFVIYGT----QVVTGFCLAI-QPVDGDIGTIGQNFMTGY 411
P+ V + QV +CLA+ + V+ D IG+N G+
Sbjct: 342 PILVFDTSDGSFQV---YCLAVLKQVNADFSIIGRNDTNGF 379
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 158/355 (44%), Gaps = 38/355 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C +C S L DL Y P SS+ +SC + C +
Sbjct: 101 DTGSDILWVNCISCNKCPRKS-----DLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGK 155
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G V D L + ASVI GCG +Q G
Sbjct: 156 LPGCAKNIPCEYSV-MYGDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQ-G 213
Query: 213 GYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQ 269
G L A DG+IG G S+ S LA AG ++ FS C D G IF GD
Sbjct: 214 GDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKGGGIFAIGDVVQPKV 273
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY 321
+ST + Y + +E+ +G + L+ S I+DSG++ T+LP+ VY
Sbjct: 274 KSTPLVPDMPH---YNVNLESINVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVY 330
Query: 322 -ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-----PSVKLMFPQNNSFVVNN 375
+ +AA F + + T S + + ++S PK+ + L ++ F N
Sbjct: 331 KDVLAAVFAKHPDTTFHSVQDFLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNG 390
Query: 376 PVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G Q G +Q DG D+ +G ++ VV+D EN +GW+ NC
Sbjct: 391 DNLYCFGFQ--NG---GLQSKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTDYNC 440
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 157/355 (44%), Gaps = 38/355 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C +C + + L DL Y P ASST + C C +
Sbjct: 104 DTGSDILWVNCITCEQCP-----HKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGK 158
Query: 157 NPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
PK PC Y++ Y + +S+ G V D L + ASVI GCG +Q G
Sbjct: 159 LPKCGANVPCEYSVTY-GDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGG 217
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
A DG++G G S+ S L AG ++ F+ C D G IF GD +
Sbjct: 218 DLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGDVVQPKVK 277
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSF--------KAIVDSGSSFTFLPKEVY- 321
+T +A Y + ++T +G + L+ + I+DSG++ T+LP+ V+
Sbjct: 278 TTPLVADKPH---YNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPELVFK 334
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVI 380
E + A F++ + T +G+ C++ P++ F + + V + F
Sbjct: 335 EVMLAVFNKHQDITFHDVQGF---LCFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFA 391
Query: 381 YGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G V +C+ A Q DG DI +G ++ V++D EN +GW+ NC
Sbjct: 392 NGNDV---YCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNC 443
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 161/355 (45%), Gaps = 38/355 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C +C + + L DL Y P ASST + C C +
Sbjct: 106 DTGSDILWVNCITCDQCP-----HKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGR 160
Query: 157 NPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
PK PC Y++ Y + +S+ G V D L + ASVI GCG +Q G
Sbjct: 161 LPKCSANVPCEYSVTY-GDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGG 219
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
A DG++G G S+ S LA AG ++ F+ C D G IF GD +
Sbjct: 220 DLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGIFAIGDVVQPKVK 279
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTS--FK------AIVDSGSSFTFLPKEVYE 322
+T +A Y + ++T +G + L+ + FK I+DSG++ T+LP+ V++
Sbjct: 280 TTPLVADKPH---YNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTLTYLPELVFK 336
Query: 323 TIA-AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVI 380
+ A F++ + T + + C++ S P++ F + + V + F
Sbjct: 337 KVMLAVFNKHQDITFHDVQDF---LCFEYSGSVDDGFPTLTFHFEDDLALHVYPHEYFFP 393
Query: 381 YGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G V +C+ A+Q DG DI +G ++ VV+D EN +GW+ NC
Sbjct: 394 NGNDV---YCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNC 445
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 158/353 (44%), Gaps = 32/353 (9%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C+ C C S L LN + P +SSTS ++CS + C+ G
Sbjct: 93 DTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSS 147
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + C YT Y + + +SG V D++HL + + ++ + A V+ GC +Q+
Sbjct: 148 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQT 206
Query: 212 GGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
G A DG+ G G E+SV S L+ G+ FS C D SG + G+
Sbjct: 207 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN 266
Query: 269 QQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE- 322
TS + + Y + + +T I SS ++ + IVDSG++ +L +E Y+
Sbjct: 267 IVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDP 326
Query: 323 ---TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
I A + V+ ++ CY +S P V L F S ++ ++
Sbjct: 327 FVSAITASIPQSVHTVVSR-----GNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 381
Query: 380 IYGTQV--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I + +C+ Q + G I +G + VV+D ++GW++ +C
Sbjct: 382 IQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 434
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 157/348 (45%), Gaps = 22/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C+ P ++ L LN + P +SSTS ++CS + C+ G
Sbjct: 96 DTGSDVLWVSCNSCNGCPQTSG----LQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSSD 151
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C + C YT Y + + +SG V D++HL + + ++ + A V+ GC +Q+G
Sbjct: 152 ATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVVFGCSNQQTG 210
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
A DG+ G G E+SV S L+ G+ FS C D SG + G+
Sbjct: 211 DLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEIVEPNI 270
Query: 270 QSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
TS + + Y + + +T I SS ++ + IVDSG++ +L +E Y+
Sbjct: 271 VYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPF 330
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
+ + ++ + + CY +S P V L F S ++ ++I
Sbjct: 331 VSAITAAIPQSVRTVVSRGNQ-CYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNS 389
Query: 385 V--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q + G I +G + VV+D ++GW++ +C
Sbjct: 390 IGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 437
>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
Length = 260
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 93/186 (50%), Gaps = 7/186 (3%)
Query: 249 MCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 306
MCF D GRI FGD+G Q T L + TY + V +G + A+
Sbjct: 1 MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVTEVSVGGDAVG-VQLLAL 58
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKL-PSVKLM 364
D+G+SFT L + Y I FD V D + P++ CY S + L P V +
Sbjct: 59 FDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMT 118
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + + NP+F+++ +CL I + VD I IGQNFM+GYR+VFDRE + LG
Sbjct: 119 FEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILG 178
Query: 424 WSHSNC 429
W S+C
Sbjct: 179 WKRSDC 184
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 159/372 (42%), Gaps = 26/372 (6%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 134
+TG F L K + D G D+LW+ C C RC S L DL Y P
Sbjct: 66 ETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKS-----DLGIDLTLYDPK 120
Query: 135 ASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S TS+ +SC C P + PCPY++ Y + ++++G V+D L
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNHVN 179
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
DN +S+I GCG QSG A DG+IG G SV S LA +G ++ FS
Sbjct: 180 DNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFS 239
Query: 249 MCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSF 303
C D G IF G+ +T + Y + +E + S +
Sbjct: 240 HCLDNIRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNG 299
Query: 304 KA-IVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
K I+DSG++ +LP VY E I RQ + E C++ + P V
Sbjct: 300 KGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVE--QQFSCFQYTGNVDRGFPVV 357
Query: 362 KLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAI-QPVDG-DIGTIGQNFMTGYRVVFDR 417
KL F + S V ++ +F G+ ++ Q +G D+ +G ++ V++D
Sbjct: 358 KLHFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDL 417
Query: 418 ENLKLGWSHSNC 429
EN+ +GW+ NC
Sbjct: 418 ENMAIGWTDYNC 429
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 159/352 (45%), Gaps = 28/352 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C P+S+ L LN + P +S T+ +SCS + C LG
Sbjct: 108 DTGSDVLWVSCSSCNGCPVSSG----LHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSD 163
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGMKQ 210
+ C C YT Y + + +SG V D+LH I GG + +KNS A ++ GC Q
Sbjct: 164 SVCAAQNNQCGYTFQY-GDGSGTSGYYVSDLLHFDTILGG-SVMKNS-SAPIVFGCSTLQ 220
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPA 267
+G A DG+ G G ++SV S LA G+ FS C DDSG + G+
Sbjct: 221 TGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEP 280
Query: 268 TQQSTSFLASNGKY-----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVY 321
T + S Y Y+ G +T I S +S + I+DSG++ +L + Y
Sbjct: 281 NIVYTPLVPSQPHYNLNLQSIYVNG-QTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAY 339
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ + V+ +++ + CY +SS P V L F S ++ ++I
Sbjct: 340 DPFISAITSTVSPSVSPYLS-KGNQCYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQ 398
Query: 382 GTQV--VTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + +C+ Q + G +I +G + V+D ++GW++ +C+
Sbjct: 399 QSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDCK 450
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 162/368 (44%), Gaps = 36/368 (9%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C +C S L DL Y P ASS+ +SC C +
Sbjct: 102 DTGSDILWVNCISCEKCPRKSG-----LGLDLTFYDPKASSSGSTVSCDQGFCAATYGGK 156
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G V D L + A+V GCG +Q G
Sbjct: 157 LPGCTANVPCEYSV-MYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGCGAQQGG 215
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
A DG++G G S+ S LA AG ++ F+ C D G IF G+ +
Sbjct: 216 DLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIFAIGNVVQPKVK 275
Query: 271 STSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVY-ETI 324
+T +A Y + +G T + + + K I+DSG++ T+LP+ V+ E +
Sbjct: 276 TTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPELVFKEVM 335
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVIYGT 383
AA F++ + + + + C++ P++ F + + V + F G
Sbjct: 336 AAIFNKHQDIVFHNVQDF---MCFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGN 392
Query: 384 QVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQD----LN 433
+ +C+ A+Q DG DI +G ++ V++D EN +GW+ NC +
Sbjct: 393 DM---YCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNCSSSIKIED 449
Query: 434 DGTKSPLT 441
D T +P T
Sbjct: 450 DKTGTPYT 457
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 173/388 (44%), Gaps = 53/388 (13%)
Query: 81 FQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVR--CAPLSASYYNSLDRDLNEYSPSASST 138
+ L+ +K ++ D G + ++PC C P N D + P ASST
Sbjct: 79 YATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGP------NHQD---AAFDPEASST 129
Query: 139 SKHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 197
+ +SC+ C G+ C Q C YT Y E +SSSG+L+ED+L L G A
Sbjct: 130 ASRISCTSPKCSCGSPRCGCSTQQCTYTRSY-AEQSSSSGILLEDVLALHDGLPGA---- 184
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDS 256
+I GC +++G A DGL GLG + SV + L KAG+I + FS+CF +
Sbjct: 185 ---PIIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGD 240
Query: 257 GRIFFGDQ---GPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSFKAI 306
G + GD G + Q T L S N K ++ + + + S Q + +
Sbjct: 241 GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQ-GYGTV 299
Query: 307 VDSGSSFTFLPKEVYETIAAEFD--------RQVNDTITSFEGYPWKCCYKS--SSQRLP 356
+DSG++FT++P V++ A + ++V F+ C+ S L
Sbjct: 300 LDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFD----DICFGQAPSHDDLE 355
Query: 357 KL----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGY 411
L PS+++ F Q S V+ ++ T +CL + +G GT +G
Sbjct: 356 ALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSGKYCLGVFD-NGRAGTLLGGITFRNV 414
Query: 412 RVVFDRENLKLGWSHSNCQDLNDGTKSP 439
V +DR N ++G+ + C++L + + P
Sbjct: 415 LVRYDRANQRVGFGPALCKELGEMQRPP 442
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 171/372 (45%), Gaps = 45/372 (12%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC DC +C ++ P SST + + C ++ +C
Sbjct: 112 DSGSTVTYVPCSDCEQCGKHQDP----------KFQPELSSTYQPVKC-----NMDCNCD 156
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K+ C Y +Y E++SS G+L ED LIS G+ + +A + GC ++G
Sbjct: 157 DDKEQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VFGCETVETGDLYS 210
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG+IGLG G++S+ L GLI NSF +C+ D G I G P+ T
Sbjct: 211 QRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMIFTD 269
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAAE 327
Y Y I + + L S A++DSG+++ +LP +
Sbjct: 270 SDPDRSPY--YNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEA 327
Query: 328 FDRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFV 379
R+V+ + +G + C ++S + +L PSV+++F S++++ ++
Sbjct: 328 VMREVS-PLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYM 386
Query: 380 IYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKS 438
++V +CL + P D T +G + VV+DREN K+G+ +NC +L+D
Sbjct: 387 FRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHI 446
Query: 439 PLTPGPGT-PSN 449
P P T PSN
Sbjct: 447 DGAPPPATLPSN 458
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 166/376 (44%), Gaps = 36/376 (9%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 134
+TG F + +K + D G D+LW+ C C C S +L +L Y P
Sbjct: 86 ETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKS-----NLGIELTMYDPR 140
Query: 135 ASSTSKHLSCSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S + + ++C + C P PC Y++ Y + +S++G V D L
Sbjct: 141 GSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISY-GDGSSTAGFFVTDFLQYNQVS 199
Query: 191 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+ ASV GCG K G +A DG++G G S+ S LA AG +R F+
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 250 CFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQ 300
C D + G IF G+ ++T ++ Y + G++ +G + L
Sbjct: 260 CLDTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGID---VGGTALGLPTNIFDSG 316
Query: 301 TSFKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
S I+DSG++ ++P+ VY+ + A FD+ + ++ + + + C++ S P
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFP 373
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRV 413
V F + S +V+ ++ + + +C+ +Q DG D+ +G ++ V
Sbjct: 374 EVTFHFEGDVSLIVSPHDYLFQNGKNL--YCMGFQNGGVQTKDGKDMVLLGDLVLSNKLV 431
Query: 414 VFDRENLKLGWSHSNC 429
++D EN +GW+ NC
Sbjct: 432 LYDLENQAIGWADYNC 447
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 169/368 (45%), Gaps = 44/368 (11%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC DC +C ++ P SST + + C ++ +C
Sbjct: 111 DSGSTVTYVPCSDCEQCGKHQDP----------KFQPEMSSTYQPVKC-----NMDCNCD 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ ++ C Y +Y E++SS G+L ED LIS G+ + +A + GC ++G
Sbjct: 156 DDREQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VFGCETVETGDLYS 209
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG+IGLG G++S+ L GLI NSF +C+ D G I G P+ T
Sbjct: 210 QRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTD 268
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAAE 327
Y Y I + + L S A++DSG+++ +LP +
Sbjct: 269 SDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEA 326
Query: 328 FDRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFV 379
R+V+ T+ +G + C ++S + +L PSV+++F S++++ ++
Sbjct: 327 VMREVS-TLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYM 385
Query: 380 IYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKS 438
++V +CL + P D T +G + VV+DREN K+G+ +NC +L+D
Sbjct: 386 FRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHI 445
Query: 439 PLTPGPGT 446
P P T
Sbjct: 446 DGAPPPAT 453
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 124/528 (23%), Positives = 223/528 (42%), Gaps = 82/528 (15%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
+SL + + + +++ S+G +F+ + H+F+ + ++L K +A + +
Sbjct: 14 LSLVVIVELGFVVCLSNG--NYVFNVQ--HKFAGKERSLSALKQHDAR--------RHRR 61
Query: 64 VLLSSDV----QKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSA 118
+L + D+ + G F + K + D G D+LW+ C +C +C S
Sbjct: 62 ILSAVDLPLGGNGHPAEAGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKS- 120
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENT 173
L L Y P +S+++ + C C + C PC Y++ Y + +
Sbjct: 121 ----DLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGC-TKDLPCQYSV-VYGDGS 174
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEIS 232
S++G V+D L N +S SVI GCG KQSG A DG++G G S
Sbjct: 175 STAGFFVKDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSS 234
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 292
+ S LA AG ++ F+ C D G IF + + + +T+ + N + Y + ++
Sbjct: 235 MISQLAAAGKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPMVPNQPH--YNVVMKEIE 292
Query: 293 IGSSCLKQTS--------FKAIVDSGSSFTFLPKEVYETIAAEF-DRQVNDTITSFEGYP 343
+G + L+ + I+DSG++ +LP+ VYE++ + Q + + E
Sbjct: 293 VGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVE--E 350
Query: 344 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL---AIQPVDG-D 399
C++ + P VK F + S VN ++ + V F +Q DG D
Sbjct: 351 QFTCFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRD 410
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQS- 458
+ +G ++ V++D EN +GW+ NC S+ + E S
Sbjct: 411 MTLLGDLVLSNKLVLYDLENQAIGWTDYNC------------------SSSIKVRDESSG 452
Query: 459 ---SPGGHAVGPAVAGRAPSKPSTASTQLISSRSSSLKVLPFLLLLRL 503
S G H + ++++QLIS R + +L F+L R
Sbjct: 453 TVYSVGAHNL-------------SSASQLISGRIMTFLLLVFVLFHRF 487
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 162/373 (43%), Gaps = 28/373 (7%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 134
+TG F L + + D G D+LW+ C +C RC S L DL Y P
Sbjct: 66 ETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKS-----DLGIDLTLYDPK 120
Query: 135 ASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S TS +SC C P + PCPY++ Y + ++++G V+D L
Sbjct: 121 GSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNRIN 179
Query: 191 DNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
N + +S+I GCG QSG G A DG+IG G SV S LA +G ++ FS
Sbjct: 180 GNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFS 239
Query: 249 MCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSF 303
C D G IF G+ +T + Y + +E + S +
Sbjct: 240 HCLDNVRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNG 299
Query: 304 KA-IVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKC-CYKSSSQRLPKLPS 360
K ++DSG++ +LP VY E I RQ + E ++C Y + R P
Sbjct: 300 KGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVE-QQFRCFLYTGNVDR--GFPV 356
Query: 361 VKLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAI-QPVDG-DIGTIGQNFMTGYRVVFD 416
VKL F + S V ++ +F G+ ++ Q +G D+ +G ++ V++D
Sbjct: 357 VKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYD 416
Query: 417 RENLKLGWSHSNC 429
EN+ +GW+ NC
Sbjct: 417 LENMVIGWTDYNC 429
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 165/376 (43%), Gaps = 36/376 (9%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 134
+TG F + +K + D G D+LW+ C C C S +L +L Y P
Sbjct: 86 ETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKS-----NLGIELTMYDPR 140
Query: 135 ASSTSKHLSCSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S + + ++C + C P PC Y++ Y + +S++G V D L
Sbjct: 141 GSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISY-GDGSSTAGFFVTDFLQYNQVS 199
Query: 191 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+ ASV GCG K G +A DG++G G S+ S LA AG +R F+
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 250 CFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQ 300
C D + G IF G+ ++T + Y + G++ +G + L
Sbjct: 260 CLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSG 316
Query: 301 TSFKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
S I+DSG++ ++P+ VY+ + A FD+ + ++ + + + C++ S P
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFP 373
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRV 413
V F + S +V+ ++ + + +C+ +Q DG D+ +G ++ V
Sbjct: 374 EVTFHFEGDVSLIVSPHDYLFQNGKNL--YCMGFQNGGVQTKDGKDMVLLGDLVLSNKLV 431
Query: 414 VFDRENLKLGWSHSNC 429
++D EN +GW+ NC
Sbjct: 432 LYDLENQAIGWADYNC 447
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 146/367 (39%), Gaps = 32/367 (8%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 189 GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPR 245
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
S + L C+ +C+ C Y ++Y + +SS G+L +D +HLI+ GG
Sbjct: 246 DSLCQELQGDQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLAKDDMHLIATNGGREK 297
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS+PS LA G+I N F C
Sbjct: 298 L------DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCIT 351
Query: 253 KDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIVDS 309
++ + G +F GD T G Y + G L S + I DS
Sbjct: 352 RETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDS 411
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF---- 365
GSS+T+LP+E+Y+ + + C+K+ + L F
Sbjct: 412 GSSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471
Query: 366 ---PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
P+ + V ++ + + V G + G +G + G VV+D E ++
Sbjct: 472 FVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQI 531
Query: 423 GWSHSNC 429
GW++S C
Sbjct: 532 GWANSEC 538
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 111/421 (26%), Positives = 182/421 (43%), Gaps = 56/421 (13%)
Query: 34 RFSEEVKALGVSKNRNATSWPAKK--SFEYYQVLLSSDVQKQKMKTGPQFQMLFP--SQG 89
+ SE ++AL V+K+ W A + S + + ++DV+ G + M + G
Sbjct: 7 KRSEAIRAL-VAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPG 65
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+ ++ D G DL+W+ + C C+ + + P SST + + CS +L
Sbjct: 66 KRFRAIA-DTGSDLVWVQSEPCTGCSGGTI------------FDPRQSSTFREMDCSSQL 112
Query: 149 C-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
C +L SC+ C Y+ +Y + T G D + L + D + K S +GCG
Sbjct: 113 CAELPGSCEPGSSTCSYSYEYGSGETE--GEFARDTISLGTTSDGSQKF---PSFAVGCG 167
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGD 263
M SG DGV DGL+GLG G +S+ S L+ A I + FS C + +S + FG
Sbjct: 168 MVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESSPLLFGP 221
Query: 264 QGP---ATQQSTSFLASNGKYITY-IIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
QST + Y TY ++ V + + I+DSG++ T++P
Sbjct: 222 SAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTLTYVPSG 280
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNNSFV 372
VY + + + V CY SS R K P++ + P +N F+
Sbjct: 281 VYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFL 340
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQD 431
V + G V CLA+ G + IG GY +++DR + +L + + C+
Sbjct: 341 VVDDS----GDTV----CLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCES 392
Query: 432 L 432
L
Sbjct: 393 L 393
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 169/390 (43%), Gaps = 67/390 (17%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + + +K L D G DL W+ CD C CA Y+
Sbjct: 21 GLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDP------------ 68
Query: 136 SSTSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
++ + C LC L +C P + C Y ++Y + +S+ G+L+ED + L+
Sbjct: 69 -KKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEY-ADGSSTMGVLMEDTITLL--- 123
Query: 191 DNALKNSVQA--SVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSF 247
L N ++ + IIGCG Q G A DG++GL +IS+PS LAK G++RN
Sbjct: 124 ---LTNGTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVI 180
Query: 248 SMCF--DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 304
C + G +FFGD PA + + + GK IT IG ++ G + K
Sbjct: 181 GHCLAGGSNGGGYLFFGDSLVPALGMTWTPIM--GKSITGNIGGKS---GDADDKTGDIG 235
Query: 305 AIV-DSGSSFTFLPKEVYETIAAEFDRQVNDT----ITSFEGYPWKCCYKSSS------- 352
++ DSG+SFT+L E Y + + + QV + I + P+ C++ S
Sbjct: 236 GVMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPF--CWRGPSPFESVAD 293
Query: 353 -QRLPKLPSVKLMFPQNNSFVVNNPV------FVIYGTQVVTGFCLAIQPVDGD----IG 401
QR K +V L F + N + + + ++I TQ CL I G
Sbjct: 294 VQRYFK--TVTLDFGKRNWYSASRVLELSPEGYLIVSTQ--GNVCLGILDASGASLEVTN 349
Query: 402 TIGQNFMTGYRVVFDRENLKLGWSHSNCQD 431
IG M GY VV+D ++GW NC +
Sbjct: 350 IIGDVSMRGYLVVYDNARNQIGWVRRNCHN 379
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 26/360 (7%)
Query: 89 GSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G K + D G D LW+ C C C S L DL Y P+ S TSK + C
Sbjct: 83 GPKDYYVQVDTGSDTLWVNCVGCTACPKKSG-----LGMDLTLYDPNLSKTSKAVPCDDE 137
Query: 148 LC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASV 202
C D S CPY++ Y +T+S + +D+ + G + ++ SV
Sbjct: 138 FCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDN--TSV 195
Query: 203 IIGCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
I GCG KQSG + DG+IG G SV S LA AG ++ FS C D G IF
Sbjct: 196 IFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGIF 255
Query: 261 -FGDQGPATQQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKA-IVDSGSSFT 314
G+ ++T L Y + +E + S L +S + I+DSG++
Sbjct: 256 AIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSGTTLA 315
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVV 373
+LP +Y+ + + Q + + C + S + + L P+VK F + +
Sbjct: 316 YLPVSIYDQLLEKILAQRSGMKLYLVEDQFTCFHYSDEESVDDLFPTVKFTFEEGLTLTT 375
Query: 374 --NNPVFVIYGTQVVTGFCLAI-QPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +F+ G+ ++ Q DG ++ +G + VV+D +N+ +GW+ NC
Sbjct: 376 YPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDLDNMAIGWADYNC 435
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 111/423 (26%), Positives = 180/423 (42%), Gaps = 60/423 (14%)
Query: 34 RFSEEVKALGVSKNRNATSWPAKK--SFEYYQVLLSSDVQKQKMKTGPQFQMLFP--SQG 89
+ SE ++ L V+K+ W A + S + + ++DV+ G + M + G
Sbjct: 7 KRSEAIRGL-VAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPG 65
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+ ++ D G DL+W+ + C C+ + + P SST + + CS +L
Sbjct: 66 KRFRAIA-DTGSDLVWVQSEPCTGCSGGTI------------FDPRQSSTFREMDCSSQL 112
Query: 149 C-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIG 205
C +L SC+ C Y+ +Y + T G D + L SGG S +G
Sbjct: 113 CTELPGSCEPGSSACSYSYEYGSGETE--GEFARDTISLGTTSGGSQKFP-----SFAVG 165
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFF 261
CGM SG DGV DGL+GLG G +S+ S L+ A I + FS C + +S + F
Sbjct: 166 CGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESSPLLF 219
Query: 262 GDQGP---ATQQSTSFLASNGKYITY-IIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
G QST + Y TY ++ V + + I+DSG++ T++P
Sbjct: 220 GPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTLTYVP 278
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNNS 370
VY + + + V CY SS R K P++ + P +N
Sbjct: 279 SGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNY 338
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F+V + G V CLA+ G + IG GY +++DR + +L + + C
Sbjct: 339 FLVVDDS----GDTV----CLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
Query: 430 QDL 432
+ L
Sbjct: 391 ESL 393
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 100/363 (27%), Positives = 154/363 (42%), Gaps = 50/363 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K L D G DL W+ CD C C + +Y N+ P A+S L+ + +
Sbjct: 83 AKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTK---NKIVPCAASLCTSLTPNKK 139
Query: 148 LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--VQASVIIG 205
C P+Q C Y + Y T+ SS G+L+ D L +L+NS V+A++ G
Sbjct: 140 -------CAVPQQ-CDYQIKY-TDKASSLGVLIADNFTL------SLRNSSTVRANLTFG 184
Query: 206 CGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
CG Q G V A DGL+GLG G +S+ S L + G+ +N CF + G +FFGD
Sbjct: 185 CGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGD 244
Query: 264 QGPATQQSTSF---LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEV 320
T + T ++G Y Y G T L + + DSGS++ + E
Sbjct: 245 DIVPTSRVTWVPMARTTSGNY--YSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEP 302
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNSFVV- 373
Y+ + ++ ++ C +KS S+ S+ L F +N+ +
Sbjct: 303 YQATVSALKAGLSKSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKSLFLSFGKNSVMEIP 362
Query: 374 --NNPVFVIYGTQVVTGFCLAIQPVDG-----DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
N + YG CL I +DG IG M +++D E +LGW
Sbjct: 363 PENYLIVTKYGN-----VCLGI--LDGTTAKLKFNIIGDITMQDQMIIYDNEKGQLGWIR 415
Query: 427 SNC 429
+C
Sbjct: 416 GSC 418
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 152/346 (43%), Gaps = 26/346 (7%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G W+ C +C S + R L Y P +S +SK + C +C C
Sbjct: 101 DTGSKAFWVNGISCKQCPHES-----DILRKLTFYDPRSSVSSKEVKCDDTICTSRPPC- 154
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N CPY Y + + G+L D+LH N SV GCG++QSG +
Sbjct: 155 NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNN 213
Query: 217 -GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSF 274
VA DG+IG G + S LA AG + FS C D + G IF G+ ++T
Sbjct: 214 SAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPI 273
Query: 275 LASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IVDSGSSFTFLPKEVY-ETIA 325
+ +N Y +++ +++ + + L+ T K +DSGS+ +LP+ +Y E I
Sbjct: 274 VKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 331
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI--YGT 383
A F + + T+ + Y ++C + S K P + F + + V +++ G
Sbjct: 332 AVFAKHPDITMGAM--YNFQCFHFLGSVD-DKFPKITFHFENDLTLDVYPYDYLLEYEGN 388
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q GF A D+ +G ++ VV+D E +GW+ NC
Sbjct: 389 QYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNC 434
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 151/378 (39%), Gaps = 44/378 (11%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 201 GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPK 257
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L + C+ +C+ C Y ++Y + +SS G+L D +H+I+ GG
Sbjct: 258 DLLCQELQGNQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLARDDMHIITTNGGREK 309
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS+PS LA G+I N F C
Sbjct: 310 L------DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT 363
Query: 253 KDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
+D + G +F GD TS + + + G L S +
Sbjct: 364 RDPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQV 423
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I DSGSS+T+LP E+Y+ + A + + C ++ + L VK +F
Sbjct: 424 IFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCL-ATDFPVRYLEDVKQLF 482
Query: 366 --------------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P+ + + +N + + V GF G +G N + G
Sbjct: 483 KPLNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGK 542
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D + ++GW++S+C
Sbjct: 543 LVVYDNQQRQIGWTNSDC 560
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 151/378 (39%), Gaps = 44/378 (11%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 202 GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPK 258
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L + C+ +C+ C Y ++Y + +SS G+L D +H+I+ GG
Sbjct: 259 DLLCQELQGNQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLARDDMHIITTNGGREK 310
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS+PS LA G+I N F C
Sbjct: 311 L------DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT 364
Query: 253 KDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
+D + G +F GD TS + + + G L S +
Sbjct: 365 RDPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQV 424
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I DSGSS+T+LP E+Y+ + A + + C ++ + L VK +F
Sbjct: 425 IFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCL-ATDFPVRYLEDVKQLF 483
Query: 366 --------------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P+ + + +N + + V GF G +G N + G
Sbjct: 484 KPLNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGK 543
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D + ++GW++S+C
Sbjct: 544 LVVYDNQQRQIGWTNSDC 561
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 168/378 (44%), Gaps = 51/378 (13%)
Query: 80 QFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 139
Q ++ PS+G D G D+LW+ +C+RC + + L +L +Y P+ S T+
Sbjct: 88 QIEIGSPSKGYYVQV---DTGSDILWV--NCIRCDGCPTT--SGLGIELTQYDPAGSGTT 140
Query: 140 KHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
+ C C L +C + PC + + Y + +S++G V D + N
Sbjct: 141 --VGCDQEFCVANSPNGLPPACPSTSSPCQFRI-AYGDGSSTTGFYVSDSVQYNQVSGNG 197
Query: 194 LKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
AS+ GCG Q GG L A DG++G G + S+ S LA A +R F+ C
Sbjct: 198 QTTPSNASITFGCG-AQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256
Query: 252 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---- 305
D G IF + T+ L N + Y + ++ +G + L+ ++F +
Sbjct: 257 DTVHGGGIFAIGNVVQPKVKTTPLVQNVTH--YNVNLQGISVGGATLQLPSSTFDSGDSK 314
Query: 306 --IVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 362
I+DSG++ +LP+EVY T + A FD+ + + +++ + C++ S P V
Sbjct: 315 GTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQDF---VCFQFSGSIDDGFPVVT 371
Query: 363 LMF---------PQNNSFVVNNPVFVIYGTQVVTGFCL-AIQPVDG-DIGTIGQNFMTGY 411
F P + F N ++ + GF +Q DG D+ +G ++
Sbjct: 372 FSFEGEITLNVYPHDYLFQNENDLYCM-------GFLDGGVQTKDGKDMVLLGDLVLSNK 424
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D E +GW+ NC
Sbjct: 425 LVVYDLEKQVIGWADYNC 442
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/409 (24%), Positives = 173/409 (42%), Gaps = 47/409 (11%)
Query: 62 YQVLLSSDVQKQKMKTGPQFQ------MLFPSQGS----------KTMSLGN-------- 97
Y++ LS ++ +++ G Q + FP QG+ + LG
Sbjct: 9 YKLKLSKLKERDRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQ 68
Query: 98 -DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C P+++ L LN + P +S T+ +SCS + C LG
Sbjct: 69 IDTGSDVLWVSCGSCNGCPVNSG----LHIPLNFFDPGSSPTASLISCSDQRCSLGLQSS 124
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+ C C Y Y + + +SG V D+LH + ++ N+ A ++ GC Q+
Sbjct: 125 DSVCSAQNNLCGYNFQY-GDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQT 183
Query: 212 GGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
G A DG+ G G ++SV S LA G+ +FS C DDSG + G+
Sbjct: 184 GDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPN 243
Query: 269 QQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYET 323
T + S Y + + +T I S +S + I+DSG++ +L + Y+
Sbjct: 244 IVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAYDP 303
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
+ V+ ++ + CY SS P V L F S ++ ++I +
Sbjct: 304 FISAITSIVSPSVRPYLS-KGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQS 362
Query: 384 QV--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q + G I +G + V+D N ++GW++ +C
Sbjct: 363 SIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDC 411
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/351 (26%), Positives = 155/351 (44%), Gaps = 28/351 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C+ P ++ L LN + P +S+T+ +SCS ++C LG
Sbjct: 101 DTGSDVLWVSCNSCNGCPATSG----LQIPLNFFDPGSSTTASLVSCSDQICALGVQSSD 156
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
++C C Y Y + + +SG V D++HL D+++ ++ ASV+ GC Q+G
Sbjct: 157 SACFGQSNQCAYVFQY-GDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTG 215
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
A DG+ G G ++SV S L+ G+ FS C DDSG + G+
Sbjct: 216 DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNV 275
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTSFKAIVDSGSSFTFLPKEVY 321
T + S Y + +++ + L +S I+DSG++ +L +E Y
Sbjct: 276 VYTPLVPSQPH---YNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAY 332
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
V+ + S CY +SS P V L F S V+ ++I
Sbjct: 333 NAFVVAVTNIVSQSTQSVV-LKGNRCYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQ 391
Query: 382 GTQV--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V T +C+ Q + G I +G + ++D N ++GW++ +C
Sbjct: 392 QNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWTNYDC 442
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 158/365 (43%), Gaps = 39/365 (10%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK L D G D++W+ C C C S +L DL Y+ SS+ K + C L
Sbjct: 83 SKDYYLQVDTGTDMMWVNCIQCKECPTRS-----NLGMDLTLYNIKESSSGKLVPCDQEL 137
Query: 149 CD-----LGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C L T C + CPY ++ Y + +S++G V+D++ + S SV
Sbjct: 138 CKEINGGLLTGCTSKTNDSCPY-LEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASANGSV 196
Query: 203 IIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
I GCG +QSG Y + A DG++G G S+ S L+ +G ++ F+ C + + G IF
Sbjct: 197 IFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNGGGIF 256
Query: 261 -FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTSFKAIVDSGS 311
G T +T L Y + ++ +G + L ++ S I+DSG+
Sbjct: 257 AIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTFLNLSTDASEQRDSKGTIIDSGT 313
Query: 312 SFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+ +LP +Y+ + + +Q N + + + C++ S P+V F S
Sbjct: 314 TLAYLPDGIYQPLVYKILSQQPNLKVQTL--HDEYTCFQYSGSVDDGFPNVTFYFENGLS 371
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDRENLKLGW 424
V ++ + +C+ Q ++ +G ++ V +D EN +GW
Sbjct: 372 LKVYPHDYLFLSENL---WCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGW 428
Query: 425 SHSNC 429
+ NC
Sbjct: 429 TEYNC 433
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 153/354 (43%), Gaps = 36/354 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C RC S L +L Y P SST +SC C
Sbjct: 107 DTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSSTGSKVSCDQGFCAATYGGL 161
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G V D+L + ++V GCG +Q G
Sbjct: 162 LPGCTTSLPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGG 220
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
A DG+IG G S+ S L+ AG ++ F+ C D + G IF +
Sbjct: 221 DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPKVK 280
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY-E 322
T+ L N + Y + +++ +G + LK S I+DSG++ T+LP+ VY E
Sbjct: 281 TTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKE 338
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVIY 381
+ A F + + T + + + C++ + P + F + V + F
Sbjct: 339 IMLAVFAKHKDITFHNVQEF---LCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFEN 395
Query: 382 GTQVVTGFCL-----AIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G + +C+ +Q DG + +G ++ VV+D EN +GW+ NC
Sbjct: 396 GDNL---YCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNC 446
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 157/363 (43%), Gaps = 53/363 (14%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLG 152
D G D+LW+ C C C S + DL Y+P +SSTS ++C C D
Sbjct: 91 DTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSSTSTLITCDQPFCSATYDAP 145
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P C Y + Y + ++++G V D + L N + S++ GCG KQSG
Sbjct: 146 IPGCKPDLLCQYKV-IYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSG 204
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
A DG++G G S+ S LA G ++ F+ C D G IF G+ +
Sbjct: 205 ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLK 264
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLK------QTSFK--AIVDSGSSFTFLPKEVY- 321
+T + + Y + GV+ +G + L +TS+K AI+DSG++ +LP +Y
Sbjct: 265 TTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDSIYL 321
Query: 322 ----ETIAAEFD---RQVNDTITSF-------EGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+ + A+ D R V+D T F +G+P S L ++P
Sbjct: 322 PLMEKILGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTVTFKFEESLILT-------IYPH 374
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
F + + V+ + G Q Q DG ++ +G + V ++ EN +GW+
Sbjct: 375 EYLFQIRDDVWCV-GWQNS-----GAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTE 428
Query: 427 SNC 429
NC
Sbjct: 429 YNC 431
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 153/354 (43%), Gaps = 36/354 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C RC S L +L Y P SST +SC C
Sbjct: 22 DTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSSTGSKVSCDQGFCAATYGGL 76
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G V D+L + ++V GCG +Q G
Sbjct: 77 LPGCTTSLPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGG 135
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
A DG+IG G S+ S L+ AG ++ F+ C D + G IF +
Sbjct: 136 DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPKVK 195
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY-E 322
T+ L N + Y + +++ +G + LK S I+DSG++ T+LP+ VY E
Sbjct: 196 TTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKE 253
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF-VVNNPVFVIY 381
+ A F + + T + + + C++ + P + F + V + F
Sbjct: 254 IMLAVFAKHKDITFHNVQEF---LCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFEN 310
Query: 382 GTQVVTGFCL-----AIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G + +C+ +Q DG + +G ++ VV+D EN +GW+ NC
Sbjct: 311 GDNL---YCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNC 361
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 156/363 (42%), Gaps = 39/363 (10%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK + D G D+LW+ C C RC S L DL Y AS+TS + C
Sbjct: 165 SKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKASTTSDAVGCDDNF 219
Query: 149 CDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C L C+ P C Y++ Y + +S++G V+D + N +V+
Sbjct: 220 CSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 277
Query: 205 GCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
GCG KQSG A DG++G G S+ S LA +G ++ FS C D D G IF
Sbjct: 278 GCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIG 337
Query: 264 QGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLP 317
+ + + + L N + + +G + + S + K I+DSG++ + P
Sbjct: 338 EVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFP 397
Query: 318 KEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+EVY + ++ + D +++ +F C+ + P+V L F ++ S
Sbjct: 398 QEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTVTLHFDKSISLT 451
Query: 373 VNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
V ++ Q +C+ Q DG D+ +G ++ VV+D E +GW
Sbjct: 452 VYPHEYLF---QHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVE 508
Query: 427 SNC 429
NC
Sbjct: 509 YNC 511
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 157/361 (43%), Gaps = 34/361 (9%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK + D G D+LW+ C C RC S L DL Y AS+TS + C
Sbjct: 84 SKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKASTTSDAVGCDDNF 138
Query: 149 CDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C L C+ P C Y++ Y + +S++G V+D + N +V+
Sbjct: 139 CSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 196
Query: 205 GCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
GCG KQSG A DG++G G S+ S LA +G ++ FS C D D G IF
Sbjct: 197 GCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIG 256
Query: 264 QGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLP 317
+ + + + L N + + +G + + S + K I+DSG++ + P
Sbjct: 257 EVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFP 316
Query: 318 KEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+EVY + ++ + D +++ +F C+ + P+V L F ++ S
Sbjct: 317 QEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTVTLHFDKSISLT 370
Query: 373 V--NNPVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V + +F + + G+ Q DG D+ +G ++ VV+D E +GW N
Sbjct: 371 VYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYN 430
Query: 429 C 429
C
Sbjct: 431 C 431
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 157/361 (43%), Gaps = 34/361 (9%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK + D G D+LW+ C C RC S L DL Y AS+TS + C
Sbjct: 165 SKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKASTTSDAVGCDDNF 219
Query: 149 CDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C L C+ P C Y++ Y + +S++G V+D + N +V+
Sbjct: 220 CSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 277
Query: 205 GCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
GCG KQSG A DG++G G S+ S LA +G ++ FS C D D G IF
Sbjct: 278 GCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIG 337
Query: 264 QGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLP 317
+ + + + L N + + +G + + S + K I+DSG++ + P
Sbjct: 338 EVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFP 397
Query: 318 KEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+EVY + ++ + D +++ +F C+ + P+V L F ++ S
Sbjct: 398 QEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTVTLHFDKSISLT 451
Query: 373 V--NNPVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V + +F + + G+ Q DG D+ +G ++ VV+D E +GW N
Sbjct: 452 VYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYN 511
Query: 429 C 429
C
Sbjct: 512 C 512
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 118/450 (26%), Positives = 181/450 (40%), Gaps = 41/450 (9%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
M+R+S I L VF L ++S A V + + + A+ +R + A
Sbjct: 1 MDRVSGLI-LIVFLLFVDASNANLVFPVQRKFNGPHRSLDAIKAHDDRRRGRFLAAIDVP 59
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSAS 119
L S TG + + +K + D G D+LW+ C C C S
Sbjct: 60 LGGNGLPS-------STGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSG- 111
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTS 174
L DL Y P+ S TS + C C S C+ CPY++ Y + ++
Sbjct: 112 ----LGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITY-GDGST 165
Query: 175 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEIS 232
+SG V D L N +SVI GCG KQSG A DG+IG G S
Sbjct: 166 TSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSS 225
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IG 287
V S LA +G ++ FS C D G IF Q + +T+ L + I +
Sbjct: 226 VLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVD 285
Query: 288 VETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWK 345
E + S + I+DSG++ +LP +Y + + RQ + E
Sbjct: 286 GEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVE--DQF 343
Query: 346 CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL-----AIQPVDG-D 399
C+ S + P VK F + V + +Y + +C+ + Q +G D
Sbjct: 344 TCFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDI---YCIGWQKSSTQTKEGRD 400
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ IG ++ VV+D EN+ +GW++ NC
Sbjct: 401 LILIGDLVLSNKLVVYDLENMVIGWTNFNC 430
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 162/353 (45%), Gaps = 34/353 (9%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G D+LW+ +C+RC + L +L +Y P+ S T+ + C C +
Sbjct: 102 DTGSDILWV--NCIRCDGCPTR--SGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGV 155
Query: 154 --SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + PC + + Y + ++++G V D + N + AS+ GCG Q
Sbjct: 156 PPTCPSTSSPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQL 213
Query: 212 GGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
GG L A DG++G G + S+ S LA A +R F+ C D G IF +
Sbjct: 214 GGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPK 273
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA------IVDSGSSFTFLPKEVY 321
T+ L N + Y + ++ +G + L+ ++F + I+DSG++ +LP+EVY
Sbjct: 274 VKTTPLVPNVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVY 331
Query: 322 ET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVF 378
T +AA FD+ + + +++ + C++ S P + F + + V ++ +F
Sbjct: 332 RTLLAAVFDKYQDLPLHNYQDF---VCFQFSGSIDDGFPVITFSFKGDLTLNVYPDDYLF 388
Query: 379 VIYGTQVVTGFCL-AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
GF +Q DG D+ +G ++ VV+D E +GW+ NC
Sbjct: 389 QNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTDYNC 441
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 157/363 (43%), Gaps = 53/363 (14%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLG 152
D G D+LW+ C C C S + DL Y+P +SSTS ++C C D
Sbjct: 91 DTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSSTSTLITCDQPFCSATYDAP 145
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P C Y + Y + ++++G V D + L N + S++ GCG KQSG
Sbjct: 146 IPGCKPDLLCQYKV-IYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSG 204
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
A DG++G G S+ S LA G ++ F+ C D G IF G+
Sbjct: 205 ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLX 264
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLK------QTSFK--AIVDSGSSFTFLPKEVY- 321
+T + + Y + GV+ +G + L +TS+K AI+DSG++ +LP+ +Y
Sbjct: 265 NTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPESIYL 321
Query: 322 ----ETIAAEFD---RQVNDTITSF-------EGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+ + A+ D R V+D T F +G+P S L ++P
Sbjct: 322 PLMEKILGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTVTFKFEESLILT-------IYPH 374
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
F + + V+ + G Q Q DG ++ +G + V ++ EN +GW+
Sbjct: 375 EYLFQIRDDVWCV-GWQNS-----GAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTE 428
Query: 427 SNC 429
NC
Sbjct: 429 YNC 431
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 162/353 (45%), Gaps = 34/353 (9%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G D+LW+ +C+RC + L +L +Y P+ S T+ + C C +
Sbjct: 102 DTGSDILWV--NCIRCDGCPTR--SGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGV 155
Query: 154 --SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + PC + + Y + ++++G V D + N + AS+ GCG Q
Sbjct: 156 PPTCPSTSSPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQL 213
Query: 212 GGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
GG L A DG++G G + S+ S LA A +R F+ C D G IF +
Sbjct: 214 GGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPK 273
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA------IVDSGSSFTFLPKEVY 321
T+ L N + Y + ++ +G + L+ ++F + I+DSG++ +LP+EVY
Sbjct: 274 VKTTPLVPNVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVY 331
Query: 322 ET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVF 378
T +AA FD+ + + +++ + C++ S P + F + + V ++ +F
Sbjct: 332 RTLLAAVFDKYQDLPLHNYQDF---VCFQFSGSIDDGFPVITFSFEGDLTLNVYPDDYLF 388
Query: 379 VIYGTQVVTGFCL-AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
GF +Q DG D+ +G ++ VV+D E +GW+ NC
Sbjct: 389 QNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTDYNC 441
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 160/365 (43%), Gaps = 37/365 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C C S SL DL Y+ + S T K + C C Q
Sbjct: 96 DTGSDIMWVNCIQCRECPKTS-----SLGIDLTLYNINESDTGKLVPCDQEFCYEINGGQ 150
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P CPY ++ Y + +S++G V+D++ + + SVI GCG +QSG
Sbjct: 151 LPGCTANMSCPY-LEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQSG 209
Query: 213 --GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQ 269
G + A DG++G G S+ S LA G ++ F+ C D + G IF G
Sbjct: 210 DLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGHVVQPKV 269
Query: 270 QSTSFLASNGKY---ITYI-IGVETCCIGSSCLKQTSFK-AIVDSGSSFTFLPKEVYETI 324
T + + Y +T + +G E + + + K AI+DSG++ +LP+ VY+ +
Sbjct: 270 NMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPL 329
Query: 325 AAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV----NNPVF 378
++ Q D T + Y C++ S P+V F NS ++ + +F
Sbjct: 330 VSKIISQQPDLKVHTVRDEYT---CFQYSDSLDDGFPNVTFHF--ENSVILKVYPHEYLF 384
Query: 379 VIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC------QD 431
G + +Q D ++ +G ++ V++D EN +GW+ NC QD
Sbjct: 385 PFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCSSSIQVQD 444
Query: 432 LNDGT 436
GT
Sbjct: 445 ERTGT 449
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 150/378 (39%), Gaps = 44/378 (11%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 185 GQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPR 241
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L + C+ +C+ C Y ++Y + +SS G+L D +H+I+ GG
Sbjct: 242 DLLCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHMIATNGGREK 293
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS PS LA G+I N F C
Sbjct: 294 L------DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT 347
Query: 253 KDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
++ G +F GD T +G Y G L++ ++ +
Sbjct: 348 REQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQV 407
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I DSGSS+T+LP E+YE + A + C+K+ + L VK F
Sbjct: 408 IFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFP-VRYLEDVKQFF 466
Query: 366 -PQN-----------NSFVVNNPVFVIYGTQ--VVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P N +F ++ ++I + V G + G +G + G
Sbjct: 467 EPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D + ++GW+ S+C
Sbjct: 527 LVVYDNQRKQIGWADSDC 544
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 156/359 (43%), Gaps = 26/359 (7%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G W+ C +C S + R L Y P +S +SK + C +C C
Sbjct: 77 DTGSKAFWVNGISCKQCPHES-----DILRKLTFYDPRSSVSSKEVKCDDTICTSRPPC- 130
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N CPY Y + + G+L D+LH N SV GCG++QSG +
Sbjct: 131 NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNN 189
Query: 217 -GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSF 274
VA DG+IG G + S LA AG + FS C D + G IF G+ ++T
Sbjct: 190 SAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPI 249
Query: 275 LASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IVDSGSSFTFLPKEVY-ETIA 325
+ +N Y +++ +++ + + L+ T K +DSGS+ +LP+ +Y E I
Sbjct: 250 VKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 307
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI--YGT 383
A F + + T+ + Y ++C + S K P + F + + V +++ G
Sbjct: 308 AVFAKHPDITMGAM--YNFQCFHFLGSVD-DKFPKITFHFENDLTLDVYPYDYLLEYEGN 364
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
Q GF A D+ +G ++ VV+D E +GW+ N + G L+P
Sbjct: 365 QYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNSVEEACGGSEGLSP 423
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 148/364 (40%), Gaps = 44/364 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C N + L Y P+ SK + C HRL
Sbjct: 75 KPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT---KSKLVPCVHRL 121
Query: 149 CDL--------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSV 198
C C++P + C Y + Y + SS+G+LV D L L +G +
Sbjct: 122 CASLHNALTGGKHRCESPHEQCDYVIKY-ADQGSSTGVLVNDSFALRLTNG------SVA 174
Query: 199 QASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
+ SV GCG Q D +P DG++GLG G +S+ S L + G+ +N C G
Sbjct: 175 RPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGG 234
Query: 258 RIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD Q++T + +A + Y G + G L K + DSGSSFT+
Sbjct: 235 FLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYF 294
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNS 370
+ Y+ + ++ T+ C +KS + S+ L F
Sbjct: 295 AAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKK 354
Query: 371 FVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
++ P + V G + D+ IG M + V++D E K+GW
Sbjct: 355 TLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIR 414
Query: 427 SNCQ 430
+ C
Sbjct: 415 APCD 418
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 150/378 (39%), Gaps = 44/378 (11%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 185 GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKI---VPPR 241
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L + C+ +C+ C Y ++Y + +SS G+L D +HLI+ GG
Sbjct: 242 DLLCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHLIATNGGREK 293
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS+PS LA G+I N F C
Sbjct: 294 L------DFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCIT 347
Query: 253 KDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
++ G +F GD T +G Y G L+ + +
Sbjct: 348 REQGGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQV 407
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I DSGSS+T+LP E+YE + A + C+K+ + L VK F
Sbjct: 408 IFDSGSSYTYLPDEIYENLVAAIKYASPGFVQDSSDRTLPLCWKADFP-VRYLEDVKQFF 466
Query: 366 -PQN-----------NSFVVNNPVFVIYGTQ--VVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P N +F ++ ++I + V G + G +G + G
Sbjct: 467 KPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D + ++GW++S+C
Sbjct: 527 LVVYDNQRRQIGWTNSDC 544
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 160/366 (43%), Gaps = 55/366 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T +L D G L ++PC C +C +D N + P SST + L CS
Sbjct: 103 QTFALIVDTGSTLTYVPCSTCEQCGK---------HQDPN-FQPDWSSTYQPLKCS---- 148
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+ +C + C Y Y E +SSSG+L EDI+ G + LK + GC
Sbjct: 149 -MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSF--GKQSELKPQ---RTVFGCENV 201
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGP 266
++G A DG++GLG G++S+ L + G+I NSFS+C+ D G + G P
Sbjct: 202 ETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPP 260
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEV 320
A T + Y Y I ++ I L + I+DSG+++ +LP+
Sbjct: 261 AGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPA 318
Query: 321 Y----ETIAAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+ + I E DR ND S G SQ P+V L+F
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG-------SDVSQLSKTFPAVDLVFSN 371
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSH 426
N ++ ++ ++ +CL I + D T +G + V++DRE+LK+G+
Sbjct: 372 GNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWK 431
Query: 427 SNCQDL 432
+NC ++
Sbjct: 432 TNCSEI 437
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 160/366 (43%), Gaps = 55/366 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T +L D G L ++PC C +C +D N + P SST + L CS
Sbjct: 103 QTFALIVDTGSTLTYVPCSTCEQCGK---------HQDPN-FQPDWSSTYQPLKCS---- 148
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+ +C + C Y Y E +SSSG+L EDI+ G + LK + GC
Sbjct: 149 -MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSF--GKQSELKPQ---RTVFGCENV 201
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGP 266
++G A DG++GLG G++S+ L + G+I NSFS+C+ D G + G P
Sbjct: 202 ETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPP 260
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEV 320
A T + Y Y I ++ I L + I+DSG+++ +LP+
Sbjct: 261 AGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPA 318
Query: 321 Y----ETIAAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+ + I E DR ND S G SQ P+V L+F
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG-------SDVSQLSKTFPAVDLVFSN 371
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSH 426
N ++ ++ ++ +CL I + D T +G + V++DRE+LK+G+
Sbjct: 372 GNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWK 431
Query: 427 SNCQDL 432
+NC ++
Sbjct: 432 TNCSEI 437
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 152/357 (42%), Gaps = 38/357 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLG 152
D G D LW+ C C C S L +L Y P++S TSK + C C D
Sbjct: 93 DTGSDTLWVNCVGCTTCPKKSG-----LGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGP 147
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQS 211
S CPY++ Y +T+S + +D+ + G + ++ SVI GCG KQS
Sbjct: 148 ISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDN--TSVIFGCGSKQS 205
Query: 212 GGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPAT 268
G + DG+IG G SV S LA AG ++ FS C D + G IF G+
Sbjct: 206 GTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPK 265
Query: 269 QQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKA-IVDSGSSFTFLPKEVYET 323
++T + Y + +E + + TS + I+DSG++ +LP +Y+
Sbjct: 266 VKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQ 325
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK-LPSVKLMF---------PQNNSFVV 373
+ + Q + + C + S + L P+VK F P + F
Sbjct: 326 LLEKTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPF 385
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ I G Q T Q DG D+ +G +T ++D +N+ +GW+ NC
Sbjct: 386 KEDMWCI-GWQKSTA-----QTKDGKDLILLGDLVLTNKLFIYDLDNMSIGWTDYNC 436
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 151/345 (43%), Gaps = 26/345 (7%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G W+ C +C S + R L Y P +S +SK + C +C C
Sbjct: 77 DTGSKAFWVNGISCKQCPHES-----DILRKLTFYDPRSSVSSKEVKCDDTICTSRPPC- 130
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N CPY Y + + G+L D+LH N SV GCG++QSG +
Sbjct: 131 NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNN 189
Query: 217 -GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSF 274
VA DG+IG G + S LA AG + FS C D + G IF G+ ++T
Sbjct: 190 SAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPI 249
Query: 275 LASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IVDSGSSFTFLPKEVY-ETIA 325
+ +N Y +++ +++ + + L+ T K +DSGS+ +LP+ +Y E I
Sbjct: 250 VKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 307
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI--YGT 383
A F + + T+ + Y ++C + S K P + F + + V +++ G
Sbjct: 308 AVFAKHPDITMGAM--YNFQCFHFLGSVD-DKFPKITFHFENDLTLDVYPYDYLLEYEGN 364
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
Q GF A D+ +G ++ VV+D E +GW+ N
Sbjct: 365 QYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 409
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 151/345 (43%), Gaps = 26/345 (7%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G W+ C +C S + R L Y P +S +SK + C +C C
Sbjct: 101 DTGSKAFWVNGISCKQCPHES-----DILRKLTFYDPRSSVSSKEVKCDDTICTSRPPC- 154
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N CPY Y + + G+L D+LH N SV GCG++QSG +
Sbjct: 155 NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNN 213
Query: 217 -GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSF 274
VA DG+IG G + S LA AG + FS C D + G IF G+ ++T
Sbjct: 214 SAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPI 273
Query: 275 LASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IVDSGSSFTFLPKEVY-ETIA 325
+ +N Y +++ +++ + + L+ T K +DSGS+ +LP+ +Y E I
Sbjct: 274 VKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 331
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI--YGT 383
A F + + T+ + Y ++C + S K P + F + + V +++ G
Sbjct: 332 AVFAKHPDITMGAM--YNFQCFHFLGSVD-DKFPKITFHFENDLTLDVYPYDYLLEYEGN 388
Query: 384 QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
Q GF A D+ +G ++ VV+D E +GW+ N
Sbjct: 389 QYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 433
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 167/385 (43%), Gaps = 54/385 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F +L K+ L D G DL W+ CD C C + +Y P+
Sbjct: 192 GLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHV----------QYKPTR 241
Query: 136 SSTSKHLSCSHRLC-DLGTSCQNPKQP-----CPYTMDYYTENTSSSGLLVEDILHLISG 189
S+ +S LC D+ + +N C Y + Y +++SS G+LV D LHL++
Sbjct: 242 SNV---VSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTT 297
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFS 248
+ K +V+ GCG Q G L+ +A DG++GL ++S+P LA GLI+N
Sbjct: 298 NGSKTK----LNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 353
Query: 249 MCFDKDDS--GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--- 299
C D + G +F GD ++ + Y T I+G+ G+ LK
Sbjct: 354 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN---YGNRQLKFDG 410
Query: 300 QTSF-KAIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFEGYPWKCCYKSSSQ 353
Q+ K DSGSS+T+ PKE Y + A + V D + W+ ++ S
Sbjct: 411 QSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFQIRSI 470
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVI--YGTQVVTG---FCLAI----QPVDGDIGTIG 404
+ K L + + + + +F I G +++ CL I + DG +G
Sbjct: 471 KDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSIILG 530
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNC 429
+ GY VV+D K+GW ++C
Sbjct: 531 DISLRGYSVVYDNVKQKIGWKRADC 555
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 165/358 (46%), Gaps = 27/358 (7%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K+ + D G D++W+ C C +C S +L +L Y+ S + K +SC
Sbjct: 90 AKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESDSGKLVSCDDDF 144
Query: 149 CDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C + S CPY ++ Y + +S++G V+D++ S + + SVI
Sbjct: 145 CYQISGGPLSGCKANMSCPY-LEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIF 203
Query: 205 GCGMKQSGGYLDG---VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG +QSG LD A DG++G G S+ S LA +G ++ F+ C D + G IF
Sbjct: 204 GCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFA 262
Query: 262 GDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQTSFK-AIVDSGSSFTF 315
+ + + + L N + +T + +G E I + + K AI+DSG++ +
Sbjct: 263 IGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAY 322
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
LP+ +YE + + Q +K C++ S + P+V F +N+ F+
Sbjct: 323 LPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHF-ENSVFLRVY 380
Query: 376 P---VFVIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P +F G + A+Q D ++ +G ++ V++D EN +GW+ NC
Sbjct: 381 PHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 154/375 (41%), Gaps = 56/375 (14%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA----S 136
MS+GN D G DL W+ CD CV C + Y N+ P S
Sbjct: 61 AMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTK---NKIVPCVDQLCS 117
Query: 137 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 196
S LS H+ C +PKQ C Y + Y + SS G+L+ D + L N
Sbjct: 118 SLHGGLSGKHK-------CDSPKQQCDYEIKY-ADQGSSLGVLLTDSFAV------RLAN 163
Query: 197 S--VQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 253
S V+ S+ GCG Q G VAP DG++GLG G IS+ S L + G+ +N C
Sbjct: 164 SSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLSI 223
Query: 254 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 312
G +FFGD P ++ + + + Y G + G L + ++DSGSS
Sbjct: 224 RGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSGSS 283
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFP 366
FT+ + Y+ + ++ T+ C +KS + S+ L F
Sbjct: 284 FTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCWKGKKPFKSVLDVKKEFKSLVLSFS 343
Query: 367 QNNSFVVNNPVFVIYGTQVVTGF---CLAIQPVDG------DIGTIGQNFMTGYRVVFDR 417
++ P +VT F CL I ++G D+ +G M V++D
Sbjct: 344 NGKKALMEIPP---ENYLIVTKFGNACLGI--LNGSEIGLKDLNIVGDITMQDQMVIYDN 398
Query: 418 ENLKLGWSHSNCQDL 432
E ++GW + C +
Sbjct: 399 ERGQIGWIRAPCDRI 413
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 166/386 (43%), Gaps = 48/386 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P SS+ K L C+ +C
Sbjct: 98 DTGSTVTYVPCSTCKQCGKHQDP----------KFQPELSSSYKALKCNP-----DCNCD 142
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ + C Y Y E +SSSG+L ED LIS G+ + +A + GC ++G
Sbjct: 143 DEGKLCVYERRY-AEMSSSSGVLSED---LISFGNESQLTPQRA--VFGCENVETGDLFS 196
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++SV L G+I + FS+C+ + G + G P S
Sbjct: 197 QRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGMVFSH 255
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
+ + Y I ++ + LK ++DSG+++ + PKE + I
Sbjct: 256 -SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAI 314
Query: 329 DRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 380
+++ ++ G Y C+ + + + ++ P + + F +++ ++
Sbjct: 315 IKEI-PSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLF 372
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
T+V +CL I P +G + V +DREN KLG+ +NC DL +P
Sbjct: 373 RHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDLWRRLAAPE 432
Query: 441 TPGPGTP------SNPLPANQEQSSP 460
+P P +P SN P+ + SP
Sbjct: 433 SPAPTSPISQNKSSNISPSPAKSESP 458
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 147/363 (40%), Gaps = 43/363 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C N + L Y P+ SK + C HRL
Sbjct: 77 KPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT---KSKLVPCVHRL 123
Query: 149 CDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQ 199
C C +P + C Y + Y + SS+G+L+ D L L +G + +
Sbjct: 124 CASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLTNG------SVAR 176
Query: 200 ASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
SV GCG Q D +P DG++GLG G +S+ S L + G+ +N C G
Sbjct: 177 PSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGF 236
Query: 259 IFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+FFGD Q++T + +A + Y G + G L K + DSGSSFT+
Sbjct: 237 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFA 296
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNSF 371
+ Y+ + ++ T+ C +KS + S+ L F
Sbjct: 297 AKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKT 356
Query: 372 VVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ P + V G + D+ IG M + V++D E K+GW +
Sbjct: 357 LMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRA 416
Query: 428 NCQ 430
C
Sbjct: 417 PCD 419
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 165/358 (46%), Gaps = 27/358 (7%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K+ + D G D++W+ C C +C S +L +L Y+ S + K +SC
Sbjct: 90 AKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESDSGKLVSCDDDF 144
Query: 149 CDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C + S CPY ++ Y + +S++G V+D++ S + + SVI
Sbjct: 145 CYQISGGPLSGCKANMSCPY-LEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIF 203
Query: 205 GCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG +QSG LD A DG++G G S+ S LA +G ++ F+ C D + G IF
Sbjct: 204 GCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFA 262
Query: 262 GDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQTSFK-AIVDSGSSFTF 315
+ + + + L N + +T + +G E I + + K AI+DSG++ +
Sbjct: 263 IGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAY 322
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
LP+ +YE + + Q +K C++ S + P+V F +N+ F+
Sbjct: 323 LPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHF-ENSVFLRVY 380
Query: 376 P---VFVIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P +F G + A+Q D ++ +G ++ V++D EN +GW+ NC
Sbjct: 381 PHDYLFPYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 167/387 (43%), Gaps = 54/387 (13%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F +L K+ L D G DL W+ CD C+ C + Y P+
Sbjct: 190 GLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYK----------PTR 239
Query: 136 SSTSKHLSCSHRLC-DLGTSCQNPKQP-----CPYTMDYYTENTSSSGLLVEDILHLISG 189
S+ +S LC D+ + +N C Y + Y +++SS G+LV D LHL++
Sbjct: 240 SNV---VSSVDALCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTT 295
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFS 248
+ K +V+ GCG Q+G L+ + DG++GL ++S+P LA GLI+N
Sbjct: 296 NGSKTK----LNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 351
Query: 249 MCFDKDDS--GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--- 299
C D + G +F GD ++ + Y T I+G+ G+ L+
Sbjct: 352 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN---YGNRQLRFDG 408
Query: 300 QTSF-KAIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFEGYPWKCCYKSSSQ 353
Q+ K + DSGSS+T+ PKE Y + A + V D + W+ + S
Sbjct: 409 QSKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFPIKSV 468
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVI--YGTQVVTG---FCLAI----QPVDGDIGTIG 404
+ K L + + + + +F I G +++ CL I DG +G
Sbjct: 469 KDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSIILG 528
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNCQD 431
+ GY VV+D K+GW ++C D
Sbjct: 529 DISLRGYSVVYDNVKQKIGWKRADCVD 555
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 175/412 (42%), Gaps = 65/412 (15%)
Query: 48 RNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIP 107
+ TS ++S Q+ L+S ++ + + ++ G K MSL D G DL W
Sbjct: 109 KAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVEL-----GGKNMSLIVDTGSDLTW-- 161
Query: 108 CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNP-------- 158
V+C P + Y ++ Y PS SS+ K + C+ C DL + N
Sbjct: 162 ---VQCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNG 214
Query: 159 --KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
K C Y + Y + + L E I+ GD L+N ++ GCG + + G
Sbjct: 215 VVKTTCEYVVSYGDGSYTRGDLASESIVL----GDTKLEN-----LVFGCG-RNNKGLFG 264
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTS 273
G + GL+GLG +S+ S K FS C + SG + FG+ + STS
Sbjct: 265 GAS--GLMGLGRSSVSLVSQTLKT--FNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTS 320
Query: 274 F----LASNGKYIT-YIIGVETCCIGSSCLKQTSFKA--IVDSGSSFTFLPKEVYETIAA 326
L N + + YI+ + IG LK SF ++DSG+ T LP +Y+ +
Sbjct: 321 VFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTVITRLPPSIYKAVKT 380
Query: 327 EFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
EF +Q F G+P C+ +S +P++K++F N V+
Sbjct: 381 EFLKQ-------FSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVF 433
Query: 380 IYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + CLA+ + + ++G IG RV++D +LG + NC
Sbjct: 434 YFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 162/376 (43%), Gaps = 36/376 (9%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 134
+TG F + +K + D G D+LW+ C C C S +L +L Y P
Sbjct: 86 ETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKS-----NLGIELTMYDPR 140
Query: 135 ASSTSKHLSCSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S + + ++C + C P PC Y++ Y + +S++G V D L
Sbjct: 141 GSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISY-GDGSSTAGFFVTDFLQYNQVS 199
Query: 191 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+ ASV GCG K G +A DG++G G S+ S LA AG +R F+
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 250 CFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQ 300
C D + G IF G+ ++T + Y + G++ +G + L
Sbjct: 260 CLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSG 316
Query: 301 TSFKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
S I+DSG++ ++P+ VY+ + A FD+ + ++ + + + C++ S P
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFP 373
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI------GTIGQNFMTGYRV 413
V F + S +V+ ++ + + +C+ Q G G +G ++ V
Sbjct: 374 EVTFHFEGDVSLIVSPHDYLFQNGKNL--YCMGFQNGGGKTKDGKDLGLLGDLVLSNKLV 431
Query: 414 VFDRENLKLGWSHSNC 429
++D EN +GW+ NC
Sbjct: 432 LYDLENQAIGWADYNC 447
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/402 (25%), Positives = 157/402 (39%), Gaps = 68/402 (16%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSP 133
+ TG F +F K + L D G DL WI CD C C + S+Y P
Sbjct: 166 LGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHY----------YP 215
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
SST +++SC C L +S C+ Q CPY DY + ++ E +
Sbjct: 216 KDSSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNL 275
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
+ + K V+ GCG G + GL+GLG G IS PS + + +SF
Sbjct: 276 TWPNGKEKFKQVVDVMFGCGHWNKGFFY---GASGLLGLGRGPISFPSQIQ--SIYGHSF 330
Query: 248 SMCF-----DKDDSGRIFFGDQGPATQQS----TSFLASNG--KYITYIIGVETCCIGSS 296
S C + S ++ FG+ T+ LA Y + +++ +G
Sbjct: 331 SYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGE 390
Query: 297 CL---KQT------------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 341
L +QT I+DSGS+ TF P Y+ I F++++ + +
Sbjct: 391 VLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADD 450
Query: 342 YPWKCCYKSSSQRLP-KLPSVKLM--------FPQNNSFVVNNPVFVIYGTQVVTGFCLA 392
+ CY S + +LP + FP N F P VI CLA
Sbjct: 451 FVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVI---------CLA 501
Query: 393 IQ--PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
I P + IG + +++D + +LG+S C ++
Sbjct: 502 IMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 543
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 170/397 (42%), Gaps = 48/397 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P S++ + L C + +C
Sbjct: 94 DTGSTVTYVPCSTCKQCG----------KHQDPKFQPELSTSYQALKC-----NPDCNCD 138
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ + C Y Y E +SSSG+L ED LIS G+ + + +A + GC +++G
Sbjct: 139 DEGKLCVYERRY-AEMSSSSGVLSED---LISFGNESQLSPQRA--VFGCENEETGDLFS 192
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++SV L G+I + FS+C+ + G + G P S
Sbjct: 193 QRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSH 251
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
+ + Y I ++ + LK ++DSG+++ + PKE + I
Sbjct: 252 -SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAV 310
Query: 329 DRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 380
+++ ++ G Y C+ + + + ++ P + + F +++ ++
Sbjct: 311 IKEI-PSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLF 368
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
T+V +CL I P +G + V +DREN KLG+ +NC D+ +P
Sbjct: 369 RHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPE 428
Query: 441 TPGPGTP------SNPLPANQEQSSPGGHAVGPAVAG 471
+P P +P SN P+ SP H G G
Sbjct: 429 SPAPTSPISQNKSSNISPSPATSESPTSHLPGSLAFG 465
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 147/363 (40%), Gaps = 43/363 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C N + L Y P+ SK + C HRL
Sbjct: 68 KPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT---KSKLVPCVHRL 114
Query: 149 CDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQ 199
C C +P + C Y + Y + SS+G+L+ D L L +G + +
Sbjct: 115 CASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLTNG------SVAR 167
Query: 200 ASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
SV GCG Q D +P DG++GLG G +S+ S L + G+ +N C G
Sbjct: 168 PSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGF 227
Query: 259 IFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+FFGD Q++T + +A + Y G + G L K + DSGSSFT+
Sbjct: 228 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFA 287
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNSF 371
+ Y+ + ++ T+ C +KS + S+ L F
Sbjct: 288 AKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKT 347
Query: 372 VVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ P + V G + D+ IG M + V++D E K+GW +
Sbjct: 348 LMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRA 407
Query: 428 NCQ 430
C
Sbjct: 408 PCD 410
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 169/392 (43%), Gaps = 48/392 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P S++ + L C + +C
Sbjct: 94 DTGSTVTYVPCSTCKQCG----------KHQDPKFQPELSTSYQALKC-----NPDCNCD 138
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ + C Y Y E +SSSG+L ED LIS G+ + + +A + GC +++G
Sbjct: 139 DEGKLCVYERRY-AEMSSSSGVLSED---LISFGNESQLSPQRA--VFGCENEETGDLFS 192
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++SV L G+I + FS+C+ + G + G P S
Sbjct: 193 QRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSH 251
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
+ + Y I ++ + LK ++DSG+++ + PKE + I
Sbjct: 252 -SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAV 310
Query: 329 DRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 380
+++ ++ G Y C+ + + + ++ P + + F +++ ++
Sbjct: 311 IKEI-PSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLF 368
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
T+V +CL I P +G + V +DREN KLG+ +NC D+ +P
Sbjct: 369 RHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPE 428
Query: 441 TPGPGTP------SNPLPANQEQSSPGGHAVG 466
+P P +P SN P+ SP H G
Sbjct: 429 SPAPTSPISQNKSSNISPSPATSESPTSHLPG 460
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 160/364 (43%), Gaps = 27/364 (7%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
L+ S+ +L D G + ++PC C +C + N ++ + P SST +
Sbjct: 95 LYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPV 154
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C ++ +C N + C Y Y E +SSSG+L EDI+ G ++ LK
Sbjct: 155 KC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQRA 203
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
+ GC ++G A DG++GLG G++S+ L + G+I +SFS+C+ D G
Sbjct: 204 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 262
Query: 263 DQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTF 315
G F SN + Y I ++ + L+ + ++DSG+++ +
Sbjct: 263 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 322
Query: 316 LPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNN 369
LP++ + +VN I + C+ + + + +L P V ++F
Sbjct: 323 LPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 382
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSN 428
++ ++ ++V +CL + D T +G + V +DR N K+G+ +N
Sbjct: 383 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTN 442
Query: 429 CQDL 432
C +L
Sbjct: 443 CSEL 446
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/421 (23%), Positives = 172/421 (40%), Gaps = 56/421 (13%)
Query: 52 SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-------------------KT 92
++P+ E ++ ++ ++M + + FP +G+ +
Sbjct: 30 AFPSNDGVELSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRE 89
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
+ + D G D+LW+ C P ++ L LN + P +SSTS +SC R C G
Sbjct: 90 LYVQIDTGSDVLWVSCGSCNGCPQTSG----LQIQLNYFDPGSSSTSSLISCLDRRCRSG 145
Query: 153 T-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
SC C YT Y + + +SG V D++H S + L + ASV+ GC
Sbjct: 146 VQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCS 204
Query: 208 MKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--RIFFGD- 263
+ Q+G A DG+ G G +SV S L+ G+ FS C D+SG + G+
Sbjct: 205 ILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEI 264
Query: 264 ------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGS 311
P + ++ NG+ I+ + +S + T IVDSG+
Sbjct: 265 VEPNIVYSPLVPSQPHYNLNLQSISVNGQ----IVRIAPSVFATSNNRGT----IVDSGT 316
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
+ +L +E Y + ++ S +C ++S + P V L F S
Sbjct: 317 TLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASL 376
Query: 372 VVNNPVFVIYGTQVVTG--FCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V+ +++ + G +C+ Q + G I +G + V+D ++GW++ +
Sbjct: 377 VLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDLAGQRIGWANYD 436
Query: 429 C 429
C
Sbjct: 437 C 437
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 160/364 (43%), Gaps = 27/364 (7%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
L+ S+ +L D G + ++PC C +C + N ++ + P SST +
Sbjct: 96 LYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPV 155
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C ++ +C N + C Y Y E +SSSG+L EDI+ G ++ LK
Sbjct: 156 KC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQRA 204
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
+ GC ++G A DG++GLG G++S+ L + G+I +SFS+C+ D G
Sbjct: 205 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 263
Query: 263 DQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTF 315
G F SN + Y I ++ + L+ + ++DSG+++ +
Sbjct: 264 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 323
Query: 316 LPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNN 369
LP++ + +VN I + C+ + + + +L P V ++F
Sbjct: 324 LPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 383
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSN 428
++ ++ ++V +CL + D T +G + V +DR N K+G+ +N
Sbjct: 384 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTN 443
Query: 429 CQDL 432
C +L
Sbjct: 444 CSEL 447
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 162/391 (41%), Gaps = 56/391 (14%)
Query: 65 LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYN 122
LLS DV TG + + +K L D G DL W+ CD C C N
Sbjct: 46 LLSGDV----YPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC--------N 93
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTM--DY---YTENTSSSG 177
+ L Y P+ + K + C++ +C S +P + C DY YT+ SS G
Sbjct: 94 KVPHPL--YRPTKN---KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLG 148
Query: 178 LLVEDILHLISGGDNALKN--SVQASVIIGCGMKQSGGYLDGVAP---DGLIGLGLGEIS 232
+LV D L L+N +V+ S+ GCG Q G +G AP DGL+GLG G +S
Sbjct: 149 VLVTDSFSL------PLRNKSNVRPSLSFGCGYDQQVGK-NGAAPATTDGLLGLGRGSVS 201
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVE 289
+ S L + G+ +N C G +FFGD T + T +++G Y Y G
Sbjct: 202 LLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNY--YSPGSA 259
Query: 290 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC-- 347
T L + + DSGS++T+ + Y+ + ++ ++ C
Sbjct: 260 TLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWK 319
Query: 348 ----YKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG----- 398
+KS S S++ +F +N + ++I CL I +DG
Sbjct: 320 GQKAFKSVSDVKKDFKSLQFIFGKNAVMEIPPENYLIVTKN--GNVCLGI--LDGSAAKL 375
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG M V++D E +LGW +C
Sbjct: 376 SFSIIGDITMQDQMVIYDNEKAQLGWIRGSC 406
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 149/384 (38%), Gaps = 56/384 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C CA Y + P
Sbjct: 192 GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPR 248
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L C +C+ C Y ++Y + +SS G+L +D +H+I+ GG
Sbjct: 249 DLLCQELQGDQNYC---ATCKQ----CDYEIEY-ADRSSSMGVLAKDDMHMIATNGGREK 300
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS+PS LA G+I N F C
Sbjct: 301 L------DFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCIT 354
Query: 253 KDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
K+ + G +F GD T G Y + G L+ +S +
Sbjct: 355 KEPNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQV 414
Query: 306 IVDSGSSFTFLPKEVYE----TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
I DSGSS+T+LP E+Y+ I ++ V DT + WK + + L V
Sbjct: 415 IFDSGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLPLCWKADFD-----VRYLEDV 469
Query: 362 KLMF-PQNNSFVVNNPVFVIYGT---------------QVVTGFCLAIQPVDGDIGTIGQ 405
K F P N F N FVI T V G + +G
Sbjct: 470 KQFFKPLNLHF--GNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGD 527
Query: 406 NFMTGYRVVFDRENLKLGWSHSNC 429
+ G VV+D E ++GW+ S C
Sbjct: 528 VSLRGKLVVYDNERRQIGWADSEC 551
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 50/364 (13%)
Query: 95 LGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
L D G L WI CD C C Y ++ P S + L + CD
Sbjct: 144 LDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENI---VPPRDSHCQELQGNQNYCD-- 198
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C+ C Y + Y + +SS+G+L D + LI+ D +N ++ GC Q G
Sbjct: 199 -TCKQ----CDYEI-AYADRSSSAGVLARDNMELITA-DGEREN---MDLVFGCAHDQQG 248
Query: 213 GYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
L A DG++GL G +S+P+ LAK G+I N F C D SG +F GD
Sbjct: 249 KLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFLGDDYVPRW 308
Query: 270 QSTSFLASNGK---YITYIIGVETCCIGSSCLKQTS--FKAIVDSGSSFTFLPKEVY--- 321
T NG Y T + V C + +Q + I DSGSS+T+ P E+Y
Sbjct: 309 GMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSSYTYFPHEIYTSL 368
Query: 322 ----ETIAAEFDRQVNDTITSF---EGYPWKCCYKSSSQRLPKL---PSVKLMFPQNNSF 371
E ++ F R +D F +P + P L L+ P+
Sbjct: 369 ITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLHFSKTWLVIPRTFEI 428
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDG-DIG-----TIGQNFMTGYRVVFDRENLKLGWS 425
N +I G V CL + +DG +IG IG + G V +D + ++GW+
Sbjct: 429 SPEN-YLIISGKGNV---CLGV--LDGTEIGHSSTIVIGDVSLRGKLVAYDNDANQIGWA 482
Query: 426 HSNC 429
S+C
Sbjct: 483 QSDC 486
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 109/428 (25%), Positives = 182/428 (42%), Gaps = 48/428 (11%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGPQFQ 82
F + H+F+ + K L K+ + SF + ++L + D+ + G F
Sbjct: 28 FVFNVTHKFAGKEKQLSELKSHD--------SFRHARMLANIDLPLGGDSRADSIGLYFT 79
Query: 83 MLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ K + D G D+LW+ C C +C P+ L L+ Y ASSTSK+
Sbjct: 80 KIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKTD----LGIPLSLYDSKASSTSKN 134
Query: 142 LSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
+ C C + K+PC Y + Y + ++S G V+D + L N +
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFVKDNITLDQVTGNLRTAPLA 193
Query: 200 ASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+ GCG QSG G + A DG++G G SV S LA G ++ FS C D + G
Sbjct: 194 QEVVFGCGKNQSGQLGQTES-AVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGG 252
Query: 258 RIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-------IVDS 309
IF G+ ++T + + Y + G++ G S + I+DS
Sbjct: 253 GIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDV--DGEPIDLPPSLASTNGDGGTIIDS 310
Query: 310 GSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
G++ +LP+ +Y E I A+ +++ +F C+ +S P V L F
Sbjct: 311 GTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVVNLHF 364
Query: 366 PQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLK 421
+ V ++ +F + G+ + DG D+ +G ++ VV+D EN
Sbjct: 365 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 424
Query: 422 LGWSHSNC 429
+GW+ NC
Sbjct: 425 IGWADHNC 432
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 151/358 (42%), Gaps = 43/358 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----L 151
D G D+LW+ C C C S L LN + ++SST++ + CSH +C
Sbjct: 99 DTGSDVLWVTCSSCSNCPQTSG-----LGIQLNYFDTTSSSTARLVPCSHPICTSQIQTT 153
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
T C C Y Y + + +SG V D + + +L + A+++ GC QS
Sbjct: 154 ATQCPPQSNQCSYAFQY-GDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQS 212
Query: 212 GGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGD----- 263
G A DG+ G G GE+SV S L+ G+ FS C +DSG + G+
Sbjct: 213 GDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGILVLGEILEPG 272
Query: 264 --------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 315
P +A +G+ ++ ++ +S + T I+D+G++ +
Sbjct: 273 IVYSPLVPSQPHYNLDLQSIAVSGQ----LLPIDPAAFATSSNRGT----IIDTGTTLAY 324
Query: 316 LPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
L +E Y+ + V+ T T +G CY S+ P V F + ++
Sbjct: 325 LVEEAYDPFVSAITAAVSQLATPTINKG---NQCYLVSNSVSEVFPPVSFNFAGGATMLL 381
Query: 374 NNPVFVIYGTQVVTG--FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+++Y T +C+ Q + G I +G + V+D + ++GW++ +C
Sbjct: 382 KPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 158/363 (43%), Gaps = 52/363 (14%)
Query: 98 DFGCDLLWI-PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-- 154
D G D+LW+ C C S L +L +Y P+ S T+ + C C ++
Sbjct: 103 DTGSDILWVNGISCDGCPTRSG-----LGIELTQYDPAGSGTT--VGCEQEFCVANSAAS 155
Query: 155 -----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C + PC + + Y + +S++G V D + N S+ GCG
Sbjct: 156 GVPPACPSAASPCQFRITY-GDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSITFGCG-A 213
Query: 210 QSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGP 266
Q GG L A DG++G G + S+ S LA A +R F+ C D G IF G+
Sbjct: 214 QLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGGIFAIGNVVQ 273
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA------IVDSGSSFTFLPK 318
T+ L N + Y + ++ +G + L+ ++F + I+DSG++ +LP+
Sbjct: 274 PPIVKTTPLVPNATH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPR 331
Query: 319 EVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF---------PQN 368
EVY T + A FD+ + + ++E + C++ S + P + F P +
Sbjct: 332 EVYRTLLTAVFDKHPDLAVRNYEDF---ICFQFSGSLDEEFPVITFSFEGDLTLNVYPHD 388
Query: 369 NSFVVNNPVFVIYGTQVVTGFCL-AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
F N ++ + GF +Q DG D+ +G ++ VV+D E +GW+
Sbjct: 389 YLFQNGNDLYCM-------GFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTD 441
Query: 427 SNC 429
NC
Sbjct: 442 YNC 444
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 153/353 (43%), Gaps = 27/353 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G D+LW+ C P+++ L L + P +S+T+ +SCS + C G
Sbjct: 102 DTGSDVLWVSCSSCNGCPVTSG----LQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSD 157
Query: 155 --CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL----ISGGD-NALKNSVQASVIIGCG 207
C + C YT Y + + +SG V D++HL +S G+ + + + +SV C
Sbjct: 158 SLCSSRTNQCGYTFQY-GDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCS 216
Query: 208 MKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQ 264
Q+G A DG+ G G E+SV S LA G+ FS C DDS G + G+
Sbjct: 217 TLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVLVLGEI 276
Query: 265 GPATQQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKE 319
T + S Y Y+ + +T I S +S + IVDSG++ +L +
Sbjct: 277 VEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLAYLAEG 336
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y+ + V+ ++ + CY +S P V L F S ++N ++
Sbjct: 337 AYDPFVSAITSVVSLNARTYLSKGNQ-CYLVTSSVNDVFPQVSLNFAGGASLILNPQDYL 395
Query: 380 IYGTQV--VTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ V +C+ Q G I +G + V+D N ++GW++ +C
Sbjct: 396 LQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 162/391 (41%), Gaps = 56/391 (14%)
Query: 65 LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYN 122
LLS DV TG + + +K L D G DL W+ CD C C N
Sbjct: 46 LLSGDV----YPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC--------N 93
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTM--DY---YTENTSSSG 177
+ L Y P+ + K + C++ +C S +P + C DY YT+ SS G
Sbjct: 94 KVPHPL--YRPTKN---KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLG 148
Query: 178 LLVEDILHLISGGDNALKN--SVQASVIIGCGMKQSGGYLDGVAP---DGLIGLGLGEIS 232
+LV D L L+N +V+ S+ GCG Q G +G AP DGL+GLG G +S
Sbjct: 149 VLVMDSFSL------PLRNKSNVRPSLSFGCGYDQQVGK-NGAAPATTDGLLGLGRGSVS 201
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVE 289
+ S L + G+ +N C G +FFGD T + T +++G Y Y G
Sbjct: 202 LLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNY--YSPGSA 259
Query: 290 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC-- 347
T L + + DSGS++T+ + Y+ + ++ ++ C
Sbjct: 260 TLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWK 319
Query: 348 ----YKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG----- 398
+KS S S++ +F +N + ++I CL I +DG
Sbjct: 320 GQKAFKSVSDVKKDFKSLQFIFGKNAVMDIPPENYLIITKN--GNVCLGI--LDGSAAKL 375
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG M V++D E +LGW +C
Sbjct: 376 SFSIIGDITMQDQMVIYDNEKAQLGWIRGSC 406
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 109/425 (25%), Positives = 180/425 (42%), Gaps = 45/425 (10%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGPQFQ 82
F K H+F+ K +N + + + + ++L S D+ + G F
Sbjct: 25 FVFKAQHKFA--------GKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFT 76
Query: 83 MLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ K + D G D+LWI C C +C + +L+ L+ + +ASSTSK
Sbjct: 77 KIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKT-----NLNFRLSLFDMNASSTSKK 131
Query: 142 LSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 198
+ C C SCQ P C Y + Y E+TS G + D+L L + +
Sbjct: 132 VGCDDDFCSFISQSDSCQ-PALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKTGPL 189
Query: 199 QASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+ GCG QSG +G A DG++G G SV S LA G + FS C D G
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 258 RIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC-LKQTSFK---AIVDSGSS 312
IF G ++T + + Y ++G++ G+S L ++ + IVDSG++
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV--DGTSLDLPRSIVRNGGTIVDSGTT 307
Query: 313 FTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
+ PK +Y ETI A +++ +F+ C+ S+ P V F +
Sbjct: 308 LAYFPKVLYDSLIETILARQPVKLHIVEETFQ------CFSFSTNVDEAFPPVSFEFEDS 361
Query: 369 NSFVV--NNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGW 424
V ++ +F + G+ D ++ +G ++ VV+D +N +GW
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421
Query: 425 SHSNC 429
+ NC
Sbjct: 422 ADHNC 426
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 159/403 (39%), Gaps = 55/403 (13%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
MS+GN D G DL W+ CD CV C+ + Y N+ P
Sbjct: 61 AMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVDQMCA 117
Query: 141 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--V 198
L H C +PKQ C Y + Y + SS G+LV D L L NS V
Sbjct: 118 AL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLANSSIV 167
Query: 199 QASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
+ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C G
Sbjct: 168 RPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGG 227
Query: 258 RIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD P ++ + + +A + Y G G L + + DSGSSFT+
Sbjct: 228 FLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYF 287
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNS 370
+ Y+ + ++ + + C +KS + +V L F
Sbjct: 288 SAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKK 347
Query: 371 FVVNNP-----VFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDREN 419
++ P + YG CL I ++G D+ +G M V++D E
Sbjct: 348 ALMEIPPENYLIVTKYGNA-----CLGI--LNGSEVGLKDLNIVGDITMQDQMVIYDNER 400
Query: 420 LKLGWSHSNCQDL-NDGTKSPLTPGPGTPSNP--LPANQEQSS 459
++GW + C + ND T G P P + EQS+
Sbjct: 401 GQIGWIRAPCDRIPNDNTIHGFEDGYCWPQFPNIIGYQNEQSA 443
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 156/368 (42%), Gaps = 48/368 (13%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C +C S L DL Y P ASS+ +SC C +
Sbjct: 105 DTGSDILWVNCISCSKCPRKSG-----LGLDLTFYDPKASSSGSTVSCDQGFCAATYGGK 159
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G + D L + A++ GCG +Q G
Sbjct: 160 LPGCTANVPCEYSV-MYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGG 218
Query: 213 GYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQ 270
+ A DG++G G S+ S LA AG + F+ C D G IF G+
Sbjct: 219 DLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIFAIGNVVQPKCY 278
Query: 271 STSFLASNGKYI-------------TYIIGVETCCIGSSCLK------QTSFK--AIVDS 309
F A I Y + +++ +G + L+ +T K I+DS
Sbjct: 279 FVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFETGEKKGTIIDS 338
Query: 310 GSSFTFLPKEVYETIA-AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
G++ T+LP+ V++ + F + + + + + C++ S P++ F +
Sbjct: 339 GTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF---LCFQYSGSVDDGFPTITFHFEDD 395
Query: 369 NSF-VVNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTIGQNFMTGYRVVFDRENLK 421
+ V + F G + +C+ A+Q DG DI +G ++ VV+D EN
Sbjct: 396 LALHVYPHEYFFPNGNDI---YCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQV 452
Query: 422 LGWSHSNC 429
+GW+ NC
Sbjct: 453 IGWTDYNC 460
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 47/381 (12%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 135
+TG F + +K+ + D G D+LW+ C P + L +L Y PS
Sbjct: 77 ETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSG----LGIELTLYDPSG 132
Query: 136 SSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
SS+ ++C C + SC P PC Y++ Y + +S++G V D L
Sbjct: 133 SSSGTGVTCGQDFCVATHGGVIPSCV-PAAPCQYSISY-GDGSSTTGFFVTDFLQYNQVS 190
Query: 191 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
N+ S+ GCG K G A DG++G G S+ S LA AG +R F+
Sbjct: 191 GNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAH 250
Query: 250 CFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------QT 301
C D + G IF + ST+ L + Y + +E +G L+
Sbjct: 251 CLDTINGGGIFAIGDVVQPKVSTTPLVPGMPH--YNVNLEAIDVGGVKLQLPTNIFDIGE 308
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC-----CYKSSSQRLP 356
S I+DSG++ +LP VY I ++ Q D P K C++ S
Sbjct: 309 SKGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDM-------PLKNDQDFQCFRYSGSVDD 361
Query: 357 KLPSVKLMF----PQN---NSFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFM 408
P + F P N + ++ N G Q TG +Q DG D+ +G
Sbjct: 362 GFPIITFHFEGGLPLNIHPHDYLFQNGELYCMGFQ--TG---GLQTKDGKDMVLLGDLAF 416
Query: 409 TGYRVVFDRENLKLGWSHSNC 429
+ V++D EN +GW+ NC
Sbjct: 417 SNRLVLYDLENQVIGWTDYNC 437
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 149/361 (41%), Gaps = 35/361 (9%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
K L D G D++W+ C C C S SL DL Y SS+ K + C C
Sbjct: 94 KNYYLQVDTGSDIMWVNCIQCKECPTRS-----SLGMDLTLYDIKESSSGKLVPCDQEFC 148
Query: 150 D-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
L T C CPY ++ Y + +S++G V+DI+ + +S S++
Sbjct: 149 KEINGGLLTGC-TANISCPY-LEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVF 206
Query: 205 GCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-F 261
GCG +QSG + A DG++G G S+ S LA +G ++ F+ C + + G IF
Sbjct: 207 GCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAI 266
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-----IVDSGSSFTFL 316
G T L Y + V+ S TS + I+DSG++ +L
Sbjct: 267 GHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYL 326
Query: 317 PKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
P+ +YE + + Q D T + Y C++ S P+V F S V
Sbjct: 327 PEGIYEPLVYKMISQHPDLKVQTLHDEYT---CFQYSESVDDGFPAVTFFFENGLSLKVY 383
Query: 375 NPVFVIYGTQVVTGFCLAIQPV------DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
++ V +C+ Q ++ +G ++ V +D EN +GW+ N
Sbjct: 384 PHDYLF---PSVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYN 440
Query: 429 C 429
C
Sbjct: 441 C 441
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 117/456 (25%), Positives = 190/456 (41%), Gaps = 62/456 (13%)
Query: 3 RISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYY 62
R L I +AVF ++ E + F K+ H+F+ K + + + + +
Sbjct: 4 RRKLCIVVAVFVIVNEFASGN---FVFKVQHKFA--------GKEKKLEHFKSHDTRRHS 52
Query: 63 QVLLSSDV----QKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLS 117
++L S D+ + G F + K + D G D+LW+ C C C +
Sbjct: 53 RMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKT 112
Query: 118 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTS 174
+L+ L+ + +ASSTSK + C C SCQ P C Y + Y E+TS
Sbjct: 113 -----NLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQ-PAVGCSYHIVYADESTS 166
Query: 175 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEIS 232
G + D L L + + V+ GCG QSG G D A DG++G G S
Sbjct: 167 E-GNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDS-AVDGVMGFGQSNTS 224
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETC 291
V S LA G + FS C D G IF G ++T + + Y ++G++
Sbjct: 225 VLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMD-- 282
Query: 292 CIGSSCLK-----QTSFKAIVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGY 342
+ + L + IVDSG++ + PK +Y ETI A +++ +F+
Sbjct: 283 -VDGTALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEDTFQ-- 339
Query: 343 PWKCCYKSSSQRLPKLP--------SVKL-MFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
C+ S P SVKL ++P + F + ++ +G Q G
Sbjct: 340 ----CFSFSENVDVAFPPVSFEFEDSVKLTVYPHDYLFTLEKELYC-FGWQ-AGGLTTGE 393
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++ +G ++ VV+D EN +GW+ NC
Sbjct: 394 RT---EVILLGDLVLSNKLVVYDLENEVIGWADHNC 426
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 52/373 (13%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
MS+GN D G DL W+ CD CV C+ + Y N+ P
Sbjct: 61 AMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVDQMCA 117
Query: 141 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--V 198
L H C +PKQ C Y + Y + SS G+LV D L L NS V
Sbjct: 118 AL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLANSSIV 167
Query: 199 QASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
+ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C G
Sbjct: 168 RPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGG 227
Query: 258 RIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD P ++ + + +A + Y G G L + + DSGSSFT+
Sbjct: 228 FLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYF 287
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNS 370
+ Y+ + ++ + + C +KS + +V L F
Sbjct: 288 SAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLSFSNGKK 347
Query: 371 FVVNNP-----VFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDREN 419
++ P + YG CL I ++G D+ +G M V++D E
Sbjct: 348 ALMEIPPENYLIVTKYGNA-----CLGI--LNGSEVGLKDLNIVGDITMQDQMVIYDNER 400
Query: 420 LKLGWSHSNCQDL 432
++GW + C +
Sbjct: 401 GQIGWIRAPCDRI 413
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 149/378 (39%), Gaps = 44/378 (11%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + +F + L D G DL WI CD C A Y + P
Sbjct: 185 GQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKI---VPPR 241
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 193
+ L + C+ +C+ C Y ++Y + +SS G+L D +H+I+ GG
Sbjct: 242 DLLCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHMIATNGGREK 293
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
L + GC Q G L A DG++GL IS PS LA G+I N F C
Sbjct: 294 L------DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT 347
Query: 253 KDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
++ G +F GD T +G Y G L++ ++ +
Sbjct: 348 REQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQV 407
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I DSGSS+T+LP E+YE + A + C+K+ + L VK F
Sbjct: 408 IFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFP-VRYLEDVKQFF 466
Query: 366 -PQN-----------NSFVVNNPVFVIYGTQ--VVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P N +F ++ ++I + V G + G +G + G
Sbjct: 467 EPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526
Query: 412 RVVFDRENLKLGWSHSNC 429
VV+D + ++GW+ S+C
Sbjct: 527 LVVYDNQRKQIGWADSDC 544
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 151/379 (39%), Gaps = 47/379 (12%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
+G + D G DL W+ CD C C Y + LN + P TS H +
Sbjct: 63 KGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLC--TSLHPITN 120
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS--VI 203
H C++ C Y ++Y ++ SS G+LV D + L L N A+ +
Sbjct: 121 HH-------CKSADDQCQYEIEY-ADHGSSLGVLVNDHVPL------KLTNGSLAAPRIA 166
Query: 204 IGCGMKQSGGYLDGVAPD-GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GCG D P G++GLG GE+S S L+ G++RN C D+ G +FFG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL-SDEGGFLFFG 225
Query: 263 DQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
D+ P++ + + ++ Y G G + DSGSS+T+ + Y
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAY 285
Query: 322 ETIAAEFDRQVNDT--ITSFEGYPWKCCYKSS----------------SQRLPKLPSVKL 363
+I A + + E C+K + + R K + ++
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNAQI 345
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
P N ++ V +G ++ G + + GD+ IG + V++D E ++G
Sbjct: 346 QLPPENYLIITKYGNVCFG--ILNGTEVGL----GDLNIIGDISLKDKMVIYDNERRRIG 399
Query: 424 WSHSNCQDLNDGTKSPLTP 442
W +NC +S P
Sbjct: 400 WFPTNCNKFRKEGQSLCQP 418
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 135/301 (44%), Gaps = 29/301 (9%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C+ C C S L LN + P +SSTS ++CS + C+ G
Sbjct: 43 DTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSS 97
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + C YT Y + + +SG V D++HL + + ++ + A V+ GC +Q+
Sbjct: 98 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQT 156
Query: 212 GGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
G A DG+ G G E+SV S L+ G+ FS C D SG + G+
Sbjct: 157 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN 216
Query: 269 QQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE- 322
TS + + Y + + +T I SS ++ + IVDSG++ +L +E Y+
Sbjct: 217 IVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDP 276
Query: 323 ---TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
I A + V+ ++ CY +S P V L F S ++ ++
Sbjct: 277 FVSAITASIPQSVHTAVSR-----GNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 331
Query: 380 I 380
I
Sbjct: 332 I 332
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 52/373 (13%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
MS+GN D G DL W+ CD CV C+ + Y N+ P
Sbjct: 61 AMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVDQMCA 117
Query: 141 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--V 198
L H C +PKQ C Y + Y + SS G+LV D L L NS V
Sbjct: 118 AL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLANSSIV 167
Query: 199 QASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
+ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C G
Sbjct: 168 RPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGG 227
Query: 258 RIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD P ++ + + +A + Y G G L + + DSGSSFT+
Sbjct: 228 FLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYF 287
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQNNS 370
+ Y+ + ++ + + C +KS + +V L F
Sbjct: 288 SAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKK 347
Query: 371 FVVNNP-----VFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDREN 419
++ P + YG CL I ++G D+ +G M V++D E
Sbjct: 348 ALMEIPPENYLIVTKYGNA-----CLGI--LNGSEVGLKDLNIVGDITMQDQMVIYDNER 400
Query: 420 LKLGWSHSNCQDL 432
++GW + C +
Sbjct: 401 GQIGWIRAPCDRI 413
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 161/388 (41%), Gaps = 56/388 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G + L K L D G DL W CD C CA YN A
Sbjct: 38 GLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNP---------KKA 88
Query: 136 SSTSKHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
HL ++ G+ C + + C Y ++Y + +S+ G+LVED L + L
Sbjct: 89 KVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEY-ADGSSTMGVLVEDTLTV------RL 141
Query: 195 KNS--VQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
N +Q IIGCG Q G A DG+IGL ++++P+ LA+ G+I+N C
Sbjct: 142 TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCL 201
Query: 252 --DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSC--------LKQ 300
+ G +FFGD+ P+ + + + + + Y +++ G L +
Sbjct: 202 ADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNNDEDLTR 261
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV------NDTITSFEGYPWK--CCYKSSS 352
++ + DSG+SFT+L + Y ++ + +Q +DT Y W+ ++S +
Sbjct: 262 STSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTLP---YCWRGPSPFQSIT 318
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPV------FVIYGTQVVTGFCLAIQPVDGD----IGT 402
++ L F N F ++ + ++I TQ CL I G
Sbjct: 319 DVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQ--GNVCLGILDASGASLEVTNI 376
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
IG M GY VV+D ++GW NC
Sbjct: 377 IGDVSMRGYLVVYDNVRDRIGWIRRNCH 404
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 155/378 (41%), Gaps = 70/378 (18%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
TM++GN D G DL W+ CD C C N + L Y P+A ++
Sbjct: 56 TMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTA---NR 102
Query: 141 HLSCSHRLCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 195
+ C++ LC S Q N K P P DY YT++ SS G+L+ D L N
Sbjct: 103 LVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--- 159
Query: 196 NSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 253
++ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C
Sbjct: 160 --IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST 217
Query: 254 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 312
+ G +FFGD P+++ + +A Y G T L + + DSGS+
Sbjct: 218 NGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGST 277
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+T+ + Y+ + + ++ ++ C+K + K +F N F
Sbjct: 278 YTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQK-------AFKSVFDVKNEF- 329
Query: 373 VNNPVFVIYGTQ-------------VVT---GFCLAIQPVDG-----DIGTIGQNFMTGY 411
+F+ + + +VT CL I +DG IG M
Sbjct: 330 --KSMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGI--LDGTAAKLSFNVIGDITMQDQ 385
Query: 412 RVVFDRENLKLGWSHSNC 429
V++D E +LGW+ C
Sbjct: 386 MVIYDNEKSQLGWARGAC 403
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 154/366 (42%), Gaps = 52/366 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K L D G DL W+ CD C C N + L Y P+ + K + C+
Sbjct: 62 AKPYFLDIDTGSDLTWLQCDAPCQSC--------NKVPHPL--YKPTKN---KLVPCAAS 108
Query: 148 LCDLGTSCQNPKQPC--PYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNS--VQA 200
+C S Q+P + C P DY YT++ SS G+LV D L L+NS V+
Sbjct: 109 ICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTL------PLRNSSSVRP 162
Query: 201 SVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
S GCG Q G +GV DGL+GLG G +S+ S L G+ +N C + G
Sbjct: 163 SFTFGCGYDQQVGK-NGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGGG 221
Query: 258 RIFFGDQGPATQQST---SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
+FFGD T ++T +++G Y Y G T L + + DSGS++T
Sbjct: 222 FLFFGDNVVPTSRATWVPMVRSTSGNY--YSPGSGTLYFDRRSLGVKPMEVVFDSGSTYT 279
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC------YKSSSQRLPKLPSVKLMFPQN 368
+ + Y+ + ++ ++ C +KS S S+ L F +N
Sbjct: 280 YFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKGQKVFKSVSDVKNDFKSLFLSFVKN 339
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD-----IGTIGQNFMTGYRVVFDRENLKLG 423
+ + ++I CL I +DG IG M +++D E +LG
Sbjct: 340 SVLEIPPENYLIVTKN--GNACLGI--LDGSAAKLTFNIIGDITMQDQLIIYDNERGQLG 395
Query: 424 WSHSNC 429
W +C
Sbjct: 396 WIRGSC 401
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/428 (25%), Positives = 180/428 (42%), Gaps = 48/428 (11%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGPQFQ 82
F + H+F+ + K L K+ + SF + ++L + D+ + G F
Sbjct: 29 FVFNVTHKFAGKEKQLSELKSHD--------SFRHARMLANIDLPLGGDSRADSIGLYFT 80
Query: 83 MLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ K + D G D+LW+ C C +C P+ L L+ Y SSTSK+
Sbjct: 81 KIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKTD----LGIPLSLYDSKTSSTSKN 135
Query: 142 LSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
+ C C + K+PC Y + Y + ++S G ++D + L N +
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 194
Query: 200 ASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+ GCG QSG G D A DG++G G S+ S LA G + FS C D + G
Sbjct: 195 QEVVFGCGKNQSGQLGQTDS-AVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG 253
Query: 258 RIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-------IVDS 309
IF G+ ++T + + Y + G++ G S + I+DS
Sbjct: 254 GIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDV--DGDPIDLPPSLASTNGDGGTIIDS 311
Query: 310 GSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
G++ +LP+ +Y E I A+ +++ +F C+ +S P V L F
Sbjct: 312 GTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVVNLHF 365
Query: 366 PQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLK 421
+ V ++ +F + G+ + DG D+ +G ++ VV+D EN
Sbjct: 366 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425
Query: 422 LGWSHSNC 429
+GW+ NC
Sbjct: 426 IGWADHNC 433
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 155/378 (41%), Gaps = 70/378 (18%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
TM++GN D G DL W+ CD C C N + L Y P+A ++
Sbjct: 56 TMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTA---NR 102
Query: 141 HLSCSHRLCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 195
+ C++ LC S Q N K P P DY YT++ SS G+L+ D L N
Sbjct: 103 LVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--- 159
Query: 196 NSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 253
++ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C
Sbjct: 160 --IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST 217
Query: 254 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 312
+ G +FFGD P+++ + +A Y G T L + + DSGS+
Sbjct: 218 NGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGST 277
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+T+ + Y+ + + ++ ++ C+K + K +F N F
Sbjct: 278 YTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQK-------AFKSVFDVKNEF- 329
Query: 373 VNNPVFVIYGTQ-------------VVT---GFCLAIQPVDG-----DIGTIGQNFMTGY 411
+F+ + + +VT CL I +DG IG M
Sbjct: 330 --KSMFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGI--LDGTAAKLSFNVIGDITMQDQ 385
Query: 412 RVVFDRENLKLGWSHSNC 429
V++D E +LGW+ C
Sbjct: 386 MVIYDNEKSQLGWARGAC 403
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/428 (25%), Positives = 180/428 (42%), Gaps = 48/428 (11%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGPQFQ 82
F + H+F+ + K L K+ + SF + ++L + D+ + G F
Sbjct: 25 FVFNVTHKFAGKEKQLSELKSHD--------SFRHARMLANIDLPLGGDSRADSIGLYFT 76
Query: 83 MLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ K + D G D+LW+ C C +C P+ L L+ Y SSTSK+
Sbjct: 77 KIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKTD----LGIPLSLYDSKTSSTSKN 131
Query: 142 LSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
+ C C + K+PC Y + Y + ++S G ++D + L N +
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 190
Query: 200 ASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+ GCG QSG G D A DG++G G S+ S LA G + FS C D + G
Sbjct: 191 QEVVFGCGKNQSGQLGQTDS-AVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG 249
Query: 258 RIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-------IVDS 309
IF G+ ++T + + Y + G++ G S + I+DS
Sbjct: 250 GIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDV--DGDPIDLPPSLASTNGDGGTIIDS 307
Query: 310 GSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
G++ +LP+ +Y E I A+ +++ +F C+ +S P V L F
Sbjct: 308 GTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVVNLHF 361
Query: 366 PQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLK 421
+ V ++ +F + G+ + DG D+ +G ++ VV+D EN
Sbjct: 362 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421
Query: 422 LGWSHSNC 429
+GW+ NC
Sbjct: 422 IGWADHNC 429
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/464 (22%), Positives = 189/464 (40%), Gaps = 84/464 (18%)
Query: 58 SFEYYQVLLSSDVQKQK-----------------MKTGPQFQMLFPSQGSKTMSLGNDFG 100
S EYY+ L D ++ + TG + ++ + + D G
Sbjct: 9 SSEYYRTLREHDQRRLRRILPEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTG 68
Query: 101 CDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--CQN 157
D+ W+ C C C S ++ ++ + P S++ +SC+ C L ++ C
Sbjct: 69 SDVAWVNCVPCTNCKRAS-----NVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSF 123
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGMKQSGGYL 215
CPY+ Y + +S++G L+ D+L + G N+ S A + GCG Q+G +L
Sbjct: 124 NSMSCPYST-LYGDGSSTAGYLINDVLSFNQVPSG-NSTATSGTARLTFGCGSNQTGTWL 181
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTS 273
DGL+G G E+S+PS L+K + N F+ C D+ SG + G T
Sbjct: 182 T----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGLVYTP 237
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEVYETIAAE 327
+ Y ++ + G++ T+F I+DSG++ T+L + Y+ +
Sbjct: 238 IVPKQSHYNVELLNIGVS--GTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYD----Q 291
Query: 328 FDRQVNDTITS------------FEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
F +V D + S EGY P+V L F + ++ +
Sbjct: 292 FQAKVRDCMRSGVLPVAFQFFCTIEGY---------------FPNVTLYFAGGAAMLL-S 335
Query: 376 PVFVIYGTQVVTG---FCLAIQPVDGDIGTI-----GQNFMTGYRVVFDRENLKLGWSHS 427
P +Y + TG +C + G + G N + VV+D N ++GW +
Sbjct: 336 PSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNF 395
Query: 428 NC-QDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVA 470
+C ++++ + + P PS P ++ H+ G + +
Sbjct: 396 DCTKEISVSSTATSMPVTVFPSKAGPPGAFVTTNNAHSNGASFS 439
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 47/371 (12%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS--TS 139
L+ Q K L D G DL W+ CD C +C Y + N+ P S
Sbjct: 61 LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLY----QPSNDLVPCKDPLCMS 116
Query: 140 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSV 198
H S HR C+NP Q C Y ++Y + SS G+LV D+ L ++ GD +
Sbjct: 117 LHSSMDHR-------CENPDQ-CDYEVEY-ADGGSSLGVLVRDVFPLNLTNGD-----PI 162
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
+ + +GCG Q G DG++GLG G +S+ S L G++RN CF+ G
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222
Query: 259 IFFGD--QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD P T K+ + G E G S + F + DSGSS+T+
Sbjct: 223 LFFGDGIYDPYRLVWTPMSRDYPKHYSPGFG-ELIFNGRSTGLRNLF-VVFDSGSSYTYF 280
Query: 317 PKEVYETIAAEFDRQV--NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-PQNNSFV- 372
+ Y+ + + +R++ + + C++ + + L V+ F P SF
Sbjct: 281 NAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWR-GRKPIKSLRDVRKYFKPLALSFSS 339
Query: 373 --VNNPVFVI--YGTQVVTGF---CLAIQPVDG-DIG-----TIGQNFMTGYRVVFDREN 419
+ VF I G +++ CL I ++G D+G IG M VV++ E
Sbjct: 340 GGRSKAVFEIPTEGYMIISSMGNVCLGI--LNGTDVGLENSNIIGDISMQDKMVVYNNEK 397
Query: 420 LKLGWSHSNCQ 430
+GW+ +NC
Sbjct: 398 QAIGWATANCD 408
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/421 (23%), Positives = 174/421 (41%), Gaps = 56/421 (13%)
Query: 52 SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS----------KTMSLGN---- 97
++P+ E ++ ++ ++M + + FP +G+ + LG
Sbjct: 30 AFPSNDGVELSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRE 89
Query: 98 -----DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
D G D+LW+ C P ++ L LN + P +SSTS +SCS R C G
Sbjct: 90 FYVQIDTGSDVLWVSCGSCNGCPQTSG----LQIQLNYFDPRSSSTSSLISCSDRRCRSG 145
Query: 153 T-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
SC + C YT Y + + +SG V D++H + L + ASV+ GC
Sbjct: 146 VQTSDASCSSQNNQCTYTFQY-GDGSGTSGYYVSDLMHFAGIFEGTLTTNSSASVVFGCS 204
Query: 208 MKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD- 263
+ Q+G A DG+ G G +SV S L+ G+ FS C D+S G + G+
Sbjct: 205 ILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSGGGVLVLGEI 264
Query: 264 ------QGPATQQSTSF------LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGS 311
P Q + ++ NG+ I+ + +S + T IVDSG+
Sbjct: 265 VEPNIVYSPLVQSQPHYNLNLQSISVNGQ----IVPIAPAVFATSNNRGT----IVDSGT 316
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
+ +L +E Y V ++ S +C ++S + P V L F S
Sbjct: 317 TLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASL 376
Query: 372 VVNNPVFVIYGTQVVTG--FCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V+ +++ + G +C+ Q + G I +G + V+D ++GW++ +
Sbjct: 377 VLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYDLAGQRIGWANYD 436
Query: 429 C 429
C
Sbjct: 437 C 437
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 161/380 (42%), Gaps = 41/380 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SST + C ++ +C
Sbjct: 106 DSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSSTYSPVKC-----NVDCTCD 150
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K C Y Y E +SSSG+L EDI+ G ++ LK + GC ++G
Sbjct: 151 SDKNQCTYERQY-AEMSSSSGVLGEDIVSF--GTESELKPQ---RAVFGCENSETGDLFS 204
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G++S+ L G+I +SFSMC+ D G + P T
Sbjct: 205 QHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMIYTH 263
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAE 327
A Y Y I ++ + L+ ++DSG+++ +LP++ +
Sbjct: 264 SNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAFVAFKDA 321
Query: 328 FDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIY 381
QV+ I + C+ + + + +L P V ++F ++ ++
Sbjct: 322 VSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFR 381
Query: 382 GTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL + D T +G + V +DR N K+G+ +NC +L + +S
Sbjct: 382 HSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSELWERLQSGG 441
Query: 441 TPGPGTPSNPLPANQEQSSP 460
P P ++P P +P
Sbjct: 442 APSPAPSNDPGPQADLSPAP 461
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 164/385 (42%), Gaps = 46/385 (11%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
LF + +L D G + ++PC C +C + P +SST K +
Sbjct: 92 LFIGTPPQEFALIVDTGSTVTYVPCSTCEQCG----------KHQDPRFQPESSSTYKPM 141
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C + +C + + C Y Y E +SSSGLL ED+L G ++ L
Sbjct: 142 QC-----NPSCNCDDEGKQCTYERRY-AEMSSSSGLLAEDVLSF--GNESEL---TPQRA 190
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIF 260
I GC ++G A DG++GLG G +SV L ++ NSFS+C+ D G +
Sbjct: 191 IFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMV 249
Query: 261 FGDQGPATQQSTSFLASNGKYIT--YIIGVETCCIGSSCLK------QTSFKAIVDSGSS 312
G+ P A + Y + Y I ++ + LK ++DSG++
Sbjct: 250 LGNIPPPPDM---VFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTT 306
Query: 313 FTFLPKEVY----ETIAAE--FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
+ +LP+E + + I E F +Q++ S+ + + SQ P V ++F
Sbjct: 307 YAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFG 366
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWS 425
++ ++ T+V +CL I D T +G + V +DR+N K+G+
Sbjct: 367 NGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFW 426
Query: 426 HSNCQDLNDGTKSPLTPGPGTPSNP 450
+NC +L +S PG P+ P
Sbjct: 427 KTNCSELWKRLQS---QSPGIPAPP 448
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 150/379 (39%), Gaps = 47/379 (12%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
+G + D G DL W+ CD C C Y + LN + P TS H +
Sbjct: 63 KGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLC--TSLHPITN 120
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS--VI 203
H C++ C Y ++Y ++ SS G+LV D + L L N A+ +
Sbjct: 121 HH-------CKSADDQCQYEIEY-ADHGSSLGVLVNDHVPL------KLTNGSLAAPRIA 166
Query: 204 IGCGMKQSGGYLDGVAPD-GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GCG D P G++GLG GE+S S L+ G++RN C D+ G +FFG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL-SDEGGFLFFG 225
Query: 263 DQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
D+ P++ + + ++ Y G + DSGSS+T+ + Y
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAY 285
Query: 322 ETIAAEFDRQVNDT--ITSFEGYPWKCCYKSS----------------SQRLPKLPSVKL 363
+I A + + E C+K + + R K + ++
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNAQI 345
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
P N ++ V +G ++ G + + GD+ IG + V++D E ++G
Sbjct: 346 QLPPENYLIITKYGNVCFG--ILNGTEVGL----GDLNIIGDISLKDKMVIYDNERRRIG 399
Query: 424 WSHSNCQDLNDGTKSPLTP 442
W +NC +S P
Sbjct: 400 WFPTNCNKFRKEGQSLCQP 418
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 158/400 (39%), Gaps = 49/400 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T L D G ++PC C RC + YY+ DR + S C +
Sbjct: 49 QTYDLIVDTGSARTYVPCKGCARCGEHAHGYYD-YDRSMEFERLDCGEASDATLCEETM- 106
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+CQ+ + C Y + Y E +SS G +V D + L G ++ A + GC
Sbjct: 107 --KGTCQSDGR-CSYVVSY-AEGSSSRGYVVRDRVRLGEG-------TLSAMLAFGCEEA 155
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-------GRIFFG 262
++ + A DGL G G G +V + LA AGLI N FS C + + GR FG
Sbjct: 156 ETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFG 214
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIVDSGSSFTFLPKEVY 321
PA + T +A + + + +G S ++ S+ +DSG++FTF+P+ V+
Sbjct: 215 ADAPALAR-TPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVW 273
Query: 322 ETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPK----------LPSVKLMFPQ 367
+ D Q P CY S+ + P + + +
Sbjct: 274 VSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAYEG 333
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
S + ++ FC+ I + +GQ M + FD N ++G + +
Sbjct: 334 GVSLTLGPENYLFAHETNSAAFCVGIFANPNNQILLGQITMRDTLMEFDVANSRVGMAPA 393
Query: 428 NCQDLNDG--TKSPLTPGPGTPSNPLPANQEQSSPGGHAV 465
NC+ L + SP P P+N S GG A+
Sbjct: 394 NCRRLREKYTHDSP---------EPTPSNSSTPSGGGDAL 424
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 161/380 (42%), Gaps = 41/380 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SST + C ++ +C
Sbjct: 106 DSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSSTYSPVKC-----NVDCTCD 150
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K C Y Y E +SSSG+L EDI+ G ++ LK + GC ++G
Sbjct: 151 SDKNQCTYERQY-AEMSSSSGVLGEDIVSF--GTESELKPQ---RAVFGCENSETGDLFS 204
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G++S+ L G+I +SFSMC+ D G + P T
Sbjct: 205 QHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMIYTH 263
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAE 327
A Y Y I ++ + L+ ++DSG+++ +LP++ +
Sbjct: 264 SNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAFVAFKDA 321
Query: 328 FDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIY 381
QV+ I + C+ + + + +L P V ++F ++ ++
Sbjct: 322 VSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFR 381
Query: 382 GTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL + D T +G + V +DR N K+G+ +NC +L + +S
Sbjct: 382 HSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSELWERLQSGG 441
Query: 441 TPGPGTPSNPLPANQEQSSP 460
P P ++P P +P
Sbjct: 442 APSPAPSNDPGPQADLSPAP 461
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 153/366 (41%), Gaps = 45/366 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
+ ++L D G DL+W +CAP + D+DL P+ASST L C C
Sbjct: 95 RPVALTLDTGSDLVW-----TQCAPCR----DCFDQDLPVLDPAASSTYAALPCGAARCR 145
Query: 150 -----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
G + C Y Y ++ + + + SGG ++ + +
Sbjct: 146 ALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR--LTF 203
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFF 261
GCG G + G+ G G G S+PS L SFS CF + S +
Sbjct: 204 GCGHLNKGVFQSN--ETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFESKSSLVTL 256
Query: 262 GDQGPATQ--------QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA-IVDSG 310
G A ++T L + + Y + ++ +G + L +T F++ I+DSG
Sbjct: 257 GGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSG 316
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQRLPKLPSVKLMFPQ 367
+S T LP+EVYE + AEF QV + EG C+ ++ R P +PS+ L
Sbjct: 317 ASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLEG 376
Query: 368 NN-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ +N VF G +V+ C+ + G+ IG VV+D EN +L ++
Sbjct: 377 ADWELPRSNYVFEDLGARVM---CIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAP 433
Query: 427 SNCQDL 432
+ C L
Sbjct: 434 ARCDRL 439
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 154/370 (41%), Gaps = 57/370 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCA-PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
K L D G DL W+ CD C C PL Y + N P ASS + +
Sbjct: 79 KAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLY-----KPKNNRVPCASSLCQAIQ---- 129
Query: 148 LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
+C P + C Y ++Y + SS G+L+ D L + L Q + GCG
Sbjct: 130 ----NNNCDIPTEQCDYEVEY-ADLGSSLGVLLSDYFPLRLNNGSLL----QPRIAFGCG 180
Query: 208 MKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 264
Q YL +P G++GLG G+ S+ S L G+ +N CF + G +FFGD
Sbjct: 181 YDQK--YLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDH 238
Query: 265 --GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 322
P+ T L S+ + Y G G + I DSGSS+T+ +VY+
Sbjct: 239 LLPPSGITWTPMLRSSSDTL-YSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQ 297
Query: 323 TIAAEFDRQVNDTITSFEGYPWK---------CCYKSSSQRLPKLPSVKLMF-PQNNSFV 372
+I +N G P K C+K +++ + + +K F P +F+
Sbjct: 298 SI-------LNLVRKDLSGMPLKDAPEEKALAVCWK-TAKPIKSILDIKSFFKPLTINFI 349
Query: 373 VNNPVFVIYGTQ---VVT---GFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
V + + ++T CL I + G++ IG FM VV+D E ++
Sbjct: 350 KAKNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQI 409
Query: 423 GWSHSNCQDL 432
GW +NC L
Sbjct: 410 GWFPTNCNRL 419
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 167/376 (44%), Gaps = 43/376 (11%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SK + D G D++W+ C R P ++S L +L Y S+T K +SC + C
Sbjct: 97 SKDYYVQVDTGSDIVWVNCIQCRECPRTSS----LGMELTPYDLEESTTGKLVSCDEQFC 152
Query: 150 ---DLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
+ G + C CPY + Y + +S++G V+D + + + S+
Sbjct: 153 LEVNGGPLSGC-TTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKF 210
Query: 205 GCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-F 261
GCG +QSG G A DG++G G S+ S LA ++ F+ C D + G IF
Sbjct: 211 GCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAM 270
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA------IVDSGSSF 313
G T + + Y + GV+ +G L ++ F+A I+DSG++
Sbjct: 271 GHVVQPKVNMTPLVPNQPHYNVNMTGVQ---VGHIILNISADVFEAGDRKGTIIDSGTTL 327
Query: 314 TFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+LP+ +YE + A+ +Q N + + G +K C++ S + P V F +N+ +
Sbjct: 328 AYLPELIYEPLVAKILSQQHNLEVQTIHG-EYK-CFQYSERVDDGFPPVIFHF-ENSLLL 384
Query: 373 VNNPVFVIYGTQVVTGFCL-----AIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
P ++ Q +C+ +Q D ++ G ++ V++D EN +GW+
Sbjct: 385 KVYPHEYLF--QYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTE 442
Query: 427 SNC------QDLNDGT 436
NC QD GT
Sbjct: 443 YNCSSSIKVQDEQTGT 458
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 37/355 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--- 153
D G D++W+ C C C S SL +L Y S T K +SC C
Sbjct: 116 DTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESLTGKLVSCDQDFCYAINGGP 170
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
S C YT + Y + +SS G V DI+ + S SVI GC QSG
Sbjct: 171 PSYCIANMSCSYT-EIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSG 229
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST 272
A DG++G G S+ S LA +G +R F+ C D + G IF + +T
Sbjct: 230 DLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNT 289
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA------IVDSGSSFTFLPKEVYETI 324
+ L N + Y + ++ +G L + F I+DSG++ +LP+ VY+ +
Sbjct: 290 TPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQL 347
Query: 325 AAEF-----DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--- 376
++ D +V+ F C++ S P+V F +N+ ++ +P
Sbjct: 348 LSKIFSWQSDLKVHTIHDQFT------CFQYSESLDDGFPAVTFHF-ENSLYLKVHPHEY 400
Query: 377 VFVIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+F G + +Q D +I +G ++ V++D EN +GW+ NC+
Sbjct: 401 LFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCK 455
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 159/371 (42%), Gaps = 33/371 (8%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SK + D G D++W+ C R P ++S L +L Y+ S + K + C C
Sbjct: 96 SKDYYVQVDTGSDIMWVNCIQCRECPRTSS----LGMELTLYNIKDSVSGKLVPCDEEFC 151
Query: 150 DLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
S CPY ++ Y + +S++G V+D++ + S SVI G
Sbjct: 152 YEVNGGPLSGCTANMSCPY-LEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFG 210
Query: 206 CGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
CG +QSG G A DG++G G S+ S LA ++ F+ C D + G IF
Sbjct: 211 CGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIG 270
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTF 315
+ + + L N + Y + + +G L + + AI+DSG++ +
Sbjct: 271 HVVQPKVNMTPLIPNQPH--YNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAY 328
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
LP+ VYE + ++ Q D + C++ S P+V F +N+ F+ +
Sbjct: 329 LPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSGSVDDGFPNVTFHF-ENSVFLKVH 386
Query: 376 P---VFVIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC-- 429
P +F G + +Q D ++ +G ++ V++D EN +GW+ NC
Sbjct: 387 PHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCSS 446
Query: 430 ----QDLNDGT 436
QD GT
Sbjct: 447 SIKVQDERTGT 457
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 152/371 (40%), Gaps = 61/371 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K L D G DL W+ CD C C N + L Y P+A+ + + C++
Sbjct: 5 AKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTAN---RLVPCANA 51
Query: 148 LCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
LC S Q N K P P DY YT++ SS G+L+ D L N ++ +
Sbjct: 52 LCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN-----IRPGL 106
Query: 203 IIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
GCG Q G V A DG++GLG G +S+ S L + G+ +N C + G +F
Sbjct: 107 TFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLF 166
Query: 261 FGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
FGD P+++ + +A Y G T L + + DSGS++T+ +
Sbjct: 167 FGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQ 226
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y+ + + ++ ++ C+K + K +F N F +F+
Sbjct: 227 PYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQK-------AFKSVFDVKNEF---KSMFL 276
Query: 380 IYGTQ-------------VVT---GFCLAIQPVDG-----DIGTIGQNFMTGYRVVFDRE 418
+ + +VT CL I +DG IG M V++D E
Sbjct: 277 SFASAKNAAMEIPPENYLIVTKNGNVCLGI--LDGTAAKLSFNVIGDITMQDQMVIYDNE 334
Query: 419 NLKLGWSHSNC 429
+LGW+ C
Sbjct: 335 KSQLGWARGAC 345
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 136/284 (47%), Gaps = 37/284 (13%)
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLG 227
Y + +S++G LV+D++HL N S ++I GCG KQSG + A DG++G G
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61
Query: 228 LGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYII 286
S S LA G ++ SF+ C D ++ G IF G+ ++T L+ + Y +
Sbjct: 62 QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLN 121
Query: 287 GVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY-----ETIAAEFDRQVN 333
+E +G+S L+ +S I+DSG++ +LP VY E +A+ + ++
Sbjct: 122 AIE---VGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLH 178
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
SF + + + +L + P+V F ++ S V P ++ + T +C
Sbjct: 179 TVQESFTCFHY-------TDKLDRFPTVTFQFDKSVSLAV-YPREYLFQVREDT-WCFGW 229
Query: 394 QPVDGDIGT--------IGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q +G + T +G ++ VV+D EN +GW++ NC
Sbjct: 230 Q--NGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 271
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 91.7 bits (226), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 158/364 (43%), Gaps = 37/364 (10%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
L+ S+ +L D G + ++PC C +C N D + P SST +
Sbjct: 95 LYIGTPSQEFALIVDSGSTVTYVPCATCEQCG-------NHQD---PRFQPDLSSTYSPV 144
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C ++ +C N + C Y Y E +SSSG+L EDI+ G ++ LK
Sbjct: 145 KC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQRA 193
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
+ GC ++G A DG++GLG G++S+ L + G+I +SFS+C+ D G
Sbjct: 194 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 252
Query: 263 DQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTF 315
G F SN + Y I ++ + L+ + ++DSG+++ +
Sbjct: 253 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 312
Query: 316 LPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNN 369
LP++ + +VN I + C+ + + + +L P V ++F
Sbjct: 313 LPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 372
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSN 428
++ ++ ++V +CL + D T +G + V +DR N K+G+ +N
Sbjct: 373 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTN 432
Query: 429 CQDL 432
C +L
Sbjct: 433 CSEL 436
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 39/355 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C C S SL +L Y S T K +SC C +
Sbjct: 116 DTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESLTGKLVSCDQDFC-YAINGG 169
Query: 157 NP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
P C YT + Y + +SS G V DI+ + S SVI GC QS
Sbjct: 170 PPSYCIANMSCSYT-EIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQS 228
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
G A DG++G G S+ S LA +G +R F+ C D + G IF + +
Sbjct: 229 GDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVN 288
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA------IVDSGSSFTFLPKEVYET 323
T+ L N + Y + ++ +G L + F I+DSG++ +LP+ VY+
Sbjct: 289 TTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQ 346
Query: 324 IAAEF-----DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP-- 376
+ ++ D +V+ F C++ S P+V F +N+ ++ +P
Sbjct: 347 LLSKIFSWQSDLKVHTIHDQFT------CFQYSESLDDGFPAVTFHF-ENSLYLKVHPHE 399
Query: 377 -VFVIYGTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+F G + +Q D +I +G ++ V++D EN +GW+ NC
Sbjct: 400 YLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 151/357 (42%), Gaps = 39/357 (10%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----DLG 152
D G D+LWI C+ P S+ L +LN + SST+ + CS +C
Sbjct: 102 DTGSDILWINCNTCSNCPKSSG----LGIELNFFDTVGSSTAALVPCSDPMCASAIQGAA 157
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGDNALKNSVQASVIIGCGMKQ 210
C C YT Y + + +SG+ V D ++ +I G + A+++ GC Q
Sbjct: 158 AQCSPQVNQCSYTFQY-EDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQ 216
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD---- 263
SG A DG++G G GE+SV S L+ G+ FS C D + G + G+
Sbjct: 217 SGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGGGILVLGEILEP 276
Query: 264 ---------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
P + +A NG+ ++ + +S + T I+DSG++ +
Sbjct: 277 SIVYSPLVPSQPHYNLNLQSIAVNGQ----VLSINPAVFATSDKRGT----IIDSGTTLS 328
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
+L +E Y+ + D V+ TSF + CY + P+V F S +
Sbjct: 329 YLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-CYLVLTSIDDSFPTVSFNFEGGASMDLK 387
Query: 375 NPVFVI-YGTQV-VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+++ G Q +C+ Q V + +G + VV+D ++GW++ +C
Sbjct: 388 PSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQIGWTNYDC 444
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 108/424 (25%), Positives = 179/424 (42%), Gaps = 45/424 (10%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGPQFQ 82
F K H+F+ K +N + + + + ++L S D+ + G F
Sbjct: 25 FVFKAQHKFA--------GKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFT 76
Query: 83 MLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ K + D G D+LWI C C +C + +L+ L+ + +ASSTSK
Sbjct: 77 KIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKT-----NLNFRLSLFDMNASSTSKK 131
Query: 142 LSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 198
+ C C SCQ P C Y + Y E+TS G + D+L L + +
Sbjct: 132 VGCDDDFCSFISQSDSCQ-PALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKTGPL 189
Query: 199 QASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V+ GCG QSG +G A DG++G G SV S LA G + FS C D G
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 258 RIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC-LKQTSFK---AIVDSGSS 312
IF G ++T + + Y ++G++ G+S L ++ + IVDSG++
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV--DGTSLDLPRSIVRNGGTIVDSGTT 307
Query: 313 FTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
+ PK +Y ETI A +++ +F+ C+ S+ P V F +
Sbjct: 308 LAYFPKVLYDSLIETILARQPVKLHIVEETFQ------CFSFSTNVDEAFPPVSFEFEDS 361
Query: 369 NSFVV--NNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGW 424
V ++ +F + G+ D ++ +G ++ VV+D +N +GW
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421
Query: 425 SHSN 428
+ N
Sbjct: 422 ADHN 425
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 155/368 (42%), Gaps = 60/368 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGT- 153
D G D+ W+ C C C ++ + S+ L Y PS SST LSC C LG+
Sbjct: 55 DTGSDVTWLNCAPCTSC--VTETQLPSIK--LTTYDPSRSSTDGALSCRDSNCGAALGSN 110
Query: 154 --SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
SC + C Y+ Y + +S+ G ++D++ +N N ASV GCG QS
Sbjct: 111 EVSCTSAGY-CAYSTTY-GDGSSTQGYFIQDVMTFQEIHNNTQVNGT-ASVYFGCGTTQS 167
Query: 212 GGYL-DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPAT 268
G L A DGLIG G +S+PS LA G + N F+ C D+ G I G
Sbjct: 168 GNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGGTIVIGSVSEPN 227
Query: 269 QQSTSFLASNGKYITYIIGVETCCI-GSSCLKQTSFK--------AIVDSGSSFTFLPKE 319
T ++ N Y +G++ + G + SF I+DSG++ +L
Sbjct: 228 ISYTPIVSRN----HYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVIMDSGTTLAYLVDP 283
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL--------PKLPSVKLMFPQNNSF 371
Y Q + +++FE + S SQ L P+VKL F +
Sbjct: 284 AYT--------QFVNAVSTFE----SSMFSSHSQCLQLAWCSLQADFPTVKLFF--DAGA 329
Query: 372 VVN-NPVFVIYGTQVVTG---FCLAIQPVDGDIG-----TIGQNFMTGYRVVFDRENLKL 422
V+N P +Y + G +C+ Q G +G + + VV+D +N +
Sbjct: 330 VMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVV 389
Query: 423 GWSHSNCQ 430
GW +C+
Sbjct: 390 GWKSFDCK 397
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 157/356 (44%), Gaps = 25/356 (7%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ ++ D G D+LW+ C P S+ L +LN + + SS+++ L C+ +C
Sbjct: 94 AREFNVQIDTGSDILWVTCSPCDGCPDSSG----LGIELNLFDTTKSSSARVLPCTDPIC 149
Query: 150 DLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVII 204
++ C C Y+ +Y + + +SG V D +H I G++ + NS A+++
Sbjct: 150 AAVSTTTDQCLTQTDHCSYSF-HYRDRSGTSGFYVTDSMHFDILLGESTIANS-SATIVF 207
Query: 205 GCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFF 261
GC + Q G A DG+ G G GE SV S L+ G+ FS C ++ G +
Sbjct: 208 GCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVL 267
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------KAIVDSGSSFTF 315
G+ + + + S Y + + G T F + I+DSG++ +
Sbjct: 268 GEILEPSIVYSPLIPSQPHYTLKLQSIALS--GQLFPNPTMFPISNAGETIIDSGTTLAY 325
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
L +EVY+ I + V+ + T + C++ S P ++ F S VV
Sbjct: 326 LVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIFPVLRFNFEGIASMVVTP 384
Query: 376 PVFVIYGTQVV--TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + + V +C+ Q + + +G + +V+D ++GW++ +C
Sbjct: 385 EEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLARQRIGWANYDC 440
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 149/361 (41%), Gaps = 45/361 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T +L D G DLLW+ C C+ C S L + Y AS++S + CS C
Sbjct: 47 RTYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSC 101
Query: 150 DLGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L T N + C Y+ Y + + + G LVED+LH + + A+VI G
Sbjct: 102 TLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMV--------NATATVIFG 152
Query: 206 CGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFG 262
CG KQSG A DG+IG G ++S S LAK G N F+ C D + G + G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFKA-IVDSGSSFTFLP 317
+ Q T + Y + + I + I DSG++ +LP
Sbjct: 213 NVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLP 272
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNP 376
E Y+ F + V+ + P+ C S+ + KL P+V L F + S +
Sbjct: 273 DEAYQA----FTQAVSLVVA-----PFLLCDTRLSRFIYKLFPNVVLYF-EGASMTLTPA 322
Query: 377 VFVIYGTQVVTG--FCLAIQPV-----DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I +C+ Q + + G + VV+D E ++GW +C
Sbjct: 323 EYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
Query: 430 Q 430
+
Sbjct: 383 K 383
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 156/372 (41%), Gaps = 59/372 (15%)
Query: 98 DFGCDLLWIPC-DCVR-CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-- 153
D G + ++PC C R C P + P++SS+S + C C G
Sbjct: 80 DTGSTITYVPCASCGRNCGPHHKD---------AAFDPASSSSSAVIGCDSDKCICGRPP 130
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C + K+ C Y Y E +SS+GLLV D L L G V+ GC K++G
Sbjct: 131 CGC-SEKRECTYQRTY-AEQSSSAGLLVSDQLQLRDGA---------VEVVFGCETKETG 179
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGRIFFGDQGPATQ-- 269
+ A DG++GLG E+S+ + LA +G+I + F++CF + G + GD A
Sbjct: 180 EIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGALMLGDVDAAEYDV 238
Query: 270 --QSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY 321
Q T+ L+S Y + +E +G L + + ++DSG++FT+LP E +
Sbjct: 239 ALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSGTTFTYLPSEAF 298
Query: 322 ETI-----AAEFDRQVNDTI--------------TSFEGYPWKCCYKSSSQRLPKLPSVK 362
+ A + +N F G P + S+ P +
Sbjct: 299 QLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAP-HAGHADQSKLEKVFPVFE 357
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLK 421
L F ++ T + +CL + +G GT+ G V +DR N +
Sbjct: 358 LQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFD-NGASGTLLGGISFRNILVQYDRRNRR 416
Query: 422 LGWSHSNCQDLN 433
+G+ ++CQ++
Sbjct: 417 VGFGAASCQEIG 428
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 158/359 (44%), Gaps = 28/359 (7%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ ++ D G D+LW+ C P S+ L +LN + + SS+++ L C+ +C
Sbjct: 94 AREFNVQIDTGSDILWVTCSPCDGCPDSSG----LGIELNLFDTTKSSSARVLPCTDPIC 149
Query: 150 DLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVII 204
++ C C Y+ +Y + + +SG V D +H I G++ + NS A+++
Sbjct: 150 AAVSTTTDQCLTQTDHCSYSF-HYRDRSGTSGFYVTDSMHFDILLGESTIANS-SATIVF 207
Query: 205 GCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFF 261
GC + Q G A DG+ G G GE SV S L+ G+ FS C ++ G +
Sbjct: 208 GCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVL 267
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------KAIVDSGSSFTF 315
G+ + + + S Y + + G T F + I+DSG++ +
Sbjct: 268 GEILEPSIVYSPLIPSQPHYTLKLQSIALS--GQLFPNPTMFPISNAGETIIDSGTTLAY 325
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
L +EVY+ I + V+ + T + C++ S P ++ F S VV
Sbjct: 326 LVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIFPVLRFNFEGIASMVVTP 384
Query: 376 PVFVIYGTQV-----VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + + V + +C+ Q + + +G + +V+D ++GW++ +C
Sbjct: 385 EEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/398 (23%), Positives = 166/398 (41%), Gaps = 42/398 (10%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
LF + +L D G + ++PC C +C + P SST + +
Sbjct: 81 LFIGTPPQEFALIVDTGSTVTYVPCSSCEQCG----------KHQDPRFQPDLSSTYRPV 130
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C + +C + + C Y Y E +SSSG++ ED++ G ++ LK
Sbjct: 131 KC-----NPSCNCDDEGKQCTYERRY-AEMSSSSGVIAEDVVSF--GNESELK---PQRA 179
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIF 260
+ GC ++G A DG++GLG G +SV L G+I +SFS+C+ D G +
Sbjct: 180 VFGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMV 238
Query: 261 FGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSF 313
G P + F SN + Y I ++ + LK ++DSG+++
Sbjct: 239 LGQISPPP--NMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTY 296
Query: 314 TFLPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQ 367
+ P+ + + +++ I + C+ + + + L P V ++F
Sbjct: 297 AYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGS 356
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSH 426
++ ++ T+V +CL I D+ T +G + V +DREN K+G+
Sbjct: 357 GQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWK 416
Query: 427 SNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHA 464
+NC +L + P P +P +N+ Q P A
Sbjct: 417 TNCSELWKSLQVPGVPASAPVLSP-SSNRSQEMPPAQA 453
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 155/368 (42%), Gaps = 53/368 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T +L D G DLLW+ C C+ C S L + Y AS++S + CS C
Sbjct: 47 RTYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSC 101
Query: 150 DLGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L T N + C Y+ Y + + + G LVED+LH + + A+VI G
Sbjct: 102 TLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMV--------NATATVIFG 152
Query: 206 CGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFG 262
CG KQSG A DG+IG G ++S S LAK G N F+ C D + G + G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212
Query: 263 DQGPATQQSTSFLASNGKYITYI---------IGVETCCIGSSCLKQTSFKAIVDSGSSF 313
+ Q T + Y + + ++ + ++ T F DSG++
Sbjct: 213 NVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIF----DSGTTL 268
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFV 372
+LP E Y+ F + V+ + P+ C S+ + KL P+V L F + S
Sbjct: 269 AYLPDEAYQA----FTQAVSLVVA-----PFLLCDTRLSRFIYKLFPNVVLYF-EGASMT 318
Query: 373 VNNPVFVIYGTQVVTG--FCLAIQPV-----DGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ ++I +C+ Q + + G + VV+D E ++GW
Sbjct: 319 LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWR 378
Query: 426 HSNCQDLN 433
+C+ L+
Sbjct: 379 PFDCKFLS 386
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 155/356 (43%), Gaps = 49/356 (13%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC +CV+C N D + P SST + + C + +C
Sbjct: 107 DTGSTVTYVPCSNCVQCG-------NHQDP---RFQPELSSTYQPVKC-----NADCNCD 151
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y Y E ++SSG+L ED++ G ++ L V + GC +SG
Sbjct: 152 ENGVQCTYERRY-AEMSTSSGVLAEDVMSF--GKESEL---VPQRAVFGCETMESGDLYT 205
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G +SV L G++ NSFS+C+ D G + G P +
Sbjct: 206 QRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSH 264
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETI--- 324
S Y Y I ++ + LK + AI+DSG+++ + P++ Y
Sbjct: 265 SDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDA 322
Query: 325 ---AAEFDRQVNDTITSFEGYPWKCCYKSSSQ---RLPKL-PSVKLMFPQNNSFVVNNPV 377
F +Q++ +F+ C+ + + LPK+ P V ++F ++
Sbjct: 323 IMKKISFLKQISGPDPNFK----DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPEN 378
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++ T+V +CL I D T +G + V ++REN +G+ +NC +L
Sbjct: 379 YLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 155/369 (42%), Gaps = 59/369 (15%)
Query: 89 GSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G + M++ D G DL W+ C C RC YN D N PS S + + + CS
Sbjct: 142 GGRKMTVIVDTGSDLSWVQCQPCKRC-------YNQQDPVFN---PSTSPSYRTVLCSSP 191
Query: 148 LC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
C +LG NP C Y ++Y + + L E HL G A+ N
Sbjct: 192 TCQSLQSATGNLGVCGSNPPS-CNYVVNYGDGSYTRGELGTE---HLDLGNSTAVNN--- 244
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDS 256
I GCG + + G G + GL+GLG +S+ S + + FS C + + S
Sbjct: 245 --FIFGCG-RNNQGLFGGAS--GLVGLGRSSLSLIS--QTSAMFGGVFSYCLPITETEAS 297
Query: 257 GRIFFGDQGPATQQST----SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDS 309
G + G + +T + + N + Y + + +GS ++ SF ++DS
Sbjct: 298 GSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDS 357
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVK 362
G+ T LP +Y+ + EF +Q F G+P C+ S + ++P++K
Sbjct: 358 GTVITRLPPSIYQALKDEFVKQ-------FSGFPSAPAFMILDTCFNLSGYQEVEIPNIK 410
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENL 420
+ F N V+ + + CLAI + + ++G IG RV++D +
Sbjct: 411 MHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGS 470
Query: 421 KLGWSHSNC 429
LG++ C
Sbjct: 471 MLGFAAEAC 479
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 84 LFPS-QGSKTMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEY 131
+FP Q +M +GN D G DL WI CD C CA Y ++
Sbjct: 153 VFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV--- 209
Query: 132 SPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
P S + L + D TS Q C Y + Y + +SS G+L D + LI+ D
Sbjct: 210 VPPRDSYCQELQGNQNYGD--TSKQ-----CDYEITY-ADRSSSMGILARDNMQLITA-D 260
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
+N + GCG Q G L A DG++GL IS+P+ LA G+I N F C
Sbjct: 261 GEREN---LDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHC 317
Query: 251 FDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS-----F 303
D S G +F GD T NG Y V+ G L
Sbjct: 318 IAADPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT 377
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
+ I DSGSS+T+LP + Y + A + C K + + + VK
Sbjct: 378 QVIFDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNFP-VRSMDDVKH 436
Query: 364 MFPQNNSFVVNNPVFVIYGTQVV-----------TGFCLAIQPVDG-DIG-----TIGQN 406
+F + S V +F++ T V+ CL + +DG +IG IG
Sbjct: 437 LF-KPLSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGV--LDGTEIGHDSAIVIGDV 493
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
+ G VV++ + ++GW S+C
Sbjct: 494 SLRGKLVVYNNDEKQIGWVQSDC 516
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 84 LFPS-QGSKTMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEY 131
+FP Q +M +GN D G DL WI CD C CA Y ++
Sbjct: 153 VFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV--- 209
Query: 132 SPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
P S + L + D TS Q C Y + Y + +SS G+L D + LI+ D
Sbjct: 210 VPPRDSYCQELQGNQNYGD--TSKQ-----CDYEITY-ADRSSSMGILARDNMQLITA-D 260
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
+N + GCG Q G L A DG++GL IS+P+ LA G+I N F C
Sbjct: 261 GEREN---LDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHC 317
Query: 251 FDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS-----F 303
D S G +F GD T NG Y V+ G L
Sbjct: 318 IAADPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT 377
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
+ I DSGSS+T+LP + Y + A + C K + + + VK
Sbjct: 378 QVIFDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNFP-VRSMDDVKH 436
Query: 364 MFPQNNSFVVNNPVFVIYGTQVV-----------TGFCLAIQPVDG-DIG-----TIGQN 406
+F + S V +F++ T V+ CL + +DG +IG IG
Sbjct: 437 LF-KPLSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGV--LDGTEIGHDSAIVIGDV 493
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
+ G VV++ + ++GW S+C
Sbjct: 494 SLRGKLVVYNNDEKQIGWVQSDC 516
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 155/356 (43%), Gaps = 49/356 (13%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC +CV+C N D + P SST + + C + +C
Sbjct: 107 DTGSTVTYVPCSNCVQCG-------NHQDP---RFQPELSSTYQPVKC-----NADCNCD 151
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y Y E ++SSG+L ED++ G ++ L V + GC +SG
Sbjct: 152 ENGVQCTYERRY-AEMSTSSGVLAEDVMSF--GKESEL---VPQRAVFGCETMESGDLYT 205
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G +SV L G++ NSFS+C+ D G + G P +
Sbjct: 206 QRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSH 264
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETI--- 324
S Y Y I ++ + LK + AI+DSG+++ + P++ Y
Sbjct: 265 SDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDA 322
Query: 325 ---AAEFDRQVNDTITSFEGYPWKCCYKSSSQ---RLPKL-PSVKLMFPQNNSFVVNNPV 377
F +Q++ +F+ C+ + + LPK+ P V ++F ++
Sbjct: 323 IMKKISFLKQISGPDPNFK----DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPEN 378
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++ T+V +CL I D T +G + V ++REN +G+ +NC +L
Sbjct: 379 YLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 153/356 (42%), Gaps = 40/356 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----L 151
D G D+LW+ C+ C C S L LN + S+SST+ + CS +C
Sbjct: 84 DTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTT 138
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
T C + C YT Y + + +SG V D L+ + +L ++ A ++ GC QS
Sbjct: 139 ATQCSSQTDQCSYTFQY-GDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQS 197
Query: 212 GGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGD----- 263
G A DG+ G G GE+SV S L+ G+ FS C D SG + G+
Sbjct: 198 GDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGGGILVLGEILEPG 257
Query: 264 --------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 315
P + +A NG+ ++ ++ +S + T IVDSG++ +
Sbjct: 258 IVYSPLVPSQPHYNLNLLSIAVNGQ----LLPIDPAAFATSNSQGT----IVDSGTTLAY 309
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
L E Y+ + + V+ ++T + CY S+ P F S V+
Sbjct: 310 LVAEAYDPFVSAVNAIVSPSVTPITSKGNQ-CYLVSTSVSQMFPLASFNFAGGASMVLKP 368
Query: 376 PVFVI-YGTQVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I +G+ + +C+ Q V G + +G + V+D ++GW++ +C
Sbjct: 369 EDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVLKDKIFVYDLVRQRIGWANYDC 423
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 100/399 (25%), Positives = 164/399 (41%), Gaps = 46/399 (11%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKT----------MSLGN---------DFGCDLLWIPCD- 109
V ++++ G + FP +GS + LGN D G D+LW+ C
Sbjct: 60 VSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSP 119
Query: 110 CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-----TSCQNPKQ---P 161
C C P S+ L+ L ++P +SST+ ++CS C G CQ P
Sbjct: 120 CTGC-PTSSG----LNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSP 174
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAP 220
C YT Y + + +SG V D + + N + AS++ GC QSG A
Sbjct: 175 CGYTFTY-GDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAV 233
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASN 278
DG+ G G ++SV S L G+ FS C D+G + G+ T + S
Sbjct: 234 DGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQ 293
Query: 279 GKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVN 333
Y + + + I SS ++ + IVDSG++ +L Y+ + V+
Sbjct: 294 PHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS 353
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG--FCL 391
++ S +C SSS P+V L F + V +++ V +C+
Sbjct: 354 PSVRSLVSKGSQCFITSSSVD-SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCI 412
Query: 392 AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q G +I +G + V+D N+++GW+ +C
Sbjct: 413 GWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 149/362 (41%), Gaps = 37/362 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C C C SA L+ L Y P SST+ +SCS LC G
Sbjct: 20 DTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFA 74
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C C Y Y + ++S G V D + N L N+ + V+ GC ++Q+
Sbjct: 75 EAQCSQATNNCEYIFSY-GDGSTSEGYYVRDAMQYNVISSNGLANTT-SQVLFGCSIRQT 132
Query: 212 GGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPAT-- 268
G A DG+IG G E+SVP+ LA I FS C + + G G A
Sbjct: 133 GDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIGGIAEPG 192
Query: 269 QQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKAIV-DSGSSFTFLPKEVYET 323
T + + Y + G+ I + T+ ++ DSG++ + P Y
Sbjct: 193 MTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAYNV 252
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYG 382
+ T +G +C S RL L P+V L F + + + ++++G
Sbjct: 253 FVQAIREATSATPVRVQGMDTQCFLVSG--RLSDLFPNVTLNF-EGGAMELQPDNYLMWG 309
Query: 383 TQVVTG----FCLAIQ-------PVDGDIGTI-GQNFMTGYRVVFDRENLKLGWSHSNCQ 430
TG +C+ Q P DG TI G + VV+D +N ++GW NC+
Sbjct: 310 GTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNCK 369
Query: 431 DL 432
L
Sbjct: 370 FL 371
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 100/399 (25%), Positives = 164/399 (41%), Gaps = 46/399 (11%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKT----------MSLGN---------DFGCDLLWIPCD- 109
V ++++ G + FP +GS + LGN D G D+LW+ C
Sbjct: 62 VSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSP 121
Query: 110 CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-----TSCQNPKQ---P 161
C C P S+ L+ L ++P +SST+ ++CS C G CQ P
Sbjct: 122 CTGC-PTSSG----LNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSP 176
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAP 220
C YT Y + + +SG V D + + N + AS++ GC QSG A
Sbjct: 177 CGYTFTY-GDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAV 235
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASN 278
DG+ G G ++SV S L G+ FS C D+G + G+ T + S
Sbjct: 236 DGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQ 295
Query: 279 GKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVN 333
Y + + + I SS ++ + IVDSG++ +L Y+ + V+
Sbjct: 296 PHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS 355
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG--FCL 391
++ S +C SSS P+V L F + V +++ V +C+
Sbjct: 356 PSVRSLVSKGSQCFITSSSVD-SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCI 414
Query: 392 AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q G +I +G + V+D N+++GW+ +C
Sbjct: 415 GWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 148/363 (40%), Gaps = 40/363 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C + Y N+ P L H
Sbjct: 77 KPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTK---NKLVPCVDQLCASL---HNG 130
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGC 206
+ C +P + C Y + Y + SS+G+LV D L L +G + V+ S+ GC
Sbjct: 131 LNRKHKCDSPYEQCDYVIKY-ADQGSSTGVLVNDSFALRLANG------SVVRPSLAFGC 183
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP 266
G Q + DG++GLG G +S+ S + G+ +N C G +FFGD
Sbjct: 184 GYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCLSLRGGGFLFFGDDLV 243
Query: 267 ATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
Q+ T + + + Y G + G L+ + + DSGSSFT+ + Y+ +
Sbjct: 244 PYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGSSFTYFAAQPYQALV 303
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN----NPVFVIY 381
++ T+ C+K + + VK F S V+N N F+
Sbjct: 304 TALKGDLSRTLKEVSDPSLPLCWK-GKKPFKSVLDVKKEF---KSLVLNFGNGNKAFMEI 359
Query: 382 GTQ---VVTGF---CLAIQPVDG------DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q +VT + CL I ++G D+ +G M V++D E ++GW + C
Sbjct: 360 PPQNYLIVTKYGNACLGI--LNGSEVGLKDLSILGDITMQDQMVIYDNEKGQIGWIRAPC 417
Query: 430 QDL 432
+
Sbjct: 418 DRI 420
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 140/356 (39%), Gaps = 41/356 (11%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G DL W+ CD C RC+ Y R N++ P C H LC
Sbjct: 95 DTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDFVP----------CRHSLCASLHHS 140
Query: 156 QNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASV--IIGCGMKQ 210
N P+ DY Y ++ SS G+L+ D+ L N VQ V +GCG Q
Sbjct: 141 DNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL------NFTNGVQLKVRMALGCGYDQ 194
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
DG++GLG G+ S+ S L GL+RN C G IFFGD +++
Sbjct: 195 IFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGDVYDSSRL 254
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFD- 329
+ + ++S G G S A+ D+GSS+T+ Y+ + +
Sbjct: 255 TWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYFNPYAYQALISWLGK 314
Query: 330 -------RQVNDTIT---SFEG-YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
++ +D T + G P++ Y+ P + S F + +
Sbjct: 315 ESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMPPEAY 374
Query: 379 VIYGT--QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+I V G + GD+ IG M +VFD + +GW+ ++C +
Sbjct: 375 LIISNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVMVFDNDKQLIGWTPADCDQV 430
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 147/350 (42%), Gaps = 25/350 (7%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+LW+ C C C S L+ L ++P SSTS + CS C L TS
Sbjct: 109 DTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTS 163
Query: 155 ---CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
CQ + PC YT Y + + +SG V D ++ S N + AS++ GC Q
Sbjct: 164 EAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDSVMGNEQTANSSASIVFGCSNSQ 222
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPA 267
SG A DG+ G G ++SV S L G+ FS C D+G + G+
Sbjct: 223 SGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP 282
Query: 268 TQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE 322
T + S Y + ++ + I SS ++ + IVDSG++ +L Y+
Sbjct: 283 GLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
V+ ++ S C+ +SS P+V L F + V +++
Sbjct: 343 PFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQ 401
Query: 383 TQVVTG--FCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q G I +G + V+D N+++GW+ +C
Sbjct: 402 ASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 151/356 (42%), Gaps = 55/356 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G D WI C C C ++ + PS SST ++CS R C +LG+S
Sbjct: 152 DTGSDQSWIQCKPCPDC----------YEQHEALFDPSKSSTYSDITCSSRECQELGSSH 201
Query: 155 ---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C + K+ CPY + Y +++ + G L D L L + GCG +
Sbjct: 202 KHNCSSDKK-CPYEITY-ADDSYTVGNLARDTLTLS-------PTDAVPGFVFGCGHNNA 252
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG-----P 266
G + + DGL+GLG G+ S+ S +A FS C S + G P
Sbjct: 253 GSFGE---IDGLLGLGRGKASLSSQVA--ARYGAGFSYCLPSSPSATGYLSFSGAAAAAP 307
Query: 267 ATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKE 319
Q T +A G++ + Y + + + +K T+ I+DSG++F+ LP
Sbjct: 308 TNAQFTEMVA--GQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSCLPPS 365
Query: 320 VYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
Y A V + ++ P + CY + ++PSV L+F + + V +
Sbjct: 366 AY----AALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVF-ADGATVHLH 420
Query: 376 PVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P V+Y V+ CLA P D +G +G V++D +N K+G+ + C
Sbjct: 421 PSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 147/361 (40%), Gaps = 35/361 (9%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
K L D G D++W+ C C C S +L DL Y SS+ K + C C
Sbjct: 96 KNYYLQVDTGSDIMWVNCIQCKECPTRS-----NLGMDLTLYDIKESSSGKFVPCDQEFC 150
Query: 150 D-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
L T C CPY ++ Y + +S++G V+DI+ + +S S++
Sbjct: 151 KEINGGLLTGC-TANISCPY-LEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVF 208
Query: 205 GCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-F 261
GCG +QSG + A G++G G S+ S LA +G ++ F+ C + + G IF
Sbjct: 209 GCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAI 268
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-----IVDSGSSFTFL 316
G T L Y + V+ S TS + I+DSG++ +L
Sbjct: 269 GHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSGTTLAYL 328
Query: 317 PKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
P+ +YE + + Q D T + Y C++ S P+V F S V
Sbjct: 329 PEGIYEPLVYKIISQHPDLKVRTLHDEYT---CFQYSESVDDGFPAVTFYFENGLSLKVY 385
Query: 375 NPVFVIYGTQVVTGFCLAIQPV------DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
++ +C+ Q ++ +G ++ V +D EN +GW+ N
Sbjct: 386 PHDYLFPSGDF---WCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYN 442
Query: 429 C 429
C
Sbjct: 443 C 443
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 148/373 (39%), Gaps = 54/373 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K L D G DL W+ CD P + ++ +D Y P+ K CS +C
Sbjct: 73 KPYELDIDTGSDLTWVQCD----GPDAPCKGCTMPKD-KLYKPNGKQVVK---CSDPICV 124
Query: 151 -------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
LG C PC Y + Y ++ S+ G+LV D +H I ++ K+ + V
Sbjct: 125 ATQSTHVLGQICSKQSPPCVYNVQY-ADHASTLGVLVRDYMH-IGSPSSSTKDPL---VA 179
Query: 204 IGCGMKQ--SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG +Q SG P G++GLG G+ S+ S L G I N C + G +F
Sbjct: 180 FGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFL 239
Query: 262 GDQ---------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 312
GD+ P Q S + G + G T G + I DSGSS
Sbjct: 240 GDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTPAKG--------LQIIFDSGSS 291
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYP-----WKCC--YKSSSQRLPKLPSVKLMF 365
+T+ VY +A + + S P WK +KS ++ + L F
Sbjct: 292 YTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVNNYFKPLTLSF 351
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG------YRVVFDREN 419
++ + P CL I ++G+ +G + G VV+D E
Sbjct: 352 TKSKNLQFQLPPVAYLIITKYGNVCLGI--LNGNEAGLGNRNVVGDISLQDKVVVYDNEK 409
Query: 420 LKLGWSHSNCQDL 432
++GW+ +NC+ +
Sbjct: 410 QQIGWASANCKQI 422
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 147/350 (42%), Gaps = 25/350 (7%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+LW+ C C C S L+ L ++P SSTS + CS C L TS
Sbjct: 135 DTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTS 189
Query: 155 ---CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
CQ + PC YT Y + + +SG V D ++ + N + AS++ GC Q
Sbjct: 190 EAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQ 248
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPA 267
SG A DG+ G G ++SV S L G+ FS C D+G + G+
Sbjct: 249 SGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP 308
Query: 268 TQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE 322
T + S Y + ++ + I SS ++ + IVDSG++ +L Y+
Sbjct: 309 GLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 368
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
V+ ++ S C+ +SS P+V L F + V +++
Sbjct: 369 PFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQ 427
Query: 383 TQVVTG--FCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q G I +G + V+D N+++GW+ +C
Sbjct: 428 ASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 477
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 143/357 (40%), Gaps = 53/357 (14%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G DL W+ CD C RC+ Y R N+ P C H LC
Sbjct: 103 DTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDLVP----------CRHPLCASVHQT 148
Query: 156 QNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASV--IIGCGMKQ 210
N + + DY Y ++ SS G+LV D+ L N VQ V +GCG Q
Sbjct: 149 DNYECEVEHQCDYEVEYADHYSSLGVLVNDVYVL------NFTNGVQLKVRMALGCGYDQ 202
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
DG++GLG G+ S+ S L GL+RN C G IFFGD +++
Sbjct: 203 IFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIFFGDVYDSSRL 262
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
+ + ++S Y Y G +G + A+ D+GSS+T+ Y+ +
Sbjct: 263 AWTPMSSR-DYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYFNSNAYQLTKELAGK 321
Query: 331 QVNDT-------ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN----SFVVNNPVFV 379
+ + + + P++ Y+ P + L FP + F + ++
Sbjct: 322 PIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKP----IALSFPGSRRSKAQFEIPPEAYL 377
Query: 380 IYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
I + CL I +DG D+ IG M +VFD E +GW+ ++C
Sbjct: 378 IISN--MGNVCLGI--LDGSEVGVEDLNLIGDISMLDKVMVFDNEKQLIGWTAADCN 430
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 156/369 (42%), Gaps = 39/369 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SST + CS +C
Sbjct: 103 DSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSSTYSPVKCS-----ADCTCD 147
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K C Y Y E +SSSG+L EDI+ G ++ LK + GC ++G
Sbjct: 148 SDKSQCTYERQY-AEMSSSSGVLGEDIVSF--GTESELKPQ---RAVFGCENSETGDLFS 201
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L G+I +SFSMC+ D G + G PA
Sbjct: 202 QHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAM-PAPPDMVFS 259
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
+ + Y I ++ + L+ + ++DSG+++ +LP++ +
Sbjct: 260 RSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLPEQAFVAFKDAV 319
Query: 329 DRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYG 382
+V I + C+ + + + +L P V ++F ++ ++
Sbjct: 320 TSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRH 379
Query: 383 TQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLT 441
++V +CL + D T +G + V +DR N K+G+ +NC +L +
Sbjct: 380 SKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSELWERLHVSGA 439
Query: 442 PGPGTPSNP 450
P P S+P
Sbjct: 440 PSPAPSSDP 448
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 159/369 (43%), Gaps = 41/369 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SS S S C++ +C
Sbjct: 106 DSGSTVTYVPCSSCEQCG-------NHQD---PRFQPDLSS-----SYSPVKCNVDCTCD 150
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K+ C Y Y E +SSSG+L EDI+ G ++ LK I GC ++G
Sbjct: 151 SDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---PQHAIFGCENSETGDLFS 204
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G++S+ L + G+I +SFS+C+ D G + G P ++
Sbjct: 205 QHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMIFSN 263
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAAE 327
Y Y I ++ + L+ S ++DSG+++ +LP++ +
Sbjct: 264 SDPLRSPY--YNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAFKEA 321
Query: 328 FDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIY 381
+V+ I + C+ + + + KL P V ++F + ++
Sbjct: 322 VTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFR 381
Query: 382 GTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL + D T+ G + V +DR N K+G+ +NC +L +
Sbjct: 382 HSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSELWERLHIGD 441
Query: 441 TPGPGTPSN 449
TP P S+
Sbjct: 442 TPSPAPSSD 450
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 125/275 (45%), Gaps = 25/275 (9%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G W+ C +C + + + R L Y P +S +SK + C +C C
Sbjct: 101 DTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSVSSKEVKCDDTICTSRPPC- 154
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N CPY Y + + G+L D+LH N SV GCG++QSG +
Sbjct: 155 NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNN 213
Query: 217 -GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSF 274
VA DG+IG G + S LA AG + FS C D + G IF G+ ++T
Sbjct: 214 SAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPI 273
Query: 275 LASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IVDSGSSFTFLPKEVY-ETIA 325
+ +N Y +++ +++ + + L+ T K +DSGS+ +LP+ +Y E I
Sbjct: 274 VKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 331
Query: 326 AEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 358
A F + + T+ + Y ++C + S + PK+
Sbjct: 332 AVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 364
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 147/350 (42%), Gaps = 25/350 (7%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+LW+ C C C S L+ L ++P SSTS + CS C L TS
Sbjct: 109 DTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTS 163
Query: 155 ---CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
CQ + PC YT Y + + +SG V D ++ + N + AS++ GC Q
Sbjct: 164 EAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQ 222
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPA 267
SG A DG+ G G ++SV S L G+ FS C D+G + G+
Sbjct: 223 SGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP 282
Query: 268 TQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE 322
T + S Y + ++ + I SS ++ + IVDSG++ +L Y+
Sbjct: 283 GLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
V+ ++ S C+ +SS P+V L F + V +++
Sbjct: 343 PFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQ 401
Query: 383 TQVVTG--FCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q G I +G + V+D N+++GW+ +C
Sbjct: 402 ASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 147/359 (40%), Gaps = 37/359 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C C C SA L+ L Y P SST+ +SCS LC G
Sbjct: 47 DTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFA 101
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C C Y Y + ++S G V D + N L N+ + V+ GC ++Q+
Sbjct: 102 EAQCSQTTNNCEYIFSY-GDGSTSEGYYVRDAMQYNVISSNGLANTT-SQVLFGCSIRQT 159
Query: 212 GGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPAT-- 268
G A DG+IG G E+SVP+ LA I FS C + + G G A
Sbjct: 160 GDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIGGIAEPG 219
Query: 269 QQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKAIV-DSGSSFTFLPKEVYET 323
T + + Y + G+ I + T+ ++ DSG++ + P Y
Sbjct: 220 MTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAYNV 279
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIYG 382
+ T +G +C S RL L P+V L F + + + ++++G
Sbjct: 280 FVQAIREATSATPVRVQGMDTQCFLVSG--RLSDLFPNVTLNF-EGGAMELQPDNYLMWG 336
Query: 383 TQVVTG----FCLAIQ-------PVDGDIGTI-GQNFMTGYRVVFDRENLKLGWSHSNC 429
TG +C+ Q P DG TI G + VV+D +N ++GW NC
Sbjct: 337 GTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 130/311 (41%), Gaps = 42/311 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK + D G D+LW+ C C RC S L DL Y AS+TS + C
Sbjct: 88 SKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKASTTSDAVGCDDNF 142
Query: 149 CDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C L C+ P C Y++ Y + +S++G V+D + N +V+
Sbjct: 143 CSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 200
Query: 205 GCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
GCG KQSG A DG++G G S+ S LA +G ++ FS C D D G IF
Sbjct: 201 GCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA-- 258
Query: 264 QGPATQQSTSFLASNGKYITYI---------------IGVETCCIGSSCLKQTSFKA-IV 307
G + FL N I + +G + + S + K I+
Sbjct: 259 IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTII 318
Query: 308 DSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 362
DSG++ + P+EVY + ++ + D +++ +F C+ + P+V
Sbjct: 319 DSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT------CFDYTGNVDDGFPTVT 372
Query: 363 LMFPQNNSFVV 373
L F ++ S V
Sbjct: 373 LHFDKSISLTV 383
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 170/396 (42%), Gaps = 67/396 (16%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYY 121
Q+ L+S ++ Q + ++ G + M++ D G DL W+ C C RC Y
Sbjct: 52 QIPLTSGIRLQSLNYIVTVEL-----GGRKMTVIVDTGSDLSWVQCQPCNRC-------Y 99
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLC--------DLGTSCQNPKQPCPYTMDYYTENT 173
N D N PS S + + + C+ C + G NP C Y ++Y +
Sbjct: 100 NQQDPVFN---PSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPT-CNYVVNYGDGSY 155
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
+S + +E HL L N+ + I GCG K G L G A GL+GLG ++S+
Sbjct: 156 TSGEVGME---HL------NLGNTTVNNFIFGCGRKNQG--LFGGA-SGLVGLGRTDLSL 203
Query: 234 PSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQST----SFLASNGKYITYII 286
S ++ + FS C + + SG + G + +T + + N Y +
Sbjct: 204 ISQISP--MFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFL 261
Query: 287 GVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 343
+ +G ++ SF + I+DSG+ + LP +Y+ + AEF +Q F GYP
Sbjct: 262 NLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQ-------FSGYP 314
Query: 344 -------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAIQ- 394
C+ S + K+P +K+ F + V + V Y + + CLAI
Sbjct: 315 SAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNV-DVTGVFYSVKTDASQVCLAIAS 373
Query: 395 -PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P + ++G IG R+++D + LG++ C
Sbjct: 374 LPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 148/352 (42%), Gaps = 27/352 (7%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C C C P S+ L+ L ++P +SST+ ++CS C G
Sbjct: 23 DTGSDILWVTCSPCTGC-PTSSG----LNIQLESFNPDSSSTASRITCSDDRCTAGFQTG 77
Query: 153 -TSCQNPKQ---PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
CQ PC YT Y + + +SG V D + + N + AS++ GC
Sbjct: 78 EAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSN 136
Query: 209 KQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQG 265
QSG A DG+ G G ++SV S L G+ FS C D+G + G+
Sbjct: 137 SQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIV 196
Query: 266 PATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEV 320
T + S Y + + + I SS ++ + IVDSG++ +L
Sbjct: 197 EPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGA 256
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y+ + V+ ++ S +C SSS P+V L F + V +++
Sbjct: 257 YDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD-SSFPTVTLYFMGGVAMSVKPENYLL 315
Query: 381 YGTQVVTG--FCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C+ Q G +I +G + V+D N+++GW+ +C
Sbjct: 316 QQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 367
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 164/389 (42%), Gaps = 42/389 (10%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 135
+ G F + K + D G D+LW+ C P S+ L LN + P +
Sbjct: 79 RVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSG----LHIPLNFFDPGS 134
Query: 136 SSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
SST+ +SCS + C LG C + C YT Y + + +SG V D+L+ +
Sbjct: 135 SSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQ-YGDGSGTSGYYVSDLLNFDAIV 193
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+++ NS AS++ GC + Q+G A DG+ G G ++SV S ++ G+ FS
Sbjct: 194 GSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSH 252
Query: 250 C----------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 293
C ++D Q P + ++ NGK + ++
Sbjct: 253 CLKGDGGGGGILVLGEIVEEDIVYSPLVPSQ-PHYNLNLQSISVNGKS----LAIDPEVF 307
Query: 294 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
+S + T IVDSG++ +L +E Y+ + V+ ++ +C +SS
Sbjct: 308 ATSTNRGT----IVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 363
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV--VTGFCLAIQPVDGD-IGTIGQNFMTG 410
+ P+V L F S + +++ + +C+ Q + G I +G +
Sbjct: 364 K-GIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 422
Query: 411 YRVVFDRENLKLGWSHSNC-QDLNDGTKS 438
V+D ++GW++ +C +N T+S
Sbjct: 423 KIFVYDLAGQRIGWANYDCSMSVNVSTRS 451
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/389 (21%), Positives = 174/389 (44%), Gaps = 42/389 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P +SST + + C+ + +C
Sbjct: 130 DTGSTVTYVPCSTCEQCG----------RHQDPKFQPESSSTYQPVKCT-----IDCNCD 174
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ C Y Y E ++SSG+L ED++ + + A + +V GC ++G
Sbjct: 175 GDRMQCVYERQY-AEMSTSSGVLGEDVISFGNQSELAPQRAV-----FGCENVETGDLYS 228
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L +I +SFS+C+ D G + G P + + ++
Sbjct: 229 QHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGISPPSDMTFAY 287
Query: 275 LASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVY----ETI 324
+ + Y I ++ + L + ++DSG+++ +LP+ + + I
Sbjct: 288 -SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI 346
Query: 325 AAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
E +Q++ ++ + SQ P V ++F + + ++ ++
Sbjct: 347 VKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRH 406
Query: 383 TQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLT 441
++V +CL I D T +G + V++DRE K+G+ +NC +L + ++ +
Sbjct: 407 SKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAELWERLQTSIA 466
Query: 442 PGPGTPSNPLPANQEQSSPGGHAVGPAVA 470
P P P++ + + E P +V P+V+
Sbjct: 467 PPPLPPNSGVRNSSEALEP---SVAPSVS 492
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 156/356 (43%), Gaps = 39/356 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C C S L +L+ YSPS+SSTS ++C+ C TS
Sbjct: 92 DTGSDILWVNCAGCTNCPKKS-----DLGIELSLYSPSSSSTSNRVTCNQDFC---TSTY 143
Query: 157 N-------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+ P+ C Y + Y + +S++G V D + L N S S++ GCG +
Sbjct: 144 DGPIPGCTPELLCEYRV-AYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSIVFGCGAQ 202
Query: 210 QSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPA 267
QSG A DG++G G S+ S LA +G ++ F+ C D + G IF G+
Sbjct: 203 QSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIFAIGEVVQP 262
Query: 268 TQQSTSFLASNGKYITYIIGVE---------TCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
++T + Y ++ +E T + K T I+DSG++ + P
Sbjct: 263 KVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGT----IIDSGTTLAYFPD 318
Query: 319 EVYE-TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NN 375
+YE I+ F RQ + + E C++ P+V F + S V +
Sbjct: 319 VIYEPLISKIFARQSTLKLHTVE--EQFTCFEYDGNVDDGFPTVTFHFEDSLSLTVYPHE 376
Query: 376 PVFVIYGTQVVTGF-CLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+F I + G+ Q DG D+ +G + V++D EN +GW+ NC
Sbjct: 377 YLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLENQTIGWTEYNC 432
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 117/272 (43%), Gaps = 33/272 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C N + L Y P+ SK + C HRL
Sbjct: 77 KPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT---KSKLVPCVHRL 123
Query: 149 CDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQ 199
C C +P + C Y + Y + SS+G+L+ D L L +G + +
Sbjct: 124 CASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLTNG------SVAR 176
Query: 200 ASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
SV GCG Q D +P DG++GLG G +S+ S L + G+ +N C G
Sbjct: 177 PSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGF 236
Query: 259 IFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+FFGD Q++T + +A + Y G + G L K + DSGSSFT+
Sbjct: 237 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFA 296
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 349
+ Y+ + ++ T+ C+K
Sbjct: 297 AKPYQALVTALKDGLSRTLEEEPDTSLPLCWK 328
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 164/389 (42%), Gaps = 42/389 (10%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 135
+ G F + K + D G D+LW+ C P S+ L LN + P +
Sbjct: 64 RVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSG----LHIPLNFFDPGS 119
Query: 136 SSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
SST+ +SCS + C LG C + C YT Y + + +SG V D+L+ +
Sbjct: 120 SSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQ-YGDGSGTSGYYVSDLLNFDAIV 178
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+++ NS AS++ GC + Q+G A DG+ G G ++SV S ++ G+ FS
Sbjct: 179 GSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSH 237
Query: 250 C----------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 293
C ++D Q P + ++ NGK + ++
Sbjct: 238 CLKGDGGGGGILVLGEIVEEDIVYSPLVPSQ-PHYNLNLQSISVNGKS----LAIDPEVF 292
Query: 294 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
+S + T IVDSG++ +L +E Y+ + V+ ++ +C +SS
Sbjct: 293 ATSTNRGT----IVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 348
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV--VTGFCLAIQPVDGD-IGTIGQNFMTG 410
+ P+V L F S + +++ + +C+ Q + G I +G +
Sbjct: 349 K-GIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 407
Query: 411 YRVVFDRENLKLGWSHSNC-QDLNDGTKS 438
V+D ++GW++ +C +N T+S
Sbjct: 408 KIFVYDLAGQRIGWANYDCSMSVNVSTRS 436
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 121/279 (43%), Gaps = 25/279 (8%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C C RC S L +L Y P SST +SC C
Sbjct: 51 DTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSSTGSKVSCDQGFCAATYGGL 105
Query: 157 NP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
P PC Y++ Y + +S++G V D+L + ++V GCG +Q G
Sbjct: 106 LPGCTTSLPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGG 164
Query: 213 GY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
A DG+IG G S+ S L+ AG ++ F+ C D + G IF +
Sbjct: 165 DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPKVK 224
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY-E 322
T+ L N + Y + +++ +G + LK S I+DSG++ T+LP+ VY E
Sbjct: 225 TTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKE 282
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
+ A F + + T + + + C L PSV
Sbjct: 283 IMLAVFAKHKDITFHNVQ--EFLCFQYVGRYTLQHTPSV 319
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 59/371 (15%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q K L D G DL W+ CD C++C P Y T+ + C
Sbjct: 75 QPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQP--------------TNDLVVCK 120
Query: 146 HRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQ 199
+C C +P Q C Y ++Y + SS G+LV D+ ++L SG +
Sbjct: 121 DPICASLHPDNYRCDDPDQ-CDYEVEY-ADGGSSIGVLVNDLFPVNLTSG------MRAR 172
Query: 200 ASVIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 256
+ IGCG Q L G+A DG++GLG G S+ + L+ GL+RN CF +
Sbjct: 173 PRLTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGG 228
Query: 257 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
G +FFGD + + S Y G + + + DSGSS+T+
Sbjct: 229 GYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYF 288
Query: 317 PKEVYETIAAEFDRQV----------NDTI-TSFEG-YPWKCCYKSSSQRLPKLPSVKLM 364
+ Y+T+ + + + +DT+ + G P+K + P S
Sbjct: 289 NTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSG 348
Query: 365 FPQNNSFVVNNPVFVIYGTQ------VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
+ + F + ++I ++ ++ G + +Q + IG M V++D E
Sbjct: 349 WKTKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQ----NYNIIGDISMQEKLVIYDNE 404
Query: 419 NLKLGWSHSNC 429
+GW SNC
Sbjct: 405 KQVIGWQPSNC 415
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 118/278 (42%), Gaps = 40/278 (14%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
TM++GN D G DL W+ CD C C N + L Y P+A+S
Sbjct: 57 TMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTANSL-- 104
Query: 141 HLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
+ C++ LC C +PKQ C Y + YT++ SS G+L+ D L N
Sbjct: 105 -VPCANALCTALHSGHGSNNKCPSPKQ-CDYQIK-YTDSASSQGVLINDNFSLPMRSSN- 160
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
++ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C
Sbjct: 161 ----IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216
Query: 252 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGS 311
+ G +FFGD T + T + Y G T L + + DSGS
Sbjct: 217 STNGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 349
++T+ + Y+ + + ++ ++ C+K
Sbjct: 277 TYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPLCWK 314
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/415 (25%), Positives = 168/415 (40%), Gaps = 65/415 (15%)
Query: 46 KNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLW 105
K + TS ++S Q+ L+S ++ + + ++ G K MSL D G DL W
Sbjct: 56 KIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVEL-----GGKNMSLIVDTGSDLTW 110
Query: 106 IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNP------ 158
V+C P + Y ++ Y PS SS+ K + C+ C DL + N
Sbjct: 111 -----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGN 161
Query: 159 ----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
K PC Y + Y + + L E IL GD L+N + GCG G +
Sbjct: 162 NGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLEN-----FVFGCGRNNKGLF 212
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQS 271
GL +S+ S K FS C + SG + FG+ S
Sbjct: 213 GGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNS 267
Query: 272 TSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSF--KAIVDSGSSFTFLPKEVYETI 324
TS L N + + YI+ + IG LK +SF ++DSG+ T LP +Y+ +
Sbjct: 268 TSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAV 327
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
EF +Q F G+P C+ +S +P +K++F N V+
Sbjct: 328 KIEFLKQ-------FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTG 380
Query: 378 FVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + CLA+ + + ++G IG RV++D +LG NC+
Sbjct: 381 VFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENCR 435
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 150/374 (40%), Gaps = 64/374 (17%)
Query: 95 LGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
L D G DL WI CD C CA + Y +L +S+ + L
Sbjct: 215 LDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNL-------VRSSEPFCVEVQRNQLT 267
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--VQASVIIGCGMKQ 210
C++ Q C Y ++Y +++ S G+L +D HL L N ++ ++ GCG Q
Sbjct: 268 EHCESCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL------KLHNGSLAESDIVFGCGYDQ 319
Query: 211 SGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFG-DQGP 266
G L+ + DG++GL +IS+PS LA G+I N C D + G IF G D P
Sbjct: 320 QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVP 379
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPKEVY 321
+ + + + Y + V G++ L K + D+GSS+T+ P + Y
Sbjct: 380 SHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAY 439
Query: 322 ETIAAEFDRQVNDTIT---SFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ + +T S E P C ++ + L VK F P+
Sbjct: 440 SQLVTSLQEVSDLELTRDDSDEALPI-CWRAKTNSPISSLSDVKKFF---------RPIT 489
Query: 379 VIYGTQ-VVTGFCLAIQPV----------------------DGDIGTIGQNFMTGYRVVF 415
+ G++ ++ L IQP DG IG M G +V+
Sbjct: 490 LQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDISMRGRLIVY 549
Query: 416 DRENLKLGWSHSNC 429
D ++GW S+C
Sbjct: 550 DNVKQRIGWMKSDC 563
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 153/370 (41%), Gaps = 55/370 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K D G DL W+ CD AP S +L +L +Y P + + CS+ +C
Sbjct: 60 KAFQFDIDTGSDLTWVQCD----APCSGC---TLPPNL-QYKPKGNI----IPCSNPICT 107
Query: 151 L-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVII 204
C NP++ C Y + Y + +S L+ + L L++G + +Q V
Sbjct: 108 ALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNG------SFMQPPVAF 161
Query: 205 GCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG QS Y P G++GLG G+I + + L AGL RN C G +FF
Sbjct: 162 GCGYDQS--YPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGFLFF 219
Query: 262 GDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEV 320
GD P+ + + L S + Y G K I D+GSS+T+ +
Sbjct: 220 GDNLVPSIGVAWTPLLSQDNH--YTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFNSKA 277
Query: 321 YETIAAEF--DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP----------QN 368
Y+TI D +V+ + E C+K ++ + VK F +N
Sbjct: 278 YQTIINLIGNDLKVSPLKVAKEDKTLPICWK-GAKPFKSVLEVKNFFKTITINFTNGRRN 336
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIG-----TIGQNFMTGYRVVFDRENLKL 422
+ +++I CL + ++G ++G IG M G +++D E +L
Sbjct: 337 TQLYLAPELYLI--VSKTGNVCLGL--LNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQL 392
Query: 423 GWSHSNCQDL 432
GW S+C L
Sbjct: 393 GWVSSDCNKL 402
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 149/374 (39%), Gaps = 63/374 (16%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K D G D+ W+ CD C C +L L +Y P ++ + CS +
Sbjct: 65 KAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKL-QYKPKGNT----VPCSDPI 110
Query: 149 C-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASV 202
C C NPK+ C Y ++Y + +S L+++ L++G +++Q +
Sbjct: 111 CLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNG------SAMQPRL 164
Query: 203 IIGCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 259
GCG QS Y P G++GLG G+I + + L AGL RN C G +
Sbjct: 165 AFGCGYDQS--YPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGYL 222
Query: 260 FFGDQ-GPATQQS-TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
FFGD P+ + T L + Y T G K I D+GSS+T+
Sbjct: 223 FFGDTLIPSLGVAWTPLLPPDNHYTT---GPAELLFNGKPTGLKGLKLIFDTGSSYTYFN 279
Query: 318 KEVYETIAAEF--DRQVNDTITSFEGYPWKCCYKSSSQRLPKLP---------------- 359
+ Y+TI D +V+ + E C+K + L
Sbjct: 280 SKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNAR 339
Query: 360 -SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
+ +L P + +++ G ++ G + +Q + IG M G +++D E
Sbjct: 340 RNTQLQIPPESYLIISKTGNACLG--LLNGSEVGLQ----NSNVIGDISMQGLLIIYDNE 393
Query: 419 NLKLGWSHSNCQDL 432
+LGW SNC L
Sbjct: 394 KQQLGWVSSNCNKL 407
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 150/375 (40%), Gaps = 66/375 (17%)
Query: 95 LGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
L D G +L WI CD C CA + Y +L +S+ + L
Sbjct: 220 LDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNL-------VRSSEAFCVEVQRNQLT 272
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--VQASVIIGCGMKQ 210
C+N Q C Y ++Y +++ S G+L +D HL L N ++ ++ GCG Q
Sbjct: 273 EHCENCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL------KLHNGSLAESDIVFGCGYDQ 324
Query: 211 SGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFG-DQGP 266
G L+ + DG++GL +IS+PS LA G+I N C D + G IF G D P
Sbjct: 325 QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVP 384
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPKEVY 321
+ + + + + Y + V G L K + D+GSS+T+ P + Y
Sbjct: 385 SHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAY 444
Query: 322 ETIAAEFDRQVNDTIT---SFEGYPWKCCYKSSSQ-RLPKLPSVKLMFPQNNSFVVNNPV 377
+ +T S E P C+++ + L VK F P+
Sbjct: 445 SQLVTSLQEVSGLELTRDDSDETLP--ICWRAKTNFPFSSLSDVKKFF---------RPI 493
Query: 378 FVIYGTQ-VVTGFCLAIQPV----------------------DGDIGTIGQNFMTGYRVV 414
+ G++ ++ L IQP DG +G M G+ +V
Sbjct: 494 TLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIV 553
Query: 415 FDRENLKLGWSHSNC 429
+D ++GW S+C
Sbjct: 554 YDNVKRRIGWMKSDC 568
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/415 (25%), Positives = 168/415 (40%), Gaps = 65/415 (15%)
Query: 46 KNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLW 105
K + TS ++S Q+ L+S ++ + + ++ G K MSL D G DL W
Sbjct: 104 KIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVEL-----GGKNMSLIVDTGSDLTW 158
Query: 106 IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNP------ 158
V+C P + Y ++ Y PS SS+ K + C+ C DL + N
Sbjct: 159 -----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGN 209
Query: 159 ----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
K PC Y + Y + + L E IL GD L+N + GCG G +
Sbjct: 210 NGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLEN-----FVFGCGRNNKGLF 260
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQS 271
GL +S+ S K FS C + SG + FG+ S
Sbjct: 261 GGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNS 315
Query: 272 TSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSF--KAIVDSGSSFTFLPKEVYETI 324
TS L N + + YI+ + IG LK +SF ++DSG+ T LP +Y+ +
Sbjct: 316 TSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAV 375
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
EF +Q F G+P C+ +S +P +K++F N V+
Sbjct: 376 KIEFLKQ-------FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTG 428
Query: 378 FVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + CLA+ + + ++G IG RV++D +LG NC+
Sbjct: 429 VFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENCR 483
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 137/326 (42%), Gaps = 27/326 (8%)
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGL 178
L DL Y P+ S TS + C C S C+ CPY++ Y + +++SG
Sbjct: 42 LGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITY-GDGSTTSGS 99
Query: 179 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSL 236
V D L N +SVI GCG KQSG A DG+IG G SV S
Sbjct: 100 FVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQ 159
Query: 237 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETC 291
LA +G ++ FS C D G IF Q + +T+ L + I + E
Sbjct: 160 LAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPI 219
Query: 292 CIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYK 349
+ S + I+DSG++ +LP +Y + + RQ + E C+
Sbjct: 220 LLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVE--DQFTCFH 277
Query: 350 SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL-----AIQPVDG-DIGTI 403
S + P VK F + V + +Y + +C+ + Q +G D+ I
Sbjct: 278 YSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDI---YCIGWQKSSTQTKEGRDLILI 334
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNC 429
G ++ VV+D EN+ +GW++ NC
Sbjct: 335 GDLVLSNKLVVYDLENMVIGWTNFNC 360
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 144/383 (37%), Gaps = 51/383 (13%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F +F + L D G DL WI CD C CA Y +L S
Sbjct: 312 GLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSL 371
Query: 136 SSTSKHLSCSHRLCDLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
C +L T C+ +Q C Y ++Y +++SS G+L D LHL+ +
Sbjct: 372 --------CVEVQRNLKTGYCETCEQ-CDYEIEY-ADHSSSMGVLASDDLHLMLANGSLT 421
Query: 195 KNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 253
K ++ GC Q G L+ +A DG++GL ++S+PS LA +I N C
Sbjct: 422 K----LGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 477
Query: 254 DDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAI 306
D + G +F GD N Y + GS L + + +
Sbjct: 478 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 537
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-----WKCCYKSSSQRLPK---- 357
D+GSS+T+ PKE Y + A ++ + P W+ + S K
Sbjct: 538 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQ 597
Query: 358 -----------LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQN 406
+ S K P +++N V G DG +G
Sbjct: 598 PLTLQFRSKWWIVSTKFRIPPEGYLIISN------KGNVCLGILDGSNVHDGSTIILGDI 651
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
+ G VV+D N K+GW+ S C
Sbjct: 652 SLRGKLVVYDNVNQKIGWAQSTC 674
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 152/354 (42%), Gaps = 40/354 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +L D G DL W +C P + + Y + L+ P+ S++ K++SCS C
Sbjct: 144 KEFTLIFDTGSDLTW-----TQCEPCAKTCYKQKEPRLD---PTKSTSYKNISCSSAFCK 195
Query: 151 L-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L G SC +P C Y + Y + + S G + L L S N KN + G
Sbjct: 196 LLDTEGGESCSSPT--CLYQVQY-GDGSYSIGFFATETLTLSS--SNVFKN-----FLFG 245
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD 263
CG +Q+ G G A GL+GLG ++S+PS A+ + FS C S G + FG
Sbjct: 246 CG-QQNSGLFRGAA--GLLGLGRTKLSLPSQTAQK--YKKLFSYCLPASSSSKGYLSFGG 300
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPK 318
Q T + T Y + + +G + L ++ ++DSG+ T LP
Sbjct: 301 QVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPS 360
Query: 319 EVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
Y +++ F + + D S +GY + CY S K+P V + F ++
Sbjct: 361 TAYSALSSAFQKLMTD-YPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSG 419
Query: 378 FVIYGTQVVTGFCLAIQPVDGDI--GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++Y + CLA D+ G Y+VV+D ++G++ S C
Sbjct: 420 -ILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 151/365 (41%), Gaps = 41/365 (11%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q K L D G DL W+ CD CVRC Y + + P +S
Sbjct: 75 QPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCAS-------- 126
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L G C++P+Q C Y ++Y + SS G+LV+D+ L N L+ + + +G
Sbjct: 127 --LHPPGYKCEHPEQ-CDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGLR--LAPRLALG 178
Query: 206 CGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 264
CG Q G P DG++GLG G+ S+ S L G+IRN C G +FFGD
Sbjct: 179 CGYDQIPG--QSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDD 236
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DSGSSFTFLPKEVY 321
+ + ++ Y G +G K T FK ++ DSGSS+T+L Y
Sbjct: 237 LYDSSRVVWTPMLRDQHTHYSSGYAELILGG---KTTVFKNLLVTFDSGSSYTYLNSLAY 293
Query: 322 ETIAAEFDRQVND--TITSFEGYPWKCCYK-----SSSQRLPK-LPSVKLMFPQNN---- 369
+ + +++++ + + C++ S + + K + L FP
Sbjct: 294 QALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKT 353
Query: 370 --SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
+ + + + V G + D IG M VV+D E ++GW+ +
Sbjct: 354 QYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPT 413
Query: 428 NCQDL 432
NC L
Sbjct: 414 NCDRL 418
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/415 (25%), Positives = 168/415 (40%), Gaps = 65/415 (15%)
Query: 46 KNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLW 105
K + TS ++S Q+ L+S ++ + + ++ G K MSL D G DL W
Sbjct: 104 KIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVEL-----GGKNMSLIVDTGSDLTW 158
Query: 106 IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNP------ 158
V+C P + Y ++ Y PS SS+ K + C+ C DL + N
Sbjct: 159 -----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGN 209
Query: 159 ----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
K PC Y + Y + + L E IL GD L+N + GCG G +
Sbjct: 210 NGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLEN-----FVFGCGRNNKGLF 260
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQS 271
GL +S+ S K FS C + SG + FG+ S
Sbjct: 261 GGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNS 315
Query: 272 TSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSF--KAIVDSGSSFTFLPKEVYETI 324
TS L N + + YI+ + IG LK +SF ++DSG+ T LP +Y+ +
Sbjct: 316 TSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAV 375
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
EF +Q F G+P C+ +S +P +K++F N V+
Sbjct: 376 KIEFLKQ-------FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTG 428
Query: 378 FVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + CLA+ + + ++G IG RV++D +LG NC+
Sbjct: 429 VFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENCR 483
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/369 (23%), Positives = 158/369 (42%), Gaps = 41/369 (11%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SS S S C++ +C
Sbjct: 107 DSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSS-----SYSPVKCNVDCTCD 151
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K+ C Y Y E +SSSG+L EDI+ G ++ LK + GC ++G
Sbjct: 152 SDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---PQRAVFGCENSETGDLFS 205
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G++S+ L + G+I +SFS+C+ D G + G P+ +
Sbjct: 206 QHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPSDMVFSH 264
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAE 327
Y Y I ++ + L+ + ++DSG+++ +LP++ +
Sbjct: 265 SDPLRSPY--YNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQAFVAFKDA 322
Query: 328 FDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIY 381
+V+ I + C+ + + + KL P V ++F + ++
Sbjct: 323 VTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFR 382
Query: 382 GTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL + D T +G + V +DR N K+G+ +NC +L +
Sbjct: 383 HSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSELWERLHISD 442
Query: 441 TPGPGTPSN 449
P P S+
Sbjct: 443 APSPAPSSD 451
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 150/354 (42%), Gaps = 31/354 (8%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASY--YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS 154
D G + ++PC C C AS+ + RD + P SS+ + + C C G
Sbjct: 58 DTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRD-PRFKPENSSSYQKIGCRSSDCITGL- 115
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI-IGCGMKQSGG 213
C + C Y Y E ++S G+L +D+L D + +Q+ ++ GC +SG
Sbjct: 116 CDSNSHQCKYER-MYAEMSTSKGVLGKDLL------DFGPASRLQSQLLSFGCETAESGD 168
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
VA DG++GLG G +S+ L G I +SFS+C+ D G
Sbjct: 169 LYLQVA-DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMV 227
Query: 274 FLASNGKYITYI-IGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAA 326
F S+ + Y + + + + LK S F I+DSG+++ +LP +E
Sbjct: 228 FAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTD 287
Query: 327 EFDRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVF 378
Q+ ++ + +G YP CY + +L P V +F +N + +
Sbjct: 288 AVVAQLG-SLQAVDGPDPNYP-DICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENY 345
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ T+V +CL +G + V +DR N ++G+ +NC +L
Sbjct: 346 LFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYNHQIGFLKTNCTEL 399
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 113/273 (41%), Gaps = 28/273 (10%)
Query: 92 TMSLGN---------DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
MS+GN D G DL W+ CD CV C+ + Y N+ P
Sbjct: 61 AMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVDQMCA 117
Query: 141 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--V 198
L H C +PKQ C Y + Y + SS G+LV D L L NS V
Sbjct: 118 AL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLANSSIV 167
Query: 199 QASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
+ + GCG Q G V A DG++GLG G +S+ S L + G+ +N C G
Sbjct: 168 RPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGG 227
Query: 258 RIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
+FFGD P ++ + + +A + Y G G L + + DSGSSFT+
Sbjct: 228 FLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYF 287
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 349
+ Y+ + ++ + + C+K
Sbjct: 288 SAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWK 320
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 152/368 (41%), Gaps = 51/368 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K D G DL W+ CD C C Y + N P ++S + +S
Sbjct: 65 KAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLY----KPKNNLVPCSNSLCQAVSTGENY 120
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCG 207
C P C Y ++Y + SS G+L+ D L +S G +Q + GCG
Sbjct: 121 -----HCDAPDDQCDYEIEY-ADLGSSIGVLLSDSFPLRLSNG-----TLLQPKMAFGCG 169
Query: 208 MKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 264
Q +L P G++GLG G++S+ S L G+ +N CF + G +FFGD
Sbjct: 170 YDQK--HLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDH 227
Query: 265 -GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYET 323
P+++ + + + + Y G G + I DSGSS+T+ +VY++
Sbjct: 228 LFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQS 287
Query: 324 IAAEFDRQVNDTITSFEGYPWK--------CCYKSSSQRLPKLPSVKLMF-PQNNSFVVN 374
I +N G P K C+K +++ + + +K F P SF+
Sbjct: 288 I-------LNLVRKDLAGKPLKDAPEKELAVCWK-TAKPIKSILDIKSYFKPLTISFMNA 339
Query: 375 NPVFVIYGTQ---VVT---GFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
V + + ++T CL I + G+ IG FM V++D E ++GW
Sbjct: 340 KNVQLQLAPEDYLIITKDGNVCLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGW 399
Query: 425 SHSNCQDL 432
+NC L
Sbjct: 400 FPANCDRL 407
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 145/365 (39%), Gaps = 57/365 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---T 153
D G DL+W C CV C D+ L + S SST+ L C C L T
Sbjct: 53 DTGSDLIWTQCKPCVSC----------FDQPLPYFDTSRSSTNALLPCESTQCKLDPTVT 102
Query: 154 SC---QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C Q C Y Y +N+ + GLL D ++G + V GCG+
Sbjct: 103 VCVKLNQTVQTCAYYTSY-GDNSVTIGLLAADKFTFVAG-------TSLPGVTFGCGLNN 154
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRI 259
+G + G+ G G G +S+PS L K G +FS CF D +
Sbjct: 155 TGVFNSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPSTVLLDLPADL 207
Query: 260 FFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-------AIVDS 309
F QG T + + Y + ++ +GS+ L +++F I+DS
Sbjct: 208 FSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDS 267
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QN 368
G+S T LP +VY+ + EF Q+ + C+ + SQ P +P + L F
Sbjct: 268 GTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGAT 327
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHS 427
N VF + + CLAI GD TI NF V++D +N L + +
Sbjct: 328 MDLPRENYVFEVPDDAGNSIICLAIN--KGDETTIIGNFQQQNMHVLYDLQNNMLSFVAA 385
Query: 428 NCQDL 432
C L
Sbjct: 386 QCDKL 390
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 144/355 (40%), Gaps = 47/355 (13%)
Query: 100 GCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----GT 153
G DL W+ CD CVRC Y + + C +C G
Sbjct: 87 GSDLSWLQCDAPCVRCTKAXHXLYRP--------------NNNLVICKDPMCAXLHPPGY 132
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C++P+Q C Y ++Y + SS G+LV+D+ L N L+ + + +GCG Q G
Sbjct: 133 KCEHPEQ-CDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGLR--LAPRLALGCGYDQIPG 186
Query: 214 YLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQST 272
P DG++GLG G+ S+ S L G+IRN C G +FFGD + +
Sbjct: 187 --XSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSRVV 244
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DSGSSFTFLPKEVYETIAAEFD 329
++ Y G +G K T FK ++ DSGSS+T+L Y+ +
Sbjct: 245 WTPMLRDQHTHYSSGYAELILGG---KTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVR 301
Query: 330 RQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV--------VNNPV-- 377
+++++ + + C++ K P SF + P+
Sbjct: 302 KELSEKPVREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDIPLES 361
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++I V G + D IG M VV+D E ++GW+ +NC L
Sbjct: 362 YLIISGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRL 416
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 141/353 (39%), Gaps = 35/353 (9%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G DL W+ CD C RC+ Y R N+ P +H C+ C
Sbjct: 97 DTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDLVPC-----RHALCASLHLSDNYDC 147
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV--IIGCGMKQSGG 213
+ P Q C Y + Y ++ SS G+L+ D+ L N VQ V +GCG Q
Sbjct: 148 EVPHQ-CDYEVQY-ADHYSSLGVLLHDVYTL------NFTNGVQLKVRMALGCGYDQIFP 199
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
DG++GLG G+ S+ S L GL+RN C G IFFGD + + + +
Sbjct: 200 DPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGDVYDSFRLTWT 259
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFD---- 329
++S + G G + A+ D+GSS+T+ Y+ + +
Sbjct: 260 PMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNSYAYQVLISWLKKESG 319
Query: 330 ----RQVNDTIT---SFEG-YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
++ +D T + G P++ Y+ P + S F + ++I
Sbjct: 320 GKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMLPEAYLIV 379
Query: 382 GT--QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
V G + GD+ IG M +VFD + +GW+ ++C +
Sbjct: 380 SNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVMVFDNDKQLIGWAPADCDQV 432
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 141/382 (36%), Gaps = 49/382 (12%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F +F + L D G DL WI CD C CA Y +L S
Sbjct: 99 GLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSL 158
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
C +L T + C Y ++Y +++SS G+L D LHL+ + K
Sbjct: 159 --------CVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGSLTK 209
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 254
++ GC Q G L+ +A DG++GL ++S+PS LA +I N C D
Sbjct: 210 ----LGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSD 265
Query: 255 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAIV 307
+ G +F GD N Y + GS L + + +
Sbjct: 266 ATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVF 325
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-----WKCCYKSSSQRLPK----- 357
D+GSS+T+ PKE Y + A ++ + P W+ + S K
Sbjct: 326 DTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQP 385
Query: 358 ----------LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF 407
+ S K P +++N V G DG +G
Sbjct: 386 LTLQFRSKWWIVSTKFRIPPEGYLIISNK------GNVCLGILDGSNVHDGSTIILGDIS 439
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
+ G VV+D N K+GW+ S C
Sbjct: 440 LRGKLVVYDNVNQKIGWAQSTC 461
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 153/373 (41%), Gaps = 58/373 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL------ 151
D G DL+W+ C P R + S S+T + CS C L
Sbjct: 72 DTGSDLIWLQCSTTAAPPAFCPKKACSRRP--AFVASKSATLSVVPCSAAQCLLVPAPRG 129
Query: 152 -GTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISG--GDNALKNSVQASVIIGCG 207
G SC PC Y DY + +S++G L D + +G G A++ V GCG
Sbjct: 130 HGPSCSPAAPVPCGYAYDY-ADGSSTTGFLARDTATISNGTSGGAAVRG-----VAFGCG 183
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSMCFDKDDSGR-------I 259
+ GG G G+IGLG G++S P A++G L +FS C + GR +
Sbjct: 184 TRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFL 238
Query: 260 FFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK----------AIVD 308
F G + + L SN T Y +GV +G+ L + ++D
Sbjct: 239 FLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVID 298
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVN-----DTITSFEGYPWKCCYK--SSSQRLPK---L 358
SGS+ T+L Y + + F V+ + T F+G + CY SSS P
Sbjct: 299 SGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSLAPANGGF 356
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGYRVVFD 416
P + + F Q S + +++ V CLAI+P +G GY V FD
Sbjct: 357 PRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIRPTLSPFAFNVLGNLMQQGYHVEFD 414
Query: 417 RENLKLGWSHSNC 429
R + ++G++ + C
Sbjct: 415 RASARIGFARTEC 427
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 154/373 (41%), Gaps = 63/373 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+T S+ D G + +IPC DC C +A +++ P S+T+K L+C L
Sbjct: 23 ERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFD----------PDKSTTAKKLACGDPL 72
Query: 149 CDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
C+ GT SC C Y+ Y E +SS G ++ED D+ ++ ++ GC
Sbjct: 73 CNCGTPSCTCNNDRCYYSRT-YAERSSSEGWMIEDTFGF-PDSDSPVR------LVFGCE 124
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPA 267
++G +A DG++G+G + S L + +I + FS+CF G + GD
Sbjct: 125 NGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDVTLP 183
Query: 268 TQQSTSF--LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKE 319
+T + L ++ Y + ++ + L + ++DSG++FT+LP +
Sbjct: 184 EGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTYLPTD 243
Query: 320 VYETIAAEF---------------DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV--- 361
++ +A D Q ND ++G P + +K + P V
Sbjct: 244 AFKAMAKAVGDYVEKKGLQSTPGADPQYNDIC--WKGAPDQ--FKDLDKYFPPAEFVFGG 299
Query: 362 --KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
KL P ++ P +CL I +G + V +DR N
Sbjct: 300 GAKLTLPPLRYLFLSKPA----------EYCLGIFDNGNSGALVGGVSVRDVVVTYDRRN 349
Query: 420 LKLGWSHSNCQDL 432
K+G++ C D+
Sbjct: 350 SKVGFTTMACADV 362
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 92/396 (23%), Positives = 172/396 (43%), Gaps = 56/396 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P +SST + + C+ + +C
Sbjct: 102 DTGSTVTYVPCSTCEQCG----------RHQDPKFQPESSSTYQPVKCT-----IDCNCD 146
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ + C Y Y E ++SSG+L ED LIS G+ + +A + GC ++G
Sbjct: 147 SDRMQCVYERQY-AEMSTSSGVLGED---LISFGNQSELAPQRA--VFGCENVETGDLYS 200
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L +I +SFS+C+ D G + G P + + ++
Sbjct: 201 QHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGISPPSDMAFAY 259
Query: 275 LASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVY----ETI 324
+ + Y I ++ + L + ++DSG+++ +LP+ + + I
Sbjct: 260 -SDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI 318
Query: 325 AAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
E D ND S G SQ P V ++F + ++
Sbjct: 319 VKELQSLKKISGPDPNYNDICFSGAGI-------DVSQLSKSFPVVDMVFENGQKYTLSP 371
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
++ ++V +CL + D T +G + VV+DRE K+G+ +NC +L +
Sbjct: 372 ENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCAELWE 431
Query: 435 GTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVA 470
+ + P P P++ + + E P +V P+V+
Sbjct: 432 RLQISVAPPPLPPNSGVRNSSEALEP---SVAPSVS 464
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 148/374 (39%), Gaps = 64/374 (17%)
Query: 95 LGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
L D G +L WI CD C CA + Y +L +S+ + L
Sbjct: 47 LDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNL-------VRSSEAFCVEVQRNQLT 99
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--VQASVIIGCGMKQ 210
C+N Q C Y ++Y +++ S G+L +D HL L N ++ ++ GCG Q
Sbjct: 100 EHCENCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL------KLHNGSLAESDIVFGCGYDQ 151
Query: 211 SGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFG-DQGP 266
G L+ + DG++GL +IS+PS LA G+I N C D + G IF G D P
Sbjct: 152 QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVP 211
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPKEVY 321
+ + + + + Y + V G L K + D+GSS+T+ P + Y
Sbjct: 212 SHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAY 271
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN---NPVF 378
+ +T S + LP K FP ++ V P+
Sbjct: 272 SQLVTSLQEVSGLELTR----------DDSDETLPICWRAKTNFPFSSLSDVKKFFRPIT 321
Query: 379 VIYGTQ-VVTGFCLAIQPV----------------------DGDIGTIGQNFMTGYRVVF 415
+ G++ ++ L IQP DG +G M G+ +V+
Sbjct: 322 LQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVY 381
Query: 416 DRENLKLGWSHSNC 429
D ++GW S+C
Sbjct: 382 DNVKRRIGWMKSDC 395
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 159/353 (45%), Gaps = 43/353 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR-LCDL-GTS 154
D G + ++PC C +C ++ P +SST K + C+ +CD G
Sbjct: 101 DTGSTVTYVPCSTCEQCG----------RHQDPKFDPESSSTYKPIKCNIDCICDSDGVQ 150
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C +Q Y E ++SSG+L ED+ IS G+ + + + GC ++G
Sbjct: 151 CVYERQ--------YAEMSTSSGVLGEDV---ISFGNQS--ELIPQRAVFGCENMETGDL 197
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQST 272
A DG++GLG G++S+ L + G I +SFS+C+ D G + G P +
Sbjct: 198 FSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMIF 256
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVY----E 322
++ + + Y + ++ + L +S + A++DSG+++ +LP E + +
Sbjct: 257 TY-SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKD 315
Query: 323 TIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
I E ++++ +F+ + +++ K P+V ++F + +
Sbjct: 316 AIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPENYFF 375
Query: 381 YGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++V +CL I D T +G + V++DR N K+G+ +NC +L
Sbjct: 376 RHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 159/353 (45%), Gaps = 43/353 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR-LCDL-GTS 154
D G + ++PC C +C ++ P +SST K + C+ +CD G
Sbjct: 101 DTGSTVTYVPCSTCEQCG----------RHQDPKFDPESSSTYKPIKCNIDCICDSDGVQ 150
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C +Q Y E ++SSG+L ED+ IS G+ + + + GC ++G
Sbjct: 151 CVYERQ--------YAEMSTSSGVLGEDV---ISFGNQS--ELIPQRAVFGCENMETGDL 197
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQST 272
A DG++GLG G++S+ L + G I +SFS+C+ D G + G P +
Sbjct: 198 FSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMIF 256
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVY----E 322
++ + + Y + ++ + L +S + A++DSG+++ +LP E + +
Sbjct: 257 TY-SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKD 315
Query: 323 TIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
I E ++++ +F+ + +++ K P+V ++F + +
Sbjct: 316 AIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPENYFF 375
Query: 381 YGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++V +CL I D T +G + V++DR N K+G+ +NC +L
Sbjct: 376 RHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 155/364 (42%), Gaps = 57/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
SK + D G D+LW+ C C +C S L L Y P++S ++ +SC
Sbjct: 37 SKDYYVQVDTGSDILWVNCIGCDKCPTKS-----DLGIKLTLYDPASSVSATRVSCDDDF 91
Query: 149 CDLGTSCQN-------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C TS N + PC Y + Y + +S++G V D + N +
Sbjct: 92 C---TSTYNGLLPDCKKELPCQYNV-VYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNGT 147
Query: 202 VIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
V GCG +QSGG G A DG++G +F+ C D + G IF
Sbjct: 148 VTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCLDNVNGGGIF 187
Query: 261 -FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA------IVDSGS 311
G+ +T + + Y Y+ +E +G + L+ + F + I+DSG+
Sbjct: 188 AIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLELPTDVFDSGDRRGTIIDSGT 244
Query: 312 SFTFLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+ +LP+ VY+++ E +Q ++ + E C+K S P +K F + +
Sbjct: 245 TLAYLPEVVYDSMMNEIRSQQPGLSLHTVE--EQFICFKYSGNVDDGFPDIKFHFKDSLT 302
Query: 371 FVVNNPVFVIYGTQVVTGFCL---AIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
V ++ ++ + F +Q DG D+ +G ++ V++D EN +GW+
Sbjct: 303 LTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTE 362
Query: 427 SNCQ 430
NC+
Sbjct: 363 YNCK 366
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 89/385 (23%), Positives = 155/385 (40%), Gaps = 45/385 (11%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ +L D G + ++PC DC C + P SST + C
Sbjct: 99 QEFALIVDTGSTVTYVPCSDCEHCG----------KHQDPRFQPDESSTYHPVKC----- 143
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
++ +C + C Y Y E +SSSG+L EDI IS G+ + V + GC
Sbjct: 144 NMDCNCDHDGVNCVYERRY-AEMSSSSGVLGEDI---ISFGNQS--EVVPQRAVFGCENV 197
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGDQGPA 267
++G A DG++GLG G++S+ L +I +SFS+C+ G + G P
Sbjct: 198 ETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGGIPPP 256
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY 321
S + + Y I ++ + LK ++DSG+++ +LP+E +
Sbjct: 257 PDMVFS-RSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYLPEEAF 315
Query: 322 ETI------AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
+ +Q++ ++ + + SQ P V ++F +
Sbjct: 316 VAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQKLSLTP 375
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL--- 432
++ T+V +CL I +G + V +DREN K+G+ +NC +L
Sbjct: 376 ENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKTNCSELWKR 435
Query: 433 ----NDGTKSPLTPGPGTPSNPLPA 453
+P+ P P + S P P
Sbjct: 436 LHIPGAPAAAPIVPTPKSVSAPAPV 460
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 149/369 (40%), Gaps = 51/369 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C A +Y P+ ++ L CSH L
Sbjct: 78 KLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHIL 123
Query: 149 C---DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQAS 201
C DL C +P+ C Y + Y +++ SS G LV D L L +G L+
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKLANGSIMNLR------ 176
Query: 202 VIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
+ GCG +Q+ G G++GLG G++ + + L G+ +N C G +
Sbjct: 177 LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLS 236
Query: 261 FGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
GD+ P++ + + LA+N Y+ G + DSGSS+T+ E
Sbjct: 237 IGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAE 296
Query: 320 VYETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP--------QNN 369
Y+ I + +N + + C+K + L L VK F Q N
Sbjct: 297 AYQAILDLIRKDLNGKPLTDTKDDKSLPVCWK-GKKPLKSLDEVKKYFKTITLRFGNQKN 355
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFM-----TGYRVVFDRENLKLG 423
+ P CL I ++G +IG G N + G V++D E ++G
Sbjct: 356 GQLFQVPPESYLIITEKGRVCLGI--LNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIG 413
Query: 424 WSHSNCQDL 432
W S+C L
Sbjct: 414 WISSDCDKL 422
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 149/369 (40%), Gaps = 51/369 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C A +Y P+ ++ L CSH L
Sbjct: 78 KLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHIL 123
Query: 149 C---DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQAS 201
C DL C +P+ C Y + Y +++ SS G LV D L L +G L+
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKLANGSIMNLR------ 176
Query: 202 VIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
+ GCG +Q+ G G++GLG G++ + + L G+ +N C G +
Sbjct: 177 LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLS 236
Query: 261 FGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
GD+ P++ + + LA+N Y+ G + DSGSS+T+ E
Sbjct: 237 IGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAE 296
Query: 320 VYETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP--------QNN 369
Y+ I + +N + + C+K + L L VK F Q N
Sbjct: 297 AYQAILDLIRKDLNGKPLTDTKDDKSLPVCWK-GKKPLKSLDEVKKYFKTITLRFGNQKN 355
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFM-----TGYRVVFDRENLKLG 423
+ P CL I ++G +IG G N + G V++D E ++G
Sbjct: 356 GQLFQVPPESYLIITEKGRVCLGI--LNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIG 413
Query: 424 WSHSNCQDL 432
W S+C L
Sbjct: 414 WISSDCDKL 422
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 146/370 (39%), Gaps = 46/370 (12%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G D WI CD C C Y + + + L + C+ +C
Sbjct: 34 DTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVH---PRDPLCEELQGNQNYCE---TC 87
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + +SS G+L D + L + D +KN + GC Q G L
Sbjct: 88 KQ----CDYEITY-ADRSSSKGVLARDNMQLTTA-DGEMKN---VDFVFGCAHNQQGKLL 138
Query: 216 DG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQST 272
D + DG++GL G IS+ + LA +G+I N F C D S G +F GD T
Sbjct: 139 DSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFLGDDYVPRWGMT 198
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTS-----FKAIVDSGSSFTFLPKEVYETIAA- 326
NG Y V G+ L + I DSGSS+T+ P E+Y + A
Sbjct: 199 WVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTYFPHEIYTNLIAL 258
Query: 327 ------EFDRQVNDTITSF---EGYPWKCCYKSSSQRLPKLPSV-KLMFPQNNSFVVNNP 376
F R +D F P + P + + K F +F ++
Sbjct: 259 LEDASPGFVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKRWFVIPTTFAISPE 318
Query: 377 VFVIYGTQVVTGFCLAIQPVDG-DIG-----TIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
++I + CL + +DG +IG IG + G VV+D + ++GW S+C
Sbjct: 319 NYLIISDK--GNVCLGV--LDGTEIGHSSTIIIGDASLRGKFVVYDNDENRIGWVQSDCT 374
Query: 431 DLNDGTKSPL 440
++ P
Sbjct: 375 RPQKQSRVPF 384
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 141/355 (39%), Gaps = 43/355 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---T 153
D G DL+W C C C S YY++ S SST SC C L T
Sbjct: 109 DTGSDLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALPSCDSTQCKLDPSVT 158
Query: 154 SCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C N Q C ++ Y + +++ G L + + ++G V+ GCG+ +G
Sbjct: 159 MCVNQTVQTCAFSYSY-GDKSATIGFLDVETVSFVAGAS-------VPGVVFGCGLNNTG 210
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-----FGDQGPA 267
+ G+ G G G +S+PS L K G + F+ + S +F G
Sbjct: 211 IFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRG 267
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGSSFTFLPK 318
T Q+T + + Y + ++ +GS+ LK + I+DSG++FT LP
Sbjct: 268 TVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPP 327
Query: 319 EVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
VY + EF V + S E P C + P +P + L F +
Sbjct: 328 RVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGATMHLPRENY 387
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CLAI ++G++ IG V++D +N KL + + C L
Sbjct: 388 VFEAKDGGNCSICLAI--IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 95/373 (25%), Positives = 145/373 (38%), Gaps = 59/373 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K S+ D G DL+WI C C C +N D + P SS+ +SC L
Sbjct: 50 AKVFSVIADTGSDLIWIQCKPCQAC-------FNQKDP---IFDPEGSSSYTTMSCGDTL 99
Query: 149 CDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLIS--GGDNALKNSVQASVI 203
CD P++ C DY Y + + + G L + + L S G A KN +
Sbjct: 100 CD-----SLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-----IA 149
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGR 258
GCG G + D GL+GLG G +S S L L + FS C +
Sbjct: 150 FGCGHLNRGSFNDA---SGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSP 204
Query: 259 IFFGDQGPATQQS-------TSFLASNGKYITYIIGVETCCIGSSCLK----------QT 301
+FFGD+ + T + + Y + ++ I L+
Sbjct: 205 MFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDG 264
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL---PKL 358
S I DSG++ T LP Y+ + +V+ CY S + K+
Sbjct: 265 SGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKI 324
Query: 359 PSVKLMFP-QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDR 417
P++ F ++ V N + I T CLA+ + DIG G +RV++D
Sbjct: 325 PAMVFHFEGADHQLPVEN--YFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDI 382
Query: 418 ENLKLGWSHSNCQ 430
+ K+GW+ S C
Sbjct: 383 GSSKIGWAPSQCD 395
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 144/362 (39%), Gaps = 52/362 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-------- 149
D G DL W+ CD P + +L +D Y P+ + K CS +C
Sbjct: 80 DTGSDLTWVQCD----GPDAPCKGCTLPKD-KLYKPNGNQLVK---CSDPICAAVQPPFS 131
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIGCGM 208
G C P PC Y ++Y +N S+G L D +H+ S G N V+ GCG
Sbjct: 132 TFGQKCAKPIPPCVYKVEY-ADNAESTGALARDYMHIGSPSGSNV------PLVVFGCGY 184
Query: 209 KQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ--- 264
+Q G + G++GLG G+IS+ S L G I N C + G +F GD+
Sbjct: 185 EQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLGDKFIP 244
Query: 265 ------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
P Q S S G + G T G + I DSGSS+T+
Sbjct: 245 SSGIFWTPIIQSSLEKHYSTGPVDLFFNGKPTPAKG--------LQIIFDSGSSYTYFSP 296
Query: 319 EVYETIAAEFDRQVNDTITSFEGYP------WKCC--YKSSSQRLPKLPSVKLMFPQNNS 370
VY +A + + E WK +KS ++ + L F ++ +
Sbjct: 297 RVYTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN 356
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
P V +G V G + G+ +G + VV+D E ++GW+ +NC+
Sbjct: 357 LQFQLPP-VKFG-NVCLGILNGNEAGLGNRNVVGDISLQDKVVVYDNEKQQIGWASANCK 414
Query: 431 DL 432
+
Sbjct: 415 QI 416
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 143/367 (38%), Gaps = 47/367 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C A +Y P+ ++ L CSH L
Sbjct: 79 KLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHLL 124
Query: 149 C---DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS--VQAS 201
C DL + C +P+ C Y + Y +++ SS G LV D L L N +
Sbjct: 125 CSGLDLTQNRPCDDPEDQCDYEIGY-SDHASSIGALVTDEFPL------KLANGSIMNPH 177
Query: 202 VIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
+ GCG +Q+ G G++GLG G++ + + L G+ +N C G +
Sbjct: 178 LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLS 237
Query: 261 FGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
GD+ P++ + + LA+N Y+ G + DSGSS+T+ E
Sbjct: 238 IGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAE 297
Query: 320 VYETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP--------QNN 369
Y+ I + +N + + C+K + L L VK F Q N
Sbjct: 298 AYQAILDLIRKDLNGKPLTDTKDDKSLPVCWK-GKKPLKSLDEVKKYFKTITLRFGYQKN 356
Query: 370 SFVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ P + + V G + +G G V++D E ++GW
Sbjct: 357 GQLFQVPPESYLIITEKGNVCLGILNGTEVGLDSYNIVGDISFQGIMVIYDNEKQRIGWI 416
Query: 426 HSNCQDL 432
S+C +
Sbjct: 417 SSDCDKI 423
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 171/401 (42%), Gaps = 64/401 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC+ C +C N D ++ P S T + C+ +C
Sbjct: 14 DTGSTVTYVPCNSCDQCG-------NHQDP---KFQPDLSDTYHPVKCNPD-----CTCD 58
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y Y E +SSSG+L ED L+S G+ + +A + GC ++G
Sbjct: 59 TENDQCTYERQY-AEMSSSSGILGED---LVSFGNMSELKPQRA--VFGCENAETGDLFS 112
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L + G+I +SFS+C+ + G + G P + S
Sbjct: 113 QHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY----ETI 324
+ + Y I + + L I+DSG+++ +LP+ + + I
Sbjct: 172 -SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230
Query: 325 AAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVF 378
+E +Q+ ++ C+ + +P+L PSV ++F + ++ +
Sbjct: 231 TSELHGLKQIRGPDPNYN----DVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNC----QDLN 433
+ ++V +CL + D T +G + V +DRE+ K+G+ +NC + LN
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCSVLWERLN 346
Query: 434 DGTKSP-------------LTPGPGTPSNPLPANQEQSSPG 461
+ SP ++P P T +P P E S G
Sbjct: 347 ASSISPAPAPLGGEVAATDMSPAPATDMSPAPLGGEISDTG 387
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 171/401 (42%), Gaps = 64/401 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC+ C +C N D ++ P S T + C+ +C
Sbjct: 14 DTGSTVTYVPCNSCDQCG-------NHQDP---KFQPDLSDTYHPVKCNPD-----CTCD 58
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y Y E +SSSG+L ED L+S G+ + +A + GC ++G
Sbjct: 59 TENDQCTYERQY-AEMSSSSGILGED---LVSFGNMSELKPQRA--VFGCENAETGDLFS 112
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L + G+I +SFS+C+ + G + G P + S
Sbjct: 113 QHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 275 LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY----ETI 324
+ + Y I + + L I+DSG+++ +LP+ + + I
Sbjct: 172 -SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230
Query: 325 AAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVF 378
+E +Q+ ++ C+ + +P+L PSV ++F + ++ +
Sbjct: 231 TSELHGLKQIRGPDPNYN----DVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNC----QDLN 433
+ ++V +CL + D T +G + V +DRE+ K+G+ +NC + LN
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCSVLWERLN 346
Query: 434 DGTKSP-------------LTPGPGTPSNPLPANQEQSSPG 461
+ SP ++P P T +P P E S G
Sbjct: 347 ASSISPAPAPLGGEVAATDMSPAPATDMSPAPLGGEISDTG 387
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 156/367 (42%), Gaps = 50/367 (13%)
Query: 95 LGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
L D G DL W+ CD C C + Y ++ + S + + D
Sbjct: 214 LDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLCMEVQR----NYDGDQC 269
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCGMKQS 211
+CQ C Y + Y + +SS G+LV+D L S G + + + I GC Q
Sbjct: 270 AACQQ----CNYEVQY-ADQSSSLGVLVKDEFTLRFSNG-----SLTKLNAIFGCAYDQQ 319
Query: 212 GGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPAT 268
G L+ ++ DG++GL ++S+PS LA G+I N C D + G +F GD
Sbjct: 320 GLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGDPAGGGYLFLGDDF-VP 378
Query: 269 QQSTSFLA-----SNGKYITYIIGVETCCIGSSCLKQTSFK--AIVDSGSSFTFLPKEVY 321
Q +++A S Y T ++ ++ I S S + + DSGSS+T+ KE Y
Sbjct: 379 QWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAY 438
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-PQNNSFVVNNPVFVI 380
+ A + +V+ + C+K + Q + + VK F P F F +
Sbjct: 439 YQLVANLE-EVSAFGLILQDSSDTICWK-TEQSIRSVKDVKHFFKPLTLQF---GSRFWL 493
Query: 381 YGTQVVT------------GFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
T++V CL I Q DG +G N + G VV+D N ++GW
Sbjct: 494 VSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIILGDNALRGKLVVYDNVNQRIGW 553
Query: 425 SHSNCQD 431
+ S+C +
Sbjct: 554 TSSDCHN 560
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 148/377 (39%), Gaps = 54/377 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCA----PLSASYYNSLDRDLNEYSPSASSTSKHLS 143
SK L D G +L WI CD C+ CA PL SL + + + S H
Sbjct: 89 SKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDPLCAAVQAGSGHYH 148
Query: 144 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
+H+ Q C Y + Y ++ S G LV D + + K + A+ +
Sbjct: 149 -NHK---------EASQRCDYDVAY-ADHGYSEGFLVRDSVRALLTN----KTVLTANSV 193
Query: 204 IGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIF 260
GCG Q + DG++GLG G S+PS AK GLI+N C D G +F
Sbjct: 194 FGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGYMF 253
Query: 261 FGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQTSFKA-----IVDSGSSFT 314
FGD +T T + Y +G G+ L + I DSGS++T
Sbjct: 254 FGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKLGGIIFDSGSTYT 313
Query: 315 FLPKEVYETIAAEFDRQVN------DTITSFEGYPW--KCCYKSSSQRLPKLPSVKLMFP 366
+ + Y + ++ D+ SF W K ++S ++ + L F
Sbjct: 314 YFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFR 373
Query: 367 QNNS----------FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
+ VVN V G T + V GDI GQ VV+D
Sbjct: 374 STKTKQMEIFPEGYLVVNKKGNVCLGILNGTAIGIVDTNVLGDISFQGQ------LVVYD 427
Query: 417 RENLKLGWSHSNCQDLN 433
E ++GW+ S+CQ+++
Sbjct: 428 NEKNQIGWARSDCQEIS 444
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 154/388 (39%), Gaps = 60/388 (15%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F + + L D DL WI CD C CA + + Y R N +P
Sbjct: 206 GLYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKP--RRDNIVTPKD 263
Query: 136 S-STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
S H + C+ +CQ C Y ++Y +++SS G+L D LHL A
Sbjct: 264 SLCVELHRNQKAGYCE---TCQQ----CDYEIEY-ADHSSSMGVLARDELHLTM----AN 311
Query: 195 KNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 253
+S GC Q G L+ V DG++GL ++S+PS LA G+I N C
Sbjct: 312 GSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAN 371
Query: 254 D--DSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKA 305
D G +F GD P S + + +Y + GS L ++ +
Sbjct: 372 DVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRI 431
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFEGYPWKCCYKSSS-----QRL 355
+ DSGSS+T+ KE Y + A + + DT + W+ + S Q
Sbjct: 432 VFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYF 491
Query: 356 PKLP----------SVKLMFPQNNSFVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIG 401
L S K P +++N + ++ G+ V G + + GDI
Sbjct: 492 KTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSDVHDGSSIIL----GDIS 547
Query: 402 TIGQNFMTGYRVVFDRENLKLGWSHSNC 429
GQ +++D N K+GW+ S+C
Sbjct: 548 LRGQ------LIIYDNVNNKIGWTQSDC 569
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 166/393 (42%), Gaps = 48/393 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C C ++ P S T + + C+ C+ C
Sbjct: 107 DTGSTVTYVPCSTCEHCG----------RHQDPKFQPDLSETYQPVKCTPD-CN----CD 151
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y Y E +SSSG+L ED+ +S G+ L + GC ++G
Sbjct: 152 GDTNQCMYDRQY-AEMSSSSGVLGEDV---VSFGN--LSELAPQRAVFGCENDETGDLYS 205
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTS 273
A DG++GLG G++S+ L +I +SFS+C+ D G I G P T
Sbjct: 206 QRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPEDMVFTH 264
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY----ET 323
Y Y I ++ + L+ ++DSG+++ +LP+ +
Sbjct: 265 SDPDRSPY--YNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLPETAFLAFKRA 322
Query: 324 IAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
I E + +Q+N +++ + SQ P V ++F + ++ ++
Sbjct: 323 IMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFR 382
Query: 382 GTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL + D T +G F+ V++DREN K+G+ +NC +L + +
Sbjct: 383 HSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSELWETLHTSD 442
Query: 441 TPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRA 473
P +PLP+N E ++ A P+VA A
Sbjct: 443 AP------SPLPSNSEVTNL-TKAFAPSVAPSA 468
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 154/361 (42%), Gaps = 48/361 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LWI +C+ C+ + + + L +L+ + + SST+ +SC +C
Sbjct: 101 DTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTAT 156
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGMKQ 210
+ C + C YT Y + + ++G V D ++ + G + + NS +++I GC Q
Sbjct: 157 SECSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSVVANS-SSTIIFGCSTYQ 214
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGD---- 263
SG A DG+ G G G +SV S L+ G+ FS C ++ G + G+
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 264 ---------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSF 313
P + +A NG+ + I S+ T+ + IVDSG++
Sbjct: 275 SIVYSPLVPSQPHYNLNLQSIAVNGQLLP---------IDSNVFATTNNQGTIVDSGTTL 325
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEG---YPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+L +E Y F + + ++ F CY S+ P V L F S
Sbjct: 326 AYLVQEAYN----PFVKAITAAVSQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381
Query: 371 FVVNNPVFVI-YG-TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V+N +++ YG +C+ Q V+ +G + V+D N ++GW+ +
Sbjct: 382 MVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYD 441
Query: 429 C 429
C
Sbjct: 442 C 442
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 147/350 (42%), Gaps = 25/350 (7%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G D+LW+ C P S+ L+ L ++P +SSTS + CS C
Sbjct: 107 DTGSDILWVACSPCTGCPTSSG----LNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGE 162
Query: 155 --CQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
CQ+ P PC YT Y + + +SG V D ++ + N + ASV+ GC
Sbjct: 163 AVCQSSDSPSSPCGYTFTY-GDGSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNS 221
Query: 210 QSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGP 266
QSG + A DG+ G G ++SV S L G+ +FS C D+G + G+
Sbjct: 222 QSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGSDNGGGILVLGEIVE 281
Query: 267 ATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVY 321
T + S Y + + + I SS ++ + IVDSG++ +L Y
Sbjct: 282 PGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIVDSGTTLVYLVDGAY 341
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ V+ ++ S + C+ ++S P+ L F S V +++
Sbjct: 342 DPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQ 400
Query: 382 GTQVVTG--FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C+ Q G I +G + V+D N+++GW+ +C
Sbjct: 401 QGSVDNNVLWCIGWQRSQG-ITILGDLVLKDKIFVYDLANMRMGWADYDC 449
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 154/361 (42%), Gaps = 48/361 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LWI +C+ C+ + + + L +L+ + + SST+ +SC+ +C
Sbjct: 101 DTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTAT 156
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGMKQ 210
+ C + C YT Y + + ++G V D ++ + G + + NS ++++ GC Q
Sbjct: 157 SGCSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSMVANS-SSTIVFGCSTYQ 214
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGD---- 263
SG A DG+ G G G +SV S L+ G+ FS C ++ G + G+
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 264 ---------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSF 313
P + +A NG+ + I S+ T+ + IVDSG++
Sbjct: 275 SIVYSPLVPSLPHYNLNLQSIAVNGQLLP---------IDSNVFATTNNQGTIVDSGTTL 325
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEG---YPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+L +E Y F + ++ F CY S+ P V L F S
Sbjct: 326 AYLVQEAYN----PFVDAITAAVSQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381
Query: 371 FVVNNPVFVI-YG-TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V+N +++ YG +C+ Q V+ +G + V+D N ++GW+ N
Sbjct: 382 MVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLANQRIGWADYN 441
Query: 429 C 429
C
Sbjct: 442 C 442
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/348 (23%), Positives = 142/348 (40%), Gaps = 22/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C P ++ L LN + P +S T+ +SCS + C G
Sbjct: 99 DTGSDVLWVSCASCNGCPQTSG----LQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSD 154
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C C YT Y + + +SG V D+L ++L + A V+ GC Q+G
Sbjct: 155 SGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
+ A DG+ G G +SV S LA GL FS C ++ G + G+
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILVLGEIVEPNM 273
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
T + S Y ++ + + I S ++ + I+D+G++ +L + Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ ++ + CY ++ P V L F S +N ++I
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ-CYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNN 392
Query: 385 V--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C+ Q + I +G + V+D ++GW++ +C
Sbjct: 393 VGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 146/379 (38%), Gaps = 71/379 (18%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K S+ D G DL+WI C C C +N D + P SS+ +SC L
Sbjct: 50 AKVFSVIADTGSDLIWIQCKPCQAC-------FNQKDP---IFDPEGSSSYTTMSCGDTL 99
Query: 149 CDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLIS--GGDNALKNSVQASVI 203
CD P++ C DY Y + + + G L + + L S G A KN +
Sbjct: 100 CD-----SLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-----IA 149
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGR 258
GCG G + D GL+GLG G +S S L L + FS C +
Sbjct: 150 FGCGHLNRGSFNDA---SGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSP 204
Query: 259 IFFGDQGPATQQS-------TSFLASNGKYITYIIGVETCCIGSSCLK----------QT 301
+FFGD+ + T + + Y + ++ I L+
Sbjct: 205 MFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDG 264
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP---KL 358
S I DSG++ T LP Y+ + +++ CY S + K+
Sbjct: 265 SGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKI 324
Query: 359 PSVKLMF-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
P++ F P N F+ N GT V CLA+ + DIG G +
Sbjct: 325 PAMVFHFEGADYQLPVENYFIAANDA----GTIV----CLAMVSSNMDIGIYGNMMQQNF 376
Query: 412 RVVFDRENLKLGWSHSNCQ 430
RV++D + K+GW+ S C
Sbjct: 377 RVMYDIGSSKIGWAPSQCD 395
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 170/390 (43%), Gaps = 48/390 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C ++ P SST + + C+ L +C
Sbjct: 99 DTGSTVTYVPCSTCEQCG----------RHQDPKFQPDLSSTYQPVKCT-----LDCNCD 143
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N + C Y Y E ++SSG+L ED++ + + A + +V GC ++G
Sbjct: 144 NDRMQCVYERQY-AEMSTSSGVLGEDVVSFGNQSELAPQRAV-----FGCENVETGDLYS 197
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSF 274
A DG++GLG G++S+ L ++ +SFS+C+ D G + G P + F
Sbjct: 198 QHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPSDM--VF 254
Query: 275 LASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY----ET 323
S+ + Y I ++ + L +++DSG+++ +LP+E + E
Sbjct: 255 AQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYLPEEAFLAFKEA 314
Query: 324 IAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
I E Q++ ++ + SQ P V ++F + + ++ ++
Sbjct: 315 IVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFR 374
Query: 382 GTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPL 440
++V +CL I D T +G + V++DRE K+G+ +NC +L + +
Sbjct: 375 HSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNCAELWERLQISS 434
Query: 441 TPGPGTPSNPLPANQEQSSPGGHAVGPAVA 470
P P+P N E ++ +V P+VA
Sbjct: 435 APP------PMPPNTEATN-STKSVDPSVA 457
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 150/373 (40%), Gaps = 60/373 (16%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCA-PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
K L D G DL W+ CD C C PL Y + LSC
Sbjct: 78 KLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLY---------------KPRNNLLSCIDP 122
Query: 148 LC----DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQA 200
LC + GT CQ+ C Y + Y E SS G+LV D L L++G + ++
Sbjct: 123 LCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTDYFPLRLMNG------SFLRP 175
Query: 201 SVIIGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 259
+ GCG Q S G + G++GLG G+ S+ S L G++ N C + G +
Sbjct: 176 KMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKGGGFL 235
Query: 260 FFGDQGPATQQSTSFLASNGKYIT--YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
FFG Q P S+ + K + Y G G + + I DSGSS+T+
Sbjct: 236 FFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSYTYFN 294
Query: 318 KEVYETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQ----------------RLPKLP 359
+VY++ ++++ + E C+K + + K
Sbjct: 295 AQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSFTKAK 354
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
SV+L P + +V N V G ++ G + + G+ IG N V++D +
Sbjct: 355 SVQLQIPPEDYLIVTNDGNVCLG--ILNGSEVGL----GNFNVIGDNLFQDKLVIYDSDK 408
Query: 420 LKLGWSHSNCQDL 432
++GW +NC L
Sbjct: 409 HQIGWIPANCDRL 421
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 100/412 (24%), Positives = 160/412 (38%), Gaps = 79/412 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C + RD Y P+ + + C +L
Sbjct: 75 KLYDLDIDSGSDLTWVQCDAPCKGC---------TKPRD-QLYKPNHNL----VQCVDQL 120
Query: 149 CD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASV 202
C + +C +P PC Y ++Y ++ SS G+LV D + + G + V+ V
Sbjct: 121 CSEVHLSMAYNCPSPDDPCDYEVEY-ADHGSSLGVLVRDYIPFQFTNG-----SVVRPRV 174
Query: 203 IIGCGMKQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG Q G A G++GLG G S+ S L GLIRN C G +FF
Sbjct: 175 AFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFF 234
Query: 262 GDQGPATQQSTSFLASNGKYITYII----------GVETCCIGSSCLKQTSFKAIVDSGS 311
GD F+ S+G T ++ G + I DSGS
Sbjct: 235 GDD---------FIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGS 285
Query: 312 SFTFLPKEVYETI---------AAEFDRQVNDT--------ITSFEGY-PWKCCYKSSSQ 353
S+T+ + Y+ + + R +D SFE K +K +
Sbjct: 286 SYTYFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLAL 345
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
K ++++ P + ++ V G ++ G + ++ ++ IG + V
Sbjct: 346 SFKKSXNLQMHLPPESYLIITKHGNVCLG--ILDGTEVGLE----NLNIIGDITLQDKMV 399
Query: 414 VFDRENLKLGWSHSNC-------QDLNDGTKSPLTPGPGTPSNPLPANQEQS 458
++D E ++GW SNC +DL P G + PA+ E++
Sbjct: 400 IYDNEKQQIGWVSSNCDRLPNVDRDLEGDFPHPYATNLGIFGDRCPASYEET 451
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 145/373 (38%), Gaps = 68/373 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST-SKHLSCSHRLCDL-GTS 154
D G DL+WI C C +C S Y+ PSASST +K + L +
Sbjct: 22 DTGSDLVWIQCKPCSQCYSQSDPIYD----------PSASSTFAKTSCSTSSCQSLPASG 71
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C + + C Y Y +++ +E + SGG + + Q GCG SG +
Sbjct: 72 CSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQ----FGCGRLNSGSF 127
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGR--IFFGDQGPATQ 269
G A G++GLG G+IS+ + L A I N FS C FD D S + FG
Sbjct: 128 -GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTSPLIFGSSASTGS 182
Query: 270 Q--STSFLASNGKYITYIIGVETCCIGSS-----------------------CLKQTSFK 304
ST + ++G+ Y +G+E +G L+ S
Sbjct: 183 GAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNSGG 242
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
I DSG++ T L VY + + F V+ + CY S + K P++ L
Sbjct: 243 TIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPALTLA 302
Query: 365 F-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFM-TGYRVVFD 416
F PQ N FV+ + + CLA+ I N M Y VV+D
Sbjct: 303 FKGTKFSPPQKNYFVIVDTAETVA--------CLAMGGSGSLGLGIIGNLMQQNYHVVYD 354
Query: 417 RENLKLGWSHSNC 429
R + S + C
Sbjct: 355 RGTSTISMSPAQC 367
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 174/441 (39%), Gaps = 68/441 (15%)
Query: 35 FSEEVKALGVSKNRN-----ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQG 89
+ +EVK + + + N S + K E+ ++++ + TG F +F
Sbjct: 121 WKQEVKVITIQQQNNLANAVVASLKSSKD-EFSGNIMATLESGASLGTGEYFIDMFVGTP 179
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K + L D G DL WI CD C C + +YN P+ SS+ +++SC
Sbjct: 180 PKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYN----------PNESSSYRNISCYDPR 229
Query: 149 CDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C L +S C+ Q CPY DY + ++ +E ++ + K V
Sbjct: 230 CQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDV 289
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSG 257
+ GCG G + L+GLG G +S PS L + +SFS C + S
Sbjct: 290 MFGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ--SIYGHSFSYCLTDLFSNTSVSS 344
Query: 258 RIFFGDQGPATQQS----TSFLASNG--KYITYIIGVETCCIGSSCLK--QTSFK----- 304
++ FG+ T LA Y + +++ +G L + ++
Sbjct: 345 KLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEG 404
Query: 305 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
I+DSGS+ TF P Y+ I F++++ + + + CY S +LP
Sbjct: 405 VGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDY 464
Query: 362 KLM--------FPQNNSFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGY 411
+ FP N F P VI CLAI P + IG +
Sbjct: 465 GIHFADGAVWNFPAENYFYQYEPDEVI---------CLAILKTPNHSHLTIIGNLLQQNF 515
Query: 412 RVVFDRENLKLGWSHSNCQDL 432
+++D + +LG+S C ++
Sbjct: 516 HILYDVKRSRLGYSPRRCAEV 536
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 159/376 (42%), Gaps = 46/376 (12%)
Query: 93 MSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL 151
SL D G + ++PC C C N D +SP+ SS+ K L C C
Sbjct: 48 FSLIVDTGSTVTYVPCSSCTHCG-------NHQD---PRFSPALSSSYKPLECGSE-CST 96
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
G C ++ Y E ++SSG+L +D++ + D + ++ GC ++
Sbjct: 97 GF-CDGSRK----YQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQR-----LVFGCETAET 146
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPAT 268
G D A DG+IGLG G +S+ L + + + FS+C+ D G I G Q P
Sbjct: 147 GDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKD 205
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYE 322
T+ Y Y + ++ +G S L+ + ++DSG+++ + P ++
Sbjct: 206 MVFTASDPHRSPY--YNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQ 263
Query: 323 TIAAEFDRQVNDTITSFEGYPWK---CCYKSSSQRLPKL----PSVKLMFPQNNSFVVNN 375
+ QV ++ G K CY + + L PSV +F S ++
Sbjct: 264 AFKSAVKEQVG-SLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSP 322
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
++ T++ +CL + +GD T +G + V ++R +G+ + C DL
Sbjct: 323 ENYLFRHTKISGAYCLGVFE-NGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDL-- 379
Query: 435 GTKSPLTPGPGTPSNP 450
++ P T PG + P
Sbjct: 380 WSRLPETNEPGHSTQP 395
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 141/302 (46%), Gaps = 29/302 (9%)
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K+ + D G D++W+ C C +C S +L +L Y+ S + K +SC
Sbjct: 90 AKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESDSGKLVSCDDDF 144
Query: 149 CDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C + S CPY ++ Y + +S++G V+D++ S + + SVI
Sbjct: 145 CYQISGGPLSGCKANMSCPY-LEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIF 203
Query: 205 GCGMKQSGGYLDG---VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG +QSG LD A DG++G G S+ S LA +G ++ F+ C D + G IF
Sbjct: 204 GCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFA 262
Query: 262 GDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQTSFK-AIVDSGSSFTF 315
+ + + + L N + +T + +G E I + + K AI+DSG++ +
Sbjct: 263 IGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAY 322
Query: 316 LPKEVYE-TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP+ +YE + E +V+ ++ C++ S + P+V F +N+ F+
Sbjct: 323 LPEIIYEPLVKKEPALKVHIVDKDYK------CFQYSGRVDEGFPNVTFHF-ENSVFLRV 375
Query: 375 NP 376
P
Sbjct: 376 YP 377
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 149/367 (40%), Gaps = 52/367 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
K L D G DL W+ CD AP + +Y P+ ++ L CSH LC
Sbjct: 78 KLFDLDIDTGSDLTWVQCD----APCNGC---------TKYKPNHNT----LPCSHILCS 120
Query: 150 --DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQASVI 203
DL C +P+ C Y + Y +++ SS G LV D L L +G L+ +
Sbjct: 121 GLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKLANGSIMNLR------LT 173
Query: 204 IGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GCG +Q+ G G++GLG G++ + + L G+ +N C G + G
Sbjct: 174 FGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIG 233
Query: 263 DQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
D+ P++ + + LA+N Y+ G + DSGSS+T+ E Y
Sbjct: 234 DELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAY 293
Query: 322 ETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP--------QNNSF 371
+ I + +N + + C+K + L L VK F Q N
Sbjct: 294 QAILDLIRKDLNGKPLTDTKDDKSLPVCWK-GKKPLKSLDEVKKYFKTITLRFGNQKNGQ 352
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFM-----TGYRVVFDRENLKLGWS 425
+ P CL I ++G +IG G N + G V++D E ++GW
Sbjct: 353 LFQVPPESYLIITEKGRVCLGI--LNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWI 410
Query: 426 HSNCQDL 432
S+C L
Sbjct: 411 SSDCDKL 417
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 140/355 (39%), Gaps = 43/355 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---T 153
D G L+W C C C S YY++ S SST SC C L T
Sbjct: 109 DTGSVLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALPSCDSTQCKLDPSVT 158
Query: 154 SCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C N Q C Y+ Y + +++ G L + + ++G V+ GCG+ +G
Sbjct: 159 MCVNQTVQTCAYSYSY-GDKSATIGFLDVETVSFVAGAS-------VPGVVFGCGLNNTG 210
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-----FGDQGPA 267
+ G+ G G G +S+PS L K G + F+ + S +F G
Sbjct: 211 IFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRG 267
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGSSFTFLPK 318
T Q+T + + Y + ++ +GS+ LK + I+DSG++FT LP
Sbjct: 268 TVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPP 327
Query: 319 EVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
VY + EF V + S E P C + P +P + L F +
Sbjct: 328 RVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGATMHLPRENY 387
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CLAI ++G++ IG V++D +N KL + + C L
Sbjct: 388 VFEAKDGGNCSICLAI--IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 140/355 (39%), Gaps = 43/355 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---T 153
D G L+W C C C S YY++ S SST SC C L T
Sbjct: 53 DTGSVLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALPSCDSTQCKLDPSVT 102
Query: 154 SCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C N Q C Y+ Y + +++ G L + + ++G V+ GCG+ +G
Sbjct: 103 MCVNQTVQTCAYSYSY-GDKSATIGFLDVETVSFVAGAS-------VPGVVFGCGLNNTG 154
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-----FGDQGPA 267
+ G+ G G G +S+PS L K G + F+ + S +F G
Sbjct: 155 IFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRG 211
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGSSFTFLPK 318
T Q+T + + Y + ++ +GS+ LK + I+DSG++FT LP
Sbjct: 212 TVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPP 271
Query: 319 EVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
VY + EF V + S E P C + P +P + L F +
Sbjct: 272 RVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGATMHLPRENY 331
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CLAI ++G++ IG V++D +N KL + + C L
Sbjct: 332 VFEAKDGGNCSICLAI--IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 384
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 150/359 (41%), Gaps = 44/359 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G D+LW+ C+ P S+ L +LN + SST+ + CS +C G
Sbjct: 86 DTGSDILWVNCNTCSNCPQSSQ----LGIELNFFDTVGSSTAALIPCSDLICTSGVQGAA 141
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGDNALKNSVQASVIIGCGMKQ 210
C C YT Y + + +SG V D ++ LI G A+ ++ A+++ GC + Q
Sbjct: 142 AECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFNLIMGQPPAVNST--ATIVFGCSISQ 198
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD---- 263
SG A DG+ G G G +SV S L+ G+ FS C D + G + G+
Sbjct: 199 SGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILVLGEILEP 258
Query: 264 ---------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
P + +A NG+ + V + + IVD G++
Sbjct: 259 SIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFS-------ISNNRGGTIVDCGTTLA 311
Query: 315 FLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+L +E Y+ + + V+ + T+ +G CY S+ P V L F S V
Sbjct: 312 YLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDIFPLVSLNFEGGASMV 368
Query: 373 VNNPVFVIYGTQV--VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++++ + +C+ Q + +G + VV+D ++GW++ +C
Sbjct: 369 LKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDC 427
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 107/454 (23%), Positives = 190/454 (41%), Gaps = 57/454 (12%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYY 62
+ + I+L +++ ++G + F+ +LIHR S + +N + + ++S +
Sbjct: 8 VIVIIFLISTAVVSAATGPD-YGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHN 66
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSL--GNDFGCDLLWIPCD-CVRCAPLSAS 119
L+++ V+ ++ M S G+ + D G D++W C+ C C
Sbjct: 67 TGLVTNTVEAPIYNNRGEYLMKL-SVGTPPFPIIAVADTGSDIIWTQCEPCTNC------ 119
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSS 176
+DL ++PS S+T + +SCS +C SC K C Y++ Y +N+ S
Sbjct: 120 ----YQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSF-KPDCTYSISY-GDNSHSQ 173
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G D L + G + + IGCG +G + V+ G++GLGLG S+
Sbjct: 174 GDFAVDTLTM---GSTSGRVVAFPRTAIGCGHDNAGSFDANVS--GIVGLGLGPASLIKQ 228
Query: 237 LAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGV 288
+ A + FS C D S ++ FG + ST S+ Y + +
Sbjct: 229 MGSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKL 286
Query: 289 ETCCIG--------SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ +G ++ + I+DSG++ T LP ++Y A +N T
Sbjct: 287 KAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDP 346
Query: 341 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGD 399
+ C+++++ K+P + + F N + V + V+ CLA D D
Sbjct: 347 NQFLEYCFETTTDDY-KVPFIAMHFEGANLRLQRENVLIRVSDNVI---CLAFAGAQDND 402
Query: 400 I---GTIGQ-NFMTGYRVVFDRENLKLGWSHSNC 429
I G I Q NF+ GY D N+ L + NC
Sbjct: 403 ISIYGNIAQINFLVGY----DVTNMSLSFKPMNC 432
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 142/348 (40%), Gaps = 22/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C P ++ L LN + P +S T+ +SCS + C G
Sbjct: 99 DTGSDVLWVSCASCNGCPQTSG----LQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C C YT Y + + +SG V D+L ++L + A V+ GC Q+G
Sbjct: 155 SGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
+ A DG+ G G +SV S LA G+ FS C ++ G + G+
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNM 273
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
T + S Y ++ + + I S ++ + I+D+G++ +L + Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ ++ + CY ++ P V L F S +N ++I
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNN 392
Query: 385 V--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C+ Q + I +G + V+D ++GW++ +C
Sbjct: 393 VGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 149/359 (41%), Gaps = 45/359 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG---- 152
D G D+LW+ C+ C C S L LN + S+SST+ + CS +C
Sbjct: 84 DTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTT 138
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
T C C YT Y + + +SG V D L+ + +L + A ++ GC QS
Sbjct: 139 VTQCSPQTNQCSYTFQY-EDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQS 197
Query: 212 GGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD-------------DSG 257
G + A DG+ G G GE+SV S L+ G+ FS C + + G
Sbjct: 198 GDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIGGGILVLGEILEPG 257
Query: 258 RIF--FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 315
++ P + +A NGK ++ ++ +S S IVDSG++ +
Sbjct: 258 MVYSPLVPSQPHYNLNLQSIAVNGK----LLPIDPSVFATS----NSQGTIVDSGTTLAY 309
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
L E Y+ + + V+ ++T + CY S+ P F S V+
Sbjct: 310 LVAEAYDPFVSAVNVIVSPSVTPIISKGNQ-CYLVSTSVSQMFPLASFNFAGGASMVLKP 368
Query: 376 PVFVI-----YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I G V+ +C+ Q V G + +G + V+D ++GW++ +C
Sbjct: 369 EDYLIPFGPSQGGSVM--WCIGFQKVQG-VTILGDLVLKDKIFVYDLVRQRIGWANYDC 424
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 132/293 (45%), Gaps = 28/293 (9%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SK + D G D++W+ C R P ++S L +L Y S+T K +SC + C
Sbjct: 97 SKDYYVQVDTGSDIVWVNCIQCRECPRTSS----LGMELTPYDLEESTTGKLVSCDEQFC 152
Query: 150 ---DLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
+ G + C CPY + Y + +S++G V+D + + + S+
Sbjct: 153 LEVNGGPLSGCTT-NMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKF 210
Query: 205 GCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-F 261
GCG +QSG G A DG++G G S+ S LA ++ F+ C D + G IF
Sbjct: 211 GCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAM 270
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA------IVDSGSSF 313
G T + + Y + GV+ +G L ++ F+A I+DSG++
Sbjct: 271 GHVVQPKVNMTPLVPNQPHYNVNMTGVQ---VGHIILNISADVFEAGDRKGTIIDSGTTL 327
Query: 314 TFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
+LP+ +YE + A+ +Q N + + G +K C++ S + P V F
Sbjct: 328 AYLPELIYEPLVAKILSQQHNLEVQTIHG-EYK-CFQYSERVDDGFPPVIFHF 378
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 142/348 (40%), Gaps = 22/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C P ++ L LN + P +S T+ +SCS + C G
Sbjct: 99 DTGSDVLWVSCASCNGCPQTSG----LQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C C YT Y + + +SG V D+L ++L + A V+ GC Q+G
Sbjct: 155 SGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
+ A DG+ G G +SV S LA G+ FS C ++ G + G+
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNM 273
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
T + S Y ++ + + I S ++ + I+D+G++ +L + Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ ++ + CY ++ P V L F S +N ++I
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNN 392
Query: 385 V--VTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C+ Q + I +G + V+D ++GW++ +C
Sbjct: 393 VGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 159/389 (40%), Gaps = 62/389 (15%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 136
TG + + + +K L D G +L WI C P N + L Y P
Sbjct: 37 TGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKC---HATPGPCKTCNKVPHPL--YRPK-- 89
Query: 137 STSKHLSCSHRLCD-----LGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
K + C+ LCD LGT+ C+ C Y ++Y + T+S G+L+ D L +G
Sbjct: 90 ---KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINY-ADGTTSLGVLLLDKFSLPTG 145
Query: 190 GDNALKNSVQASVIIGCGMKQSGG----YLDGVAPDGLIGLGLGEISVPSLLAKAGLI-R 244
++ GCG Q G + V DG++GLG G + + S L +G + +
Sbjct: 146 S--------ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSK 197
Query: 245 NSFSMCFDKDDSGRIFFGDQG-PATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTS 302
N C G +F G++ P++ ++ + Y G T +G + +
Sbjct: 198 NVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKP 257
Query: 303 FKAIVDSGSSFTFLPKEVYETIAAEFD--------RQVNDTITSFEGYPWKCCYK----- 349
FKAI DSGS++T+LP+ ++ + + + V+DT T C+K
Sbjct: 258 FKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLH-----LCWKGPKPF 312
Query: 350 SSSQRLPK--LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF---CLAIQPVDG-DIGTI 403
+ LPK V L F + + ++I +TG C I + G D+ I
Sbjct: 313 KTVHDLPKEFKSLVTLKFDHGVTMTIPPENYLI-----ITGHGNACFGILELPGYDLFVI 367
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G M V+ D E +L W S C +
Sbjct: 368 GGISMQEQLVIHDNEKGRLAWMPSPCDKM 396
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 108/453 (23%), Positives = 190/453 (41%), Gaps = 55/453 (12%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYY 62
+ + I+L +++ ++G + F+ +LIHR S + +N + + ++S +
Sbjct: 8 VIVIIFLISTAVVSAATGPD-YGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHN 66
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSL--GNDFGCDLLWIPCDCVRCAPLSASY 120
L+++ V+ ++ M S G+ + D G D++W CV C
Sbjct: 67 TGLVTNTVEAPIYNNRGEYLMKL-SVGTPPFPIIAVADTGSDIIWT--QCVPCT------ 117
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSG 177
N +DL ++PS S+T + +SCS +C SC K C Y++ Y +N+ S G
Sbjct: 118 -NCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSF-KPDCTYSISY-GDNSHSQG 174
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
D L + G + + IGCG +G + V+ G++GLGLG S+ +
Sbjct: 175 DFAVDTLTM---GSTSGRVVAFPRTAIGCGHDNAGSFDANVS--GIVGLGLGPASLIKQM 229
Query: 238 AKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVE 289
A + FS C D S ++ FG + ST S+ Y + ++
Sbjct: 230 GSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLK 287
Query: 290 TCCIG--------SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 341
+G ++ + I+DSG++ T LP ++Y A +N T
Sbjct: 288 AVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPN 347
Query: 342 YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDI 400
+ C+++++ K+P + + F N + V + V+ CLA D DI
Sbjct: 348 QFLEYCFETTTDDY-KVPFIAMHFEGANLRLQRENVLIRVSDNVI---CLAFAGAQDNDI 403
Query: 401 ---GTIGQ-NFMTGYRVVFDRENLKLGWSHSNC 429
G I Q NF+ GY D N+ L + NC
Sbjct: 404 SIYGNIAQINFLVGY----DVTNMSLSFKPMNC 432
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 152/357 (42%), Gaps = 49/357 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G DL WI + C C ++ + PS SST ++CS C LGT
Sbjct: 43 DTGSDLTWIQSEPCRAC----------FEQADPIFDPSKSSTYNKIACSSSACADLLGTQ 92
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
+ C Y Y + + E I + G+ V G + +G +
Sbjct: 93 TCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEE---------VKFGASVYNTGTF 143
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQG-PAT 268
D +G++GLG G +S+PS L ++ N FS C ++ ++FGD P+
Sbjct: 144 GD-TGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSG 200
Query: 269 QQSTSFLASNGKYITYI-IGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLP 317
+ + + N + TY I V+ +G S L Q+ ++ I+DSG++ T+L
Sbjct: 201 EVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQ 260
Query: 318 KEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
+EV+ + A + QV T TS G C+ + P P++ + + +
Sbjct: 261 QEVFNALVAAYTSQVRYPTTTSATG--LDLCFNTRGTGSPVFPAMTIHLDGVHLELPTAN 318
Query: 377 VFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
F+ T ++ CLA +D I G + +V+D +N+++G++ ++C L
Sbjct: 319 TFISLETNII---CLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCASL 372
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 140/344 (40%), Gaps = 38/344 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL W+ C C C Y D + PS SST ++C C +L S
Sbjct: 167 DTGSDLSWVQCKPCADC-------YEQQD---PLFDPSLSSTYAAVACGAPECQELDASG 216
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + + G LV D L L + + + GCG Q+ G
Sbjct: 217 CSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSA-------SDTLPGFVFGCG-DQNAGLF 267
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTS 273
V DGL GLG ++S+PS A + F+ C SGR + G PA Q T+
Sbjct: 268 GQV--DGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTA 323
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAE 327
LA Y I + +G ++ + ++DSG+ T LP Y + A
Sbjct: 324 -LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAA 382
Query: 328 FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT 387
F R + + CY + R ++P+V+L F + V + V+Y ++V
Sbjct: 383 FARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF-AGGATVSLDFTGVLYVSKVSQ 441
Query: 388 GFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
CLA P D I +G + V +D N ++G+ C
Sbjct: 442 A-CLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 140/344 (40%), Gaps = 38/344 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL W+ C C C Y D + PS SST ++C C +L S
Sbjct: 167 DTGSDLSWVQCKPCADC-------YEQQD---PLFDPSLSSTYAAVACGAPECQELDASG 216
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + + G LV D L L + + + GCG Q+ G
Sbjct: 217 CSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSA-------SDTLPGFVFGCG-DQNAGLF 267
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTS 273
V DGL GLG ++S+PS A + F+ C SGR + G PA Q T+
Sbjct: 268 GQV--DGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTA 323
Query: 274 FLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAE 327
LA Y I + +G ++ + ++DSG+ T LP Y + A
Sbjct: 324 -LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAA 382
Query: 328 FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT 387
F R + + CY + R ++P+V+L F + V + V+Y ++V
Sbjct: 383 FARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF-AGGATVSLDFTGVLYVSKVSQ 441
Query: 388 GFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
CLA P D I +G + V +D N ++G+ C
Sbjct: 442 A-CLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 154/360 (42%), Gaps = 48/360 (13%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK--HLSCSHRLCDLGT 153
D G DL W+ CD CV C L A + L + N+ P K H + +HR
Sbjct: 75 DTGSDLTWLQCDAPCVHC--LEAPH--PLYQPSNDLIPCNDPLCKALHFNGNHR------ 124
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN-SVQASVIIGCGMKQSG 212
C+ P+Q C Y ++Y + SS G+LV D+ L N K + + +GCG Q
Sbjct: 125 -CETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSL-----NYTKGLRLTPRLALGCGYDQIP 176
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG-DQGPATQQS 271
G DG++GLG G++S+ S L G ++N C G +FFG D +++ S
Sbjct: 177 GASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGILFFGNDLYDSSRVS 236
Query: 272 TSFLA-SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
+ +A N K+ + +G E G + + DSGSS+T+ + Y+ + R
Sbjct: 237 WTPMARENSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKR 295
Query: 331 QVNDT--ITSFEGYPWKCCYKSSSQRL----------PKLPSVKLMFPQNNSFVVNNPVF 378
+++ + + + C++ + P S K + F + +
Sbjct: 296 ELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAY 355
Query: 379 VIYGTQ------VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+I + ++ G + +Q ++ IG M +++D E +GW ++C ++
Sbjct: 356 LIISMKGNVCLGILNGTEIGLQ----NLNLIGDISMQDQMIIYDNEKQSIGWIPADCDEI 411
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/363 (22%), Positives = 145/363 (39%), Gaps = 54/363 (14%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----D 150
D G DL W+ CD CVRC L P +S + C+ LC +
Sbjct: 78 DTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDLIPCNDPLCKALHLN 123
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN-SVQASVIIGCGMK 209
C+ P+Q C Y ++Y + SS G+LV D+ + N K + + +GCG
Sbjct: 124 SNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM-----NYTKGLRLTPRLALGCGYD 176
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ--GPA 267
Q G DG++GLG G++S+ S L G ++N C G +FFGD +
Sbjct: 177 QIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSS 236
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAE 327
T K+ + +G E G + + DSGSS+T+ + Y+ +
Sbjct: 237 RVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYL 295
Query: 328 FDRQVNDT--ITSFEGYPWKCCYKSSSQRL----------PKLPSVKLMFPQNNSFVVNN 375
R+++ + + + C++ + P S K + F +
Sbjct: 296 LKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPP 355
Query: 376 PVFVIYGTQ------VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I + ++ G + +Q ++ IG M +++D E +GW ++C
Sbjct: 356 EAYLIISMKGNVCLGILNGTEIGLQ----NLNLIGDISMQDQMIIYDNEKQSIGWMPADC 411
Query: 430 QDL 432
+L
Sbjct: 412 DEL 414
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 145/368 (39%), Gaps = 66/368 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G DL+W C C C P + SP ASS+ + + C+ LC+ L S
Sbjct: 122 DTGSDLIWTQCAPCASCLPQPDPIF----------SPGASSSYEPMRCAGELCNDILHHS 171
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
CQ P C Y Y + T++ G+ + S + A + GCG G
Sbjct: 172 CQRPDT-CTYRYSY-GDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSL 229
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFG-------DQ 264
+G G++G G +S+ S LA IR FS C SGR + FG D
Sbjct: 230 NNG---SGIVGFGRAPLSLVSQLA----IRR-FSYCLTPYASGRKSTLLFGSLRGGVYDA 281
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFT 314
AT Q+T L S Y + +G+ L+ S AIVDSG++ T
Sbjct: 282 ATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALT 341
Query: 315 FLPKEVYETIAAEFDRQVNDTITS--FEGYPWKCCYKSSSQRLPK----------LPSVK 362
P V + F Q+ + G C+ +++ R+P+ L
Sbjct: 342 LFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGAD 401
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFM-TGYRVVFDRENLK 421
L P+ N +V+++ Q CL + GD GT NF+ RV++D E
Sbjct: 402 LDLPRRN-YVLDD--------QRKGNLCLLLAD-SGDSGTTIGNFVQQDMRVLYDLEADT 451
Query: 422 LGWSHSNC 429
L ++ + C
Sbjct: 452 LSFAPAQC 459
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 151/374 (40%), Gaps = 54/374 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYN-SLDRDLNEYSPSASSTSKHLSCSH 146
+K L D G DL W+ CD C CA Y+ R ++ P+ + + +
Sbjct: 41 AKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARVVDCRRPTCAQVQRGGQFT- 99
Query: 147 RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV--QASVII 204
C + C Y +DY + +S+ G+LVED + L+ L N Q +I
Sbjct: 100 --------CSGDVRQCDYEVDY-VDGSSTMGILVEDTITLV------LTNGTRFQTRAVI 144
Query: 205 GCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFF 261
GCG Q G A DG+IGL +IS+PS LA G+ N C + G +FF
Sbjct: 145 GCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLFF 204
Query: 262 GDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----AIVDSGSSFTF 315
GD PA + + + Y + + G L+ A+ DSG+SFT+
Sbjct: 205 GDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTY 264
Query: 316 LPKEVYETIAAEFDRQVN----DTITSFEGYP--WK--CCYKSSSQRLPKLPSVKLMFPQ 367
L Y + + RQ + I + P W+ ++S + +V L F
Sbjct: 265 LVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFKTVTLDFGG 324
Query: 368 NNSFVVNNPV------FVIYGTQVVTGFCLAIQPVDGDIGT------IGQNFMTGYRVVF 415
+ + + ++I TQ CL + +D + + +G M GY VV+
Sbjct: 325 STWWSSGKLLELSPEGYLIVSTQ--GNVCLGV--LDASVASLEVTNILGDISMRGYLVVY 380
Query: 416 DRENLKLGWSHSNC 429
D ++GW NC
Sbjct: 381 DNMREQIGWVRRNC 394
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 160/377 (42%), Gaps = 51/377 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ +L D G + ++PC C +C ++ P SST + + C
Sbjct: 24 QRFALIVDTGSSVTYVPCSSCEQCG----------RHQDPKFQPDLSSTYQSVKC----- 68
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
++ +C + KQ C Y Y E ++SSG+L EDI IS G+ L + GC
Sbjct: 69 NIDCNCDDEKQQCVYERQY-AEMSTSSGVLGEDI---ISFGN--LSALAPQRAVFGCENM 122
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
++G A DG++G+G G++S+ L G+I +SFS+C+ G G +
Sbjct: 123 ETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPP 181
Query: 270 QSTSFLASNG-KYITYIIGVETCCIGSS--CLKQTSFKA----IVDSGSSFTFLPKEVY- 321
+ F S+ + Y I ++ + L T F I+DSG+++ +LP+ +
Sbjct: 182 SNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYLPEAAFV 241
Query: 322 ---ETIAAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
+ I E D ND S G SQ P+V+++F
Sbjct: 242 SFKDAIMKELHSLKPIRGPDPNYNDICFSGAG-------SDISQLSSSFPAVEMVFGNGQ 294
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSN 428
+++ ++ ++V +CL I D T +G + V++DREN K+G+ +N
Sbjct: 295 KLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTN 354
Query: 429 CQDLNDGTKSPLTPGPG 445
C +L + P P
Sbjct: 355 CSELWERLNVDGAPPPA 371
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 148/364 (40%), Gaps = 54/364 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +C P A + D+ L + PS SST SC LC SC
Sbjct: 53 DTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASC 103
Query: 156 QNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+PK Q C YT Y + + ++G L D + G + V GCG+ +
Sbjct: 104 GSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLFNN 156
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRIF 260
G + G+ G G G +S+PS L K G +FS CF D +F
Sbjct: 157 GVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPSTVLLDLPADLF 209
Query: 261 FGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-------AIVDSG 310
QG T + + Y + ++ +GS+ L +++F I+DSG
Sbjct: 210 SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSG 269
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNN 369
+S T LP +VY+ + EF Q+ + C+ + SQ P +P + L F
Sbjct: 270 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATM 329
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHSN 428
N VF + + CLAI GD TI NF V++D +N L + +
Sbjct: 330 DLPRENYVFEVPDDAGNSIICLAIN--KGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQ 387
Query: 429 CQDL 432
C L
Sbjct: 388 CDKL 391
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 154/387 (39%), Gaps = 52/387 (13%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F L Q +++ L D G DL+W+ C C C+ S + + P
Sbjct: 81 SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRH 131
Query: 136 SSTSKHLSCSHRLC------DLGTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDI--LHL 186
SST C +C D C + + +Y Y + + +SGL + L
Sbjct: 132 SSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKT 191
Query: 187 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLI 243
SG + LK SV GCG + SG + G + +G++GLG G IS S L +
Sbjct: 192 SSGKEARLK-----SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--F 244
Query: 244 RNSFSMC-----FDKDDSGRIFFGDQGPATQQ--STSFLASNGKYITYIIGVETCCIGSS 296
N FS C + + G+ G + T L + Y + +++ + +
Sbjct: 245 GNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGA 304
Query: 297 CLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 346
L+ + +VDSG++ FL + Y ++ A R+V I +
Sbjct: 305 KLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDL 364
Query: 347 CYKSSSQRLPK--LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--T 402
C S P+ LP +K F FV + I + + CLAIQ VD +G
Sbjct: 365 CVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ--CLAIQSVDPKVGFSV 422
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG G+ FDR+ +LG+S C
Sbjct: 423 IGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 147/338 (43%), Gaps = 35/338 (10%)
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
+SP+ SS+ K L C + C G C ++ Y E ++SSG+L +D++ +
Sbjct: 74 RFSPALSSSYKPLECGNE-CSTGF-CDGSRK----YQRQYAEKSTSSGVLGKDVISFSNS 127
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
D + ++ GC ++G D A DG+IGLG G +S+ L + + + FS+
Sbjct: 128 SDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSL 181
Query: 250 CFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------Q 300
C+ D G I G Q P TS Y Y + ++ +G S L+
Sbjct: 182 CYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY--YNLMLKGIRVGGSPLRLKPEVFD 239
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---CCYKSSSQRLPK 357
+ ++DSG+++ + P ++ + QV ++ G K CY + +
Sbjct: 240 GKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVG-SLKEVPGPDEKFKDICYAGAGTNVSN 298
Query: 358 L----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYR 412
L PSV +F S ++ ++ T++ +CL + +GD T +G +
Sbjct: 299 LSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFE-NGDPTTLLGGIIVRNML 357
Query: 413 VVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNP 450
V ++R +G+ + C DL ++ P T PG + P
Sbjct: 358 VTYNRGKASIGFLKTKCNDL--WSRLPETNEPGHSTQP 393
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/349 (23%), Positives = 146/349 (41%), Gaps = 24/349 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G D+LW+ C P S+ L DL+ + S T+ ++CS +C
Sbjct: 118 DTGSDILWVTCSSCSNCPHSSG----LGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTA 173
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Q C Y+ Y + + +SG + D + + +L + A ++ GC QSG
Sbjct: 174 AQCSENNQ-CGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQ 269
A DG+ G G G++SV S L+ G+ FS C D SG F G+
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGM 291
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
+ + S Y ++ + + + ++ + ++ + IVD+G++ T+L KE Y+
Sbjct: 292 VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLF 351
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ +T + CY S+ PSV L F S ++ P ++
Sbjct: 352 LNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFAGGASMML-RPQDYLFHYG 409
Query: 385 VVTG---FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ G +C+ Q + +G + V+D ++GW+ +C+
Sbjct: 410 IYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCK 458
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 151/373 (40%), Gaps = 58/373 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL------ 151
D G DL+W+ C P R + S S+T + CS C L
Sbjct: 71 DTGSDLIWLQCSTTAAPPAFCPKKACSRRP--AFVASKSATLSVVPCSAAQCLLVPAPRG 128
Query: 152 -GTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISG--GDNALKNSVQASVIIGCG 207
G +C PC Y DY + +S++G L D + +G G A++ V GCG
Sbjct: 129 HGPACSPAAPVPCGYAYDY-ADGSSTTGFLARDTATISNGTSGGAAVRG-----VAFGCG 182
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSMCFDKDDSGR-------I 259
+ GG G G+IGLG G++S P A++G L +FS C + GR +
Sbjct: 183 TRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFL 237
Query: 260 FFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK----------AIVD 308
F G + + L SN T Y +GV +G+ L + ++D
Sbjct: 238 FLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVID 297
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVN-----DTITSFEGYPWKCCYKSSSQRLPK-----L 358
SGS+ T+L Y + + F V+ + T F+G + CY SS
Sbjct: 298 SGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSSAPANGGF 355
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGYRVVFD 416
P + + F Q S + +++ V CLAI+P +G GY V FD
Sbjct: 356 PRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIRPTLSPFAFNVLGNLMQQGYHVEFD 413
Query: 417 RENLKLGWSHSNC 429
R + ++G++ + C
Sbjct: 414 RASARIGFARTEC 426
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 147/348 (42%), Gaps = 36/348 (10%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G DL+WI +CAP Y + + P SST ++SC LC L T
Sbjct: 86 DTGSDLIWI-----QCAPCLGCY----KQIKPMFDPLKSSTYNNISCDSPLCHKLDTGVC 136
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+P++ C YT Y +N+ + G+L +D S N K + + GCG +GG+ D
Sbjct: 137 SPEKRCNYTYGY-GDNSLTKGVLAQDTATFTS---NTGKPVSLSRFLFGCGHNNTGGFND 192
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLI--RNSFSMCF-----DKDDSGRIFFGD--QGPA 267
GLIGLG G S L+++ G + FS C D S R+ FG Q
Sbjct: 193 HEM--GLIGLGGGPTS---LISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLG 247
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEVYET 323
T+ L K +Y + + + + S +VDSG+ LP+++Y+
Sbjct: 248 NGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANMLVDSGTPPILLPQQLYDK 307
Query: 324 IAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ AE +V IT + CY++ + K P++ F N + F+
Sbjct: 308 VFAEVRNKVALKPITDDPSLGTQLCYRTQTNL--KGPTLTFHFVGANVLLTPIQTFIPPT 365
Query: 383 TQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Q FCLAI + D G G + Y + FD + + + ++C
Sbjct: 366 PQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 156/373 (41%), Gaps = 73/373 (19%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL------ 151
D +L W V+CAP + + D+ + PS+S + + C+ CD
Sbjct: 169 DTASELTW-----VQCAPCESCH----DQQDPLFDPSSSPSYAAVPCNSSSCDALQLATG 219
Query: 152 GTS-----CQNPKQ---PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
GTS CQ Q C YT+ Y + + S G+L D L +L V +
Sbjct: 220 GTSGGAAACQGQDQSAAACSYTLSY-RDGSYSRGVLAHDRL--------SLAGEVIDGFV 270
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCF---DKDDSGRI 259
GCG G G + GL+GLG ++S V + + G + FS C + D SG +
Sbjct: 271 FGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTMDQFGGV---FSYCLPLKESDSSGSL 325
Query: 260 FFGDQGPATQQSTSFLASN-------GKYITYIIGVETCCIGSSCLKQTSF-------KA 305
GD + ST + ++ G + Y + + +G ++ + F KA
Sbjct: 326 VIGDDSSVYRNSTPIVYASMVSDPLQGPF--YFVNLTGITVGGQEVESSGFSSGGGGGKA 383
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKL 358
I+DSG+ T L +Y + AEF ++ F YP C+ + R ++
Sbjct: 384 IIDSGTVITSLVPSIYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNMTGLREVQV 436
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFD 416
PS+KL+F V++ + + + + CLA+ P+ + T IG RV+FD
Sbjct: 437 PSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFD 496
Query: 417 RENLKLGWSHSNC 429
++G++ C
Sbjct: 497 TSGSQVGFAQETC 509
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 156/392 (39%), Gaps = 62/392 (15%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F L Q +++ L D G DL+W+ C C C+ S + + P
Sbjct: 80 SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRH 130
Query: 136 SSTSKHLSCSHRLCDL------GTSCQNPK--QPCPYTMDYYTENTSSSGLLVEDI--LH 185
SST C +C L C + + CPY Y + + +SGL + L
Sbjct: 131 SSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYG-YADGSLTSGLFARETTSLK 189
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGL 242
SG + LK SV GCG + SG + G + +G++GLG G IS S L +
Sbjct: 190 TSSGKEAKLK-----SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR-- 242
Query: 243 IRNSFSMC-----FDKDDSGRIFFGDQGPATQQ--STSFLASNGKYITYIIGVETCCIGS 295
N FS C + + GD G A + T L + Y + +++ +
Sbjct: 243 FGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNG 302
Query: 296 SCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN----DTITSFEG 341
+ L+ + ++DSG++ FL Y + A +++ D +T
Sbjct: 303 AKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTP--- 359
Query: 342 YPWKCCYKSSSQRLPK--LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
+ C S P+ LP +K F FV + I + + CLAIQ VD
Sbjct: 360 -GFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ--CLAIQSVDPK 416
Query: 400 IG--TIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G IG G+ FDR+ +LG+S C
Sbjct: 417 VGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 145/347 (41%), Gaps = 22/347 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G D+LW+ C P S+ L DL+ + S T+ ++CS +C
Sbjct: 118 DTGSDILWVTCSSCSNCPHSSG----LGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTA 173
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Q C Y+ Y + + +SG + D + + +L + A ++ GC QSG
Sbjct: 174 AQCSENNQ-CGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQ 269
A DG+ G G G++SV S L+ G+ FS C D SG F G+
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGM 291
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
+ L S Y ++ + + I ++ + ++ + IVD+G++ T+L KE Y+
Sbjct: 292 VYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPF 351
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI-YG- 382
V+ +T + CY S+ P V L F S ++ ++ YG
Sbjct: 352 LNAISNSVSQLVTLIISNGEQ-CYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGF 410
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +C+ Q + +G + V+D ++GW++ +C
Sbjct: 411 YDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/372 (22%), Positives = 146/372 (39%), Gaps = 52/372 (13%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q + L D G DL W+ CD CVRC L P +S + C+
Sbjct: 56 QPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDLIPCN 101
Query: 146 HRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 200
LC + C+ P+Q C Y ++Y + SS G+LV D+ + + +
Sbjct: 102 DPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGLRLTP 155
Query: 201 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
+ +GCG Q G DG++GLG G++S+ S L G ++N C G +F
Sbjct: 156 RLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILF 215
Query: 261 FGDQ--GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
FGD + T K+ + +G E G + + DSGSS+T+
Sbjct: 216 FGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYFNS 274
Query: 319 EVYETIAAEFDRQVNDT--ITSFEGYPWKCCYKSSSQRL----------PKLPSVKLMFP 366
+ Y+ + R+++ + + + C++ + P S K +
Sbjct: 275 KAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWR 334
Query: 367 QNNSFVVNNPVFVIYGTQ------VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
F + ++I + ++ G + +Q ++ IG M +++D E
Sbjct: 335 SKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQ----NLNLIGDISMQDQMIIYDNEKQ 390
Query: 421 KLGWSHSNCQDL 432
+GW +C +L
Sbjct: 391 SIGWMPVDCDEL 402
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/348 (23%), Positives = 145/348 (41%), Gaps = 24/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G D+LW+ C P S+ L DL+ + S T+ ++CS +C
Sbjct: 123 DTGSDILWVTCSSCSNCPHSSG----LGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTA 178
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Q C Y+ Y + + +SG + D + + +L + A ++ GC QSG
Sbjct: 179 AQCSENNQ-CGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 236
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQ 269
A DG+ G G G++SV S L+ G+ FS C D SG F G+
Sbjct: 237 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGM 296
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
+ + S Y ++ + + + ++ + ++ + IVD+G++ T+L KE Y+
Sbjct: 297 VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLF 356
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ +T + CY S+ PSV L F S ++ P ++
Sbjct: 357 LNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFAGGASMML-RPQDYLFHYG 414
Query: 385 VVTG---FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G +C+ Q + +G + V+D ++GW+ +C
Sbjct: 415 IYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 462
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 111/420 (26%), Positives = 169/420 (40%), Gaps = 69/420 (16%)
Query: 46 KNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDL 103
+ RN + A +V L+S ++ Q + + S GS +L D G DL
Sbjct: 154 RIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDL 213
Query: 104 LWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL------GT--SC 155
W V+C P SA Y RD + P+ S+T + C+ C GT SC
Sbjct: 214 TW-----VQCKPCSACYAQ---RD-PLFDPAGSATYAAVRCNASACAASLKAATGTPGSC 264
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + S G+L D + AL + + GCG+ G
Sbjct: 265 GGGNERCYYALAY-GDGSFSRGVLATDTV--------ALGGASLDGFVFGCGLSNRG-LF 314
Query: 216 DGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQQ 270
G A GL+GLG E+S+ S A + G + FS C D SG + G + +
Sbjct: 315 GGTA--GLMGLGRTELSLVSQTALRYGGV---FSYCLPATTSGDASGSLSLGGDASSYRN 369
Query: 271 ST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYE 322
+T +A + Y + V +G + L A ++DSG+ T L VY
Sbjct: 370 TTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYR 429
Query: 323 TIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
+ AEF RQ + GYP CY + K+P + L V+
Sbjct: 430 GVRAEFTRQF-----AAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDA 484
Query: 376 P--VFVIY--GTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC 429
+FV+ G+QV CLA+ + + T IG RVV+D +LG++ +C
Sbjct: 485 AGMLFVVRKDGSQV----CLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 157/372 (42%), Gaps = 46/372 (12%)
Query: 85 FPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 143
F G++T L D G ++PC C C A Y Y AS+ +
Sbjct: 39 FELAGAQTFELIVDTGSSRTYLPCKGCASCGAHEAGRY---------YDYDASADFSRVE 89
Query: 144 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
CS +G C C Y + +Y E + S G LV D++ L GG A+V+
Sbjct: 90 CS-ACAGIGGKC-GTSGVCRYDV-HYLEGSGSEGYLVRDVVSL--GGSVG-----NATVV 139
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS------- 256
GC ++ G + + DGL G G ++ + LA A +I + FSMC + +
Sbjct: 140 FGCEERELGS-IKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVG 198
Query: 257 -----GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT-SFKAIVDSG 310
G FG PA + + S+ Y Y + + +G+S ++ + I+DSG
Sbjct: 199 GLLTLGNFDFGADAPALVYTP--MVSSAMY--YQVTTTSWTLGNSVVEGSRGVLTIIDSG 254
Query: 311 SSFTFLPKEVYET---IAAEFDRQVN-DTITSFEGYPWKCCYKSS----SQRLPKLPSVK 362
+S+T++P ++ +A + R+ + + E YP C S S P++K
Sbjct: 255 TSYTYVPGNMHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPALK 314
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
+ + + ++ ++ + + + FC+ I D + +GQ M FD ++
Sbjct: 315 IEYHGSARLTLSPETYLYWHQKNASAFCVGILEHDDNRILLGQITMRNTFTEFDVARSQV 374
Query: 423 GWSHSNCQDLND 434
G + +NC+ L +
Sbjct: 375 GMASANCEMLRE 386
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/362 (22%), Positives = 143/362 (39%), Gaps = 52/362 (14%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----D 150
D G DL W+ CD CVRC L P +S + C+ LC +
Sbjct: 78 DTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDLIPCNDPLCKALHLN 123
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C+ P+Q C Y ++Y + SS G+LV D+ + + + + +GCG Q
Sbjct: 124 SNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGLRLTPRLALGCGYDQ 177
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ--GPAT 268
G DG++GLG G++S+ S L G ++N C G +FFGD +
Sbjct: 178 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 237
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
T K+ + +G E G + + DSGSS+T+ + Y+ +
Sbjct: 238 VSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 296
Query: 329 DRQVNDT--ITSFEGYPWKCCYKSSSQRL----------PKLPSVKLMFPQNNSFVVNNP 376
R+++ + + + C++ + P S K + F +
Sbjct: 297 KRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPE 356
Query: 377 VFVIYGTQ------VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
++I + ++ G + +Q ++ IG M +++D E +GW +C
Sbjct: 357 AYLIISMKGNVCLGILNGTEIGLQ----NLNLIGDISMQDQMIIYDNEKQSIGWMPVDCD 412
Query: 431 DL 432
+L
Sbjct: 413 EL 414
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 148/360 (41%), Gaps = 45/360 (12%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG 152
+G D G DLLW+ C C C S ++ PS SST LS +C +
Sbjct: 74 VGIDTGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSP 123
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
N C Y Y +TSS L EDI+ S +SV+ GCG G
Sbjct: 124 QKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRG 179
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPA 267
+ DG G++GL G+ S+ S L + FS C FD ++ GD
Sbjct: 180 RF-DG-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKM 231
Query: 268 TQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFK---AIVDSGSSFTFLPKEV 320
ST F NG Y + G+ I ++T ++DSG++ TFL K+
Sbjct: 232 EGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDG 291
Query: 321 YETIAAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NP 376
++ ++ E R V + P CYK ++ L P + F + V++ N
Sbjct: 292 FDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANS 351
Query: 377 VFVIYGTQVVTGFCLAIQPVD-GDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
+FV V FCLA+ + +IG+ IG Y V +D ++ + ++C+ L D
Sbjct: 352 LFVQKNQDV---FCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/348 (23%), Positives = 145/348 (41%), Gaps = 24/348 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G D+LW+ C P S+ L DL+ + S T+ ++CS +C
Sbjct: 118 DTGSDILWVTCSSCSNCPHSSG----LGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTA 173
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Q C Y+ Y + + +SG + D + + +L + A ++ GC QSG
Sbjct: 174 AQCSENNQ-CGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQ 269
A DG+ G G G++SV S L+ G+ FS C D SG F G+
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGM 291
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
+ + S Y ++ + + + ++ + ++ + IVD+G++ T+L KE Y+
Sbjct: 292 VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLF 351
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ +T + CY S+ PSV L F S ++ P ++
Sbjct: 352 LNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFAGGASMML-RPQDYLFHYG 409
Query: 385 VVTG---FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G +C+ Q + +G + V+D ++GW+ +C
Sbjct: 410 IYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 148/360 (41%), Gaps = 45/360 (12%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG 152
+G D G DLLW+ C C C S ++ PS SST LS +C +
Sbjct: 74 VGIDTGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSP 123
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
N C Y Y +TSS L EDI+ S +SV+ GCG G
Sbjct: 124 QKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRG 179
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPA 267
+ DG G++GL G+ S+ S L + FS C FD ++ GD
Sbjct: 180 RF-DG-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKM 231
Query: 268 TQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFK---AIVDSGSSFTFLPKEV 320
ST F NG Y + G+ I ++T ++DSG++ TFL K+
Sbjct: 232 EGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDG 291
Query: 321 YETIAAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NP 376
++ ++ E R V + P CYK ++ L P + F + V++ N
Sbjct: 292 FDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANS 351
Query: 377 VFVIYGTQVVTGFCLAIQPVD-GDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
+FV V FCLA+ + +IG+ IG Y V +D ++ + ++C+ L D
Sbjct: 352 LFVQKNQDV---FCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 129/314 (41%), Gaps = 40/314 (12%)
Query: 144 CSHRLCD--LGTSCQN----PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 197
C LC L SC N P Q C YT YY + + ++GLL D +G
Sbjct: 190 CDSTLCQGLLVASCGNTKFWPNQTCVYTY-YYNDKSVTTGLLEVDKFTFGAGAS------ 242
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
V GCG+ +G + G+ G G G +S+PS L K G +FS CF +
Sbjct: 243 -VPGVAFGCGLFNNGVFKSN--ETGIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGL 294
Query: 258 RI------FFGD---QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-- 304
+ D G QST + ++ Y + ++ +GS+ L +++F
Sbjct: 295 KQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALT 354
Query: 305 -----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
I+DSG+S T LP +VY+ + EF Q+ + C+ + SQ P +P
Sbjct: 355 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVP 414
Query: 360 SVKLMFP-QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
+ L F N VF + + CLAI + + TIG V++D +
Sbjct: 415 KLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQ 474
Query: 419 NLKLGWSHSNCQDL 432
N L + + C L
Sbjct: 475 NNMLSFVAAQCDKL 488
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 46/103 (44%), Gaps = 3/103 (2%)
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
I+DSG+S T LP +VY+ + EF Q+ + C+ + SQ P +P + L F
Sbjct: 66 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHF 125
Query: 366 P-QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF 407
N VF + + CLAI GD TI NF
Sbjct: 126 EGATMDLPRENYVFEVPDDAGNSIICLAIN--KGDETTIIGNF 166
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 147/373 (39%), Gaps = 54/373 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ ++L D G DL+W C C+ C A+ LD P+ASST L C LC
Sbjct: 101 RPVALTLDTGSDLVWTQCAPCLDCFEQGAA--PVLD-------PAASSTHAALPCDAPLC 151
Query: 150 DL--GTSCQNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
TSC + C Y +Y + + + G L D GGD+ V
Sbjct: 152 RALPFTSCGGRSWGDRSCVYVY-HYGDRSLTVGQLATDSFTF--GGDDNAGGLAARRVTF 208
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIF 260
GCG G + G+ G G G S+PS L SFS CF D S +
Sbjct: 209 GCGHINKGIF--QANETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFDTKSSSVVT 261
Query: 261 FGDQGPATQQSTSFLASNGKYIT------------YIIGVETCCIGSS--CLKQTSFKA- 305
G A T A G T Y + + +G + + ++ ++
Sbjct: 262 LG-AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSS 320
Query: 306 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQRLPKLPSV 361
I+DSG+S T LP++VYE + AEF QV + C+ ++ R P +P++
Sbjct: 321 TIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPAL 380
Query: 362 KLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
L + + N VF Y +V C+ + G+ IG VV+D EN
Sbjct: 381 TLHLDGGADWELPRGNYVFEDYAARV---LCVVLDAAAGEQVVIGNYQQQNTHVVYDLEN 437
Query: 420 LKLGWSHSNCQDL 432
L ++ + C L
Sbjct: 438 DVLSFAPARCDKL 450
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 100/393 (25%), Positives = 155/393 (39%), Gaps = 97/393 (24%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST--SKHLS 143
Q SK L D G DL W+ CD CV+C YY R N P S H +
Sbjct: 42 QPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYY----RPRNNLVPCMDPICQSLHSN 97
Query: 144 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
HR C+NP Q C Y ++Y + SS G+LV D +L N + ++
Sbjct: 98 GDHR-------CENPGQ-CDYEVEY-ADGGSSFGVLVTDTFNL-----NFTSEKRHSPLL 143
Query: 204 -IGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD------- 254
+GCG Q GG + DG++GLG G+ S+ S L+ GL+RN C
Sbjct: 144 ALGCGYDQFPGGSHHPI--DGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFF 201
Query: 255 -----DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV-- 307
DS R+ + P + + LA E G K T FK ++
Sbjct: 202 GDDLYDSSRVAWTPMSPDAKHYSPGLA------------ELTFDG----KTTGFKNLLTT 245
Query: 308 -DSGSSFTFLPKEVYETIAAEFDRQVN-----------------------DTITSFEGYP 343
DSG+S+T+L + Y+ + + ++++ +I + Y
Sbjct: 246 FDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYF 305
Query: 344 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVV----NNPVFVIYGTQVVTGFCLAIQPVDGD 399
+++R K +L FP ++ N + ++ GT+V D
Sbjct: 306 KTFALSFTNERKSK---TELEFPPEAYLIISSKGNACLGILNGTEVGL----------ND 352
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ IG M V++D E ++GW+ NC L
Sbjct: 353 LNVIGDISMQDRVVIYDNEKERIGWAPGNCNRL 385
>gi|388513215|gb|AFK44669.1| unknown [Lotus japonicus]
Length = 101
Score = 79.7 bits (195), Expect = 3e-12, Method: Composition-based stats.
Identities = 35/81 (43%), Positives = 55/81 (67%), Gaps = 2/81 (2%)
Query: 21 GAETVMFSTKLIHRFSEEVKALGVSKNRNAT--SWPAKKSFEYYQVLLSSDVQKQKMKTG 78
G V FS++L+HRFSEE K S+ A SWP K + EY+++LL+SD+ +Q+MK G
Sbjct: 19 GEAAVTFSSRLVHRFSEEAKVHLASRGNGAALQSWPNKSTSEYFRLLLNSDLTRQRMKLG 78
Query: 79 PQFQMLFPSQGSKTMSLGNDF 99
Q++ ++PS+G +T GN++
Sbjct: 79 SQYESMYPSKGGQTFFFGNEW 99
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 155/356 (43%), Gaps = 44/356 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ M++ D G DL W V+C P S Y ++ + P+ SST + C+ C
Sbjct: 156 ARDMTVVFDTGSDLSW-----VQCTPCSDCY----EQKDPLFDPARSSTYSAVPCASPEC 206
Query: 150 D--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
SC K+ C Y + Y + + + G L D L L ++ V + GCG
Sbjct: 207 QGLDSRSCSRDKK-CRYEV-VYGDQSQTDGALARDTLTLT-------QSDVLPGFVFGCG 257
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDS--GRIFFGDQ 264
+ +G L G A DGL+GLG ++S+ S A K G FS C S G + G
Sbjct: 258 EQDTG--LFGRA-DGLVGLGREKVSLSSQAASKYG---AGFSYCLPSSPSAAGYLSLGGP 311
Query: 265 GPATQQSTSFLASNGK---YITYIIGVETC--CIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
PA + T+ + Y ++GV+ + S + ++ ++DSG+ T LP
Sbjct: 312 APANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLPPR 371
Query: 320 VYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
VY + + F R + ++ P CY + ++PSV L+F + V +
Sbjct: 372 VYAALRSAFARSMGR--YGYKRAPALSILDTCYDFTGHTTVRIPSVALVF-AGGAAVGLD 428
Query: 376 PVFVIYGTQVVTGFCLAIQP-VDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V+Y + V+ CLA P DG D G IG VV+D K+G+ + C
Sbjct: 429 FSGVLYVAK-VSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGC 483
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 148/360 (41%), Gaps = 45/360 (12%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG 152
+G D G DLLW+ C C C S ++ PS SST LS +C +
Sbjct: 106 VGIDTGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSP 155
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
N C Y Y +TSS L EDI+ S +SV+ GCG G
Sbjct: 156 QKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRG 211
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPA 267
+ DG G++GL G+ S+ S L + FS C FD ++ GD
Sbjct: 212 RF-DG-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKM 263
Query: 268 TQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFK---AIVDSGSSFTFLPKEV 320
ST F NG Y + G+ I ++T ++DSG++ TFL K+
Sbjct: 264 EGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDG 323
Query: 321 YETIAAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NP 376
++ ++ E R V + P CYK ++ L P + F + V++ N
Sbjct: 324 FDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANS 383
Query: 377 VFVIYGTQVVTGFCLAIQPVD-GDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
+FV V FCLA+ + +IG+ IG Y V +D ++ + ++C+ L D
Sbjct: 384 LFVQKNQDV---FCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 440
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 165/396 (41%), Gaps = 66/396 (16%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYY 121
Q+ LSS V+ Q + ++ G + M++ D G DL W+ C C C Y
Sbjct: 53 QIPLSSGVRLQTLNYIVTVEI-----GGRNMTVIVDTGSDLTWVQCQPCRLC-------Y 100
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLC--------DLGTSCQNPKQPCPYTMDYYTENT 173
N D N PS S + + + C+ C +LG C + C Y ++Y +
Sbjct: 101 NQQDPLFN---PSGSPSYQTILCNSSTCQSLQYATGNLGV-CGSNTPTCNYVVNYGDGSY 156
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
+ L +E + L + ++ I GCG + + G G + GL+GLG ++S+
Sbjct: 157 TRGDLGMEQL---------NLGTTHVSNFIFGCG-RNNKGLFGGAS--GLMGLGKSDLSL 204
Query: 234 PSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQST----SFLASNGKYIT-YI 285
S + + FS C D SG + G + +T + + +N + T Y
Sbjct: 205 VS--QTSAIFEGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYF 262
Query: 286 IGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 342
+ + IG L+ +++ ++DSG+ T LP VY + AEF +Q F G+
Sbjct: 263 LNLTGISIGGVALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQ-------FSGF 315
Query: 343 P-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP 395
P C+ + +P++++ F N V+ + + CLA+
Sbjct: 316 PSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALAS 375
Query: 396 V--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ D +I IG RV+++ + KLG++ C
Sbjct: 376 LSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEAC 411
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 152/360 (42%), Gaps = 46/360 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL W +CAP + + + + Y P+ SST L C+ LC S
Sbjct: 114 DTGSDLTW-----TQCAPCTTACFA---QPTPLYDPARSSTFSKLPCASPLCQALPSAFR 165
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
DY ++G L D L + G + +S A V GC +GG +DG
Sbjct: 166 ACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCS-TANGGDMDG 224
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD-DSGR--IFFGDQGPATQ---QS 271
+ G++GLG +S LL++ G+ R FS C D D+G I FG T QS
Sbjct: 225 AS--GIVGLGRSALS---LLSQIGVGR--FSYCLRSDADAGASPILFGALANVTGDKVQS 277
Query: 272 TSFL----ASNGKYITYIIGVETCCIGSSCLKQTS----FKA------IVDSGSSFTFLP 317
T+ L A+ + Y + + +GS+ L TS F A IVDSG++FT+L
Sbjct: 278 TALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLA 337
Query: 318 KEVYETIAAEFDRQVNDTITSFEG--YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
+ Y + F Q +T G + + C+++ + P +P + F + V
Sbjct: 338 EAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADTP-VPRLVFRFAGGAEYAVPR 396
Query: 376 PVF---VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ V G +V CL + P G + IG V++D + ++ ++C L
Sbjct: 397 QSYFDAVDEGGRVA---CLLVLPTRG-VSVIGNVMQMDLHVLYDLDGATFSFAPADCASL 452
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 143/353 (40%), Gaps = 35/353 (9%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G +L W+ CD C +C+ Y + N++ P L + +C
Sbjct: 92 DTGSELTWLQCDAPCSQCSETPHPLY----KPSNDFIPCKDPLCASLQPTDDY-----TC 142
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV--IIGCGMKQSGG 213
++P Q C Y + Y + S+ G+L+ D+ L N VQ V +GCG Q
Sbjct: 143 EDPNQ-CDYEIKY-ADQYSTLGVLLNDVYLL------NFTNGVQLKVRMALGCGYDQIFS 194
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
DG++GLG G+ S+ S L GL+RN C G IFFG+ +++ S +
Sbjct: 195 PSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGNVYDSSRMSWT 254
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
++S Y G G S I D+GSS+T+ + Y+ + + +++++
Sbjct: 255 PISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQAMISLLNKELH 314
Query: 334 --------DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN----SFVVNNPVFVIY 381
D T + K ++S ++ + L F F + ++I
Sbjct: 315 RKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTNGGRVKPQFEIPPEAYLII 374
Query: 382 GT--QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
V G + G++ IG M +VFD E +GW ++C +
Sbjct: 375 SNMGNVCLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQLIGWGPADCNSV 427
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 151/359 (42%), Gaps = 57/359 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
K M L D G L+W C C C P + + P+ S++ K L CS +LC
Sbjct: 143 KEMPLIFDTGSGLIWTQCKPCKACYP-----------KVPVFDPTKSASFKGLPCSSKLC 191
Query: 150 D-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
+ C +PK C Y + Y +N+SS+G L + + + LK + +++IGC
Sbjct: 192 QSIRQGCSSPK--CTY-LTAYVDNSSSTGTLATETISF-----SHLKYDFK-NILIGCSD 242
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSGRIFFGDQGP 266
+ SG + + G++GL IS+ S A + FS C +G + FG + P
Sbjct: 243 QVSG---ESLGESGIMGLNRSPISLAS--QTANIYDKLFSYCIPSTPGSTGHLTFGGKVP 297
Query: 267 ATQQ--STSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEV 320
+ S A + Y + G+ I +S K S +DSG+ T LP +
Sbjct: 298 NDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAS---TIDSGAVLTRLPPKA 354
Query: 321 YETIAAEFDRQVNDTITSFEGYPW-------KCCYKSSSQRLPKLPSVKLMFPQ--NNSF 371
Y + + F + +GYP CY S+ +PS+ + F
Sbjct: 355 YSALRSVFREMM-------KGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDI 407
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
V+ ++ + G++V +CLA +D ++ G Y VVFD ++G++ C
Sbjct: 408 DVSGIMWQVPGSKV---YCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGCD 463
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 152/357 (42%), Gaps = 41/357 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G D+LW+ C+ P S+ L LN + S+SS+S +SCS +C+
Sbjct: 97 DTGSDILWVNCNSCNGCPRSSG----LGIQLNFFDASSSSSSSLVSCSDPICNSAFQTTA 152
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCGMKQS 211
T C C YT Y + + +SG V + ++ + G + + NS ASV+ GC QS
Sbjct: 153 TQCLTQSNQCSYTFQY-GDGSGTSGYYVSESMYFDMVMGQSMIANS-SASVVFGCSTYQS 210
Query: 212 GGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGDQGPAT 268
G A DG+ G G G++SV S L+ G+ FS C + + G + G+
Sbjct: 211 GDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEGNGGGILVLGEVLEPG 270
Query: 269 QQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVY-- 321
+ + S Y Y+ + +T I S + + I+DSG++ +L +E Y
Sbjct: 271 IVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRGTIIDSGTTLAYLVEEAYTP 330
Query: 322 --ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
I A + V TI+ CY S+ P V L F + S V+ ++
Sbjct: 331 FVSAITAAVSQSVTPTISK-----GNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYL 385
Query: 380 IYGTQVVTGF-------CLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ GF C+ Q V + +G M V+D ++GW+ +C
Sbjct: 386 MH-----LGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGWASYDC 437
>gi|306015413|gb|ADM76760.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015419|gb|ADM76763.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015425|gb|ADM76766.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015431|gb|ADM76769.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015433|gb|ADM76770.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015435|gb|ADM76771.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015437|gb|ADM76772.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015439|gb|ADM76773.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015441|gb|ADM76774.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015443|gb|ADM76775.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015447|gb|ADM76777.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015451|gb|ADM76779.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015453|gb|ADM76780.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015459|gb|ADM76783.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015461|gb|ADM76784.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015463|gb|ADM76785.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015465|gb|ADM76786.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015467|gb|ADM76787.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015471|gb|ADM76789.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015473|gb|ADM76790.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015477|gb|ADM76792.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015481|gb|ADM76794.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015483|gb|ADM76795.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015493|gb|ADM76800.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015495|gb|ADM76801.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015497|gb|ADM76802.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015499|gb|ADM76803.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015501|gb|ADM76804.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015503|gb|ADM76805.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015507|gb|ADM76807.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/76 (56%), Positives = 51/76 (67%), Gaps = 7/76 (9%)
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSN----PLPANQEQS 458
IGQNFMT YR+VFDRENLKLGWS S+C L D + + P P +P N P Q+Q+
Sbjct: 2 IGQNFMTSYRLVFDRENLKLGWSPSDCYQL-DENEGAVAPAP-SPQNGWKTRTPLQQQQT 59
Query: 459 SPGGHAVGPAVAGRAP 474
SP G AV PA+AGR P
Sbjct: 60 SP-GRAVAPAIAGRTP 74
>gi|306015415|gb|ADM76761.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015421|gb|ADM76764.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015423|gb|ADM76765.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015427|gb|ADM76767.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015429|gb|ADM76768.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015445|gb|ADM76776.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015449|gb|ADM76778.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015455|gb|ADM76781.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015457|gb|ADM76782.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015469|gb|ADM76788.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015475|gb|ADM76791.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015479|gb|ADM76793.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015485|gb|ADM76796.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015487|gb|ADM76797.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015489|gb|ADM76798.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015491|gb|ADM76799.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015505|gb|ADM76806.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/76 (56%), Positives = 51/76 (67%), Gaps = 7/76 (9%)
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSN----PLPANQEQS 458
IGQNFMT YR+VFDRENLKLGWS S+C L D + + P P +P N P Q+Q+
Sbjct: 2 IGQNFMTSYRLVFDRENLKLGWSPSDCYQL-DENEGAVAPAP-SPQNGWRTRTPLQQQQT 59
Query: 459 SPGGHAVGPAVAGRAP 474
SP G AV PA+AGR P
Sbjct: 60 SP-GRAVAPAIAGRTP 74
>gi|306015417|gb|ADM76762.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/76 (56%), Positives = 51/76 (67%), Gaps = 7/76 (9%)
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSN----PLPANQEQS 458
IGQNFMT YR+VFDRENLKLGWS S+C L D + + P P +P N P Q+Q+
Sbjct: 2 IGQNFMTSYRLVFDRENLKLGWSPSDCYQL-DENEGAVAPAP-SPQNGWRTRTPLQQQQT 59
Query: 459 SPGGHAVGPAVAGRAP 474
SP G AV PA+AGR P
Sbjct: 60 SP-GRAVAPAIAGRTP 74
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 151/366 (41%), Gaps = 44/366 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K ++ D G D+LW+ C+ P S+ L +LN + SST+ + CS +C
Sbjct: 89 KEFNVQIDTGSDILWVNCNTCSNCPQSSQ----LGIELNFFDTVGSSTAALIPCSDPICT 144
Query: 151 -----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGDNALKNSVQASVI 203
C C YT Y + + +SG V D ++ LI G A+ +S A+++
Sbjct: 145 SRVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFSLIMGQPPAVNSS--ATIV 201
Query: 204 IGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG----- 257
GC + QSG A DG+ G G G +SV S L+ G+ FS C D G
Sbjct: 202 FGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGGGVLV 261
Query: 258 -------RIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
I + P+ + +A NG+ + V + + IV
Sbjct: 262 LGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFS-------ISNNRGGTIV 314
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
D G++ +L +E Y+ + + V+ + T+ +G CY S+ PSV L F
Sbjct: 315 DCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDIFPSVSLNF 371
Query: 366 PQNNSFVVNNPVFVIYGTQV--VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
S V+ ++++ + +C+ Q +G + VV+D ++G
Sbjct: 372 EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIG 431
Query: 424 WSHSNC 429
W++ +C
Sbjct: 432 WANYDC 437
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 145/355 (40%), Gaps = 38/355 (10%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++T ++ D G D+ WI +C P S Y D + P+ S+T + C H C
Sbjct: 145 AQTYTVIFDTGSDVSWI-----QCLPCSGHCYKQHDP---IFDPTKSATYSVVPCGHPQC 196
Query: 150 DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
G+ C N C Y ++Y + +SS+G+L + L L S GCG
Sbjct: 197 AAADGSKCSNGT--CLYKVEY-GDGSSSAGVLSHETLSLTS-------TRALPGFAFGCG 246
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQG 265
G + D DGLIGLG G++S+ S A + +FS C D++ G + G
Sbjct: 247 QTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIGPTT 301
Query: 266 PATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLP 317
PA+ Q T+ + Y + + + IG L T +DSG+ T+LP
Sbjct: 302 PASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLP 361
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
E Y + F + + P+ CY + Q +P+V F + F ++
Sbjct: 362 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421
Query: 378 FVIYGTQVVTGF-CLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+I+ CL +P +G V++D K+G++ ++C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 45/358 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D +L W V+CAP + + D+ + PS+S + + C CD L
Sbjct: 159 DTASELTW-----VQCAPCESCH----DQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLA 209
Query: 153 TSCQNPKQPC----PYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
T PC P Y Y + + S G+L D L +L V + G
Sbjct: 210 TGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRL--------SLAGEVIDGFVFG 261
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK--AGLIRNSFSMCFDKDDSGRIFFGD 263
CG G G + GL+GLG ++S+ S G+ + + D SG + GD
Sbjct: 262 CGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGD 319
Query: 264 QGPATQQST----SFLASNGKYIT----YIIGVETCCIGSSCLKQTSF--KAIVDSGSSF 313
A + ST + + SN + Y++ + +G ++ T F +AIVDSG+
Sbjct: 320 DPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAIVDSGTVI 379
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T L VY + AEF Q+ + + C+ + + ++PS+ L+F V
Sbjct: 380 TSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEV 439
Query: 374 NNPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + + + + CLA+ + + + IG RVVFD ++G++ C
Sbjct: 440 DSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 160/370 (43%), Gaps = 47/370 (12%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS--TS 139
L+ Q K L D G DL W+ CD C +C Y + N+ P S
Sbjct: 61 LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLY----QPSNDLVPCKDPLCMS 116
Query: 140 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSV 198
H S HR C+NP Q C Y ++Y + SS G+LV D+ L ++ GD +
Sbjct: 117 LHSSMDHR-------CENPDQ-CDYEVEY-ADGGSSLGVLVRDVFPLNLTNGD-----PI 162
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
+ + +GCG Q G DG++GLG G +S+ S L G++RN CF+ G
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222
Query: 259 IFFGDQ--GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
FFGD P T K+ + G E G S + F + DSGSS+T+
Sbjct: 223 XFFGDGIYDPYRLVWTPMSRDYPKHYSPGFG-ELIFNGRSTGLRNLF-VVFDSGSSYTYF 280
Query: 317 PKEVYETIAAEFDRQV--NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-PQNNSFV- 372
+ Y+ + + +R++ + + C++ + + L V+ F P SF
Sbjct: 281 NAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWR-GRKPIKSLRDVRKYFKPLALSFSS 339
Query: 373 --VNNPVFVI--YGTQVVTGF---CLAIQPVDG-DIG-----TIGQNFMTGYRVVFDREN 419
+ VF I G +++ CL I ++G D+G IG M VV++ E
Sbjct: 340 GGRSKAVFEIPTEGYMIISSMGNVCLGI--LNGTDVGLENSNIIGDISMQDKMVVYNNEK 397
Query: 420 LKLGWSHSNC 429
+GW+ +NC
Sbjct: 398 QAIGWATANC 407
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 150/359 (41%), Gaps = 53/359 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL------ 151
D +L W V+C P A + D+ + PS+S + + C+ CD
Sbjct: 129 DTASELTW-----VQCEPCDACH----DQQEPLFDPSSSPSYAAVPCNSSSCDALRVATG 179
Query: 152 --GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
G +C + C YT+ Y + + S G+L D L L +G D +Q + GCG
Sbjct: 180 MSGQACDDQPAACSYTLSY-RDGSYSRGVLAHDRLSL-AGED------IQG-FVFGCGTS 230
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG 265
G + GL+GLG ++S+ S + + G + FS C + SG + GD
Sbjct: 231 NQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPPKESGSSGSLVLGDDA 284
Query: 266 PATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSF------KAIVDSGSS 312
+ ST + + G + Y+ + +G ++ F KAIVDSG+
Sbjct: 285 SVYRNSTPIVYTAMVSDPLQGPF--YLANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTI 342
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T L VY + AEF Q+ + + C+ + R ++PS+KL+F
Sbjct: 343 ITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGAEVE 402
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC 429
V++ + T + CLA+ + + T IG RV+FD ++G++ C
Sbjct: 403 VDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETC 461
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 153/361 (42%), Gaps = 62/361 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD------- 150
D +L W V+CAP ++ + D+ + P++S + L C+ CD
Sbjct: 143 DTASELTW-----VQCAPCASCH----DQQGPLFDPASSPSYAVLPCNSSSCDALQVATG 193
Query: 151 -LGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
+C +QP C YT+ Y + + S G+L D L +L V + GCG
Sbjct: 194 SAAGACGGGEQPSCSYTLSY-RDGSYSQGVLAHDKL--------SLAGEVIDGFVFGCGT 244
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFFGDQ 264
G + GL+GLG ++S+ S + + G + FS C + + SG + GD
Sbjct: 245 SNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPLKESESSGSLVLGDD 298
Query: 265 GPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+ ST + + G + Y + + IG ++ ++ K IVDSG+ T L
Sbjct: 299 TSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGITIGGQEVESSAGKVIVDSGTIITSLV 356
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNS 370
VY + AEF ++ F YP C+ + R ++PS+K +F N
Sbjct: 357 PSVYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVE 409
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSN 428
V++ + + + + CLA+ + + T IG RV+FD ++G++
Sbjct: 410 VEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQET 469
Query: 429 C 429
C
Sbjct: 470 C 470
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 186/430 (43%), Gaps = 53/430 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C C ++ P AS T + + C+ + C+ C
Sbjct: 111 DTGSTVTYVPCSTCKHCG----------SHQDPKFRPEASETYQPVKCTWQ-CN----CD 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ ++ C Y Y E ++SSG+L ED+ +S G+ + + +A I GC ++G +
Sbjct: 156 DDRKQCTYERRY-AEMSTSSGVLGEDV---VSFGNQSELSPQRA--IFGCENDETGDIYN 209
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
A DG++GLG G++S+ L + +I ++FS+C+ G G + F
Sbjct: 210 QRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVFTH 268
Query: 277 SNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD 329
S+ + Y I ++ + L ++DSG+++ +LP+ +
Sbjct: 269 SDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIM 328
Query: 330 RQVN--DTITSFEGYPWKCCYKSS----SQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
++ + I+ + + C+ + SQ P V+++F + ++ ++ +
Sbjct: 329 KETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHS 388
Query: 384 QVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
+V +CL + D T +G + V++DRE+ K+G+ +NC +L + P
Sbjct: 389 KVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSELWERLHVSNAP 448
Query: 443 GPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPSTASTQL------ISSRSSSLKVLP 496
P P N + A P+V APS PS + QL IS S + + P
Sbjct: 449 PPLMPPKSEGTNLTK------AFKPSV---APS-PSQYNLQLGIMSFVISFNISYMDIKP 498
Query: 497 FLLLLRLLVS 506
++ L L++
Sbjct: 499 YITELTGLIA 508
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 159/396 (40%), Gaps = 85/396 (21%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T+ L D G DL+W PC C C+ +++ + N + P +SS+SK L C +
Sbjct: 101 QTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFIPKSSSSSKVLGCVN 154
Query: 147 RLC-------------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
C D + N Q CP + +Y + G+++ + L L G
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG-GIMLSETLDLPGKG--- 210
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF------ 247
+ I+GC + L P G+ G G G S+PS L GL + S+
Sbjct: 211 -----VPNFIVGCSV------LSTSQPAGISGFGRGPPSLPSQL---GLKKFSYCLLSRR 256
Query: 248 --------SMCFDKD-DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
S+ D + DSG G Q+ + + Y +G+ +G +
Sbjct: 257 YDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHV 316
Query: 299 KQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYP-WK 345
K +K I+DSG++FT++ E++E +AAEF++QV + T EG +
Sbjct: 317 K-IPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLR 375
Query: 346 CCYKSSSQRLPKLPSVKLMFP--QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG-- 401
C+ S P P + L F + N V + G VV CL I DG G
Sbjct: 376 PCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVV---CLTIV-TDGAAGKE 431
Query: 402 -------TIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+G + V +D N +LG+ +C+
Sbjct: 432 FSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 467
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 153/361 (42%), Gaps = 62/361 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD------- 150
D +L W V+CAP ++ + D+ + P++S + L C+ CD
Sbjct: 142 DTASELTW-----VQCAPCASCH----DQQGPLFDPASSPSYAVLPCNSSSCDALQVATG 192
Query: 151 -LGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
+C +QP C YT+ Y + + S G+L D L +L V + GCG
Sbjct: 193 SAAGACGGGEQPSCSYTLSY-RDGSYSQGVLAHDKL--------SLAGEVIDGFVFGCGT 243
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFFGDQ 264
G + GL+GLG ++S+ S + + G + FS C + + SG + GD
Sbjct: 244 SNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPLKESESSGSLVLGDD 297
Query: 265 GPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+ ST + + G + Y + + IG ++ ++ K IVDSG+ T L
Sbjct: 298 TSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGITIGGQEVESSAGKVIVDSGTIITSLV 355
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNS 370
VY + AEF ++ F YP C+ + R ++PS+K +F N
Sbjct: 356 PSVYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVE 408
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSN 428
V++ + + + + CLA+ + + T IG RV+FD ++G++
Sbjct: 409 VEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQET 468
Query: 429 C 429
C
Sbjct: 469 C 469
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/259 (30%), Positives = 112/259 (43%), Gaps = 26/259 (10%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
L+P + L D G DL WI CD C CA + ++Y R N P K
Sbjct: 194 LYPDGPPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKP--RRGNIVPP------KD 245
Query: 142 LSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 200
L C + C+ Q C Y ++Y +++SS G+L D L L+ + K
Sbjct: 246 LLCMEVQRNQKAGYCETCDQ-CDYEIEY-ADHSSSMGVLATDKLLLMVANGSLTK----L 299
Query: 201 SVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSG 257
+ I GC Q G L V DG++GL ++S+PS LA G+I N C D G
Sbjct: 300 NFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGG 359
Query: 258 RIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIV-DSGS 311
+F GD P + + + Y V GSS L ++ K I+ DSGS
Sbjct: 360 YMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGS 419
Query: 312 SFTFLPKEVYETIAAEFDR 330
S+T+ PKE Y + A +
Sbjct: 420 SYTYFPKEAYSELVASLNE 438
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 150/377 (39%), Gaps = 57/377 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ ++L D G DL+W +CAP ++ L P+ASST L C C
Sbjct: 103 RPVALTLDTGSDLVW-----TQCAPCRDCFHQGLPL----LDPAASSTYAALPCGAPRCR 153
Query: 151 L--GTSCQ--------NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 200
TSC N + C Y + +Y + + + G + D GGDN +S
Sbjct: 154 ALPFTSCGGGGRSSWGNGNRSCAY-IYHYGDKSVTVGEIATD--RFTFGGDNGDGDSRLP 210
Query: 201 S--VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDD 255
+ + GCG G + G+ G G G S+PS L +FS CF +
Sbjct: 211 TRRLTFGCGHFNKGVFQSN--ETGIAGFGRGRWSLPSQLNV-----TTFSYCFTSMFESK 263
Query: 256 SGRIFFGDQGPAT------------QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 303
S + G A ++T L + + Y + ++ +G + L
Sbjct: 264 SSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEA 323
Query: 304 K---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-FEGYPWKCCYK---SSSQRLP 356
K I+DSG+S T LP+ VYE + AEF QV T EG C+ ++ R P
Sbjct: 324 KLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWRRP 383
Query: 357 KLPSVKLMFPQNN-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVF 415
+PS+ L + N VF +V+ C+ + GD IG VV+
Sbjct: 384 PVPSLTLHLDGADWELPRGNYVFEDLAARVM---CVVLDAAPGDQTVIGNFQQQNTHVVY 440
Query: 416 DRENLKLGWSHSNCQDL 432
D EN L ++ + C L
Sbjct: 441 DLENDWLSFAPARCDSL 457
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 152/387 (39%), Gaps = 69/387 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ ++L D G DL+W C C+ C A + P+ASST + C +C
Sbjct: 105 RPVALTLDTGSDLVWTQCAPCLNCFDQGA---------IPVLDPAASSTHAAVRCDAPVC 155
Query: 150 DL--GTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV-QAS 201
TSC ++ C Y +Y + + + G L D GDNA V +
Sbjct: 156 RALPFTSCGRGGSSWGERSCVYVY-HYGDKSITVGKLASDRFTF-GPGDNADGGGVSERR 213
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGR 258
+ GCG G + G+ G G G S+PS L SFS CF + S
Sbjct: 214 LTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFSYCFTSMFESTSSL 266
Query: 259 IFFGDQGPAT------QQSTSFLASNGKYITYIIGVETCCIGSSCL-------KQTSFKA 305
+ G PA QST L + Y + ++ +G++ + + A
Sbjct: 267 VTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK-------- 357
I+DSG+S T LP++VYE + AEF QV +++ EG C+ S PK
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWR 385
Query: 358 ---------LPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDG---DIGTI 403
+P + + + N VF YG +V+ CL + G I
Sbjct: 386 GRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVM---CLVLDAATGGGDQTVVI 442
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQ 430
G VV+D EN L ++ + C+
Sbjct: 443 GNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 143/364 (39%), Gaps = 48/364 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C ++ R+ Y P+ + + C L
Sbjct: 75 KVYDLDIDTGSDLTWVQCDAPCQGC---------TIPRN-RLYKPNGNL----VKCGDPL 120
Query: 149 CDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
C S C P + C Y ++Y + +S LL ++I + G A + +
Sbjct: 121 CKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLA-----RPILA 175
Query: 204 IGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GCG Q G+ + G++GLG G+ S+ S L GLIRN C + G +FFG
Sbjct: 176 FGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGFLFFG 235
Query: 263 DQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
DQ P + + L + Y G + I DSGSS+T+ + +
Sbjct: 236 DQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTYFNSKAH 295
Query: 322 ETI---------AAEFDRQVNDT---ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
+ + R D+ I P+K + +S P L L F ++
Sbjct: 296 KALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLL----LSFTKSK 351
Query: 370 SFVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ ++ P + V V G + G+ IG + V++D E ++GW+
Sbjct: 352 NSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQQIGWA 411
Query: 426 HSNC 429
+NC
Sbjct: 412 SANC 415
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 151/403 (37%), Gaps = 61/403 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C RD Y P+ + + C +L
Sbjct: 75 KLYDLDIDSGSDLTWVQCDAPCKGCTK---------PRD-QLYKPNHNL----VQCVDQL 120
Query: 149 CD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL--HLISGGDNALKNSVQAS 201
C + +C +P C Y ++Y ++ SS G+LV D + +G + V+
Sbjct: 121 CSEVQLSMEYTCASPDDQCDYEVEY-ADHGSSLGVLVRDYIPFQFTNG------SVVRPR 173
Query: 202 VIIGCGMKQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
V GCG Q G A G++GLG G S+ S L GLI N C G +F
Sbjct: 174 VAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARGGGFLF 233
Query: 261 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 319
FGD P++ + + + Y G + I DSGSS+T+ +
Sbjct: 234 FGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLELIFDSGSSYTYFNSQ 293
Query: 320 VYETI---------AAEFDRQVNDTITSFEGYPWKCC--YKSSSQRLPKLPSVKLMFPQN 368
Y+ + + R +D WK +KS S + L F +
Sbjct: 294 AYQAVVDLVTQDLKGKQLKRATDDPSLPI---CWKGAKSFKSLSDVKKYFKPLALSFTKT 350
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDG------DIGTIGQNFMTGYRVVFDRENLKL 422
++ P CL I +DG ++ IG + V++D E ++
Sbjct: 351 KILQMHLPPEAYLIITKHGNVCLGI--LDGTEVGLENLNIIGDISLQDKMVIYDNEKQQI 408
Query: 423 GWSHSNC-------QDLNDGTKSPLTPGPGTPSNPLPANQEQS 458
GW SNC +DL P G + PA+ E++
Sbjct: 409 GWVSSNCDRLPNVDRDLEGDFPHPYATNLGIFGDRCPASYEET 451
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 113/459 (24%), Positives = 169/459 (36%), Gaps = 70/459 (15%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
+ + + L E A FS LIHR S SK R +A A + +
Sbjct: 13 VVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGRFR 72
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLG-----------NDFGCDLLWIPCD-C 110
Q ++SD + + L PS G M+L D G DL W C C
Sbjct: 73 QSAMTSDGIQSR---------LVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC 123
Query: 111 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SCQNPKQPCPYTMD 167
C +++ P SST + SC C LG SC+N K+ C +
Sbjct: 124 THCYKQVVPFFD----------PKNSSTYRDSSCGTSFCLALGNDRSCRNGKK-CTFMYS 172
Query: 168 YYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLG 227
Y + + L VE + + G K GC + +SGG D + G++GLG
Sbjct: 173 YADGSFTGGNLAVETLTVASTAG----KPVSFPGFAFGC-VHRSGGIFDEHS-SGIVGLG 226
Query: 228 LGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQG---PATQQSTSFLASNG 279
+ E+S+ S L I FS C D S RI FG G A ST +
Sbjct: 227 VAELSMISQLKST--INGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGP 284
Query: 280 KYITYIIGVETCCIGSSCLKQTSF---------KAIVDSGSSFTFLPKEVYETIAAEFDR 330
Y+I +E +G L F IVDSG+++T+LP E Y +
Sbjct: 285 DTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAH 344
Query: 331 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFC 390
+ CY ++ ++ P + F N + F+ +V C
Sbjct: 345 SIKGKRVRDPNGISSLCYNTTVDQI-DAPIITAHFKDANVELQPWNTFLRMQEDLV---C 400
Query: 391 LAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ P DIG +G + V FD ++ + ++C
Sbjct: 401 FTVLPTS-DIGILGNLAQVNFLVGFDLRKKRVSFKAADC 438
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 150/352 (42%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL T C
Sbjct: 200 DTGSDTTW-----VQCQPCVVVCYKQQEK---LFDPARSSTYANVSCAAPACSDLYTRGC 251
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y++ Y + + S G D L L S +A+K GCG + G +
Sbjct: 252 SGGH--CLYSVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 301
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPAT---Q 269
+ GL+GLG G+ S+P K G + F+ C SG + FG PA +
Sbjct: 302 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLDFGPGSPAAVGAR 355
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYETI 324
Q+T L NG Y +G+ +G L Q+ F IVDSG+ T LP Y ++
Sbjct: 356 QTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSL 414
Query: 325 AAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
+ F + ++ P CY + +P V L+F Q +++ N ++
Sbjct: 415 RSAFASAM--AARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLF-QGGAYLDVNASGIM 471
Query: 381 YG---TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Y +QV GF A D D+G +G + + VV+D +G+S C
Sbjct: 472 YAASLSQVCLGF--AANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
Length = 149
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 62/121 (51%), Gaps = 27/121 (22%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
FS++++HR S+E + + WP + S YY+ LL SD+Q+QK + + Q+L
Sbjct: 27 FSSRMVHRLSDEAR---LEAGPRMGLWPQRGSGGYYRALLRSDLQRQKRRLAGKNQLLSL 83
Query: 87 SQGSKTMSLGNDFGC-----------------------DLLWIPCDCVRCAPLSASYYNS 123
S+G T S GND G DL W+PCDC++CAPLS SY +
Sbjct: 84 SKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLS-SYRGN 142
Query: 124 L 124
L
Sbjct: 143 L 143
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 112/453 (24%), Positives = 181/453 (39%), Gaps = 73/453 (16%)
Query: 30 KLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQV-----LLSSDVQKQKMKTG-----P 79
KL HRFSE + S R + ++ ++ + LL D+ T
Sbjct: 31 KLKHRFSELEGSSKQSGKRGMSEEHFRQLMDHTRARSRRFLLEVDLMLNGSSTSDATYYA 90
Query: 80 QFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNS---LDRDLNEYSPSA 135
Q + P Q + D G D+LW C C C+ S + + Y P
Sbjct: 91 QIGVGHPVQFLNAIV---DTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
S T+ +CS LC G SC+ C Y + Y + +SS+G+ D++HL K
Sbjct: 148 SITASPATCSDPLCSEGGSCRGNNNSCAYDIS-YEDTSSSTGIYFRDVVHL------GHK 200
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DK 253
S+ ++ +GC SG + DG++G G ++SVP+ LA N F C +K
Sbjct: 201 ASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEK 256
Query: 254 DDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK----- 304
+ G + G D+ P T LA++ I Y + + + + S L + + F+
Sbjct: 257 EGGGILVLGKNDEFPEMVY-TPMLAND---IVYNVKLVSLSVNSKALPIEASEFEYNATV 312
Query: 305 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC----CYKSSSQR-- 354
I+DSG+S P + A F + V+ T+ P + C+ S S R
Sbjct: 313 GNGGTIIDSGTSSATFPSKAL----ALFVKAVSKFTTAIPTAPLESSGSPCFISISDRNS 368
Query: 355 -LPKLPSVKLMFPQNNSF----------VVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTI 403
P+V L F + VV+ + Q V C++ G+ +
Sbjct: 369 VEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSV--GNSTIL 426
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGT 436
G + VV+D E ++GW QDL+ G+
Sbjct: 427 GDAILKDKVVVYDMEKSRIGWVK---QDLSHGS 456
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 103/245 (42%), Gaps = 30/245 (12%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----D 150
D G DL W+ CD CVRC L P +S + C+ LC +
Sbjct: 75 DTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDLIPCNDPLCKALHLN 120
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C+ P+Q C Y ++Y + SS G+LV D+ + + + + +GCG Q
Sbjct: 121 SNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGLRLTPRLALGCGYDQ 174
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ--GPAT 268
G DG++GLG G++S+ S L G ++N C G +FFGD +
Sbjct: 175 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 234
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
T K+ + +G E G + + DSGSS+T+ + Y+ +
Sbjct: 235 VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 293
Query: 329 DRQVN 333
R+++
Sbjct: 294 KRELS 298
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 148/359 (41%), Gaps = 61/359 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G DL W+ C C RC ++ + P ASS+ + SC+ LCD L
Sbjct: 26 DTGSDLCWVQCAPCARC----------FEQPDPLFIPLASSSYSNASCTDSLCDALPRPT 75
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ + C Y+ Y + + E + L S A + GCG Q G +
Sbjct: 76 CSMRNTCTYSYSYGDGSNTRGDFAFETV---------TLNGSTLARIGFGCGHNQEGTF- 125
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR---IFFGDQGPATQQS 271
DGLIGLG G +S+PS L + + FS C D+ +G I FG+ ++ S
Sbjct: 126 --AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPITFGNAAENSRAS 181
Query: 272 -TSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFK--------AIVDSGSSFTFLPKEV 320
T L + Y +GVE+ +G+ + ++F+ I+DSG++ T+
Sbjct: 182 FTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLAA 241
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYK-----SSSQRLP----KLPSVKLMFPQNNSF 371
+ I AE RQ++ Y CY +SS LP L +V P +N +
Sbjct: 242 FIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIPVSNLW 301
Query: 372 V-VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V V+N +G V T + Q IG +V D N ++G+ ++C
Sbjct: 302 VLVDN-----FGETVCTAMSTSDQ-----FSIIGNVQQQNNLIVTDVANSRVGFLATDC 350
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 145/364 (39%), Gaps = 59/364 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-----G 152
D G DL + V+CAP Y ++D Y PS SST + C C L G
Sbjct: 52 DTGSDLAF-----VQCAPCDLCY----EQDGPLYQPSNSSTFTPVPCDSAECLLIPAPVG 102
Query: 153 TSCQN------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C + P+ C Y Y +N+S+ G+ + + GG V GC
Sbjct: 103 APCSSSYPESPPQGACSYEYRY-GDNSSTVGVFAYETATV--GGIRV------NHVAFGC 153
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-----GRIFF 261
G + G + V+ G++GLG G +S S A N F+ C S + F
Sbjct: 154 GNRNQGSF---VSAGGVLGLGQGALSFTSQAGYA--FENKFAYCLTSYLSPTSVFSSLIF 208
Query: 262 GDQGPATQQSTSF--LASN----GKYITYII----GVETCCIGSSCLKQTSFK---AIVD 308
GD +T F L SN Y I+ G ET I S K S I D
Sbjct: 209 GDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNGGTIFD 268
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
SG++ T+ + Y I A F++ V S +G P C S P PS + F
Sbjct: 269 SGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL--CVNVSGIDHPIYPSFTIEFD 326
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWS 425
Q ++ N + I + + CLA+ D IG Y V +DRE ++G++
Sbjct: 327 QGATYRPNQGNYFIEVSPNID--CLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRIGFA 384
Query: 426 HSNC 429
H+NC
Sbjct: 385 HANC 388
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 153/351 (43%), Gaps = 43/351 (12%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTS- 154
D G DL+W C C +C ++ P +SS+ +++C C+ L +S
Sbjct: 78 DTGSDLVWFQCIPCTKCYKQQNPMFD----------PRSSSSYTNITCGTESCNKLDSSL 127
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C ++ C YT Y +N+ + G+L ++ L L S + +I GCG SG +
Sbjct: 128 CSTDQKTCNYTYSY-ADNSITQGVLAQETLTLTSTTGEPV---AFQGIIFGCGHNNSG-F 182
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKA-GLIRNSFSMC---FDKDDS--GRIFFGDQGPAT 268
D GLIGLG G +S+ S + + G N FS C F+ D S ++ FG
Sbjct: 183 NDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVL 240
Query: 269 QQ---STSFLASNGK-YITYIIGVETCCI------GSSCLKQTSFKAIVDSGSSFTFLPK 318
ST ++ +G Y ++G+ I GSS T ++DSG++ T+LP+
Sbjct: 241 GNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITKGNILIDSGTTITYLPE 300
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
E Y + + +V +GY + CY++ + P++ + F + + +F
Sbjct: 301 EFYHRLIEQVRNKVALEPFRIDGY--ELCYQTPTNL--NGPTLTIHFEGGDVLLTPAQMF 356
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ FC A+ + + T G + Y + FD E + + ++C
Sbjct: 357 IPVQDD---NFCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDC 404
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 148/359 (41%), Gaps = 66/359 (18%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W C C++C Y L N P S++ H+ C+ + C
Sbjct: 98 DTGSDLTWAQCLPCLKC-------YQQLRPIFN---PLKSTSFSHVPCNTQTCHAVDDGH 147
Query: 157 NPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
Q C Y+ Y S L E I + G +++K+ +IGCG SGG+
Sbjct: 148 CGVQGVCDYSYTYGDRTYSKGDLGFEKI----TIGSSSVKS------VIGCGHASSGGF- 196
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQ----GPAT 268
G A G+IGLG G++S+ S +++ I FS C +G+I FG GP
Sbjct: 197 -GFA-SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 254
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVDSGSSFTFLPKEVYETI 324
+ L S Y I +E IG+ + +F I+DSG++ +FLPKE+Y+ +
Sbjct: 255 VSTP--LISKNTVTYYYITLEAISIGNE--RHMAFAKQGNVIIDSGTTLSFLPKELYDGV 310
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYK-----SSSQRLPKLPS-------VKLMFPQNNSFV 372
+ + V G W C+ ++S +P + + V L+ P N
Sbjct: 311 VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLL-PVNTFQK 369
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V N V CL + P + G IG + + + +D E +L + + C
Sbjct: 370 VANNV-----------NCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 150/334 (44%), Gaps = 53/334 (15%)
Query: 27 FSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
FS +LIHR S + ++N+ NA ++ ++ LS+ + G ++
Sbjct: 28 FSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLSNTPESTVYVNGGEY 87
Query: 82 QMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 138
M + S G+ ++ D G D++W+ C C +C + +N PS SS+
Sbjct: 88 LMTY-SVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFN----------PSKSSS 136
Query: 139 SKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 196
K++ CS LC TSC N + C YT+++ ++ S L VE + D+ +
Sbjct: 137 YKNIPCSSNLCQSVRYTSC-NKQNSCEYTINFSDQSYSQGELSVETLTL-----DSTTGH 190
Query: 197 SVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 251
SV +IGCG G + + G++GLG+G +S+ + L + I FS C
Sbjct: 191 SVSFPKTVIGCGHNNRGMFQGETS--GIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLL 246
Query: 252 -DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-- 305
D + + ++ FGD + ST F+ + + Y + +E +G+ K+ F+
Sbjct: 247 VDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAF-YYLTLEAFSVGN---KRIEFEVLD 302
Query: 306 -------IVDSGSSFTFLPKEVYETIAAEFDRQV 332
I+DSG++ T LP VY + + + V
Sbjct: 303 DSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLV 336
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 103/245 (42%), Gaps = 30/245 (12%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----D 150
D G DL W+ CD CVRC L P +S + C+ LC +
Sbjct: 56 DTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDLIPCNDPLCKALHLN 101
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C+ P+Q C Y ++Y + SS G+LV D+ + + + + +GCG Q
Sbjct: 102 SNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGLRLTPRLALGCGYDQ 155
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ--GPAT 268
G DG++GLG G++S+ S L G ++N C G +FFGD +
Sbjct: 156 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 215
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF 328
T K+ + +G E G + + DSGSS+T+ + Y+ +
Sbjct: 216 VSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 274
Query: 329 DRQVN 333
R+++
Sbjct: 275 KRELS 279
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 148/364 (40%), Gaps = 58/364 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +C P A + D+ L + PS SST SC LC SC
Sbjct: 100 DTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASC 150
Query: 156 QNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+PK Q C YT Y + + ++G L D + G + V GCG+ +
Sbjct: 151 GSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLFNN 203
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRIF 260
G + G+ G G G +S+PS L K G +FS CF D ++
Sbjct: 204 GVFKSN--ETGIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGLKPSTVLLDLPADLY 256
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGS 311
+G QST + + Y + ++ +GS+ LK + I+DSG+
Sbjct: 257 KSGRGAV--QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNNS 370
+ T LP VY + F QV + S C + + P +P + L F
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMD 374
Query: 371 FVVNNPVFVI--YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
N VF + G+ ++ CLAI G++ TIG V++D +N KL + +
Sbjct: 375 LPRENYVFEVEDAGSSIL---CLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 429 CQDL 432
C L
Sbjct: 431 CDKL 434
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 124/307 (40%), Gaps = 19/307 (6%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C P ++ L LN + P +S T+ +SCS + C G
Sbjct: 99 DTGSDVLWVSCASCNGCPQTSG----LQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C C YT Y + + +SG V D+L ++L + A V+ GC Q+G
Sbjct: 155 SGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 213 GYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ 269
+ A DG+ G G +SV S LA G+ FS C ++ G + G+
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNM 273
Query: 270 QSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETI 324
T + S Y ++ + + I S ++ + I+D+G++ +L + Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
V+ ++ + CY ++ P V L F S +N ++I
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNN 392
Query: 385 VVTGFCL 391
V + C
Sbjct: 393 VASALCF 399
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 160/357 (44%), Gaps = 40/357 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----DLG 152
D G D+LW+ C+ P ++ L +L+ + PS+SST+ +SCSH +C
Sbjct: 104 DTGSDILWVTCNSCNDCPRTSG----LGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTA 159
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG-GDNALKNSVQASVIIGCGMKQS 211
C C Y+ +Y + + ++G V D+L+ + GD+ + NS AS++ GC QS
Sbjct: 160 AECSPQSNQCSYSF-HYGDGSGTTGYYVSDMLYFDTVLGDSLIANS-SASIVFGCSTYQS 217
Query: 212 GGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD----- 263
G A DG+ G G ++SV S L+ G+ FS C + D G++ G+
Sbjct: 218 GDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLVLGEILEPN 277
Query: 264 --QGPATQQSTSF------LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 315
P + + ++ NG+ ++ ++ +S + T IVDSG++ T+
Sbjct: 278 IIYSPLVPSQSHYNLNLQSISVNGQ----LLPIDPAVFATSNNQGT----IVDSGTTLTY 329
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
L + Y+ + V+ + T CY S+ P V L F S V+
Sbjct: 330 LVETAYDPFVSAITATVSSSTTPVLS-KGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKP 388
Query: 376 PVFVIY--GTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++++ + +C+ Q V + I +G + V+D + ++GW++ +C
Sbjct: 389 GEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 150/367 (40%), Gaps = 67/367 (18%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS- 154
D G DL+W CD C RC P A Y +P+ S+T ++SC +C S
Sbjct: 110 DTGSDLIWTQCDAPCRRCFPQPAPLY----------APARSATYANVSCRSPMCQALQSP 159
Query: 155 ---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C P C Y Y + TS+ G+L + L G D A++ V GCG +
Sbjct: 160 WSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTL--GSDTAVRG-----VAFGCGTENL 211
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG--P 266
G + GL+G+G G +S L+++ G+ R FS CF + + +F G
Sbjct: 212 GSTDNS---SGLVGMGRGPLS---LVSQLGVTR--FSYCFTPFNATAASPLFLGSSARLS 263
Query: 267 ATQQSTSFLAS-----NGKYITYIIGVETCCIGSSCL--KQTSFK--------AIVDSGS 311
+ ++T F+ S + Y + +E +G + L F+ I+DSG+
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP----- 366
+FT L + + +A +V + S C+ ++S ++P + L F
Sbjct: 324 TFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADME 383
Query: 367 -QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ S+VV + + CL + G + +G +++D E L +
Sbjct: 384 LRRESYVVED--------RSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFE 434
Query: 426 HSNCQDL 432
+ C +L
Sbjct: 435 PAKCGEL 441
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 148/364 (40%), Gaps = 58/364 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +C P A + D+ L + PS SST SC LC SC
Sbjct: 100 DTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASC 150
Query: 156 QNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+PK Q C YT Y + + ++G L D + G + V GCG+ +
Sbjct: 151 GSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLFNN 203
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRIF 260
G + G+ G G G +S+PS L K G +FS CF D ++
Sbjct: 204 GVFKSN--ETGIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGLKPSTVLLDLPADLY 256
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGS 311
+G QST + + Y + ++ +GS+ LK + I+DSG+
Sbjct: 257 KSGRGAV--QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGT 314
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNNS 370
+ T LP VY + F QV + S C + + P +P + L F
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMD 374
Query: 371 FVVNNPVFVI--YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
N VF + G+ ++ CLAI G++ TIG V++D +N KL + +
Sbjct: 375 LPRENYVFEVEDAGSSIL---CLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 429 CQDL 432
C L
Sbjct: 431 CDKL 434
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 112/454 (24%), Positives = 179/454 (39%), Gaps = 73/454 (16%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
L I L VF S A F+ KLI R S +V NR P S +Y L
Sbjct: 10 LAILLLVFIF--PSIEAHNGRFTVKLIPRNSSQVLF-----NRITAQTPV--SVHHYDYL 60
Query: 66 LSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLD 125
+ + +KT Q D G DL+W+ +C P + Y
Sbjct: 61 MELSIGTPPVKTYAQV----------------DTGSDLIWL-----QCIPCTNCY----- 94
Query: 126 RDLNE-YSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
+ LN + P +SST +++ C TSC + C YT Y +++ + G+L ++
Sbjct: 95 KQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSY-EDDSITEGVLAQE 153
Query: 183 ILHLIS--GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
L L S G ALK VI GCG +G + D G+IGLG G +S+ S + +
Sbjct: 154 TLTLTSTTGKPVALK-----GVIFGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGSS 206
Query: 241 GLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---STSFLASNGK---YITYIIGVE 289
FS C + + + FG ST ++ N Y ++G+
Sbjct: 207 -FGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGIS 265
Query: 290 TCCI------GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGY 342
I GSS T ++DSG+ T LP++ Y + E +V D I
Sbjct: 266 VEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTL 325
Query: 343 PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIG 401
++ CY++ + K ++ F + + +F+ + FC A + G
Sbjct: 326 GYQLCYRTPTNL--KGTTLTAHFEGADVLLTPTQIFIPVQDGI---FCFAFTSTFSNEYG 380
Query: 402 TIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDG 435
G + + Y + FD E + + ++C +L D
Sbjct: 381 IYGNHAQSNYLIGFDLEKQLVSFKATDCTNLQDA 414
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 127/295 (43%), Gaps = 46/295 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ ++L D G DL+W +CAP + D+ + P+ASST L C C
Sbjct: 97 RPVALTLDTGSDLVW-----TQCAPCR----DCFDQGIPLLDPAASSTYAALPCGAPRCR 147
Query: 151 L--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQAS--VI 203
TSC + C Y +Y + + + G + D GDN +N S+ A+ +
Sbjct: 148 ALPFTSCGG--RSCVYVY-HYGDKSVTVGKIATDRFTF---GDNGRRNGDGSLPATRRLT 201
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFF 261
GCG G + G+ G G G S+PS L SFS CF D I
Sbjct: 202 FGCGHFNKGVFQSN--ETGIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDSKSSIVT 254
Query: 262 GDQGPAT---------QQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA-IVDS 309
PA ++T + + Y + ++ +G + L +T F++ I+DS
Sbjct: 255 LGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 314
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQRLPKLPSV 361
G+S T LP+EVYE + AEF QV + EG C+ S+ R P +PS+
Sbjct: 315 GASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAVPSL 369
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 150/367 (40%), Gaps = 67/367 (18%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS- 154
D G DL+W CD C RC P A Y +P+ S+T ++SC +C S
Sbjct: 110 DTGSDLIWTQCDAPCRRCFPQPAPLY----------APARSATYANVSCRSPMCQALQSP 159
Query: 155 ---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C P C Y Y + TS+ G+L + L G D A++ V GCG +
Sbjct: 160 WSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTL--GSDTAVRG-----VAFGCGTENL 211
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG--P 266
G + GL+G+G G +S L+++ G+ R FS CF + + +F G
Sbjct: 212 GSTDNS---SGLVGMGRGPLS---LVSQLGVTR--FSYCFTPFNATAASPLFLGSSARLS 263
Query: 267 ATQQSTSFLAS-----NGKYITYIIGVETCCIGSSCL--KQTSFK--------AIVDSGS 311
+ ++T F+ S + Y + +E +G + L F+ I+DSG+
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP----- 366
+FT L + + +A +V + S C+ ++S ++P + L F
Sbjct: 324 TFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADME 383
Query: 367 -QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ S+VV + + CL + G + +G +++D E L +
Sbjct: 384 LRRESYVVED--------RSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFE 434
Query: 426 HSNCQDL 432
+ C +L
Sbjct: 435 PAKCGEL 441
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 117/446 (26%), Positives = 172/446 (38%), Gaps = 76/446 (17%)
Query: 30 KLIH--RFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
+L+H F+ +AL +R + + A Q L S V +G F L
Sbjct: 40 RLLHIKPFTTPSQALSFDSHRLSFFFSA---LHTPQSLKSPVVSGASTGSGQYFVDLRLG 96
Query: 88 QGSKTMSLGNDFGCDLLWIPC----DCVRCAPLSASYYNSLDRDLNEYSPSASSTS---- 139
+ + L D G DL+W+ C +C R P SA L R +SP+ S
Sbjct: 97 TPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSA----FLARHSTTFSPNHCYDSACQL 152
Query: 140 ----KHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDN 192
KH C+H RL PC Y Y + + +SG ++ L+ SG +
Sbjct: 153 VPLPKHHRCNHARL----------HSPCRYEYS-YGDGSKTSGFFSKETTTLNTSSGREA 201
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
LK + GC + SG + G + G++GLG G IS+ S L N FS
Sbjct: 202 KLKG-----IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSY 254
Query: 250 CFDKDD-----SGRIFFG----DQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCL 298
C D + + G D P ++ T + Y IG+E+ + L
Sbjct: 255 CLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKL 314
Query: 299 ----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 348
+ + IVDSG++ TFLP+ Y I R+V + + C
Sbjct: 315 PINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCV 374
Query: 349 KSSSQRLPKLPSVKLMFPQNNSFVVNNP---VFVIYGTQVVTGFCLAIQPVDGDIG--TI 403
S P+LP KL F V + P FV V CLA+Q V G I
Sbjct: 375 NVSEIEHPRLP--KLSFKLGGDSVFSPPPRNYFVDTDEDVK---CLALQAVMTPSGFSVI 429
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNC 429
G G+ + FD++ +LG+S C
Sbjct: 430 GNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 144/359 (40%), Gaps = 41/359 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLGT 153
D G D+LW+ C PL++ L LN + P SST+ LSC C +
Sbjct: 59 DTGSDILWVNCKPCNACPLTSG----LGVALNFFDPRGSSTASPLSCIDSKCVSSNQISE 114
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
S + C Y+ +Y + + + G V D + + N+ A + GC QSG
Sbjct: 115 SVCTTDRYCGYSFEY-GDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYNQSGD 173
Query: 214 YLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQ 270
A DG+ G G ++SV S L GL FS C + D G + G+
Sbjct: 174 LTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITEPGMV 233
Query: 271 STSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIA 325
T + S Y + G+ + I T+ + I+D G++ +L +E YE
Sbjct: 234 YTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEEAYEPFV 293
Query: 326 AEFDRQVNDTITSF--EGYPWKCCYKSSSQRLPKLPSVKLMFP------QNNSFVV---- 373
V+ + F +G P C+ + PSV L F + +++
Sbjct: 294 NTIIAAVSQSTQPFMLKGNP---CFLTVHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLS 350
Query: 374 --NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV-VFDRENLKLGWSHSNC 429
++PV+ I G Q Q D TI + + +V V+D EN ++GW+ +C
Sbjct: 351 PDSSPVWCI-GWQKS-----GQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDC 403
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 154/395 (38%), Gaps = 100/395 (25%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST--SKHLS 143
Q SK L D G DL W+ CD CV+C YY R N P S H +
Sbjct: 28 QPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYY----RPRNNLVPCMDPICQSLHSN 83
Query: 144 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
HR C+NP Q C Y ++Y + SS G+LV D +L + + S +
Sbjct: 84 GDHR-------CENPGQ-CDYEVEY-ADGGSSFGVLVRDTFNL------NFTSEKRHSPL 128
Query: 204 IG---CGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD----- 254
+ CG Q GG + DG++GLG G+ S+ S L+ GL+RN C
Sbjct: 129 LALGLCGYDQFPGGSHHPI--DGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFL 186
Query: 255 -------DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 307
DS R+ + P + + LA E G K T FK ++
Sbjct: 187 FFGDDLYDSSRVAWTPMSPDAKHYSPGLA------------ELTFDG----KTTGFKNLL 230
Query: 308 ---DSGSSFTFLPKEVYETIAAEFDRQVN-----------------------DTITSFEG 341
DSG+S+T+L + Y+ + + ++++ +I +
Sbjct: 231 TTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKK 290
Query: 342 YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV----NNPVFVIYGTQVVTGFCLAIQPVD 397
Y +++R K +L FP ++ N + ++ GT+V
Sbjct: 291 YFKTFALSFTNERKSK---TELEFPPEAYLIISSKGNACLGILNGTEVGL---------- 337
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
D+ IG M V++D E ++GW+ NC L
Sbjct: 338 NDLNVIGDISMQDRVVIYDNEKERIGWAPGNCNRL 372
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 152/365 (41%), Gaps = 63/365 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG--TS 154
D G D +W C C C ++ +N PS SST K++ CS +C G T
Sbjct: 108 DTGSDGIWFQCKPCKPCLNQTSPIFN----------PSKSSTYKNIRCSSPICKRGEKTR 157
Query: 155 CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C N K+ C Y + Y + + S G + +D L L S + + ++IGCG K S
Sbjct: 158 CSSNRKRKCEYEITY-LDRSGSQGDISKDTLTLNSNDGSPIS---FPKIVIGCGHKNSLT 213
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDD-SGRIFFGDQGPAT 268
+G+A G+IG G G S+ S L + I FS C F K + S +++FGD +
Sbjct: 214 -TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISSKLYFGDMAVVS 269
Query: 269 QQST-------SFLASNGKYITYIIGVETCCIG--------SSCLKQTSFKAIVDSGSSF 313
SF N Y +E +G SS + A++DSGS+
Sbjct: 270 GHGVVSTPLIQSFYVGN-----YFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVIDSGSTI 324
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T LP +VY + V CYK++ ++ ++P + F + +
Sbjct: 325 TQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKY-EVPIITAHFRGADVKLN 383
Query: 374 NNPVFVIYGTQVVTGFCLAIQP------VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
F+ +V+ C A V G+I QNF+ GY + +N+ + + +
Sbjct: 384 AFNTFIQMNHEVM---CFAFNSSAFPWVVYGNIAQ--QNFLVGYDTL---KNI-ISFKPT 434
Query: 428 NCQDL 432
NC L
Sbjct: 435 NCTKL 439
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 95/211 (45%), Gaps = 24/211 (11%)
Query: 53 WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVR 112
+P K +F+ QV L K K+ T P + + + D G D+LW+ C
Sbjct: 63 FPVKGTFDPSQVGLY--YTKVKLGTPP-----------RELYVQIDTGSDVLWVSCGSCN 109
Query: 113 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-----SCQNPKQPCPYTMD 167
P ++ L LN + P +SSTS +SC R C G SC C YT
Sbjct: 110 GCPQTSG----LQIQLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQ 165
Query: 168 YYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGL 226
Y + + +SG V D++H S + L + ASV+ GC + Q+G A DG+ G
Sbjct: 166 Y-GDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGF 224
Query: 227 GLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
G +SV S L+ G+ FS C D+SG
Sbjct: 225 GQQGMSVISQLSSQGIAPRVFSHCLKGDNSG 255
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 146/352 (41%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL T C
Sbjct: 204 DTGSDTTW-----VQCEPCVVVCYEQQEK---LFDPARSSTDANISCAAPACSDLYTKGC 255
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 256 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAIKG-----FRFGCGERNEGLFG 305
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP------AT 268
+ GL+GLG G+ S+P K G + F+ CF SG + D GP +T
Sbjct: 306 EAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL-DFGPGSSPAVST 358
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYET 323
+ +T L NG Y +G+ +G L T+ IVDSG+ T LP Y +
Sbjct: 359 KLTTPMLVDNGLTF-YYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAAYSS 417
Query: 324 IAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--V 377
+ + F + ++ P CY + +P+V L+F S V+ +
Sbjct: 418 LRSAFASAI--AARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGII 475
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +Q GF A D D+G +G + + VV+D +G+S C
Sbjct: 476 YAASVSQACLGF--AANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 153/387 (39%), Gaps = 67/387 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T+S D G D++W PC + +S + + P SS+SK L C + C
Sbjct: 78 QTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCS 137
Query: 151 LG-------------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 197
SC N Q CP M +Y T+ G+ + + LHL S
Sbjct: 138 WIHHSNINCDQDCSIKSCLN--QTCPPYMIFYGSGTTG-GVALSETLHLHSLS------- 187
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--- 254
+ + ++GC + S P G+ G G G S+PS L S FD D
Sbjct: 188 -KPNFLVGCSVFSSH------QPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKK 240
Query: 255 DSGRIFFGDQGPATQQSTSFL----ASNGKY-------ITYIIGVETCCIGSSCLKQTSF 303
S + +Q + +++ + + N K + Y +G+ +G +K +
Sbjct: 241 SSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVK-VPY 299
Query: 304 K-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFE-GYPWKCCYK 349
K I+DSG++FTF+ +E +E ++ EF RQ+ D + E + C+
Sbjct: 300 KYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFN 359
Query: 350 SSSQRLPKLPSVKLMFP--QNNSFVVNNPVFVIYGTQ-----VVTGFCLAIQPVDGDIGT 402
S + P ++L F + + V N F G + VVT + V G
Sbjct: 360 VSDAKTVSFPELRLYFKGGADVALPVEN-YFAFVGGEVACLTVVTDGVAGPERVGGPGMI 418
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G M + V +D N +LG+ C
Sbjct: 419 LGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 113/448 (25%), Positives = 178/448 (39%), Gaps = 68/448 (15%)
Query: 32 IHRFSEEVKALGVSKN--RNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQG 89
I R + K SK + A S A S EY L+++ + +G F +F
Sbjct: 142 ISRLQKSTKKQTNSKQSYKPAVSPVAAASPEYSSQLVATLESGVSLGSGEYFMDVFIGTP 201
Query: 90 SKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K SL D G DL WI C C+ C S YY+ P SS+ ++++C
Sbjct: 202 PKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------PKESSSFENITCHDPR 251
Query: 149 CDLGTS------CQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C L +S C++ Q CPY Y + NT+ L ++L + + + V+ +
Sbjct: 252 CKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVE-N 310
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 256
V+ GCG G + L+GLG G +S S L + +SFS C D S
Sbjct: 311 VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQL--QSIYGHSFSYCLVDRNSDTSVS 365
Query: 257 GRIFFGDQGPATQQS----TSFLA--SNGKYITYIIGVETCCIGSSCL----------KQ 300
++ FG+ TSF+ N Y +G+++ + L K+
Sbjct: 366 SKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKE 425
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQRLPKLP 359
I+DSG++ T+ + YE I F +++ EG+ P K CY S +LP
Sbjct: 426 GGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKG-YELVEGFPPLKPCYNVSGIEKMELP 484
Query: 360 SVKLM--------FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTG 410
++ FP N F+ P V CLAI + IG
Sbjct: 485 DFGILFSDGAMWDFPVENYFIQIEPDLV----------CLAILGTPKSALSIIGNYQQQN 534
Query: 411 YRVVFDRENLKLGWSHSNCQDLNDGTKS 438
+ +++D + +LG++ C G S
Sbjct: 535 FHILYDMKKSRLGYAPMKCTATTSGGDS 562
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 150/359 (41%), Gaps = 53/359 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++TM L D D WIPC CV C S++ +N++ S+T K + C
Sbjct: 106 AQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK----------STTFKTVGCEAPQ 152
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C + M Y + + +++ L +D++ L + +S+ S GC
Sbjct: 153 CKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLAT-------DSI-PSYTFGCLT 202
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
+ +G + P GL+GLG G +S+ L L +++FS C + SG + G
Sbjct: 203 EATG---SSIPPQGLLGLGRGPMSL--LSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPV 257
Query: 265 G-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSF 313
G P ++T L + + Y + + +G + T I DSG+ F
Sbjct: 258 GQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVF 317
Query: 314 TFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T L Y + F ++V N T+TS G+ CY S P++ MF N +
Sbjct: 318 TRLVAPAYTAVRDAFRKRVGNATVTSLGGF--DTCYTSPI----VAPTITFMFSGMNVTL 371
Query: 373 VNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + + +T +A P V+ + I +R++FD N +LG + C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 156/365 (42%), Gaps = 37/365 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C C ++ P S T + + C+ + C+ C
Sbjct: 111 DTGSTVTYVPCSTCRHCG----------SHQDPKFRPEDSETYQPVKCTWQ-CN----CD 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N ++ C Y Y E ++SSG L ED+ +S G+ + +A I GC ++G +
Sbjct: 156 NDRKQCTYERRY-AEMSTSSGALGEDV---VSFGNQTELSPQRA--IFGCENDETGDIYN 209
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
A DG++GLG G++S+ L + +I +SFS+C+ G G + F
Sbjct: 210 QRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVFTR 268
Query: 277 SNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD 329
S+ + Y I ++ + L ++DSG+++ +LP+ +
Sbjct: 269 SDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIM 328
Query: 330 RQVN--DTITSFEGYPWKCCYKSS----SQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
++ + I+ + C+ + SQ P V+++F + ++ ++ +
Sbjct: 329 KETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHS 388
Query: 384 QVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTP 442
+V +CL + D T +G + V++DRE+ K+G+ +NC +L + P
Sbjct: 389 KVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWKTNCSELWERLHVSDAP 448
Query: 443 GPGTP 447
P P
Sbjct: 449 PPLLP 453
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 148/357 (41%), Gaps = 43/357 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLGT 153
D G D+LW+ C P ++ L L+ + P SS++ +SCS R C +
Sbjct: 102 DTGSDVLWVSCTSCNGCPKTSE----LQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C +P C Y+ Y + + +SG + D + + + L + A + GC QSG
Sbjct: 158 GC-SPNNLCSYSFKY-GDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGD 215
Query: 214 YLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD------- 263
A DG+ GLG G +SV S LA GL FS C DK G + G
Sbjct: 216 LQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTV 275
Query: 264 ------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
P + +A NG+ + V T G I+D+G++ +LP
Sbjct: 276 YTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDG--------TIIDTGTTLAYLP 327
Query: 318 KEVYETIAAEFDRQVNDTITSFE---GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
E Y + F + V + ++ + Y C++ ++ + P V L F S V+
Sbjct: 328 DEAY----SPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLG 383
Query: 375 NPVFV-IYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ I+ + + +C+ Q + I +G + VV+D ++GW+ +C
Sbjct: 384 PRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 144/355 (40%), Gaps = 39/355 (10%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D+LW+ C P ++ L L+ + P SS++ +SCS R C ++
Sbjct: 102 DTGSDVLWVSCTSCNGCPKTSE----LQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157
Query: 158 ---PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
P C Y+ Y + + +SG + D + + + L + A + GC Q+G
Sbjct: 158 GCSPNNLCSYSFKY-GDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDL 216
Query: 215 LD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD-------- 263
A DG+ GLG G +SV S LA GL FS C DK G + G
Sbjct: 217 QRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVY 276
Query: 264 -----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
P + +A NG+ + V T G I+D+G++ +LP
Sbjct: 277 TPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDG--------TIIDTGTTLAYLPD 328
Query: 319 EVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
E Y V+ ++E Y C++ ++ + P V L F S V+
Sbjct: 329 EAYSPFIQAIANAVSQYGRPITYESYQ---CFEITAGDVDVFPEVSLSFAGGASMVLRPH 385
Query: 377 VFV-IYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ I+ + + +C+ Q + I +G + VV+D ++GW+ +C
Sbjct: 386 AYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 159/366 (43%), Gaps = 53/366 (14%)
Query: 89 GSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G++ M++ D G DL W+ CD C+ C +N + SST ++L +
Sbjct: 140 GNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTTG 199
Query: 148 LCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
+ +C+ N C +T+ Y + + L VE HL GG + ++ + GC
Sbjct: 200 NTE---ACESNNPSSCNHTVSYGDGSFTDGELGVE---HLSFGGISV------SNFVFGC 247
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGD 263
G + + G GV+ G++GLG +S+ S FS C D SG + G+
Sbjct: 248 G-RNNKGLFGGVS--GIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTDSGASGSLVIGN 302
Query: 264 QGPATQQSTSF----LASNGK----YITYIIGVETCCIGSSCLKQTSFK---AIVDSGSS 312
+ + T + SN + Y+ + G++ +G ++ TSF ++DSG+
Sbjct: 303 ESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGID---VGGVAIQDTSFGNGGILIDSGTV 359
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMF 365
T L +Y + AEF +Q F GYP C+ + +P++ + F
Sbjct: 360 ITRLAPSLYNALKAEFLKQ-------FSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHF 412
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLG 423
N V+ V ++Y + + CLA+ + + D+ IG RV++D + K+G
Sbjct: 413 ENNVDLNVD-AVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIG 471
Query: 424 WSHSNC 429
++ +C
Sbjct: 472 FAREDC 477
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 138/352 (39%), Gaps = 55/352 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C +C Y D + P+ASS+ +SC +C +
Sbjct: 148 DSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSSFSGVSCGSAICRTLSGTG 197
Query: 157 NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
DY Y + + + G L + L L G A++ V IGCG + SG
Sbjct: 198 CGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQG-----VAIGCGHRNSGL 249
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG--DQGPAT 268
+ V GL+GLG G +S+ L G FS C +G + G + P
Sbjct: 250 F---VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLGRTEAVPRG 304
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPK 318
++++SF Y +G+ +G L + + ++D+G++ T LP+
Sbjct: 305 RRASSF---------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPR 355
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV-NNPV 377
E Y + FD + S CY S ++P+V F Q + +
Sbjct: 356 EAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNL 415
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V G V FCLA P I +G G ++ D N +G+ + C
Sbjct: 416 LVEVGGAV---FCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 158/391 (40%), Gaps = 53/391 (13%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F +F K SL D G DL WI C C C + YY+ P
Sbjct: 190 LGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYD----------P 239
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
SS+ K+++C C L +S C+ Q CPY Y + ++ +E +
Sbjct: 240 KDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNL 299
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
+ + + + +V+ GCG G + L+GLG G +S + L L +SF
Sbjct: 300 TTPEGKPELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQLQ--SLYGHSF 354
Query: 248 SMCF-DKDD----SGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSS 296
S C D++ S ++ FG+ TSF+ + Y + +++ +G
Sbjct: 355 SYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGE 414
Query: 297 CLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFEGYPW 344
LK Q I+DSG++ T+ + YE I F R++ + +F P
Sbjct: 415 VLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFP--PL 472
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNN--SFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIG 401
K CY S +LP ++F F V N I VV CLAI +
Sbjct: 473 KPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVV---CLAILGTPRSALS 529
Query: 402 TIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
IG + +++D + +LG++ C D+
Sbjct: 530 IIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 144/349 (41%), Gaps = 38/349 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----G 152
D G DL+W+ C C +C P +A ++ P SST K + C + C L
Sbjct: 110 DTGSDLIWVQCAPCEKCVPQNAPLFD----------PRKSSTFKTVPCDSQPCTLLPPSQ 159
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C C Y Y ++T SG+L + ++ S +NA+K + GC +
Sbjct: 160 RACVGKSGQC-YYQYIYGDHTLVSGILGFESINFGSK-NNAIKF---PKLTFGCTFSNND 214
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQ 269
+ GL+GLG+G +S+ S L I FS CF + + ++ FG+ Q
Sbjct: 215 TVDESKRNMGLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQ 272
Query: 270 ----QSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIVDSGSSFTFLPKEVY 321
ST + + Y + +E IG+ +K QT ++DSG+SFT L + Y
Sbjct: 273 IKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFY 332
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
A + C+++ +R + P V +F V + +F
Sbjct: 333 NKFVALVKEVYGVEAVKIPPLVYNFCFENKGKR-KRFPDVVFLFTGAKVRVDASNLFEAE 391
Query: 382 GTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ C+ P D D G + GY+V +D + + ++ ++C
Sbjct: 392 DNNLL---CMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 127/308 (41%), Gaps = 47/308 (15%)
Query: 144 CSHRLCD--LGTSCQN----PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 197
C LC L SC N P Q C YT YY + + ++GL+ D +G
Sbjct: 38 CDSTLCQGLLVASCGNTKFWPNQTCVYTY-YYNDKSVTTGLIEVDKFTFGAGAS------ 90
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---- 253
V GCG+ +G + G+ G G G +S+PS L K G +FS CF
Sbjct: 91 -VPGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGL 142
Query: 254 -------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK 304
D ++ G QST + ++ Y + ++ +GS+ L +++F
Sbjct: 143 KQSTVLLDLPADLY--KNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFA 200
Query: 305 -------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 357
I+DSG+S T LP +VY+ + EF Q+ + C+ + SQ P
Sbjct: 201 LTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPD 260
Query: 358 LPSVKLMFP-QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVF 415
+P + L F N VF + + CLAI GD TI NF V++
Sbjct: 261 VPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN--KGDETTIIGNFQQQNMHVLY 318
Query: 416 DRENLKLG 423
D +N+ G
Sbjct: 319 DLQNMHRG 326
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 154/367 (41%), Gaps = 63/367 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G L W+ C C C+ S ++ PS SST +LSCS C+ C
Sbjct: 111 DTGSSLTWVMCHPCSSCSQQSVPIFD----------PSKSSTYSNLSCSE--CN---KCD 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK---QSGG 213
CPY+++Y + SS G+ + L L + ++ +K S+I GCG K S G
Sbjct: 156 VVNGECPYSVEY-VGSGSSQGIYAREQLTLETIDESIIK---VPSLIFGCGRKFSISSNG 211
Query: 214 Y-LDGVAPDGLIGLGLGEISV-PSLLAK----AGLIRNSFSMCFDKDDSGRIFFGDQGPA 267
Y G+ +G+ GLG G S+ PS K G +RN+ R+ GD+
Sbjct: 212 YPYQGI--NGVFGLGSGRFSLLPSFGKKFSYCIGNLRNT------NYKFNRLVLGDKANM 263
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK---------AIVDSGSSFTFL 316
ST+ NG Y + +E IG L T F+ I+DSG+ T+L
Sbjct: 264 QGDSTTLNVING---LYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWL 320
Query: 317 PKEVYETIAAEFDRQVNDTITSFE---GYPWKCCYKS-SSQRLPKLPSVKLMFPQNNSFV 372
K +E ++ E + + + + P+ CY SQ L P V F +
Sbjct: 321 TKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLD 380
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVD--GD----IGTIGQNFMTGYRVVFDRENLKLGWSH 426
++ I T+ FC+A+ P + GD +IG Y V +D +++ +
Sbjct: 381 LDVTSMFIQTTE--NEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQR 438
Query: 427 SNCQDLN 433
+C+ L+
Sbjct: 439 IDCELLD 445
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 150/377 (39%), Gaps = 55/377 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-----CVRCAPLSAS-YYNSLDRDLNEY--SPSASSTSKH 141
+K L D G +L W+ C C C P YY D +L SP + +
Sbjct: 48 AKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLKVVCGSPLCVAVRRD 107
Query: 142 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
+ + +N C Y + Y T S G L DI+ ++G D +
Sbjct: 108 VP------GIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS-VNGRD-------KKR 151
Query: 202 VIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKDDSGRI 259
+ GCG KQ +P DG++GLG+G+ + + L +I+ N C G +
Sbjct: 152 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVL 211
Query: 260 FFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSFKAIVDSGSSFTFLPK 318
+ GD P T+ T + Y G+ I ++ +F+A+ DSGS++T +P
Sbjct: 212 YVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPA 270
Query: 319 EVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSS--------QRLPKLPSVKLMF---- 365
++Y I ++ +++ ++ +G C+K + K S+K+
Sbjct: 271 QIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALSLKITHARGT 330
Query: 366 ------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGYRVVFDR 417
PQN FV + G + ++ PV ++ IG M V++D
Sbjct: 331 SNLDIPPQNYLFVKED------GETCLAILDASLDPVLKELNFILIGAVTMQDLFVIYDN 384
Query: 418 ENLKLGWSHSNCQDLND 434
E +LGW + C + +
Sbjct: 385 EKKQLGWVRAQCDRVQE 401
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 145/367 (39%), Gaps = 60/367 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCD---LGT 153
D G DL+W+ +C P Y R + Y P +SST + + C+ C
Sbjct: 106 DTGSDLIWL-----QCVPCRHCY-----RQVTPLYDPRSSSTHRRIPCASPRCRDVLRYP 155
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C C Y M Y + ++SSG L D L+ D + N V +GCG G
Sbjct: 156 GCDARTGGCVY-MVVYGDGSASSGDLATD--RLVFPDDTHVHN-----VTLGCGHDNVG- 206
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD------KDDSGRIFFGDQGPA 267
L+ A GL+G+G G++S P+ LA A + FS C ++ S + FG
Sbjct: 207 LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRLSRAQNGSSYLVFGRTPEP 262
Query: 268 TQQSTSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGSSFT 314
+ + L +N + Y ++G + S +VDSG++ +
Sbjct: 263 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAIS 322
Query: 315 FLPKEVYETIAAEFDRQ------VNDTITSFEGYPWKCCYKSSSQRLP----KLPSVKLM 364
++ Y + FD + T F + CY P ++PS+ L
Sbjct: 323 RFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVF--DACYDLRGNGAPAAAVRVPSIVLH 380
Query: 365 FPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
F + N + + G T FCL +Q D + +G G+ +VFD E ++
Sbjct: 381 FAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRI 440
Query: 423 GWSHSNC 429
G++ + C
Sbjct: 441 GFTPNGC 447
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 103/239 (43%), Gaps = 41/239 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G W+ CD CA + + Y P+ T+ L S LC+ G +N
Sbjct: 178 DTGSHTTWVQCDAPPCASCAKGAHPL-------YRPA--RTADALPASDPLCE-GAQHEN 227
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
P Q C Y + Y + +SS G+ V D + + G D +N A ++ GCG Q G L+
Sbjct: 228 PNQ-CDYEISY-ADGSSSMGVYVRDSMQFV-GEDGEREN---ADIVFGCGYDQQGVLLNA 281
Query: 218 V-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQSTS 273
+ DG++GL +S+P+ LA G+I N+F C D SG +F GD
Sbjct: 282 LETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGD---------D 332
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSF------------KAIVDSGSSFTFLPKEV 320
++ G I + + +KQ + + + D+GS++T+ P E
Sbjct: 333 YIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEA 391
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 104/242 (42%), Gaps = 41/242 (16%)
Query: 95 LGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS 154
L D G W+ CD CA + + Y P+ T+ L S LC+ G
Sbjct: 175 LDVDTGSHTTWVQCDAPPCASCAKGAHPL-------YRPA--RTADALPASDPLCE-GAQ 224
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
+NP Q C Y + Y + +SS G+ V D + + G D +N A ++ GCG Q G
Sbjct: 225 HENPNQ-CDYEISY-ADGSSSMGVYVRDSMQFV-GEDGEREN---ADIVFGCGYDQQGVL 278
Query: 215 LDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQ 270
L+ + DG++GL +S+P+ LA G+I N+F C D SG +F GD
Sbjct: 279 LNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGD------- 331
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSF------------KAIVDSGSSFTFLPK 318
++ G I + + +KQ + + + D+GS++T+ P
Sbjct: 332 --DYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPD 389
Query: 319 EV 320
E
Sbjct: 390 EA 391
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 145/361 (40%), Gaps = 55/361 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ M + D D W+PC CV CA S+ ++ PS SS+S++L C
Sbjct: 101 AQPMLVALDTSNDAAWVPCSGCVGCA--SSVLFD----------PSKSSSSRNLQCDAPQ 148
Query: 149 CDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C +C K C + M Y +S L +D L L N V S GC
Sbjct: 149 CKQAPNPTCTAGKS-CGFNMTYGGSTIEAS--LTQDTL--------TLANDVIKSYTFGC 197
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFG 262
K +G L GL+GLG G +S+ S L ++FS C SG + G
Sbjct: 198 ISKATGTSLPA---QGLMGLGRGPLSLIS--QTQNLYMSTFSYCLPNSKSSNFSGSLRLG 252
Query: 263 DQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCL---KQTSFKAIVDSGS 311
+ + T+ L N + Y+ + +G + I +S L T I DSG+
Sbjct: 253 PKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGT 312
Query: 312 SFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
FT L + Y + EF R++ N TS G+ CY S PSV MF N
Sbjct: 313 VFTRLVEPAYVAVRNEFRRRIKNANATSLGGF--DTCYSGSV----VYPSVTFMFAGMNV 366
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ + + + + + +A P V+ + I +RV+ D N +LG S
Sbjct: 367 TLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRET 426
Query: 429 C 429
C
Sbjct: 427 C 427
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 145/365 (39%), Gaps = 61/365 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--- 153
D G DL W C C C P +D Y PSASST + CS C L T
Sbjct: 84 DTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPVPCSSATC-LPTWRS 132
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQS 211
+C NP PC Y Y++ S G+L + L + G + +V SV GCG
Sbjct: 133 RNCSNPSSPCRYIYS-YSDGAYSVGILGTETLTI---GSSVPGQTVSVGSVAFGCGTDNG 188
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGD-----Q 264
G D + G +GLG G + SLLA+ G+ + S+ + F+ F G
Sbjct: 189 G---DSLNSTGTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAELAP 242
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------AIVDSGSSFT 314
GP T QST L S Y + ++ +G L +F +VDSG++FT
Sbjct: 243 GPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFT 302
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPW-------KCCYKSSSQRLPKLPSVKLMFPQ 367
L K + R+V D + G P C+ S P +P + L F
Sbjct: 303 ILAKSGF--------REVVDRVAQLLGQPPVNASSLDSPCFPSPDGE-PFMPDLVLHFAG 353
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ ++ Y + + FCL I +G +++FD +L + +
Sbjct: 354 GADMRLHRDNYMSY-NEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPT 412
Query: 428 NCQDL 432
+C L
Sbjct: 413 DCSKL 417
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 153/339 (45%), Gaps = 50/339 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D++W+ C C +C YN R + PS S+T K L S C TS
Sbjct: 104 DTGSDMIWLQCKPCEKC-------YNQTTR---IFDPSKSNTYKILPFSSTTCQSVEDTS 153
Query: 155 CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + ++ C YT+ YY + + S G L + L L S +++K +IGCG +
Sbjct: 154 CSSDNRKMCEYTI-YYGDGSYSQGDLSVETLTLGSTNGSSVKFR---RTVIGCGRNNTVS 209
Query: 214 YLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQ 270
+ +G G++GLG G +S + L ++ I FS C + S ++ FGD +
Sbjct: 210 F-EG-KSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAVVSGD 267
Query: 271 ST--SFLASNGKYITYIIGVETCCIGSSCLKQTS--FK------AIVDSGSSFTFLPKEV 320
T + + ++ + Y + +E +G++ ++ TS F+ I+DSG++ T LP ++
Sbjct: 268 GTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGTTLTLLPNDI 327
Query: 321 YETIAA------EFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV-- 372
Y + + E DR V D + CY+S+ L P + F + +
Sbjct: 328 YSKLESAVADLVELDR-VKDPLKQLS-----LCYRSTFDEL-NAPVIMAHFSGADVKLNA 380
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
VN + V G + I P+ G++ QNF+ GY
Sbjct: 381 VNTFIEVEQGVTCLAFISSKIGPIFGNMAQ--QNFLVGY 417
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 153/371 (41%), Gaps = 56/371 (15%)
Query: 89 GSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G+ ++ D +L W+ C C C D+ + PS+S + + C+
Sbjct: 127 GAAEATVVVDTASELTWVQCQPCESCH----------DQQDPLFDPSSSPSYAAVPCNSS 176
Query: 148 LCDL-------GTS-C--QNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 196
CD GTS C N +QP C Y + Y + + S G+L D L L +G D
Sbjct: 177 SCDALRVAMAAGTSPCADDNEQQPACSYALSY-RDGSYSRGVLARDKLRL-AGQD----- 229
Query: 197 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCF---D 252
+ GCG G G + GL+GLG +S V + + G + FS C +
Sbjct: 230 --IEGFVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMDQFGGV---FSYCLPMRE 282
Query: 253 KDDSGRIFFGDQGPATQQSTSFLAS---------NGKYITYIIGVETCCIGSSCLKQTSF 303
SG + GD A + ST + + G + Y + + +G ++ F
Sbjct: 283 SGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPF--YFLNLTGITVGGQEVESPWF 340
Query: 304 KA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 360
A I+DSG+ T L VY + AEF Q+ + + C+ + + ++PS
Sbjct: 341 SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNLTGLKEVQVPS 400
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRE 418
+K +F + V++ + + + + CLA+ + + D IG RV+FD
Sbjct: 401 LKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTL 460
Query: 419 NLKLGWSHSNC 429
++G++ C
Sbjct: 461 GSQIGFAQETC 471
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 134/310 (43%), Gaps = 36/310 (11%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC C +C N D + P SS S S C++ +C
Sbjct: 107 DSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSS-----SYSPVKCNVDCTCD 151
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K+ C Y Y E +SSSG+L EDI+ G ++ LK + GC ++G
Sbjct: 152 SDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---AQRAVFGCENSETGDLFS 205
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 276
A DG++GLG G++S+ L + G+I +SFS+C+ D G G T F
Sbjct: 206 QHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSR 264
Query: 277 SNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD 329
S+ + Y I ++ + L+ + ++DSG+++ +LP++ +
Sbjct: 265 SDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVT 324
Query: 330 RQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYGT 383
+V+ I + C+ + + + KL P V ++F + ++ +
Sbjct: 325 SKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHS 384
Query: 384 QVVTGFCLAI 393
+V +CL +
Sbjct: 385 KVDGAYCLGV 394
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 106/448 (23%), Positives = 186/448 (41%), Gaps = 64/448 (14%)
Query: 1 MNRIS-LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWP 54
MN +S LT+ L + S A + FS +LIHR S + ++N+ +A
Sbjct: 1 MNTLSFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRS 60
Query: 55 AKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLG-NDFGCDLLWIPCD-CVR 112
++ +++ +S + + + M + T G D G D++W+ C+ C +
Sbjct: 61 INRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQ 120
Query: 113 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYT 170
C + +N PS SS+ K++ CS +LC TSC + + C Y + Y
Sbjct: 121 CYNQTTPIFN----------PSKSSSYKNIPCSSKLCHSVRDTSCSD-QNSCQYKIS-YG 168
Query: 171 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 230
+++ S G L D L L S + + ++IGCG +G + G A G++GLG G
Sbjct: 169 DSSHSQGDLSVDTLSLESTSGSPVS---FPKIVIGCGTDNAGTF--GGASSGIVGLGGGP 223
Query: 231 ISVPSLLAKAGLIRNSFSMCF------DKDDSGRIFFGDQGPATQQ---STSFLASNGKY 281
+S+ + L + I FS C + + S + FGD + ST + + +
Sbjct: 224 VSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVF 281
Query: 282 ITYIIGVETCCIGSSCLKQTSF-----------KAIVDSGSSFTFLPKEVYETIAAEFDR 330
Y + ++ +G+ K+ F I+DSG++ T +P +VY + +
Sbjct: 282 --YFLTLQAFSVGN---KRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVD 336
Query: 331 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFC 390
V + CY S P + + F + + + FV +V C
Sbjct: 337 LVKLDRVDDPNQQFSLCYSLKSNEY-DFPIITVHFKGADVELHSISTFVPITDGIV---C 392
Query: 391 LAIQPVDGDIGTI-----GQNFMTGYRV 413
A QP +G+I QN + GY +
Sbjct: 393 FAFQP-SPQLGSIFGNLAQQNLLVGYDL 419
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 149/362 (41%), Gaps = 57/362 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G D++W+ C C+ C Y D + P+ S+T +SC +C + ++
Sbjct: 189 DSGSDVMWVQCKPCLEC-------YVQAD---PLFDPATSATFSGVSCGSAICRILPTSA 238
Query: 155 CQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + + C Y + Y + + + G L + L L G A++ V+IGCG + G
Sbjct: 239 CGDGELGGCEYEVSY-ADGSYTKGALALETLTL---GGTAVEG-----VVIGCGHRNRGL 289
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD----------KDDSGRIFFGD 263
+ V GL+GLG G +S+ L G + +FS C DD+G + G
Sbjct: 290 F---VGAAGLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYGSGAADDDAGWLVLGR 344
Query: 264 QGPATQQSTSF-LASNGKYIT-YIIGVETCCIGSSCL----------KQTSFKAIVDSGS 311
+ + L N + + Y +G+ +G L + + ++D+G+
Sbjct: 345 SEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGT 404
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSSQRLPKLPSVKLMFPQN 368
+ T LP+E Y + F + + +G CY S ++P+V F +
Sbjct: 405 TVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGD 464
Query: 369 NSFVVNNPVFVIYGTQVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ ++ +V G +CLA P + +G G ++ D N +G+ +
Sbjct: 465 ARLILAARNVLL---EVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYIGFGPA 521
Query: 428 NC 429
NC
Sbjct: 522 NC 523
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 155/368 (42%), Gaps = 39/368 (10%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 142
+F ++ +L D G + ++PC C C A + + P SS+ + +
Sbjct: 103 VFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDP-------RFKPDNSSSYQTV 155
Query: 143 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS- 201
SC+ C + C C Y Y E +SS G+L +D+L +G + +Q
Sbjct: 156 SCNSPDC-ITKMCDARVHQCKYER-VYAEMSSSKGVLGKDLLGFGNG------SRLQPHP 207
Query: 202 VIIGCGMKQSGG-YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GR 258
++ GC ++G YL DG++GLG G +S+ L G + +SFS+C+ D G
Sbjct: 208 LLFGCETAETGDLYLQ--HADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGS 265
Query: 259 IFFGDQGPATQQSTSFLASNGKYITYI------IGVETCCIG-SSCLKQTSFKAIVDSGS 311
+ G P + F S+ Y I V+ + S + ++DSG+
Sbjct: 266 MVLGAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGT 323
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEG----YPWKCC--YKSSSQRLPK-LPSVKLM 364
++ +LP + ++ +Q+ ++ + G YP C S S+ L K P V +
Sbjct: 324 TYAYLPDKAFDAFKDAITQQLG-SLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFV 382
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
F N + ++ T+V +CL +G + V +DR N ++G+
Sbjct: 383 FSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGF 442
Query: 425 SHSNCQDL 432
+NC +L
Sbjct: 443 FKTNCTNL 450
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 148/383 (38%), Gaps = 67/383 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-----CVRCAPLSAS-YYNSLDRDLNEYSPSASSTSKHLS 143
+K L D G +L W+ C C C P YY D +L +
Sbjct: 48 AKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLK------------VV 95
Query: 144 CSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
C LC + +N C Y + Y T S G L DI+ ++G D
Sbjct: 96 CGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS-VNGRD---- 148
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDK 253
+ + GCG KQ +P DG++GLG+G+ + L +I+ N C
Sbjct: 149 ---KKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSS 205
Query: 254 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSFKAIVDSGSS 312
G ++ GD P T+ T + Y G+ I ++ +F+A+ DSGS+
Sbjct: 206 KGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGST 264
Query: 313 FTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSS--------QRLPKLPSVKL 363
+T +P ++Y I ++ +++ ++ +G C+K + K S+K+
Sbjct: 265 YTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALSLKI 324
Query: 364 MF----------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGY 411
PQN FV + G + ++ PV ++ IG M
Sbjct: 325 THARGTNNLDIPPQNYLFVKED------GETCLAILDASLDPVLKELNFILIGAVTMQDL 378
Query: 412 RVVFDRENLKLGWSHSNCQDLND 434
V++D E +LGW + C + +
Sbjct: 379 FVIYDNEKKQLGWVRAQCDRVQE 401
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 112/437 (25%), Positives = 173/437 (39%), Gaps = 76/437 (17%)
Query: 36 SEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSL 95
++E +A RN A +V L+S ++ Q + L S GS +L
Sbjct: 103 ADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRLQTLNYVTTIS-LGGSSGSPAANL 161
Query: 96 GN--DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC--DL 151
D G DL W V+C P SA Y RD + P+ S+T + C+ C L
Sbjct: 162 TVIVDTGSDLTW-----VQCKPCSACYAQ---RD-PLFDPAGSATYAAVRCNASACADSL 212
Query: 152 GTSCQNP---------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
+ P + C Y + Y + + S G+L D + AL +
Sbjct: 213 RAATGTPGSCGSTGAGSEKCYYAL-AYGDGSFSRGVLATDTV--------ALGGASLGGF 263
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCF----DKDDSG 257
+ GCG+ G G A GL+GLG E+S+ S A + G + FS C D SG
Sbjct: 264 VFGCGLSNRG-LFGGTA--GLMGLGRTELSLVSQTASRYGGV---FSYCLPAATSGDASG 317
Query: 258 RIFFG--DQGPATQQSTS------FLASNGKYITYIIGVETCCIGSSCLKQTSFKA---I 306
+ G D ++ ++T+ +A + Y + V +G + L A +
Sbjct: 318 SLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVL 377
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLP 359
+DSG+ T L VY + AEF RQ GYP CY + K+P
Sbjct: 378 IDSGTVITRLAPSVYRAVRAEFMRQFGAA-----GYPAAPGFSILDTCYDLTGHDEVKVP 432
Query: 360 SVKLMFPQNNSFVVNNP--VFVIY--GTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRV 413
+ L V+ +FV+ G+QV CLA+ + + + IG RV
Sbjct: 433 LLTLRLEGGADVTVDAAGMLFVVRKDGSQV----CLAMASLSYEDETPIIGNYQQKNKRV 488
Query: 414 VFDRENLKLGWSHSNCQ 430
V+D +LG++ +C
Sbjct: 489 VYDTLGSRLGFADEDCN 505
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 146/369 (39%), Gaps = 56/369 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C +L RD +Y P + + C L
Sbjct: 59 KAYELDIDTGSDLTWVQCDAPCKGC---------TLPRD-RQYKPHGNL----VKCVDPL 104
Query: 149 CDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASV 202
C S C NP + C Y ++Y + SS G+LV DI+ L ++ G L +S+ A
Sbjct: 105 CAAIQSAPNPPCVNPNEQCDYEVEY-ADQGSSLGVLVRDIIPLKLTNG--TLTHSMLA-- 159
Query: 203 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
GCG Q+ G+ + G++GLG G S+ S L GLIRN C G +FF
Sbjct: 160 -FGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCLSGTGGGFLFF 218
Query: 262 GDQ---------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 312
GDQ P Q S+S L Y G + DSGSS
Sbjct: 219 GDQLIPQSGVVWTPILQSSSSLLKH------YKTGPADMFFNGKATSVKGLELTFDSGSS 272
Query: 313 FTFL----PKEVYETIAAEFDRQVNDTITSFEGYP--WKC--CYKSSSQRLPKLPSVKLM 364
+T+ K + + I + + T P WK +KS + L
Sbjct: 273 YTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHDVTSNFKPLVLS 332
Query: 365 FPQNNSFVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
F ++ + + P + V V G + G+ IG + V++D E
Sbjct: 333 FTKSKNSLFQVPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQ 392
Query: 421 KLGWSHSNC 429
++GW+ +NC
Sbjct: 393 RIGWASANC 401
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 138/359 (38%), Gaps = 89/359 (24%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K D G DL W+ CD C C + +Y P ++ + C +
Sbjct: 65 KAFEFDIDTGSDLTWVQCDAPCTGCT----------LPPIRQYKPKGNT----VPCLDPI 110
Query: 149 CDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASV 202
C C NPK+ C Y ++Y + +S L+++ L L++G +++Q +
Sbjct: 111 CLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNG------SAMQPRL 164
Query: 203 IIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
GCG Q L P G++GLG G+I V L AGL RN C G
Sbjct: 165 AFGCGYDQ---ILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSKGGGY 221
Query: 259 IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSF 313
+FFGD + + + G T ++ E C + T FK++++
Sbjct: 222 LFFGD---------TLIPTLGVAWTPLLSPEYTFFFHICRDRLQRDYTFFKSVLEF---- 268
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
K ++TI F ++++R+ +L P + ++
Sbjct: 269 ----KNFFKTITINF---------------------TNARRI-----TQLQIPPESYLII 298
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G ++ G + +Q + IG M G V++D E +LGW SNC L
Sbjct: 299 SKTGNACLG--LLNGSEVGLQ----NSNVIGDISMQGLMVIYDNEKQQLGWVSSNCNKL 351
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 133/343 (38%), Gaps = 50/343 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C +C Y D + P+ASS+ +SC +C +
Sbjct: 148 DSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSSFSGVSCGSAICRTLSGTG 197
Query: 157 NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
DY Y + + + G L + L L G A++ V IGCG + SG
Sbjct: 198 CGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQG-----VAIGCGHRNSGL 249
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
+ V GL+GLG G +S+ L G FS C +G
Sbjct: 250 F---VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG-------------GAG 291
Query: 274 FLASNGKYITYI---IGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAE 327
LAS+ Y+ +G E + S + T A ++D+G++ T LP+E Y +
Sbjct: 292 SLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGA 351
Query: 328 FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV-NNPVFVIYGTQVV 386
FD + S CY S ++P+V F Q + + V G V
Sbjct: 352 FDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAV- 410
Query: 387 TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
FCLA P I +G G ++ D N +G+ + C
Sbjct: 411 --FCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 148/358 (41%), Gaps = 52/358 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++TMS+ D G D+ W+ C C +C S +SL + PSASST SCS
Sbjct: 143 TQTMSM--DTGSDVSWVQCKPCSQCH----SEVDSL------FDPSASSTYSPFSCSSAA 190
Query: 149 C------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C G C + + C Y + Y + +S++G D L L G NA+K
Sbjct: 191 CVQLSQSQQGNGCSSSQ--CQYIVS-YVDGSSTTGTYSSDTLTL---GSNAIKG-----F 239
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIF 260
GC +SGG+ D DGL+GLG S+ S AG +FS C SG +
Sbjct: 240 QFGCSQSESGGFSDQT--DGLMGLGGDAQSLVS--QTAGTFGKAFSYCLPPTPGSSGFLT 295
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFL 316
G + T L S Y + +E +G L + F A ++DSG+ T L
Sbjct: 296 LGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDSGTVITRL 355
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN-- 374
P Y +++ F + + C+ S Q +PSV L+F + VVN
Sbjct: 356 PPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVF--SGGAVVNLD 413
Query: 375 -NPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
N + + + +CLA D +G IG + V++D +G+ C
Sbjct: 414 FNGIML-----ELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 141/358 (39%), Gaps = 44/358 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G DL W C C C P +D Y PSASST + CS C
Sbjct: 95 DTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPVPCSSATCLPVLRSR 144
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VIIGCGMKQSG 212
+C P C Y Y++ S+G+L + L L G + +V S V GCG G
Sbjct: 145 NCSTPSSLCRYGYS-YSDGAYSAGILGTETLTL---GSSVPGQAVSVSDVAFGCGTDNGG 200
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGD-----QG 265
D + G +GLG G + SLLA+ G+ + S+ + F+ G G
Sbjct: 201 ---DSLNSTGTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSTLDSPFLLGTLAELAPG 254
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTF 315
P QST L S Y++ ++ +G L ++ +VDSG++F+
Sbjct: 255 PGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSI 314
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR-LPKLPSVKLMFPQNNSFVVN 374
LP+ + + + + + C + +R LP +P + L F ++
Sbjct: 315 LPESGFRVVVDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLH 374
Query: 375 NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
++ Y Q + FCL I +G +++FD +L + ++C L
Sbjct: 375 RDNYMSY-NQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 431
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 135/352 (38%), Gaps = 46/352 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C +C Y D + P+ASS+ +SC +C +
Sbjct: 148 DSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSSFSGVSCGSAICRTLSGTG 197
Query: 157 NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
DY Y + + + G L + L L G A++ V IGCG + SG
Sbjct: 198 CGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQG-----VAIGCGHRNSGL 249
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG--DQGPAT 268
+ V GL+GLG G +S+ L G FS C +G + G + P
Sbjct: 250 F---VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVG 304
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPK 318
+ +N Y +G+ +G L + + ++D+G++ T LP+
Sbjct: 305 AVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPR 364
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV-NNPV 377
E Y + FD + S CY S ++P+V F Q + +
Sbjct: 365 EAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNL 424
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V G V FCLA P I +G G ++ D N +G+ + C
Sbjct: 425 LVEVGGAV---FCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 135/352 (38%), Gaps = 46/352 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C C +C Y D + P+ASS+ +SC +C +
Sbjct: 148 DSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSSFSGVSCGSAICRTLSGTG 197
Query: 157 NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
DY Y + + + G L + L L G A++ V IGCG + SG
Sbjct: 198 CGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQG-----VAIGCGHRNSGL 249
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG--DQGPAT 268
+ V GL+GLG G +S+ L G FS C +G + G + P
Sbjct: 250 F---VGAAGLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVG 304
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPK 318
+ +N Y +G+ +G L + + ++D+G++ T LP+
Sbjct: 305 AVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLPR 364
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV-NNPV 377
E Y + FD + S CY S ++P+V F Q + +
Sbjct: 365 EAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNL 424
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V G V FCLA P I +G G ++ D N +G+ + C
Sbjct: 425 LVEVGGAV---FCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 91/394 (23%), Positives = 156/394 (39%), Gaps = 88/394 (22%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T+S D G +W PC C C S ++ + P SS+SK + C +
Sbjct: 88 QTLSFVMDTGSSFVWFPCTLRYLCNNC---------SFTSRISPFLPKHSSSSKIIGCKN 138
Query: 147 RLC-----------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
C D + +N Q CP + Y T+ G+ + + LHL
Sbjct: 139 PKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTG-GVALSETLHL--------H 189
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 251
+ + ++GC + S P G+ G G G S+PS L GL + FS C
Sbjct: 190 GLIVPNFLVGCSVFSSR------QPAGIAGFGRGPSSLPSQL---GLTK--FSYCLLSHK 238
Query: 252 --DKDDSGRIFFGDQGPATQQSTSF----LASNGKY-------ITYIIGVETCCIGSSCL 298
D +S + Q + +++ + L N K + Y + + IG +
Sbjct: 239 FDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSV 298
Query: 299 KQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEGY 342
K +K I+DSG++FT++ E +E ++ EF QV + + + G
Sbjct: 299 K-IPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSG- 356
Query: 343 PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV---FVIYGTQVVTGFCLAIQPVDGD 399
K C+ S + +LP ++L F V P+ F G++ V F + +
Sbjct: 357 -LKPCFNVSGAKELELPQLRLHFKGGAD--VELPLENYFAFLGSREVACFTVVTDGAEKA 413
Query: 400 IG---TIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
G +G M + V +D +N +LG+ +C+
Sbjct: 414 SGPGMILGNFQMQNFYVEYDLQNERLGFKKESCK 447
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 55/361 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ M + D D WIPC CV C S+S + PS SS+S+ L C
Sbjct: 98 AQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQCEAPQ 145
Query: 149 CDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C SC K C + M Y ++ L +D L L S V + GC
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTY--GGSTIEAYLTQDTLTLAS--------DVIPNYTFGC 194
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFG 262
K SG L GL+GLG G +S+ S L +++FS C SG + G
Sbjct: 195 INKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 263 DQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDSGS 311
+ + T+ L N + Y+ + +G + I +S L T I DSG+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 312 SFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+T L + Y + EF R+V N TS G+ CY S PSV MF N
Sbjct: 310 VYTRLVEPAYVAVRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMFAGMNV 363
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ + + + ++ +A PV+ + + I +RV+ D N +LG S
Sbjct: 364 TLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRET 423
Query: 429 C 429
C
Sbjct: 424 C 424
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 162/395 (41%), Gaps = 86/395 (21%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCS 145
+T+ L D G L+W PC C C+ + +D + + P SS+SK + C
Sbjct: 92 QTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRFVPKLSSSSKLVGCQ 145
Query: 146 HRLC------DLGTSCQ--NPK-----QPCP-YTMDYYTENTSSSGLLVEDILHLISGGD 191
+ C D+ + C+ NPK Q CP Y + Y + S++GLL+ + L D
Sbjct: 146 NPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY--GSGSTAGLLLSETLDF---PD 200
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
+ N ++GC +L P G+ G G G S+PS + GL + ++ +
Sbjct: 201 KKIPN-----FVVGCS------FLSIHQPSGIAGFGRGSESLPS---QMGLKKFAYCLAS 246
Query: 252 DKDD----SGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
K D SG++ G P Q + +++N Y + + +G+ +
Sbjct: 247 RKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPS--VSNNAYKEYYYLNIRKIIVGNQAV 304
Query: 299 KQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEGY 342
K +K +I+DSGS+FTF+ K V E +A EF++Q+ + + + G
Sbjct: 305 K-VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTG- 362
Query: 343 PWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQ 394
+ C+ S ++ K P + F P NN F + + V T V
Sbjct: 363 -LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGG 421
Query: 395 PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G +G + V +D N +LG+ C
Sbjct: 422 GGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 151/354 (42%), Gaps = 48/354 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+KT ++ D G D+ W+ C C++C ++ +D + PS SST SCS
Sbjct: 141 AKTQTVLIDSGSDVSWVQCKPCLQC-------HSQVD---PLFDPSLSSTYSPFSCSSAA 190
Query: 149 CDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C G C + Q C Y + Y + +S++G D L L G N + N
Sbjct: 191 CAQLGQDGNGCSSSSQ-CQYIVRY-ADGSSTTGTYSSDTLAL---GSNTISN-----FQF 240
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCF--DKDDSGRIFF 261
GC +SG + D DGL+GLG G PSL ++ AG +FS C SG +
Sbjct: 241 GCSHVESG-FND--LTDGLMGLGGG---APSLASQTAGTFGTAFSYCLPPTPSSSGFLTL 294
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLP 317
G G + T L S+ Y + +E +G + L + F A ++DSG+ T LP
Sbjct: 295 G-AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLP 353
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
+ Y +++ F + + C+ S Q +LPSV L+F + VVN
Sbjct: 354 RTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVF--SGGAVVN--- 408
Query: 378 FVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++ G CLA D G +G + V++D +G+ C
Sbjct: 409 --LDANGIILGNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 147/351 (41%), Gaps = 48/351 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G D+ WI C C C Y+ +D + P SS+ KHLSC C +L T
Sbjct: 156 DTGSDVTWIQCKPCSDC-------YSQVDP---IFEPQQSSSYKHLSCLSSACTELTTMN 205
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y ++ Y + + S G ++ L L G D+ S GCG + G
Sbjct: 206 HCRLGGCVYEIN-YGDGSRSQGDFSQETLTL--GSDSF------PSFAFGCGHTNT-GLF 255
Query: 216 DGVAPDGLIGLGLGEISVPSLL-AKAGLIRNSFSMC---FDKDDSGRIFFGDQG--PATQ 269
G A GL+GLG +S PS +K G FS C F S F QG PAT
Sbjct: 256 KGSA--GLLGLGRTALSFPSQTKSKYG---GQFSYCLPDFVSSTSTGSFSVGQGSIPATA 310
Query: 270 QSTSFLASNGKYIT-YIIGVETCCIGSSCLK-----QTSFKAIVDSGSSFTFLPKEVYET 323
L SN Y + Y +G+ +G L IVDSG+ T L + Y+
Sbjct: 311 TFVP-LVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLVPQAYDA 369
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY-- 381
+ F + + ++ CY SS ++P++ F QNN+ V + V +++
Sbjct: 370 LKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHF-QNNADVAVSAVGILFTI 428
Query: 382 ---GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G+QV F A Q + +I IG RV FD ++G++ +C
Sbjct: 429 QSDGSQVCLAFASASQSISTNI--IGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 55/361 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ M + D D WIPC CV C S+S + PS SS+S+ L C
Sbjct: 98 AQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQCEAPQ 145
Query: 149 CDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C SC K C + M Y ++ L +D L L S V + GC
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTY--GGSTIEAYLTQDTLTLAS--------DVIPNYTFGC 194
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFG 262
K SG L GL+GLG G +S+ S L +++FS C SG + G
Sbjct: 195 INKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 263 DQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDSGS 311
+ + T+ L N + Y+ + +G + I +S L T I DSG+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 312 SFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+T L + Y + EF R+V N TS G+ CY S PSV MF N
Sbjct: 310 VYTRLVEPAYVAVRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMFAGMNV 363
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ + + + ++ +A PV+ + + I +RV+ D N +LG S
Sbjct: 364 TLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRET 423
Query: 429 C 429
C
Sbjct: 424 C 424
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 118/485 (24%), Positives = 196/485 (40%), Gaps = 83/485 (17%)
Query: 5 SLTIYLAVFWLLTESSGAETVMFST-KLIHRFSEEVKALGVSKNRNATSWPAKK------ 57
+L ++L W+ +S+ E+ + ST + + R K + KN+NA S K+
Sbjct: 99 TLKLHLKHRWINRDSTHKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKEEPKQPV 158
Query: 58 -----SFEYYQV------LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWI 106
S E Y L+++ + +G F +F + SL D G DL WI
Sbjct: 159 VAPAASPESYPANGLSGQLMATLESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWI 218
Query: 107 PC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS------CQNPK 159
C C C + YY+ P SS+ K++ C C L +S C+
Sbjct: 219 QCVPCYDCFVQNGPYYD----------PKESSSFKNIGCHDPRCHLVSSPDPPQPCKAEN 268
Query: 160 QPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 218
Q CPY Y + NT+ L ++L S + V+ +V+ GCG G +
Sbjct: 269 QTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVE-NVMFGCGHWNRGLFHGAA 327
Query: 219 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQS-- 271
L+GLG G +S S L L +SFS C D + S ++ FG+
Sbjct: 328 G---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEV 382
Query: 272 --TSFLASNGKYIT--YIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLP 317
TS +A + Y + +++ +G LK + + IVDSG++ ++
Sbjct: 383 NFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFA 442
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNN- 369
+ YE I F ++V +GYP CY S +LP +++F
Sbjct: 443 EPSYEIIKDAFVKKV-------KGYPVIKDFPILDPCYNVSGVEKMELPEFRILFEDGAV 495
Query: 370 -SFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
+F V N + ++V CLAI + IG + +++D + +LG++
Sbjct: 496 WNFPVENYFIKLEPEEIV---CLAILGTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPM 552
Query: 428 NCQDL 432
C D+
Sbjct: 553 KCADV 557
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 153/382 (40%), Gaps = 74/382 (19%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCDCVR--CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q SK L D G DL W+ CD R C YY + + P S H
Sbjct: 28 QPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYYKPSNNLVACKDPICQSL--HTGGD 85
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI-I 204
R C+NP Q C Y ++Y + SS G+LV+D +L N Q+ ++ +
Sbjct: 86 QR-------CENPGQ-CDYEVEY-ADGGSSLGVLVKDAFNL-----NFTSEKRQSPLLAL 131
Query: 205 G-CGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
G CG Q GG + DG++GLG G+ S+ S L+ GL+RN C SGR
Sbjct: 132 GLCGYDQLPGGTYHPI--DGVLGLGRGKPSIVSQLSGLGLVRNVIGHCL----SGRGGGF 185
Query: 263 DQGPATQQSTSFLA-----SNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DSGSSFT 314
+S +A N K+ Y G K T FK ++ DSG+S+T
Sbjct: 186 LFFGDDLYDSSRVAWTPMSPNAKH--YSPGFAELTFDG---KTTGFKNLIVAFDSGASYT 240
Query: 315 FLPKEVYETIAAEFDRQVNDT--ITSFEGYPWKCCYK-----SSSQRLPKL--------- 358
+L +VY+ + + R+++ + + C+K S + + K
Sbjct: 241 YLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSFA 300
Query: 359 ----PSVKLMFPQNNSFVV----NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
+L FP +V N + V+ GT+V D+ IG M
Sbjct: 301 NDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGL----------NDLNVIGDISMQD 350
Query: 411 YRVVFDRENLKLGWSHSNCQDL 432
V++D E +GW+ NC +
Sbjct: 351 RVVIYDNEKQLIGWAPRNCDRI 372
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 131/308 (42%), Gaps = 50/308 (16%)
Query: 58 SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSK----------TMSLGN---------D 98
S ++Y L D Q++ + P+ + FP G +SLG D
Sbjct: 2 SLDHYHTLRKHD-QRRLRRMLPEV-VSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVD 59
Query: 99 FGCDLLWIPCDCVRCAPLSA-SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTS 154
G ++ W V+CAP + + + ++ + P S+T +SC+ C +
Sbjct: 60 TGSNVAW-----VKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQ 114
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIGCGMKQSGG 213
C + CPY++ Y + +S++G + D+ DN+ S A ++ GCG Q+G
Sbjct: 115 CSPERLSCPYSL-LYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGS 173
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQS 271
+ + DGL+G G +S+P+ LA+ + N F+ C D SGR + G
Sbjct: 174 W----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY 229
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEVYETIA 325
T + Y ++ + G + SF I+DSG++ T+L + Y+
Sbjct: 230 TPMVFGEDHYNVQLLNIGIS--GRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYD--- 284
Query: 326 AEFDRQVN 333
EF R V+
Sbjct: 285 -EFRRGVS 291
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 156/384 (40%), Gaps = 56/384 (14%)
Query: 79 PQFQMLFPSQGSK--TMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
PQ ++ S GS T L D DLLW+ C C+ C S L + PS
Sbjct: 82 PQAFLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQS----------LPIFDPSR 131
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
S T ++ SC + + N K + C Y+M Y + T S G+L +++L + D +
Sbjct: 132 SYTHRNESCRTSQYSMPSLRFNAKTRSCEYSMRY-MDGTGSKGILAKEMLMFNTIYDESS 190
Query: 195 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 254
++ V+ GCG G L G G++GLG GE S L+ + G FS CF
Sbjct: 191 SAALH-DVVFGCGHDNYGEPLVGT---GILGLGYGEFS---LVHRFG---TKFSYCFGSL 240
Query: 255 DS-----GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------K 299
D + GD G T+ L + Y + +E + L
Sbjct: 241 DDPSYPHNVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDGIILPIDPWVFNRNH 298
Query: 300 QTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF---EGYPWKC-CYKSSSQR 354
QT I+D+G+S T L +E Y+ + + + T+ + +K CY + +R
Sbjct: 299 QTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLER 358
Query: 355 ---LPKLPSVKLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
P V F ++ VF+ V FCLA+ P G++ +IG
Sbjct: 359 DLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNV---FCLAVTP--GNMNSIGATAQQS 413
Query: 411 YRVVFDRENLKLGWSHSNCQDLND 434
Y + +D E K+ + +C L D
Sbjct: 414 YNIGYDLEAKKISFERIDCGVLFD 437
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 161/377 (42%), Gaps = 50/377 (13%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSAS 136
G + ++ + S+ D G L+ PC C C + + + S
Sbjct: 63 GTHYTWVYAGTPPQRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQA----------DNS 112
Query: 137 STSKHLSCSHRLCDLG-TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG----- 190
ST H++CS + C C + Y E +S +VED+++L GG
Sbjct: 113 STLIHVTCSQQQSHFQCKECTEKSDTCAISQSYM-EGSSWKASVVEDVVYL--GGESSFH 169
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSM 249
D A+++ GC ++G ++ VA DG++GL + + + L + I N FS+
Sbjct: 170 DEAMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSL 228
Query: 250 CFDKDDSGRIFFGDQGPATQQSTSFLA------SNGKYITYIIGVETCCIGSSCL--KQT 301
CF ++ G + G+ + A S G + Y + ++ IG + K+
Sbjct: 229 CF-TENGGTMSVGEPNTKAHRGEISYAKVIKDRSAGHF--YNVNMKDIRIGGKSINAKEE 285
Query: 302 SFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
++ IVDSG++ ++LP+ + EF QV + + C+ +++ L L
Sbjct: 286 AYTRGHYIVDSGTTDSYLPR----AMKNEF-LQVFKEVAGRDYQVGTSCHGYTNEDLASL 340
Query: 359 PSVKLMFP----QNNSFVVNNP--VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYR 412
P ++L+ +N +++ P ++++ +C +I + G IG N M
Sbjct: 341 PKIQLVMEAYGDENGEVIIDIPPEQYLLHND---NSYCGSIYLSENAGGVIGANLMMNRD 397
Query: 413 VVFDRENLKLGWSHSNC 429
V+FD N ++G+ ++C
Sbjct: 398 VIFDNGNQRVGFVDADC 414
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 114/269 (42%), Gaps = 21/269 (7%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+LW+ C C C S L+ L ++P SSTS + CS C L TS
Sbjct: 109 DTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTS 163
Query: 155 ---CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
CQ + PC YT Y + + +SG V D ++ + N + AS++ GC Q
Sbjct: 164 EAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQ 222
Query: 211 SGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPA 267
SG A DG+ G G ++SV S L G+ FS C D+G + G+
Sbjct: 223 SGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP 282
Query: 268 TQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE 322
T + S Y + ++ + I SS ++ + IVDSG++ +L Y+
Sbjct: 283 GLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSS 351
V+ ++ S +C SS
Sbjct: 343 PFVNAITAAVSPSVRSLVSKGNQCFVTSS 371
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 103/433 (23%), Positives = 185/433 (42%), Gaps = 58/433 (13%)
Query: 27 FSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
F+ +LIHR S + S+ + ++S V+L SD + + ++
Sbjct: 27 FTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYLVE 86
Query: 86 PSQGSKTMSLGN--DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 143
S G+ S+ D G D++W +C P S Y ++ + PS S+T K+++
Sbjct: 87 ISVGTPPFSIVAVADTGSDVIW-----TQCKPCSNCY----QQNAPMFDPSKSTTYKNVA 137
Query: 144 CSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQ 199
CS +C G+SC + + C Y++ Y ++ S L V+ + + SG A +V
Sbjct: 138 CSSPVCSYSGDGSSCSDDSE-CLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTV- 195
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF------DK 253
IGCG +G + V+ G++GLG G S+ + L A FS C
Sbjct: 196 ----IGCGHDNAGTFNANVS--GIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGST 247
Query: 254 DDSGRIFFGDQGPATQQST--SFLASNGKYIT-YIIGVETCCI---------GSSCLKQT 301
+DS ++ FG + T + + S+ +Y T Y + +E + G+S L
Sbjct: 248 NDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGE 307
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
S I+DSG++ T+LP + + + + ++ C+ +++ ++P V
Sbjct: 308 S-NIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDY-EMPPV 365
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD----IGTIGQ-NFMTGYRVVFD 416
+ F + + +FV + CLA D G I Q NF+ GY D
Sbjct: 366 TMHFEGADVPLQRENLFVRLSDDTI---CLAFGSFPDDNIFIYGNIAQSNFLVGY----D 418
Query: 417 RENLKLGWSHSNC 429
+NL + + ++C
Sbjct: 419 IKNLAVSFQPAHC 431
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 162/395 (41%), Gaps = 86/395 (21%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCS 145
+T+ L D G L+W PC C C+ + +D + + P SS+SK + C
Sbjct: 92 QTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRFVPKLSSSSKLVGCQ 145
Query: 146 HRLC------DLGTSCQ--NPK-----QPCP-YTMDYYTENTSSSGLLVEDILHLISGGD 191
+ C D+ + C+ NPK Q CP Y + Y + S++GLL+ + L D
Sbjct: 146 NPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY--GSGSTAGLLLSETLDF---PD 200
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
+ N ++GC +L P G+ G G G S+PS + GL + ++ +
Sbjct: 201 KXIPN-----FVVGCS------FLSIHQPSGIAGFGRGSESLPS---QMGLKKFAYCLAS 246
Query: 252 DKDD----SGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
K D SG++ G P Q + +++N Y + + +G+ +
Sbjct: 247 RKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPS--VSNNAYKEYYYLNIRKIIVGNQAV 304
Query: 299 KQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEGY 342
K +K +I+DSGS+FTF+ K V E +A EF++Q+ + + + G
Sbjct: 305 K-VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTG- 362
Query: 343 PWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQ 394
+ C+ S ++ K P + F P NN F + + V T V
Sbjct: 363 -LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGG 421
Query: 395 PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G +G + V +D N +LG+ C
Sbjct: 422 GGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 144/352 (40%), Gaps = 53/352 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P + Y ++ + P++SST ++SC+ C DL S C
Sbjct: 197 DTGSDTTW-----VQCQPCVVACYEQREK---LFDPASSSTYANVSCAAPACSDLDVSGC 248
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 249 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNDGLFG 298
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQSTS 273
+ GL+GLG G+ S+P + G F+ C +G + FG P +T
Sbjct: 299 EAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTP 353
Query: 274 FLASNGKYITYIIGVETCCIGSSCL--KQTSFKA---IVDSGSSFTFLPKEVYETIAAEF 328
L NG Y +G+ +G L + F A IVDSG+ T LP Y ++
Sbjct: 354 MLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL---- 408
Query: 329 DRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--VFV 379
R + GY CY + +P+V L+F + V+ ++
Sbjct: 409 -RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 467
Query: 380 IYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +QV CLA + GD+G +G + + V +D +G+S C
Sbjct: 468 VSASQV----CLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 153/374 (40%), Gaps = 39/374 (10%)
Query: 80 QFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNS--LDRDLNEYS----P 133
QF++ P+Q L D G DL W+ C R + AS S + R N S P
Sbjct: 113 QFRVGTPAQ---PFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169
Query: 134 SASSTSK-HLSCSHRLCDLGTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGD 191
+S T K ++ S C GT+ P PC Y DY Y + +S+ G++ D + G
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTT---PPAPCGY--DYRYKDKSSARGVVGTDAATIALSGS 224
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
+ + + V++GC G + DG++ LG IS S A FS C
Sbjct: 225 GSDRKAKLQEVVLGCTTSYDGQSFQ--SSDGVLSLGNSNISFASR--AAARFGGRFSYCL 280
Query: 252 -----DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK------ 299
++ + + FG G A S + L + + Y + V+ + L
Sbjct: 281 VDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVW 340
Query: 300 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLP 356
+ + AI+DSG+S T L Y+ + A +Q+ + P++ CY ++++R P
Sbjct: 341 DVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLA-RVPRVTMDPFEYCYNWTATRRPP 399
Query: 357 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP-VDGDIGTIGQNFMTGYRVVF 415
+P +++ F + +VI V C+ +Q V + IG + F
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVK--CIGLQEGVWPGVSVIGNILQQEHLWEF 457
Query: 416 DRENLKLGWSHSNC 429
D N L + S C
Sbjct: 458 DLANRWLRFQESRC 471
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 100/435 (22%), Positives = 162/435 (37%), Gaps = 72/435 (16%)
Query: 14 WLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQ 73
W L + +T L+ + + S N ++ F Y V L++ Q
Sbjct: 40 WELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQ 99
Query: 74 KMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 133
+++ L D G D+ W C RC P SA + ++ L + P
Sbjct: 100 EVQ------------------LTLDTGSDITWT--QCKRC-PASACF----NQTLPLFDP 134
Query: 134 SASSTSKHLSCSHRLCDLGTSC----QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
SASS+ L CS C+ C +PC Y++ Y + + S G + ++ SG
Sbjct: 135 SASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSIS-YGDGSVSRGEIGREVFTFASG 193
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+V ++ GCG G + G+ G G G +S+PS L K G +FS
Sbjct: 194 TGEGSSAAVPG-LVFGCGHANRGVFTSNET--GIAGFGRGSLSLPSQL-KVG----NFSH 245
Query: 250 CFDK---DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 306
CF + + G G A ++ G Y + S
Sbjct: 246 CFTTITGSKTSAVLLGLPGVAPPSASPLGRRRGSY-----------------RCRSTPRS 288
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQRLPKLPSVKLMF 365
+SG+S T LP Y + EF QV + P+ C P +P++ L F
Sbjct: 289 SNSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHF 348
Query: 366 -------PQNN-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDR 417
PQ N F V + ++++ CLA+ ++G +G V++D
Sbjct: 349 EGATMRLPQENYVFEVVDDDDAGNSSRII---CLAV--IEGGEIILGNIQQQNMHVLYDL 403
Query: 418 ENLKLGWSHSNCQDL 432
+N KL + + C L
Sbjct: 404 QNSKLSFVPAQCDQL 418
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 144/352 (40%), Gaps = 53/352 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P + Y ++ + P++SST ++SC+ C DL S C
Sbjct: 198 DTGSDTTW-----VQCQPCVVACYEQREK---LFDPASSSTYANVSCAAPACSDLDVSGC 249
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 250 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNDGLFG 299
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQSTS 273
+ GL+GLG G+ S+P + G F+ C +G + FG P +T
Sbjct: 300 EAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPATTTTP 354
Query: 274 FLASNGKYITYIIGVETCCIGSSCL--KQTSFKA---IVDSGSSFTFLPKEVYETIAAEF 328
L NG Y +G+ +G L + F A IVDSG+ T LP Y ++
Sbjct: 355 MLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL---- 409
Query: 329 DRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--VFV 379
R + GY CY + +P+V L+F + V+ ++
Sbjct: 410 -RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 468
Query: 380 IYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +QV CLA + GD+G +G + + V +D +G+S C
Sbjct: 469 VSASQV----CLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 141/349 (40%), Gaps = 38/349 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C+ C ++ PS S++ K +SC + C L S
Sbjct: 109 DTGSDLMWTQCLPCLSCYKQKNPMFD----------PSKSTSFKEVSCESQQCRLLDTVS 158
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C P++ C ++ Y + + + G++ + L L S N+ + + +++ GCG SG +
Sbjct: 159 CSQPQKLCDFSYGY-GDGSLAQGVIATETLTLNS---NSGQPTSILNIVFGCGHNNSGTF 214
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQ 269
+ GL G G +S+ S + FS C D + +I FG + +
Sbjct: 215 NENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSG 272
Query: 270 QS--TSFLASNGKYITYIIGVETCCIG-------SSCLKQTSFKAIVDSGSSFTFLPKEV 320
++ L + Y + ++ +G SS T +D+G+ T LP++
Sbjct: 273 SDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDF 332
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + + + CY+S++ L P + F + + F+
Sbjct: 333 YNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFIS 390
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C A+QP+DGD G G + + FD + K+ + +C
Sbjct: 391 PKEGV---YCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 160/392 (40%), Gaps = 85/392 (21%)
Query: 98 DFGCDLLWIPC----DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--- 150
D G DL+W+PC C+ C SAS N + + P SS+ ++C+ C
Sbjct: 2 DTGSDLVWVPCTRNYSCINCPEDSAS--NGV------FLPRMSSSLHLVTCADSNCKTLY 53
Query: 151 ------LGTSCQNPKQPC-----PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
L SC + C PY + Y S++GLL+ + L+L L+N
Sbjct: 54 GNNTELLCQSCAGSLKNCSETCPPYGIQY--GRGSTAGLLLTETLNL------PLENGEG 105
Query: 200 ASVI----IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----- 250
A I +GC + S P G+ G G G +S+PS L + + ++ F+ C
Sbjct: 106 ARAITHFAVGCSIVSS------QQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHR 158
Query: 251 FDKDDSGRIF-FGDQGPATQ---QSTSFLASN-----GKY-ITYIIGVETCCIGSSCLKQ 300
FD+++ + GD+ T FL ++ +Y + Y IG+ IG LKQ
Sbjct: 159 FDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQ 218
Query: 301 TSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVN----DTITSFEGYPWK 345
K I+DSG++FT E+++ IAA F Q+ + G
Sbjct: 219 LPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTG--MG 276
Query: 346 CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG----DIG 401
CY + LP F + V+ + Y + + CL + G D G
Sbjct: 277 LCYDVTGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSSFDS-ICLTMISSRGLLEVDSG 335
Query: 402 ---TIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+G + + +++DRE +LG++ C+
Sbjct: 336 PAVILGNDQQQDFYLLYDREKNRLGFTQQTCK 367
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 150/365 (41%), Gaps = 47/365 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
SK L D G DL W+ CD C+ C +L RD+ Y P ++ S+
Sbjct: 63 SKVFELDIDTGSDLTWVQCDVECIGC---------TLPRDM-LYRPHNNAVSREDPLCAA 112
Query: 148 LCDLGTSC-QNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQASVII 204
L LG +NP C Y ++Y ++ SS G+LV+D+ + L +G + ++
Sbjct: 113 LSSLGKFIFKNPNDQCAYEVEY-ADHGSSVGVLVKDLVPMRLTNG------KRISPNLGF 165
Query: 205 GCGMKQSGGYLD---GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIF 260
GCG Q G L +A G++GL + ++ S L+ G + N C + F
Sbjct: 166 GCGYDQENGDLQQPPSIA--GVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGFLFF 223
Query: 261 FGDQGPATQQSTSFLASN--GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
GD P++ S + + N GKY + G + DSGSS+T+
Sbjct: 224 GGDVVPSSGMSWTPILRNSEGKYSS---GPAEVYFNGRAVGIGGLTLTFDSGSSYTYFNS 280
Query: 319 EVYETIAA--EFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP---------Q 367
+VY I + D + N + + + C+K + + V+ F +
Sbjct: 281 QVYRAIEKLLKNDLKGNPLKLASDDKTLELCWKGP-KPFESVVDVRNFFKPLAMSFKNSK 339
Query: 368 NNSFVVNNPVFVIYGT--QVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
N F + ++I V G + G++ IG M VV+D E ++GW+
Sbjct: 340 NVQFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIGDISMLNKIVVYDNERERIGWA 399
Query: 426 HSNCQ 430
SNC
Sbjct: 400 SSNCN 404
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 142/350 (40%), Gaps = 49/350 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P + Y ++ + P++SST ++SC+ C DL S C
Sbjct: 201 DTGSDTTW-----VQCQPCVVACYEQREK---LFDPASSSTYANVSCAAPACSDLDVSGC 252
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 253 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNDGLFG 302
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQSTS 273
+ GL+GLG G+ S+P + G F+ C +G + FG P +T
Sbjct: 303 EAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTP 357
Query: 274 FLASNGKYITYIIGVETCCIGSSCL--KQTSFKA---IVDSGSSFTFLPKEVYETIAAEF 328
L NG Y +G+ +G L + F A IVDSG+ T LP Y ++
Sbjct: 358 MLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL---- 412
Query: 329 DRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--VFV 379
R + GY CY + +P+V L+F + V+ ++
Sbjct: 413 -RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 471
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +QV F A GD+G +G + + V +D +G+S C
Sbjct: 472 VSASQVCLAF--AGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 148/365 (40%), Gaps = 61/365 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G ++LW+ C C RC + + PS SST L C++ +C S
Sbjct: 117 DTGSNILWVRCAPCKRCTQQNGPLLD----------PSKSSTYASLPCTNTMCHYAPSAY 166
Query: 157 -NPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
N C Y + Y T SS+G+L + I H G NA+ SV+ GC ++G
Sbjct: 167 CNRLNQCGYNLSYAT-GLSSAGVLATEQLIFHSSDEGVNAVP-----SVVFGCS-HENGD 219
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-----GRIFFGDQGPAT 268
Y D G+ GLG G + S + + G + FS C ++ FG++
Sbjct: 220 YKDRRFT-GVFGLGKG---ITSFVTRMG---SKFSYCLGNIADPHYGYNQLVFGEKANFE 272
Query: 269 QQSTSFLASNGKYITYI----IGVETCCIGSSC--LKQTSFKAIVDSGSSFTFLPKEVYE 322
ST NG Y + +G + I S+ +K A++DSG++ T+L + +
Sbjct: 273 GYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTWLAESAFR 332
Query: 323 TIAAEFDRQVNDTITSFEGYPWK---CCYKSS-SQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ E + ++ + F W+ CYK + SQ L P V F ++
Sbjct: 333 ALDNEVRQLLDGVLMPF----WRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESM 388
Query: 379 VIYGTQVVTGFCLAIQPVDGD---------IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T + C+A++ IG + Q + Y + +D + KL + +C
Sbjct: 389 FYQATPDI--LCIAVRQASAYGNDFKSFSVIGLMAQQY---YNMAYDLNSNKLFFQRIDC 443
Query: 430 QDLND 434
Q L D
Sbjct: 444 QLLVD 448
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 103/438 (23%), Positives = 175/438 (39%), Gaps = 64/438 (14%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
++ +F + S A F+ +LIHR S + ++N+ NA + +Y
Sbjct: 10 LFFTIFCFIISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFY 69
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCAPLSAS 119
+ L+S Q ++ M + S G+ + D G DL+W+ C+ C +C P
Sbjct: 70 KYSLTSTPQSTVNSDKGEYLMSY-SIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITP 128
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 179
++ PS SS+ +++ C L +C + + T + G L
Sbjct: 129 IFD----------PSLSSSYQNIPC------LSDTCHSMR----------TTSCDVRGYL 162
Query: 180 VEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 238
+ L L D+ SV +IGCG + +G + G++GLG G +S+PS L
Sbjct: 163 SVETLTL----DSTTGYSVSFPKTMIGCGYRNTGTFHG--PSSGIVGLGSGPMSLPSQLG 216
Query: 239 KAGLIRNSFSMCFDK---DDSGRIFFGD------QGPATQQSTSFLASNGKYIT---YII 286
+ I FS C + + ++ FGD G T A +G Y+T + +
Sbjct: 217 TS--IGGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSV 274
Query: 287 GVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 346
G + G ++DSG++FTFLP +VY + +N +K
Sbjct: 275 GNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKL 334
Query: 347 CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDI-GTIG 404
CY + + P + F + + F+ +V G CLA P I G +
Sbjct: 335 CYNVAYHGF-EAPLITAHFKGADIKLYYISTFI----KVSDGIACLAFIPSQTAIFGNVA 389
Query: 405 -QNFMTGYRVVFDRENLK 421
QN + GY +V + K
Sbjct: 390 QQNLLVGYNLVQNTVTFK 407
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 147/365 (40%), Gaps = 49/365 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG-TSC 155
D G DL W +CAP + + SL R ++PS S T L C R+C DL +SC
Sbjct: 103 DTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLTWSSC 153
Query: 156 QNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Y Y +++ ++G L D S D+A+ + + GCG+ +G
Sbjct: 154 GEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVPDLTFGCGLFNNG 211
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSGRIFFG------- 262
++ G+ G G +S+P A L ++FS CF + +F G
Sbjct: 212 IFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYS 264
Query: 263 ---DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL----------KQTSFKAIVD 308
G QST+ + + + Y I ++ +G++ L + + IVD
Sbjct: 265 DAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVD 324
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+ T LP+ VY + F Q T+ + + C+ P +P++ L F
Sbjct: 325 SGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGA 384
Query: 369 N-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
N +F I + CLAI + D+ IG V++D N L + +
Sbjct: 385 TLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPA 443
Query: 428 NCQDL 432
C +
Sbjct: 444 RCNKI 448
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 147/365 (40%), Gaps = 49/365 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG-TSC 155
D G DL W +CAP + + SL R ++PS S T L C R+C DL +SC
Sbjct: 129 DTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLTWSSC 179
Query: 156 QNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Y Y +++ ++G L D S D+A+ + + GCG+ +G
Sbjct: 180 GEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVPDLTFGCGLFNNG 237
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSGRIFFG------- 262
++ G+ G G +S+P A L ++FS CF + +F G
Sbjct: 238 IFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYS 290
Query: 263 ---DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL----------KQTSFKAIVD 308
G QST+ + + + Y I ++ +G++ L + + IVD
Sbjct: 291 DAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVD 350
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+ T LP+ VY + F Q T+ + + C+ P +P++ L F
Sbjct: 351 SGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGA 410
Query: 369 N-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
N +F I + CLAI + D+ IG V++D N L + +
Sbjct: 411 TLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPA 469
Query: 428 NCQDL 432
C +
Sbjct: 470 RCNKI 474
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 162/395 (41%), Gaps = 62/395 (15%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F +F K SL D G DL WI C C+ C S YY+ P
Sbjct: 192 LGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------P 241
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL- 186
SS+ +++SC C L ++ C+ Q CPY +Y + ++++G + +
Sbjct: 242 KDSSSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFY-WYGDGSNTTGDFALETFTVN 300
Query: 187 --ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
G + LK+ +V+ GCG G + GL L S L
Sbjct: 301 LTTPNGTSELKHV--ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYG 353
Query: 245 NSFSMCF-DKDD----SGRIFFG-DQGPATQQSTSFLASNGKY-----ITYIIGVETCCI 293
SFS C D++ S ++ FG D+ + + +F + G Y + +++ +
Sbjct: 354 QSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMV 413
Query: 294 GSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY- 342
LK + + I+DSG++ T+ + YE I F R++ EG
Sbjct: 414 DDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKG-YQLVEGLP 472
Query: 343 PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV---FVIYGTQVVTGFCLAI--QPVD 397
P K CY S +LP ++F + V N PV F+ +VV CLAI P
Sbjct: 473 PLKPCYNVSGIEKMELPDFGILFA--DEAVWNFPVENYFIWIDPEVV---CLAILGNPRS 527
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ IG + +++D + +LG++ C D+
Sbjct: 528 A-LSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 561
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 147/365 (40%), Gaps = 49/365 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG-TSC 155
D G DL W +CAP + + SL R ++PS S T L C R+C DL +SC
Sbjct: 129 DTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLTWSSC 179
Query: 156 QNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Y Y +++ ++G L D S D+A+ + + GCG+ +G
Sbjct: 180 GEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVPDLTFGCGLFNNG 237
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSGRIFFG------- 262
++ G+ G G +S+P A L ++FS CF + +F G
Sbjct: 238 IFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYS 290
Query: 263 ---DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL----------KQTSFKAIVD 308
G QST+ + + + Y I ++ +G++ L + + IVD
Sbjct: 291 DAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVD 350
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+ T LP+ VY + F Q T+ + + C+ P +P++ L F
Sbjct: 351 SGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGA 410
Query: 369 N-SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
N +F I + CLAI + D+ IG V++D N L + +
Sbjct: 411 TLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPA 469
Query: 428 NCQDL 432
C +
Sbjct: 470 RCNKI 474
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 148/352 (42%), Gaps = 41/352 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD------ 150
D G D W+ C C C Y D + P+ASST + C R C
Sbjct: 157 DTGSDQSWVQCKPCADC-------YEQRD---PVFDPTASSTYSAVPCGARECQELASSS 206
Query: 151 -LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+ + CPY + Y +++ + G L D L L + ++V + GCG
Sbjct: 207 SSRNCSSDNNKNCPYEVS-YDDDSHTVGDLARDTLTLSPSPSPSPADTVPG-FVFGCGHS 264
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
+G + + DGL+GLGLG+ S+PS + A +FS C S + G A +
Sbjct: 265 NAGTFGE---VDGLLGLGLGKASLPSQV--AARYGAAFSYCLPSSPSAAGYLSFGGAAAR 319
Query: 270 QSTSF--LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEVY 321
+ F + + +Y + + + +K T+ I+DSG++F+ LP Y
Sbjct: 320 ANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAY 379
Query: 322 ETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
+ + F + ++ P + CY + ++P+V+L+F + + V +P
Sbjct: 380 AALRSSFRSAMGR--YRYKRAPSSPIFDTCYDFTGHETVRIPAVELVF-ADGATVHLHPS 436
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V+Y V CLA P + D+G +G V++D + ++G+ C
Sbjct: 437 GVLYTWNDVAQTCLAFVP-NHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 140/349 (40%), Gaps = 38/349 (10%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C+ C ++ PS S++ K +SC + C L S
Sbjct: 109 DTGSDLMWTQCLPCLSCYKQKNPMFD----------PSKSTSFKEVSCESQQCRLLDTVS 158
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C P++ C ++ Y + + + G++ + L L S N+ + +++ GCG SG +
Sbjct: 159 CSQPQKLCDFSYGY-GDGSLAQGVIATETLTLNS---NSGQPXSIXNIVFGCGHNNSGTF 214
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQ 269
+ GL G G +S+ S + FS C D + +I FG + +
Sbjct: 215 NENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSG 272
Query: 270 QS--TSFLASNGKYITYIIGVETCCIG-------SSCLKQTSFKAIVDSGSSFTFLPKEV 320
++ L + Y + ++ +G SS T +D+G+ T LP++
Sbjct: 273 SXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDF 332
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + + + CY+S++ L P + F + + F+
Sbjct: 333 YNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFIS 390
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V +C A+QP+DGD G G + + FD + K+ + +C
Sbjct: 391 PKEGV---YCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 109/247 (44%), Gaps = 32/247 (12%)
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRI 259
I GCG + + G GV+ GL+GLG ++S+ S +G+ FS C ++ SG +
Sbjct: 165 IFGCG-RNNKGLFGGVS--GLMGLGRSDLSLIS--QTSGIFGGVFSYCLPSTERKGSGSL 219
Query: 260 FFGDQGPATQQST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGS 311
G + S+ + + Y Y I + IG L+ S + +VDSG+
Sbjct: 220 ILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGT 279
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM 364
T LP +Y+ + AEF +Q F G+P C+ S+ + +P++K+
Sbjct: 280 VITRLPPTIYKALKAEFLKQ-------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMH 332
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKL 422
F N V+ + + CLA+ ++ ++ +G RV++D + K+
Sbjct: 333 FEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKV 392
Query: 423 GWSHSNC 429
G++ C
Sbjct: 393 GFALETC 399
>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 656
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 98/416 (23%), Positives = 168/416 (40%), Gaps = 52/416 (12%)
Query: 61 YYQVLLSSD---VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPL 116
Y + L +SD V + G + ++ + +S+ D G + PC C +C
Sbjct: 73 YRRSLFTSDQNEVVPLNLGMGTHYAWIYVGTPPQRVSIIIDTGSGMTAFPCSGCDQCGNH 132
Query: 117 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 176
+ +N+ + SS+ + +SC+HR C NP +PC Y E +S S
Sbjct: 133 TDIPFNT----------NLSSSIQPISCNHRTYFSCAYCTNPTEPCR----TYMEGSSWS 178
Query: 177 GLLVEDILHL---ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGL-GLGEIS 232
++EDI++L S D L +S + GC K++G ++ VA DG++G+ G
Sbjct: 179 AKVMEDIVYLGDVASAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDI 237
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFG------DQGPATQQSTSFLASNGKYITYI- 285
V L + + N+F++CF G G G T + Y ++
Sbjct: 238 VTKLFREKKIPSNTFTLCFSP-RGGYFALGAMDTSRHAGEVTYARINDAYGENYYAVFMT 296
Query: 286 -IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 344
I V I S++ IVDSG++ + + + + D N T
Sbjct: 297 DIRVGGHSIDIDMKATNSYRYIVDSGTTNSIISGRAGQAL---MDLYRNLTHLKNPLNDN 353
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVV-----TGFCLAIQPVDGD 399
C S SQ + +LP+++ + N + + I +Q + C I
Sbjct: 354 DCILLSPSQ-IEQLPTLQFVMEGVNG---DRAILEILASQYLQKGENNKTCFNILVDTRK 409
Query: 400 I-GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPAN 454
I G IG + M + V+FDR K+G+ +NC D P + N +P++
Sbjct: 410 IGGVIGASMMMNHDVIFDRSQNKVGFVPANCTFAGDTE-------PNSHKNAIPSD 458
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 140/364 (38%), Gaps = 53/364 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G DL+W CD A S + Y P+ASST L CS RLC S
Sbjct: 118 DTGSDLIWTKCDAGGGAAWGGS---------SSYHPNASSTFTRLPCSDRLCAALRSYSL 168
Query: 155 --CQNPKQPCPYTMDYYTENTS--SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C C Y Y + + G L + L GGD V GC
Sbjct: 169 ARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL--GGDAV------PGVGFGCTTAL 220
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
G Y +G GL+GLG G +S+ S L AG +F C D S + FG T
Sbjct: 221 EGDYGEGA---GLVGLGRGPLSLVSQL-DAG----TFMYCLTADASKASPLLFGALATMT 272
Query: 269 Q-----QSTSFLASNGKYITYIIGVETCCIGS--SCLKQTSFKAIVDSGSSFTFLPKEVY 321
QST LAS Y + + + IGS + + DSG++ T+L + Y
Sbjct: 273 GAGAGVQSTGLLAST---TFYAVNLRSITIGSATTAGVGGPGGVVFDSGTTLTYLAEPAY 329
Query: 322 ETIAAEFDRQVNDTITSFEG-YPWKCCY-KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
A F Q ++T EG Y ++ CY K S RL +P++ L F + +V
Sbjct: 330 TEAKAAFLSQTT-SLTPVEGRYGFEACYEKPDSARL--IPAMVLHFDGGADMALPVANYV 386
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLN-DGTKS 438
+ +V G + + IG Y V+ D L + +NC +G
Sbjct: 387 V---EVDDGVVCWVVQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCDSYKANGASG 443
Query: 439 PLTP 442
L P
Sbjct: 444 SLPP 447
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 146/360 (40%), Gaps = 53/360 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC D+ + P S + + CS LC D G
Sbjct: 160 DTGSDVVWLQCAPCRRC----------YDQSGQVFDPRRSRSYGAVGCSAPLCRRLDSG- 208
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C ++ C Y + Y + + ++G + L G + A + +GCG G
Sbjct: 209 GCDLRRKACLYQV-AYGDGSVTAGDFATETLTFAGG-------ARVARIALGCGHDNEGL 260
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR-------IFFGDQG 265
+ VA GL+GLG G +S P+ +++ SFS C D+ S + FG
Sbjct: 261 F---VAAAGLLGLGRGSLSFPAQISR--RYGRSFSYCLVDRTSSANPASHSSTVTFGSGA 315
Query: 266 PATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSG 310
+ + SF + N + Y ++G+ S + + + IVDSG
Sbjct: 316 VGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSG 375
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNN 369
+S T L + Y + F S G+ + CY S +++ K+P+V + F
Sbjct: 376 TSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGA 435
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I T FC A DG + IG G+RVVFD + ++G+ C
Sbjct: 436 EAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 142/353 (40%), Gaps = 33/353 (9%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++T +L D G D+ WI +C P S Y D + P+ S+T + C H C
Sbjct: 130 AQTYTLMFDTGSDVSWI-----QCLPCSGHCYKQHD---PIFDPTKSATYSAVPCGHPQC 181
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
+ C Y + Y + +S++G+L + L L S AL GCG
Sbjct: 182 AAAGGKCSSNGTCLYKVQY-GDGSSTAGVLSHETLSLTSA--RALPG-----FAFGCGET 233
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
G + D DGLIGLG G++S+ S A + S+ + G + G PA+
Sbjct: 234 NLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASG 290
Query: 270 ----QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEV 320
+ T+ + Y + + + +G L T ++DSG+ T+LP E
Sbjct: 291 SDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEA 350
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + F + + P+ CY + Q +P V F +SF ++ +I
Sbjct: 351 YTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVLI 410
Query: 381 Y--GTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ T TG CLA +P +G +++D K+G+ +C
Sbjct: 411 FPDDTAPATG-CLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 154/385 (40%), Gaps = 69/385 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVR--CA--PLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T+ L D G DL W+ C + C+ P +++ L R +SP+ C
Sbjct: 94 QTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTF---LARHSTTFSPT--------HCFS 142
Query: 147 RLCDLGTSCQNPKQPCPYTMDY--------YTENTSSSGLLVEDI--LHLISGGDNALKN 196
LC L NP PC +T + Y++ + +SG ++ L+ SG + LK
Sbjct: 143 SLCQL-VPQPNP-NPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLK- 199
Query: 197 SVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
S+ GCG SG L G + G++GLG G IS S L + SFS C
Sbjct: 200 ----SIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFSYCLLD 253
Query: 252 ---DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT------YIIGVETCCIGSSCL---- 298
+ + GD + + S ++ I Y I ++ + L
Sbjct: 254 YTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDP 313
Query: 299 ------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCY 348
+ + ++DSG++ TFL + Y I + F R+V + G + C
Sbjct: 314 SVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCV 373
Query: 349 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIG---TIG 404
+ P+ P + L + + +P Y + G CLAIQPV+ + G IG
Sbjct: 374 NVTGVSRPRFPRLSLELGGESLY---SPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIG 430
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNC 429
G+ + FDR +LG+S C
Sbjct: 431 NLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 142/357 (39%), Gaps = 52/357 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNE---YSPSASSTSKHLSCSHRLCD-LGT 153
D G DL+W+ C S+S D D + P+ SST LSC C L
Sbjct: 121 DTGSDLVWVNC--------SSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQ 172
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA-SVIIGCGMKQSG 212
+ + C Y Y + + + G+L + + GG K V+ V GC +G
Sbjct: 173 ASCDADSECQYQYSY-GDGSRTIGVLSTETFSFVDGGG---KGQVRVPRVNFGCSTASAG 228
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRIFFGDQGPAT 268
+ DGL+GLG G S+ S L I S C +D + S + FG + +
Sbjct: 229 TFRS----DGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSRAVVS 284
Query: 269 Q---QSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
+ ST + S+ Y + +E+ +G + + IVDSG++ TFL + +
Sbjct: 285 EPGAASTPLVPSDVDSY-YTVALESVAVGGQEVATHDSRIIVDSGTTLTFLDPALLGPLV 343
Query: 326 AEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKLPSVKLMFPQNNSFVV---NNPVF 378
E +R++ + CY KS + +P V L F + + N
Sbjct: 344 TELERRIKLQRVQPPEQLLQLCYDVQGKSETDNF-GIPDVTLRFGGGAAVTLRPENTFSL 402
Query: 379 VIYGTQVVTGFCLAIQPVDGD-----IGTIG-QNFMTGYRVVFDRENLKLGWSHSNC 429
+ GT CL + PV +G I QNF GY D + + ++ ++C
Sbjct: 403 LQEGT-----LCLVLVPVSESQPVSILGNIAQQNFHVGY----DLDARTVTFAAADC 450
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 109/247 (44%), Gaps = 36/247 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G D+LW+ C+ P S+ L DLN + ++SST+ +SCS +C
Sbjct: 89 DTGSDILWLNCNTCNNCPKSSG----LGIDLNYFDTASSSTAALVSCSDPVCSYAVQTAT 144
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCGMKQS 211
+ C + C YT Y + + +SG V D ++ + G + NS ++V+ GC QS
Sbjct: 145 SQCSSQANQCSYTFQY-GDGSGTSGYYVYDAMYFDVIMGQSVFSNS-SSTVVFGCSTYQS 202
Query: 212 GGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGD----- 263
G A DG+ G G G +SV S ++ G+ FS C SG + G+
Sbjct: 203 GDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGSGGGILVLGEILEPN 262
Query: 264 --------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 315
P + +A NG+ I+ ++ + + T IVDSG++ +
Sbjct: 263 IVYTPLVPLQPHYNLNLQSIAVNGQ----ILPIDQDVFATGNNRGT----IVDSGTTLAY 314
Query: 316 LPKEVYE 322
L +E Y+
Sbjct: 315 LVQEAYD 321
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 136/350 (38%), Gaps = 43/350 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C RC Y +D + P +S T + SC R C L
Sbjct: 113 DTGSDLIWTQCKPCERC-------YKQVDP---LFDPKSSKTYRDFSCDARQCSLLDQST 162
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYL 215
C Y Y + + + G + D + L D+ + V +IGCG + G +
Sbjct: 163 CSGNICQYQYSY-GDRSYTMGNVASDTITL----DSTTGSPVSFPKTVIGCGHENDGTFS 217
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ----GP 266
D G++GLG G +S+ S + + + FS C +S ++ FG GP
Sbjct: 218 D--KGSGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGP 273
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGS-------SCLKQTSFKAIVDSGSSFTFLPKE 319
Q ST L+S Y + +E +G+ S L I+DSG++ T +P +
Sbjct: 274 GVQ-STPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGTTLTIVPDD 332
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
+ ++ QV CY ++S K+P++ F + + FV
Sbjct: 333 FFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDL--KVPAITAHFTGADVKLKPINTFV 390
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
VV CLA I G + V ++ + L + ++C
Sbjct: 391 QVSDDVV---CLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 75/168 (44%), Gaps = 21/168 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-SC 155
D G L PC C RC P + P SSTS CS C G SC
Sbjct: 99 DTGSTLPAFPCSGCTRCGPSKTGMFK----------PELSSTSSTFGCSDARCFCGANSC 148
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y++ Y E +S+SG L ED+L + GG A+ + GC +SG
Sbjct: 149 SCNNEQCGYSIRYL-EGSSTSGFLAEDMLAVGDGGP-------AANFVFGCAQSESGLLY 200
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
+A DG+ G+G S+ L + G+I ++FSMCF G + G+
Sbjct: 201 SQIA-DGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREGVLLLGN 247
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 160/395 (40%), Gaps = 62/395 (15%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F +F K SL D G DL WI C C+ C S YY+ P
Sbjct: 190 LGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------P 239
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL- 186
SS+ +++SC C L +S C+ Q CPY +Y + ++++G + +
Sbjct: 240 KDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFY-WYGDGSNTTGDFALETFTVN 298
Query: 187 --ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
G + LK+ +V+ GCG G + GL L S L
Sbjct: 299 LTTPNGKSELKHV--ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYG 351
Query: 245 NSFSMCF-DKDD----SGRIFFG-DQGPATQQSTSFLASNGKY-----ITYIIGVETCCI 293
SFS C D++ S ++ FG D+ + + +F + G Y + + + +
Sbjct: 352 QSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMV 411
Query: 294 GSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY- 342
LK + + I+DSG++ T+ + YE I F R++ EG
Sbjct: 412 DDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKG-YELVEGLP 470
Query: 343 PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV---FVIYGTQVVTGFCLAI--QPVD 397
P K CY S +LP ++F + V N PV F+ VV CLAI P
Sbjct: 471 PLKPCYNVSGIEKMELPDFGILFA--DGAVWNFPVENYFIQIDPDVV---CLAILGNPRS 525
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ IG + +++D + +LG++ C D+
Sbjct: 526 A-LSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 150/352 (42%), Gaps = 43/352 (12%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---LGT 153
D G DL+W+ C C+ C YN ++ + P SST ++SC LC +G
Sbjct: 82 DTGSDLIWVQCVPCLGC-------YNQINP---MFDPLKSSTYTNISCDSPLCYKPYIGE 131
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
+P++ C YT Y +++ + G+L ++ + L S N K ++ GCG +G
Sbjct: 132 C--SPEKRCDYTYGY-ADSSLTKGVLAQETVTLTS---NTGKPISLQGILFGCGHNNTGN 185
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLI--RNSFSMCF-----DKDDSGRIFFGDQGP 266
+ D GLIGLG G S L+++ G + FS C D S ++ FG
Sbjct: 186 FNDHEM--GLIGLGGGPTS---LVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSE 240
Query: 267 ATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVDSGSSFTFLPKE 319
+ +T + +Y + + + + L S +VDSG+ LP++
Sbjct: 241 VLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILPQQ 300
Query: 320 VYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+Y+ + E +V + IT + CY++ + K P++ F N + F
Sbjct: 301 LYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNL--KGPTLTYHFEGANLLLTPIQTF 358
Query: 379 VIYGTQVVTGFCLAIQP-VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + FCLAI + D G G T Y + FD + + + ++C
Sbjct: 359 IPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDC 410
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 133/341 (39%), Gaps = 34/341 (9%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G ++ WI C V C P ++ P+ SST +++SC+ C +S
Sbjct: 34 DTGSNVNWIQCKPCVVSCYPQQEPLFD----------PTLSSTYRNISCTSAACTGLSSR 83
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + +S+ G L + L +G N N I GCG G
Sbjct: 84 GCSGSTCVYGVT-YGDGSSTVGFLATETFTLAAG--NVFNN-----FIFGCGQNNQ-GLF 134
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL 275
G A GLIGLG S+ S LA + + N FS C S + P + +
Sbjct: 135 TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGNPLRTPGYTAM 190
Query: 276 ASNGKYIT-YIIGVETCCIGSS--CLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFD 329
+N + T Y I + +G + L T F++ I+DSG+ T LP Y + F
Sbjct: 191 LTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRTAFR 250
Query: 330 RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF-VIYGTQVVTG 388
+ + CY S P++KL + + + VF VI +QV
Sbjct: 251 AAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVISSSQVCLA 310
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F A IG IG V +D ++G++ C
Sbjct: 311 F--AGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 164/405 (40%), Gaps = 78/405 (19%)
Query: 87 SQGSKTMSLGNDFGCDLLWIPC---DCVRCA-------PLSASYYNSLDRDLNEYSPSAS 136
S S++++L D G DL+W PC +C+ C PL+ + + + S + S
Sbjct: 27 SHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQSPACSTAHS 86
Query: 137 STSKHLSCSHRLCDLG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
S S H C+ C L + C + P Y Y + S L D L S
Sbjct: 87 SVSSHDLCAIARCPLDNIETSDCSSATCPPFY---YAYGDGSFIAHLHRDTL---SMSQL 140
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC- 250
LKN GC + P G+ G G G +S+P+ LA + + N FS C
Sbjct: 141 FLKN-----FTFGCA------HTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCL 189
Query: 251 ----FDKDDSGR---IFFGDQGPATQQSTSFLAS----NGKY-ITYIIGVETCCIGSSCL 298
FDK+ + + G + + F+ + N K+ Y +G+ +G +
Sbjct: 190 VSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTI 249
Query: 299 ----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC-- 346
++ +VDSG++FT LP +Y ++ AEFDR+V K
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGL 309
Query: 347 --CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF--------CLAIQ-- 394
CY + L ++P+V F NNS V+ + Y + + G CL +
Sbjct: 310 GPCY--FLEGLVEVPTVTWHFLGNNSNVMLPRMNYFY--EFLDGEDEARRKVGCLMLMNG 365
Query: 395 ----PVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHSNCQDLND 434
+ G G I N+ G+ VV+D EN ++G++ C L D
Sbjct: 366 GDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASLWD 410
>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 146
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 30/114 (26%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
S++++HR S+E + + WP + S EYY+ L+ SD+Q+QK + +L S
Sbjct: 28 SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80
Query: 88 QGSKTMSLGNDFGC-----------------------DLLWIPCDCVRCAPLSA 118
+G T S GND G DL W+PCDC++CAPLS
Sbjct: 81 KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG 134
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 100/421 (23%), Positives = 164/421 (38%), Gaps = 77/421 (18%)
Query: 37 EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKT-GPQFQMLFPSQGSKTMSL 95
E +K L ++ T+ P + ++ ++ V + K+ T G Q M+
Sbjct: 68 ERLKYLSTLADQKTTAVPIAPGQQVLKI--ANYVVRVKLGTPGQQMFMVL---------- 115
Query: 96 GNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LG 152
D D W+PC C C+ + + P+AS+T L CS C G
Sbjct: 116 --DTSNDAAWVPCSGCTGCSSTT-------------FLPNASTTLGSLDCSGAQCSQVRG 160
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC Y ++S + LV+D + L N V GC SG
Sbjct: 161 FSCPATGSSACLFNQSYGGDSSLTATLVQDAI--------TLANDVIPGFTFGCINAVSG 212
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDDS----GRIFFGDQG-P 266
G + P GL+GLG G IS L+++AG + + FS C S G + G G P
Sbjct: 213 G---SIPPQGLLGLGRGPIS---LISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQP 266
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFL 316
+ ++T L + + Y + + +G + T I+DSG+ T
Sbjct: 267 KSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRF 326
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNN 369
+ VY I EF +QVN I+S + C+ ++++ + P++ L F P N
Sbjct: 327 VQPVYFAIRDEFRKQVNGPISSLGAF--DTCFAATNEA--EAPAITLHFEGLNLVLPMEN 382
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
S + ++ G+ A V+ + I R++FD N +LG + C
Sbjct: 383 SLIHSS-----SGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
Query: 430 Q 430
Sbjct: 438 N 438
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 152/348 (43%), Gaps = 46/348 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G D+ W V+CAP + Y ++ + P++S++ LSC C D+ +
Sbjct: 169 DTGSDVSW-----VQCAPCAECY----EQTDPXFEPTSSASFTSLSCETEQCKSLDV-SE 218
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G V + + L G +L N + IGCG G +
Sbjct: 219 CRNGT--CLYEVSY-GDGSYTVGDFVTETVTL---GSTSLGN-----IAIGCGHNNEGLF 267
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQGPATQQS-T 272
+ GL+GLG G +S PS L + SFS C D+D P T + T
Sbjct: 268 ---IGAAGLLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 273 SFLASNGKYITYI-IGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVY 321
+ L N T+ +G+ +G + L +TSF+ IVDSG++ T L VY
Sbjct: 320 APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVY 379
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ F + +D T+ + CY SS+ ++P+V F N + ++I
Sbjct: 380 NVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIP 439
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FC A P D + +G G RV FD N +G+S + C
Sbjct: 440 VDSEGT-FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 143/361 (39%), Gaps = 55/361 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ M + D D WIPC CV C S+S + PS SS+S+ L C
Sbjct: 98 AQAMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQCEAPQ 145
Query: 149 CDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C SC K C + M Y ++ L +D L L V + GC
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTY--GGSAIEAYLTQDTL--------TLATDVIPNYTFGC 194
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFG 262
K SG L GL+GLG G +S+ S L +++FS C SG + G
Sbjct: 195 INKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 263 DQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDSGS 311
+ + T+ L N + Y+ + +G + I +S L T I DSG+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 312 SFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+T L + Y + EF R+V N TS G+ CY S PSV MF N
Sbjct: 310 VYTRLVEPAYVAMRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMFAGMNV 363
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ + + + ++ +A P V+ + I +RV+ D N +LG S
Sbjct: 364 TLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRET 423
Query: 429 C 429
C
Sbjct: 424 C 424
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 110/248 (44%), Gaps = 40/248 (16%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W C C++C Y L N P S++ H+ C+ + C
Sbjct: 110 DTGSDLTWAQCLPCLKC-------YQQLRPIFN---PLKSTSFSHVPCNTQTCHAVDDGH 159
Query: 157 NPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
Q C Y+ Y S L E I + G +++K+ +IGCG SGG+
Sbjct: 160 CGVQGVCDYSYTYGDRTYSKGDLGFEKI----TIGSSSVKS------VIGCGHASSGGF- 208
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQ----GPAT 268
G A G+IGLG G++S+ S +++ I FS C +G+I FG+ GP
Sbjct: 209 -GFA-SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGV 266
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVDSGSSFTFLPKEVYETI 324
+ L S Y I +E IG+ + +F I+DSG++ T LPKE+Y+ +
Sbjct: 267 VSTP--LISKNTVTYYYITLEAISIGNE--RHMAFAKQGNVIIDSGTTLTILPKELYDGV 322
Query: 325 AAEFDRQV 332
+ + V
Sbjct: 323 VSSLLKVV 330
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 87/191 (45%), Gaps = 12/191 (6%)
Query: 76 KTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 135
+TG F + +K + D G D+LW+ +CV C ++L +L Y P
Sbjct: 86 ETGLYFTRIGIGTPAKRYYVQVDTGSDILWV--NCVSCD--GCPRKSNLGIELTMYDPRG 141
Query: 136 SSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S + + ++C + C + SC + PC Y++ Y + +S++G V D L
Sbjct: 142 SQSGELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISY-GDGSSTAGFFVTDFLQYNQVS 199
Query: 191 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
+ ASV GCG K G +A DG++G G S+ S LA AG +R F+
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 250 CFDKDDSGRIF 260
C D + G IF
Sbjct: 260 CLDTVNGGGIF 270
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 105/456 (23%), Positives = 174/456 (38%), Gaps = 50/456 (10%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
M+ LT++ V +L ++S + + FS LI R S + N T KS
Sbjct: 1 MHHFVLTLFFLVSTMLVDASKS-LMGFSIDLIPRHS----PISPLYNSQMTQTELVKSAA 55
Query: 61 YYQVLLSSDVQKQKMKTGPQFQML--FPSQGSKTM--SLGN---------DFGCDLLWIP 107
+ S V + P ++ P G M SLG D G DL W+
Sbjct: 56 LRSITRSKRVNFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQ 115
Query: 108 CD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----GTSCQNPKQPC 162
C C C P A ++ P+ SST + C + C L C + KQ C
Sbjct: 116 CTPCKTCYPQEAPLFD----------PTQSSTYVDVPCESQPCTLFPQNQRECGSSKQ-C 164
Query: 163 PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDG 222
Y Y T+ + + G L D + S G + SV GC + + +G
Sbjct: 165 IYLHQYGTD-SFTIGRLGYDTISFSSTGMGQGGATFPKSVF-GCAFYSNFTFKISTKANG 222
Query: 223 LIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQ-QSTSFLASN 278
+GLG G +S+ S L I + FS C F +G++ FG P + ST F+ +
Sbjct: 223 FVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINP 280
Query: 279 GKYITYIIGVETCCIGSSCL--KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
Y++ +E +G + Q I+DS T L + +Y + +N +
Sbjct: 281 SYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEV 340
Query: 337 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
P++ C ++ + P F + + +F+ +V C+ + P
Sbjct: 341 AEDAPTPFEYCVRNPTNL--NFPEFVFHFTGADVVLGPKNMFIALDNNLV---CMTVVPS 395
Query: 397 DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G I G ++V +D K+ ++ +NC +
Sbjct: 396 KG-ISIFGNWAQVNFQVEYDLGEKKVSFAPTNCSTI 430
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 146/352 (41%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL T C
Sbjct: 198 DTGSDTTW-----VQCQPCVVVCYEQREK---LFDPARSSTYANISCAAPACSDLDTRGC 249
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 250 SGGN--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 299
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPAT---Q 269
+ GL+GLG G+ S+P K G + F+ C SG + FG PA +
Sbjct: 300 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLDFGPGSPAAAGAR 353
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETI 324
+T L NG Y +G+ +G L T+ IVDSG+ T LP Y ++
Sbjct: 354 LTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAYSSL 412
Query: 325 AAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
+ F + ++ P CY + +P+V L+F Q + + + ++
Sbjct: 413 RSAFASAM--AARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF-QGGARLDVDASGIM 469
Query: 381 YG---TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Y +QV GF A GD+G +G + + V +D +G+S C
Sbjct: 470 YAASVSQVCLGF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 137/309 (44%), Gaps = 42/309 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T +L D G + ++PC C +C ++ P SST + +SC
Sbjct: 101 QTFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFEPELSSTYQPVSC----- 145
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
++ +C N ++ C Y Y E +SSSG+L EDI IS G+ + V I GC +
Sbjct: 146 NIDCTCDNERKQCVYERQY-AEMSSSSGVLGEDI---ISFGNQS--ELVPQRAIFGCENQ 199
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGPA 267
++G A DG++GLG G++S+ L + G+I +SFS+C+ D G + G P
Sbjct: 200 ETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISPP 258
Query: 268 TQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLPKEV 320
+ F S+ + Y I ++ + L ++DSG+++ +LP+
Sbjct: 259 S--GMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYAYLPEAA 316
Query: 321 Y----ETIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP--QNNSFV 372
+ + + E +Q++ ++ + SQ P+V+++F Q S
Sbjct: 317 FTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSNGQKLSLS 376
Query: 373 VNNPVFVIY 381
N +F Y
Sbjct: 377 PENYLFQYY 385
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 142/352 (40%), Gaps = 45/352 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LG 152
D G DL W +C P + Y+ + N PS S++ ++SCS CD G
Sbjct: 156 DTGSDLTW-----TQCEPCARYCYHQQEPIFN---PSKSTSYTNISCSSPTCDELKSGTG 207
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
S C Y + Y + + S G +D L L S V + + GCG G
Sbjct: 208 NSPSCSASTCVYGIQY-GDQSYSVGFFAQDKLALTS-------TDVFNNFLFGCGQNNRG 259
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDS--GRIFFGDQG---P 266
++ GVA GLIGLG +S+ S A K G + FS C S G + FG G
Sbjct: 260 LFV-GVA--GLIGLGRNALSLVSQTAQKYGKL---FSYCLPSTSSSTGYLTFGSGGGTSK 313
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT-----SFKAIVDSGSSFTFLPKEVY 321
A + + S + S G Y + + +G L + + I+DSG+ + LP Y
Sbjct: 314 AVKFTPSLVNSQGPSF-YFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVISRLPPTAY 372
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN-NPVFVI 380
+ A F +Q++ + CY S +P + L F ++ + +F I
Sbjct: 373 SDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYI 432
Query: 381 YGTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
V CLA DI +G + VV+D ++G++ C+
Sbjct: 433 LNISQV---CLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGCE 481
>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
Length = 947
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 155/375 (41%), Gaps = 48/375 (12%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSAS 136
G F ++ + +S+ D G PC +C C + +++ S S
Sbjct: 124 GTHFAYVYAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTDPHWDQ----------SKS 173
Query: 137 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL------ISGG 190
++S ++C C CQ K+ C ++ Y+E +S VED+L + S
Sbjct: 174 TSSHIVTCED--CHGSFRCQKDKR-CGFSQ-RYSEGSSWRAYQVEDVLWVGELTLQQSEK 229
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN-SFSM 249
N +++ + GC Q+G + +A DG++G+ ++ LAKAG I+ +FS+
Sbjct: 230 INHDESAYSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIKERTFSL 288
Query: 250 CFDKDDSGRIFFG-----DQGPATQQSTSFLASNGKYITYI--IGVETCCIGSS-CLKQT 301
CF K+ + G ++ T +NG + + I V I + Q
Sbjct: 289 CFGKNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDPAIFQR 348
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS------SSQRL 355
IVDSG++ T+LP+ V + +A ++R G P+ C + +S L
Sbjct: 349 GKGIIVDSGTTDTYLPRSVAKGFSAAWERAT--------GSPYANCKDNHFCMILTSAEL 400
Query: 356 PKLPSVKLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVV 414
LP+V + + VN P + + I + G +G N M + VV
Sbjct: 401 EALPTVTIHM--DGGLEVNVRPSGYMDALGKDNAYAPRIYLTESMGGVLGANVMLDHNVV 458
Query: 415 FDRENLKLGWSHSNC 429
FD EN +G++ C
Sbjct: 459 FDYENHLVGFAEGVC 473
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 142/351 (40%), Gaps = 42/351 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G DL W V+C P Y + ++ PS S + + +C+ LC++
Sbjct: 57 DTGSDLNW-----VQCLPCRVCY----QQPGPKFDPSKSRSFRKAACTDNLCNVSALPLK 107
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
+C C Y Y ++ ++ L E I G ++ N GCG Q+ G
Sbjct: 108 AC--AANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPN-----FAFGCG-TQNLG 159
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS---GRIFFGDQGPATQQ 270
G A GL+GLG G +S+ S L+ N FS C +S + FG A
Sbjct: 160 TFAGAA--GLVGLGQGPLSLNSQLSHT--FANKFSYCLVSLNSLSASPLTFGSIAAAANI 215
Query: 271 STSFLASNGKYITYI-IGVETCCIGSS---------CLKQTSFKA--IVDSGSSFTFLPK 318
+ + N ++ TY + + + +G + Q++ + I+DSG++ T L
Sbjct: 216 QYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTL 275
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
Y + ++ VN Y C+ + P +P + F + + +F
Sbjct: 276 PAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLF 335
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V+ T T CLA+ G IG + VV+D E K+G++ ++C
Sbjct: 336 VLVDTSATT-LCLAMGGSQG-FSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 143/368 (38%), Gaps = 64/368 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---LGT 153
D G DL+W+ C C RC Y + Y P S T + + C+ C
Sbjct: 110 DTGSDLIWLQCLPCRRC-------YRQV---TPLYDPRNSKTHRRIPCASPQCRGVLRYP 159
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C C Y M Y + ++SSG L D L L D + N V +GCG G
Sbjct: 160 GCDARTGGCVY-MVVYGDGSASSGDLATDTLVLPD--DTRVHN-----VTLGCGHDNEG- 210
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD------KDDSGRIFFGDQGPA 267
L A GL+G G G++S P+ LA A + FS C ++ S + FG
Sbjct: 211 LLASAA--GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRMSRARNSSSYLVFGRTPEL 266
Query: 268 TQQSTSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGSSFT 314
+ + L +N + Y ++G + S +VDSG++ +
Sbjct: 267 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGTAIS 326
Query: 315 FLPKEVYETI--------AAEFDRQVNDTITSFEGYPWKCCYKSSSQ---RLPKLPSVKL 363
++ Y + AA R++ + + F+ CY ++PS+ L
Sbjct: 327 RFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFD-----TCYDVHGNGPGTGVRVPSIVL 381
Query: 364 MFPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
F + N + + G T FCL +Q D + +G G+ VVFD E +
Sbjct: 382 HFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERGR 441
Query: 422 LGWSHSNC 429
+G++ + C
Sbjct: 442 IGFTPNGC 449
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 108/444 (24%), Positives = 167/444 (37%), Gaps = 60/444 (13%)
Query: 18 ESSGAETVMFSTKLIHR------FSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQ 71
E S + TKLIHR + + R + A+ S+ Y ++ D+
Sbjct: 28 EFSSIQPTRLVTKLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDIN 87
Query: 72 KQKMK-----TGPQFQMLFPSQGSKTMSLG-NDFGCDLLWIPCDCVRCAPLSASYYNSLD 125
+ + P F + F L D G LLWI C C S +
Sbjct: 88 DLWLNLHPSASEPLFLVNFSMGQPPVPQLAIMDTGSSLLWI--QCAPCKSCSQQIIGPM- 144
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
+ PS SST LSC + +C S C + Q C Y Y E S G++ +
Sbjct: 145 -----FDPSISSTYDSLSCKNIICRYAPSGECDSSSQ-CVYNQTY-VEGLPSVGVIATE- 196
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
LI G + +N+V +V+ GC + +G Y D G+ GLG G SV + +
Sbjct: 197 -QLIFGSSDEGRNAVN-NVLFGCSHR-NGNYKDRRFT-GVFGLGSGITSVVNQMG----- 247
Query: 244 RNSFSMCF----DKDDSGRIFFGDQGPATQ-QSTSFLASNGKYITYIIGVET----CCIG 294
+ FS C D D S +G + ST +G Y + G+ I
Sbjct: 248 -SKFSYCIGNIADPDYSYNQLVLSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVID 306
Query: 295 SSCLKQTS--FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 352
S K+T + I+DSG++ T+L + Y + E ++ +T F + C
Sbjct: 307 PSAFKRTEKQRRVIIDSGTAPTWLAENEYRALEREVRNLLDRFLTPFMRESFLCYKGKVG 366
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
Q L P+V F + VV+ + +YG D IG
Sbjct: 367 QDLVGFPAVTFHFAEGADLVVDTEMRQASVYGKDF------------KDFSVIGLMAQQY 414
Query: 411 YRVVFDRENLKLGWSHSNCQDLND 434
Y V +D KL + +C+ L++
Sbjct: 415 YNVAYDLNKHKLFFQRIDCELLDE 438
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 174/449 (38%), Gaps = 52/449 (11%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
+ + + L E + A FS LIHR S SK + +A + +
Sbjct: 13 VVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFR 72
Query: 63 QVLLSSD-VQKQKMKTGPQFQM-LFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSAS 119
++SD +Q + + + ++ M L+ + D G DL W C C C
Sbjct: 73 PTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHC------ 126
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SCQNPKQPCPYTMDYYTENTSSS 176
Y + + + P SST + SC C LG SC K+ C + Y + + +
Sbjct: 127 -YKQV---VPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKK-CTFRYSY-ADGSFTG 180
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G L + L + S A K GCG SGG D + G++GLG GE+S+ S
Sbjct: 181 GNLASETLTVDS---TAGKPVSFPGFAFGCG-HSSGGIFDK-SSSGIVGLGGGELSLISQ 235
Query: 237 LAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVE 289
L I FS C D S RI FG G + T + L Y + +E
Sbjct: 236 LKST--INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLE 293
Query: 290 TCCIGSSCL------KQTSFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+G L K+T + IVDSG+++TFLP+E Y + +
Sbjct: 294 GISVGKKRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDP 353
Query: 341 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI 400
+ CY ++++ P + F N + F+ +V C + P DI
Sbjct: 354 NGIFSLCYNTTAE--INAPIITAHFKDANVELQPLNTFMRMQEDLV---CFTVAPTS-DI 407
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G +G + V FD ++ + ++C
Sbjct: 408 GVLGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 147/379 (38%), Gaps = 71/379 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----- 152
D G DL+W C C C D+ + + S S T + CS LC
Sbjct: 113 DTGSDLVWTQCACTVC----------FDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPL 162
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C + C Y Y +++ ++G + ED D A + ++ GCGM G
Sbjct: 163 SGCAARDRSCFYAYGYM-DHSITTGKMAEDTF-TFKAPDRADTAAAVPNIRFGCGMMNYG 220
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI---FFGDQ----- 264
+ + G+ G G G +S+PS L +R FS CF + R+ G +
Sbjct: 221 LFTPNQS--GIAGFGTGPLSLPSQLK----VRR-FSYCFTAMEESRVSPVILGGEPENIE 273
Query: 265 ----GPATQQSTSFL-----ASNGKYITYIIGVETCCIGSSCL--KQTSFK--------A 305
GP QST F A G Y + + +G + L ++F
Sbjct: 274 AHATGPI--QSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGT 331
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSV 361
+DSG++ TF P+ V+ ++ F QV + +GY C + ++ P +P +
Sbjct: 332 FIDSGTAITFFPQAVFRSLREAFVAQVPLPVA--KGYTDPDNLLCFSVPAKKKAPAVPKL 389
Query: 362 KLM-------FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRV 413
L P+ N + N+ G+ C+ I GTI NF +
Sbjct: 390 ILHLEGADWELPRENYVLDNDD----DGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHI 445
Query: 414 VFDRENLKLGWSHSNCQDL 432
V+D E+ K+ ++ + C L
Sbjct: 446 VYDLESNKMVFAPARCDKL 464
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 148/342 (43%), Gaps = 51/342 (14%)
Query: 131 YSPSASSTSKHLSCSHRLC-------DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVE 181
+ P+AS + + + C +LC G+S C N C Y++ Y ++ +S+G +
Sbjct: 35 FDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSY-GDSRNSTGDFSQ 93
Query: 182 DILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
D++ L S N+ +VQ V GC G +D + G++G G +S+PS L K
Sbjct: 94 DVIFLNS--TNSSSQAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KD 149
Query: 241 GLIRNSFSMCFDKD-----DSGRIFFGDQG-PATQQSTSFLASN----GKYITYIIGVET 290
L + FS CF +G IF GD G ++ S + L N + Y +G+ +
Sbjct: 150 RLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTS 209
Query: 291 CCIGSSCLK--QTSFK---------AIVDSGSSFTFLPKEVY----ETIAAEFDRQVNDT 335
+ L +++FK ++DSG++FT + + Y AA +
Sbjct: 210 ISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKK 269
Query: 336 ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN-NPVFV---IYGTQVVTGFCL 391
+ + G+ C S+ LP +P V+L N + +FV G +V CL
Sbjct: 270 VGAAAGFD-DCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTV--CL 326
Query: 392 AIQPVD----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
AI G I +G + Y V +D E ++G+ ++C
Sbjct: 327 AILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 147/352 (41%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL T C
Sbjct: 197 DTGSDTTW-----VQCQPCVVVCYEQQEK---LFDPARSSTYANVSCAAPACFDLDTRGC 248
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 249 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 298
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPAT---Q 269
+ GL+GLG G+ S+P K G + F+ C SG + FG PA +
Sbjct: 299 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLDFGPGSPAAAGAR 352
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYETI 324
+T L NG Y +G+ +G L Q+ F IVDSG+ T LP Y ++
Sbjct: 353 LTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAYSSL 411
Query: 325 AAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
+ F + ++ P CY + +P+V L+F Q + + + ++
Sbjct: 412 RSAFVSAM--AARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF-QGGAILDVDASGIM 468
Query: 381 YG---TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
Y +QV GF A GD+G +G + + V +D +G+S C
Sbjct: 469 YAASVSQVCLGF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 124/306 (40%), Gaps = 43/306 (14%)
Query: 154 SCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
SC +PK Q C YT Y + + ++G L D + G + V GCG+
Sbjct: 50 SCGSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLF 102
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGR 258
+G + G+ G G G +S+PS L K G +FS CF D
Sbjct: 103 NNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPSTVLLDLPAD 155
Query: 259 IFFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-------AIVD 308
+F QG T + + Y + ++ +GS+ L +++F I+D
Sbjct: 156 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIID 215
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-Q 367
SG+S T LP +VY+ + EF Q+ + C+ + SQ P +P + L F
Sbjct: 216 SGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGA 275
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSH 426
N VF + + CLAI GD TI NF V++D +N L +
Sbjct: 276 TMDLPRENYVFEVPDDAGNSIICLAIN--KGDETTIIGNFQQQNMHVLYDLQNNMLSFVA 333
Query: 427 SNCQDL 432
+ C L
Sbjct: 334 AQCDKL 339
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 145/352 (41%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL C
Sbjct: 179 DTGSDTTW-----VQCEPCVVVCYKQQEK---LFDPARSSTYANISCAAPACSDLYIKGC 230
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G Y
Sbjct: 231 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAIKG-----FRFGCGERNEGLYG 280
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPAT------ 268
+ GL+GLG G+ S+P K G + F+ CF SG + D GP +
Sbjct: 281 EAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL-DFGPGSLPAVSA 333
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYET 323
+ +T L NG Y +G+ +G L Q+ F IVDSG+ T LP Y +
Sbjct: 334 KLTTPMLVDNGPTF-YYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITRLPPAAYSS 392
Query: 324 IAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--V 377
+ + F + + ++ P CY + +P+V L+F S V+ +
Sbjct: 393 LRSAFASAMAE--RGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHASGII 450
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +Q GF A D D+G +G + + VV+D +G+ C
Sbjct: 451 YAASVSQACLGF--AGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 681
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/378 (22%), Positives = 159/378 (42%), Gaps = 53/378 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSAS 136
G + ++ + S+ D G L+ PC C C + + + + S
Sbjct: 65 GTHYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCGHHTDQPFQAAN----------S 114
Query: 137 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG-----D 191
ST H++C+ + C C + Y E +S +VEDI++L GG D
Sbjct: 115 STLVHITCAQKSLFQCKECHVQSDTCGISQSYM-EGSSWKASVVEDIVYL--GGESSFDD 171
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMC 250
++N GC + G ++ VA DG++GL E + + L + I N FS+C
Sbjct: 172 KEMRNRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLC 230
Query: 251 FDKDDSGRIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK 304
F ++ G + G A + +A Y + ++ IG + K+ ++
Sbjct: 231 F-TENGGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYT 289
Query: 305 A---IVDSGSSFTFLPK-------EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 354
IVDSG++ ++LP+ ++++ IA D QV ++ F +++
Sbjct: 290 RGHYIVDSGTTDSYLPRALKTEFLQMFKEIAGR-DYQVGNSCKGF-----------TNKD 337
Query: 355 LPKLPSVKLM---FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
L LP+++L+ + N+ V+ + Y + +C I + G IG N M
Sbjct: 338 LASLPTIQLVMEAYGDENAEVILDVPPEQYLLESNGAYCGGIYLSENSGGVIGANLMMNR 397
Query: 412 RVVFDRENLKLGWSHSNC 429
V+FD + ++G+ ++C
Sbjct: 398 DVIFDLGDQRVGFVDADC 415
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 146/359 (40%), Gaps = 50/359 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSC 155
D G D++W+ +CAP Y S + P S + + C +C C
Sbjct: 140 DTGSDVVWL-----QCAPCRHCYAQSG----RVFDPRRSRSYAAVDCVAPICRRLDSAGC 190
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + ++G + L G VQ V IGCG G +
Sbjct: 191 DRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGA------RVQ-RVAIGCGHDNEGLF- 241
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--------IFFGDQGP 266
+A GL+GLG G +S PS +A++ SFS C D+ S R + FG
Sbjct: 242 --IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 297
Query: 267 ATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGS 311
A SF + N + Y +++G + Q+ + I+DSG+
Sbjct: 298 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 357
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNS 370
S T L + VYE + F S G+ + CY S +R+ K+P+V + S
Sbjct: 358 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGAS 417
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I FC A+ DG + IG G+RVVFD + ++G+ +C
Sbjct: 418 VALPPENYLI-PVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 104/251 (41%), Gaps = 36/251 (14%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T + D G L ++PC C +C + + P T K L+C + C
Sbjct: 124 RTFQVIVDTGSTLTYVPCATCAKCGTHTGG---------TRFDP----TGKWLTCQEKQC 170
Query: 150 DLGTS---CQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C + C Y+ Y E + SG LV D +H GGD A + V
Sbjct: 171 KAAGGPGICAGGRGAAANRCTYSRTY-AEGSGVSGDLVRDKMHF--GGDIAPATNGTLDV 227
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEI-SVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
+ GC +SG D A DGLIGLG + S+P+ LA + FS+CF + G
Sbjct: 228 VFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALS 286
Query: 262 GDQGPATQQS-----TSFLASNGKYITYIIGVETCCIGSSCLKQTS-----FKAIVDSGS 311
+ PAT + T + Y++ IG + S + ++DSG+
Sbjct: 287 FGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGYGTVMDSGT 346
Query: 312 SFTFLPKEVYE 322
+FT++P +V+
Sbjct: 347 TFTYVPTKVFH 357
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 162/401 (40%), Gaps = 88/401 (21%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
S+T+ L D G L+W PC R S ++ N+ + ++ P SS+SK + C + C
Sbjct: 94 SQTVKLIMDTGSSLVWFPCTS-RYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKC 152
Query: 150 D--LGTSCQ------NPK-----QPCP-YTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
G+S Q NP+ Q CP Y + Y +T+ GLL+ + ++
Sbjct: 153 AWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTA--GLLLSETINF--------P 202
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FD 252
N + + GC + L P+G+ G G + S+P L GL + S+ + FD
Sbjct: 203 NKTISDFLAGCSL------LSTRQPEGIAGFGRSQESLPLQL---GLKKFSYCLVSRRFD 253
Query: 253 KDDSGRIFFGDQGPATQQS-------TSF---LASNGKYI---TYIIGVETCCIGSSCLK 299
D GP+T S T F LAS Y + + +G + +K
Sbjct: 254 DSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVK 313
Query: 300 -QTSF---------KAIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFEGYPW 344
SF IVDSGS+FTF+ V+E +A EF++Q V + G
Sbjct: 314 VPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTG--L 371
Query: 345 KCCYKSSSQRLPKLPSV--------KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
+ C+ S ++ +P + K+ P +N F FV G +T +
Sbjct: 372 RPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYF-----AFVDMGVVCLTIVSDNAAAL 426
Query: 397 DGDIGT--------IGQNFMTGYRVVFDRENLKLGWSHSNC 429
GD G +G + + +D EN + G+ +C
Sbjct: 427 GGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 152/348 (43%), Gaps = 46/348 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G D+ W V+CAP + Y ++ + P++S++ LSC C D+ +
Sbjct: 169 DTGSDVSW-----VQCAPCAECY----EQTDPIFEPTSSASFTSLSCETEQCKSLDV-SE 218
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G V + + L G +L N + IGCG G +
Sbjct: 219 CRNGT--CLYEVSY-GDGSYTVGDFVTETVTL---GSTSLGN-----IAIGCGHNNEGLF 267
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQGPATQQS-T 272
+ GL+GLG G +S PS L + SFS C D+D P T + T
Sbjct: 268 ---IGAAGLLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 273 SFLASNGKYITYI-IGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVY 321
+ L N T+ +G+ +G + L +TSF+ IVDSG++ T L VY
Sbjct: 320 APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVY 379
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ F + +D T+ + CY SS+ ++P+V F N + ++I
Sbjct: 380 NVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIP 439
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FC A P D + +G G RV FD N +G+S + C
Sbjct: 440 VDSEGT-FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 138/338 (40%), Gaps = 44/338 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D D +W C+ C C ++ ++ PS SST K + CS C T
Sbjct: 107 DTANDNIWFQCNPCKPCFNTTSPMFD----------PSKSSTYKTIPCSSPKCKNVENTH 156
Query: 155 CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + K+ C Y+ Y E S G L D L L S D + +++IGCG + G
Sbjct: 157 CSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLNSNNDTPIS---FKNIVIGCGHRNKGP 212
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPAT 268
L+G G IGLG G +S S L + I FS C ++ SG++ FGD+ +
Sbjct: 213 -LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKLHFGDKSVVS 268
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA--------IVDSGSSFTFLPKEV 320
T I Y + +G +K + + I+DSG++ T LP+ V
Sbjct: 269 GVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLTILPENV 328
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + + V +K CYK++ + L +P + F + + + F
Sbjct: 329 YSRLESIVTSMVKLERAKSPNQQFKLCYKATLKNL-DVPIITAHFNGADVHLNSLNTFYP 387
Query: 381 YGTQVVTGFCLAIQPVDGDIGTI-----GQNFMTGYRV 413
+VV C A V GTI QNF+ G+ +
Sbjct: 388 IDHEVV---CFAFVSVGNFPGTIIGNIAQQNFLVGFDL 422
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 157/373 (42%), Gaps = 70/373 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ M L D G D+LW+ C CV C Y+ D + P SST L C+ R C
Sbjct: 48 RGMYLVMDTGSDILWLQCAPCVSC-------YHQCDE---VFDPYKSSTYSTLGCNSRQC 97
Query: 150 ---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVII 204
D+G N C Y +DY + + S+G D + L SGG + N + +
Sbjct: 98 LNLDVGGCVGNK---CLYQVDY-GDGSFSTGEFATDAVSLNSTSGGGQVVLNKIP----L 149
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGR--I 259
GCG G + V GL+GLG G +S P+ + R FS C D D + R +
Sbjct: 150 GCGHDNEGYF---VGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTGRDTDSTERSSL 204
Query: 260 FFGDQG--PA----TQQSTSFLASNGKYITYI---IGVETCCIGSSCLKQTSFK---AIV 307
FGD PA T Q+++ S Y+ +G I +S + S I+
Sbjct: 205 IFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVII 264
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-- 365
DSG+S T L Y ++ F +D + + E + CY S +P+V L F
Sbjct: 265 DSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQG 324
Query: 366 ------PQNNSFV-VNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFD 416
P +N V V+N + FCLA G IG I Q G+RV++D
Sbjct: 325 GADLKLPASNYLVPVDNS----------STFCLAFAGTTGPSIIGNIQQQ---GFRVIYD 371
Query: 417 RENLKLGWSHSNC 429
+ ++G+ S C
Sbjct: 372 NLHNQVGFVPSQC 384
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 153/396 (38%), Gaps = 71/396 (17%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLN 129
V + +G F F + SL D G DLLW V+CAP Y +D
Sbjct: 55 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLW-----VQCAPCLQCY----AQDTP 105
Query: 130 EYSPSASSTSKHLSCSHRLCDL-----GTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDI 183
Y+PS SST + C C L G C + C Y Y + + S G+ +
Sbjct: 106 LYAPSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRY-ADTSLSKGVFAYE- 163
Query: 184 LHLISGGDNALKNSVQAS-VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
+A + V+ V GCG G + A G++GLG G +S S + A
Sbjct: 164 --------SATVDDVRIDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA-- 210
Query: 243 IRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIG 294
N F+ C S + FGD+ +T F + SN + T Y + +E +G
Sbjct: 211 YGNKFAYCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVG 270
Query: 295 SSCLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYP 343
L + +I DSG++ T+ Y I A FD+ V S +G
Sbjct: 271 GESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQG-- 328
Query: 344 WKCCYKSSSQRLPKLPSVKLMF-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
C + P PS ++ PQ ++ V+ V Q CLA+ +
Sbjct: 329 LDLCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVD----VAPNVQ-----CLAMAGL 379
Query: 397 DGDIG---TIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G TIG + V +DRE ++G++ + C
Sbjct: 380 PSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKC 415
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 146/359 (40%), Gaps = 50/359 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSC 155
D G D++W+ +CAP Y S + P S + + C +C C
Sbjct: 146 DTGSDVVWL-----QCAPCRHCYAQSG----RVFDPRRSRSYAAVDCVAPICRRLDSAGC 196
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + ++G + L G VQ V IGCG G +
Sbjct: 197 DRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGA------RVQ-RVAIGCGHDNEGLF- 247
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--------IFFGDQGP 266
+A GL+GLG G +S PS +A++ SFS C D+ S R + FG
Sbjct: 248 --IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 303
Query: 267 ATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGS 311
A SF + N + Y +++G + Q+ + I+DSG+
Sbjct: 304 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 363
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNS 370
S T L + VYE + F S G+ + CY S +R+ K+P+V + S
Sbjct: 364 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGAS 423
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I FC A+ DG + IG G+RVVFD + ++G+ +C
Sbjct: 424 VALPPENYLI-PVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 156/356 (43%), Gaps = 53/356 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTS 154
D G DL+W +C P Y ++D + P +SST + +SCS + CDL G S
Sbjct: 110 DTGSDLIW-----TQCKPCDQCY----EQDAPLFDPKSSSTYRDISCSTKQCDLLKEGAS 160
Query: 155 CQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y+ Y + + +SG + D + L G + + + IIGCG G
Sbjct: 161 CSGEGNKTCHYSYS-YGDRSFTSGNVAADTITL---GSTSGRPVLLPKAIIGCGHNNGGS 216
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPAT 268
+ + + G++GLG G IS+ S L I FS C + +S ++ FG G +
Sbjct: 217 FTEKGS--GIVGLGGGPISLISQLGST--IDGKFSYCLVPLSSNATNSSKLNFGSNGIVS 272
Query: 269 Q---QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF-----KAIVDSGSSFTFLPK 318
QST ++ + Y + +E +GS +K +SF I+DSG++ T P+
Sbjct: 273 GGGVQSTPLISKDPDTF-YFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPE 331
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV- 377
+ + +++ V T CY + K PS+ F + + V NP+
Sbjct: 332 DFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDADL--KFPSITAHF--DGADVKLNPLN 387
Query: 378 -FVIYGTQVVTGFCLAIQPVDGD--IGTIGQ-NFMTGYRVVFDRENLKLGWSHSNC 429
FV V+ C A P++ G + Q NF+ GY D E + + ++C
Sbjct: 388 TFVQVSDTVL---CFAFNPINSGAIFGNLAQMNFLVGY----DLEGKTVSFKPTDC 436
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 150/374 (40%), Gaps = 56/374 (14%)
Query: 79 PQFQMLFPSQGSK--TMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSA 135
PQ ++ S GS T L D DLLWI C C+ C S L + PS
Sbjct: 82 PQAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQS----------LPIFDPSR 131
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
S T ++ +C + + N + C Y+M Y ++T S G+L ++L + D +
Sbjct: 132 SYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRY-VDDTGSKGILAREMLLFNTIYDESS 190
Query: 195 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 254
++ V+ GCG G L G G++GLG GE S+ K FS CF
Sbjct: 191 SAALH-DVVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGK------KFSYCFGSL 240
Query: 255 DS-----GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------K 299
D + GD G T+ L + + Y + +E + L
Sbjct: 241 DDPSYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNH 298
Query: 300 QTSFKA-IVDSGSSFTFLPKEVYE----TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 354
QT I+D+G+S T L +E Y+ I F+ + S + CY + +R
Sbjct: 299 QTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFER 358
Query: 355 ---LPKLPSVKLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
P V F + ++ +F+ V FCLA+ P G++ +IG
Sbjct: 359 DLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNV---FCLAVTP--GNLNSIGATAQQS 413
Query: 411 YRVVFDRENLKLGW 424
Y + +D E +++ +
Sbjct: 414 YNIGYDLEAMEVSF 427
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 149/372 (40%), Gaps = 58/372 (15%)
Query: 98 DFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLG 152
D +L W+ C C+P +N P SS+ C+ +C LG
Sbjct: 17 DTASELTWVQGTSCTNCSPTKVPPFN----------PGLSSSFISEPCTSSVCLGRSKLG 66
Query: 153 --TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
++C C + + Y + + + G++ +I L S A S VI GC K
Sbjct: 67 FQSACNRSTGSCSFQVAYL-DGSEAYGVIAREIFSLQSWDGAA---STLGDVIFGCASKD 122
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLL---AKAGLIRNSFSMCFDK-----DDSGRIFFG 262
+D G +GL G S P+ + +K+GL + FS CF + SG I FG
Sbjct: 123 LQRPVD--FSSGTLGLNRGSFSFPAQIGSRSKSGL-SDRFSYCFPNRAEHLNSSGVIIFG 179
Query: 263 DQG-PATQQSTSFLASNGKYIT----YIIGVETCCIGSSCLK--QTSFK--------AIV 307
D G PA L + Y +G++ +G L +++FK
Sbjct: 180 DSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYF 239
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSS--QRLPKLPSVKLM 364
DSG++ +FL + + + F R+V + TS + + CY ++ RLP P V L
Sbjct: 240 DSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLH 299
Query: 365 FPQNNSFVVNNP---VFVIYGTQVVTGFCLAI----QPVDGDIGTIGQNFMTGYRVVFDR 417
F N + V + QVVT CLA G + IG Y + D
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVT-ICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDL 358
Query: 418 ENLKLGWSHSNC 429
E ++G++ +NC
Sbjct: 359 ERSRIGFAPANC 370
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 100/420 (23%), Positives = 164/420 (39%), Gaps = 75/420 (17%)
Query: 37 EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKT-GPQFQMLFPSQGSKTMSL 95
E +K L ++ T+ P + ++ ++ V + K+ T G Q M+
Sbjct: 68 ERLKYLSTLADQKTTAVPIAPGQQVLKI--ANYVVRVKLGTPGQQMFMVL---------- 115
Query: 96 GNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGT 153
D D W+PC C S++ + P+AS+T L CS C G
Sbjct: 116 --DTSNDAAWVPCS--GCTGFSST----------TFLPNASTTLGSLDCSGAQCSQVRGF 161
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
SC Y ++S + LV+D + L N V GC SGG
Sbjct: 162 SCPATGSSACLFNQSYGGDSSLTATLVQDAI--------TLANDVIPGFTFGCINAVSGG 213
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDDS----GRIFFGDQG-PA 267
+ P GL+GLG G IS L+++AG + + FS C S G + G G P
Sbjct: 214 ---SIPPQGLLGLGRGPIS---LISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPK 267
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLP 317
+ ++T L + + Y + + +G + T I+DSG+ T
Sbjct: 268 SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFV 327
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNNS 370
+ VY I EF +QVN I+S + C+ ++++ + P++ L F P NS
Sbjct: 328 QPVYFAIRDEFRKQVNGPISSLGAF--DTCFAATNEA--EAPAITLHFEGLNLVLPMENS 383
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ ++ G+ A V+ + I R++FD N +LG + C
Sbjct: 384 LIHSS-----SGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 99/451 (21%), Positives = 177/451 (39%), Gaps = 91/451 (20%)
Query: 30 KLIHRFS--------EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
KL HR+S E LG+SK+ + Q L+ + ++ + G
Sbjct: 25 KLQHRYSGLEGSSKQNEKLGLGMSKH-------------HLQHLVEHNDRRGRFLQG--- 68
Query: 82 QMLFPSQGSKT--------MSLGN---------DFGCDLLWIPCD-CVRCA-------PL 116
+ FP +G+ + + LGN D G D+LW+ C C C PL
Sbjct: 69 -ISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPL 127
Query: 117 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 176
S ++ T + CS C Y + Y ++TS
Sbjct: 128 SIYNLSASSTSSVSSCSDPLCTGEQAVCSR---------SGSNSACAYGISYQDKSTSIG 178
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
+ +D+ +++ GG N+ + + GC + +G + DG++G G +VP+
Sbjct: 179 AYVKDDMHYVLQGG-----NATTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQ 229
Query: 237 LAKAGLIRNSFSMCF--DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 294
+A + FS C +K G + FG++ T+ + L + + Y + + + +
Sbjct: 230 IATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEMVFTPLLNVTTH--YNVDLLSISVN 287
Query: 295 SSCL----KQTSFKA--------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT-SFEG 341
S L K+ S+ + I+DSG+SF L + + +E + EG
Sbjct: 288 SKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEG 347
Query: 342 YPWKCCY-KSSSQRLPKLPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDG 398
+C Y KS P+V L F ++ + +N + ++ + G+C A DG
Sbjct: 348 L--QCFYLKSGLTVETSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADG 405
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G+ + V +D EN ++GW NC
Sbjct: 406 -LTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 141/367 (38%), Gaps = 53/367 (14%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LG 152
L D DL W+ C C RC P S ++ P S++ ++ C LG
Sbjct: 149 LALDTASDLTWLQCQPCRRCYPQSGPVFD----------PRHSTSYGEMNYDAPDCQALG 198
Query: 153 TSC--QNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
S + C YT+ Y + ++S G LVE+ L G QA + IGCG
Sbjct: 199 RSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG-------VRQAYLSIGCG 251
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG------RIFF 261
G L G G++GLG G+IS+P +A G SFS C SG + F
Sbjct: 252 HDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDFISGPGSPSSTLTF 308
Query: 262 G----DQGPATQQSTSFLASNGKYITYI--IGVETCCIGSSCLKQTSFK---------AI 306
G D P + + L N Y+ IGV + + + + I
Sbjct: 309 GAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVI 368
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSSQRLPKLPSVKL 363
+DSG++ T L + Y F G P + CY + K+P+V +
Sbjct: 369 LDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSM 428
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKL 422
F + ++I T C A D + IG G+RVV+D ++
Sbjct: 429 HFAGGVEVSLQPKNYLIPVDSRGT-VCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRV 487
Query: 423 GWSHSNC 429
G++ +NC
Sbjct: 488 GFAPNNC 494
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 140/352 (39%), Gaps = 48/352 (13%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS 154
D G L W+ C V C P + Y+ P ASST + CS C +L +
Sbjct: 126 DSGSSLTWLQCAPCAVSCHPQAGPLYD----------PRASSTYAAVPCSAPQCAELQAA 175
Query: 155 CQNPKQ-----PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
NP C Y Y + + S G L +D + L S G GCG
Sbjct: 176 TLNPSSCSGSGVCQYQASY-GDGSFSFGYLSKDTVSLSSSGSFP-------GFYYGCGQD 227
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD---DSGRIFFG---- 262
G L G A GLIGL ++S+ S LA + + NSF+ C +G + FG
Sbjct: 228 NVG--LFGRA-AGLIGLARNKLSLLSQLAPS--VGNSFAYCLPTSAAASAGYLSFGSNSD 282
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAIVDSGSSFTFLP 317
++ P TS ++S+ Y + + + S L + S I+DSG+ T LP
Sbjct: 283 NKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSGTVITRLP 342
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
VY ++ + + C+K +LP +P+V + F + +
Sbjct: 343 TPVYTALSKAVGAALAAPSAPAYSI-LQTCFKGQVAKLP-VPAVNMAFAGGATLRLTPGN 400
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + T CLA P D IG + VV+D + ++G++ C
Sbjct: 401 VLVDVNETTT--CLAFAPTD-STAIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 149/369 (40%), Gaps = 53/369 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSAS-YYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ +++ D G DL W V+C P S+ Y D ++PS SST + C R
Sbjct: 164 ARDLTVVFDTGSDLSW-----VQCGPCSSGGCYKQQD---PLFAPSDSSTFSAVRCGARE 215
Query: 149 CDLGTSCQNP--KQPCPYTMDYYTENTSSSGLLVEDILHL---ISGGDNALKNSVQASVI 203
C SC CPY + Y + + + G L D L L +A ++ +
Sbjct: 216 CRARQSCGGSPGDDRCPYEV-VYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFV 274
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS---GRIF 260
GCG +G L G A DGL GLG G++S+ S AG FS C S G +
Sbjct: 275 FGCGENNTG--LFGQA-DGLFGLGRGKVSLSS--QAAGKFGEGFSYCLPSSSSSAPGYLS 329
Query: 261 FGD--QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA----IVDSGSSFT 314
G PA Q T L Y + + + ++ +S + IVDSG+ T
Sbjct: 330 LGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVIT 389
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWK---------CCYK--SSSQRLPKLPSVKL 363
L Y + A F +++ Y +K CY + + +P+V L
Sbjct: 390 RLAPRAYRALRAAF-------LSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVAL 442
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD---IGTIGQNFMTGYRVVFDRENL 420
+F + V+ V+Y +V CLA P +GD G +G VV+D
Sbjct: 443 VFAGGATISVDFS-GVLYVAKVAQA-CLAFAP-NGDGRSAGILGNTQQRTLAVVYDVARQ 499
Query: 421 KLGWSHSNC 429
K+G++ C
Sbjct: 500 KIGFAAKGC 508
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 95/412 (23%), Positives = 156/412 (37%), Gaps = 76/412 (18%)
Query: 87 SQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLN--EYSPSASSTSKHLSC 144
S + +SL D G DL+W PC C Y + L+ + SAS + K +C
Sbjct: 81 SHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPAC 140
Query: 145 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS--------------GLLVEDILHLISGG 190
S L +S CP + ++ +S S L D L + +
Sbjct: 141 SAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPASS 200
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSM 249
L N GC G P G+ G G G +S+P+ LA + + N FS
Sbjct: 201 PLVLHN-----FTFGCAHTALG------EPVGVAGFGRGVLSLPAQLASFSPHLGNQFSY 249
Query: 250 C-----FDKDDSGR---IFFGDQGPATQQSTSFLASNGKYIT------------YIIGVE 289
C FD D R + G ++ G+++ Y +G+E
Sbjct: 250 CLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLE 309
Query: 290 TCCIGS------SCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+G+ LK+ + +VDSG++FT LP +YE++ EF+ ++
Sbjct: 310 GITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRA 369
Query: 340 EGYPWKC----CYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVFVIYG------TQVVT 387
+ CY S K+P+V L F N++ ++ NN + + +
Sbjct: 370 TQIEERTGLGPCYYSDDS-AAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKV 428
Query: 388 GFCLAIQPVD-----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
G + + D G T+G G+ VV+D E ++G++ C L D
Sbjct: 429 GCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCALLWD 480
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 133/346 (38%), Gaps = 34/346 (9%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----G 152
D G L+W+ C C C P ++ + P SST K+ +C + C L
Sbjct: 107 DTGSSLIWLQCSPCHNCFP----------QETPLFEPLKSSTYKYATCDSQPCTLLQPSQ 156
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
C Q C Y + Y + + S G+L + L G + + I GCG+ +
Sbjct: 157 RDCGKLGQ-CIYGI-MYGDKSFSVGILGTETLSF--GSTGGAQTVSFPNTIFGCGVDNNF 212
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQ 269
G+ GLG G +S+ S L I + FS C +D + ++ FG + T
Sbjct: 213 TIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKLKFGSEAIITT 270
Query: 270 Q---STSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSGSSFTFLPKEVYETI 324
ST + Y + +E IG + QT ++DSG+ T+L Y
Sbjct: 271 NGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYLENTFYNNF 330
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
A + + P K C+ + + +P + F + V P V+
Sbjct: 331 VASLQETLGVKLLQDLPSPLKTCFPNRANL--AIPDIAFQF--TGASVALRPKNVLIPLT 386
Query: 385 VVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
CLA+ P G I G ++V +D E K+ ++ ++C
Sbjct: 387 DSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 139/361 (38%), Gaps = 58/361 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++W+ C CV C Y L Y P SST CS C +C
Sbjct: 117 DTGSDVVWLQCKPCVHC-------YRQLS---PLYDPRGSSTYAQTPCSPPQCRNPQTCD 166
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y + Y + +S+SG L D L+ D ++ N V +GCG G L
Sbjct: 167 GTTGGCGYRI-VYGDASSTSGNLATD--RLVFSNDTSVGN-----VTLGCGHDNEG--LF 216
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR----IFFGDQGPATQQS 271
G A GL+G+ G S + +A + F+ C D+ SG + FG P S
Sbjct: 217 GSAA-GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSS 273
Query: 272 T-SFLASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGSSFTFLP 317
+ L SN + Y ++G + S +VDSG+S T
Sbjct: 274 VFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRFA 333
Query: 318 KEVYETIAAEFD--------RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
++ Y + FD R+V I+ F+ CY + P V L F
Sbjct: 334 RDAYGALRDAFDARAAKVGMRKVGRGISVFD-----ACYDLRGVAVADAPGVVLHF-AGG 387
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ V P + + C A++ D + IG +RVVFD EN ++G+ +
Sbjct: 388 ADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFEPNG 447
Query: 429 C 429
C
Sbjct: 448 C 448
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 146/359 (40%), Gaps = 50/359 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSC 155
D G D++W+ +CAP Y S + P S + + C +C C
Sbjct: 140 DTGSDVVWL-----QCAPCRHCYAQSG----RVFDPRRSRSYAAVDCVAPICRRLDSAGC 190
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + ++G + L G VQ V IGCG G +
Sbjct: 191 DRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGA------RVQ-RVAIGCGHDNEGLF- 241
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--------IFFGDQGP 266
+A GL+GLG G +S P+ +A++ SFS C D+ S R + FG
Sbjct: 242 --IAASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 297
Query: 267 ATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSGS 311
A SF + N + Y +++G + Q+ + I+DSG+
Sbjct: 298 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 357
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNS 370
S T L + VYE + F S G+ + CY S +R+ K+P+V + S
Sbjct: 358 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGAS 417
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I FC A+ DG + IG G+RVVFD + ++G+ +C
Sbjct: 418 VALPPENYLI-PVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 145/361 (40%), Gaps = 54/361 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC D+ + P AS + + C+ LC D G
Sbjct: 165 DTGSDVVWLQCAPCRRC----------YDQSGQMFDPRASHSYGAVDCAAPLCRRLDSG- 213
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C ++ C Y + Y + + ++G + L SG + V +GCG G
Sbjct: 214 GCDLRRKACLYQV-AYGDGSVTAGDFATETLTFASG-------ARVPRVALGCGHDNEGL 265
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---------DKDDSGRIFFGDQ 264
+ VA GL+GLG G +S PS +++ SFS C S + FG
Sbjct: 266 F---VAAAGLLGLGRGSLSFPSQISR--RFGRSFSYCLVDRTSSSASATSRSSTVTFGSG 320
Query: 265 --GPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK------------AIVDS 309
GP+ S + + N + T Y + + +G + + + IVDS
Sbjct: 321 AVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDS 380
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQN 368
G+S T L + Y + F S G+ + CY S ++ K+P+V + F
Sbjct: 381 GTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGG 440
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ ++I T FC A DG + IG G+RVVFD + +LG+
Sbjct: 441 AEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKG 499
Query: 429 C 429
C
Sbjct: 500 C 500
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 104/444 (23%), Positives = 172/444 (38%), Gaps = 78/444 (17%)
Query: 28 STKLIHRFSEEVKALGVSKNRNATSWPAKKSF--------EYYQVLLS----SDVQKQKM 75
S K+++++ + G K N P+ F + +QV LS S V K+
Sbjct: 70 SLKVVNKYGPCIPVTGAPKTINV---PSTAEFLLQDQLRVKSFQVRLSMNPSSGVFKEMQ 126
Query: 76 KTGPQFQMLFPSQGS-----------KTMSLGNDFGCDLLWIPCD-CVR-CAPLSASYYN 122
T P + P+ G+ K +L D G DL W C+ C+ C P
Sbjct: 127 TTIPA--SIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFP------- 177
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS---GLL 179
++ ++ P+ S++ K++SCS C L P Q C Y S G L
Sbjct: 178 ---QNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGYTIGFL 234
Query: 180 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 239
+ L + S + KN + GC ++S G +G GL+GLG I++PS
Sbjct: 235 ATETLAIAS--SDVFKN-----FLFGCS-EESRGTFNGTT--GLLGLGRSPIALPSQTTN 284
Query: 240 AGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 297
+N FS C S G + FG + +ST + + G+ T I
Sbjct: 285 K--YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPI----SPKLKQLYGLNTVGISVRG 338
Query: 298 ----LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS- 352
+ + + I+DSG++FTFLP Y + + F + + + ++ CY S+
Sbjct: 339 RELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNI 398
Query: 353 -QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGF---CLAIQPV--DGDIGTIGQ 405
+P + + F ++ + G + V G CLA D D G
Sbjct: 399 GNGTLTIPGISIFFEGGVEVEID-----VSGIMIPVNGLKEVCLAFADTGSDSDFAIFGN 453
Query: 406 NFMTGYRVVFDRENLKLGWSHSNC 429
Y V++D +G++ C
Sbjct: 454 YQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 146/369 (39%), Gaps = 71/369 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ C Y+ P+ S++ L CS +C+ S
Sbjct: 103 DTGSDLIWTQCAPCLLCVEQPTPYFE----------PAKSTSYASLPCSSAMCNALYSPL 152
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ C Y +Y ++ SS+G+L + G N+ + +V V GCG +G +
Sbjct: 153 CFQNACVY-QAFYGDSASSAGVLANETFTF---GTNSTRVAVP-RVSFGCGNMNAGTLFN 207
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG-----------DQ 264
G G++G G G +S L+++ G R S+ + F + R++FG
Sbjct: 208 G---SGMVGFGRGALS---LVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 261
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSF 313
GP QST F+ + Y + + + L + I+DSG++
Sbjct: 262 GPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTV 319
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYK--SSSQRLPKLPSVKLMF--- 365
TFL + Y + F V + P + C+K +R+ LP + L F
Sbjct: 320 TFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGA 377
Query: 366 ----PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
P N V++ GT CLA+ P D D IG + +++D EN
Sbjct: 378 DMELPLENYMVMDG------GTG---NLCLAMLPSD-DGSIIGSFQHQNFHMLYDLENSL 427
Query: 422 LGWSHSNCQ 430
L + + C
Sbjct: 428 LSFVPAPCN 436
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 146/369 (39%), Gaps = 71/369 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ C Y+ P+ S++ L CS +C+ S
Sbjct: 106 DTGSDLIWTQCAPCLLCVEQPTPYFE----------PAKSTSYASLPCSSAMCNALYSPL 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ C Y +Y ++ SS+G+L + G N+ + +V V GCG +G +
Sbjct: 156 CFQNACVY-QAFYGDSASSAGVLANETFTF---GTNSTRVAVP-RVSFGCGNMNAGTLFN 210
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG-----------DQ 264
G G++G G G +S L+++ G R S+ + F + R++FG
Sbjct: 211 G---SGMVGFGRGALS---LVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 264
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSF 313
GP QST F+ + Y + + + L + I+DSG++
Sbjct: 265 GPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTV 322
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYK--SSSQRLPKLPSVKLMF--- 365
TFL + Y + F V + P + C+K +R+ LP + L F
Sbjct: 323 TFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGA 380
Query: 366 ----PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
P N V++ GT CLA+ P D D IG + +++D EN
Sbjct: 381 DMELPLENYMVMDG------GTG---NLCLAMLPSD-DGSIIGSFQHQNFHMLYDLENSL 430
Query: 422 LGWSHSNCQ 430
L + + C
Sbjct: 431 LSFVPAPCN 439
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 147/345 (42%), Gaps = 40/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D+ W+ +C P + Y+ + + PS+SS+ + LSC C+ +
Sbjct: 169 DTGSDVNWL-----QCTPCADCYHQTEPI----FEPSSSSSYEPLSCDTPQCNALEVSEC 219
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
C Y + Y + + + G + L + G ++N V +GCG G +
Sbjct: 220 RNATCLYEVSY-GDGSYTVGDFATETLTI---GSTLVQN-----VAVGCGHSNEGLF--- 267
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSF 274
V GL+GLG G +++PS L SFS C D D + + FG P
Sbjct: 268 VGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVDRDSDSASTVEFGTSLPPDAVVAPL 322
Query: 275 LASNGKYITYIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVYETI 324
L ++ Y +G+ +G L+ Q+SF+ I+DSG++ T L +Y ++
Sbjct: 323 LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYNSL 382
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
F + +D + + CY S++ ++P+V FP + ++I
Sbjct: 383 RDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVDS 442
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V T FCLA P + IG G RV FD N +G+S + C
Sbjct: 443 VGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 145/341 (42%), Gaps = 49/341 (14%)
Query: 131 YSPSASSTSKHLSCSHRLC-------DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVE 181
+ P+AS + + + C +LC G+S C N C Y++ Y ++ +S+G +
Sbjct: 136 FDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSY-GDSRNSTGDFSQ 194
Query: 182 DILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
D++ L S N+ +VQ V GC G +D + G++G G +S+PS L K
Sbjct: 195 DVIFLNS--TNSSGQAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KD 250
Query: 241 GLIRNSFSMCFDKD-----DSGRIFFGDQGPATQQS--TSFL---ASNGKYITYIIGVET 290
L + FS CF +G IF GD G + + T L + + Y +G+ +
Sbjct: 251 RLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTS 310
Query: 291 CCIGSSCLK--QTSFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+ L +++FK ++DSG++FT + + Y F +
Sbjct: 311 ISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKK 370
Query: 340 EGYP--WKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVN-NPVFV---IYGTQVVTGFCLA 392
G + CY S+ LP +P V+L N + +FV G +V CLA
Sbjct: 371 VGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTV--CLA 428
Query: 393 IQPVD----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I G I +G + Y V +D E ++G+ ++C
Sbjct: 429 ILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 146/370 (39%), Gaps = 72/370 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G DL+W C CV C ++ + PS+SST L CS LC DL TS
Sbjct: 136 DTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYSTLPCSSSLCSDLPTST 185
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C + + C YT Y + +S+ G+L + L + V GCG G G
Sbjct: 186 CTSAAKDCGYTYTY-GDASSTQGVLAAETF--------TLAKTKLPGVAFGCGDTNEGDG 236
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGR--IFFGD------- 263
+ G GL+GLG G +S L+++ GL FS C DD+ + + G
Sbjct: 237 FTQGA---GLVGLGRGPLS---LVSQLGL--GKFSYCLTSLDDTSKSPLLLGSLAAISTD 288
Query: 264 -QGPATQQSTSFLASNGKYITYIIGVETCCIGSSC--LKQTSFKA--------IVDSGSS 312
A Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 289 TASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTS 348
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ-----RLPKLP-----SVK 362
T+L + Y + F Q+ + C+K+ + +PKL
Sbjct: 349 ITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGAD 408
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
L P N V+++ CL + G + IG + V+D + L
Sbjct: 409 LDLPAENYMVLDS---------ASGALCLTVMGSRG-LSIIGNFQQQNIQFVYDVDKDTL 458
Query: 423 GWSHSNCQDL 432
++ C L
Sbjct: 459 SFAPVQCAKL 468
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/382 (22%), Positives = 147/382 (38%), Gaps = 65/382 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 144
+K L D G +L W+ C C C P Y Y+P+ + C
Sbjct: 48 AKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPY---------YTPADGKLK--VVC 96
Query: 145 SHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 196
LC + +N C Y + Y T S G L DI+ ++G D
Sbjct: 97 GSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS-VNGRD----- 148
Query: 197 SVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKD 254
+ + GCG KQ +P +G++GLG+G+ + L +I+ N C
Sbjct: 149 --KKRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSSK 206
Query: 255 DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSFKAIVDSGSSF 313
G ++ GD P T+ T + Y G+ I ++ +F+A+ DSGS++
Sbjct: 207 GKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 314 TFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSS--------QRLPKLPSVKLM 364
T +P ++Y I ++ ++ ++ +G C+K + K S+K+
Sbjct: 266 THVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALSLKIT 325
Query: 365 F----------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGYR 412
PQN FV + G + ++ PV ++ IG M
Sbjct: 326 HARGTNNLDIPPQNYLFVKED------GETCLAILDASLDPVLKELNFILIGAVTMQDLF 379
Query: 413 VVFDRENLKLGWSHSNCQDLND 434
V++D E +LGW + C + +
Sbjct: 380 VIYDNEKKQLGWVRAQCDRVQE 401
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 79/171 (46%), Gaps = 16/171 (9%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G D+LW+ +C+RC + L +L +Y P+ S T+ + C C ++
Sbjct: 102 DTGSDILWV--NCIRCD--GCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGV 155
Query: 155 ---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C + PC + + Y + ++++G V D + N + AS+ GCG Q
Sbjct: 156 PPTCPSTSSPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQL 213
Query: 212 GGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 260
GG L A DG++G G + S+ S LA A +R F+ C D G IF
Sbjct: 214 GGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIF 264
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 150/358 (41%), Gaps = 45/358 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
K +SL D G DL W +C P + YN D + PS S+T ++SCS C
Sbjct: 142 KYLSLIFDTGSDLTW-----TQCQPCARYCYNQKDP---VFVPSQSTTYSNISCSSPDCS 193
Query: 150 --DLGTSCQ---NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
+ GT Q + + C Y + Y + + S G ++ L L S V + +
Sbjct: 194 QLESGTGNQPGCSAARACIYGIQY-GDQSFSVGYFAKETLTLTS-------TDVIENFLF 245
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDSGR---IF 260
GCG G L G A GLIGLG +IS+ A K G + FS C K S F
Sbjct: 246 GCGQNNRG--LFGSAA-GLIGLGQDKISIVKQTAQKYGQV---FSYCLPKTSSSTGYLTF 299
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIG------SSCLKQTSFKAIVDSGSSFT 314
G G + T ++G Y + + +G SS + TS AI+DSG+ T
Sbjct: 300 GGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTS-GAIIDSGTVIT 358
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP + Y + + F++ + + E CY S ++P V +F ++
Sbjct: 359 RLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLD 418
Query: 375 NPVFVIYG---TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++YG +QV F P + IG +VV+D K+G+ ++ C
Sbjct: 419 G-IGIMYGASTSQVCLAFAGNQDP--STVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 143/375 (38%), Gaps = 66/375 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K +L D G DL W+ CD CV C P ++C+ +
Sbjct: 79 KPYTLDIDSGSDLTWLQCDAPCVSCT--------------KAPHPPYKPNKGPITCNDPM 124
Query: 149 CDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-- 201
C C+ + C Y + Y ++ SS G+LV DI L L N A+
Sbjct: 125 CSALHWPSKPPCKASHEQCDYEVSY-ADHGSSLGVLVHDIFSL------QLTNGTLAAPR 177
Query: 202 VIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
+ GCG QS Y AP DG++GLG G+ S+ + L GLIR+ C G
Sbjct: 178 LAFGCGYDQS--YPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGF 235
Query: 259 IFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+F GD T + ++ Y +G + + DSGSS+T+
Sbjct: 236 LFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFN 295
Query: 318 KEVYETIAAEFDRQVNDTI--TSFEGYP--W------------KCCYKSSSQRLPKLPSV 361
+ Y+T + + +N + T+ E P W K +K + K S
Sbjct: 296 AQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSA 355
Query: 362 KLMFPQNNSFVV----NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDR 417
+L P + ++ N + ++ G++V GD IG V++D
Sbjct: 356 QLQLPPESYLIISKHGNACLGILNGSEVGL----------GDSNVIGDIAFQDKMVIYDN 405
Query: 418 ENLKLGWSHSNCQDL 432
E ++GW +C L
Sbjct: 406 ERQQIGWVPKDCNKL 420
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 147/361 (40%), Gaps = 48/361 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+K L D G D+ WI +C+P + Y ++ + P ASS+ + LSCS C
Sbjct: 24 TKLQYLVMDTGSDVPWI-----QCSPCKSCY----KQNDAVFDPRASSSFRRLSCSTPQC 74
Query: 150 DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
L +C + C Y + Y + + + G L D L+S G + V+ GCG
Sbjct: 75 KLLDVKACASTDNRCLYQVS-YGDGSFTVGDLASDSF-LVSRGRT-------SPVVFGCG 125
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-----RIFFG 262
G + V GL+GLG G++S PS L+ FS C D+G + FG
Sbjct: 126 HDNEGLF---VGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNGVRASSALLFG 177
Query: 263 DQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK---------AIVD 308
D T S ++ L N K T Y G+ IG + L T+FK I+D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+S T LP Y + F + + + CY S+ +P+V F +
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHF-EG 296
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ V P + FC A D+ IG RV D ++ ++G++
Sbjct: 297 GASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356
Query: 429 C 429
C
Sbjct: 357 C 357
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 146/350 (41%), Gaps = 41/350 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G DL W CV C N + + P S+T +++SC +LC L T
Sbjct: 90 DTGSDLTWT--SCVPCN-------NCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVC 140
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALKNSVQASVIIGCGMKQSGGY 214
+P++ C YT Y + + G+L ++ + L S G LK ++ GCG +GG+
Sbjct: 141 SPQKRCNYTYAYASAAITR-GVLAQETITLSSTKGKSVPLKG-----IVFGCGHNNTGGF 194
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQ 269
D G+IGLG G +S+ S + + FS C D S ++ FG +
Sbjct: 195 NDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSG 251
Query: 270 Q---STSFLASNGK---YITYI-IGVETCCIGSSCLKQTSFKA--IVDSGSSFTFLPKEV 320
+ ST +A K ++T + I VE + + Q K +DSG+ T LP ++
Sbjct: 252 KGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQL 311
Query: 321 YETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y+ + A+ +V +T + CY++ + + P + F + + F+
Sbjct: 312 YDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNL--RGPVLTAHFEGADVKLSPTQTFI 369
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V FCL D G G + Y + FD + + + +C
Sbjct: 370 SPKDGV---FCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 131/337 (38%), Gaps = 41/337 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D D W+ C C++C D+ + + PS SS+ LSC + C+L +S
Sbjct: 205 DLQTDFTWLQCQPCIKC----------YDQPDSIFDPSQSSSYTLLSCETKHCNLLPNSS 254
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C + C Y + Y + T++ G+L+ + + S G V +GC K G +
Sbjct: 255 CSDDGY-CRYNITY-KDGTNTEGVLINETVSFESSG-------WVDRVSLGCSNKNQGPF 305
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-SGRIFFGDQGPATQQSTS 273
V DG GLG G +S PS + + + S+ + KD S + P + +
Sbjct: 306 ---VGSDGTFGLGRGSLSFPSRINASSM---SYCLVESKDGYSSSTLEFNSPPCSGSVKA 359
Query: 274 FLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKEVYE 322
L N K Y +G++ +G + ++F IV S S T L + Y
Sbjct: 360 KLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLENDTYN 419
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ F + + CY SS +LP ++ S+++ + +Y
Sbjct: 420 VVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLLPKESY-LYA 478
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
FC A P G +G G RV FD N
Sbjct: 479 VDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVN 515
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 108/440 (24%), Positives = 181/440 (41%), Gaps = 88/440 (20%)
Query: 22 AETVMFSTKLIHRFSEEVKALGVSKNRNATSW--PAKKSF---EYYQVLLSS----DVQK 72
A F+T+L+HR S + L S+ + W ++S ++Q ++ +V+
Sbjct: 26 AHNAGFTTELVHRDSPK-SPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVES 84
Query: 73 QKMKTGPQFQMLFPSQGSKTMSLGN---------DFGCDLLWIPCD-CVRC----APLSA 118
+ + G ++ M ++SLG D G DL+W C C +C APL
Sbjct: 85 EIIANGGEYLM--------SLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPL-- 134
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG--TSCQNPKQPCPYTMDYYTENTSS 175
+ P +S T + LSC R C +LG +SC + +Q C Y+ YY + + +
Sbjct: 135 ------------FDPKSSKTYRDLSCDTRQCQNLGESSSCSS-EQLCQYSY-YYGDRSFT 180
Query: 176 SGLLVEDILHLIS--GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
+G L D + L S GG +V IGCG + +G + G+IGLG G +S+
Sbjct: 181 NGNLAVDTVTLPSTNGGPVYFPKTV-----IGCGRRNNGTF--DKKDSGIIGLGGGPMSL 233
Query: 234 PSLLAKAGLIRNSFSMC---FDKDDSG---RIFFGDQGPATQ---QSTSFLASNGKYITY 284
S + + + FS C F + +G ++ FG + QST ++ N Y
Sbjct: 234 ISQMGSS--VGGKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYY 291
Query: 285 IIGVETCCIGSSCLKQTSF-------KAIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTI 336
+ +E +G ++ I+DSG+S T P + A + V N
Sbjct: 292 LT-LEAMSVGDKKIEFGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGER 350
Query: 337 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
T CY+ + K+P + F + + F++ V+ CLA
Sbjct: 351 TQDASGLLSHCYRPTPDL--KVPVITAHFNGADVVLQTLNTFILISDDVL---CLAFNST 405
Query: 397 DGD--IGTIGQ-NFMTGYRV 413
G + Q NF+ GY +
Sbjct: 406 QSGAIFGNVAQMNFLIGYDI 425
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 138/358 (38%), Gaps = 41/358 (11%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ +L D G D+ WI +C P S Y D + P+ S+T + C H C
Sbjct: 171 AQNYTLSIDTGSDVSWI-----QCLPCSGHCYKQHD---PVFDPTKSATYSAVPCGHPQC 222
Query: 150 -DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
G C N C Y + Y + +S++G+L + L L S D GCG
Sbjct: 223 AAAGGKCSNSGT-CLYKVT-YGDGSSTAGVLSHETLSLSSTRD-------LPGFAFGCGQ 273
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGP 266
G + L+GLG G +S+PS A +FS C D+ G + G P
Sbjct: 274 TNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYCLPSYDTTHGYLTMGSTTP 328
Query: 267 ATQ------QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTF 315
A Q T+ + Y + V + IG L T + DSG+ T+
Sbjct: 329 AASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSGTILTY 388
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
LP E Y ++ F + + P+ CY + +P+V F F ++
Sbjct: 389 LPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAVFDLSP 448
Query: 376 PVFVIY--GTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+IY T TG CLA +P IG G V++D K+G+ C
Sbjct: 449 VAILIYPDDTAPATG-CLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 143/349 (40%), Gaps = 37/349 (10%)
Query: 93 MSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD- 150
M L D G D+ WI CD C +C Y D + + P+ S+T K L C+ +C
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQC-------YKQQD---SLFQPAGSATYKPLPCNSTMCQQ 50
Query: 151 ---LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
SC N C Y + Y ++T+ +E L D+ + SV + GCG
Sbjct: 51 LQSFSHSCLNSS--CNYMVSYGDKSTTRGDFALET---LTLRSDDTILVSV-PNFAFGCG 104
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----GRIFFGD 263
+ G +G A GL+GLG I P+ + A FS C S G + FG+
Sbjct: 105 -HANKGLFNGAA--GLMGLGKSSIGFPAQTSVA--FGKVFSYCLPSVSSTIPSGILHFGE 159
Query: 264 QGPATQ--QSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVY 321
+ T + S+ Y + + +G L S +VDSG+ + + Y
Sbjct: 160 AAMLDYDVRFTPLVDSSSGPSQYFVSMTGINVGDELLP-ISATVMVDSGTVISRFEQSAY 218
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
E + F + + T+ P+ C++ S+ +P + L F ++++ + +PV ++Y
Sbjct: 219 ERLRDAFTQILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHF-RDDAELRLSPVHILY 277
Query: 382 GTQVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V G C A P +G R V+D +LG S C
Sbjct: 278 --PVDDGVMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 143/375 (38%), Gaps = 66/375 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K +L D G DL W+ CD CV C P ++C+ +
Sbjct: 46 KPYTLDIDSGSDLTWLQCDAPCVSCT--------------KAPHPPYKPNKGPITCNDPM 91
Query: 149 CDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-- 201
C C+ + C Y + Y ++ SS G+LV DI L L N A+
Sbjct: 92 CSALHWPSKPPCKASHEQCDYEVSY-ADHGSSLGVLVHDIFSL------QLTNGTLAAPR 144
Query: 202 VIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 258
+ GCG QS Y AP DG++GLG G+ S+ + L GLIR+ C G
Sbjct: 145 LAFGCGYDQS--YPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGF 202
Query: 259 IFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+F GD T + ++ Y +G + + DSGSS+T+
Sbjct: 203 LFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFN 262
Query: 318 KEVYETIAAEFDRQVNDTI--TSFEGYP--W------------KCCYKSSSQRLPKLPSV 361
+ Y+T + + +N + T+ E P W K +K + K S
Sbjct: 263 AQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSA 322
Query: 362 KLMFPQNNSFVV----NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDR 417
+L P + ++ N + ++ G++V GD IG V++D
Sbjct: 323 QLQLPPESYLIISKHGNACLGILNGSEVGL----------GDSNVIGDIAFQDKMVIYDN 372
Query: 418 ENLKLGWSHSNCQDL 432
E ++GW +C L
Sbjct: 373 ERQQIGWVPKDCNKL 387
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 154/395 (38%), Gaps = 60/395 (15%)
Query: 63 QVLLS-SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASY 120
Q+L + S V + ++ T PQ T+ + D D W+PC C+ CAP ++S
Sbjct: 93 QILRTPSYVARARLGTPPQ-----------TLLVAIDPSNDAAWVPCSACLGCAPGASS- 140
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLC----DLGTSC-QNPKQPCPYTMDYYTENTSS 175
+ P+ SST + + C C SC P C + + Y + +
Sbjct: 141 --------PSFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA 192
Query: 176 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 235
+L +D L L A+ + GC ++ G V P GL+G G G +S
Sbjct: 193 --VLGQDALSLSDSNGAAVPDD---HYTFGC-LRVVTGSGGSVPPQGLVGFGRGPLS--- 243
Query: 236 LLAKAGLIRNS-FSMCF----DKDDSGRIFFGDQGPATQQSTSFLASNGK----YITYII 286
L++ S FS C + SG + G G + T+ L SN Y ++
Sbjct: 244 FLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMV 303
Query: 287 GV----ETCCIGSSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
GV + I +S L + IVD+G+ FT L Y + F R V+
Sbjct: 304 GVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAP 363
Query: 339 FEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV-VTGFCLAIQPVD 397
G C Y + ++ +P+V +F + VI T V +A P D
Sbjct: 364 ALGGFDTCYYVNGTK---SVPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAGPSD 420
Query: 398 G---DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G + + +RVVFD N ++G+S C
Sbjct: 421 GVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 138/359 (38%), Gaps = 43/359 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ +SL D G DL W +C P + S Y D + PS SS+ +++C+ LC
Sbjct: 147 RDLSLVFDTGSDLTW-----TQCEPCAGSCYKQQDA---IFDPSKSSSYINITCTSSLCT 198
Query: 151 LGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
TS C + C Y + Y + ++S G L ++ L + + + +
Sbjct: 199 QLTSAGIKSRCSSSTTACIYGIQY-GDKSTSVGFLSQERLTITA-------TDIVDDFLF 250
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFG 262
GCG G G A GLIGLG IS + + + FS C S G + FG
Sbjct: 251 GCGQDNEG-LFSGSA--GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLTFG 305
Query: 263 DQGPATQQSTSFL------ASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSF 313
AT + + N Y I+G+ + ++F A I+DSG+
Sbjct: 306 -ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVI 364
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T L Y + + F + + + E + CY S + +P + F V
Sbjct: 365 TRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFA--GGVTV 422
Query: 374 NNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
P+ I + CLA D DI G VV+D E ++G+ + C
Sbjct: 423 ELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 481
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 149/381 (39%), Gaps = 64/381 (16%)
Query: 68 SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDR 126
S + K K+ T PQ + M+L N + D WIPC CV C S++ +N++
Sbjct: 34 SYIVKAKVGTPPQTLL---------MALDNSY--DAAWIPCKGCVGC---SSTVFNTVK- 78
Query: 127 DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 186
S+T K L C C Q P C + + SS IL
Sbjct: 79 ---------STTFKTLGCGAPQCK-----QVPNPICGGSTCTWNTTYGSS-----TILSN 119
Query: 187 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 246
++ AL GC K +G V P GL+G G G +S L L +++
Sbjct: 120 LTRDTIALSMDPVPYYAFGCIQKATG---SSVPPQGLLGFGRGPLSF--LSQTQNLYKST 174
Query: 247 FSMCFDK----DDSGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 299
FS C + SG + G G P ++T L + + Y + + +G +
Sbjct: 175 FSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIP 234
Query: 300 --------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKS 350
T I DSG+ FT L Y + EF ++V N T++S G+ CY
Sbjct: 235 RSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGGF--DTCY-- 290
Query: 351 SSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFM 408
S +P P++ MF N + + + V + +A P V+ + I
Sbjct: 291 SVPIVP--PTITFMFSGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQ 348
Query: 409 TGYRVVFDRENLKLGWSHSNC 429
+R++FD N +LG + C
Sbjct: 349 QNHRILFDVPNSRLGVAREQC 369
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 163/398 (40%), Gaps = 67/398 (16%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPS 134
+ +G F +F K SL D G DL WI CV C Y +++ Y P
Sbjct: 176 LGSGEYFIDVFVGTPPKHFSLILDTGSDLNWI--QCVPC-------YECFEQNGPHYDPG 226
Query: 135 ASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-- 186
SS+ +++ C C L +S C+ Q CPY +Y ++++++G + +
Sbjct: 227 QSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYY-WYGDSSNTTGDFALETFTVNL 285
Query: 187 -ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
+S G L+ +V+ GCG G + L+GLG G +S S L L +
Sbjct: 286 TMSSGKPELRRV--ENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGH 338
Query: 246 SFSMCF-----DKDDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIG 294
SFS C D + S ++ FG+ T+ +A + Y + +++ +G
Sbjct: 339 SFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVG 398
Query: 295 SSCLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP- 343
+ K I+DSG++ ++ + Y+ I F +V +GYP
Sbjct: 399 GEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKV-------KGYPV 451
Query: 344 ------WKCCYKSSSQRLPKLPSVKLMFPQNN--SFVVNNPVFVIYGTQVVTGFCLAIQP 395
+ CY + P LP ++F +F V N I +VV CLAI
Sbjct: 452 VKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVV---CLAILG 508
Query: 396 V-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ IG + +++D + +LG++ + C D+
Sbjct: 509 TPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 546
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 169/400 (42%), Gaps = 65/400 (16%)
Query: 63 QVLLSSDVQKQKMKTGPQFQML----FPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSA 118
Q+ SS+ Q + +G +FQ L GS+ MS+ D G DL W+ C+ R
Sbjct: 100 QIADSSETQV-PLTSGIKFQTLNYIVTMGLGSQNMSVIVDTGSDLTWVQCEPCR------ 152
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNP--KQPCPYTMDYYTENT 173
S YN ++ + PS S + + + C+ C +LG +P C Y ++Y +
Sbjct: 153 SCYN---QNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSY 209
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
+S L +E L GG + ++ + GCG + + G G + GL+GLG E+S+
Sbjct: 210 TSGELGIE---KLGFGGISV------SNFVFGCG-RNNKGLFGGAS--GLMGLGRSELSM 257
Query: 234 PSLLAKAGLIRNSFSMCFDKDD----SGRIFFGDQGPATQQSTSF--------LASNGKY 281
S FS C D SG + G+Q + T L + Y
Sbjct: 258 IS--QTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFY 315
Query: 282 ITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
I + G++ + S ++ +SF I+DSG+ + L VY+ + A+F Q
Sbjct: 316 ILNLTGIDVGGV-SLHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQ------- 367
Query: 339 FEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCL 391
F G+P C+ + +P++ + F N V+ + + CL
Sbjct: 368 FSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCL 427
Query: 392 AIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
A+ + + ++G IG RV++D + ++G++ C
Sbjct: 428 ALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 152/382 (39%), Gaps = 67/382 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCA--PLSASYYNSLDRDLNEYSP--------SASSTS 139
+++ L D G DL+W+ C C C+ P S+++ L R + +SP +
Sbjct: 99 QSLLLVADTGSDLVWVKCSACRNCSHHPPSSAF---LPRHSSSFSPFHCFDPHCRLLPHA 155
Query: 140 KHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKN 196
H C+H RL PC + Y + + SSG ++ L +SG + LK
Sbjct: 156 PHHLCNHTRL----------HSPCRFLYSY-ADGSLSSGFFSKETTTLKSLSGSEIHLKG 204
Query: 197 SVQASVIIGCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
+ GCG + SG + G G++GLG G IS S L + N FS C
Sbjct: 205 -----LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKFSYCLMD 257
Query: 252 ---DKDDSGRIFFGDQ------GPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL--- 298
+ + G AT+ S + L N T Y I + + I L
Sbjct: 258 YTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPIN 317
Query: 299 -------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYK 349
+Q + +VDSG++ T+L K YE + R+V + G+ C
Sbjct: 318 PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDL-CVNA 376
Query: 350 SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNF 407
S R P LP ++ F + + + V CLAI+ V+ G IG
Sbjct: 377 SGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGV--MCLAIRAVESGNGFSVIGNLM 434
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
G+ + FD+E +LG++ C
Sbjct: 435 QQGFLLEFDKEESRLGFTRRGC 456
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/397 (22%), Positives = 163/397 (41%), Gaps = 65/397 (16%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F + K SL D G DL WI C C C + ++Y+ P
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYD----------P 214
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHL 186
AS++ K+++C+ + C+L +S C++ Q CPY Y + ++ VE ++L
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNL 274
Query: 187 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 246
+ G ++ +V+ +++ GCG G + L+GLG G +S S L L +S
Sbjct: 275 TTNGGSSELYNVE-NMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHS 328
Query: 247 FSMCF-----DKDDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGS 295
FS C D + S ++ FG+ TSF+A + Y + +++ +
Sbjct: 329 FSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAG 388
Query: 296 SCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-W 344
L + I+DSG++ ++ + YE I + + + +P
Sbjct: 389 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPIL 448
Query: 345 KCCYKSSSQRLPKLPSVKLM--------FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
C+ S +LP + + FP NSF+ N V CLA+
Sbjct: 449 DPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLV----------CLAMLGT 498
Query: 397 -DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
IG + +++D + +LG++ + C D+
Sbjct: 499 PKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 535
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 143/356 (40%), Gaps = 44/356 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
K SL D G DL W +C P S + D ++ P+ S++ K+LSCS C
Sbjct: 143 KDFSLLFDTGSDLTW-----TQCEPCSGGCFPQNDE---KFDPTKSTSYKNLSCSSEPCK 194
Query: 150 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
+ C + C Y + Y T T G L + L + + V + +IG
Sbjct: 195 SIGKESAQGCSS-SNSCLYGVKYGTGYTV--GFLATETLTIT-------PSDVFENFVIG 244
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD 263
CG +++GG G A GL+GLG +++PS + +N FS C S G + FG
Sbjct: 245 CG-ERNGGRFSGTA--GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGG 299
Query: 264 QGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL--KQTSFKA---IVDSGSSFTFLP 317
Q+ F K Y + V +G L + F+ I+DSG++ T+LP
Sbjct: 300 ---GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLP 356
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK--LPSVKLMFPQNNSFVVNN 375
+ +++ F + + + + CY S +P + + F +++
Sbjct: 357 STAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDD 416
Query: 376 PVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I + CLA + D D+ G Y VV+D +G++ C
Sbjct: 417 SGIFI-AANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/391 (23%), Positives = 153/391 (39%), Gaps = 78/391 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T D G L+W PC C C ++ N + + P SS+SK + C +
Sbjct: 94 QTTKFVMDTGSSLVWFPCTSRYLCSEC-----NFPNIKKTGIPTFLPKLSSSSKLIGCKN 148
Query: 147 RLCDL-------------GTSCQNPKQPCP-YTMDYYTENTSSSGLLVEDILHLISGGDN 192
C + ++ QN Q CP Y + Y + S++GLL+ + L D
Sbjct: 149 PRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY--GSGSTAGLLLSETL------DF 200
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
K ++ ++GC + P+G+ G G S+PS L S FD
Sbjct: 201 PNKKTI-PDFLVGCSI------FSIKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFD 253
Query: 253 KDDSGRIFFGDQGPATQQSTSFLASNGKYIT---------YIIGVETCCIGSSCLKQTSF 303
+ D G + + + S+ ++ Y + + IG + +K +
Sbjct: 254 DTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVK-VPY 312
Query: 304 K-----------AIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFEGYPWKCC 347
K IVDSG++FTF+ VYE +A EF++Q V I + G + C
Sbjct: 313 KFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTG--LRPC 370
Query: 348 YKSSSQRLPKLPSV--------KLMFPQNNSF-VVNNPVFVIYGTQVVTGFCLAIQPVDG 398
Y S ++ +P + K+ P +N F +V++ V + +V+ G
Sbjct: 371 YNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICL---TIVSDNVAGPGLGGG 427
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G + V FD EN K G+ +C
Sbjct: 428 PAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 150/385 (38%), Gaps = 61/385 (15%)
Query: 83 MLFPSQGSKTMSLGN-----------DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNE 130
++ S+G MS+G D G DL+W C C+ C +D+
Sbjct: 81 LVLASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLC----------VDQPTPF 130
Query: 131 YSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
+ P+ S + L C+ +C+ + C Y +Y ++ +++G+L + G
Sbjct: 131 FDPAQSPSYAKLPCNSPMCNALYYPLCYRNVCVYQY-FYGDSANTAGVLSNETFTF---G 186
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
N + +V + GCG +G +G G++G G G +S L+++ G R S+ +
Sbjct: 187 TNDTRVTVP-RIAFGCGNLNAGSLFNG---SGMVGFGRGPLS---LVSQLGSPRFSYCLT 239
Query: 251 -FDKDDSGRIFFGDQGPATQ---------QSTSFLASNGKYITYIIGVETCCIGSSCL-- 298
F R++FG QST F+ + G Y + + +G L
Sbjct: 240 SFMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPI 299
Query: 299 ---------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN---DTITSFEGYPWKC 346
+ I+DSGS+ T+L + Y+ + F QV TS C
Sbjct: 300 DPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTC 359
Query: 347 -CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQ 405
+ +++ +P + F N + +I G CLAI D D IG
Sbjct: 360 FVWPPPPRKIVTMPELAFHFEGANMELPLENYMLIDGD--TGNLCLAIAASD-DGSIIGS 416
Query: 406 NFMTGYRVVFDRENLKLGWSHSNCQ 430
+ V++D EN L ++ + C
Sbjct: 417 FQHQNFHVLYDNENSLLSFTPATCN 441
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 102/447 (22%), Positives = 180/447 (40%), Gaps = 82/447 (18%)
Query: 40 KALGVSKNRNATSWP-AKKSFEYYQVLLSSDVQKQK------------MKTGPQFQMLFP 86
K + KN+N S KK+ E ++S V++Q + +G F +
Sbjct: 102 KRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLV 161
Query: 87 SQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
K SL D G DL WI C C C + ++Y+ P AS++ K+++C+
Sbjct: 162 GSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYD----------PKASASYKNITCN 211
Query: 146 HRLCDLGT------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL---ISGGDNALKN 196
C+L + C++ Q CPY +Y ++++++G + + SGG + L N
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYY-WYGDSSNTTGDFAVETFTVNLTTSGGSSELYN 270
Query: 197 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----- 251
+++ GCG G + L+GLG G +S S L L +SFS C
Sbjct: 271 V--ENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNS 323
Query: 252 DKDDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSSCLK------ 299
D + S ++ FG+ TSF+A + Y + +++ + L
Sbjct: 324 DTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETW 383
Query: 300 ----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQR 354
+ I+DSG++ ++ + YE I + + + +P C+ S
Sbjct: 384 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGID 443
Query: 355 LPKLPSVKLM--------FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQ 405
+LP + + FP NSF+ N V CLAI IG
Sbjct: 444 SIQLPELGIAFADGAVWNFPTENSFIWLNEDLV----------CLAILGTPKSAFSIIGN 493
Query: 406 NFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ +++D + +LG++ + C D+
Sbjct: 494 YQQQNFHILYDTKRSRLGYAPTKCADI 520
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 115/426 (26%), Positives = 170/426 (39%), Gaps = 78/426 (18%)
Query: 49 NATSWP--AKKSFEYYQVLLSSDVQKQKMKTGPQFQML-FPSQGSKTMSLGN-------- 97
N++SW +SFE L++ K +GP M P Q T+ GN
Sbjct: 88 NSSSWIDLVSQSFERDNARLNTIRSK---NSGPYTTMSNLPLQSGTTVGTGNYIVTAGFG 144
Query: 98 ----------DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
D G DL WI C C C Y+ +D + P SS+ K L C
Sbjct: 145 TPAKNSLLIIDTGSDLTWIQCKPCADC-------YSQVDA---IFEPKQSSSYKTLPCLS 194
Query: 147 RLC-DLGTSCQNPKQ----PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C +L TS NP C Y ++Y + +SS G ++ L L G ++ +N
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINY-GDGSSSQGDFSQETLTL---GSDSFQN----- 245
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL-LAKAGLIRNSFSMCF-DKDDSGRI 259
GCG +G + GL+GLG +S PS +K G F+ C D S
Sbjct: 246 FAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFGSSTST 299
Query: 260 FFGDQGPATQQSTSF---LASNGKYIT-YIIGVETCCIGSSCLK-----QTSFKAIVDSG 310
G + +++ L SN Y T Y +G+ +G L IVDSG
Sbjct: 300 GSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSG 359
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 370
+ T L + Y + F + D ++ CY S ++P++ F QNN+
Sbjct: 360 TVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHF-QNNA 418
Query: 371 FVVNNPVFVIY-----GTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLG 423
V + V ++ G+QV F A Q +DG IG Q M RV FD ++G
Sbjct: 419 DVAVSDVGILVPVQNGGSQVCLAFASASQ-MDGFNIIGNFQQQRM---RVAFDTGAGRIG 474
Query: 424 WSHSNC 429
++ +C
Sbjct: 475 FASGSC 480
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 53/360 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC S ++ P S + + C+ LC D G
Sbjct: 158 DTGSDVVWLQCAPCRRCYEQSGQVFD----------PRRSRSYNAVGCAAPLCRRLDSG- 206
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y + Y + + ++G + L G + A V +GCG G
Sbjct: 207 GCDLRRSACLYQV-AYGDGSVTAGDFATETLTFAGG-------ARVARVALGCGHDNEGL 258
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR-------IFFGDQG 265
+ VA GL+GLG G +S P+ +++ SFS C D+ S + FG
Sbjct: 259 F---VAAAGLLGLGRGSLSFPTQISR--RYGRSFSYCLVDRTSSANTASRSSTVTFGSGA 313
Query: 266 PATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDSG 310
+ ++SF + N + Y +IG+ + + + IVDSG
Sbjct: 314 VGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSG 373
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNN 369
+S T L + Y + F S G+ + CY S +++ K+P+V + F
Sbjct: 374 TSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGA 433
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I T FC A DG + IG G+RVVFD + ++ ++ C
Sbjct: 434 EAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 147/358 (41%), Gaps = 57/358 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--S 154
D G DL+W C+ C C+ S PS+SST + C LC + S
Sbjct: 60 DTGSDLVWTKCNPCTDCSTSSIY------------DPSSSSTYSKVLCQSSLCQPPSIFS 107
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C N C Y Y + +S+SG+L ++ + S +L N + GCG G
Sbjct: 108 CNNDGD-CEYVYPY-GDRSSTSGILSDETFSISS---QSLPN-----ITFGCGHDNQG-- 155
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQG--PAT 268
D V GL+G G G +S+ S L + + N FS C D + +F G+ AT
Sbjct: 156 FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNTASLEAT 211
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPK 318
++ L + Y + +E +G L S I+DSG++ TFL +
Sbjct: 212 TVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQ 271
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV-VNNPV 377
Y+ + +N + +G C+ P PS+ F + V N +
Sbjct: 272 TAYDAVKEAMVSSIN--LPQADGQ-LDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYL 328
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTI---GQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
F + +V CLA+ P + ++G + G Y++++D EN L ++ + C L
Sbjct: 329 FPDSTSDIV---CLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 148/382 (38%), Gaps = 69/382 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T+ L D G DL+W V+C+P R+ + SP ++ ++H + +
Sbjct: 97 QTLLLVADTGSDLIW-----VKCSPC---------RNCSHRSPGSAFFARHSTTYSAIHC 142
Query: 151 LGTSCQ-------NP------KQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKN 196
CQ NP PC Y Y ++++++G ++ L L S G N
Sbjct: 143 YSPQCQLVPHPHPNPCNRTRLHSPCRYQYTY-ADSSTTTGFFSKEALTLNTSTGKVKKLN 201
Query: 197 SVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
+ GCG + SG L G + G++GLG IS S L + + FS C
Sbjct: 202 GLS----FGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--FGSKFSYCLMD 255
Query: 252 ----DKDDSGRIFFGDQGPATQQS-----TSFLASNGKYITYIIGVETCCIGSSCLKQT- 301
S G Q A + T L + Y I ++ + L
Sbjct: 256 YTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINP 315
Query: 302 ---------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 352
+ I+DSG++ TF+ + Y I F ++V + + C S
Sbjct: 316 SVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSG 375
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNP---VFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNF 407
P LP ++ F V + P F+ G Q+ CLA+QPV DG +G
Sbjct: 376 VTRPALP--RMSFNLAGGSVFSPPPRNYFIETGDQIK---CLAVQPVSQDGGFSVLGNLM 430
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
G+ + FDR+ +LG++ C
Sbjct: 431 QQGFLLEFDRDKSRLGFTRRGC 452
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 141/362 (38%), Gaps = 48/362 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +SL D G DL W +C P S Y + + PSAS T ++SC+ C
Sbjct: 165 KDLSLIFDTGSDLTW-----TQCQPCVKSCY---AQQQPIFDPSASKTYSNISCTSTACS 216
Query: 151 -----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
G S C Y + Y +++ + G +D L L +N V + G
Sbjct: 217 GLKSATGNSPGCSSSNCVYGIQY-GDSSFTVGFFAKDTLTLT-------QNDVFDGFMFG 268
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD 263
CG G + GLIGLG +S+ A+ FS C + +G + FG+
Sbjct: 269 CGQNNRGLF---GKTAGLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRGSNGHLTFGN 323
Query: 264 -QGPATQQS-------TSFLASNGKYITYIIGVETCCIGSSCLKQTSF-----KAIVDSG 310
G T ++ T F +S G Y I V +G L + I+DSG
Sbjct: 324 GNGVKTSKAVKNGITFTPFASSQGATF-YFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN-N 369
+ T LP VY ++ + F + ++ T+ CY S+ +P + F N N
Sbjct: 383 TVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNAN 442
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
+ N + + G V CLA D IG G VV+D +LG+ +
Sbjct: 443 VDLEPNGILITNGASQV---CLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFGYK 499
Query: 428 NC 429
C
Sbjct: 500 GC 501
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 102/442 (23%), Positives = 179/442 (40%), Gaps = 63/442 (14%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFE 60
LT+ L + S A + FS +LIHR S + ++N+ +A ++
Sbjct: 7 LTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANH 66
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLG-NDFGCDLLWIPCD-CVRCAPLSA 118
+++ +S + + + M + T G D G D++W+ C+ C +C +
Sbjct: 67 FFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTT 126
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSS 176
+N PS SS+ K++ C +LC TSC + + C Y + Y +++ S
Sbjct: 127 PIFN----------PSKSSSYKNIPCLSKLCHSVRDTSCSD-QNSCQYKISY-GDSSHSQ 174
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G L D L L S + + +IGCG +G + G A G++GLG G +S+ +
Sbjct: 175 GDLSVDTLSLESTSGSPVS---FPKTVIGCGTDNAGTF--GGASSGIVGLGGGPVSLITQ 229
Query: 237 LAKAGLIRNSFSMCF------DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIG 287
L + I FS C + + S + FGD + ST + + + Y +
Sbjct: 230 LGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVF--YFLT 285
Query: 288 VETCCIGSSCLKQTSF-----------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 336
++ +G+ K+ F I+DSG++ T +P +VY + + V
Sbjct: 286 LQAFSVGN---KRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDR 342
Query: 337 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV 396
+ CY S P + F + + + FV +V C A QP
Sbjct: 343 VDDPNQQFSLCYSLKSNEY-DFPIITAHFKGADIELHSISTFVPITDGIV---CFAFQP- 397
Query: 397 DGDIGTI-----GQNFMTGYRV 413
+G+I QN + GY +
Sbjct: 398 SPQLGSIFGNLAQQNLLVGYDL 419
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 143/352 (40%), Gaps = 52/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL T C
Sbjct: 197 DTGSDTTW-----VQCQPCVVVCYEQREK---LFDPARSSTYANVSCAAPACSDLDTRGC 248
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 249 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 298
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQST 272
+ GL+GLG G+ S+P K G + F+ C +G + FG PA + +T
Sbjct: 299 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLDFGAGSPAARLTT 352
Query: 273 S-FLASNGKYITYIIGVETCCIGSSCL--KQTSFKA---IVDSGSSFTFLPKEVYETIAA 326
+ L NG Y +G+ +G L Q+ F IVDSG+ T LP Y ++ +
Sbjct: 353 TPMLVDNGPTF-YYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYSSLRS 411
Query: 327 EFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP--V 377
F + S GY CY + +P+V L+F V+ +
Sbjct: 412 AFAAAM-----SARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIM 466
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +QV F A GD+G +G + + V +D + +S C
Sbjct: 467 YAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 150/362 (41%), Gaps = 57/362 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL+W C CV C S ++ PS+SST + CS C DL TS
Sbjct: 185 DTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPCSSASCSDLPTSK 234
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GY 214
C YT Y +++S+ G+L + L S V+ GCG G G+
Sbjct: 235 CTSASKCGYTYTY-GDSSSTQGVLATETF--------TLAKSKLPGVVFGCGDTNEGDGF 285
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGD-------- 263
G GL+GLG G +S L+++ GL + FS C D ++ + G
Sbjct: 286 SQGA---GLVGLGRGPLS---LVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEAS 337
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--------IVDSGSSF 313
++ Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 338 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 397
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T+L + Y + F Q+ G C+++ ++ + ++ +L+F + +
Sbjct: 398 TYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL 457
Query: 374 NNPV---FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ P V+ G CL + G + IG ++ V+D + L ++ C
Sbjct: 458 DLPAENYMVLDGGS--GALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCN 514
Query: 431 DL 432
L
Sbjct: 515 KL 516
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/354 (23%), Positives = 142/354 (40%), Gaps = 52/354 (14%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT 153
D G DL W+ PCD +C + Y+ L+ P S L S +C D G
Sbjct: 114 DTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGD 173
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C Y Y +N+ S G L D + L+ L+ + + GCG +
Sbjct: 174 --------CIYAYTY-GDNSYSYGGLSSDSIRLM-----LLQLHYNSKICFGCGFQNKFT 219
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGD----QGP 266
G++GLG G +S+ S L I + FS C F + + ++ FG+ QG
Sbjct: 220 ADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSKLKFGEAAIVQGN 277
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSGSSFTFLPKEVY--- 321
+ + + + Y + +E +G+ +K QT I+DSGS+ T+L + Y
Sbjct: 278 GVVSTPLIIKPDLPF--YYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYLEESFYNEF 335
Query: 322 -----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
ET+A E D+ + YP+ C+ + + + P V F + +
Sbjct: 336 VSLVKETVAVEEDQYI--------PYPFDFCF-TYKEGMSTPPDVVFHFTGGDVVLKPMN 386
Query: 377 VFVIYGTQVVTGFCLAIQPVDGD-IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V+ ++ C + P D I G + V +D + K+ ++ ++C
Sbjct: 387 TLVLIEDNLI---CSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 145/361 (40%), Gaps = 48/361 (13%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+K L D G D+ WI +C+P + Y ++ + P ASS+ + LSCS C
Sbjct: 24 TKLQYLVMDTGSDVPWI-----QCSPCKSCY----KQNDAVFDPRASSSFRRLSCSTPQC 74
Query: 150 DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
L +C + C Y + Y + + + G L D + G + V+ GCG
Sbjct: 75 KLLDVKACASTDNRCLYQVS-YGDGSFTVGDLASDSFSVSRG--------RTSPVVFGCG 125
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-----RIFFG 262
G + V GL+GLG G++S PS L+ FS C D+G + FG
Sbjct: 126 HDNEGLF---VGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNGVRASSALLFG 177
Query: 263 DQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK---------AIVD 308
D T S ++ L N K T Y G+ IG + L T+FK I+D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+S T LP Y + F + + + CY S+ +P+V F +
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHF-EG 296
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ V P + FC A D+ IG RV D ++ ++G++
Sbjct: 297 GASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356
Query: 429 C 429
C
Sbjct: 357 C 357
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 140/355 (39%), Gaps = 43/355 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL W C C C P +D Y PSASST L CS C + +
Sbjct: 89 DTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPLPCSSATCLPIWSRN 138
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
P C Y Y + S+G+L + L L G ++ SV V GCG G
Sbjct: 139 CTPSSLCRYRYA-YGDGAYSAGILGTETLTL---GPSSAPVSV-GGVAFGCGTDNGG--- 190
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGD-----QGPAT 268
D + G +GLG G + SLLA+ G+ + S+ + F+ G GP+T
Sbjct: 191 DSLNSTGTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPST 247
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPK 318
QST L S Y + ++ +G L + IVDSG++FT L +
Sbjct: 248 VQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAE 307
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ + R + + C+ + + P +P + L F + +
Sbjct: 308 SGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLHFAGGADMRLYRDNY 366
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHSNCQDL 432
+ Y + + FCL I + ++ NF +++FD +L + ++C L
Sbjct: 367 MSYNEE-DSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 148/378 (39%), Gaps = 65/378 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T+++ D G +L W+ C + AP S ++ L + YSP ++ +C R D
Sbjct: 67 QTVTMVLDTGSELSWLHC---KKAPNLHSVFDPLRS--SSYSPIPCTSP---TCRTRTRD 118
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
K+ + + Y + +S G L D H+ NS + I GC
Sbjct: 119 FSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI--------GNSAIPATIFGC---M 167
Query: 211 SGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGDQG 265
G+ D GLIG+ G +S + + GL FS C +D SG + FG+
Sbjct: 168 DSGFSSNSDEDSKTTGLIGMNRGSLS---FVTQMGL--QKFSYCISGQDSSGILLFGESS 222
Query: 266 ----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKA 305
P Q ST + + Y + +E + +S L+ + +
Sbjct: 223 FSWLKALKYTPLVQISTPLPYFD--RVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 280
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR--LPK 357
+VDSG+ FTFL VY + EF RQ ++ E + CY+ R LP
Sbjct: 281 MVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPP 340
Query: 358 LPSVKLMFPQNNSFVVNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGY 411
LP+V LMF V + VI G+ V F + G + IG +
Sbjct: 341 LPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNV 400
Query: 412 RVVFDRENLKLGWSHSNC 429
+ FD ++G++ C
Sbjct: 401 WMEFDLAKSRVGFAEVRC 418
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 148/378 (39%), Gaps = 65/378 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+T+++ D G +L W+ C + AP S ++ L + YSP ++ +C R D
Sbjct: 74 QTVTMVLDTGSELSWLHC---KKAPNLHSVFDPLRS--SSYSPIPCTSP---TCRTRTRD 125
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
K+ + + Y + +S G L D H+ NS + I GC
Sbjct: 126 FSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI--------GNSAIPATIFGC---M 174
Query: 211 SGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGDQG 265
G+ D GLIG+ G +S + + GL FS C +D SG + FG+
Sbjct: 175 DSGFSSNSDEDSKTTGLIGMNRGSLS---FVTQMGL--QKFSYCISGQDSSGILLFGESS 229
Query: 266 ----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKA 305
P Q ST + + Y + +E + +S L+ + +
Sbjct: 230 FSWLKALKYTPLVQISTPLPYFD--RVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 287
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR--LPK 357
+VDSG+ FTFL VY + EF RQ ++ E + CY+ R LP
Sbjct: 288 MVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPP 347
Query: 358 LPSVKLMFPQNNSFVVNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGY 411
LP+V LMF V + VI G+ V F + G + IG +
Sbjct: 348 LPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNV 407
Query: 412 RVVFDRENLKLGWSHSNC 429
+ FD ++G++ C
Sbjct: 408 WMEFDLAKSRVGFAEVRC 425
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 94/424 (22%), Positives = 165/424 (38%), Gaps = 83/424 (19%)
Query: 60 EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSK--------TMSLGN---------DFGCD 102
+ + +LS ++ M ++FP G+ T+S+G D G D
Sbjct: 34 RWRKAVLSGEITSSMMINRAGSSLVFPLHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSD 93
Query: 103 LLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-----GTSC 155
L W+ CD C +C + P ++ + C LC +C
Sbjct: 94 LTWLQCDAPCRQC--------------IEAPHPLYRPSNNLVICEDPLCASLQPPGVHNC 139
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQASVIIGCGMKQSGG 213
Q+P Q C Y ++Y + SS G+LV+D+ L+ +G + + +GCG Q G
Sbjct: 140 QDPDQ-CDYEVEY-ADGGSSLGVLVKDVFVLNFTNG------KRLNPLLALGCGYDQLPG 191
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 273
+ DG++GLG G S+PS L+ GL+ N C G +FFG+ + T
Sbjct: 192 RSNHPL-DGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGEDIYDSSGVTW 250
Query: 274 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
S Y G + + DSGSS+T+L + Y+ + R+++
Sbjct: 251 TPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELS 310
Query: 334 -----------------------DTITSFEGY--PWKCCYKSSSQRLPKLPSVKLMFPQN 368
+I + Y P+ +K+SS R K + F
Sbjct: 311 RKPISEALDDQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSK---TQFEFSPE 367
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
++++ G ++ G + ++ D+ IG M V+++ E +GW+ ++
Sbjct: 368 AYLIISSKGNACLG--ILNGTEVGLR----DLNVIGDVSMLDRLVIYNNEKQMIGWAAAS 421
Query: 429 CQDL 432
C L
Sbjct: 422 CDRL 425
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 103/410 (25%), Positives = 152/410 (37%), Gaps = 110/410 (26%)
Query: 98 DFGCDLLWIPC-----DCVRCAPLSASYYNSLDRDLNE---YSPSASSTSKHLSCSHRLC 149
D G DL W+PC DC+ C Y+ + DL +SP SSTS SC+ C
Sbjct: 101 DTGSDLTWVPCGNLSFDCIEC-------YDLKNNDLKSPSVFSPLHSSTSFRDSCASSFC 153
Query: 150 DLGTSCQNP-------------------KQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S NP +PCP Y E SG+L DIL
Sbjct: 154 VEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDIL------ 207
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
+ GC + Y + P G+ G G G +S+PS L G + FS C
Sbjct: 208 --KARTRDVPRFSFGC---VTSTYRE---PIGIAGFGRGLLSLPSQL---GFLEKGFSHC 256
Query: 251 F-------DKDDSGRIFFGDQGPATQ-----QSTSFLASNGKYITYIIGVETCCIGSSC- 297
F + + S + G + Q T L + +Y IG+E+ IG++
Sbjct: 257 FLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNIT 316
Query: 298 -------LKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--- 343
L+Q + +VDSG+++T LP+ Y Q+ T+ S YP
Sbjct: 317 PTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYS--------QLLTTLQSTITYPRAT 368
Query: 344 -------WKCCYK--SSSQRLPKLPS-VKLMFPQNNSFVVNNPVFVIYGTQVVTGF---- 389
+ CYK + L L + V ++FP +NN ++
Sbjct: 369 ETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPS 428
Query: 390 ------CLAIQPV-DGDI---GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
CL Q + DGD G G +VV+D E ++G+ +C
Sbjct: 429 DGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 127/293 (43%), Gaps = 54/293 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G DL+W C CV C ++ + PS+SST L CS LC DL +S
Sbjct: 120 DTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYAALPCSSTLCSDLPSSK 169
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C + K C YT Y +++S+ G+L + L + V GCG G G
Sbjct: 170 CTSAK--CGYTYT-YGDSSSTQGVLAAETF--------TLAKTKLPDVAFGCGDTNEGDG 218
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGR----------IFFG 262
+ G GL+GLG G + SL+++ GL N FS C DD+ + I
Sbjct: 219 FTQGA---GLVGLGRGPL---SLVSQLGL--NKFSYCLTSLDDTSKSPLLLGSLATISES 270
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK--------AIVDSGSS 312
++ Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 271 AAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTS 330
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
T+L + Y + F Q+ G C+++ + + ++ KL+F
Sbjct: 331 ITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLVF 383
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 152/371 (40%), Gaps = 44/371 (11%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F + ++ + + D G D+ W+ C C C Y D + P
Sbjct: 158 LGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDP 207
Query: 134 SASSTSKHLSCSHRLC-DL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
S S++ ++C + C DL +C+N C Y + Y + + + G + L L GD
Sbjct: 208 SLSTSYASVACDNPRCHDLDAAACRNSTGACLYEV-AYGDGSYTVGDFATETLTL---GD 263
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
+A +SV IGCG G + V GL+ LG G +S PS ++ +FS C
Sbjct: 264 SAPVSSVA----IGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-----TTFSYCL 311
Query: 252 -DKDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-- 304
D+D S + FGD A + + + S Y +G+ +G L ++F
Sbjct: 312 VDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMD 370
Query: 305 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
IVDSG++ T L Y + F R + + CY S + ++
Sbjct: 371 GTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 430
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P+V L F + ++I T +CLA P + + IG G RV FD
Sbjct: 431 PAVSLRFAGGGELRLPAKNYLIPVDGAGT-YCLAFAPTNAAVSIIGNVQQQGTRVSFDTA 489
Query: 419 NLKLGWSHSNC 429
+G++ + C
Sbjct: 490 KSTVGFTSNKC 500
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 97/394 (24%), Positives = 162/394 (41%), Gaps = 64/394 (16%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKTMSLGN---------DFGCDLLWIPCD-CVRC-APLSA 118
+KQ+ TG + + P T+ LGN G D++W+PC C C P
Sbjct: 58 AKKQQGVTGFVLEAM-PGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDCPTP--- 113
Query: 119 SYYNSLDRDLNEYSPSASST-----------SKHLSCSHRLCDLGTSCQNPKQPCPYTMD 167
+ + L+ Y P SST + L H +C + + C Y
Sbjct: 114 ---DDIGFSLDLYDPKNSSTSSEISCSDDRCADALKTGHAICH---TSHSSGDQCGYNQI 167
Query: 168 YYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGL 226
Y +++G V D +H I G+ + +S ASVI GC +SG + DG+IG
Sbjct: 168 YADGVLATTGYYVSDDIHFDIFMGNESFASS-SASVIFGCSKSRSG----HLQADGVIGF 222
Query: 227 GLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGRIFFGDQ-GPATQQSTSFLAS----NGK 280
G S+ S L G + ++FS C D DD G + D+ G + TS +AS N
Sbjct: 223 GKDAPSLISQLNSQG-VSHAFSRCLDDSDDGGGVLILDEVGEPGLEFTSLVASRPCYNLN 281
Query: 281 YITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+ + + I SS +S + +DSG+S + P VY+ + + + SF
Sbjct: 282 MKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYFPDGVYDPVIRAI-LFIYFSTRSF 340
Query: 340 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
+P Y + P L+ + S+ +N ++ C+A Q +GD
Sbjct: 341 SSFPTVTXYFEGGAAMKVGPENYLL--RRGSY--DNDSYM----------CIAFQRSEGD 386
Query: 400 IG---TIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+G + V++ + +++GW + NC+
Sbjct: 387 YKQTTILGDLILHDKIFVYNLKKMQIGWVNYNCK 420
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 141/364 (38%), Gaps = 56/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T+ + D D W+PC C CA S S+ SP+ SST + + C
Sbjct: 112 AQTLLVAIDPSNDAAWVPCSACAGCAASSPSF-----------SPTQSSTYRTVPCGSPQ 160
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGL---LVEDILHLISGGDN-ALKNSVQASVII 204
C Q P CP + SS G + G D+ AL+N+V S
Sbjct: 161 C-----AQVPSPSCPAGVG------SSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTF 209
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIF 260
GC SG + V P GLIG G G +S L + FS C + SG +
Sbjct: 210 GCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLK 264
Query: 261 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDS 309
G G P ++T L + + Y + + +GS ++ T I+D+
Sbjct: 265 LGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDA 324
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
G+ FT L VY + F +V + G + CY + +P+V MF
Sbjct: 325 GTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVTV----SVPTVTFMFAGAV 379
Query: 370 SFVVNNPVFVIYGTQ-VVTGFCLAIQPVDG---DIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ + +I+ + V +A P DG + + RV+FD N ++G+S
Sbjct: 380 AVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFS 439
Query: 426 HSNC 429
C
Sbjct: 440 RELC 443
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 153/387 (39%), Gaps = 69/387 (17%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 136
TG F L + +L D G DL W+ C +P + P S
Sbjct: 113 TGQYFVKLRVGTPVQEFTLVADTGSDLTWV--KCAGASPPG-----------RVFRPKTS 159
Query: 137 STSKHLSCSHRLCDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLL-VEDILHLISGG 190
+ + CS C L +C +P PC Y Y + + G++ E + GG
Sbjct: 160 RSWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGG 219
Query: 191 DNA-LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 249
A LK+ V++GC G DG++ LG +IS + A SFS
Sbjct: 220 KVAQLKD-----VVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARFGGSFSY 270
Query: 250 CF-----DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----- 298
C ++ +G + FG Q P T + + L + + Y + V+ + L
Sbjct: 271 CLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAE 330
Query: 299 --KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR-- 354
S I+DSG++ T L Y+ + A + + D + P++ CY +++R
Sbjct: 331 VWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHL-DGVPKVSFPPFEHCYNWTARRPG 389
Query: 355 ----LPKLP-----SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD---IGT 402
+PKL S +L P S+V++ V G + C+ +Q +G+ +
Sbjct: 390 APEIIPKLAVQFAGSARLE-PPAKSYVID----VKPGVK-----CIGVQ--EGEWPGLSV 437
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG + FD +N+++ + SNC
Sbjct: 438 IGNIMQQEHLWEFDLKNMQVRFKQSNC 464
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 148/347 (42%), Gaps = 43/347 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC-SHRLCDLGTS-C 155
D G D+ W V+CAP A Y D + PS SS+ L+C +H+ L S C
Sbjct: 173 DTGSDVNW-----VQCAPC-ADCYQQADP---IFEPSFSSSYAPLTCETHQCKSLDVSEC 223
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+N C Y + Y + + + G + + L G +L N V IGCG G +
Sbjct: 224 RN--DSCLYEVSY-GDGSYTVGDFATETITL--DGSASLNN-----VAIGCGHDNEGLF- 272
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQST 272
V GL+GLG G +S PS + + SFS C D D + + F P+ +
Sbjct: 273 --VGAAGLLGLGGGSLSFPSQINAS-----SFSYCLVNRDTDSASTLEFNSPIPSHSVTA 325
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVYE 322
L +N Y +G+ +G L ++SF+ IVDSG++ T L +VY
Sbjct: 326 PLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYN 385
Query: 323 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
++ F R ++ + CY SS+ ++P+V FP + ++I
Sbjct: 386 SLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPV 445
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FC A P + IG G RV +D N +G+S + C
Sbjct: 446 DSAGT-FCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 147/359 (40%), Gaps = 48/359 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG----T 153
D G DL W V+C P + S Y + + PS SST + C C +G
Sbjct: 144 DTGSDLTW-----VQCKPCTDSCYQQQE---PLFDPSKSSTYVDVPCGTPQCKIGGGQDL 195
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
+C C Y++ Y + + + G L ++ L A A V+ GC + S G
Sbjct: 196 TCGGTT--CEYSVKY-GDQSVTRGNLAQEAFTLSPSAPPA------AGVVFGCSHEYSSG 246
Query: 214 YL---DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
+ ++ GL+GLG G+ S+ S + G + FS C S + A Q
Sbjct: 247 VKGAEEEMSVAGLLGLGRGDSSILSQ-TRRGNSGDVFSYCLPPRGSSAGYLTIGAAAPPQ 305
Query: 271 S----TSFLASNGK----YITYIIGVETCCIGSSC-LKQTSF--KAIVDSGSSFTFLPKE 319
S T + N + Y+ ++G+ G++ + ++F ++DSG+ T +P
Sbjct: 306 SNLSFTPLVTDNSQLSSVYVVNLVGISVS--GAALPIDASAFYIGTVIDSGTVITHMPAA 363
Query: 320 VYETIAAEFDRQVNDTITSFEGY--PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN-NP 376
Y + EF R + EG+ CY + + P V L F V+ +
Sbjct: 364 AYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASG 423
Query: 377 VFVIYGT----QVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +++ Q +T CLA P + G + IG Y VVFD E ++G+ + C
Sbjct: 424 ILLVFAVDASGQSLTLACLAFVPTNLPGFV-IIGNMQQRAYNVVFDVEGRRIGFGANGC 481
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 182/454 (40%), Gaps = 77/454 (16%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVM--FSTKLIHRFSEEVKALGVSKNRNATSWPAKKS 58
MN +S + L+ F+L S ++ V FS +LIHR S + ++N+ A
Sbjct: 1 MNTVSF-LTLSFFFLCFSISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAV-- 57
Query: 59 FEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMS--LGN---------DFGCDLLWIP 107
+ + + K + + P+ + +G MS +G D G D++W+
Sbjct: 58 --HRSINRVNHSNKNSLASTPE-STVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQ 114
Query: 108 CD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPY 164
C+ C +C YN N PS SS+ K++SCS +LC TSC N K+ C Y
Sbjct: 115 CEPCEQC-------YNQTTPKFN---PSKSSSYKNISCSSKLCQSVRDTSC-NDKKNCEY 163
Query: 165 TMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQSGGY--------L 215
+++Y ++ S L +E + L +G + +V IGCG G +
Sbjct: 164 SINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTV-----IGCGTNNIGSFKRVSSGVVG 218
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAG--LIRNSFSMCFDKDDSGRIFFGDQGPATQQ--- 270
G P LI LG PS+ K L+R S ++ S ++ FGD +
Sbjct: 219 LGGGPASLI-TQLG----PSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVL 273
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA----------IVDSGSSFTFLPKEV 320
ST + + + Y + +E +G K+ F I+DS + TF+P +V
Sbjct: 274 STPIVKKDHSFF-YYLTIEAFSVGD---KRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDV 329
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + + V + CY SS P + F + + FV
Sbjct: 330 YTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAHFKGADILLYATNTFVE 389
Query: 381 YGTQVVTGFCLAIQPVDGD--IGTIG-QNFMTGY 411
V+ C A P +G G+ Q+FM GY
Sbjct: 390 VARDVL---CFAFAPSNGGAIFGSFSQQDFMVGY 420
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 97/389 (24%), Positives = 157/389 (40%), Gaps = 53/389 (13%)
Query: 66 LSSDVQKQKMKT----------GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAP 115
L +++Q Q + T G F + +K+ + D G D+ WI +C P
Sbjct: 135 LQTEIQPQDLSTPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWI-----QCQP 189
Query: 116 LSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENT 173
S Y S ++P+ASS+ L+C + C+ +SC+N + C Y ++Y + +
Sbjct: 190 CSDCYQQSDPI----FTPAASSSYSPLTCDSQQCNSLQMSSCRNGQ--CRYQVNY-GDGS 242
Query: 174 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
+ G V + + GG + S+ +GCG G ++ GL G L
Sbjct: 243 FTFGDFVTETMSF--GGSGTVN-----SIALGCGHDNEGLFVGAAGLLGLGGGPL----- 290
Query: 234 PSLLAKAGLIRNSFSMCFDKDDSG--RIFFGDQGPATQQSTSFLASNGKYIT-YIIGVET 290
SL ++ L SFS C DS + P + L + K T Y +G+
Sbjct: 291 -SLTSQ--LKATSFSYCLVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSG 347
Query: 291 CCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+G L+ Q FK IVD G++ T L E Y ++ F ++
Sbjct: 348 MSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSG 407
Query: 341 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI 400
+ CY S Q K+P+V F S+ + ++I T +C A P +
Sbjct: 408 VALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLPAANYLIPVDSAGT-YCFAFAPTTSSL 466
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG G RV FD N ++G+S + C
Sbjct: 467 SIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 152/371 (40%), Gaps = 44/371 (11%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F + ++ + + D G D+ W+ C C C Y D + P
Sbjct: 162 LGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDP 211
Query: 134 SASSTSKHLSCSHRLC-DL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
S S++ ++C + C DL +C+N C Y + Y + + + G + L L GD
Sbjct: 212 SLSTSYASVACDNPRCHDLDAAACRNSTGACLYEV-AYGDGSYTVGDFATETLTL---GD 267
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 251
+A +SV IGCG G + V GL+ LG G +S PS ++ +FS C
Sbjct: 268 SAPVSSVA----IGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-----TTFSYCL 315
Query: 252 -DKDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK-- 304
D+D S + FGD A + + + S Y +G+ +G L ++F
Sbjct: 316 VDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMD 374
Query: 305 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
IVDSG++ T L Y + F R + + CY S + ++
Sbjct: 375 STGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 434
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P+V L F + ++I T +CLA P + + IG G RV FD
Sbjct: 435 PAVSLRFAGGGELRLPAKNYLIPVDGAGT-YCLAFAPTNAAVSIIGNVQQQGTRVSFDTA 493
Query: 419 NLKLGWSHSNC 429
+G++ + C
Sbjct: 494 KSTVGFTTNKC 504
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 141/364 (38%), Gaps = 56/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T+ + D D W+PC C CA S S+ SP+ SST + + C
Sbjct: 93 AQTLLVAIDPSNDAAWVPCSACAGCAASSPSF-----------SPTQSSTYRTVPCGSPQ 141
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGL---LVEDILHLISGGDN-ALKNSVQASVII 204
C Q P CP + SS G + G D+ AL+N+V S
Sbjct: 142 C-----AQVPSPSCPAGVG------SSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTF 190
Query: 205 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIF 260
GC SG + V P GLIG G G +S L + FS C + SG +
Sbjct: 191 GCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLK 245
Query: 261 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDS 309
G G P ++T L + + Y + + +GS ++ T I+D+
Sbjct: 246 LGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDA 305
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
G+ FT L VY + F +V + G + CY + +P+V MF
Sbjct: 306 GTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVTV----SVPTVTFMFAGAV 360
Query: 370 SFVVNNPVFVIYGTQ-VVTGFCLAIQPVDG---DIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ + +I+ + V +A P DG + + RV+FD N ++G+S
Sbjct: 361 AVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFS 420
Query: 426 HSNC 429
C
Sbjct: 421 RELC 424
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 109/447 (24%), Positives = 178/447 (39%), Gaps = 81/447 (18%)
Query: 6 LTIYLAVF-WLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQV 64
+ I+L + + L ++ + F+ LIHR S + N S A F+ Y+
Sbjct: 8 IAIFLQIITYFLITTTASSPQGFTIDLIHRRSNASSSRVF--NTQLGSPYADTVFDTYEY 65
Query: 65 LLSSDVQKQKMKTG-PQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYN 122
L+ K++ G P F++ D G + +W C CV C +A ++
Sbjct: 66 LM-------KLQIGTPPFEI----------EAVLDTGSEHIWTQCLPCVHCYNQTAPIFD 108
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
PS SST K + C CPY + Y ++ + L+ E
Sbjct: 109 ----------PSKSSTFKEIRCDTH-----------DHSCPYELVYGGKSYTKGTLVTET 147
Query: 183 I-LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
+ +H SG + V IIGCG SG + G A G++GL G S+ + G
Sbjct: 148 VTIHSTSG-----QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGG 197
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCL 298
S CF + +I FG ST+ K Y + ++ +G++ +
Sbjct: 198 EYPGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRI 257
Query: 299 KQ--TSFKA-----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS 351
+ T F A ++DSGS+ T+ P+ + ++ V T F C Y
Sbjct: 258 ETVGTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVV--TAVRFPRSDILCYY--- 312
Query: 352 SQRLPKLPSVKLMFPQNNSFVVNN-PVFVIYGTQVVTGFCLAI---QPVDGDI-GTIGQ- 405
S+ + P + + F V++ ++V T V FCLAI P++ I G Q
Sbjct: 313 SKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGV--FCLAIICNSPIEEAIFGNRAQN 370
Query: 406 NFMTGYRVVFDRENLKLGWSHSNCQDL 432
NF+ GY D +L + + +NC L
Sbjct: 371 NFLVGY----DSSSLLVSFKPTNCSAL 393
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 146/345 (42%), Gaps = 40/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D+ W+ +C P + Y+ + + PS+SS+ + LSC C+ +
Sbjct: 166 DTGSDVNWL-----QCTPCADCYHQTEPI----FEPSSSSSYEPLSCDTPQCNALEVSEC 216
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
C Y + Y + + + G + L + G ++N V +GCG G +
Sbjct: 217 RNATCLYEVSY-GDGSYTVGDFATETLTI---GSTLVQN-----VAVGCGHSNEGLF--- 264
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSF 274
V GL+GLG G +++PS L SFS C D D + + FG
Sbjct: 265 VGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVDRDSDSASTVDFGTSLSPDAVVAPL 319
Query: 275 LASNGKYITYIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVYETI 324
L ++ Y +G+ +G L+ Q+SF+ I+DSG++ T L E+Y ++
Sbjct: 320 LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSL 379
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
F + D + + CY S++ ++P+V FP + ++I
Sbjct: 380 RDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDS 439
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V T FCLA P + IG G RV FD N +G+S + C
Sbjct: 440 VGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 148/334 (44%), Gaps = 45/334 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D D++W+ C C C YN + PS S T K+L CS C GTS
Sbjct: 106 DTASDIIWVQCQLCETC-------YNDTSP---MFDPSYSKTYKNLPCSSTTCKSVQGTS 155
Query: 155 CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + ++ C +T++Y + + S G L+ + + L S D + +IGC ++ +
Sbjct: 156 CSSDERKICEHTVNY-KDGSHSQGDLIVETVTLGSYNDPFVHF---PRTVIGC-IRNTNV 210
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGDQGPATQQ- 270
D + G++GLG G +S+ L+ + I FS C D S ++ FGD +
Sbjct: 211 SFDSI---GIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLKFGDAAMVSGDG 265
Query: 271 --STSFLASNGKYITYIIGVETCCIGSSCLKQTSF--------KAIVDSGSSFTFLPKEV 320
ST + + K Y + +E +G++ ++ S I+DSG++FT LP +V
Sbjct: 266 TVSTRIVFKDWKKF-YYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGTTFTVLPDDV 324
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + + V + CYKS+ ++ +P + F + + F++
Sbjct: 325 YSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKV-DVPVITAHFSGADVKLNALNTFIV 383
Query: 381 YGTQVVTGFCLA-IQPVDGDI-GTIG-QNFMTGY 411
+VV CLA + G I G + QNF+ GY
Sbjct: 384 ASHRVV---CLAFLSSQSGAIFGNLAQQNFLVGY 414
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 142/358 (39%), Gaps = 40/358 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ +SL D G DL W +C P + S Y D + PS SS+ +++C+ LC
Sbjct: 57 RDLSLVFDTGSDLTW-----TQCEPCAGSCYKQQDA---IFDPSKSSSYTNITCTSSLCT 108
Query: 151 LGTSCQNPKQPCPYTMDY-------YTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
TS K C + D Y +N++S G L ++ L + + + +
Sbjct: 109 QLTS-DGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITA-------TDIVDDFL 160
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFF 261
GCG G +G A GL+GLG IS+ + + FS C S G + F
Sbjct: 161 FGCGQDNEG-LFNGSA--GLMGLGRHPISI--VQQTSSNYNKIFSYCLPATSSSLGHLTF 215
Query: 262 GDQGPATQQS---TSFLASNGKYITYIIGVETCCIGSSCL---KQTSFKA---IVDSGSS 312
G AT S T +G Y + + + +G + L ++F A I+DSG+
Sbjct: 216 G-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTV 274
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T L VY + + F R + + E CY S + +P + F +
Sbjct: 275 ITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVE 334
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + + ++ A D DI G VV+D + ++G+ + C+
Sbjct: 335 LXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 144/381 (37%), Gaps = 67/381 (17%)
Query: 74 KMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYS 132
++K G Q++F M L D D W+PC DC C+ + +S
Sbjct: 102 RVKLGTPGQLMF-------MVL--DTSRDAAWVPCADCAGCSSPT-------------FS 139
Query: 133 PSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
P+ SST L CS C G SC + Y ++S S +L +D L
Sbjct: 140 PNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSL------ 193
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSM 249
L S GC SG L P GL+GLG G +S LL+++G L FS
Sbjct: 194 --GLAVDTLPSYSFGCVNAVSGSTLP---PQGLLGLGRGPMS---LLSQSGSLYSGVFSY 245
Query: 250 CFDKDDS----GRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----- 299
CF S G + G G P ++T L + + Y + + +G +
Sbjct: 246 CFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPEL 305
Query: 300 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 354
T I+DSG+ T + VY I EF +QV + + C+ ++++
Sbjct: 306 LAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGAF--DTCFAATNED 363
Query: 355 LP-----KLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMT 409
+ + L P N+ + ++ G+ A V+ + I
Sbjct: 364 IAPPVTFHFTGMDLKLPLENTLIHSSA-----GSLACLAMAAAPNNVNSVLNVIANLQQQ 418
Query: 410 GYRVVFDRENLKLGWSHSNCQ 430
R++FD N +LG + C
Sbjct: 419 NLRIMFDVTNSRLGIARELCN 439
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 138/345 (40%), Gaps = 38/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G L W+ +C+P S + + + P SS+ +SCS CD L T+
Sbjct: 135 DTGSSLTWL-----QCSPCRVSCHR---QSGPVFDPKTSSSYAAVSCSSPQCDGLSTATL 186
Query: 157 NPK-----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
NP C Y Y +++ S G L +D +S G N++ N GCG
Sbjct: 187 NPAVCSPSNVCIYQASY-GDSSFSVGYLSKDT---VSFGANSVPN-----FYYGCGQDNE 237
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQGPATQQ 270
G + GL+GL ++S+ L A + SFS C SG + G P
Sbjct: 238 GLFGRSA---GLMGLARNKLSL--LYQLAPTLGYSFSYCLPSTSSSGYLSIGSYNPGGYS 292
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETIA 325
T +++ Y I + + L + TS I+DSG+ T LP VY ++
Sbjct: 293 YTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPTSVYTALS 352
Query: 326 AEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
+ + Y C++ + +L +P+V + F + ++ ++
Sbjct: 353 KAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDG 412
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T CLA P IG + VV+D ++ ++G++ + C
Sbjct: 413 ATT--CLAFAPAR-SAAIIGNTQQQTFSVVYDVKSNRIGFAAAGC 454
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 150/362 (41%), Gaps = 57/362 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL+W C CV C S ++ PS+SST + CS C DL TS
Sbjct: 113 DTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPCSSASCSDLPTSK 162
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GY 214
C YT Y +++S+ G+L + L S V+ GCG G G+
Sbjct: 163 CTSASKCGYTYTY-GDSSSTQGVLATETF--------TLAKSKLPGVVFGCGDTNEGDGF 213
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG------ 265
G GL+GLG G +S L+++ GL + FS C D ++ + G
Sbjct: 214 SQGA---GLVGLGRGPLS---LVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEAS 265
Query: 266 --PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--------IVDSGSSF 313
++ Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 266 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 325
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T+L + Y + F Q+ G C+++ ++ + ++ +L+F + +
Sbjct: 326 TYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL 385
Query: 374 NNPV---FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ P V+ G CL + G + IG ++ V+D + L ++ C
Sbjct: 386 DLPAENYMVLDGGS--GALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCN 442
Query: 431 DL 432
L
Sbjct: 443 KL 444
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 109/447 (24%), Positives = 178/447 (39%), Gaps = 81/447 (18%)
Query: 6 LTIYLAVF-WLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQV 64
+ I+L + + L ++ + F+ LIHR S + N S A F+ Y+
Sbjct: 2 IAIFLQIITYFLITTTASSPQGFTIDLIHRRSNASSSRVF--NTQLGSPYADTVFDTYEY 59
Query: 65 LLSSDVQKQKMKTG-PQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYN 122
L+ K++ G P F++ D G + +W C CV C +A ++
Sbjct: 60 LM-------KLQIGTPPFEI----------EAVLDTGSEHIWTQCLPCVHCYNQTAPIFD 102
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
PS SST K + C CPY + Y ++ + L+ E
Sbjct: 103 ----------PSKSSTFKEIRCDTH-----------DHSCPYELVYGGKSYTKGTLVTET 141
Query: 183 I-LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
+ +H SG + V IIGCG SG + G A G++GL G S+ + G
Sbjct: 142 VTIHSTSG-----QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGG 191
Query: 242 LIRNSFSMCFDKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCL 298
S CF + +I FG ST+ K Y + ++ +G++ +
Sbjct: 192 EYPGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRI 251
Query: 299 KQ--TSFKA-----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS 351
+ T F A ++DSGS+ T+ P+ + ++ V T F C Y
Sbjct: 252 ETVGTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVV--TAVRFPRSDILCYY--- 306
Query: 352 SQRLPKLPSVKLMFPQNNSFVVNN-PVFVIYGTQVVTGFCLAI---QPVDGDI-GTIGQ- 405
S+ + P + + F V++ ++V T V FCLAI P++ I G Q
Sbjct: 307 SKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGV--FCLAIICNSPIEEAIFGNRAQN 364
Query: 406 NFMTGYRVVFDRENLKLGWSHSNCQDL 432
NF+ GY D +L + + +NC L
Sbjct: 365 NFLVGY----DSSSLLVSFKPTNCSAL 387
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 109/443 (24%), Positives = 174/443 (39%), Gaps = 57/443 (12%)
Query: 3 RISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKK 57
R LT+ + S A+ FS +LIHR S + ++N+ +A +
Sbjct: 4 RSFLTLLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINR 63
Query: 58 SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCA 114
+ +Y+ L++ Q + ++ M + S G+ L D G D++W+ C+ C C
Sbjct: 64 ANHFYKYSLANIPQSTVIPDIGEYLMTY-SVGTPPFKLYGIVDTGSDIVWLQCEPCQECY 122
Query: 115 PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTEN 172
+ +N PS SS+ K++ C +LC TSC N K C Y+ YY +N
Sbjct: 123 NQTTPMFN----------PSKSSSYKNIPCPSKLCQSMEDTSC-NDKNYCEYST-YYGDN 170
Query: 173 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 232
+ S G L D L L S N L S +++IGCG Y +G A G++G G G S
Sbjct: 171 SHSGGDLSVDTLTLES--TNGLTVSF-PNIVIGCGTNNILSY-EG-ASSGIVGFGSGPAS 225
Query: 233 VPSLLAKAGLIRNSFSMCF---------DKDDSGRIFFGDQGPATQQ---STSFLASNGK 280
+ L + FS C + + ++ FGD + +T L + +
Sbjct: 226 FITQLGSS--TGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPE 283
Query: 281 YITYI------IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 334
Y+ +G IG I+DSG++ T L K+ Y + + V
Sbjct: 284 TFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKL 343
Query: 335 TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ 394
CY ++ P + + F + + FV V FCLA +
Sbjct: 344 ERVDDPTQTLNLCYSVKAEGY-DFPIITMHFKGADVDLHPISTFVSVADGV---FCLAFE 399
Query: 395 PVDGDIGTIG----QNFMTGYRV 413
D G QN M GY +
Sbjct: 400 SSQ-DHAIFGNLAQQNLMVGYDL 421
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 150/362 (41%), Gaps = 57/362 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL+W C CV C S ++ PS+SST + CS C DL TS
Sbjct: 92 DTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPCSSASCSDLPTSK 141
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GY 214
C YT Y +++S+ G+L + L S V+ GCG G G+
Sbjct: 142 CTSASKCGYTYTY-GDSSSTQGVLATETF--------TLAKSKLPGVVFGCGDTNEGDGF 192
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG------ 265
G GL+GLG G +S L+++ GL + FS C D ++ + G
Sbjct: 193 SQGA---GLVGLGRGPLS---LVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEAS 244
Query: 266 --PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--------IVDSGSSF 313
++ Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 245 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 304
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T+L + Y + F Q+ G C+++ ++ + ++ +L+F + +
Sbjct: 305 TYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL 364
Query: 374 NNPV---FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ P V+ G CL + G + IG ++ V+D + L ++ C
Sbjct: 365 DLPAENYMVLDGGS--GALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCN 421
Query: 431 DL 432
L
Sbjct: 422 KL 423
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 150/362 (41%), Gaps = 57/362 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL+W C CV C S ++ PS+SST + CS C DL TS
Sbjct: 123 DTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPCSSASCSDLPTSK 172
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GY 214
C YT Y +++S+ G+L + L S V+ GCG G G+
Sbjct: 173 CTSASKCGYTYTY-GDSSSTQGVLATETF--------TLAKSKLPGVVFGCGDTNEGDGF 223
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG------ 265
G GL+GLG G +S L+++ GL + FS C D ++ + G
Sbjct: 224 SQGA---GLVGLGRGPLS---LVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEAS 275
Query: 266 --PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--------IVDSGSSF 313
++ Q+T + + + Y + ++ +GS+ L ++F IVDSG+S
Sbjct: 276 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 335
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T+L + Y + F Q+ G C+++ ++ + ++ +L+F + +
Sbjct: 336 TYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL 395
Query: 374 NNPV---FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ P V+ G CL + G + IG ++ V+D + L ++ C
Sbjct: 396 DLPAENYMVLDGGS--GALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCN 452
Query: 431 DL 432
L
Sbjct: 453 KL 454
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 80/161 (49%), Gaps = 23/161 (14%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + ++PC DC +C ++ P SST + + C ++ +C
Sbjct: 111 DSGSTVTYVPCSDCEQCGKHQDP----------KFQPEMSSTYQPVKC-----NMDCNCD 155
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ ++ C Y +Y E++SS G+L ED LIS G+ + +A + GC ++G
Sbjct: 156 DDREQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VFGCETVETGDLYS 209
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
A DG+IGLG G++S+ L GLI NSF +C+ D G
Sbjct: 210 QRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 148/348 (42%), Gaps = 46/348 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G D+ W V+CAP A Y D + P++S++ LSC+ R C D+ +
Sbjct: 167 DTGSDVNW-----VQCAPC-ADCYQQADP---IFEPASSASFSTLSCNTRQCRSLDV-SE 216
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + E I L ++ +V IGCG G +
Sbjct: 217 CRN--DTCLYEVSYGDGSYTVGDFVTETI---------TLGSAPVDNVAIGCGHNNEGLF 265
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQS 271
V GL+GLG G +S PS + SFS C D + + + F P S
Sbjct: 266 ---VGAAGLLGLGGGSLSFPSQINAT-----SFSYCLVDRDSESASTLEFNSTLPPNAVS 317
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKEVY 321
L ++ Y +G+ +G + +++F+ IVDSG++ T L +VY
Sbjct: 318 APLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVY 377
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
++ F ++ D ++ + CY SS+ ++P+V FP + +++
Sbjct: 378 NSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVP 437
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FC A P + IG G RVV+D N +G+ + C
Sbjct: 438 LDSEGT-FCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 150/373 (40%), Gaps = 66/373 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT- 153
D G +L W+ CAP A S + P ASST + C+ C DL +
Sbjct: 103 DTGSELSWL-----LCAPAGARNKFSA----MSFRPRASSTFAAVPCASAQCRSRDLPSP 153
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C C ++ Y + +SS G L D+ + SG ++A+ GC
Sbjct: 154 PACDGASSRCSVSLSY-ADGSSSDGALATDVFAVGSG------PPLRAA--FGCMSSAFD 204
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG--------- 262
DGVA GL+G+ G + S +++A R FS C D+DD+G + G
Sbjct: 205 SSPDGVASAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAGVLLLGHSDLPTFLP 259
Query: 263 -DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSF 313
+ P Q + +A + + + +G + I +S L A +VDSG+ F
Sbjct: 260 LNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQF 319
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYKSSSQRLP---KLPSVKLM 364
TFL + Y + AEF RQ + + + + C++ R P +LP V L+
Sbjct: 320 TFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLL 379
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTG---FCLA-----IQPVDGDIGTIGQNFMTGYRVVFD 416
F V + + + G +CL + P+ + IG + V +D
Sbjct: 380 FNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYV--IGHHHQMNVWVEYD 437
Query: 417 RENLKLGWSHSNC 429
E ++G + C
Sbjct: 438 LERGRVGLAPVRC 450
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 150/369 (40%), Gaps = 54/369 (14%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+ WI C C C P +N P ASST C++ + C
Sbjct: 156 DTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST-----CTNVYQGVKPFCS 210
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ---ASVIIGCGMKQSGG 213
+ C +++ Y + + SSGLL + I+G + +++ +GC G
Sbjct: 211 PSGRTCLFSIQY-GDGSLSSGLLA---METIAGNTPNFGDGEPVKLSNITLGCADIDREG 266
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DK----DDSGRIFFGDQG--- 265
G + GL+G+ IS PS L+ FS CF DK + SG +FFG+
Sbjct: 267 LPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLNSSGLVFFGESDIIS 322
Query: 266 ------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT-----------SFKAIVD 308
P Q AS Y ++G+ + S L + S I+D
Sbjct: 323 PYLRYTPLVQNPAVPSASLDYYYVGLVGIS---VDESRLPLSHKNFDIDKVTGSGGTIID 379
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKLPSVKLM 364
SG++FT+L K ++ + EF + + + + CY +++ LPS+ L
Sbjct: 380 SGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSITLH 439
Query: 365 FPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDI--GTIGQNFMTGYRVVFDRENL 420
F V+ N+ + + ++ T CLA Q + GDI IG V +D E L
Sbjct: 440 FRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNYQQQNLWVEYDLEKL 498
Query: 421 KLGWSHSNC 429
+LG + + C
Sbjct: 499 RLGIAPAQC 507
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 147/369 (39%), Gaps = 46/369 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 136
+G F + Q SK + D G D+ W+ +C P S Y S + P+AS
Sbjct: 154 SGEYFSRVGVGQPSKPFYMVLDTGSDVNWL-----QCKPCSDCYQQSDPI----FDPTAS 204
Query: 137 STSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 194
S+ L+C + C DL S C+N K C Y + Y + + + G V + + +G N
Sbjct: 205 SSYNPLTCDAQQCQDLEMSACRNGK--CLYQVSY-GDGSFTVGEYVTETVSFGAGSVN-- 259
Query: 195 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 254
V IGCG G ++ L + L + + SFS C
Sbjct: 260 ------RVAIGCGHDNEGLFVGSAG--------LLGLGGGPLSLTSQIKATSFSYCLVDR 305
Query: 255 DSGR---IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQT 301
DSG+ + F P L + Y + + +G + +
Sbjct: 306 DSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSG 365
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPS 360
+ IVDSG++ T L + Y ++ F R+ ++ + EG + CY SS + ++P+
Sbjct: 366 AGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSN-LRPAEGVALFDTCYDLSSLQSVRVPT 424
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
V F + ++ + ++I T +C A P + IG G RV FD N
Sbjct: 425 VSFHFSGDRAWALPAKNYLIPVDGAGT-YCFAFAPTTSSMSIIGNVQQQGTRVSFDLANS 483
Query: 421 KLGWSHSNC 429
+G+S + C
Sbjct: 484 LVGFSPNKC 492
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 139/358 (38%), Gaps = 49/358 (13%)
Query: 98 DFGCDLLWIPCD-CVRCA-------PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
D G D+LW+ C C C PLS ++ T + + CS
Sbjct: 101 DTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSR--- 157
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C Y + Y + ++S G V D +H + G NA + + GC
Sbjct: 158 ------SGNNSACAY-VSSYQDKSASVGAYVRDDMHYVLHGGNA----TTSRIFFGCATN 206
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGDQGPA 267
+G + DG++G GL +VP+ +A + FS C +K G + FG+
Sbjct: 207 ITGSW----PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEAPNT 262
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCL----KQTSF--------KAIVDSGSSFTF 315
T+ + L + + Y + + + + S L K+ S+ I+DSG++F
Sbjct: 263 TEMVFTPLLNVTTH--YNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGTTFVL 320
Query: 316 LPKEVYETIAAEFDRQVNDTIT-SFEGYPWKCCY-KSSSQRLPKLPSVKLMFPQNNSFVV 373
L + + E + EG +C Y KS P+V L F ++ +
Sbjct: 321 LTTKANRMLFQEIKSLTTAKLGPKLEGL--ECFYLKSGLTMETSFPNVTLTFSGGSTMKL 378
Query: 374 --NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+N + + + G+C A DG + G+ + V +D EN ++GW NC
Sbjct: 379 KPDNYLVMAEYKKKRNGYCYAWSSADG-LTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 151/390 (38%), Gaps = 59/390 (15%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLN 129
V + +G F F + SL D G DLLW V+C+P Y +D
Sbjct: 54 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLW-----VQCSPCRQCY----AQDSP 104
Query: 130 EYSPSASSTSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDI 183
Y PS SST + C C L G C + + P +Y Y + +SS G+
Sbjct: 105 LYVPSNSSTFSPVPCLSSDCLLIPATEGFPC-DFRYPGACAYEYLYADTSSSKGVFAY-- 161
Query: 184 LHLISGGDNALKNSVQAS-VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 242
++A + V+ V GCG G + A G++GLG G +S S + A
Sbjct: 162 -------ESATVDGVRIDKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA-- 209
Query: 243 IRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIG 294
N F+ C S + FGD+ +T + + SN K T Y + +E +G
Sbjct: 210 YGNKFAYCLVNYLDPTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVG 269
Query: 295 SSCLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYP 343
L + +I DSG++ T+ Y I A FD V+ S +G
Sbjct: 270 GKSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG-- 327
Query: 344 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIG- 401
C + + P PS + F F P Y V CLA+ + +G
Sbjct: 328 LDLCVELTGVDQPSFPSFTIEFDDGAVF---QPEAENYFVDVAPNVRCLAMAGLASPLGG 384
Query: 402 --TIGQNFMTGYRVVFDRENLKLGWSHSNC 429
TIG + V +DRE +G++ + C
Sbjct: 385 FNTIGNLLQQNFFVQYDREENLIGFAPAKC 414
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 89/416 (21%), Positives = 173/416 (41%), Gaps = 61/416 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSAS 136
G + ++ ++ S+ D G L +PC C C + ++ S S
Sbjct: 93 GTHYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGCGQHTDPLFDV----------SKS 142
Query: 137 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 196
+T+K+L+C H SC++ +Q Y Y E + ++V++++ + GG ++ +
Sbjct: 143 TTAKYLAC-HDF----DSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWV--GGFSSPAD 195
Query: 197 SVQASVI-------IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFS 248
++ + +GC K++G ++ +G++GLG +V S + AG + +N F+
Sbjct: 196 EMEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFT 254
Query: 249 MCFDKDDSGRIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCLK----- 299
+CF D G + FG + S T L+ Y Y + V+ + L
Sbjct: 255 LCF-AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAY--YPVHVKDILLNGVSLGIDTGT 311
Query: 300 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
+ IVDSG++ TF + + F + + + K +S+ L L
Sbjct: 312 INSGRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYS-------ESRMKLTSEELAAL 364
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVT------GFCLAIQPVDGDIGTIGQNFMTGYR 412
P + ++ ++ + +Q +T + + G +G + M G+
Sbjct: 365 PVISIILSGMKGDGTDDVQLDVPASQYLTPADDGKSYYGNFHFSERSGGVLGASAMVGFD 424
Query: 413 VVFDRENLKLGWSHSNCQD--LNDGTKSPLT------PGPGTPSNPLPANQEQSSP 460
V+FD EN ++G++ S+C N T +P+ P P TP + EQ +P
Sbjct: 425 VIFDVENKRVGFAESDCGRSYSNATTAAPIASDSTNQPAPATPVSVDSNATEQPAP 480
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 180/445 (40%), Gaps = 60/445 (13%)
Query: 12 VFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYYQVLL 66
+F+L + G FS ++IHR S ++ + NA ++ Q +
Sbjct: 19 IFYLEAFNGG-----FSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFV 73
Query: 67 SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCAPLSASYYNS 123
S + + + + ++ S G+ ++ + D G D++W+ C C +C + ++S
Sbjct: 74 SPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDS 133
Query: 124 LDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
S S T K L C C GT C + K C Y++ Y + S L VE
Sbjct: 134 ----------SKSQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYSIHYVDGSQSLGDLSVE 182
Query: 182 DILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
L G N + VQ +IGCG + G + G++GLG G +S+ + L+ +
Sbjct: 183 T---LTLGSTNG--SPVQFPGTVIGCGRYNAIGIEE--KNSGIVGLGRGPMSLITQLSPS 235
Query: 241 GLIRNSFSMCFD---KDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIG 294
FS C S ++ FG+ + + ST + NG + Y + +E +G
Sbjct: 236 --TGGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNG-LVFYFLTLEAFSVG 292
Query: 295 SSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 348
+ ++ S I+DSG++ T LP VY + A + V CY
Sbjct: 293 RNRIEFGSPGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCY 352
Query: 349 KSSSQRL-PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIG- 404
K + +L +P + F + + FV VV C A QP + G +
Sbjct: 353 KVTPDKLDASVPVITAHFSGADVTLNAINTFVQVADDVV---CFAFQPTETGAVFGNLAQ 409
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNC 429
QN + GY D + + + H++C
Sbjct: 410 QNLLVGY----DLQMNTVSFKHTDC 430
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 147/357 (41%), Gaps = 62/357 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G DL+W C C++C S ++ P S++ H+ C+ + C + S
Sbjct: 110 DTGSDLMWAQCLPCLKCYKQSRPIFD----------PLKSTSFSHVPCNSQNCKAIDDSH 159
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y+ Y + + L E I + G +++K+ +IGCG +
Sbjct: 160 CGAQGVCDYSYTYGDQTYTKGDLGFEKI----TIGSSSVKS------VIGCGHESG---G 206
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQ----GPAT 268
G+IGLG G++S+ S +++ I FS C +G+I FG GP
Sbjct: 207 GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 266
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--AIVDSGSSFTFLPKEVYETIAA 326
+ L S Y + +E IG+ ++ + I+DSG++ +FLPKE+Y+ + +
Sbjct: 267 VSTP--LISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYDGVVS 324
Query: 327 EFDRQVNDTITSFEGYPWKCCYK-----SSSQRLPKLPS-------VKLMFPQNNSFVVN 374
+ V G W C+ ++S +P + + V L+ P N V
Sbjct: 325 SLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLL-PVNTFQKVA 383
Query: 375 NPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
N V CL + P + G IG + + + +D E +L + + C
Sbjct: 384 NNV-----------NCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 144/363 (39%), Gaps = 72/363 (19%)
Query: 37 EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKT-GPQFQMLFPSQGSKTMSL 95
E +K L ++ T+ P + ++ ++ V + K+ T G Q M+
Sbjct: 15 ERLKYLSTLADQKTTAVPIAPGQQVLKI--ANYVVRVKLGTPGQQMFMVL---------- 62
Query: 96 GNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LG 152
D D W+PC C C+ + + P+AS+T L CS C G
Sbjct: 63 --DTSNDAAWVPCSGCTGCSSTT-------------FLPNASTTLGSLDCSEAQCSQVRG 107
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC Y ++S + LV+D + L N V GC SG
Sbjct: 108 FSCPATGSSACLFNQSYGGDSSLAATLVQDAI--------TLANDVIPGFTFGCINAVSG 159
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDDS----GRIFFGDQG-P 266
G + P GL+GLG G IS L+++AG + + FS C S G + G G P
Sbjct: 160 G---SIPPQGLLGLGRGPIS---LISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQP 213
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFL 316
+ ++T L + + Y + + +G + T I+DSG+ T
Sbjct: 214 KSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRF 273
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNN 369
+ VY I EF +QVN I+S + C+ ++++ + P+V L F P N
Sbjct: 274 VQPVYFAIRDEFRKQVNGPISSLGAF--DTCFAATNEA--EAPAVTLHFEGLNLVLPMEN 329
Query: 370 SFV 372
S +
Sbjct: 330 SLI 332
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 147/348 (42%), Gaps = 42/348 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D+ W+ C+ CA + Y ++ + P SS+ +SC C L
Sbjct: 15 DTGSDVTWL--QCLPCAGKNGCY----EQITPIFDPELSSSYNPVSCDSEQCQLLDEAGC 68
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
C Y ++Y + + + G L + L + N++ N + IGCG G +
Sbjct: 69 NVNSCIYKVEY-GDGSFTIGELATETLTFVHS--NSIPN-----ISIGCGHDNEGLF--- 117
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD--QGPATQQSTSFL 275
V DGLIGLG G IS+ S L + SFS C DS D P + S L
Sbjct: 118 VGADGLIGLGGGAISISSQLKAS-----SFSYCLVDIDSPSFSTLDFNTDPPSDSLISPL 172
Query: 276 ASNGKYITY----IIGVETCCIGSSCLKQTSFK----------AIVDSGSSFTFLPKEVY 321
N ++ ++ +IG+ +G L +S + IVDSG++ T LP +VY
Sbjct: 173 VKNDRFPSFRYVKVIGMS---VGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVY 229
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
E + F + + E P+ CY SSQ ++P++ + P NS + +I
Sbjct: 230 EVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQ 289
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FCLA + IG G RV +D N +G+S + C
Sbjct: 290 VDSAGT-FCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 85/369 (23%), Positives = 153/369 (41%), Gaps = 63/369 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--- 153
D G DL W+ C C+ C S ++ P+AS + ++++C C L +
Sbjct: 167 DTGSDLNWLQCAPCLDCFEQSGPIFD----------PAASISYRNVTCGDDRCRLVSPPA 216
Query: 154 -----SCQNPK-QPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGC 206
C+ P+ PCPY Y ++ ++ L +E ++L G + V GC
Sbjct: 217 ESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDG-----VAFGC 271
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGD 263
G + G + L+GLG G +S S L + ++FS C + S +I FG
Sbjct: 272 GHRNRGLFHGAAG---LLGLGRGPLSFASQL-RGVYGGHAFSYCLVEHGSAAGSKIIFGH 327
Query: 264 QGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----AIVDSGSSFT 314
T+F + Y + +++ +G + +S I+DSG++ +
Sbjct: 328 DDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLS 387
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM--------F 365
+ P+ Y+ I F +++ + G+P CY S ++P + L+ F
Sbjct: 388 YFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEF 447
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
P N F+ P ++ CLA+ P G + IG + V++D E+ +LG
Sbjct: 448 PAENYFIRLEPEGIM---------CLAVLGTPRSG-MSIIGNYQQQNFHVLYDLEHNRLG 497
Query: 424 WSHSNCQDL 432
++ C D+
Sbjct: 498 FAPRRCADV 506
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 89/385 (23%), Positives = 142/385 (36%), Gaps = 62/385 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K L D G L W+ CD C+ C + +Y L + + C+ +
Sbjct: 48 AKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPHGLYKPELKYAVKCTEQ 107
Query: 148 LC-DLGTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQA 200
C DL + P K C Y + Y SS G+L+ D L S G N
Sbjct: 108 RCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTNP------T 159
Query: 201 SVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGR 258
S+ GCG Q + P +G++GLG G++++ S L G+I ++ C G
Sbjct: 160 SIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGF 219
Query: 259 IFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+FFGD + P + + S + K+ + G S + + I DSG+++T+
Sbjct: 220 LFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFA 279
Query: 318 KEVYE-----------------TIAAEFDRQV------NDTITSFEGYPWKCCYKSSSQR 354
+ Y T E DR + D I + + K C++S S +
Sbjct: 280 LQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV--KKCFRSLSLK 337
Query: 355 LPK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI------QPVDGDIGTIGQNF 407
L P + +++ V CL I P IG
Sbjct: 338 FADGDKKATLEIPPEHYLIISQEGHV----------CLGILDGSKEHPSLAGTNLIGGIT 387
Query: 408 MTGYRVVFDRENLKLGWSHSNCQDL 432
M V++D E LGW + C +
Sbjct: 388 MLDQMVIYDSERSLLGWVNYQCDRI 412
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 157/398 (39%), Gaps = 87/398 (21%)
Query: 90 SKTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
S+T+S D G L+W PC C RC S+ N + + P SS++K + C
Sbjct: 100 SQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSAKIVGCL 154
Query: 146 HRLC------DLGTSC-------QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
+ C ++ T C N + CP Y T+ LL+E ++
Sbjct: 155 NPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLV-------- 206
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
+ ++GC + L P G+ G G G S+P + GL + S+ +
Sbjct: 207 -FAERTEPDFVVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFSYCLLSH 256
Query: 253 K-DDSGR-----IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSC 297
+ DDS + ++ G D T F ++SN + Y + + +G
Sbjct: 257 RFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKR 316
Query: 298 LKQT-SFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEGY 342
+K SF IVDSGS+FTF+ K V+E +A EFDRQ+ + + + G
Sbjct: 317 VKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGL 376
Query: 343 PWKCCYKSSSQRLPKLPSV--------KLMFPQNNSFVVNNPVFVIYGTQV---VTGFCL 391
K C+ S LPS+ K+ P N F + + V+ T V G L
Sbjct: 377 --KPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTL 434
Query: 392 AIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ P QNF T Y D EN + G+ C
Sbjct: 435 SSGPSIILGNYQSQNFYTEY----DLENERFGFRRQRC 468
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 118/285 (41%), Gaps = 38/285 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D DL W+PC C C +D + PS SST +C C + G
Sbjct: 115 DITGDLTWLPCKTCQDCT-----------KDGFTFFPSESSTYTSAACESYQCQITNGAV 163
Query: 155 CQNPKQPCPYTMDYYTENTSS---SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
CQ + C Y + SS GL+ D + S AL S + I CG
Sbjct: 164 CQT--KMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQAL--SYPNTNFI-CGTFID 218
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPAT 268
+ G G++GLG G S+ S + LI +FS C + S +I FG +G +
Sbjct: 219 NWHYIGA---GIVGLGRGLFSMTSQMKH--LINGTFSQCLVPYSSKQSSKINFGLKGVVS 273
Query: 269 QQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-----IVDSGSSFTFLPKEVY 321
+ ++ +A +G+ Y + +E +G + + + A +D ++FT LP + Y
Sbjct: 274 GEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSNIYIDWRTTFTSLPHDFY 333
Query: 322 ETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKLPSVKLMF 365
E + AE + +N T ++ CYKS S P + + F
Sbjct: 334 ENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITMHF 378
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 118/294 (40%), Gaps = 51/294 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +C P A + D+ L + PS SST SC LC SC
Sbjct: 100 DTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASC 150
Query: 156 QNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+PK Q C YT Y + + ++G L D + G + V GCG+ +
Sbjct: 151 GSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLFNN 203
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRIF 260
G + G+ G G G +S+PS L K G +FS CF D ++
Sbjct: 204 GVFKSN--ETGIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGLKPSTVLLDLPADLY 256
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSGS 311
+G QST + + Y + ++ +GS+ LK + I+DSG+
Sbjct: 257 KSGRGAV--QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
+ T LP VY + F QV + S C + + P +P + L F
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF 368
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/399 (25%), Positives = 158/399 (39%), Gaps = 87/399 (21%)
Query: 90 SKTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
S+T+S D G L+W PC C RC S+ N + + P SS++K + C
Sbjct: 100 SQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSAKIVGCL 154
Query: 146 HRLC------DLGTSC-------QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
+ C ++ T C N + CP Y T+ LL+E ++
Sbjct: 155 NPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLV-------- 206
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
+ ++GC + L P G+ G G G S+P + GL + S+ +
Sbjct: 207 -FAERTEPDFVVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFSYCLLSH 256
Query: 253 K-DDSGR-----IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSC 297
+ DDS + ++ G D T F ++SN + Y + + +G
Sbjct: 257 RFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKR 316
Query: 298 LK-QTSFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEGY 342
+K SF IVDSGS+FTF+ K V+E +A EFDRQ+ + + + G
Sbjct: 317 VKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSG- 375
Query: 343 PWKCCYKSSSQRLPKLPSV--------KLMFPQNNSFVVNNPVFVIYGTQV---VTGFCL 391
K C+ S LPS+ K+ P N F + + V+ T V G L
Sbjct: 376 -LKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTL 434
Query: 392 AIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ P QNF T Y D EN + G+ C+
Sbjct: 435 SSGPSIILGNYQSQNFYTEY----DLENERFGFRRQRCK 469
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 144/352 (40%), Gaps = 44/352 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCD-LGTSC 155
D G DL W CV C N + N + P S++ +++SC +LC L T
Sbjct: 43 DTGSDLTWT--SCVPC--------NKCYKQRNPIFDPQKSTSYRNISCDSKLCHKLDTGV 92
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALKNSVQASVIIGCGMKQSGG 213
+P++ C YT Y + + G+L ++ + L S G LK ++ GCG +GG
Sbjct: 93 CSPQKHCNYTYAYASAAITQ-GVLAQETITLSSTKGESVPLKG-----IVFGCGHNNTGG 146
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPAT 268
+ D G+IGLG G +S S + + FS C D S ++ G +
Sbjct: 147 FND--REMGIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVS 203
Query: 269 QQ---STSFLASNGK--YITYIIGVETCCI-----GSSCLKQTSFKAIVDSGSSFTFLPK 318
+ ST +A K Y ++G+ GSS +DSG+ T LP
Sbjct: 204 GKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFLDSGTPPTILPT 263
Query: 319 EVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
++Y+ + A+ +V +T+ + CY++ + + P + F + ++
Sbjct: 264 QLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNL--RGPVLTAHFEGGDVKLLPTQT 321
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
FV V FCL D G G + Y + FD + + + +C
Sbjct: 322 FVSPKDGV---FCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 111/423 (26%), Positives = 173/423 (40%), Gaps = 57/423 (13%)
Query: 32 IH-RFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS 90
IH R ++ V L S++R+ + + F+ + V + +G F +
Sbjct: 15 IHGRINQTVNGLTRSRSRDRQTKVPSQDFQ------APVVSGLSLGSGEYFIRISVGTPP 68
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ M L D G D+LW+ C CV C S + ++ P SST L CS R C
Sbjct: 69 RRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFD----------PYKSSTYSTLGCSTRQC 118
Query: 150 ---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIG 205
D+GT CQ K C Y +DY + ++ +D+ L+ SG + N + +G
Sbjct: 119 LNLDIGT-CQANK--CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIP----LG 171
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIF 260
CG G + V GL+GLG G +S P+ + R FS C D + +
Sbjct: 172 CGHDNEGYF---VGAAGLLGLGKGPLSFPNQVDPQNGGR--FSYCLTDRETDSTEGSSLV 226
Query: 261 FGDQG--PA----TQQSTSFLASNGKYITYI---IGVETCCIGSSCLKQTSFK---AIVD 308
FG+ PA T Q ++ Y+ +G I +S + S I+D
Sbjct: 227 FGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIID 286
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+S T L Y ++ F +D + + CY S +P+V L F
Sbjct: 287 SGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGG 346
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ ++I T FCLA G IG I Q G+RV++D + ++G+
Sbjct: 347 TDLKLPASNYLIPVDNSNT-FCLAFAGTTGPSIIGNIQQQ---GFRVIYDNLHNQVGFVP 402
Query: 427 SNC 429
S C
Sbjct: 403 SQC 405
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 140/351 (39%), Gaps = 39/351 (11%)
Query: 92 TMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-- 149
T ++ D G D+ W+ C+ P A D P+ SST + +SC+ C
Sbjct: 139 TQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFD-------PAKSSTYRAVSCAAAECAQ 191
Query: 150 --DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
G C C Y + Y + ++++G D L L SG +A+K GC
Sbjct: 192 LEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL-SGASDAVKG-----FQFGCS 244
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPA 267
+SG + D DGL+GLG G S+ S A A NSFS C F G
Sbjct: 245 HLESG-FSDQT--DGLMGLGGGAQSLVSQTAAA--YGNSFSYCLPPTSGSSGFLTLGGGG 299
Query: 268 TQQ---STSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--IVDSGSSFTFLPKEV 320
+T L S Y ++ +G L + F A +VDSG+ T LP
Sbjct: 300 GASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSVVDSGTIITRLPPTA 359
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y +++ F + ++ C+ + Q +P+V L+F + + +P ++
Sbjct: 360 YSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVF-SGGAAIDLDPNGIM 418
Query: 381 YGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
YG CLA DG G IG + V++D + LG+ C
Sbjct: 419 YGN------CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 152/365 (41%), Gaps = 46/365 (12%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q ++ L D G DL W+ CD C C+ + L R N++ P L +
Sbjct: 79 QPARPYFLDVDTGSDLTWLQCDAPCTHCSETP----HPLHRPSNDFVPCRDPLCASLQPT 134
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
+C++P Q C Y ++ Y + S+ G+L+ D+ L S LK + +G
Sbjct: 135 EDY-----NCEHPDQ-CDYEIN-YADQYSTYGVLLNDVYLLNSSNGVQLK----VRMALG 183
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 265
CG Q DGL+GLG G+ S+ S L GL+RN C G IFFG+
Sbjct: 184 CGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLSSQGGGYIFFGNAY 243
Query: 266 PATQQSTSFLAS-NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 324
+ + + + ++S + K+ Y G G S A+ D+GSS+T+ Y+ +
Sbjct: 244 DSARVTWTPISSVDSKH--YSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQAL 301
Query: 325 AAEFDRQVN--------DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF----------- 365
+ +++++ D T + K + S + V L F
Sbjct: 302 LSWLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLREVRKYFKPVALSFTNGGRVKAQFE 361
Query: 366 -PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
P +++N V G ++ GF + ++ ++ +G M +VF+ E +GW
Sbjct: 362 IPPEAYLIISNLGNVCLG--ILNGFEVGLE----ELNLVGDISMQDKVMVFENEKQLIGW 415
Query: 425 SHSNC 429
++C
Sbjct: 416 GPADC 420
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 104/239 (43%), Gaps = 32/239 (13%)
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRI 259
I GCG + + G GV+ GL+GLG ++S+ S +G+ FS C ++ SG +
Sbjct: 108 IFGCG-RNNKGLFGGVS--GLMGLGRSDLSLIS--QTSGIFGGVFSYCLPSTERKGSGSL 162
Query: 260 FFGDQGPATQQST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGS 311
G + S+ + + Y Y I + IG L+ S + +VDSG+
Sbjct: 163 ILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGT 222
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM 364
T LP +Y+ + AEF +Q F G+P C+ S+ + +P++K+
Sbjct: 223 VITRLPPTIYKALKAEFLKQ-------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMH 275
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVD--GDIGTIGQNFMTGYRVVFDRENLK 421
F N V+ + + CLA+ ++ ++ +G RV++D + K
Sbjct: 276 FEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETK 334
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 143/363 (39%), Gaps = 72/363 (19%)
Query: 37 EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKT-GPQFQMLFPSQGSKTMSL 95
E +K L ++ T+ P + ++ ++ V + K+ T G Q M+
Sbjct: 15 ERLKYLSTLADQKTTAVPIAPGQQVLKI--ANYVVRVKLGTPGQQMFMVL---------- 62
Query: 96 GNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LG 152
D D W+PC C C+ + + P+AS+T L CS C G
Sbjct: 63 --DTSNDAAWVPCSGCTGCSSTT-------------FLPNASTTLGSLDCSEAQCSQVRG 107
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC Y ++S + LV+D + L N V GC SG
Sbjct: 108 FSCPATGSSACLFNQSYGGDSSLAATLVQDAI--------TLANDVIPGFTFGCINAVSG 159
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDDS----GRIFFGDQG-P 266
G + P GL+GLG G IS L+++AG + + FS C S G + G G P
Sbjct: 160 G---SIPPQGLLGLGRGPIS---LISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQP 213
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFL 316
+ ++T L + + Y + + +G + T I+DSG+ T
Sbjct: 214 KSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRF 273
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNN 369
+ VY I EF +QVN I+S + C+ +++ + P+V L F P N
Sbjct: 274 VQPVYFAIRDEFRKQVNGPISSLGAF--DTCFAETNEA--EAPAVTLHFEGLNLVLPMEN 329
Query: 370 SFV 372
S +
Sbjct: 330 SLI 332
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 143/366 (39%), Gaps = 55/366 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ S+ D G DL W V+C+P Y ++ + + P+ S++ L+C LC+
Sbjct: 14 RVFSVIVDTGSDLTW-----VQCSPCGTCY----SQNDSLFIPNTSTSFTKLACGTELCN 64
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+ C Y Y + + S+G V D + + G N K V + GCG
Sbjct: 65 GLPYPMCNQTTCVYWYSY-GDGSLSTGDFVYDTITM--DGINGQKQQV-PNFAFGCGHDN 120
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQG 265
G + DG++GLG G +S PS L + FS C + + FGD
Sbjct: 121 EGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSPLLFGDAA 175
Query: 266 PATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------AIVDSGSS 312
T + L +N K T Y + + +G L T+F I DSG++
Sbjct: 176 VPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTT 235
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---------CCYKSSSQRLPKLPSVKL 363
T L EV++ + A + D YP K C + +LP +PS+
Sbjct: 236 VTQLAGEVHQEVLAAMNASTMD-------YPRKSDDSSGLDLCLGGFAEGQLPTVPSMTF 288
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + + + F+ + F + P D+ IG ++V +D K+G
Sbjct: 289 HFEGGDMELPPSNYFIFLESSQSYCFSMVSSP---DVTIIGSIQQQNFQVYYDTVGRKIG 345
Query: 424 WSHSNC 429
+ +C
Sbjct: 346 FVPKSC 351
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 152/368 (41%), Gaps = 45/368 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP--S 134
TG F + + T + D G D++W P VR P L R + + S +
Sbjct: 119 TGEYFAQVGVGTPATTALMVLDTGSDVVWAP---VRALP-------PLLRAVRQGSSTGA 168
Query: 135 ASSTSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
A + + +C +C C + C Y + Y + + ++G + L G
Sbjct: 169 APAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGA-- 225
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 251
VQ V IGCG G + +A GL+GLG G +S PS +A++ SFS C
Sbjct: 226 ----RVQ-RVAIGCGHDNEGLF---IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLV 275
Query: 252 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------- 304
D+ S R + T + +F Y +++G + Q+ +
Sbjct: 276 DRTSSRRARPSRRWGGTPRMATF------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 329
Query: 305 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSV 361
I+DSG+S T L + VYE + F S G+ + CY S +R+ K+P+V
Sbjct: 330 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTV 389
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
+ S + ++I T FC A+ DG + IG G+RVVFD + +
Sbjct: 390 SMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQR 448
Query: 422 LGWSHSNC 429
+G+ +C
Sbjct: 449 VGFVPKSC 456
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 159/392 (40%), Gaps = 65/392 (16%)
Query: 67 SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLD 125
SS + +G F L ++ + + D G D++WI C C++C Y+ D
Sbjct: 132 SSVISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKC-------YSQTD 184
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
+ P+ S + ++ C LC C KQ C Y + Y + + + G +
Sbjct: 185 ---PVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSY-GDGSFTVGEFSTET 240
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
L + + V++GCG G + V GL+GLG G +S PS + +
Sbjct: 241 L--------TFRGTRVGRVVLGCGHDNEGLF---VGAAGLLGLGRGRLSFPSQIGRR--F 287
Query: 244 RNSFSMCF-DKDDSGR---IFFGDQGPATQQSTSF--LASNGK----YITYIIGVETCCI 293
+ FS C D+ S R I FGD A ++T F L SN K Y ++G+
Sbjct: 288 NSKFSYCLGDRSASSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGT 345
Query: 294 GSSCLKQTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 345
S + + FK I+DSG+S T L + Y + F ++ + E +
Sbjct: 346 RVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFD 405
Query: 346 CCYKSSSQRLPKLPSVKLMF-------PQNNSFV-VNNPVFVIYGTQVVTGFCLAIQPVD 397
C+ S + K+P+V L F P +N + V+N FC A
Sbjct: 406 TCFDLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNS----------GSFCFAFAGTA 455
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ IG G+RVV+D ++G++ C
Sbjct: 456 SGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 152/377 (40%), Gaps = 70/377 (18%)
Query: 89 GSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G+K +++ D G DL W+ C+ C P S+ Y RD + P+AS T + C
Sbjct: 190 GAKNLTVIVDTGSDLTWVQCEPC----PGSSCYAQ---RD-PLFDPAASPTFAAVPCGSP 241
Query: 148 LC------------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
C S N +Q C Y + Y + + S G+L +D L L G L
Sbjct: 242 ACAASLKDATGAPGSCARSAGNSEQRCYYALSY-GDGSFSRGVLAQDTLGL--GTTTKLD 298
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DK 253
+ GCG+ G G A GL+GLG ++S+ S A FS C
Sbjct: 299 G-----FVFGCGLSNRG-LFGGTA--GLMGLGRTDLSLVS--QTAARFGGVFSYCLPATT 348
Query: 254 DDSGRIFFGDQGPAT----QQSTSFLASNGKYITYIIGV-ETCCIGSSCLKQTSFKA--- 305
+G + G GP++ T +A + Y I + G + L F A
Sbjct: 349 TSTGSLSLG-PGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNV 407
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKL 358
+VDSG+ T L VY+ + AEF R+ FE YP CY + + +
Sbjct: 408 LVDSGTVITRLAPSVYKAVRAEFARR-------FE-YPAAPGFSILDACYDLTGRDEVNV 459
Query: 359 PSVKLMFPQNNSFVVNNP--VFVIY--GTQVVTGFCLAIQ--PVDGDIGTIGQNFMTGYR 412
P + L V+ +FV+ G+QV CLA+ P + IG R
Sbjct: 460 PLLTLTLEGGAQVTVDAAGMLFVVRKDGSQV----CLAMASLPYEDQTPIIGNYQQRNKR 515
Query: 413 VVFDRENLKLGWSHSNC 429
VV+D +LG++ +C
Sbjct: 516 VVYDTVGSRLGFADEDC 532
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 153/382 (40%), Gaps = 56/382 (14%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
TG F ++ + M L D G D+ W+ C C C + +N PS+
Sbjct: 13 TGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFN----------PSS 62
Query: 136 SSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
SS+ K L CS LC D+ C + K C Y D Y + + + G LV D + L D+
Sbjct: 63 SSSFKVLDCSSSLCLNLDV-MGCLSNK--CLYQAD-YGDGSFTMGELVTDNVVL----DD 114
Query: 193 AL--KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
A V ++ +GCG G + G A G++GLG G +S P+ L + RN FS C
Sbjct: 115 AFGPGQVVLTNIPLGCGHDNEGTF--GTAA-GILGLGRGPLSFPNNLDAS--TRNIFSYC 169
Query: 251 F-----DKDDSGRIFFGDQG-PATQQ-STSFLAS--NGKYIT-YIIGVETCCIGSSCLKQ 300
D + + FGD P T S F+ N + T Y + + +G + L
Sbjct: 170 LPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTN 229
Query: 301 ---TSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 349
+ F+ I DSG++ T L Y + F ++ + + CY
Sbjct: 230 IPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYD 289
Query: 350 SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNF 407
+ +P+V F Q + + P I FC A G IG + Q
Sbjct: 290 FTGMNSISVPTVTFHF-QGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGPSVIGNVQQQ- 347
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
+RV++D + ++G C
Sbjct: 348 --SFRVIYDNVHKQIGLLPDQC 367
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 146/348 (41%), Gaps = 46/348 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G D+ WI +CAP S Y S + P +S++ + C C DL +
Sbjct: 167 DTGSDVSWI-----QCAPCSECYQQSDPI----FDPISSNSYSPIRCDEPQCKSLDL-SE 216
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G + + L G A++N V IGCG G +
Sbjct: 217 CRNGT--CLYEVSY-GDGSYTVGEFATETVTL---GSAAVEN-----VAIGCGHNNEGLF 265
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQS 271
V GL+GLG G++S P A + SFS C D D + F P +
Sbjct: 266 ---VGAAGLLGLGGGKLSFP-----AQVNATSFSYCLVNRDSDAVSTLEFNSPLPRNAAT 317
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVY 321
+ + Y +G++ +G L ++SF+ I+DSG++ T L EVY
Sbjct: 318 APLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVY 377
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ + F + + + CY SS+ ++P+V FP+ + ++I
Sbjct: 378 DALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIP 437
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V T FC A P + IG G RV FD N +G+S +C
Sbjct: 438 VDSVGT-FCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 147/351 (41%), Gaps = 53/351 (15%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQ---KMKTGPQFQM 83
FS KLIH+ S S + ++ K +YQV S VQK ++ + +
Sbjct: 30 FSFKLIHKNSPN------SPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNNGDYL 83
Query: 84 LFPSQGSKTMSLGN--DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 141
+ + GS + + D G DL+W +C P Y + P S T
Sbjct: 84 MKLTLGSPPVDIYGLVDTGSDLVW-----AQCTPCGGCYRQKSPM----FEPLRSKTYSP 134
Query: 142 LSCSHRLCDL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 200
+ C C G SC +P++ C Y+ Y + + L E I + GD V
Sbjct: 135 IPCESEQCSFFGYSC-SPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPV----VVG 189
Query: 201 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS--FSMCF-----D 252
+I GCG SG + + +G P SL+++ G + S FS C D
Sbjct: 190 DIIFGCGHSNSGTFNENDM------GIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTD 243
Query: 253 KDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI---- 306
SG I FG++ + + T+ LAS +Y++ +E +G + ++ S + +
Sbjct: 244 AHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGN 303
Query: 307 --VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSS 352
+DSG+ T++P+E YE + E +V ++ E P + CY+S +
Sbjct: 304 IMIDSGTPATYIPQEFYERLVEEL--KVQSSLLPIEDDPDLGTQLCYRSET 352
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 143/358 (39%), Gaps = 43/358 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ S+ D G L W+ C CV Y + D + PSAS T K LSC+
Sbjct: 23 ARYYSMIVDTGSSLSWLQCKPCV--------VYCHVQAD-PLFDPSASKTYKSLSCTSSQ 73
Query: 149 CDLGTS-------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C C+ C YT Y +++ S G L +D+L L +
Sbjct: 74 CSSLVDATLNNPLCETSSNVCVYTAS-YGDSSYSMGYLSQDLLTLA-------PSQTLPG 125
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIF 260
+ GCG G L G A G++GLG ++S+ ++ +FS C + G +
Sbjct: 126 FVYGCGQDSEG--LFGRAA-GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLS 180
Query: 261 FGDQGPA--TQQSTSFLASNGKYITYIIGVETCCIGSSCL----KQTSFKAIVDSGSSFT 314
G A + T G Y + + +G L Q I+DSG+ T
Sbjct: 181 IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVIT 240
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
LP VY F + ++ G+ C+K + + + +P V+L+F Q + +
Sbjct: 241 RLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF-QGGADLN 299
Query: 374 NNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
PV V+ QV G CLA +G + IG + ++V D ++G++ C
Sbjct: 300 LRPVNVL--LQVDEGLTCLAFAGNNG-VAIIGNHQQQTFKVAHDISTARIGFATGGCN 354
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 141/351 (40%), Gaps = 39/351 (11%)
Query: 92 TMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-- 149
T ++ D G D+ W+ C+ P A D P+ SST + +SC+ C
Sbjct: 139 TQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFD-------PAKSSTYRAVSCAAAECAQ 191
Query: 150 --DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
G C C Y + Y + ++++G D L L SG +A+K GC
Sbjct: 192 LEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL-SGASDAVKG-----FQFGCS 244
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQ 264
+SG + D DGL+GLG G S+ S A A NSFS C G
Sbjct: 245 HVESG-FSDQT--DGLMGLGGGAQSLVSQTAAA--YGNSFSYCLPPTSGSSGFLTLGGGG 299
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA--IVDSGSSFTFLPKEV 320
G + +T L S Y ++ +G L + F A +VDSG+ T LP
Sbjct: 300 GVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSVVDSGTIITRLPPTA 359
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y +++ F + ++ C+ + Q +P+V L+F + + +P ++
Sbjct: 360 YSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVF-SGGAAIDLDPNGIM 418
Query: 381 YGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
YG CLA DG G IG + V++D + LG+ C
Sbjct: 419 YGN------CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 148/372 (39%), Gaps = 68/372 (18%)
Query: 98 DFGCDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-- 150
D G W+ C C C + Y + L + C+ LCD
Sbjct: 57 DTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKL-------------VPCADPLCDAL 103
Query: 151 ---LGTS--CQN-PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
LGT+ C + K C Y + Y + SS G+L+ D L +GG ++
Sbjct: 104 HKDLGTTKKCTDVRKNQCDYKVKY-QDGLSSLGVLLLDKFSLPTGG--------ARNIAF 154
Query: 205 GCGMKQSGGYL----DGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRI 259
GCG Q G + V DG++GLG G + + S L +G + +N C G +
Sbjct: 155 GCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIGHCLSSKGGGYL 214
Query: 260 FFGDQG-PATQQSTSFLA--SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 316
F G++ P++ + +A + G+ Y G T + S+ + KAI DSGS++T+L
Sbjct: 215 FIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKAIFDSGSTYTYL 274
Query: 317 PKEVYETIAAEFD--------RQVNDTITS--FEG-YPWKCCYKSSSQ-----RLPKLPS 360
P+ ++ + + +QV+D ++G P+K + + + L
Sbjct: 275 PENLHAQLVSALKASLSKSSLKQVSDPALPLCWKGPKPFKTVHDTPKEFKSLVTLKFDLG 334
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
V ++ P N ++ +G + G D IG M V++D E
Sbjct: 335 VTMIIPPENYLIITGHGNACFGILDMPGL---------DQYIIGDITMQEQLVIYDNEKG 385
Query: 421 KLGWSHSNCQDL 432
+L W S C +
Sbjct: 386 RLAWMPSPCDKI 397
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 135/351 (38%), Gaps = 40/351 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G L W+ C CA + + L Y PS S T K LSC+ C +
Sbjct: 143 DTGSSLSWLQCQ--PCAVYCHAQADPL------YDPSVSKTYKKLSCASVECSRLKAATL 194
Query: 155 ----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C+ C YT Y + + S G L +D+L L S + GCG
Sbjct: 195 NDPLCETDSNACLYTAS-YGDTSFSIGYLSQDLLTLTS-------SQTLPQFTYGCGQDN 246
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDSGRIFFGDQ----- 264
G L G A G+IGL ++S+ + L+ K G ++FS C +SG G
Sbjct: 247 QG--LFGRAA-GIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSIGSI 300
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEV 320
P + + T L + Y + + + L + ++DSG+ T LP +
Sbjct: 301 SPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSM 360
Query: 321 YETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + F + ++ Y C+K S + + +P +K++F + P +
Sbjct: 361 YAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL 420
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
I + +T A I IG Y + +D ++G++ +C
Sbjct: 421 IEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 471
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 127/337 (37%), Gaps = 42/337 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G DL+W+ C S + PS S+T LSC C L +
Sbjct: 118 DTGSDLVWVNCS-------SNGGGGGASDGAVVFHPSRSTTYSLLSCQSAACQALSQASC 170
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ C Y Y + + + G+L + + G V GC +G +
Sbjct: 171 DADSECQYQY-AYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS 229
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQ-- 269
DGL+GLG G +S+ S L A I FS C + S + FG + +
Sbjct: 230 ----DGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSSTLSFGARAVVSDPG 285
Query: 270 -QSTSFLASNGKYITYIIGVETCCI-GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAE 327
ST + S Y + +E+ + G S + IVDSG++ TFL + + AE
Sbjct: 286 AASTPLVPSEVDSY-YTVALESVAVAGQDVASANSSRIIVDSGTTLTFLDPALLRPLVAE 344
Query: 328 FDRQVNDTITSFEGYPWKCCY----KSSSQRLPKLPSVKLMFPQNNSFVV---NNPVFVI 380
+R++ + CY KS ++ +P V L F S + N +
Sbjct: 345 LERRIRLPRAQPPEQLLQLCYDVQGKSQAEDF-GIPDVTLRFGGGASVTLRPENTFSLLE 403
Query: 381 YGTQVVTGFCLAIQPVDGD-----IGTIG-QNFMTGY 411
GT CL + PV +G I QNF GY
Sbjct: 404 EGT-----LCLVLVPVSESQPVSILGNIAQQNFHVGY 435
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 142/361 (39%), Gaps = 52/361 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G DL+W C CV C S ++ PS+SST + CS LC DL TS
Sbjct: 118 DTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPCSSALCSDLPTST 167
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GY 214
C YT Y + +S+ G+L + L + V GCG G G+
Sbjct: 168 CTSASKCGYTYTY-GDASSTQGVLASETFTL------GKEKKKLPGVAFGCGDTNEGDGF 220
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQ- 269
G GL+GLG G +S L+++ GL + FS C D D + G A
Sbjct: 221 TQGA---GLVGLGRGPLS---LVSQLGL--DKFSYCLTSLDDGDGKSPLLLGGSAAAISE 272
Query: 270 -------QSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK--------AIVDSGSS 312
Q+T + + + Y + + +GS+ L ++F IVDSG+S
Sbjct: 273 SAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTS 332
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T+L + Y + F Q+ C++ ++ + ++ KL+ +
Sbjct: 333 ITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGAD 392
Query: 373 VNNPVFVIYGTQVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQD 431
++ P +G CL + P G + IG ++ V+D L ++ C
Sbjct: 393 LDLPAENYMVLDSASGALCLTVAPSRG-LSIIGNFQQQNFQFVYDVAGDTLSFAPVQCNK 451
Query: 432 L 432
L
Sbjct: 452 L 452
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 142/357 (39%), Gaps = 50/357 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 92 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 140
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 141 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 190
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 191 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 246
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 306
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 307 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 365
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + ++ VFV Q +CLA P + + IG T VV+D + +G
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIG 421
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 152/377 (40%), Gaps = 60/377 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T+ L D G DL+W PC C C+ +++ + N + P +SS+SK L C +
Sbjct: 101 QTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFIPKSSSSSKVLGCVN 154
Query: 147 RLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 204
C G+ Q+ + C T T+ + L+ + D+ Q +
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQ-------ICPPYLNFLRFWDH---RRSQFHRRM 204
Query: 205 GCGMKQS-----GGYLDGVAPDGLIG-LGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSG 257
C + QS G+ G P L LGL + S L + S S+ D + DSG
Sbjct: 205 LCPLHQSTRREISGF--GRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDSG 262
Query: 258 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----------AI 306
G Q+ + + Y +G+ +G +K +K I
Sbjct: 263 EKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK-IPYKYLIPGADGDGGTI 321
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM 364
+DSG++FT++ E++E +AAEF++QV + T EG + C+ S P P + L
Sbjct: 322 IDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLK 381
Query: 365 FP--QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG---------TIGQNFMTGYRV 413
F + N V + G VV CL I DG G +G + V
Sbjct: 382 FRGGAEMELPLANYVAFLGGDDVV---CLTIV-TDGAAGKEFSGGPAIILGNFQQQNFYV 437
Query: 414 VFDRENLKLGWSHSNCQ 430
+D N +LG+ +C+
Sbjct: 438 EYDLRNERLGFRQQSCK 454
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 96/249 (38%), Gaps = 11/249 (4%)
Query: 192 NALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
N +AS ++G Q G L A G++GL IS+PS LA G+I N F C
Sbjct: 4 NRYNGGRKASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGHC 63
Query: 251 FDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIV 307
++ + G +F GD T G Y + G L + I
Sbjct: 64 ITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGIPVQVIS 123
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-- 365
G+S+T+LP+E+Y+ + + C+K+ + L F
Sbjct: 124 RCGTSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGR 183
Query: 366 -----PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
P+ + V ++ + + V G + G +G + G VV+D E
Sbjct: 184 RWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 243
Query: 421 KLGWSHSNC 429
++GW++S C
Sbjct: 244 QIGWANSEC 252
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 59/365 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC S ++ P SS+ + C LC D G
Sbjct: 147 DTGSDVVWVQCAPCRRCYEQSGPVFD----------PRRSSSYGAVGCGAALCRRLDSG- 195
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y + Y + + ++G V + L G + A V +GCG G
Sbjct: 196 GCDLRRGACMYQV-AYGDGSVTAGDFVTETLTFAGG-------ARVARVALGCGHDNEGL 247
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR-----------IFF 261
+ VA GL+GLG G +S P+ +++ SFS C D+ SG + F
Sbjct: 248 F---VAAAGLLGLGRGGLSFPTQISR--RYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSF 302
Query: 262 GDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AI 306
G G S SF + N + Y ++G+ + ++ + I
Sbjct: 303 G-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVI 361
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTIT-SFEGYP-WKCCYKSSSQRLPKLPSVKLM 364
VDSG+S T L + Y + F + S G+ + CY +R+ K+P+V +
Sbjct: 362 VDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMH 421
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
F + ++I T FC A DG + IG G+RVVFD + ++G+
Sbjct: 422 FAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 480
Query: 425 SHSNC 429
+ C
Sbjct: 481 APKGC 485
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 147/361 (40%), Gaps = 54/361 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC D+ + P SS+ + C+ LC D G
Sbjct: 158 DTGSDVVWLQCAPCRRC----------YDQSGPVFDPRRSSSYGAVDCAAPLCRRLDSG- 206
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C ++ C Y + Y + + ++G + L G + A V +GCG G
Sbjct: 207 GCDLRRRACLYQV-AYGDGSVTAGDFATETLTFAGG-------ARVARVALGCGHDNEGL 258
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQ-------- 264
+ VA GL+GLG G +S P+ +++ SFS C D+ S +
Sbjct: 259 F---VAAAGLLGLGRGSLSFPTQISR--RYGKSFSYCLVDRTSSSSSGAASRSRSSTVTF 313
Query: 265 GPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AIVDS 309
GP + + SF + N + Y ++G+ + ++ + IVDS
Sbjct: 314 GPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDS 373
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQN 368
G+S T L + Y + F S G+ + CY +++ K+P+V + F
Sbjct: 374 GTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGG 433
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ ++I T FC A DG + IG G+RVVFD + ++G++
Sbjct: 434 AEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKG 492
Query: 429 C 429
C
Sbjct: 493 C 493
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 141/352 (40%), Gaps = 56/352 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSA-SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG--- 152
D G DL W V+C P +A S Y D + P+ SS+ + C C LG
Sbjct: 155 DTGSDLSW-----VQCKPCAAPSCYRQKD---PLFDPAQSSSYAAVPCGRSACAGLGIYA 206
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
++C + C Y + Y + ++++G+ D L L + N+ + GCG QSG
Sbjct: 207 SACSAAQ--CGYVVSY-GDGSNTTGVYSSDTLTLAA-------NATVQGFLFGCGHAQSG 256
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDSGRIFFGDQGPATQ-- 269
G G+ DGL+G G + PSL+ + AG FS C S + GP+
Sbjct: 257 GLFTGI--DGLLGFGREQ---PSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAP 311
Query: 270 --QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEVYET 323
+T L S Y++ + +G L ++F A +VD+G+ T LP Y
Sbjct: 312 GFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPAAYAA 371
Query: 324 IAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
+ + F + S+ P CY + L SV L F + + +
Sbjct: 372 LRSAF----RSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM 427
Query: 380 IYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G CLA DG + +G + V D + +G+ S+C
Sbjct: 428 SFG-------CLAFASSGSDGSMAILGNVQQRSFEVRIDGSS--VGFRPSSC 470
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 148/364 (40%), Gaps = 58/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ + L D D+ WIPC CV C +A +SP+ S++ K++SCS
Sbjct: 125 AQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSCSAPQ 172
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C + + Y + + +++ L +D + L + A GC
Sbjct: 173 CKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAFT--------FGCVN 222
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
K +GG G P LGLG + + + +++FS C SG + G
Sbjct: 223 KVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPT 279
Query: 265 G-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSF 313
P + T L + + Y + + +G + T I DSG+ +
Sbjct: 280 SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 339
Query: 314 TFLPKEVYETIAAEFDRQVNDT---ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNN 369
T L K VYE + EF ++V T +TS G+ CY K+P++ MF N
Sbjct: 340 TRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGF--DTCYSGQV----KVPTITFMFKGVNM 393
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ +N +++ T T CLA+ + V+ + I +RV+ D N +LG +
Sbjct: 394 TMPADN--LMLHSTAGSTS-CLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLA 450
Query: 426 HSNC 429
C
Sbjct: 451 RERC 454
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 149/370 (40%), Gaps = 60/370 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT- 153
D G +L W+ CAP R + P AS T + C C DL +
Sbjct: 84 DTGSELSWL-----LCAPGGGGGGGG--RSALSFRPRASLTFASVPCDSAQCRSRDLPSP 136
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C + C ++ Y + +SS G L ++ + G ++A+ GC
Sbjct: 137 PACDGASKQCRVSLSY-ADGSSSDGALATEVFTVGQG------PPLRAA--FGCMATAFD 187
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG--------- 262
DGVA GL+G+ G + S +++A R FS C D+DD+G + G
Sbjct: 188 TSPDGVATAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAGVLLLGHSDLPFLPL 242
Query: 263 DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFT 314
+ P Q + +A + + + +G + I +S L A +VDSG+ FT
Sbjct: 243 NYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFT 302
Query: 315 FLPKEVYETIAAEFDRQ-------VNDTITSFEGYPWKCCYKSSSQRLP--KLPSVKLMF 365
FL + Y + AEF RQ +ND +F+ + C++ R P +LP+V L+F
Sbjct: 303 FLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEA-FDTCFRVPQGRAPPARLPAVTLLF 361
Query: 366 PQNNSFVVNNPVFVIYGTQVVTG---FCLAIQPVDGDIGT---IGQNFMTGYRVVFDREN 419
V + + + G +CL D T IG + V +D E
Sbjct: 362 NGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLER 421
Query: 420 LKLGWSHSNC 429
++G + C
Sbjct: 422 GRVGLAPIRC 431
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 150/347 (43%), Gaps = 63/347 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C+ C C ++ ++ P SST + +SCS C S
Sbjct: 104 DTGSDLIWTQCNPCEDCYQQTSPLFD----------PKESSTYRKVSCSSSQCRALEDAS 153
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN--ALKNSVQASVIIGCGMKQSG 212
C + C YT+ Y +N+ + G + D + + S G +L+N +IIGCG + +G
Sbjct: 154 CSTDENTCSYTITY-GDNSYTKGDVAVDTVTMGSSGRRPVSLRN-----MIIGCGHENTG 207
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPA 267
+ A G+IGLG G S+ S L K+ I FS C + + +I FG G
Sbjct: 208 TF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGLTSKINFGTNGIV 263
Query: 268 TQQ---STSFLASN-GKYITYIIGVETCCIGSSCLKQTSF-------KAIVDSGSSFTFL 316
+ STS + + Y Y + +E +GS ++ TS ++DSG++ T L
Sbjct: 264 SGDGVVSTSMVKKDPATY--YFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTLL 321
Query: 317 PKEVY--------ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
P Y TI AE Q D I S CY+ SS K+P + + F
Sbjct: 322 PSNFYYELESVVASTIKAE-RVQDPDGILSL-------CYRDSSSF--KVPDITVHFKGG 371
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQ-NFMTGYRVV 414
+ + N FV ++ V+ F A G + Q NF+ GY V
Sbjct: 372 DVKLGNLNTFVAV-SEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTV 417
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 155/389 (39%), Gaps = 50/389 (12%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYY 121
Q+ LSS + Q + ++ GSK M++ D G DL W+ C+ C+ C +
Sbjct: 51 QIPLSSGINLQTLN-----YIVTMGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIF 105
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
+ SST + L + + G + C Y ++Y + ++ L VE
Sbjct: 106 KPSTSSSYQSVSCNSSTCQSLQFATG--NTGACGSSNPSTCNYVVNYGDGSYTNGELGVE 163
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
L GG + + + GCG + + G GV+ GL+GLG +S+ S
Sbjct: 164 ---ALSFGGVSV------SDFVFGCG-RNNKGLFGGVS--GLMGLGRSYLSLVS--QTNA 209
Query: 242 LIRNSFSMCF---DKDDSGRIFFGDQGPATQQS-----TSFLASNGKYITYIIGVETCCI 293
FS C + SG + G++ + + T L++ YI+ + +
Sbjct: 210 TFGGVFSYCLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDV 269
Query: 294 GSSCLKQ-TSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP------ 343
G LK SF ++DSG+ T LP VY+ + AEF + F G+P
Sbjct: 270 GGVALKAPLSFGNGGILIDSGTVITRLPSSVYKALKAEF-------LKKFTGFPSAPGFS 322
Query: 344 -WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG--DI 400
C+ + +P++ L F N V+ + + CLA+ + D
Sbjct: 323 ILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDT 382
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG RV++D + K+G++ C
Sbjct: 383 AIIGNYQQRNQRVIYDTKQSKVGFAEEPC 411
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 149/370 (40%), Gaps = 60/370 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT- 153
D G +L W+ CAP R + P AS T + C C DL +
Sbjct: 83 DTGSELSWL-----LCAPGGGGGGGG--RSALSFRPRASLTFASVPCGSAQCRSRDLPSP 135
Query: 154 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+C + C ++ Y + +SS G L ++ + G ++A+ GC
Sbjct: 136 PACDGASKQCRVSLSY-ADGSSSDGALATEVFTVGQG------PPLRAA--FGCMATAFD 186
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG--------- 262
DGVA GL+G+ G + S +++A R FS C D+DD+G + G
Sbjct: 187 TSPDGVATAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAGVLLLGHSDLPFLPL 241
Query: 263 DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFT 314
+ P Q + +A + + + +G + I +S L A +VDSG+ FT
Sbjct: 242 NYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFT 301
Query: 315 FLPKEVYETIAAEFDRQ-------VNDTITSFEGYPWKCCYKSSSQRLP--KLPSVKLMF 365
FL + Y + AEF RQ +ND +F+ + C++ R P +LP+V L+F
Sbjct: 302 FLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEA-FDTCFRVPQGRAPPARLPAVTLLF 360
Query: 366 PQNNSFVVNNPVFVIYGTQVVTG---FCLAIQPVDGDIGT---IGQNFMTGYRVVFDREN 419
V + + + G +CL D T IG + V +D E
Sbjct: 361 NGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLER 420
Query: 420 LKLGWSHSNC 429
++G + C
Sbjct: 421 GRVGLAPIRC 430
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 142/350 (40%), Gaps = 46/350 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+ W+ C C C Y D + P+ASST ++C + C +S
Sbjct: 179 DTGSDINWLQCQPCTDC-------YQQTDP---IFDPTASSTYAPVTCQSQQCSSLEMSS 228
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C++ + C Y ++Y + + E + G ++KN V +GCG G +
Sbjct: 229 CRSGQ--CLYQVNYGDGSYTFGDFATESVSF---GNSGSVKN-----VALGCGHDNEGLF 278
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--IFFGDQGPATQQS 271
+ GL G L SL + L SFS C ++D +G + F
Sbjct: 279 VGAAGLLGLGGGPL------SLTNQ--LKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSV 330
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKEV 320
T+ L N K T Y +G+ +G + +++F+ IVD G++ T L +
Sbjct: 331 TAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQA 390
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + F R + + + CY S Q ++P+V F S+ + ++I
Sbjct: 391 YNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLI 450
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
T +C A P + IG G RV FD N ++G+S + CQ
Sbjct: 451 PVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKCQ 499
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 130/349 (37%), Gaps = 33/349 (9%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W+ C C C P S + P SST +C + C L Q
Sbjct: 108 DTGSDLIWVQCSPCASCFPQSTPLFQ----------PLKSSTFMPTTCRSQPCTLLLPEQ 157
Query: 157 N---PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C YT Y + + S GLL + L S G ++ + GCG+ +
Sbjct: 158 KGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQG--GVQTVAFPNSFFGCGLYNNIT 215
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQ 270
G++GLG G +S+ S + I + FS C + ++ FG++ T +
Sbjct: 216 VFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLKFGNESIITGE 273
Query: 271 ---STSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSGSSFTFLPKEVYETIA 325
ST + Y + +E + + T I+DSG+ T+L + Y A
Sbjct: 274 GVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFA 333
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 385
A + + P C+ + P + F + V P + T+
Sbjct: 334 ASLQESLAVELVQDVLSPLPFCFPYRDNFV--FPEIAFQF--TGARVSLKPANLFVMTED 389
Query: 386 VTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CL I P V G I G ++V +D E K+ + ++C +
Sbjct: 390 RNTVCLMIAPSSVSG-ISIFGSFSQIDFQVEYDLEGKKVSFQPTDCSKV 437
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 155/402 (38%), Gaps = 71/402 (17%)
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASY 120
+Q LL + V M +L T S+ D G DL+W C C +C
Sbjct: 75 FQALLENGVGGYNMNISVGTPLL-------TFSVVADTGSDLIWTQCAPCTKC------- 120
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSG 177
+ + P++SST L C+ C N + C T +Y + ++G
Sbjct: 121 ---FQQPAPPFQPASSSTFSKLPCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAG 174
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L + L + GD + SV GC + G + G+ GLG G +S L+
Sbjct: 175 YLATETLKV---GDASFP-----SVAFGCSTENG----VGNSTSGIAGLGRGALS---LI 219
Query: 238 AKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQ---QSTSFLASNGKYITYI-IGVET 290
+ G+ R FS C + I FG T QST F+ + + +Y + +
Sbjct: 220 PQLGVGR--FSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTG 277
Query: 291 CCIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+G + L T+ IVDSG++ T+L K+ YE + F Q D T
Sbjct: 278 ITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVN 337
Query: 340 EGYPWKCCYKSSSQRLPKL--PSVKLMFPQNNSFVVNNPVFVIYGTQV-----VTGFCLA 392
C+KS+ + PS+ L F + V P + G + VT CL
Sbjct: 338 GTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV--PTY-FAGVETDSQGSVTVACLM 394
Query: 393 IQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ P GD + IG +++D + ++ ++C +
Sbjct: 395 MLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCAKV 436
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/354 (23%), Positives = 138/354 (38%), Gaps = 54/354 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DL-GTS 154
D G DL+W C+ C +C +N P SS+ L C + C DL +
Sbjct: 114 DTGSDLIWTQCEPCTQCFSQPTPIFN----------PQDSSSFSTLPCESQYCQDLPSET 163
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C N + C YT Y +T+ + E + S ++ GCG G G
Sbjct: 164 CNNNE--CQYTYGYGDGSTTQGYMATETF---------TFETSSVPNIAFGCGEDNQGFG 212
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQG---PA 267
+G GLIG+G G +S+PS L FS C + + G P
Sbjct: 213 QGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLALGSAASGVPE 264
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLP 317
ST+ + S+ Y I ++ +G L ++F+ I+DSG++ T+LP
Sbjct: 265 GSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLP 324
Query: 318 KEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
++ Y +A F Q+N T+ C + S ++P + + F +
Sbjct: 325 QDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQN 384
Query: 377 VFVIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + V+ CLA+ I G +V++D +NL + + + C
Sbjct: 385 ILISPAEGVI---CLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 91/367 (24%), Positives = 143/367 (38%), Gaps = 52/367 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+K S+ D G L W+ C CV Y + D ++PS S T K L CS
Sbjct: 123 AKYFSMIVDTGSSLSWLQCQPCV--------IYCHVQVD-PIFTPSTSKTYKALPCSSSQ 173
Query: 149 CDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C C N C Y Y + + S G L +D+L L + +
Sbjct: 174 CSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLTP------SEAPSSG 226
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
+ GCG G L G + G+IGL +IS+ L+K N+FS C S
Sbjct: 227 FVYGCGQDNQG--LFGRS-SGIIGLANDKISMLGQLSKK--YGNAFSYCLPSSFSAPNSS 281
Query: 262 GDQGPATQQSTSFLASNGKYI----------TYIIGVETCCIGSSCLKQTS----FKAIV 307
G + ++S +S K+ Y + + T + L ++ I+
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTII 341
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFP 366
DSG+ T LP VY + F ++ G+ C+K S + + +P ++++F
Sbjct: 342 DSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFR 401
Query: 367 QNNSFVV---NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
+ N+ V + GT CLAI I IG ++V +D N K+G
Sbjct: 402 GGAGLELKAHNSLVEIEKGTT-----CLAIAASSNPISIIGNYQQQTFKVAYDVANFKIG 456
Query: 424 WSHSNCQ 430
++ CQ
Sbjct: 457 FAPGGCQ 463
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 148/364 (40%), Gaps = 58/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ + L D D+ WIPC CV C +A +SP+ S++ K++SCS
Sbjct: 109 AQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSCSAPQ 156
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C + + Y + + +++ L +D + L + A GC
Sbjct: 157 CKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAFT--------FGCVN 206
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
K +GG G P LGLG + + + +++FS C SG + G
Sbjct: 207 KVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPT 263
Query: 265 G-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSF 313
P + T L + + Y + + +G + T I DSG+ +
Sbjct: 264 SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 323
Query: 314 TFLPKEVYETIAAEFDRQVNDT---ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNN 369
T L K VYE + EF ++V T +TS G+ CY K+P++ MF N
Sbjct: 324 TRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGF--DTCYSGQV----KVPTITFMFKGVNM 377
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ +N +++ T T CLA+ + V+ + I +RV+ D N +LG +
Sbjct: 378 TMPADN--LMLHSTAGSTS-CLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLA 434
Query: 426 HSNC 429
C
Sbjct: 435 RERC 438
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 146/390 (37%), Gaps = 76/390 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+T D G L+W PC C RC + N + + P SS+S + C +
Sbjct: 103 QTTKFVMDTGSSLVWFPCTSRYLCSRC-----DFPNIEVTGIPTFIPKQSSSSNLIGCKN 157
Query: 147 RLCD--LGTSCQNPKQPC------------PYTMDYYTENTSSSGLLVEDILHLISGGDN 192
C G Q+ Q C PY + Y +T +GLL+ + L D
Sbjct: 158 HKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGST--AGLLLSETL------DF 209
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 252
K ++ ++GC + P+G+ G G S+PS L S FD
Sbjct: 210 PHKKTIPG-FLVGCSL------FSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFD 262
Query: 253 KDDSGRIFFGDQGPATQQSTSFLASNGKYIT---------YIIGVETCCIGSSCLKQTSF 303
+ D G + + + S + Y + + IG + +K +
Sbjct: 263 DTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVK-VPY 321
Query: 304 K-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE---GYPWKCCYK 349
K IVDSG++FTF+ K VYE +A EF++QV + E + C+
Sbjct: 322 KFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFN 381
Query: 350 SSSQRLPKLPS--------VKLMFPQNN--SFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
S ++ +P K+ P N SFV + + + + ++G + P
Sbjct: 382 ISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAI-- 439
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G + V FD +N + G+ NC
Sbjct: 440 --ILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 116/277 (41%), Gaps = 55/277 (19%)
Query: 90 SKTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
S+T+S D G L+W PC C RC S+ N + + P SS++K + C
Sbjct: 116 SQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSAKIVGCL 170
Query: 146 HRLCDLGTSCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
+ C +N + CP Y T+ LL+E ++ +
Sbjct: 171 NPKCGFVMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLV---------FAERTEPDF 221
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGR--- 258
++GC + L P G+ G G G S+P + GL + S+ + + DDS +
Sbjct: 222 VVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFSYCLLSHRFDDSPKSSK 272
Query: 259 --IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLK-QTSFKA- 305
++ G D T F ++SN + Y + + +G +K SF
Sbjct: 273 MTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVA 332
Query: 306 --------IVDSGSSFTFLPKEVYETIAAEFDRQVND 334
IVDSGS+FTF+ K V+E +A EFDRQ+ +
Sbjct: 333 GSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMAN 369
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 135/351 (38%), Gaps = 40/351 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--- 154
D G L W+ C CA + + L Y PS S T K LSC+ C +
Sbjct: 4 DTGSSLSWLQCQ--PCAVYCHAQADPL------YDPSVSKTYKKLSCASVECSRLKAATL 55
Query: 155 ----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C+ C YT Y + + S G L +D+L L S + GCG
Sbjct: 56 NDPLCETDSNACLYTAS-YGDTSFSIGYLSQDLLTLTS-------SQTLPQFTYGCGQDN 107
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDSGRIFFGDQ----- 264
G L G A G+IGL ++S+ + L+ K G ++FS C +SG G
Sbjct: 108 QG--LFGRAA-GIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSIGSI 161
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEV 320
P + + T L + Y + + + L + ++DSG+ T LP +
Sbjct: 162 SPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSM 221
Query: 321 YETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + F + ++ Y C+K S + + +P +K++F + P +
Sbjct: 222 YAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL 281
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
I + +T A I IG Y + +D ++G++ +C
Sbjct: 282 IEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 332
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 72/137 (52%), Gaps = 13/137 (9%)
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 180
Y D D ++ P SST + + C ++ +C + K+ C Y +Y E++SS G+L
Sbjct: 155 YGLFDED-PKFQPELSSTYQPVKC-----NMDCNCDDDKEQCVYEREY-AEHSSSKGVLG 207
Query: 181 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 240
ED LIS G+ + +A + GC ++G A DG+IGLG G++S+ L
Sbjct: 208 ED---LISFGNESHLTPQRA--VFGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDK 261
Query: 241 GLIRNSFSMCFDKDDSG 257
GLI NSF +C+ D G
Sbjct: 262 GLISNSFGLCYGGLDVG 278
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 144/378 (38%), Gaps = 61/378 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K+ L D G L W+ CD C C + Y + L ++C+
Sbjct: 413 AKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL-------------VTCADS 459
Query: 148 LC-DLGTSCQNPK-----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
LC DL T PK + C Y + Y ++SS G+LV D L +A + +
Sbjct: 460 LCTDLYTDLGKPKRCGSQKQCDYVIQYV--DSSSMGVLVIDRFSL-----SASNGTNPTT 512
Query: 202 VIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRI 259
+ GCG Q + P D ++GL G++++ S L G+I ++ C G +
Sbjct: 513 IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFL 572
Query: 260 FFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
FFGD Q P + + + + KY + G S + I DSG+++T+
Sbjct: 573 FFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAA 632
Query: 319 EVYE-----------------TIAAEFDRQV------NDTITSFEGYPWKCCYKSSSQRL 355
+ Y+ T E DR + D I + + K C++S S
Sbjct: 633 QPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTID--EVKKCFRSLSLEF 690
Query: 356 PK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVV 414
L P + +++ V G + L++ + IG M V+
Sbjct: 691 ADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTN----LIGGITMLDQMVI 746
Query: 415 FDRENLKLGWSHSNCQDL 432
+D E LGW + C +
Sbjct: 747 YDSERSLLGWVNYQCDRI 764
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 112/297 (37%), Gaps = 39/297 (13%)
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS-GGYLDGVAP 220
C Y + Y + S+ G L+ D L + + + ++ GCG Q G +P
Sbjct: 29 CDYEIKY-ADGASTIGALIVDQFSLP-------RIATRPNLPFGCGYNQGIGENFQQTSP 80
Query: 221 -DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASN 278
+G++GL G++S S L G+I ++ C G +F GD + L +N
Sbjct: 81 VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGD----GDGNLVLLHAN 136
Query: 279 GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
Y G T L + DSGS++T+ + Y+ ++ T
Sbjct: 137 ----YYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLE 192
Query: 339 FEGYP-----WKC--CYKSSSQRLPKLPSVKLMFPQNNSFVV---NNPVFVIYGTQVVTG 388
P WK ++S + S++L F N + N + YG
Sbjct: 193 QVSDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGNNAVMEIPPENYLIVTEYGN----- 247
Query: 389 FCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGP 444
CL I + IG M V++D E +LGW +C DG++ T P
Sbjct: 248 VCLGILHGCRLNFNIIGDITMQDQMVIYDNEREQLGWIRGSC----DGSQEAPTQAP 300
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 95/402 (23%), Positives = 158/402 (39%), Gaps = 95/402 (23%)
Query: 90 SKTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
S+T D G L+W+PC C +C S + ++ P SS+SK + C+
Sbjct: 96 SQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFS---------NTPKFIPKNSSSSKFVGCT 146
Query: 146 HRLC------DLGTSC--------QNPKQPCP-YTMDYYTENTSSSGLLVEDILHLISGG 190
+ C D+ + C N Q CP YT+ Y +T+ G L+ + L+
Sbjct: 147 NPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTA--GFLLSENLNF---- 200
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
+ ++GC + + P G+ G G GE S+PS + L R S+ +
Sbjct: 201 ----PTKKYSDFLLGCSV------VSVYQPAGIAGFGRGEESLPS---QMNLTRFSYCLL 247
Query: 251 FDK-DDSGRI-----------------------FFGDQGPATQQSTSFLASNGKYITY-- 284
+ DDS I F + P T+++ +F A YIT
Sbjct: 248 SHQFDDSATITSNLVLETASSRDGKTNGVSYTPFL--KNPTTKKNPAFGAY--YYITLKR 303
Query: 285 -IIGVETCCIGSSCLKQT---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
++G + + L+ IVDSGS+FTF+ + +++ +A EF +QV+ T
Sbjct: 304 IVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREA 363
Query: 341 GYPW---KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV---FVIYGTQVVTGFCLAIQ 394
+ C + P ++ F + PV F + G V +
Sbjct: 364 EKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRL--PVANYFSLVGKGDVACLTIVSD 421
Query: 395 PVDGDIGTIGQNFMTG------YRVVFDRENLKLGWSHSNCQ 430
V G GT+G + G + V +D EN + G+ +CQ
Sbjct: 422 DVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463
>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 436
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 100/406 (24%), Positives = 160/406 (39%), Gaps = 99/406 (24%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL- 151
++L D G W+ CD SY SST K + CS C L
Sbjct: 60 INLTIDLGGGYFWVNCD--------KSY--------------VSSTLKPILCSSSQCSLF 97
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS---VIIGCGM 208
G+ + K+ C + S+SG + DI+ + S N V I G +
Sbjct: 98 GSHGCSDKKICGRSPYNIVTGVSTSGDIQSDIVSVQSTNGNYSGRFVSVPNFLFICGSNV 157
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD----- 263
Q+G GV G+ GLG ++S+PS + A +N F++C + G +FFGD
Sbjct: 158 VQNG-LAKGV--KGMAGLGRTKVSLPSQFSSAFSFKNKFAICLGTQN-GVLFFGDGPYLF 213
Query: 264 --------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSFKAIVD 308
P + +SFL K + Y IGV++ + S +K T+ +I
Sbjct: 214 NFDESKNLIYTPLITNPVSTSPSSFLGE--KSVEYFIGVKSIRVSSKNVKLNTTLLSIDQ 271
Query: 309 SG---------SSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKS---SSQRL 355
+G + +T + +Y+ +A F + +N +++ E P+ C+ S SS R+
Sbjct: 272 NGFGGTKISTVNPYTIMETSIYKAVADAFVKALN--VSTVEPVAPFGTCFASQSISSSRM 329
Query: 356 -PKLPSVKLMFPQNNSFVVN----NPVFVIYGTQVVTGFCLAIQPVDGDIG--------- 401
P +PS+ L+ QN + V N N + I V+ CL D
Sbjct: 330 GPDVPSIDLVL-QNENVVWNIIGANAMVRINDKDVI---CLGFVDAGSDFAKTSQVGFVV 385
Query: 402 ---------TIGQNFMTGYRVVFDRENLKLGW-----SHSNCQDLN 433
TIG + + + FD +LG+ H NC + N
Sbjct: 386 GGSKPMTSITIGAHQLENNLLQFDLATSRLGFRSLFLEHDNCGNFN 431
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 144/379 (37%), Gaps = 63/379 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K+ L D G L W+ CD C C + Y + L ++C+
Sbjct: 48 AKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL-------------VTCADS 94
Query: 148 LC-DLGTSCQNPK-----QPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQA 200
LC DL T PK + C Y + Y ++SS G+LV D L S G N
Sbjct: 95 LCTDLYTDLGKPKRCGSQKQCDYVIQYV--DSSSMGVLVIDRFSLSASNGTNP------T 146
Query: 201 SVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGR 258
++ GCG Q + P D ++GL G++++ S L G+I ++ C G
Sbjct: 147 TIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGF 206
Query: 259 IFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
+FFGD Q P + + + + KY + G S + I DSG+++T+
Sbjct: 207 LFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFA 266
Query: 318 KEVYE-----------------TIAAEFDRQV------NDTITSFEGYPWKCCYKSSSQR 354
+ Y+ T E DR + D I + + K C++S S
Sbjct: 267 AQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEV--KKCFRSLSLE 324
Query: 355 LPK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
L P + +++ V G + L++ + IG M V
Sbjct: 325 FADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTN----LIGGITMLDQMV 380
Query: 414 VFDRENLKLGWSHSNCQDL 432
++D E LGW + C +
Sbjct: 381 IYDSERSLLGWVNYQCDRI 399
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 137/340 (40%), Gaps = 47/340 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G DLLW +CAP Y +D + P SST K +SCS C + S
Sbjct: 108 DTGSDLLW-----TQCAPCDDCY-TQVDP---LFDPKTSSTYKDVSCSSSQCTALENQAS 158
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C C Y++ Y +N+ + G + D L L S ++ ++IIGCG +G +
Sbjct: 159 CSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF 214
Query: 215 LDGVAPDGLIGLGLGEISVP-SLLAKAG-LIRNSFSMCF-----DKDDSGRIFFGDQGPA 267
+ +G P SL+ + G I FS C KD + +I FG
Sbjct: 215 ------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 268 TQQ---STSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKAIVDSGSSFTFLP 317
+ ST +A + Y + +++ +GS ++ + I+DSG++ T LP
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLP 328
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
E Y + ++ CY ++ K+P + + F + + ++
Sbjct: 329 TEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDL--KVPVITMHFDGADVKLDSSNA 386
Query: 378 FVIYGTQVVTGFCLAIQ--PVDGDIGTIGQ-NFMTGYRVV 414
FV +V C A + P G + Q NF+ GY V
Sbjct: 387 FVQVSEDLV---CFAFRGSPSFSIYGNVAQMNFLVGYDTV 423
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 153/372 (41%), Gaps = 56/372 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSAS-YYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ +++ D G DL W V+C P S+ Y+ D ++PS+SST + C
Sbjct: 95 ARDLTVVFDTGSDLSW-----VQCGPCSSGGCYHQQD---PLFAPSSSSTFSAVRCGEPE 146
Query: 149 CDLGT-SCQNP--KQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQA--SV 202
C SC + CPY + Y + + + G L D L L + NA +N+
Sbjct: 147 CPRARQSCSSSPGDDRCPYEV-VYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGF 205
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS---GRI 259
+ GCG +G L G A DGL GLG G++S+ S AG FS C S G +
Sbjct: 206 VFGCGENNTG--LFGKA-DGLFGLGRGKVSLSS--QAAGKYGEGFSYCLPSSSSNAHGYL 260
Query: 260 FFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------IVDSGS 311
G PA + T L + Y + + + +K +S A IVDSG+
Sbjct: 261 SLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDSGT 320
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---------CCYK--SSSQRLPKLPS 360
T L Y + F +++ Y +K CY + + +P+
Sbjct: 321 VITRLAPRAYSALRTAF-------LSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPA 373
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD---IGTIGQNFMTGYRVVFDR 417
V L+F + V+ V+Y +V CLA P +G+ G +G VV+D
Sbjct: 374 VALVFAGGATISVDFS-GVLYVAKVAQA-CLAFAP-NGNGRSAGILGNTQQRTVAVVYDV 430
Query: 418 ENLKLGWSHSNC 429
K+G++ C
Sbjct: 431 GRQKIGFAAKGC 442
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 118/291 (40%), Gaps = 29/291 (9%)
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+NP Q C Y + Y SS G+L+ D L G D + ++ GCG Q GG
Sbjct: 73 ENPNQ-CDYDVRY-AGGESSLGVLIADKFSL-PGRD------ARPTLTFGCGYDQEGGKA 123
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFG-DQGPATQQSTS 273
+ + DG++G+G G + S L + G I N C G +FFG ++ P++ +
Sbjct: 124 E-MPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHEKVPSSVVTWV 182
Query: 274 FLASNGKYITYIIGVETCCIGSSC---LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 330
+ N Y Y G+ + + + ++DSGS++T++P E Y +
Sbjct: 183 PMVPNNHY--YSPGLAALHFNGNLGNPISVAPMEVVIDSGSTYTYMPTETYRRLVFVVIA 240
Query: 331 QVNDTITSFEGYP-----W--KCCYKSSSQRLPKLPSVKLMFPQNNSFVV-----NNPVF 378
++ + + P W K +K K ++L F Q S + N +
Sbjct: 241 SLSKSSLTLVRDPALPVCWAGKEPFKXIGDVKDKFKPLELAFIQGTSQAIMEIPPENYLI 300
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ V G Q + IG M V++D E ++GW + C
Sbjct: 301 ISGEGNVCMGILDGTQAGLRKLNVIGDISMQNQLVIYDNERARIGWVRAPC 351
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 160/404 (39%), Gaps = 79/404 (19%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F +F K SL D G DL WI C C C + YY+ P
Sbjct: 85 LGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYD----------P 134
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDI--LH 185
SS+ +++ C C L +S C+ Q CPY +Y ++++++G + ++
Sbjct: 135 KESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFY-WYGDSSNTTGDFATETFTVN 193
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
L S + V+ +V+ GCG + G G + +G G S S L L +
Sbjct: 194 LTSPTGKSEFKRVE-NVMFGCG-HWNRGLFHGASGLLGLGRGPLSFS--SQLQS--LYGH 247
Query: 246 SFSMCF-----DKDDSGRIFFG-DQGPATQQSTSFLASNG------------KYITYIIG 287
SFS C D + S ++ FG D+ +F G + + ++G
Sbjct: 248 SFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVG 307
Query: 288 VETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP- 343
E I S TS IVDSG++ ++ + Y+ I F ++V +GYP
Sbjct: 308 GEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKV-------KGYPI 360
Query: 344 ------WKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGF 389
CY S LP ++F P N F+ +P V+
Sbjct: 361 VQDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVV--------- 411
Query: 390 CLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CLAI + IG + V++D + +LG++ NC D+
Sbjct: 412 CLAILGTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 455
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 144/350 (41%), Gaps = 45/350 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G D+ WI C+ C C S YN P+ SS+ K + C LC L S
Sbjct: 163 DTGSDVTWIQCEPCSDCYQQSDPIYN----------PALSSSYKLVGCQANLCQQLDVSG 212
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y + Y + + + G + L L G L+N V IGCG G +
Sbjct: 213 CSRNGSCLYQVSY-GDGSYTQGNFATETLTL---GGAPLQN-----VAIGCGHDNEGLF- 262
Query: 216 DGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQS 271
V GL+GLG G +S PS L + G I FS C D + S + FG
Sbjct: 263 --VGAAGLLGLGGGSLSFPSQLTDENGKI---FSYCLVDRDSESSSTLQFGRAAVPNGAV 317
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLKQT----------SFKAIVDSGSSFTFLPKEV 320
+ + N + T Y + + +G L + + IVDSG++ T L
Sbjct: 318 LAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTAA 377
Query: 321 YETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y+++ F R + S +G + CY SS+ +P+V F S + ++
Sbjct: 378 YDSLRDAF-RAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYL 436
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + T FC A P + +G G RV FDR N ++G++ + C
Sbjct: 437 VPVDSMGT-FCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 137/340 (40%), Gaps = 47/340 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G DLLW +CAP Y +D + P SST K +SCS C + S
Sbjct: 108 DTGSDLLW-----TQCAPCDDCY-TQVDP---LFDPKTSSTYKDVSCSSSQCTALENQAS 158
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C C Y++ Y +N+ + G + D L L S ++ ++IIGCG +G +
Sbjct: 159 CSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF 214
Query: 215 LDGVAPDGLIGLGLGEISVP-SLLAKAG-LIRNSFSMCF-----DKDDSGRIFFGDQGPA 267
+ +G P SL+ + G I FS C KD + +I FG
Sbjct: 215 ------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 268 TQQ---STSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKAIVDSGSSFTFLP 317
+ ST +A + Y + +++ +GS ++ + I+DSG++ T LP
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLP 328
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
E Y + ++ CY ++ K+P + + F + + ++
Sbjct: 329 TEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDL--KVPVITMHFDGADVKLDSSNA 386
Query: 378 FVIYGTQVVTGFCLAIQ--PVDGDIGTIGQ-NFMTGYRVV 414
FV +V C A + P G + Q NF+ GY V
Sbjct: 387 FVQVSEDLV---CFAFRGSPSFSIYGNVAQMNFLVGYDTV 423
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 85/373 (22%), Positives = 155/373 (41%), Gaps = 67/373 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--- 153
D G DL W+ C C+ C ++ + P+ASS+ ++++C + C L
Sbjct: 167 DTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPE 216
Query: 154 ---SCQNPKQP-CPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGM 208
+C+ P + CPY Y ++ ++ L +E ++L + G + + V+ GCG
Sbjct: 217 APRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD----GVVFGCGH 272
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSGRIFFGDQ- 264
+ G + GL L S L A G ++FS C + D ++ FG+
Sbjct: 273 RNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVEHGSDAGSKVVFGEDY 327
Query: 265 ---GPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL----------KQTSFKAIVDSG 310
+ T+F ++ T Y + ++ +G L K S I+DSG
Sbjct: 328 LVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSG 387
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM----- 364
++ ++ + Y+ I F ++ +P CY S P++P + L+
Sbjct: 388 TTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVERPEVPELSLLFADGA 447
Query: 365 ---FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ--PVDGDIGTIGQNFMTGYRVVFDREN 419
FP N FV +P ++ CLA++ P G + IG + VV+D +N
Sbjct: 448 VWDFPAENYFVRLDPDGIM---------CLAVRGTPRTG-MSIIGNFQQQNFHVVYDLQN 497
Query: 420 LKLGWSHSNCQDL 432
+LG++ C ++
Sbjct: 498 NRLGFAPRRCAEV 510
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 138/360 (38%), Gaps = 38/360 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W C C C P Y++ P AS+T + S R C T+
Sbjct: 113 DTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASATCLPIWRSSRNCTATTT-- 170
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYL 215
PC Y Y + S+G+L + L A V V GCG+ G
Sbjct: 171 ---SPCRYRYAY-DDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSY 226
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGDQ--------- 264
+ G +GLG G +S L+A+ G+ + S+ + F+ + FG
Sbjct: 227 NST---GTVGLGRGSLS---LVAQLGVGKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTI 280
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFT 314
G A QST + Y + +E +G + L S IVDSG+ FT
Sbjct: 281 GGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFT 340
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVV 373
L + + + +N + + C ++ Q+LP +P + L F +
Sbjct: 341 VLVESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPDMLLHFAGGADMRL 400
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHSNCQDL 432
+ ++ + Q + FCL I G+I NF +++FD +L + ++C L
Sbjct: 401 HRDNYMSF-NQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVGQLSFVPTDCSKL 459
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 94/226 (41%), Gaps = 19/226 (8%)
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 280
DG++GLG G+ S+ S L GL+RN C G IFFGD +++ + + ++S
Sbjct: 13 DGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGDVYDSSRLTWTPMSSR-D 71
Query: 281 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN------- 333
Y+ G G + D+GSS+T+ Y+ + + +++
Sbjct: 72 LKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKEA 131
Query: 334 -DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ----NNSFVVNNPVFVIYGTQVVTG 388
D T + K ++S + S+ L F N F + ++I +
Sbjct: 132 PDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVSN--MGN 189
Query: 389 FCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
CL I + GD+ IG M +VFD E +GW+ ++C
Sbjct: 190 VCLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADCN 235
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 139/361 (38%), Gaps = 46/361 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +SL D G DL W +C P S Y + + PS S T ++SC+ C
Sbjct: 165 KDLSLIFDTGSDLTW-----TQCQPCVKSCY---AQQQPIFDPSTSKTYSNISCTSAACS 216
Query: 151 -----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
G S C Y + Y +++ + G +D L L +N V + G
Sbjct: 217 SLKSATGNSPGCSSSNCVYGIQY-GDSSFTIGFFAKDKLTLT-------QNDVFDGFMFG 268
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGD 263
CG G L G GLIGLG +S+ A+ FS C + +G + FG+
Sbjct: 269 CGQNNKG--LFGKTA-GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRGSNGHLTFGN 323
Query: 264 -----QGPATQQSTSF--LASNGKYITYIIGVETCCIGSSCLKQTSF-----KAIVDSGS 311
A + +F AS+ Y I V +G L + I+DSG+
Sbjct: 324 GNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIIDSGT 383
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
T LP Y ++ + F + ++ T+ CY S+ +P + F N +
Sbjct: 384 VITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANV 443
Query: 372 VVN-NPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
++ N + + G V CLA D IG G VV+D +LG+ +
Sbjct: 444 ELDPNGILITNGASQV---CLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKG 500
Query: 429 C 429
C
Sbjct: 501 C 501
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 141/357 (39%), Gaps = 50/357 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 92 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 140
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 141 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 190
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 191 SFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 246
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 306
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 307 DSGSELSYIPDRALSVLSQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 365
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F + ++ VFV Q +CLA P + + IG T VV+D + +G
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIG 421
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/306 (25%), Positives = 127/306 (41%), Gaps = 28/306 (9%)
Query: 141 HLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
H SC LC L T +P++ C YT Y +N+ + G+L +D S N K
Sbjct: 18 HNSCDSPLCHKLDTGVCSPEKRCNYTYGY-GDNSLTKGVLAQDTATFTS---NTGKLVSL 73
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI--RNSFSMCF-----D 252
+ + GCG +GG+ D GLIGLG G S L+++ G + FS C D
Sbjct: 74 SRFLFGCGHNNTGGFNDHEM--GLIGLGGGPTS---LISQIGPLFGGKKFSQCLVPFLTD 128
Query: 253 KDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KA 305
S R+ FG +T + +Y + + + + L S
Sbjct: 129 IKISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNM 188
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 364
+VDSG+ LP+++Y+ + E V + IT+ + CY++ + K P++
Sbjct: 189 LVDSGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNL--KGPTLTYH 246
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLG 423
F N + F+ + FCLAI G + NF + Y + FD + +
Sbjct: 247 FEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVS 306
Query: 424 WSHSNC 429
+ ++C
Sbjct: 307 FKATDC 312
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 59/365 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G D++W+ C C RC S ++ P SS+ + C LC D G
Sbjct: 4 DTGSDVVWVQCAPCRRCYEQSGPVFD----------PRRSSSYGAVGCGAALCRRLDSG- 52
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y + Y + + ++G V + L G + A V +GCG G
Sbjct: 53 GCDLRRGACMYQV-AYGDGSVTAGDFVTETLTFAGG-------ARVARVALGCGHDNEGL 104
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR-----------IFF 261
+ VA GL+GLG G +S P+ +++ SFS C D+ SG + F
Sbjct: 105 F---VAAAGLLGLGRGGLSFPTQISR--RYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSF 159
Query: 262 GDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTSFK---------AI 306
G G S SF + N + Y ++G+ + ++ + I
Sbjct: 160 G-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVI 218
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTIT-SFEGYP-WKCCYKSSSQRLPKLPSVKLM 364
VDSG+S T L + Y + F + S G+ + CY +R+ K+P+V +
Sbjct: 219 VDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMH 278
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
F + ++I T FC A DG + IG G+RVVFD + ++G+
Sbjct: 279 FAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 337
Query: 425 SHSNC 429
+ C
Sbjct: 338 APKGC 342
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 109/454 (24%), Positives = 186/454 (40%), Gaps = 68/454 (14%)
Query: 3 RISLTIYLAVFWLLTESSG-----AETVMFSTKLIHRFSEEVKALGVSKNR-----NATS 52
R L+ L++ +L SG AE + F+T+LIHR S S+ NA
Sbjct: 8 RTLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVE 67
Query: 53 WPAKKSFEYYQVLLSSDVQKQKMKT---GPQFQMLFPSQGSKTMSLGN-DFGCDLLWIPC 108
A + + L+S+ + + + F M T L N G DL+WIPC
Sbjct: 68 RSADR-VNRFNDLISNSITAAEFPSILDNGDFLMKISIGIPPTELLVNVATGSDLVWIPC 126
Query: 109 DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY 168
+ P + + DL + P SST K++ C C + + C Y+ D
Sbjct: 127 --LSFKPCTH------NCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDP 178
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 228
+++ G L D L L S K+ + + CG + G Y GV G++GLG
Sbjct: 179 RHQDSCPDGDLAMDTLTLNS---TTGKSFMLPNTGFICGNRIGGDY-PGV---GILGLGH 231
Query: 229 GEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQ---STSFLASNGKYI 282
G +S+ + ++ LI FS C + + + ++ FGD+ + ST + G Y
Sbjct: 232 GSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPY- 288
Query: 283 TYIIGVETCCIGSSCLKQTSFKAI-------VDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
+Y + +G+ + + +DSG+ FT+ P+ Y + E+D V
Sbjct: 289 SYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQL--EYD--VRYA 344
Query: 336 ITSFEGYP-----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFC 390
I YP + CY+ S P P++ + F + + ++ F+ +V C
Sbjct: 345 IQQEPLYPDPTRRLRLCYRYSPDFSP--PTITMHFEGGSVELSSSNSFIRMTEDIV---C 399
Query: 391 LAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
LA + Q+ + GY + + NL +G+
Sbjct: 400 LAFATSSSE-----QDAVFGY---WQQTNLLIGY 425
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 173/401 (43%), Gaps = 74/401 (18%)
Query: 74 KMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 133
++ G F +F + L D G DL W+ +C P A + D+ + P
Sbjct: 165 ELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWL-----QCKPCKACF----DQSGPVFDP 215
Query: 134 SASSTSKHLSCSHRLCDL--GTSCQ-NPKQPCPYTMDY---YTENTSSSGLLVEDILHLI 187
S S++ K + C+ CDL C+ N + P T Y Y +++ +SG L + L +
Sbjct: 216 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLS-V 274
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
S D+ ++ ++IGCG G + L+GLG G +S PS L ++ I SF
Sbjct: 275 SLSDHPSSLEIR-DMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSF 329
Query: 248 SMCF-DKDD----SGRIFFGDQGPATQ-----QSTSFLASNGKYIT-YIIGVETCCIGSS 296
S C D+ + S I FG ++ + T F+ +N T Y +G++ I
Sbjct: 330 SYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQE 389
Query: 297 CLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK- 345
L + + I+DSG++ T+L ++ Y + + F +++ YP
Sbjct: 390 LLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS--------YPRAD 441
Query: 346 ------CCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCL 391
CY ++ + P++ ++F PQ N F+ +P + CL
Sbjct: 442 PFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKH--------CL 493
Query: 392 AIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
AI P DG + IG ++D ++ +LG+++++C L
Sbjct: 494 AILPTDG-MSIIGNFQQQNIHFLYDVQHARLGFANTDCSAL 533
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 144/392 (36%), Gaps = 65/392 (16%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F + S +K L D G L W+ CD C+ C + Y E +
Sbjct: 36 GHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ELKYAV 89
Query: 136 SSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNA 193
T + C+ DL + PK C Y + Y SS G+L+ D L S G N
Sbjct: 90 KCTEQR--CADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTNP 145
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCF 251
S+ GCG Q + P +G++GLG G++++ S L G+I ++ C
Sbjct: 146 ------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCI 199
Query: 252 DKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSG 310
G +FFGD + P + + S + K+ + G S + + I DSG
Sbjct: 200 SSKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSG 259
Query: 311 SSFTFLPKEVYE-----------------TIAAEFDRQV------NDTITSFEGYPWKCC 347
+++T+ + Y T E DR + D I + + K C
Sbjct: 260 ATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV--KKC 317
Query: 348 YKSSSQRLPK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI------QPVDGDI 400
++S S + L P + +++ V CL I P
Sbjct: 318 FRSLSLKFADGDKKATLEIPPEHYLIISQEGHV----------CLGILDGSKEHPSLAGT 367
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
IG M V++D E LGW + C +
Sbjct: 368 NLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 148/353 (41%), Gaps = 50/353 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++WI C+ C +C Y+ +D N PS S++ L C+ +C +
Sbjct: 215 DTGSDVVWIQCEPCSKC-------YSQVDPIFN---PSLSASFSTLGCNSAVCSYLDAYN 264
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y + Y + + E +++ G +++N V IGCG +G +
Sbjct: 265 CHGGGCLYKVSYGDGSYTIGSFATE----MLTFGTTSVRN-----VAIGCGHDNAGLF-- 313
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DK--DDSGRIFFGDQG-PATQQST 272
V GL+GLG G +S PS L +FS C D+ + SG + FG + P T
Sbjct: 314 -VGAAGLLGLGAGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGTLEFGPESVPLGSILT 370
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKA--IVDSGSSFTFLPKEV 320
L + Y + + + +G + L +TS + IVDSG++ T L V
Sbjct: 371 PLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPV 430
Query: 321 YETIAAEF---DRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
Y+ + F RQ+ EG + CY S L +P+V F S ++
Sbjct: 431 YDAVRDAFVAGTRQLPKA----EGVSIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAK 486
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I + FC A P D+ +G G RV FD N +G++ C
Sbjct: 487 NYMI-PMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 151/396 (38%), Gaps = 73/396 (18%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSP 133
++G F ++ S L D G DL+W+ C C RC ++ P
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFD----------P 130
Query: 134 SASSTSKHLSCSH------RL--CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
SST + + CS R CD G + C Y M Y + +SS+G L D L
Sbjct: 131 RRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRY-MVAYGDGSSSTGDLATDKLA 186
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
+ D + N V +GCG + + G D A GL+G+G G+IS+ + +A A +
Sbjct: 187 FAN--DTYVNN-----VTLGCG-RDNEGLFDSAA--GLLGVGRGKISISTQVAPA--YGS 234
Query: 246 SFSMCFDKDDSGR-------IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
F C D + R +F P + T+ L++ + Y + + +G
Sbjct: 235 VFEYCL-GDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGE-- 291
Query: 299 KQTSFK--------------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF---EG 341
+ T F +VDSG++ + ++ Y + FD + E
Sbjct: 292 RVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEH 351
Query: 342 YPWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
+ CY + P + L F P N F+ PV CL
Sbjct: 352 SVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFL---PVDGGRRRAASYRRCLGF 408
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ D + IG G+RVVFD E ++G++ C
Sbjct: 409 EAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 172/401 (42%), Gaps = 74/401 (18%)
Query: 74 KMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 133
++ G F +F + L D G DL W+ +C P A + D+ + P
Sbjct: 81 ELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWL-----QCKPCKACF----DQSGPVFDP 131
Query: 134 SASSTSKHLSCSHRLCDL--GTSCQ-NPKQPCPYTMDY---YTENTSSSGLLVEDILHLI 187
S S++ K + C+ CDL C+ N + P T Y Y +++ +SG L + L +
Sbjct: 132 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLS-V 190
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
S D+ ++ ++IGCG G + L+GLG G +S PS L ++ I SF
Sbjct: 191 SLSDHPSSLEIR-DMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSF 245
Query: 248 SMCF-DKDD----SGRIFFGDQGPATQ-----QSTSFLASNGKYIT-YIIGVETCCIGSS 296
S C D+ + S I FG ++ + T F+ +N T Y +G++ I
Sbjct: 246 SYCLVDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQE 305
Query: 297 CL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK- 345
L S I+DSG++ T+L ++ Y + + F +++ YP
Sbjct: 306 LLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS--------YPRAD 357
Query: 346 ------CCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCL 391
CY ++ + P++ ++F PQ N F+ +P + CL
Sbjct: 358 PFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKH--------CL 409
Query: 392 AIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
AI P DG + IG ++D ++ +LG+++++C L
Sbjct: 410 AILPTDG-MSIIGNFQQQNIHFLYDVQHARLGFANTDCSAL 449
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/366 (22%), Positives = 142/366 (38%), Gaps = 68/366 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL WI C +C P + +++ PS SST ++ SC + ++
Sbjct: 96 DTGSDLTWIHCLPCKCYPQTIPFFH----------PSRSSTYRNASCVSAPHAMPQIFRD 145
Query: 158 PKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
K C Y + Y + +++ G+L E+ L + D + + +++ GCG SG
Sbjct: 146 EKTGNCQYHLRY-RDFSNTRGILAEEKLTFETSDDGLIS---KQNIVFGCGQDNSGF--- 198
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTS 273
G++GLG G S+ + RN FS CF G T
Sbjct: 199 -TKYSGVLGLGPGTFSI--------VTRNFGSKFSYCF----------GSLTNPTYPHNI 239
Query: 274 FLASNGKYIT------------YIIGVETCCIGSSCLK---------QTSFKAIVDSGSS 312
+ NG I Y + ++ G L ++ ++D+G S
Sbjct: 240 LILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCS 299
Query: 313 FTFLPKEVYETIAAEFDR---QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
T L +E YET++ E D +V + ++ Y C + L P V F
Sbjct: 300 PTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGA 359
Query: 370 SFVVN-NPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ +FV ++ FCLA+ D+ IG Y V ++ +K+ + +
Sbjct: 360 ELALDVESLFV--SSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRT 417
Query: 428 NCQDLN 433
+C+ ++
Sbjct: 418 DCEIID 423
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/366 (22%), Positives = 143/366 (39%), Gaps = 68/366 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G DL WI C +C P + +++ PS SST ++ SC + ++
Sbjct: 106 DTGSDLTWIQCLPCKCYPQTIPFFH----------PSRSSTYRNASCESAPHAMPQIFRD 155
Query: 158 PKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
K C Y + Y + +++ G+L ++ L + + + + +++ GCG SG
Sbjct: 156 EKTGNCRYHLRY-RDFSNTRGILAKEKLTFQTSDEGLIS---KPNIVFGCGQDNSGF--- 208
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTS 273
G++GLG G S+ + RN FS CF G T
Sbjct: 209 -TQYSGVLGLGPGTFSI--------VTRNFGSKFSYCF----------GSLIDPTYPHNF 249
Query: 274 FLASNGKYIT------------YIIGVETCCIGSSCLK---------QTSFKAIVDSGSS 312
+ NG I Y + ++ +G L ++ ++D+G S
Sbjct: 250 LILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCS 309
Query: 313 FTFLPKEVYETIAAEFDR---QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 369
T L +E YET++ E D +V + +E Y C + L P V F
Sbjct: 310 PTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGA 369
Query: 370 SFVVN-NPVFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
++ +FV ++ FCLA+ D+ IG Y V ++ +K+ + +
Sbjct: 370 ELALDVESLFV--SSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRT 427
Query: 428 NCQDLN 433
+C+ L+
Sbjct: 428 DCEILD 433
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 149/369 (40%), Gaps = 54/369 (14%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+ WI C C C P +N P ASST C++ + C
Sbjct: 157 DTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST-----CTNVYQGVKPFCS 211
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ---ASVIIGCGMKQSGG 213
+ C +++ Y + + SSGLL + I+G + +++ +GC G
Sbjct: 212 PSGRTCLFSIQY-GDGSLSSGLLA---METIAGNTPNFGDGEPVKLSNITLGCADIDREG 267
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DK----DDSGRIFFGDQG--- 265
G + GL+G+ IS PS L+ FS CF DK + SG +FFG+
Sbjct: 268 LPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLNSSGLVFFGESDIIS 323
Query: 266 ------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT-----------SFKAIVD 308
P Q AS Y ++G+ + S L + S I+D
Sbjct: 324 PYLRYTPLVQNPAVPSASLDYYYVGLVGIS---VDESRLPLSHKNFDIDKVTGSGGTIID 380
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKLPSVKLM 364
SG++FT+L K ++ + EF + + + + CY +++ LPS+ L
Sbjct: 381 SGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSITLH 440
Query: 365 FPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDI--GTIGQNFMTGYRVVFDRENL 420
F V+ N+ + + ++ T CLA + GDI IG V +D E L
Sbjct: 441 FRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNYQQQNLWVEYDLEKL 499
Query: 421 KLGWSHSNC 429
+LG + + C
Sbjct: 500 RLGIAPAQC 508
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 110/418 (26%), Positives = 165/418 (39%), Gaps = 58/418 (13%)
Query: 38 EVKALGVSKNR----NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTM 93
++ A+GVSK N +S A+ + + SS + +G F L +
Sbjct: 110 QLAAMGVSKAEMKPLNGSSIDARFDAKDFS---SSIISGLAQGSGEYFTRLGVGTPPRYT 166
Query: 94 SLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC--- 149
+ D G D++WI C C +C Y D N P+ASST + + C+ LC
Sbjct: 167 YMVLDTGSDIMWIQCLPCAKC-------YGQTDPLFN---PAASSTYRKVPCATPLCKKL 216
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
D+ + C+N K+ C Y + Y + + E + + V V +GCG
Sbjct: 217 DI-SGCRN-KRYCEYQVSYGDGSFTVGDFSTETL---------TFRGQVIRRVALGCGHD 265
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG---RIFFGDQG 265
G + + GL+GLG G +S PS FS C D+ SG + FG
Sbjct: 266 NEGLF---IGAAGLLGLGRGSLSFPS--QTGAQFSKRFSYCLVDRSASGTASSLIFGKAA 320
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-------------IVDSGSS 312
+ L SN K T+ VE I + TS A I+DSG+S
Sbjct: 321 IPKSAIFTPLLSNPKLDTFYY-VELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTS 379
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
T L Y T+ F R + S G+ + CY S + K+P++ F
Sbjct: 380 VTRLVDSAYSTMRDAF-RVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGAHI 438
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++I T FC A G + IG GYRVVFD ++G+ +C
Sbjct: 439 SLPATNYLIPVDSSAT-FCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 139/358 (38%), Gaps = 59/358 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-- 154
D G D++WI C C RC S ++ P S + ++C LC S
Sbjct: 144 DTGSDIVWIQCAPCKRCYAQSDPVFD----------PRKSRSFASIACRSPLCHRLDSPG 193
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C KQ C Y + Y + + E + + + A V +GCG G +
Sbjct: 194 CNTQKQTCMYQVSYGDGSFTFGDFSTETL---------TFRRTRVARVALGCGHDNEGLF 244
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR---IFFGDQGPATQQ 270
V GL+GLG G +S PS + + FS C D+ S + + FGD +
Sbjct: 245 ---VGAAGLLGLGRGRLSFPSQTGRR--FNHKFSYCLVDRSASSKPSSMVFGDSAVSRTA 299
Query: 271 STSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPK 318
+ L SN K Y ++G+ + + FK I+DSG+S T L +
Sbjct: 300 RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTR 359
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNNSF 371
Y F ++ + + + C+ S + K+P+V L F P +N
Sbjct: 360 PAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYL 419
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ PV FCLA G + IG G+RVV+D ++G++ C
Sbjct: 420 I---PV------DTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGC 468
>gi|154311375|ref|XP_001555017.1| hypothetical protein BC1G_06540 [Botryotinia fuckeliana B05.10]
gi|114149215|gb|AAR87747.3| aspartic proteinase precursor [Botryotinia fuckeliana]
gi|347829155|emb|CCD44852.1| similar to aspartic-type endopeptidase opsB [Botryotinia
fuckeliana]
Length = 482
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 142/343 (41%), Gaps = 58/343 (16%)
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIGCG---- 207
T C PC T YT N+SS+ V ++ G A + V + IG
Sbjct: 105 TLCSRKTNPCQ-TAGTYTANSSSTYAYVASDFNISYVDGSGASGDYVTDTFTIGSATLDK 163
Query: 208 MKQSGGYLDGVAPDGLIGLG--LGEISV-----------PSLLAKAGLIR-NSFSMCFDK 253
++ GY +P+G++G+G + E+ V P+ + GLI N+FS+ +
Sbjct: 164 LQFGIGYTSS-SPEGILGIGYEINEVQVGRAGKKAYNNLPAQMVADGLINSNAFSLWLND 222
Query: 254 DD--SGRIFFGDQGPATQQSTSFLAS------NGKYITYIIGVETCCIGSSCLKQ-TSFK 304
D +G I FG G T Q L + +G Y ++I + +G + + Q +
Sbjct: 223 LDASTGSILFG--GVDTAQFHGQLETLPIEKESGYYAEFLITLTEVMLGDTVIAQDQALA 280
Query: 305 AIVDSGSSFTFLP----KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-- 358
++DSGSS T+LP + +YE + A++D EG + C +++
Sbjct: 281 VLLDSGSSLTYLPDAMAEAIYEQVEAQYDAS--------EGAAYVPCSLATNTSALNFTF 332
Query: 359 --PSVKLMFPQNNSFVVNNPVFVIYGTQVV----TGFCL-AIQPVDGDIGTIGQNFMTGY 411
P++++ N V+ PV G Q+ T CL I P +G F+
Sbjct: 333 TSPTIQVTM---NELVI--PVTSTTGQQLQFTDGTAACLFGIAPAGDSTSVLGDTFIRSA 387
Query: 412 RVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPAN 454
+V+D +N ++ + +N + T PS L AN
Sbjct: 388 YIVYDLDNNEISLAQTNFNATSTSVVEITTGTTAVPSATLVAN 430
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/373 (22%), Positives = 149/373 (39%), Gaps = 74/373 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C C D+ + P SS+ + CS LC+ ++
Sbjct: 126 DTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN 175
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C K C Y + Y + +S+ GLL + +NS+ + + GCG++ G G
Sbjct: 176 CNEDKDSCEY-LYTYGDYSSTRGLLATETFTFED------ENSI-SGIGFGCGVENEGDG 227
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQ 269
+ G GL+GLG G +S+ S L + FS C D + S +F G
Sbjct: 228 FSQG---SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEASSSLFIGSLASGIV 279
Query: 270 QST------------SFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------AIV 307
T S L + + Y + ++ +G+ L ++++F+ I+
Sbjct: 280 NKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMII 339
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL----P 359
DSG++ T+L + ++ + EF +++ + C+K + + +PKL
Sbjct: 340 DSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK 399
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
L P N V ++ V+ CLA+ +G + G + V+ D E
Sbjct: 400 GADLELPGENYMVADSSTGVL---------CLAMGSSNG-MSIFGNVQQQNFNVLHDLEK 449
Query: 420 LKLGWSHSNCQDL 432
+ + + C L
Sbjct: 450 ETVTFVPTECGKL 462
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 144/368 (39%), Gaps = 67/368 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +CAP + + L + ++P S++ + + C+ +LC L C
Sbjct: 120 DTGSDLIW-----TQCAPCA----SCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGC 170
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ P C Y +Y + E SGGD + + GCG G
Sbjct: 171 EMPDT-CTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMT----VPLGFGCGSMNVGSLN 225
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFGD-----QGPA 267
+G G++G G +S+ S L+ IR FS C SGR + FG G A
Sbjct: 226 NG---SGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYGSGRKSTLLFGSLSGGVYGDA 277
Query: 268 TQ--QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTF 315
T Q+T L S Y + + +G+ L+ +++F IVDSG++ T
Sbjct: 278 TGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 337
Query: 316 LPKEVYETIAAEFDRQVN----------DTITSFEGYPWKCCYKSSSQRLPKL----PSV 361
LP V + F +Q+ D + W+ +S +P++
Sbjct: 338 LPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDA 397
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L P+ N +V+++ CL + D TIG RV++D E
Sbjct: 398 DLDLPRRN-YVLDD--------HRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAET 448
Query: 422 LGWSHSNC 429
L ++ + C
Sbjct: 449 LSFAPAQC 456
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 126/335 (37%), Gaps = 38/335 (11%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC--SHRL 148
K L D G L W +C P S Y + +Y P+AS T + C SH
Sbjct: 69 KKQFLHMDTGSSLTW-----TQCFPCSDCYAQKI---YPKYRPAASITYRDAMCEDSHPK 120
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
+ + + C Y +Y + T+ G L ++++ + D K V GC
Sbjct: 121 SNPHFAFDPLTRICTY-QQHYLDETNIKGTLAQEMI-TVDTHDGGFKRV--HGVYFGCNT 176
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQ 264
G Y G G++GLG+G+ S+ G + FS C + S + GD
Sbjct: 177 LSDGSYFTGT---GILGLGVGKYSI------IGEFGSKFSFCLGEISEPKASHNLILGDG 227
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 324
T + G I +E+ +G + VD+GS+ + L +Y
Sbjct: 228 ANVQGHPTVINITEGHTI---FQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKF 284
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN-NPVFVIYGT 383
FD + S+E P C + +RL K+ V F VN + +F+ G
Sbjct: 285 VDAFDDLIGSRPLSYE--PTLCYKADTIERLEKM-DVGFKFDVGAELSVNIHNIFIQQGP 341
Query: 384 QVVTGFCLAIQPVDGDIG--TIGQNFMTGYRVVFD 416
+ CLAIQ IG M GY V +D
Sbjct: 342 PEIR--CLAIQNNKESFSHVIIGVIAMQGYNVGYD 374
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 139/354 (39%), Gaps = 55/354 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DL-GTS 154
D G DL+W C+ C +C +N P SS+ L C + C DL S
Sbjct: 114 DTGSDLIWTQCEPCTQCFSQPTPIFN----------PQDSSSFSTLPCESQYCQDLPSES 163
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C N C YT Y + +S+ G + + + S ++ GCG G G
Sbjct: 164 CYND---CQYTYGY-GDGSSTQGYMATETF--------TFETSSVPNIAFGCGEDNQGFG 211
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQG---PA 267
+G GLIG+G G +S+PS L FS C S + G P
Sbjct: 212 QGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSPSTLALGSAASGVPE 263
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLP 317
ST+ + S+ Y I ++ +G L ++F+ I+DSG++ T+LP
Sbjct: 264 GSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLP 323
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNP 376
++ Y +A F Q+N + C++ S ++P + + F +
Sbjct: 324 QDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEEN 383
Query: 377 VFVIYGTQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V + V+ CLA+ I G +V++D +NL + + + C
Sbjct: 384 VLISPAEGVI---CLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 139/374 (37%), Gaps = 71/374 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--S 154
D G DL+W C C C R L PS SST L CS +CD T S
Sbjct: 433 DTGSDLVWTQCRPCPVC----------FSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSS 482
Query: 155 CQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C Q C Y Y + ++ L E + G + + GCG+ +
Sbjct: 483 CGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTG---QATVPDLAFGCGLFNN 539
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSGRIFFG------ 262
G + G+ G G G +S+PS L ++FS CF + + G
Sbjct: 540 GIFTSN--ETGIAGFGRGALSLPSQLKV-----DNFSHCFTAITGSEPSSVLLGLPANLY 592
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSS 312
QST + + Y + ++ +GS+ L +++F I+DSG+
Sbjct: 593 SDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTG 652
Query: 313 FTFLPKEVYETIAAEFDRQV-----NDTITSFEGYPWKCCYKSSSQRL--PKLPSVKLMF 365
T LP++ Y+ + F QV N T +S + C+ S R P +P + L F
Sbjct: 653 MTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLS----RLCFSFSVPRRAKPDVPKLVLHF 708
Query: 366 -------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
P+ N F G V CLAI D D+ IG V++D
Sbjct: 709 EGATLDLPRENYMF----EFEDAGGSVT---CLAINAGD-DLTIIGNYQQQNLHVLYDLV 760
Query: 419 NLKLGWSHSNCQDL 432
L + + C L
Sbjct: 761 RNMLSFVPAQCNRL 774
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/373 (22%), Positives = 149/373 (39%), Gaps = 74/373 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C C D+ + P SS+ + CS LC+ ++
Sbjct: 125 DTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN 174
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C K C Y + Y + +S+ GLL + +NS+ + + GCG++ G G
Sbjct: 175 CNEDKDACEY-LYTYGDYSSTRGLLATETFTFED------ENSI-SGIGFGCGVENEGDG 226
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQ 269
+ G GL+GLG G +S+ S L + FS C D + S +F G
Sbjct: 227 FSQG---SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEASSSLFIGSLASGIV 278
Query: 270 QST------------SFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------AIV 307
T S L + + Y + ++ +G+ L ++++F+ I+
Sbjct: 279 NKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMII 338
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL----P 359
DSG++ T+L + ++ + EF +++ + C+K + + +PK+
Sbjct: 339 DSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK 398
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
L P N V ++ V+ CLA+ +G + G + V+ D E
Sbjct: 399 GADLELPGENYMVADSSTGVL---------CLAMGSSNG-MSIFGNVQQQNFNVLHDLEK 448
Query: 420 LKLGWSHSNCQDL 432
+ + + C L
Sbjct: 449 ETVSFVPTECGKL 461
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 147/365 (40%), Gaps = 64/365 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ C +D+ + P+ SST + L CS C+
Sbjct: 110 DTGSDLIWTQCAPCLLC----------VDQPTPYFDPANSSTYRSLGCSAPACNALYYPL 159
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
++ C Y +Y ++ S++G+L + G N + ++ + GCG +G +
Sbjct: 160 CYQKTCVYQY-FYGDSASTAGVLANETFTF---GTNDTRVTLP-RISFGCGNLNAGSLAN 214
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG------DQGPATQ 269
G G++G G G +S L+++ G R S+ + F R++FG +T
Sbjct: 215 G---SGMVGFGRGSLS---LVSQLGSPRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTV 268
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSFTFLPK 318
QST F+ + Y + + +G + L + I+DSG++ T+L +
Sbjct: 269 QSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAE 328
Query: 319 EVYETIAAEFDRQVNDTITSF---EGYPWKCCYK--SSSQRLPKLPSVKLMF-------P 366
Y + F +N T+ E C++ ++ LP + L F P
Sbjct: 329 PAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELP 388
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQP-VDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
N +V+ G CLA+ DG I IG + V++D EN L +
Sbjct: 389 LQNYMLVD---------PSTGGLCLAMATSSDGSI--IGSYQHQNFNVLYDLENSLLSFV 437
Query: 426 HSNCQ 430
+ C
Sbjct: 438 PAPCN 442
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 140/357 (39%), Gaps = 48/357 (13%)
Query: 93 MSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD- 150
+SL D G DL W C CVR D+ ++PS S++ ++SCS C
Sbjct: 146 LSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSYYNVSCSSAACGS 196
Query: 151 ----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
G + C Y + Y + + S G L +D L S + V V GC
Sbjct: 197 LSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKDKFTLTS-------SDVFDGVYFGC 248
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQ 264
G + + G GVA GL+GLG ++S PS A A FS C S G + FG
Sbjct: 249 G-ENNQGLFTGVA--GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSA 303
Query: 265 G----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
G TSF N IT +G + I S+ A++DSG+ T
Sbjct: 304 GISRSVKFTPISTITDGTSFYGLNIVAIT--VGGQKLPIPSTVFSTPG--ALIDSGTVIT 359
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP + Y + + F +++ T+ C+ S + +P V F + VV
Sbjct: 360 RLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF--SGGAVVE 417
Query: 375 NPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I+ ++ CLA D + G VV+D ++G++ + C
Sbjct: 418 LGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 474
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 84/381 (22%), Positives = 153/381 (40%), Gaps = 62/381 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G L PCD CV C + +++ +K S + C C
Sbjct: 64 DTGSGLTAFPCDKCVDCGTHTDPKFDA---------------TKSTSINFVQCKYEEGCD 108
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDIL---HLISGGDNALKNSVQASVIIGCGMKQSGG 213
+ Y+E + ++++D++ ++ S + GC +++G
Sbjct: 109 TCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIMRRYGIRFKFGCQTRETGL 168
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKDDSGRIFFGDQ--GPATQQ 270
++ V +G++GLG+G ++ + + KA + + F++CF + + G T+
Sbjct: 169 FITQV-ENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQKGGSFVIGGVDYSHHTTKI 227
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK----AIVDSGSSFTFLPKEVYETI 324
+ + LA +G Y I V+ IG L+ FK AIVDSG++ T+ P
Sbjct: 228 AYTPLAKHGTS-NYPIEVKDVRIGGISLQVDAEHFKSGRGAIVDSGTTDTYFPSAAATPF 286
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN-----------SFVV 373
F R IT E K + + + LP+V L+ + +++
Sbjct: 287 QEAFKR-----ITGVEYNENKMNL--TPEMVETLPNVSLIIAGEDGEDFEISLNASDYIL 339
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLN 433
N+ +GT L G + +G + M GY V+FD E ++G++ + C
Sbjct: 340 NDSNHHFFGT-------LHFSERRGAV--LGASIMMGYDVIFDLEKKRVGFAEATC---- 386
Query: 434 DGTKSPLTPGPGTPSNPLPAN 454
DG P+T P P P+ +
Sbjct: 387 DGKGHPITL-PLKPLAPIAKD 406
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 140/380 (36%), Gaps = 65/380 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
+K L D G L W+ CD C+ C + Y E + T + C+
Sbjct: 48 AKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ELKYAVKCTEQR--CADL 99
Query: 148 LCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQASVIIG 205
DL + PK C Y + Y SS G+L+ D L S G N S+ G
Sbjct: 100 YADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTNP------TSIAFG 151
Query: 206 CGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGD 263
CG Q + P +G++GLG G++++ S L G+I ++ C G +FFGD
Sbjct: 152 CGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGD 211
Query: 264 -QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 322
+ P + + S + K+ + G S + + I DSG+++T+ + Y
Sbjct: 212 AKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYH 271
Query: 323 -----------------TIAAEFDRQV------NDTITSFEGYPWKCCYKSSSQRLPK-L 358
T E DR + D I + + K C++S S +
Sbjct: 272 ATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV--KKCFRSLSLKFADGD 329
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI------QPVDGDIGTIGQNFMTGYR 412
L P + +++ V CL I P IG M
Sbjct: 330 KKATLEIPPEHYLIISQEGHV----------CLGILDGSKEHPSLAGTNLIGGITMLDQM 379
Query: 413 VVFDRENLKLGWSHSNCQDL 432
V++D E LGW + C +
Sbjct: 380 VIYDSERSLLGWVNYQCDRI 399
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 155/391 (39%), Gaps = 56/391 (14%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYY 121
Q+ LSS + Q + ++ GS M++ D G DL W+ C+ C+ C +
Sbjct: 51 QIPLSSGINLQTLN-----YIVTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIF 105
Query: 122 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 181
+ SST + L + + G NP C Y ++Y + ++ L VE
Sbjct: 106 KPSTSSSYQSVSCNSSTCQSLQFATG--NTGACGSNPS-TCNYVVNYGDGSYTNGELGVE 162
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
L GG + + + GCG + + G GV+ GL+GLG +S+ S
Sbjct: 163 ---QLSFGGVSV------SDFVFGCG-RNNKGLFGGVS--GLMGLGRSYLSLVS--QTNA 208
Query: 242 LIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSFLAS--------NGKYITYIIGVET 290
FS C + SG + G++ + T + + YI + G++
Sbjct: 209 TFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGID- 267
Query: 291 CCIGSSCLKQTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---- 343
+ L+ SF ++DSG+ T LP VY+ + A F +Q F G+P
Sbjct: 268 --VDGVALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQ-------FTGFPSAPG 318
Query: 344 ---WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG-- 398
C+ + +P++ + F N V+ + + CLA+ +
Sbjct: 319 FSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAY 378
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
D IG RV++D + K+G++ +C
Sbjct: 379 DTAIIGNYQQRNQRVIYDTKQSKVGFAEESC 409
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 148/352 (42%), Gaps = 47/352 (13%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
+SL D G DL W +C P S Y+ + N PS+SST +++SCS +C+
Sbjct: 145 LSLVFDTGSDLTW-----TQCEPCLGSCYSQKEPKFN---PSSSSTYQNVSCSSPMCEDA 196
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC C Y++ Y + + + G L ++ L + + V V GCG G
Sbjct: 197 ESCS--ASNCVYSI-VYGDKSFTQGFLAKEKFTLTN-------SDVLEDVYFGCGENNQG 246
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMC---FDKDDSGRIFFGDQGPAT 268
+ DG+ GL SL A+ N+ FS C F + +G + FG G +
Sbjct: 247 LF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISE 300
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQT--SFK---AIVDSGSSFTFLPKEVYET 323
+ ++S Y I + +G L T SF AI+DSG+ FT LP +VY
Sbjct: 301 SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAE 360
Query: 324 IAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ + F +++ + S GY + CY + P++ F + V + G
Sbjct: 361 LRSVFKEKMS-SYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSF-------AGSTVVELDG 412
Query: 383 TQV-----VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + ++ CLA D G T VV+D ++G++ + C
Sbjct: 413 SGISLPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 137/364 (37%), Gaps = 49/364 (13%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS 154
D G DL W+ PC C P ++ PS SST + CS C +G
Sbjct: 140 DTGSDLTWVQCLPCPDSSCYPQQEPLFD----------PSKSSTYVDVPCSAPECHIGGV 189
Query: 155 CQNP--KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
Q C Y++ Y E + + G L E+ L A V+ GC +
Sbjct: 190 QQTRCGATSCEYSVKYGDE-SETHGSLAEETFTLSPPSPLA---PAATGVVFGCSHEYIS 245
Query: 213 GYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNS----FSMCFDKDDS--GRIFFGDQG 265
+ D G+ GL+GLG G+ S+L++ NS FS C S G + G
Sbjct: 246 VFNDTGMGVAGLLGLGRGD---SSILSQTRRSINSGGGVFSYCLPPRGSSTGYLTIGGGA 302
Query: 266 PATQQSTSFLASNGKYIT-------YIIGVETCCIGSSCLK----QTSFKAIVDSGSSFT 314
A QQ S L+ T Y++ + + + + S A++DSG+ T
Sbjct: 303 AAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAVIDSGTVVT 362
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYP--WKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
+P Y + EF + EG CY + Q + P V L F
Sbjct: 363 HMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARID 422
Query: 373 VNNPVFVIY------GTQVVTGFCLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKLGWS 425
V+ ++ Q +T CLA P + + +G Y VVFD + ++G+
Sbjct: 423 VDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFG 482
Query: 426 HSNC 429
+ C
Sbjct: 483 PNGC 486
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 148/353 (41%), Gaps = 71/353 (20%)
Query: 1 MNRISLTI--YLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSW 53
MN SL I Y ++ ++++ S FS +LIHR S + ++N+ NA
Sbjct: 1 MNTCSLLILFYFSLCFIISLSHALNN-GFSVELIHRDSSKSPLYQPTQNKYQHIVNAARR 59
Query: 54 PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-C 110
++ +Y+ L++ Q + ++ M + S G+ L D G D++W+ C+ C
Sbjct: 60 SINRANHFYKTALTNTPQSTVIPDHGEYLMTY-SVGTPPFKLYGIADTGSDIVWLQCEPC 118
Query: 111 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT 170
C YN + ++ PS SST K++ CS LC G
Sbjct: 119 KEC-------YN---QTTPKFKPSKSSTYKNIPCSSDLCKSG------------------ 150
Query: 171 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 230
G L D L L S + + +IGCG + + +G A G++GLG G
Sbjct: 151 ----QQGNLSVDTLTLESSTGHPIS---FPKTVIGCGTDNTVSF-EG-ASSGIVGLGGGP 201
Query: 231 ISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQS--TSFLASNGKYIT 283
S+ + L + I FS C + + + ++ FGD + ++ + +
Sbjct: 202 ASLITQLGSS--IDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVF 259
Query: 284 YIIGVETCCIGSSCLKQTSFKA----------IVDSGSSFTFLPKEVYETIAA 326
Y + +E +G+ K+ F+ I+DSG++ T +P +VY + +
Sbjct: 260 YYLTLEAFSVGN---KRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLES 309
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 101/440 (22%), Positives = 165/440 (37%), Gaps = 72/440 (16%)
Query: 15 LLTESSGAETVMFSTKLIHRFSEEVKALGVSKN-----RNATSWPAKKSFEYYQVLLSSD 69
L+ ++ + F+ LIHR S + ++ RNA + F + +
Sbjct: 19 FLSNANAKSKLGFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDI----- 73
Query: 70 VQKQKMKTGPQFQMLFPS-QGSKTMSLGN---------DFGCDLLWIPCD-CVRCAPLSA 118
QK PQ + S + +SLG D G DLLW C C C
Sbjct: 74 SQKDASDNAPQIDLTSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDC----- 128
Query: 119 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSS 175
Y +D + P ASST K +SCS C + SC C Y+ Y + + +
Sbjct: 129 --YTQVDP---LFDPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTS-YGDRSYT 182
Query: 176 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 235
G + D L L G + ++IIGCG +G + G+ S
Sbjct: 183 KGNIAVDTLTL---GSTDTRPVQLKNIIIGCGHNNAGTF-----NKKGSGIVGLGGGAVS 234
Query: 236 LLAKAG-LIRNSFSMCF-----DKDDSGRIFFGDQG--PATQQSTSFLASNGKYITYIIG 287
L+ + G I FS C + D + +I FG T ++ L + + Y +
Sbjct: 235 LITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLT 294
Query: 288 VETCCIGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 337
+++ +GS K+ + I+DSG++ T LP E Y + ++
Sbjct: 295 LKSISVGS---KEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK 351
Query: 338 SFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ--P 395
CY ++ K+P++ + F + + + FV +V C A + P
Sbjct: 352 QDPQTGLSLCYSATGDL--KVPAITMHFDGADVNLKPSNCFVQISEDLV---CFAFRGSP 406
Query: 396 VDGDIGTIGQ-NFMTGYRVV 414
G + Q NF+ GY V
Sbjct: 407 SFSIYGNVAQMNFLVGYDTV 426
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 138/365 (37%), Gaps = 50/365 (13%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCA-PLSASY--YNSLDRDLNEYSPSASSTSKHLSCS 145
K L D G DL W+ CD C C P + Y + L + ++ + S H
Sbjct: 75 KVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHGDLVKCVDPLCAAIQSAPNH---- 130
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
C P + C Y ++Y + +S LL ++I + G A + + G
Sbjct: 131 --------HCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLA-----RPMLAFG 177
Query: 206 CGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 264
CG Q+ G + G++GLG G S+ S L GLIRN C G +FFGDQ
Sbjct: 178 CGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGDQ 237
Query: 265 --GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 322
P+ T L S+ Y G + I DSGSS+T+ + ++
Sbjct: 238 LIPPSGVVWTPLLQSSSAQ-HYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHK 296
Query: 323 TI---------AAEFDRQVND---TITSFEGYPWKCCYKSSSQRLPKLPSVK------LM 364
+ R D I P+K + +S P L S L
Sbjct: 297 ALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLLSFTKSKNSPLQ 356
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
P +V V G ++ G + + G+ IG + V++D E ++GW
Sbjct: 357 LPPEAYLIVTKHGNVCLG--ILDGTEIGL----GNTNIIGDISLQDKLVIYDNEKQQIGW 410
Query: 425 SHSNC 429
+ +NC
Sbjct: 411 ASANC 415
>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
Length = 394
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 157/371 (42%), Gaps = 74/371 (19%)
Query: 89 GSKTMSLGNDFGCDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G+ T ++ D G L+ IP +C C D Y P+ S SK +SC
Sbjct: 48 GNHTFTVQVDTGSSLMAIPMVNCNTC------------HDRPSYDPTHSQYSKVVSCFSE 95
Query: 148 LCDLGT-----SCQN-PKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQ 199
C LG+ C+N + C + + Y + + SG + +D+++L +SG N N ++
Sbjct: 96 HC-LGSGSAPPQCKNRAEDDCDFVI-LYGDGSRVSGKIYQDVVNLSGLSGIANFGANRIE 153
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLG-EISVPSL---LAKAGLIRNSFSMCFDKDD 255
G + DG++G G + VP++ L +A ++N F+M D +
Sbjct: 154 T------------GDFEYPRADGIVGFGRSCKTCVPTVFESLVQAHGLKNIFAMSMDYEG 201
Query: 256 SGRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKAIVDS 309
G + G+ P+ Q T L +G + Y I + + + + + IVDS
Sbjct: 202 RGTLSLGELNPSNHIGEIQYTP-LFEDGPF--YNIKPTNFKVDDTVILPRLLGRQVIVDS 258
Query: 310 GSSFTFLPKEVYETIAAEFDRQ------VNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
GSS L Y+ + F + + D+ + +G CY S+S L LP++ L
Sbjct: 259 GSSALSLASGAYDALVHHFRKNYCHVAGICDSPSILDG---SICYNSASS-LDLLPTIYL 314
Query: 364 MF---------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVV 414
F P+N ++ P+ T +G+C I D +G FM GY V
Sbjct: 315 TFEGGVKVAVPPKN--YLTKAPL-----TNGASGYCWMIDRADPSTTILGDVFMRGYYTV 367
Query: 415 FDRENLKLGWS 425
FD E ++G++
Sbjct: 368 FDNEEKRIGFA 378
>gi|328865865|gb|EGG14251.1| hypothetical protein DFA_12021 [Dictyostelium fasciculatum]
Length = 698
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 135/320 (42%), Gaps = 34/320 (10%)
Query: 137 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNA 193
S+++ LSC C G S P P T + Y + + G LV D + + A
Sbjct: 164 SSAETLSCRSSQCKRGCSFITPYASHPSTCGFKISYQDGSFIGGDLVTDYVTVAGLTVKA 223
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP----DGLIGLGLGEIS------VPSLLAKAGLI 243
+ ++QA + QS D A DG++GL + + SLL K I
Sbjct: 224 IFGNMQAQSL---NFSQSSCPADPFAAPRKRDGIMGLSYQSLDPNNGDDIFSLLVKTHEI 280
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA--SNGKYITYIIGVETCCIGSSCLKQT 301
NSFSMC D+ G + G P + +N +Y Y + I + L
Sbjct: 281 HNSFSMCL-SDEGGMLVLGGVDPKMNSTLMKYTPITNERY--YSVNCTGLRIDGNNLNSK 337
Query: 302 SFK--AIVDSGSSFTFLPKEVYETIAAEFDRQVNDT--ITSFEGYPWKC-CYKSSSQRLP 356
SF+ +IVDSG++ FL +++ + + + IT+ W C+ S ++L
Sbjct: 338 SFQSISIVDSGTTIMFLKLDIFNDLIYYLVQHYSHLPGITTQSESLWNHQCFTLSDRQLE 397
Query: 357 KLPSVKLMFPQNNS--FVVNNPVFVIYGTQVVTGFCLAIQ--PVDGDIGT-IGQNFMTGY 411
K P++ ++FP F V P +Y ++ +C + P+ IG + GY
Sbjct: 398 KYPTISMVFPNTEGGLFEVAIPP-NLYMIKIDDMYCFGFEKLPIKSPYSVLIGDVALQGY 456
Query: 412 RVVFDRENLKLGWSH--SNC 429
V ++RE+ +G++ NC
Sbjct: 457 NVHYNREDGSIGFAKVTDNC 476
>gi|145348493|ref|XP_001418682.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578912|gb|ABO96975.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 464
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/412 (22%), Positives = 162/412 (39%), Gaps = 77/412 (18%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC---DCVRCAPLSASYYN-SLDRDLNEYS 132
+G +F + G+++ L D G L + PC D C YY+ L D +
Sbjct: 35 SGHEFSLTVTLPGAQSFDLIVDTGSPLTYFPCVGCDAELCGYHEHQYYDWRLSNDFRLLN 94
Query: 133 PSASSTSKHLSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
S ++ CD N C + + Y + G ++ED+ +S G
Sbjct: 95 ASMNAADA------AFCDAMPVAHNVSADGECLFGLGYL-DGARGGGSMIEDV---VSVG 144
Query: 191 DNALKNSVQASVIIGCG--MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR-NSF 247
D A +I GCG ++ GG+ DG+ G G + + LAKAG+I + F
Sbjct: 145 DEL----SPAKMIFGCGGVVEADGGF---DRQDGMAGFSRGNTAFHTQLAKAGVINAHVF 197
Query: 248 SMCFDKDDS-------GRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCC--IGSSC 297
C + + GR FG D P + T L ++ + V T +G +
Sbjct: 198 GFCSEGSGTDTAMLSLGRYDFGRDLAPLSY--TRILGADD------LAVRTMSWKLGEAI 249
Query: 298 LKQTS-FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSS 352
+ +S ++DSG++ LP + + + Q+ T E + + C+ S++
Sbjct: 250 IASSSNVYTVLDSGTTLVLLPPAMRDDFITKLVAQMAATHPELELFDDEDLGQMCFSSAT 309
Query: 353 ---------QRLPKL-----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG 398
+ PKL P + L+ P N +N+ +++ + +CL I D
Sbjct: 310 PVLTAKLRDEWFPKLAITYDPDITLILPSEN--YLNSHLYIPHT------YCLGIDESDD 361
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNP 450
+GQ + + +D EN ++G + C++L P TP NP
Sbjct: 362 GTILLGQQALRNTFIEYDLENDRVGVVVAQCENLRK------KFAPDTPHNP 407
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 147/364 (40%), Gaps = 58/364 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ + L D D+ WIPC CV C +A +SP+ S++ K++SCS
Sbjct: 109 AQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSCSAPQ 156
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C + + Y + + +++ L +D + L + A GC
Sbjct: 157 CKQVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAFT--------FGCVN 206
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
K +GG G P LGLG + + + +++FS C SG + G
Sbjct: 207 KVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPT 263
Query: 265 G-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSF 313
P + T L + + Y + + +G + T I DSG+ +
Sbjct: 264 SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 323
Query: 314 TFLPKEVYETIAAEFDRQVND---TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNN 369
T L K VYE + EF ++V +TS G+ CY K+P++ MF N
Sbjct: 324 TRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGF--DTCYSGQV----KVPTITFMFKGVNM 377
Query: 370 SFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ +N +++ T T CLA+ + V+ + I +RV+ D N +LG +
Sbjct: 378 TMPADN--LMLHSTAGSTS-CLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLA 434
Query: 426 HSNC 429
C
Sbjct: 435 RERC 438
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 142/350 (40%), Gaps = 45/350 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G D+ W+ C C C Y D + PS S++ +SC C DL T+
Sbjct: 187 DTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSPRCRDLDTAA 236
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G + L L G + N V IGCG G +
Sbjct: 237 CRNATGACLYEV-AYGDGSYTVGDFATETLTL--GDSTPVTN-----VAIGCGHDNEGLF 288
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD--SGRIFFGDQGPATQQS 271
V GL+ LG G +S PS ++ ++FS C D+D + + FG G
Sbjct: 289 ---VGAAGLLALGGGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGADGAEADTV 340
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK---------AIVDSGSSFTFLPKE 319
T+ L + + T Y + + +G L ++F IVDSG++ T L
Sbjct: 341 TAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSS 400
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + F R + + CY S + ++P+V L F + + ++
Sbjct: 401 AYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYL 460
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I T +CLA P + + IG G RV FD +G++ + C
Sbjct: 461 IPVDGAGT-YCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 142/358 (39%), Gaps = 63/358 (17%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLG- 152
D G D+ W+ PC+ +C P ++ PS SST ++C+ C LG
Sbjct: 149 DTGSDVSWVQCTPCNSTKCYPQKDPLFD----------PSKSSTYAPIACNTDACRKLGD 198
Query: 153 ---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C + C Y+++Y + + S G+ + L L G GCG
Sbjct: 199 HYHNGCTSGGTQCGYSVEY-ADGSHSRGVYSNETLTLAPG-------ITVEDFHFGCGRD 250
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
Q G DGL+GLG +S+ ++ + + +FS C +S F P +
Sbjct: 251 QRG---PSDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSYCLPALNSEAGFLVLGSPPSG 305
Query: 270 QSTSFLASNGKYIT-----YIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEV 320
++F+ + +++ Y++ + +G L Q++F+ I+DSG+ T LP+
Sbjct: 306 NKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTVDTELPETA 365
Query: 321 YETIAAEFDRQVNDTITSFEGYP------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
Y + A + + + YP + CY + +P V F + ++
Sbjct: 366 YNALEAALRK-------ALKAYPLVPSDDFDTCYNFTGYSNITVPRVAFTFSGGATIDLD 418
Query: 375 NPVFVIYGTQVVTGFCLAIQ---PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P ++ CLA Q P DG +G IG V++D +G+ C
Sbjct: 419 VP------NGILVNDCLAFQESGPDDG-LGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 147/352 (41%), Gaps = 47/352 (13%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
+SL D G DL W +C P S Y+ + N PS+SST +++SCS +C+
Sbjct: 145 LSLVFDTGSDLTW-----TQCEPCLGSCYSQKEPKFN---PSSSSTYQNVSCSSPMCEDA 196
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC C Y++ Y + + + G L ++ L + + V V GCG G
Sbjct: 197 ESCS--ASNCVYSIG-YGDKSFTQGFLAKEKFTLTN-------SDVLEDVYFGCGENNQG 246
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMC---FDKDDSGRIFFGDQGPAT 268
+ DG+ GL SL A+ N+ FS C F + +G + FG G +
Sbjct: 247 LF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISE 300
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLKQT--SFK---AIVDSGSSFTFLPKEVYET 323
+ ++S Y I + +G L T SF AI+DSG+ FT LP +VY
Sbjct: 301 SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAE 360
Query: 324 IAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ + F +++ + S GY + CY + P++ F V + G
Sbjct: 361 LRSVFKEKMS-SYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSF-------AGGTVVELDG 412
Query: 383 TQV-----VTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + ++ CLA D G T VV+D ++G++ + C
Sbjct: 413 SGISLPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 138/365 (37%), Gaps = 72/365 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ + L D G DL+W +C P A + D+ L + PS SST SC LC
Sbjct: 100 QPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTSCDSTLC- 149
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+ + L D + G + V GCG+
Sbjct: 150 --------------------QGLPVASLPRSDKFTFVGAGASV------PGVAFGCGLFN 183
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGRI 259
+G + G+ G G G +S+PS L K G +FS CF D +
Sbjct: 184 NGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPSTVLLDLPADL 236
Query: 260 FFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFKAIVDSG 310
F QG Q+T + + Y + ++ +GS+ LK + I+DSG
Sbjct: 237 FSNGQGAV--QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSG 294
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNN 369
++ T LP VY + F QV + S C + + P +P + L F
Sbjct: 295 TAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATM 354
Query: 370 SFVVNNPVFVI--YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
N VF + G+ ++ CLAI G++ TIG V++D +N KL + +
Sbjct: 355 DLPRENYVFEVEDAGSSIL---CLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 410
Query: 428 NCQDL 432
C L
Sbjct: 411 QCDKL 415
>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
Length = 107
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGR 258
+V C +G +LDG A +GL+GLG ++SV +L +GL+ +SFSMCF +D GR
Sbjct: 12 GAVAKACRCGPTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGR 71
Query: 259 IFFGDQGPATQQSTSFLAS 277
I FGD G Q F+++
Sbjct: 72 INFGDAGIRGQGEMPFIST 90
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 135/366 (36%), Gaps = 84/366 (22%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS-----ASSTSKHLSCSHRLCD- 150
D G DL+W+ CD C C DL+ + + ASS+ K L C+ C
Sbjct: 23 DTGSDLVWLKCDNCDHC-------------DLDHHGETIFFSDASSSYKKLPCNSTHCSG 69
Query: 151 -----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
+G C+ + C Y +Y + + +SG + D + S G S + G
Sbjct: 70 MSSAGIGPRCE---ETCKYKYEY-GDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 265
CG K G D GLIGLG S+ L + FS C DS
Sbjct: 126 CGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDS--------- 171
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF---------------------- 303
P + +S FL S+ + + V T + L QT +
Sbjct: 172 PPSAKSFLFLGSSAALRGHDV-VSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESG 230
Query: 304 -----------KAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSS 351
K ++DSG+++T L VYE + + QV T+ + G C+ SS
Sbjct: 231 HNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAG--LDLCFNSS 288
Query: 352 SQRLPKLPSVKLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
PSV F V+ +F + VV CL++ GD+ IG
Sbjct: 289 GDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRDVV---CLSMDSSGGDLSIIGNMQQQN 345
Query: 411 YRVVFD 416
+ +++D
Sbjct: 346 FHILYD 351
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 88/379 (23%), Positives = 157/379 (41%), Gaps = 74/379 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC------- 149
D G DL W+ C C+ C ++ + P+ASS+ ++L+C C
Sbjct: 164 DTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPE 213
Query: 150 -DLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGC 206
+C+ P + PCPY Y ++ S+ L +E ++L + G +S V+ GC
Sbjct: 214 APAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPG----ASSRVDGVVFGC 269
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGD 263
G + G + L+GLG G +S S L +A ++FS C D + ++ FG+
Sbjct: 270 GHRNRGLFHGAAG---LLGLGRGPLSFASQL-RAVYGGHTFSYCLVDHGSDVASKVVFGE 325
Query: 264 Q----------------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT---SFK 304
PA+ + +F + ++G E I S + S
Sbjct: 326 DDALALAAHPRLKYTAFAPASSPADTFYYV--RLTGVLVGGELLNISSDTWDASEGGSGG 383
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKL 363
I+DSG++ ++ + Y+ I F +++ + +P CY S P++P + L
Sbjct: 384 TIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSL 443
Query: 364 M--------FPQNNSFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRV 413
+ FP N F+ +P ++ CLA+ P G + IG + V
Sbjct: 444 LFADGAVWDFPAENYFIRLDPDGIM---------CLAVLGTPRTG-MSIIGNFQQQNFHV 493
Query: 414 VFDRENLKLGWSHSNCQDL 432
+D N +LG++ C ++
Sbjct: 494 AYDLHNNRLGFAPRRCAEV 512
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 139/366 (37%), Gaps = 73/366 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G D+ W+ C+ C +P A + +L + P+ASST +CS C LG S
Sbjct: 153 DTGSDVSWVQCEPCPAPSPCHA-HAGAL------FDPAASSTYAAFNCSAAACAQLGDSG 205
Query: 156 Q----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+ + K C Y + Y + ++++G D+L L SG D V GC +
Sbjct: 206 EANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTL-SGSD------VVRGFQFGCSHAEL 257
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
G +D DGLIGLG S+ S A SFS C PAT S
Sbjct: 258 GAGMDD-KTDGLIGLGGDAQSLVS--QTAARYGKSFSYCL--------------PATPAS 300
Query: 272 TSFL----------ASNGKYIT------------YIIGVETCCIGSS--CLKQTSFKA-- 305
+ FL ++ T Y +E +G L + F A
Sbjct: 301 SGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGS 360
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
+VDSG+ T LP Y +++ F + + C+ + +P+V L+F
Sbjct: 361 LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVF 420
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLG 423
V + +V+G CLA P D GTIG + V++D G
Sbjct: 421 -------AGGAVVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVFG 473
Query: 424 WSHSNC 429
+ C
Sbjct: 474 FRAGAC 479
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 146/376 (38%), Gaps = 70/376 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G +L+W C C RC P P+ SST L C+ C L TS
Sbjct: 109 DTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRLPCNGSFCQYLPTSS 160
Query: 156 Q----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+ N C Y Y + T+ G L + L + GD V GC +
Sbjct: 161 RPRTCNATAACAYNYTYGSGYTA--GYLATETLTV---GDGTFPK-----VAFGCSTE-- 208
Query: 212 GGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
+GV G++GLG G +S+ S LA R S+ + D D G I FG T
Sbjct: 209 ----NGVDNSSGIVGLGRGPLSLVSQLAVG---RFSYCLRSDMADGGASPILFGSLAKLT 261
Query: 269 QQST---------SFLASNGKYITYIIGV-----ETCCIGSSC-LKQTSFKA--IVDSGS 311
++S +L + Y + G+ E GS+ QT IVDSG+
Sbjct: 262 ERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321
Query: 312 SFTFLPKEVYETIAAEFDRQVND----TITSFEGYPWKCCYKSSS---QRLPKLPSVKLM 364
+ T+L K+ Y + F Q+ + T S Y CYK S+ + ++P + L
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381
Query: 365 FPQNNSFVVNNPVFVIY-GTQV-----VTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFD 416
F + N PV + G + VT CL + P D I IG +++D
Sbjct: 382 FAGGAKY--NVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYD 439
Query: 417 RENLKLGWSHSNCQDL 432
+ ++ ++C L
Sbjct: 440 IDGGMFSFAPADCAKL 455
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 138/354 (38%), Gaps = 61/354 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G ++ W C CV C +A ++ PS SST K C
Sbjct: 398 DTGSEITWTQCLPCVHCYKQNAPIFD----------PSKSSTFKEKRCH----------- 436
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
CPY +DY+ + T + G L D + + S V A IIGCG S
Sbjct: 437 --DHSCPYEVDYF-DKTYTKGTLATDTVTIHSTSGEPF---VMAETIIGCGRNNS----- 485
Query: 217 GVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ---GPATQQS 271
P +G +GL G +S+ + G S CF + + +I FG G S
Sbjct: 486 WFRPSFEGFVGLNWGPLSL--ITQMGGEYPGLMSYCFAGNGTSKINFGTNAIVGGGGVVS 543
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGSSFTFLPKEVYETI 324
T+ + + Y + ++ +G + ++ T F A ++DSG++ T+ P+ +
Sbjct: 544 TTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTYFPESYCNLV 603
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
+ V + CY S++ + P + + F V++ + ++
Sbjct: 604 RQAVEHVVPAVPAADPTGNDLLCYYSNTTEI--FPVITMHFSGGADLVLDK--YNMFMES 659
Query: 385 VVTG-FCLAI---QPVDGDI-GTIGQ-NFMTGYRVVFDRENLKLGWSHSNCQDL 432
G FCLAI P I G Q NF+ GY D +L + + +NC L
Sbjct: 660 YSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGY----DSSSLLVSFKPTNCSAL 709
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 101/425 (23%), Positives = 158/425 (37%), Gaps = 94/425 (22%)
Query: 6 LTIYLAVF-WLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQV 64
+ I+L + + L ++ + F+ LIHR S + N A S A F+ Y+
Sbjct: 8 IAIFLQIITYFLFTTTASSPHGFTIDLIHRRSNASSSR--VSNTQAGSPYADTVFDTYEY 65
Query: 65 LLSSDVQKQKMKTG-PQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYN 122
L+ K++ G P F++ D G +L+W C C+ C
Sbjct: 66 LM-------KLQIGTPPFEV----------EAVLDTGSELIWTQCLPCLHC--------- 99
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 182
D+ + PS SST K T C P CPY + Y ++ + L E
Sbjct: 100 -YDQKAPIFDPSKSSTFKE-----------TRCNTPDHSCPYKLVYDDKSYTQGTLATET 147
Query: 183 I-LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAK 239
+ +H SG V IIGC SG G P G++GL G +S+ S +
Sbjct: 148 VTIHSTSG-----VPFVMPETIIGCSRNNSG---SGFRPSSSGIVGLSRGSLSLISQMGG 199
Query: 240 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 299
A + GD ST+ A K Y + ++ +G + ++
Sbjct: 200 A-------------------YPGDG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIE 236
Query: 300 Q--TSFKA-----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 352
T F A ++DSG+ T+ P + +R V CY S++
Sbjct: 237 TVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNT 296
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG-FCLAI---QPVDGDI-GTIGQ-N 406
+ P + + F V++ + +Y G FCLAI P I G Q N
Sbjct: 297 IEI--FPVITVHFSGGADLVLDK--YNMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNN 352
Query: 407 FMTGY 411
F+ GY
Sbjct: 353 FLVGY 357
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 138/344 (40%), Gaps = 43/344 (12%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
+ +SL D G DL W +C P + S Y D + PS S++ +++C+ LC
Sbjct: 156 RDLSLIFDTGSDLTW-----TQCEPCARSCYKQQDAIFD---PSKSTSYSNITCTSTLCT 207
Query: 150 DLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
L T+ C + C Y + Y +++ S G + L + + + + +
Sbjct: 208 QLSTATGNEPGCSASTKACIYGIQY-GDSSFSVGYFSRERLSVTA-------TDIVDNFL 259
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFF 261
GCG + + G G A GLIGLG IS + A + R FS C S GR+ F
Sbjct: 260 FGCG-QNNQGLFGGSA--GLIGLGRHPISF--VQQTAAVYRKIFSYCLPATSSSTGRLSF 314
Query: 262 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----AIVDSGSSFTFL 316
G + + T F + Y + + +G + L +S AI+DSG+ T L
Sbjct: 315 GTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVITRL 374
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
P Y + + F + ++ ++ E CY S + +P + F V P
Sbjct: 375 PPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFA--GGVTVQLP 432
Query: 377 ----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
++V QV F A D D+ G VV+D
Sbjct: 433 PQGILYVASAKQVCLAF--AANGDDSDVTIYGNVQQKTIEVVYD 474
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 139/357 (38%), Gaps = 53/357 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G ++ WIPC+ C C+ + PS SST +L+C+ + C L C
Sbjct: 142 DTGSNIAWIPCNPCSGCS-----------SKQQPFEPSKSSTYNYLTCASQQCQLLRVCT 190
Query: 157 NPKQP--CPYTMDYYTENTSSSGLLVEDIL--HLISGGDNALKNSVQASVIIGCGMKQSG 212
C T Y ++ V++IL +S G ++N + GC G
Sbjct: 191 KSDNSVNCSLTQRYGDQSE------VDEILSSETLSVGSQQVEN-----FVFGCSNAARG 239
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRIFFGDQGPAT 268
L P L+G G +S S A L ++FS C F +G + G + +
Sbjct: 240 --LIQRTP-SLVGFGRNPLSFVS--QTATLYDSTFSYCLPSLFSSAFTGSLLLGKEALSA 294
Query: 269 QQ-STSFLASNGKYIT-YIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFL 316
Q + L SN +Y + Y +G+ +G + + T I+DSG+ T L
Sbjct: 295 QGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRL 354
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
+ Y + F Q+++ + + CY S + + P + L F N +
Sbjct: 355 VEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDV-EFPLITLHFDDNLDLTLPLD 413
Query: 377 VFVIYGTQVVTGFCLA--IQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G + CLA + P GD + T G R+V D +LG + NC
Sbjct: 414 NILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 152/374 (40%), Gaps = 76/374 (20%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C C D+ + P SS+ + CS LC+ ++
Sbjct: 17 DTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN 66
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C K C Y + Y + +S+ GLL + +NS+ + + GCG++ G
Sbjct: 67 CNEDKDACEY-LYTYGDYSSTRGLLATETFTFED------ENSI-SGIGFGCGVENEG-- 116
Query: 215 LDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGD------ 263
DG + GL+GLG G +S+ S L + FS C D + S +F G
Sbjct: 117 -DGFSQGSGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEASSSLFIGSLASGIV 170
Query: 264 -------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------AI 306
G T+ + S L + + Y + ++ +G+ L ++++F+ I
Sbjct: 171 NKTGASLDGEVTK-TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMI 229
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL---- 358
+DSG++ T+L + ++ + EF +++ + C+K + + +PK+
Sbjct: 230 IDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF 289
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
L P N V ++ V+ CLA+ +G + G + V+ D E
Sbjct: 290 KGADLELPGENYMVADSSTGVL---------CLAMGSSNG-MSIFGNVQQQNFNVLHDLE 339
Query: 419 NLKLGWSHSNCQDL 432
+ + + C L
Sbjct: 340 KETVSFVPTECGKL 353
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/373 (24%), Positives = 147/373 (39%), Gaps = 68/373 (18%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 155
D G DL+W CD C RC P A Y +P+ S T ++SC RLCD S
Sbjct: 118 DTGSDLIWTQCDAPCRRCFPQPAPLY----------APARSVTYANVSCGSRLCDALPSL 167
Query: 156 Q-------------NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
+ + C Y Y + +S+ G+L + +G + +
Sbjct: 168 RPSSRCSASASAPAPERGGCTYYYS-YGDGSSTDGVLATETFTFGAG-------TTVHDL 219
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGR 258
GCG GG + GL+G+G G + SL+++ G+ + FS CF D S
Sbjct: 220 AFGCGTDNLGGTDNS---SGLVGMGRGPL---SLVSQLGVTK--FSYCFTPFNDTTTSSP 271
Query: 259 IFFGDQG---PATQQSTSFLASNG---KYITYIIGVETCCIGSSCL--KQTSFK------ 304
+F G PA +ST F+ S + Y + +E +G + L F+
Sbjct: 272 LFLGSSASLSPAA-KSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGR 330
Query: 305 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK---LP 359
I+DSG++FT L + + +A +V + S C+ + R P+ +P
Sbjct: 331 GGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVP 390
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+ L F + + + V +V CL I G + +G V +D
Sbjct: 391 RLVLHFDGADMELPRSSAVVE--DRVAGVACLGIVSARG-MSVLGSMQQQNMHVRYDVGR 447
Query: 420 LKLGWSHSNCQDL 432
L + +NC +L
Sbjct: 448 DVLSFEPANCGEL 460
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 145/363 (39%), Gaps = 59/363 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++TM + D G DL W+ C AP S + L + P+ SS+ + C +C
Sbjct: 152 AQTMEV--DTGSDLSWVQCKPCSAAPSCYSQKDPL------FDPAQSSSYAAVPCGGPVC 203
Query: 150 DLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
G Y Y + ++++G+ D L L + ++VQ GC
Sbjct: 204 -AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL------SASSAVQG-FFFGC 255
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDS--GRIFFGD 263
G QS G +GV DGL+GLG + PSL+ + AG FS C S G + G
Sbjct: 256 GHAQS-GLFNGV--DGLLGLGREQ---PSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGL 309
Query: 264 QGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTF 315
GP+ +T L S Y++ + +G L ++F +VD+G+ T
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITR 369
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQN 368
LP Y + + F + S+ GYP CY + LP+V L F
Sbjct: 370 LPPTAYAALRSAF----RSGMASY-GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 424
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ ++ + +G CLA P DG + +G + V D +G+
Sbjct: 425 ATVMLGADGILSFG-------CLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKP 475
Query: 427 SNC 429
S+C
Sbjct: 476 SSC 478
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 158/398 (39%), Gaps = 65/398 (16%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F + K SL D G DL W+ C C C + ++Y+ P
Sbjct: 157 LGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYD----------P 206
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
S++ K+++C+ C L +S C++ Q CPY Y + ++ VE +
Sbjct: 207 KTSASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNL 266
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 247
+ + +++ GCG G + L+GLG G +S S L L +SF
Sbjct: 267 TTTEGRSSEYKVENMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQLQ--SLYGHSF 321
Query: 248 SMCF-----DKDDSGRIFFGDQGPATQQS----TSFL--ASNGKYITYIIGVETCCIGSS 296
S C D + S ++ FG+ + TSF+ N Y I +++ +G
Sbjct: 322 SYCLVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGE 381
Query: 297 CLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WK 345
L + I+DSG++ ++ + YE I +F ++ + F +P
Sbjct: 382 ALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLD 441
Query: 346 CCY-----KSSSQRLPKL-----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP 395
C+ + ++ LP+L FP NSF+ + V CLAI
Sbjct: 442 PCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDLV----------CLAILG 491
Query: 396 V-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
IG + +++D + +LG++ + C D+
Sbjct: 492 TPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKCADI 529
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 104/424 (24%), Positives = 158/424 (37%), Gaps = 55/424 (12%)
Query: 8 IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
+ + + L E + A FS LIHR S SK + +A + +
Sbjct: 13 VVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFR 72
Query: 63 QVLLSSD-VQKQKMKTGPQFQM-LFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSAS 119
++SD +Q + + + ++ M L+ + D G DL W C C C
Sbjct: 73 PTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVP 132
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SCQNPKQPCPYTMDYYTENTSSS 176
++ P SST + SC C LG SC K+ C + Y + + +
Sbjct: 133 LFD----------PKNSSTYRDSSCGTSFCLALGKDRSCSKEKK-CTFRYSY-ADGSFTG 180
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G L + L + S A K GCG SGG D + G++GLG GE+S+ S
Sbjct: 181 GNLASETLTVDS---TAGKPVSFPGFAFGCG-HSSGGIFDK-SSSGIVGLGGGELSLISQ 235
Query: 237 LAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 291
L I FS C D S RI FG G + T Y Y
Sbjct: 236 LKST--INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGY------- 286
Query: 292 CIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS 351
S + IVDSG+++TFLP+E Y + + + CY ++
Sbjct: 287 ---SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTT 343
Query: 352 SQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQ----NF 407
++ P + F N + F+ +V C + P DIG +G NF
Sbjct: 344 AE--INAPIITAHFKDANVELQPLNTFMRMQEDLV---CFTVAPTS-DIGVLGNLAQVNF 397
Query: 408 MTGY 411
+ G+
Sbjct: 398 LVGF 401
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 135/341 (39%), Gaps = 48/341 (14%)
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSC----------QNPKQPCPYTMDYY-TENTSSSGL 178
E S S T L C C+ SC + C Y + Y N S++G+
Sbjct: 84 EKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGV 143
Query: 179 LVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L ED L +++ A+ S V IGC + + D + G+ GLG S+P L
Sbjct: 144 LYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDP-SIKGVFGLGRSATSLPRQL 202
Query: 238 AKAGLIRNSFSMC---FDKDDSGRIFFGDQGP---------ATQQSTSFLASNGKYIT-Y 284
+ FS C + K D P A +T+ L N Y T Y
Sbjct: 203 NFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRY 257
Query: 285 IIGVETCCIGSSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ ++ IG + L S K+ VD+G+SFT L V+ + E DR + + E
Sbjct: 258 FVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKE 317
Query: 341 GYPWK----CCY---KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
P + CY +++ KLP + L F + + V+ + Y + + CLAI
Sbjct: 318 -QPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLP---WDSYLWKTTSKLCLAI 373
Query: 394 QP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G I +G M ++ D N KL + ++C +
Sbjct: 374 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKV 414
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 144/355 (40%), Gaps = 57/355 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P A Y + ++P+ S+T ++SC+ C DL T C
Sbjct: 183 DTGSDTTW-----VQCQPCVAYCYQQKE---PLFTPTKSATYANISCTSSYCSDLDTRGC 234
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G +D L L G + +K+ GCG K G L
Sbjct: 235 SGGH--CLYAVQY-GDGSYTVGFYAQDTLTL---GYDTVKD-----FRFGCGEKNRG--L 281
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF----GDQGPATQQS 271
G A GL+GLG G+ SVP + F+ C SG F G A +
Sbjct: 282 FGKAA-GLMGLGRGKTSVP--VQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPAAANARL 338
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK---AIVDSGSSFTFLPKEVYETIAA 326
T L NG Y +G+ +G L T F A+VDSG+ T LP YE + +
Sbjct: 339 TPMLVDNGPTF-YYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAYEPLRS 397
Query: 327 EFDRQVNDTITSFEGYPWK---------CCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNP 376
F + EG +K CY + Q LP+V L+F Q + + +
Sbjct: 398 AFAK-------GMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVF-QGGACLDVDA 449
Query: 377 VFVIYGTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++Y V CLA D D+ +G Y V++D +G++ C
Sbjct: 450 SGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 145/353 (41%), Gaps = 49/353 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASY--YNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLG 152
D G DL W V+C P Y ++ L + PS S+T + C + C D G
Sbjct: 156 DTGSDLSW-----VQCKPCDGCYQQHDPL------FDPSQSTTYSAVPCGAQECRRLDSG 204
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
SC + K C Y + Y + + + G L D L L ++ + +Q + GCG +G
Sbjct: 205 -SCSSGK--CRYEV-VYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQ-EFVFGCGDDDTG 259
Query: 213 GYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQ 269
L G A DGL GLG +S+ S AK G FS C + G + G P
Sbjct: 260 --LFGKA-DGLFGLGRDRVSLASQAAAKYGA---GFSYCLPSSSTAEGYLSLGSAAPPNA 313
Query: 270 QSTSFLASNGK---YITYIIGVE----TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 322
+ T+ + + Y ++G++ T + + + ++DSG+ T LP Y
Sbjct: 314 RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPG--TVIDSGTVITRLPSRAYA 371
Query: 323 TIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNP 376
+ + F + S++ P CY + + ++PSV L+F + +
Sbjct: 372 ALRSSFAGLMRR--YSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEV 429
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++V +Q F A D I +G + VV+D N K+G+ C
Sbjct: 430 LYVANKSQACLAF--ASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 139/356 (39%), Gaps = 51/356 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G D++W+ C C+ C Y D + P++S+T +SC +C L TS
Sbjct: 143 DSGSDVIWVQCKPCLEC-------YAQAD---PLFDPASSATFSAVSCGSAICRTLRTSG 192
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G L + L L G A++ V IGCG + G +
Sbjct: 193 CGDSGGCEYEVSY-GDGSYTKGTLALETLTL---GGTAVEG-----VAIGCGHRNRGLF- 242
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---------KDDSGRIFFGDQGP 266
V GL+GLG G +S+ L A +FS C D +G + G
Sbjct: 243 --VGAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGSGSGAADAAGSLVLGRSEA 298
Query: 267 ATQQSTSF-LASNGKYIT-YIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFT 314
+ + L N + + Y +GV +G L + ++D+G++ T
Sbjct: 299 VPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVT 358
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP+E Y + F V + CY S ++P+V F + +
Sbjct: 359 RLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLP 418
Query: 375 NPVFVIYGTQVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ +V G +CLA P + +G G ++ D N +G+ + C
Sbjct: 419 ARNLLL---EVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 133/333 (39%), Gaps = 51/333 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G++SV L ++ + FS C S R FF
Sbjct: 110 TFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSERGFFS 165
Query: 262 --------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAI 306
G + AT+ + T +A + + + + L + S K +
Sbjct: 166 KTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGV 225
Query: 307 V-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
V DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 VFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 366 PQNNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 DDGARFDLGRHGVFVERSVQEQDVWCLAFAPTE 317
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 153/412 (37%), Gaps = 114/412 (27%)
Query: 98 DFGCDLLWIPC-----DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
D G DL W+PC DC+ C L + N+L + + +SP SS+S SC+ C
Sbjct: 29 DTGSDLTWVPCGNLSFDCIDCNDLKS---NNL-KSSSIFSPLHSSSSFRASCASSFCAEI 84
Query: 153 TSCQNP-------------------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
S NP +PCP Y E SG+L DIL
Sbjct: 85 HSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGILTRDILK-------- 136
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
+ GC + Y + P G+ G G G +S+PS L G + FS CF
Sbjct: 137 ARTRDVPRFSFGC---VTSTYHE---PIGIAGFGRGLLSLPSQL---GFLEKGFSHCFLP 187
Query: 252 -----DKDDSGRIFFGDQGPATQ-----QSTSFLASNGKYITYIIGVETCCIGSSC---- 297
+ + S + G + Q T L + +Y IG+E+ IG++
Sbjct: 188 FKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNITPTQ 247
Query: 298 ----LKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP------ 343
L+Q + +VDSG+++T LP Y + + TIT YP
Sbjct: 248 VPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLT----ILQSTIT----YPRATETE 299
Query: 344 ----WKCCYKS----------SSQRLPKLPSV--------KLMFPQNNSFVVNNPVFVIY 381
+ CYK + + PS+ L+ PQ NSF +
Sbjct: 300 SRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYA---MSAPS 356
Query: 382 GTQVVTGFCLAIQPVD----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
VV CL Q ++ G G G +VV+D E ++G+ +C
Sbjct: 357 DGSVVQ--CLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 138/347 (39%), Gaps = 42/347 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G D++W+ C C+ C Y D + P+ S+T + C +C L TS
Sbjct: 145 DSGSDVIWVQCKPCLEC-------YAQAD---PLFDPATSATFSAVPCGSAVCRTLRTSG 194
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G L + L L G A++ V IGCG + G +
Sbjct: 195 CGDSGGCDYEVSY-GDGSYTKGALALETLTL---GGTAVEG-----VAIGCGHRNRGLF- 244
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF- 274
V GL+GLG G +S+ L A +FS C +G + G + +
Sbjct: 245 --VGAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAGSLVLGRSEAVPEGAVWVP 300
Query: 275 LASNGKYIT-YIIGVETCCIGSSCL--KQTSFK--------AIVDSGSSFTFLPKEVYET 323
L N + + Y +G+ +G L ++ F+ ++D+G++ T LP+E Y
Sbjct: 301 LVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAA 360
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
+ F V + CY S ++P+V F + + ++
Sbjct: 361 LRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLL--- 417
Query: 384 QVVTG-FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+V G +CLA P +G G ++ D N +G+ + C
Sbjct: 418 EVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 148/372 (39%), Gaps = 68/372 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ- 156
D G +L W+ C + +P S +N L + YSP S+ C R DL
Sbjct: 58 DTGSELSWLHC---KKSPNLTSVFNPLSS--SSYSPIPCSSP---VCRTRTRDLPNPVTC 109
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+PK+ C + + Y + +S G L D + G +AL + + GC G+
Sbjct: 110 DPKKLC-HAIVSYADASSLEGNLASDNFRI---GSSALPGT-----LFGC---MDSGFSS 157
Query: 217 GVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGDQG------ 265
D GL+G+ G +S + + GL + FS C +D SG + FGD
Sbjct: 158 NSEEDAKTTGLMGMNRGSLS---FVTQLGLPK--FSYCISGRDSSGVLLFGDSHLSWLGN 212
Query: 266 ----PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGS 311
P Q ST + + Y + ++ +G+ L + + +VDSG+
Sbjct: 213 LTYTPLVQISTPLPYFD--RVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGT 270
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSSQRLPKLPSVKLM 364
FTFL VY + EF Q + F+G C + +LP+LP+V LM
Sbjct: 271 QFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLM 330
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTG----FCLAIQPVD---GDIGTIGQNFMTGYRVVFDR 417
F + VV V + ++ G +CL D + IG + + FD
Sbjct: 331 F-RGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDL 389
Query: 418 ENLKLGWSHSNC 429
++G+ + C
Sbjct: 390 VKSRVGFVETRC 401
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 135/341 (39%), Gaps = 48/341 (14%)
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSC----------QNPKQPCPYTMDYY-TENTSSSGL 178
E S S T L C C+ SC + C Y + Y N S++G+
Sbjct: 61 EKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGV 120
Query: 179 LVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L ED L +++ A+ S V IGC + + D + G+ GLG S+P L
Sbjct: 121 LYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDP-SIKGVFGLGRSATSLPRQL 179
Query: 238 AKAGLIRNSFSMC---FDKDDSGRIFFGDQGP---------ATQQSTSFLASNGKYIT-Y 284
+ FS C + K D P A +T+ L N Y T Y
Sbjct: 180 NFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRY 234
Query: 285 IIGVETCCIGSSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ ++ IG + L S K+ VD+G+SFT L V+ + E DR + + E
Sbjct: 235 FVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKE 294
Query: 341 GYPWK----CCY---KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
P + CY +++ KLP + L F + + V+ + Y + + CLAI
Sbjct: 295 -QPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLP---WDSYLWKTTSKLCLAI 350
Query: 394 QP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G I +G M ++ D N KL + ++C +
Sbjct: 351 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKV 391
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 150/396 (37%), Gaps = 73/396 (18%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSP 133
++G F ++ S L D G DL+W+ C C RC ++ P
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFD----------P 130
Query: 134 SASSTSKHLSCSH------RL--CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
SST + + CS R CD G + C Y M Y + +SS+G L D L
Sbjct: 131 RRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRY-MVAYGDGSSSTGELATDKLA 186
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
+ D + N V +GCG + + G D A GL+G+ G+IS+ + +A A +
Sbjct: 187 FAN--DTYVNN-----VTLGCG-RDNEGLFDSAA--GLLGVARGKISISTQVAPA--YGS 234
Query: 246 SFSMCFDKDDSGR-------IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
F C D + R +F P + T+ L++ + Y + + +G
Sbjct: 235 VFEYCL-GDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGE-- 291
Query: 299 KQTSFK--------------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF---EG 341
+ T F +VDSG++ + ++ Y + FD + E
Sbjct: 292 RVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEH 351
Query: 342 YPWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
+ CY + P + L F P N F+ PV CL
Sbjct: 352 SVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFL---PVDGGRRRAASYRRCLGF 408
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ D + IG G+RVVFD E ++G++ C
Sbjct: 409 EAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 99/451 (21%), Positives = 171/451 (37%), Gaps = 57/451 (12%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTG---PQFQM 83
K + R + + +G +N ++ AK+S + +V+ ++ + + M++ M
Sbjct: 65 MQAKDLFRHEQMITMMGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSALNIAHVGM 124
Query: 84 LFPSQGSKTMSLGN----DFGCDLLWIPCDCVR-----------CAPLSASYYNSLDRDL 128
S T +L D DL WI C R +S + +
Sbjct: 125 YLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEASK 184
Query: 129 NEYSPSASSTSKHLSCSHRLCDL--GTSCQNPK--QPCPYTMDYYTENTSSSGLL-VEDI 183
N Y P+ SS+ + + CS + C + +CQ+P + C Y + T + G+ E
Sbjct: 185 NWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIYGKEKA 243
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 243
+S G + + +I+GC + ++GG +D A DG++ LG G++S AK
Sbjct: 244 TVTVSDG----RMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR--F 295
Query: 244 RNSFSMCF-----DKDDSGRIFFGDQ----GPATQQSTSFL------ASNGKYITYIIGV 288
FS C +D S + FG GP T ++ A + ++G
Sbjct: 296 GQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGG 355
Query: 289 ETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 345
E I F I+D+ +S T L E Y + A DR ++ +E ++
Sbjct: 356 ERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFE 415
Query: 346 CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG------FCLAIQP-VDG 398
CYK + P+ + P + VV CLA + + G
Sbjct: 416 YCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRG 475
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G +G FM Y D + K+ + C
Sbjct: 476 GPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 161/419 (38%), Gaps = 75/419 (17%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC---DCVRCA----PLSASYYNSLDRD 127
+ G + + F S S+T+S+ D G D++W PC +C+ C P + + N
Sbjct: 88 LSPGTDYTLTF-SINSQTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSS 146
Query: 128 LNEYSPSASSTSKHLSCSHRLCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGL 178
L A ST+ + + LC + + C N CP Y + + + L
Sbjct: 147 LISCKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSN--YHCPSFYYAYGDGSLIAKL 204
Query: 179 LVED-ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
+ I+ S +LK+ GC G P G+ G G G +S+P+ L
Sbjct: 205 HKHNLIMPSTSNKPFSLKD-----FTFGCAHSALG------EPIGVAGFGFGSLSLPAQL 253
Query: 238 AKAGL-IRNSFSMC-----FDKDDS--------GRIFFGDQGPATQQSTSFLASNGKY-I 282
A + N FS C FD G++ D TQ + + N K+
Sbjct: 254 ANLSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPY 313
Query: 283 TYIIGVETCCIGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQV 332
Y + +E +GSS ++ + +VDSG+++T LP Y ++A E DR+V
Sbjct: 314 FYSVSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRV 373
Query: 333 NDTITSFEGYPWKC----CYKSSSQRLPKL----PSVKLMFPQNNSFVV---NNPVFVIY 381
K CY + +L P + F N S V+ N +
Sbjct: 374 GRVFKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLD 433
Query: 382 GTQVVTGF---CLAI-----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G G CL + + G T+G G++VV+D E ++G++ C L
Sbjct: 434 GEDEKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCASL 492
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 144/384 (37%), Gaps = 77/384 (20%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
S+ + L D D W C+P +SL ++P+ SS+ L CS C
Sbjct: 91 SQQLLLALDTSADATW-----AHCSPCGTCPSSSL------FAPANSSSYASLPCSSSWC 139
Query: 150 DL--GTSCQNPK-----QPCPYTMDYYT-----ENTSSSGLLVEDILHLISGGDNALKNS 197
L G +C P+ P P T+ + S L D L L G +A+ N
Sbjct: 140 PLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRL---GKDAIPN- 195
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDK--- 253
GC + G + GL+GLG G ++ LL++AG + N FS C
Sbjct: 196 ----YTFGC-VSSVTGPTTNMPRQGLLGLGRGPMA---LLSQAGSLYNGVFSYCLPSYRS 247
Query: 254 ---DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------Q 300
S R+ G P + + T L + + Y + V +G + +K
Sbjct: 248 YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAA 307
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL 358
T +VDSG+ T VY + EF RQV TS + C+ +
Sbjct: 308 TGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAF--DTCFNTDEVAAGGA 365
Query: 359 PS--------VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQN 406
P+ V L P N+ + ++ + CLA+ Q V+ + I
Sbjct: 366 PAVTVHMDGGVDLALPMENTLIHSSATPLA---------CLAMAEAPQNVNSVVNVIANL 416
Query: 407 FMTGYRVVFDRENLKLGWSHSNCQ 430
RVVFD N ++G++ +C
Sbjct: 417 QQQNIRVVFDVANSRIGFAKESCN 440
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 142/350 (40%), Gaps = 62/350 (17%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS- 154
D G DL+W CD C RC P A Y+P+ S+T ++SC +C S
Sbjct: 110 DTGSDLIWTQCDAPCRRCFPQPAPL----------YAPARSATYANVSCRSPMCQALQSP 159
Query: 155 ---CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
C P C Y Y + TS+ G+L + L G D A++ V GCG +
Sbjct: 160 WSRCSPPDTGCAYYFS-YGDGTSTDGVLATETFTL--GSDTAVRG-----VAFGCGTENL 211
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
G + GL+G+G G + SL+++ G+ R S C + + +G +
Sbjct: 212 GSTDNS---SGLVGMGRGPL---SLVSQLGVTRPRRS-CRARAAA-------RGGGAPTT 257
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEF 328
TS L + IT +G I + + T I+DSG++FT L + + +A
Sbjct: 258 TSPL----EGIT--VGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARAL 311
Query: 329 DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP------QNNSFVVNNPVFVIYG 382
+V + S C+ ++S ++P + L F + S+VV +
Sbjct: 312 ASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVED------- 364
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ CL + G + +G +++D E L + + C +L
Sbjct: 365 -RSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 412
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 140/358 (39%), Gaps = 50/358 (13%)
Query: 93 MSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD- 150
+SL D G DL W C CVR D+ ++PS S++ ++SCS C
Sbjct: 117 LSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSYYNVSCSSAACGS 167
Query: 151 ----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
G + C Y + Y + + S G L ++ L + + V V GC
Sbjct: 168 LSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTN-------SDVFDGVYFGC 219
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQ 264
G + + G GVA GL+GLG ++S PS A A FS C S G + FG
Sbjct: 220 G-ENNQGLFTGVA--GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSA 274
Query: 265 G----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
G TSF N IT +G + I S+ A++DSG+ T
Sbjct: 275 GISRSVKFTPISTITDGTSFYGLNIVAIT--VGGQKLPIPSTVFSTPG--ALIDSGTVIT 330
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP + Y + + F +++ T+ C+ S + +P V F +
Sbjct: 331 RLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELG 390
Query: 375 NP-VFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +F ++ V CLA D + G VV+D ++G++ + C
Sbjct: 391 SKGIFYVFKISQV---CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 445
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 136/350 (38%), Gaps = 50/350 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCD-LGTS 154
D G L W+ C CV S R + Y P ASST + CS CD L +
Sbjct: 152 DTGSSLTWLQCSPCVV----------SCHRQVGPLYDPRASSTYATVPCSASQCDELQAA 201
Query: 155 CQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
NP + C Y Y +++ S G L D +S G + N GCG
Sbjct: 202 TLNPSACSVRNVCIYQASY-GDSSFSVGYLSRDT---VSFGSGSYPN-----FYYGCGQD 252
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPAT 268
G + GLIGL ++S+ LA + + SFS C S G + G
Sbjct: 253 NEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCLPTPASTGYLSIGPYTSGH 307
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYET 323
T +S+ Y + + +G S L + +S I+DSG+ T LP VY
Sbjct: 308 YSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTAVYTA 367
Query: 324 IAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
++ + V + + P C++ + +L ++P+V + F + + +
Sbjct: 368 LS----KAVAAAMVGVQSAPAFSILDTCFQGQASQL-RVPAVAMAFAGGATLKLATQNVL 422
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I T CLA P D IG + VV+D ++G++ C
Sbjct: 423 IDVDDSTT--CLAFAPTDSTT-IIGNTQQQTFSVVYDVAQSRIGFAAGGC 469
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 109/250 (43%), Gaps = 24/250 (9%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 145
Q ++ L D G DL W+ CD C C+ Y R N++ P L +
Sbjct: 77 QPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLY----RPSNDFVPCRDPLCASLQPT 132
Query: 146 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV--I 203
+C++P Q C Y ++ Y + S+ G+L+ D+ L N VQ V
Sbjct: 133 EDY-----NCEHPDQ-CDYEIN-YADQYSTFGVLLNDVYLL------NFTNGVQLKVRMA 179
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
+GCG Q DGL+GLG G+ S+ S L GL+RN C G IFFG+
Sbjct: 180 LGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLSAQGGGYIFFGN 239
Query: 264 QGPATQQSTSFLAS-NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 322
+ + + + ++S + K+ Y G G S A+ D+GSS+T+ Y+
Sbjct: 240 AYDSARVTWTPISSVDSKH--YSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQ 297
Query: 323 TIAAEFDRQV 332
+ + +++
Sbjct: 298 ALLSWLKKEL 307
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 139/373 (37%), Gaps = 73/373 (19%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +CAP + + L + ++P+ASS+ + CS +LC+ L SC
Sbjct: 121 DTGSDLIW-----TQCAPCA----SCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHHSC 171
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
Q P C Y +Y T+ E S G+ + + GCG G
Sbjct: 172 QRPDT-CTYRYNYGDGTTTLGVYATERFTFASSSGEK-----LSVPLGFGCGTMNVGSLN 225
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR------------IFFGD 263
+G G++G G +S+ S L+ IR FS C S R +F GD
Sbjct: 226 NG---SGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYTSTRKSTLMFGSLSDGVFEGD 277
Query: 264 QGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSS 312
Q Q+T L S Y + +G+ L+ S IVDSG++
Sbjct: 278 DAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTA 337
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS------------SSQRLPKLP- 359
T P V + F Q+ TS C+ + + +P++
Sbjct: 338 LTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAF 397
Query: 360 ---SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
L P+ N +V+++P C+ + TIG RV++D
Sbjct: 398 HFQGADLELPRRN-YVLDDP--------RRGSLCILLADSGDSGATIGNFVQQDMRVLYD 448
Query: 417 RENLKLGWSHSNC 429
E L ++ + C
Sbjct: 449 LEAETLSFAPAQC 461
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/384 (22%), Positives = 150/384 (39%), Gaps = 63/384 (16%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 136
TG F + ++ +L D G +L W+ C P + P AS
Sbjct: 88 TGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLV------------FRPEAS 135
Query: 137 STSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGG 190
+ + CS C L +C + PC Y Y + + G++ D + + GG
Sbjct: 136 KSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGG 195
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
K + V++GC G V DG++ LG +IS S A SFS C
Sbjct: 196 ----KVAQLQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGGSFSYC 247
Query: 251 F-----DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL------ 298
++ +G + FG Q P T + + L + Y + V+ + L
Sbjct: 248 LVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEV 307
Query: 299 -KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR--L 355
S I+DSG++ T L Y+ + A + + + + P++ CY ++ R
Sbjct: 308 WDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAG-VPKVDFPPFEHCYNWTAPRPGA 366
Query: 356 PKLPSVKLMF-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD---IGTIGQ 405
P++P + + F P S+V++ V G + C+ +Q +G+ + IG
Sbjct: 367 PEIPKLAVQFTGCARLEPPAKSYVID----VKPGVK-----CIGLQ--EGEWPGVSVIGN 415
Query: 406 NFMTGYRVVFDRENLKLGWSHSNC 429
+ FD +N+++ + S C
Sbjct: 416 IMQQEHLWEFDLKNMEVRFMPSTC 439
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 124/301 (41%), Gaps = 51/301 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLC---DLGT 153
D G +L W+ C R S + E + P AS+T + C C DL
Sbjct: 81 DTGSELSWLLCATGR----QGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDLPA 136
Query: 154 --SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
SC + C ++ Y + ++S G L D+ + G L+++ GC
Sbjct: 137 PPSCDGASRQCHVSLSY-ADGSASDGALATDVFAV--GEAPPLRSA------FGCMSTAY 187
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG-------- 262
DGVA GL+G+ G + S + +A R FS C D+DD+G + G
Sbjct: 188 DSSPDGVATAGLLGMNRGTL---SFVTQASTRR--FSYCISDRDDAGVLLLGHSDLPFLP 242
Query: 263 -DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSF 313
+ P Q + +A + + + +G + I +S L A +VDSG+ F
Sbjct: 243 LNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQF 302
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWK------CCYKSSSQRLP---KLPSVKLM 364
TFL + Y + AEF +Q + + + + C++ + R P +LP V L+
Sbjct: 303 TFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTLL 362
Query: 365 F 365
F
Sbjct: 363 F 363
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 147/376 (39%), Gaps = 70/376 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G +L+W C C RC P P+ SST L C+ C L TS
Sbjct: 109 DTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRLPCNGSFCQYLPTSS 160
Query: 156 Q----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+ N C Y Y + T+ G L + L + GD V GC +
Sbjct: 161 RPRTCNATAACAYNYTYGSGYTA--GYLATETLTV---GDGTFPK-----VAFGCSTE-- 208
Query: 212 GGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPAT 268
+GV G++GLG G +S+ S LA R S+ + D D G I FG T
Sbjct: 209 ----NGVDNSSGIVGLGRGPLSLVSQLAVG---RFSYCLRSDMADGGASPILFGSLAKLT 261
Query: 269 Q----QSTS-----FLASNGKYITYIIGV-----ETCCIGSSC-LKQTSFKA--IVDSGS 311
+ QST +L + Y + G+ E GS+ QT IVDSG+
Sbjct: 262 EGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321
Query: 312 SFTFLPKEVYETIAAEFDRQVND----TITSFEGYPWKCCYKSSS---QRLPKLPSVKLM 364
+ T+L K+ Y + F Q+ + T S Y CYK S+ + ++P + L
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381
Query: 365 FPQNNSFVVNNPVFVIY-GTQV-----VTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFD 416
F + N PV + G + VT CL + P D I IG +++D
Sbjct: 382 FAGGAKY--NVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYD 439
Query: 417 RENLKLGWSHSNCQDL 432
+ ++ ++C L
Sbjct: 440 IDGGMFSFAPADCAKL 455
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 145/381 (38%), Gaps = 66/381 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCSHRL 148
S+ L D G DL W+ C C + S + R + + SS+ K + C +
Sbjct: 93 SQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDM 151
Query: 149 CDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDIL--HLISGGDNALKNSV 198
C + T+C P PC Y DY Y++ +++ G + + L G L N
Sbjct: 152 CKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVELKEGRKMKLHN-- 207
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 253
V+IGC G A DG++GLG + S + A FS C K
Sbjct: 208 ---VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLSHK 260
Query: 254 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET---------CCIGSSCLK----- 299
+ S + FG + +S L +N Y ++G+ IG + LK
Sbjct: 261 NVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 300 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------RQVNDTITSFEGYPWKCCYKS 350
+ + I+DSGSS TFL + Y+ + A R+V I P + C+ S
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG-----PLEYCFNS 370
Query: 351 SSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT--GFCLAIQPVDGDIGTIGQNFM 408
+ +P + F F +VI V GF P +G I Q
Sbjct: 371 TGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQ-- 428
Query: 409 TGYRVVFDRENLKLGWSHSNC 429
+ FD KLG++ S+C
Sbjct: 429 -NHLWEFDLGLKKLGFAPSSC 448
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 145/381 (38%), Gaps = 66/381 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCSHRL 148
S+ L D G DL W+ C C + S + R + + SS+ K + C +
Sbjct: 93 SQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDM 151
Query: 149 CDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDIL--HLISGGDNALKNSV 198
C + T+C P PC Y DY Y++ +++ G + + L G L N
Sbjct: 152 CKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVELKEGRKMKLHN-- 207
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 253
V+IGC G A DG++GLG + S + A FS C K
Sbjct: 208 ---VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLSHK 260
Query: 254 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET---------CCIGSSCLK----- 299
+ S + FG + +S L +N Y ++G+ IG + LK
Sbjct: 261 NVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 300 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------RQVNDTITSFEGYPWKCCYKS 350
+ + I+DSGSS TFL + Y+ + A R+V I P + C+ S
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG-----PLEYCFNS 370
Query: 351 SSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT--GFCLAIQPVDGDIGTIGQNFM 408
+ +P + F F +VI V GF P +G I Q
Sbjct: 371 TGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQ-- 428
Query: 409 TGYRVVFDRENLKLGWSHSNC 429
+ FD KLG++ S+C
Sbjct: 429 -NHLWEFDLGLKKLGFAPSSC 448
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 141/350 (40%), Gaps = 45/350 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G D+ W+ C C C Y D + PS S++ +SC + C DL T+
Sbjct: 184 DTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSQRCRDLDTAA 233
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G + L L G + N V IGCG G +
Sbjct: 234 CRNATGACLYEV-AYGDGSYTVGDFATETLTL--GDSTPVGN-----VAIGCGHDNEGLF 285
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD--SGRIFFGDQGPATQQS 271
V GL+ LG G +S PS ++ ++FS C D+D + + FGD
Sbjct: 286 ---VGAAGLLALGGGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTV 337
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK-----------QTSFKAIVDSGSSFTFLPKE 319
T+ L + + T Y + + +G L S IVDSG++ T L
Sbjct: 338 TAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSA 397
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + F + + + CY S + ++P+V L F + + ++
Sbjct: 398 AYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYL 457
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I T +CLA P + + IG G RV FD +G++ + C
Sbjct: 458 IPVDGAGT-YCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 104/427 (24%), Positives = 161/427 (37%), Gaps = 83/427 (19%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQMLF 85
F+ LIHR S + N + S P Y + + V K++ G P F++
Sbjct: 30 FTMDLIHRRSNASSRV---SNTQSGSSP------YANTVFDNSVYLMKLQVGTPPFEI-- 78
Query: 86 PSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 144
D G ++ W C CV C +A ++ PS SST K
Sbjct: 79 --------QAIIDTGSEITWTQCLPCVHCYEQNAPIFD----------PSKSSTFKE--- 117
Query: 145 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVI 203
+ CD CPY +DY+ + L E I LH SG + V I
Sbjct: 118 --KRCD--------GHSCPYEVDYFDHTYTMGTLATETITLHSTSG-----EPFVMPETI 162
Query: 204 IGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
IGCG S P G++GL G S+ + G S CF + +I F
Sbjct: 163 IGCGHNNS-----WFKPSFSGMVGLNWGPSSL--ITQMGGEYPGLMSYCFSGQGTSKINF 215
Query: 262 GDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGS 311
G ST+ + K Y + ++ +G++ ++ T+F A ++DSG+
Sbjct: 216 GANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGT 275
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
+ T+ P + + V + CY S + + P + + F
Sbjct: 276 TLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDI--FPVITMHFSGGVDL 333
Query: 372 VVNNPVFVIYGTQVVTG-FCLAI---QPVDGDI-GTIGQ-NFMTGYRVVFDRENLKLGWS 425
V++ + +Y G FCLAI P I G Q NF+ GY D +L + +S
Sbjct: 334 VLDK--YNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGY----DSSSLLVSFS 387
Query: 426 HSNCQDL 432
+NC L
Sbjct: 388 PTNCSAL 394
>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 498
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/411 (23%), Positives = 158/411 (38%), Gaps = 68/411 (16%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD---CVRCAPLSASYYN----SLDRDLN 129
+G +F + G + L D G L + PC C YY+ R LN
Sbjct: 63 SGHEFFLTVELAGKQKFDLEVDTGSPLTYFPCKGCPLEVCGIHEHPYYDYDMSKTFRKLN 122
Query: 130 EYSPSASSTSKHLSCSHR----LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
+ST C+ + LCD S N C + + Y + + G + ED
Sbjct: 123 ----CTTSTEDAAYCNAQPNVLLCDTNISYTNT---CLFGIGY-VDGSVGRGYMAEDTFT 174
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDG--VAPDGLIGLGLGEISVPSLLAKAGLI 243
L GD A + GCG Y DG + DG+ G G + + LAKAG+I
Sbjct: 175 L---GDEL----APAKITFGCGGMY---YPDGSNLRQDGMAGFSRGNTAFHTQLAKAGVI 224
Query: 244 -RNSFSMCFDKDDS-------GRIFFGDQGPATQQSTSFLASNG---KYITYIIGVETCC 292
+ F C + ++ GR FG + P T L + + +++ +G +T
Sbjct: 225 DAHVFGFCSEGMETSTAMLTLGRYNFGRRVPELAW-TRMLGEDDLAVRTMSWKLGDKT-- 281
Query: 293 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 352
I SS ++ ++DSG++ T LP ++ + S C Y++
Sbjct: 282 IASS----SNVYTVLDSGTTLTVLPSAMHHDFMTHLNETARSAGLSVVVRGTHCFYENQR 337
Query: 353 Q------RLPK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQP------VDGD 399
Q L + PS+ + + + + V+ ++ T + FC I +G+
Sbjct: 338 QSSLTQYTLTRWFPSLTITYDPDVTLVLRPENYLFADTVNLHAFCAGIMSASDAALANGE 397
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNP 450
+GQ + V +D EN ++G + C+ L + P TP NP
Sbjct: 398 QIILGQQTLRNTFVEYDLENSRVGMATVQCEKLREKF------APDTPHNP 442
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/366 (21%), Positives = 144/366 (39%), Gaps = 63/366 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
+KT ++ D L W+ C+ C+ + ++P+ASST K + C L
Sbjct: 136 AKTHNVLVDTASSLSWVGCEPCINACLIPT------------FNPNASSTYKVVGCGSAL 183
Query: 149 CDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
C+ SC P + C Y Y+ + + S G++ D L G
Sbjct: 184 CNAVPSATMARKSCMAPTEGCSYRQSYH-DYSLSVGVVSSDTLTYGLGSQK--------- 233
Query: 202 VIIGCG--MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR 258
I GC + GG G+ +G+ + + S+ S + R + S CF + G
Sbjct: 234 FIFGCCNLFRGVGGRYSGI-----LGMSVNKFSLFSQMTVGHRYR-AMSYCFPHPRNQGF 287
Query: 259 IFFG--DQGPATQQSTSFLASNGKYITYI--IGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
+ FG D+ + + T Y ++ + VET + + + D+G+ +T
Sbjct: 288 LQFGRYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQSSGNQTMRCFFDTGTPYT 347
Query: 315 FLPKEVYETIAAEFDRQVNDTITSF-EGY------PWKCCYKSSSQRLP---KLPSVKLM 364
LP+ ++ +++ DT+ + EGY + C+++ + +P+VK+
Sbjct: 348 MLPQSLFVSLS--------DTVGNLVEGYYRVGASTGQTCFQADGNWIEGDLYMPTVKIE 399
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
F +N+ + V FCLA + DG +G + G V D E + +G
Sbjct: 400 FQNGARITLNSEDLMFMEEPNV--FCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTMGL 457
Query: 425 SHSNCQ 430
C
Sbjct: 458 RGQGCN 463
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 141/349 (40%), Gaps = 46/349 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS 154
D G D+ W+ C C C Y D + P+ASST ++C + C +S
Sbjct: 38 DTGSDINWLQCQPCTDC-------YQQTDP---IFDPTASSTYAPVTCQSQQCSSLEMSS 87
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C++ + C Y ++Y + + E + G ++KN V +GCG G +
Sbjct: 88 CRSGQ--CLYQVNYGDGSYTFGDFATESVSF---GNSGSVKN-----VALGCGHDNEGLF 137
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--IFFGDQGPATQQS 271
+ GL G L SL + L SFS C ++D +G + F
Sbjct: 138 VGAAGLLGLGGGPL------SLTNQ--LKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSV 189
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKEV 320
T+ L N K T Y +G+ +G + +++F+ IVD G++ T L +
Sbjct: 190 TAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQA 249
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y + F R + + + CY S Q ++P+V F S+ + ++I
Sbjct: 250 YNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLI 309
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T +C A P + IG G RV FD N ++G+S + C
Sbjct: 310 PVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 106/275 (38%), Gaps = 52/275 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ ++L D G DL+W C C C +++L AS T+ + CS +C
Sbjct: 112 QRVALTLDTGSDLVWTQCACHVCFAQPFPTFDAL----------ASQTTLAVPCSDPICT 161
Query: 151 LG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS----GGDNALKNSVQASV 202
G + C C Y D Y + + +SG +VED S G A +V
Sbjct: 162 SGKYPLSGCTFNDNTCFYLYD-YADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNV 220
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---I 259
GCG G + + G+ G G +S+PS L A FS CF R +
Sbjct: 221 RFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQLKVA-----RFSHCFTAIADARTSPV 273
Query: 260 FFGDQ-GP--------ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------ 304
F G GP QST F SNG Y + ++ +G + L +
Sbjct: 274 FLGGAPGPDNLGAHATGPVQSTPFANSNGSL--YYLTLKGITVGKTRLPLNALAFAGKGT 331
Query: 305 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVN 333
I+DSG+ LP +Y ++ A F +V
Sbjct: 332 GSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVK 366
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 129/317 (40%), Gaps = 49/317 (15%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCDL-GTS 154
D G D+LW+ C C C D DL + PS SST L + CD G
Sbjct: 119 DTGSDILWVMCTPCTNC-----------DNDLGLLFDPSKSSTFSPLCKTP--CDFEGCR 165
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C P P+T+ Y +N+++SG D + + + + S V+ GCG + G+
Sbjct: 166 CD----PIPFTVTY-ADNSTASGTFGRDTVVFETTDEGTSRIS---DVLFGCG--HNIGH 215
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----DDSGRIFFGDQGPATQ 269
+G++GL G SL+ K G FS C + ++ G+
Sbjct: 216 DTDPGHNGILGLNNGP---DSLVTKLG---QKFSYCIGNLADPYYNYHQLILGEGADLEG 269
Query: 270 QSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYE 322
ST F NG Y + +G + I + +A I+D+GS+ TFL V++
Sbjct: 270 YSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHK 329
Query: 323 TIAAEFDRQVNDTI--TSFEGYPW-KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
++ E + + + E PW +C Y S S+ L P V F +++ F
Sbjct: 330 LLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSFF 389
Query: 380 IYGTQVVTGFCLAIQPV 396
V FC+ + PV
Sbjct: 390 NQLNDNV--FCMTVGPV 404
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 140/348 (40%), Gaps = 40/348 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSC 155
D G L W+ +C P ++ +D + PSAS+T + L CS C L +
Sbjct: 138 DTGSSLSWL-----QCKPCVVYCHSQVD---PLFEPSASNTYRPLYCSSSECSLLKAATL 189
Query: 156 QNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+P C YT Y + + S G L D+L L + S GCG
Sbjct: 190 NDPLCTASGVCVYTAS-YGDASYSMGYLSRDLLTLT-------PSQTLPSFTYGCGQDNE 241
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDS---GRIFFGDQGPA 267
G L G A G++GL ++S+ + L+ K G +FS C S G + G P+
Sbjct: 242 G--LFGKAA-GIVGLARDKLSMLAQLSPKYGY---AFSYCLPTSTSSGGGFLSIGKISPS 295
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEVYET 323
+ + T + ++ Y + + + + + I+DSG+ T LP +Y
Sbjct: 296 SYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLPISIYAA 355
Query: 324 IAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ F + ++ Y C+K S + + P ++++F + P +I
Sbjct: 356 LREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEA 415
Query: 383 TQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + CLA + I IG + Y + +D K+G++ C+
Sbjct: 416 DKGIA--CLAFASSN-QIAIIGNHQQQTYNIAYDVSASKIGFAPGGCR 460
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 144/384 (37%), Gaps = 77/384 (20%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
S+ + L D D W C+P +SL ++P+ SS+ L CS C
Sbjct: 89 SQQLLLALDTSADATW-----AHCSPCGTCPSSSL------FAPANSSSYASLPCSSSWC 137
Query: 150 DL--GTSCQNPK-----QPCPYTMDYYT-----ENTSSSGLLVEDILHLISGGDNALKNS 197
L G +C P+ P P T+ + S L D L L G +A+ N
Sbjct: 138 PLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRL---GKDAIPN- 193
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDK--- 253
GC + G + GL+GLG G ++ LL++AG + N FS C
Sbjct: 194 ----YTFGC-VSSVTGPTTNMPRQGLLGLGRGPMA---LLSQAGSLYNGVFSYCLPSYRS 245
Query: 254 ---DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------Q 300
S R+ G P + + T L + + Y + V +G + +K
Sbjct: 246 YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAA 305
Query: 301 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL 358
T +VDSG+ T VY + EF RQV TS + C+ +
Sbjct: 306 TGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAF--DTCFNTDEVAAGGA 363
Query: 359 PS--------VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQN 406
P+ V L P N+ + ++ + CLA+ Q V+ + I
Sbjct: 364 PAVTVHMDGGVDLALPMENTLIHSSATPLA---------CLAMAEAPQNVNSVVNVIANL 414
Query: 407 FMTGYRVVFDRENLKLGWSHSNCQ 430
RVVFD N ++G++ +C
Sbjct: 415 QQQNIRVVFDVANSRVGFAKESCN 438
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 145/353 (41%), Gaps = 50/353 (14%)
Query: 95 LGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG- 152
L D D WIPC C C SA+ ++ P+AS++ + + C LC
Sbjct: 127 LAVDTSNDASWIPCAGCAGCPTSSAAPFD----------PAASASYRTVPCGSPLCAQAP 176
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + C +++ Y ++S L +D L + NA+K + GC + +
Sbjct: 177 NAACPPGGKACGFSLTY--ADSSLQAALSQDSLAV---AGNAVK-----AYTFGCLQRAT 226
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQG-P 266
G P GL+GLG G +S L + +FS C + SG + G G P
Sbjct: 227 G---TAAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQP 281
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEV 320
++T LA+ + Y + + +G + +F ++DSG+ FT L
Sbjct: 282 QRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPA 341
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP----SVKLMFPQNNSFVVNNP 376
Y + E R+V ++S G+ C+ +++ P + +++ P+ N + +
Sbjct: 342 YVAVRDEVRRRVGAPVSSLGGF--DTCFNTTAVAWPPMTLLFDGMQVTLPEENVVIHST- 398
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
YGT A V+ + I +RV+FD N ++G++ C
Sbjct: 399 ----YGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
Length = 445
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 117/289 (40%), Gaps = 54/289 (18%)
Query: 135 ASSTSKHLSCSHRLCDL------GTSCQNPKQPCPY-TMDYYTEN----TSSSGLLVEDI 183
SST K C C+L G PK C T + N TS+SG L +DI
Sbjct: 80 VSSTYKPARCRSAQCNLAGSKSCGECFDGPKPGCNNNTCGLFPYNPFIRTSTSGELAQDI 139
Query: 184 LHLIS-GGDNALKNSVQASVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKA 240
+ + S G N K +VI CG S L+G+A G+ GLG +I++PS A A
Sbjct: 140 ISIQSTNGSNPSKVVSFPNVIFTCG---STFLLEGLASGVTGIAGLGRKKIALPSQFAAA 196
Query: 241 GLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSFLASNGKYI---------------- 282
+ F++C +G +FFGD GP ++ N Y
Sbjct: 197 FSFKRKFALCLSSSTRATGVVFFGD-GPYIMLPNKDVSQNLIYTPLILNPVSTAGASFEG 255
Query: 283 ----TYIIGVETCCIGSSCLK-QTSFKAIVDSGSS---------FTFLPKEVYETIAAEF 328
Y IGV+ + +K TS +I G+ +T L +Y+ + F
Sbjct: 256 EPSADYFIGVKGIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPYTSLETSIYKAVIGAF 315
Query: 329 DRQVNDTITSFEGYPWKCCYKS---SSQRL-PKLPSVKLMFPQNNSFVV 373
+ V P++ C+ S SS R+ P +P + L+ P N ++ +
Sbjct: 316 GKAVAKVPRVTAVAPFELCFNSTSFSSTRVGPGVPQIDLVLPNNKAWTI 364
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 137/370 (37%), Gaps = 45/370 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + +K M L D G D+ WI C+ C C S +N P++
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFN----------PTS 208
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
SST K L+CS C L + C Y + Y + + + G L D + G +
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVS-YGDGSFTVGELATDTVTF--GNSGKIN 265
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKD 254
N V +GCG G + +L+ ++ SFS C
Sbjct: 266 N-----VALGCGHDNEGLFTGAAGL---------LGLGGGVLSITNQMKATSFSYCLVDR 311
Query: 255 DSGR---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSS--CLKQTSFK---- 304
DSG+ + F +T+ L N K T Y +G+ +G L F
Sbjct: 312 DSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDAS 371
Query: 305 ----AIVDSGSSFTFLPKEVYETIAAEFDR-QVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
I+D G++ T L + Y ++ F + VN S + CY SS K+P
Sbjct: 372 GSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVP 431
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V F S + ++I T FC A P + IG G R+ +D
Sbjct: 432 TVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSK 490
Query: 420 LKLGWSHSNC 429
+G S + C
Sbjct: 491 NVIGLSGNKC 500
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 99/398 (24%), Positives = 153/398 (38%), Gaps = 96/398 (24%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL- 151
++L D G LW+ C+ N Y+ SST + + C C L
Sbjct: 62 LNLVVDLGGKFLWVDCE-------------------NHYT---SSTYRPVRCPSAQCSLA 99
Query: 152 -----GTSCQNPKQPCPYTMDYYTENT----SSSGLLVEDILHLIS-GGDNALKNSVQAS 201
G +PK C T +NT ++ G L ED+L + S G N +N V +
Sbjct: 100 KSDSCGDCFSSPKPGCNNTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFNTGQNVVVSR 159
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 261
+ C L G A G+ GLG +I++PS LA A + + F+ CF D G I F
Sbjct: 160 FLFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSD-GVIIF 217
Query: 262 GDQGPATQQSTSFLASNGKY--------------------------------ITYIIGVE 289
GD GP SFLA N + Y IGV+
Sbjct: 218 GD-GPY-----SFLADNPSLPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGVK 271
Query: 290 TCCI-GSSCLKQTSFKAIVDSG---------SSFTFLPKEVYETIAAEFDR-QVNDTITS 338
T I G +S +I + G +T L +Y+ + F + V IT+
Sbjct: 272 TIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITT 331
Query: 339 FEGY-PWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF---CLAIQ 394
+ P++ CY S LP P + P + NN ++ ++G + L +
Sbjct: 332 EDSSPPFEFCY--SFDNLPGTP-LGASVPTIELLLQNNVIWSMFGANSMVNINDEVLCLG 388
Query: 395 PVDGDIGTIGQNFMTGYRVV-----FDRENLKLGWSHS 427
V+G + + GY++ FD +LG+S++
Sbjct: 389 FVNGGVNLRTSIVIGGYQLENNLLQFDLAASRLGFSNT 426
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 143/359 (39%), Gaps = 51/359 (14%)
Query: 90 SKTMSLGNDFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
++TM L D D+ W+ PC C P +D+ Y P+ SS+S SC+
Sbjct: 143 TQTMVL--DTASDVTWVQCSPCPTPPCYP---------QKDV-LYDPTKSSSSGVFSCNS 190
Query: 147 RLC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C LG C N Q C Y + Y + TS++G + D+L + A++ S
Sbjct: 191 PTCTQLGPYANGCTNNNQ-CQYRVRY-PDGTSTAGTYISDLLTITPA--TAVR-----SF 241
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GC G + G + G++ LG G S+ S A FS CF + R FF
Sbjct: 242 QFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRVFSHCFPPP-TRRGFFT 298
Query: 263 DQGPATQQSTSFLASNGKYIT-----YIIGVETCCIGSS--CLKQTSFKA--IVDSGSSF 313
P L K Y++ +E + + T F A +DS ++
Sbjct: 299 LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAI 358
Query: 314 TFLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T LP Y+ + F DR +G P CY + R LP + L+F N+ V
Sbjct: 359 TRLPPTAYQALRQAFRDRMAMYQPAPPKG-PLDTCYDMAGVRSFALPRITLVF-DKNAAV 416
Query: 373 VNNPVFVIYGTQVVTGFCLAIQ--PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+P V++ CLA P D G IG + V+++ +G+ H+ C
Sbjct: 417 ELDPSGVLFQG------CLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/352 (23%), Positives = 139/352 (39%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C +C S +N P SS+ L CS +LC S
Sbjct: 113 DTGSDLIWTQCQPCTQCFNQSTPIFN----------PQGSSSFSTLPCSSQLCQALQSPT 162
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GYL 215
C YT Y + + + G + + L G ++ N + GCG G G
Sbjct: 163 CSNNSCQYTYGY-GDGSETQGSMGTETLTF---GSVSIPN-----ITFGCGENNQGFGQG 213
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFG---DQGPATQ 269
+G GL+G+G G +S+PS L FS C +S + G + A
Sbjct: 214 NGA---GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSNSSTLLLGSLANSVTAGS 265
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK---------AIVDSGSSFTFLPK 318
+T+ + S+ Y I + +GS+ L + FK I+DSG++ T+
Sbjct: 266 PNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVD 325
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
Y+ + F Q+N ++ + + C++ S Q ++P+ + F + + +
Sbjct: 326 NAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENY 385
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F+ ++ CLA+ + G VV+D N + + + C
Sbjct: 386 FISPSNGLI---CLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 153/381 (40%), Gaps = 93/381 (24%)
Query: 89 GSKTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 144
G+ T + D G L+ IP + CV P+ Y PS STS ++C
Sbjct: 129 GNTTFLVQVDTGSLLMAIPLEGCNTCVESRPV--------------YHPS--STSTKVAC 172
Query: 145 SHRLCDLGTSCQNPK-------QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 197
S C G+ P + C + + Y + + SG + ED+++L
Sbjct: 173 SSDQCK-GSGSTPPSCSRTSSGESCDFQIRY-GDGSHVSGYIYEDVVNLAG--------- 221
Query: 198 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VP----SLLAKAGLIRNSFSMCFD 252
+Q G +++G + + DG+IG G S VP SL++ GL +N F M +
Sbjct: 222 LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQFGMLLN 279
Query: 253 KDDSGRIFFGDQG-----------PATQQSTSF--LASNGKYITYIIGVETCCIGSSCLK 299
+ G + G+ P Q++T F + S G I + I S L
Sbjct: 280 YEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKSTG------IRINDYTIPGSKLG 333
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ------VNDTITSFEGYPWKCCYKSSSQ 353
Q + IVDSGS+ L Y+ + F V + F+G CY SS
Sbjct: 334 Q---EVIVDSGSTALSLASGAYDQLRNYFQTHYCSIQGVCENPNIFQG---SICY-SSDD 386
Query: 354 RLPKLPSVKLMF---------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
L K P++ F P+N ++V P+ T G+C I+ D + +G
Sbjct: 387 VLSKFPTLYFTFDGGVQVAIPPKN--YLVKAPL-----TNGKYGYCFMIERADSTMTILG 439
Query: 405 QNFMTGYRVVFDRENLKLGWS 425
FM GY VFD N ++G++
Sbjct: 440 DVFMRGYYTVFDNVNDRVGFA 460
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 144/352 (40%), Gaps = 51/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P A Y + + P+ S+T ++SCS C DL S C
Sbjct: 179 DTGSDTTW-----VQCQPCVAYCYRQKE---PLFDPTKSATYANISCSSSYCSDLYVSGC 230
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G +D L L + +KN GCG K G L
Sbjct: 231 SGGH--CLYGIQY-GDGSYTIGFYAQDTLTLAY---DTIKN-----FRFGCGEKNRG--L 277
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP----ATQQ 270
G A GL+GLG G+ S+P K G + F+ C +G F D GP A +
Sbjct: 278 FGRA-AGLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFL-DLGPGAPAANAR 332
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETIA 325
T L G Y +G+ +G L ++ +VDSG+ T LP Y +
Sbjct: 333 LTPMLVDRGPTF-YYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAPLR 391
Query: 326 AEFDRQVNDTITSFEGYP----WKCCYKSSSQRLP--KLPSVKLMFPQNNSFVVNNPVFV 379
+ F + + + P CY + + LP+V L+F Q + + + +
Sbjct: 392 SAFSKAMQG--LGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF-QGGACLDVDASGI 448
Query: 380 IYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+Y V+ CLA P D D+ +G + V++D +G++ C
Sbjct: 449 LY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 146/362 (40%), Gaps = 76/362 (20%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC------DL 151
D G D+ W+ C R S+ +++ P SST SCS C D
Sbjct: 143 DTGSDVSWVHCH-ARAGAGSSLFFD----------PGKSSTYTPFSCSSAACTRLEGRDN 191
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG-MKQ 210
G S + C YT+ Y + ++++G D L L S ++N GC
Sbjct: 192 GCSLNST---CQYTV-RYGDGSNTTGTYGSDTLALNS--TEKVEN-----FQFGCSETSD 240
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
G LD DGL+GLG G PSL+++ A ++FS C PAT
Sbjct: 241 PGEGLDEDQTDGLMGLGGG---APSLVSQTAATYGSAFSYCL--------------PATT 283
Query: 270 QSTSFL---ASNGK--YIT------------YIIGVETCCIGSS--CLKQTSFKA--IVD 308
+S+ FL AS G ++T Y + ++ +G + T F A I+D
Sbjct: 284 RSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSIMD 343
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 368
SG+ T LP Y ++A F + + C+ + Q +P+V+L+F
Sbjct: 344 SGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVF-SG 402
Query: 369 NSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLKLGWSHS 427
+ V + ++YG+ CLA P G IG+I N + V+ D LG+
Sbjct: 403 GAVVDLDADGIMYGS------CLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPG 456
Query: 428 NC 429
C
Sbjct: 457 AC 458
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 133/333 (39%), Gaps = 51/333 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT L D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSERGFFS 165
Query: 262 --------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAI 306
G + AT+ + T +A + + + + L + S K +
Sbjct: 166 KTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGV 225
Query: 307 V-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
V DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 VFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 366 PQNNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE 317
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 137/370 (37%), Gaps = 45/370 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + +K M L D G D+ WI C+ C C S +N P++
Sbjct: 159 SGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFN----------PTS 208
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
SST K L+CS C L + C Y + Y + + + G L D + G +
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVS-YGDGSFTVGELATDTVTF--GNSGKIN 265
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKD 254
N V +GCG G + +L+ ++ SFS C
Sbjct: 266 N-----VALGCGHDNEGLFTGAAGL---------LGLGGGVLSITNQMKATSFSYCLVDR 311
Query: 255 DSGR---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSS--CLKQTSFK---- 304
DSG+ + F +T+ L N K T Y +G+ +G L F
Sbjct: 312 DSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDAS 371
Query: 305 ----AIVDSGSSFTFLPKEVYETIAAEFDR-QVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
I+D G++ T L + Y ++ F + VN S + CY SS K+P
Sbjct: 372 GSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVP 431
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V F S + ++I T FC A P + IG G R+ +D
Sbjct: 432 TVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSK 490
Query: 420 LKLGWSHSNC 429
+G S + C
Sbjct: 491 NVIGLSGNKC 500
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 142/358 (39%), Gaps = 51/358 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ MS+ D G D+ W V+CAP +A +S L + P+ S+T SCS C
Sbjct: 142 TQVMSI--DTGSDVSW-----VQCAPCAAQSCSSQKDKL--FDPAKSATYSAFSCSSAQC 192
Query: 150 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
G C N C Y + Y ++++++G D L L + +A+KN G
Sbjct: 193 AQLGGEGNGCLNSH--CQYIVKY-VDHSNTTGTYGSDTLGLTT--SDAVKN-----FQFG 242
Query: 206 CGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
C + +G G LDG+ +GLG + + A +FS C S F
Sbjct: 243 CSHRANGFVGQLDGL-------MGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLT 295
Query: 264 QGPATQQSTSFLASNGKYITYII----GVETCCIGSSCLKQT------SFKAIVDSGSSF 313
G A ++S S + + + GV I + K S ++VDSG+
Sbjct: 296 LGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVI 355
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T LP Y+ + F +++ ++ C+ S + ++P V L F + +
Sbjct: 356 TQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDL 415
Query: 374 NNPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G CLA DGD G +G + ++FD LG+ C
Sbjct: 416 DVSGIFYAG-------CLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 141/365 (38%), Gaps = 59/365 (16%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+TM + D D W PC C+ C+ +S SST L CS C
Sbjct: 106 QTMYMVLDTSNDAAWAPCSGCIGCS------------STTTFSAQNSSTFATLDCSKPEC 153
Query: 150 D--LGTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
G SC C + Y ++T S+ LV+D LHL G N + N GC
Sbjct: 154 TQARGLSCPTTGNVDCLFNQTYGGDSTFSA-TLVQDSLHL---GPNVIPN-----FSFGC 204
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSMCFDKDDS----GRIFF 261
SG + P GL+GLG G +S L++++G L FS C S G +
Sbjct: 205 ISSASG---SSIPPQGLMGLGRGPLS---LISQSGSLYSGLFSYCLPSFKSYYFSGSLKL 258
Query: 262 GDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSG 310
G G P ++T L + + Y + + +G + T I+DSG
Sbjct: 259 GPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSG 318
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ-RLP----KLPSVKLMF 365
+ T +Y + EF +QV + + + C+ ++++ P L + L
Sbjct: 319 TVITRFVPAIYTAVRDEFRKQVGGSFSPLGAF--DTCFATNNEVSAPAITLHLSGLDLKL 376
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
P NS + ++ G+ A V+ + I +R++FD N KLG +
Sbjct: 377 PMENSLIHSSA-----GSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIA 431
Query: 426 HSNCQ 430
C
Sbjct: 432 RELCN 436
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 142/359 (39%), Gaps = 72/359 (20%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
+ +SL D G DL W +C P + S Y D + PS S++ +++C+ LC
Sbjct: 157 RDLSLIFDTGSDLTW-----TQCEPCARSCYKQQDV---IFDPSKSTSYSNITCTSALCT 208
Query: 150 DLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
L T+ C + C Y + Y +++ S G + L + + V + +
Sbjct: 209 QLSTATGNDPGCSASTKACIYGIQY-GDSSFSVGYFSRERLTVTA-------TDVVDNFL 260
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
GCG + + G G A GLIGLG IS + A R FS C
Sbjct: 261 FGCG-QNNQGLFGGSA--GLIGLGRHPISF--VQQTAAKYRKIFSYCL------------ 303
Query: 264 QGPATQQSTSFL----ASNGKYITY-----------IIGVETCCIGSSCLK----QTSFK 304
P+T ST L A+ G+Y+ Y G++ I +K ++F
Sbjct: 304 --PSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361
Query: 305 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
AI+DSG+ T LP Y + + F + ++ ++ E CY S ++ +P++
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTI 421
Query: 362 KLMFPQNNSFVVNNP----VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
+ F V P +FV QV F A D D+ G VV+D
Sbjct: 422 EFSFA--GGVTVKLPPQGILFVASTKQVCLAF--AANGDDSDVTIYGNVQQRTIEVVYD 476
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 140/358 (39%), Gaps = 50/358 (13%)
Query: 93 MSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD- 150
+SL D G DL W C CVR D+ ++PS S++ ++SCS C
Sbjct: 145 LSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSYYNVSCSSAACGS 195
Query: 151 ----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
G + C Y + Y + + S G L ++ L + + V V GC
Sbjct: 196 LSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTN-------SDVFDGVYFGC 247
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQ 264
G + + G GVA GL+GLG ++S PS A A FS C S G + FG
Sbjct: 248 G-ENNQGLFTGVA--GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSA 302
Query: 265 G----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 314
G TSF N IT +G + I S+ A++DSG+ T
Sbjct: 303 GISRSVKFTPISTITDGTSFYGLNIVAIT--VGGQKLPIPSTVFSTPG--ALIDSGTVIT 358
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP + Y + + F +++ T+ C+ S + +P V F +
Sbjct: 359 RLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELG 418
Query: 375 NP-VFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +F ++ V CLA D + G VV+D ++G++ + C
Sbjct: 419 SKGIFYVFKISQV---CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 142/373 (38%), Gaps = 59/373 (15%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LG 152
L D DL W+ C C RC P S ++ P S++ ++ C LG
Sbjct: 156 LALDTASDLTWLQCQPCRRCYPQSGPVFD----------PRHSTSYGEMNYDAPDCQALG 205
Query: 153 TSC--QNPKQPCPYTM-----DYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
S + C YT+ D + ++S G LVE+ L G QA + IG
Sbjct: 206 RSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGG-------VRQAYLSIG 258
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG------RI 259
CG G L G G++GL G+IS+P +A G SFS C SG +
Sbjct: 259 CGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLVDFISGPGSPSSTL 315
Query: 260 FFG----DQGPATQQSTSFLASNGKYITYI--IGVETCCIGSSCLKQTSFK--------- 304
FG D P + + L N Y+ IGV + + + +
Sbjct: 316 TFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGG 375
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCY----KSSSQRLPK 357
I+DSG++ T L + Y F G P + CY ++ + K
Sbjct: 376 VILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVK 435
Query: 358 LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFD 416
+P+V + F + ++I T C A D + IG G+RVV+D
Sbjct: 436 VPAVSMHFAGGVELSLQPKNYLITVDSRGT-VCFAFAGTGDRSVSVIGNILQQGFRVVYD 494
Query: 417 RENLKLGWSHSNC 429
++G++ ++C
Sbjct: 495 IGGQRVGFAPNSC 507
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 134/333 (40%), Gaps = 51/333 (15%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQS-----------RSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G++SV L ++ + FS C S R FF
Sbjct: 110 TFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSERGFFS 165
Query: 262 --------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAI 306
G + AT+ + T +A + + + + L + S K +
Sbjct: 166 KTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGV 225
Query: 307 V-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
V DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 VFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 366 PQNNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE 317
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 101/459 (22%), Positives = 173/459 (37%), Gaps = 67/459 (14%)
Query: 27 FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTG---PQFQM 83
K + R + + +G +N ++ AK+S + +V+ ++ + + M++ M
Sbjct: 64 MQAKDLFRHEQMITMMGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSALNIAHVGM 123
Query: 84 LFPSQGSKTMSLGN----DFGCDLLWIPCDCVRCAPLSASYY--NSLDRDL--------- 128
S T +L D DL WI C R +Y S+ + +
Sbjct: 124 YLVSVRIGTPALPYNLVLDTATDLTWINC---RLRRRKGKHYGRQSMGQTMSVGGEGATA 180
Query: 129 -------NEYSPSASSTSKHLSCSHRLCDL--GTSCQNP--KQPCPYTMDYYTENTSSSG 177
N Y P+ SS+ + + CS + C + +CQ+P + C Y + T + G
Sbjct: 181 AKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIG 239
Query: 178 LL-VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
+ E +S G + + +I+GC + ++GG +D A DG++ LG G++S
Sbjct: 240 IYGKEKATVTVSDG----RMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVH 293
Query: 237 LAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ----GPATQQSTSFL------ASNGKY 281
AK FS C +D S + FG GP T ++ A K
Sbjct: 294 AAKR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKV 351
Query: 282 ITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 338
++G E I F I+D+ +S T L E Y + A DR ++
Sbjct: 352 TGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRV 411
Query: 339 FEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG------FCLA 392
+E ++ CYK + P+ + P + VV CLA
Sbjct: 412 YELEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLA 471
Query: 393 IQP-VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + G G +G FM Y D + K+ + C
Sbjct: 472 FRKLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKCN 510
>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
Length = 165
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 52/117 (44%), Gaps = 22/117 (18%)
Query: 27 FSTKLIHRFSEEVKALGVSKN-RNATSWPAKKSFEYYQVLLSSDVQKQKMKTG------- 78
+S ++ H+FS EVK ++ + WP + S EYY+ L D + K
Sbjct: 28 YSLQMYHKFSNEVKEWMTWRHGLDTDGWPVEGSNEYYKALYHHDSARHGRKLADHPSLTF 87
Query: 79 ---------PQFQMLFPSQ-----GSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYY 121
PQ LF S + T+ + D G D+ W+PCDC CAP SA+ Y
Sbjct: 88 LEGNETVEIPQLGFLFYSMVQVGTPNVTLFVALDTGSDVFWVPCDCQACAPTSAASY 144
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 142/359 (39%), Gaps = 51/359 (14%)
Query: 90 SKTMSLGNDFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
++TM L D D+ W+ PC C P +D+ Y P+ SS+S SC+
Sbjct: 168 TQTMVL--DTASDVTWVQCSPCPTPPCYP---------QKDV-LYDPTKSSSSGVFSCNS 215
Query: 147 RLC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C LG C N Q C Y + Y + TS++G + D+L + A++ S
Sbjct: 216 PTCTQLGPYANGCTNNNQ-CQYRVRY-PDGTSTAGTYISDLLTITPA--TAVR-----SF 266
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GC G + G + G++ LG G S+ S A FS CF + R FF
Sbjct: 267 QFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRVFSHCFPPP-TRRGFFT 323
Query: 263 DQGPATQQSTSFLASNGKYIT-----YIIGVETCCIGSS--CLKQTSFKA--IVDSGSSF 313
P L K Y++ +E + + T F A +DS ++
Sbjct: 324 LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAI 383
Query: 314 TFLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
T LP Y+ + F DR +G P CY + R LP + L+F +N +
Sbjct: 384 TRLPPTAYQALRQAFRDRMAMYQPAPPKG-PLDTCYDMAGVRSFALPRITLVFDKNAAVE 442
Query: 373 VNNPVFVIYGTQVVTGFCLAIQ--PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + G CLA P D G IG + V+++ +G+ H+ C
Sbjct: 443 LDPSGVLFQG-------CLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 161/401 (40%), Gaps = 71/401 (17%)
Query: 75 MKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 133
+ +G F + K SL D G DL W+ C C C + +Y+ P
Sbjct: 155 LGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD----------P 204
Query: 134 SASSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDI---L 184
S++ K+++C+ C L +S C++ Q CPY Y + ++ VE L
Sbjct: 205 KTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNL 264
Query: 185 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
GG + K +++ GCG G + L+GLG G +S S L L
Sbjct: 265 TTTEGGSSEYK---VGNMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQLQ--SLYG 316
Query: 245 NSFSMCF-----DKDDSGRIFFGDQGPATQQS----TSFL--ASNGKYITYIIGVETCCI 293
+SFS C + + S ++ FG+ + TSF+ N Y I +++ +
Sbjct: 317 HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILV 376
Query: 294 GSSCLK--QTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 343
G L + ++ I+DSG++ ++ + YE I +F ++ + F +P
Sbjct: 377 GGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP 436
Query: 344 -WKCCY-----KSSSQRLPKL-----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLA 392
C+ + ++ LP+L FP NSF+ + V CLA
Sbjct: 437 VLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLV----------CLA 486
Query: 393 IQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
I IG + +++D + +LG++ + C D+
Sbjct: 487 ILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCADI 527
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 135/332 (40%), Gaps = 77/332 (23%)
Query: 59 FEYYQVLLSSDVQKQKMKTGPQFQM--------LFP-SQGSKTMSLGN-----------D 98
F+ +LLS+ + + + PQ + LFP S G+ ++SL D
Sbjct: 91 FKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFD 150
Query: 99 FGCDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC------DL 151
G L+W PC RC+ S Y + ++++ P SS+ K + C + C +L
Sbjct: 151 TGSSLVWFPCTAGYRCSRCSFPYVDP--ATISKFVPKLSSSVKVVGCRNPKCAWIFGPNL 208
Query: 152 GTSCQNPKQP-------CP-YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
+ C+N CP Y + Y + T+ G+L+ + L L +N +
Sbjct: 209 KSRCRNCNSKSRKCSDSCPGYGLQYGSGATA--GILLSETLDL--------ENKRVPDFL 258
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
+GC + + P G+ G G G S+PS + S FD D
Sbjct: 259 VGCSV------MSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLD 312
Query: 264 QGPATQQST--SFL---------ASNGKYITYI-IGVETCCIGSSCLKQTSFK------- 304
G + +S SF+ SN + Y + + IG +K +K
Sbjct: 313 SGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVK-FPYKYLVPDST 371
Query: 305 ----AIVDSGSSFTFLPKEVYETIAAEFDRQV 332
AI+DSGS+FTFL K ++E IA E ++Q+
Sbjct: 372 GNGGAIIDSGSTFTFLDKPIFEAIADELEKQL 403
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/352 (23%), Positives = 138/352 (39%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C +C S +N P SS+ L CS +LC S
Sbjct: 113 DTGSDLIWTQCQPCTQCFNQSTPIFN----------PQGSSSFSTLPCSSQLCQALQSPT 162
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GYL 215
C YT Y + + + G + + L G ++ N + GCG G G
Sbjct: 163 CSNNSCQYTYGY-GDGSETQGSMGTETLTF---GSVSIPN-----ITFGCGENNQGFGQG 213
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFG---DQGPATQ 269
+G GL+G+G G +S+PS L FS C S + G + A
Sbjct: 214 NGA---GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTSSTLLLGSLANSVTAGS 265
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK---------AIVDSGSSFTFLPK 318
+T+ + S+ Y I + +GS+ L + FK I+DSG++ T+
Sbjct: 266 PNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFAD 325
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
Y+ + F Q+N ++ + + C++ S Q ++P+ + F + + +
Sbjct: 326 NAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENY 385
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F+ ++ CLA+ + G VV+D N + + + C
Sbjct: 386 FISPSNGLI---CLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 145/381 (38%), Gaps = 66/381 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCSHRL 148
S+ L D G DL W+ C C + S + R + + SS+ K + C +
Sbjct: 22 SQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDM 80
Query: 149 CDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDIL--HLISGGDNALKNSV 198
C + T+C P PC Y DY Y++ +++ G + + L G L N
Sbjct: 81 CKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVELKEGRKMKLHN-- 136
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 253
V+IGC G A DG++GLG + S + A FS C K
Sbjct: 137 ---VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLSHK 189
Query: 254 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET---------CCIGSSCLK----- 299
+ S + FG + +S L +N Y ++G+ IG + LK
Sbjct: 190 NVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 244
Query: 300 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------RQVNDTITSFEGYPWKCCYKS 350
+ + I+DSGSS TFL + Y+ + A R+V I P + C+ S
Sbjct: 245 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG-----PLEYCFNS 299
Query: 351 SSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT--GFCLAIQPVDGDIGTIGQNFM 408
+ +P + F F +VI V GF P +G I Q
Sbjct: 300 TGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQ-- 357
Query: 409 TGYRVVFDRENLKLGWSHSNC 429
+ FD KLG++ S+C
Sbjct: 358 -NHLWEFDLGLKKLGFAPSSC 377
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 155/401 (38%), Gaps = 62/401 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ ++L D G DL+W PC C + ++ + + S S S S +H
Sbjct: 87 QLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSCQSPACSAAHASMS 146
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD---NALKNSVQASVIIGCG 207
C + CP +DY E + S + G N + ++ S +
Sbjct: 147 SSNLCAISR--CP--LDY-IETSDCSSFSCPPFYYAYGDGSFVANLYQQTLSLSSLHLQN 201
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC-----FDKDDSGR--- 258
+ P G+ G G G +S+P+ L+ + + N FS C FD D R
Sbjct: 202 FTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDRLRRPSP 261
Query: 259 IFFGDQ-----GPATQQSTSF----LASNGKY-ITYIIGVETCCIGS------SCLKQTS 302
+ G G +S F + SN K+ Y +G+ +G LK+
Sbjct: 262 LILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPEILKRVD 321
Query: 303 FKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC----CYKSSSQR 354
K +VDSG++FT LP+ Y + EFD++VN K CY +
Sbjct: 322 EKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLGPCYYLNG-- 379
Query: 355 LPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG----------FCLAIQ------PVDG 398
L ++P +KL F NNS VV Y + + G C+ + +DG
Sbjct: 380 LSQIPVLKLHFVGNNSDVVLPRKNYFY--EFMDGGDGIRRKGKVGCMMLMNGEDETELDG 437
Query: 399 DIG-TIGQNFMTGYRVVFDRENLKLGWSHSNCQDLNDGTKS 438
G T+G G+ VV+D E ++G++ C L D S
Sbjct: 438 GPGATLGNYQQQGFEVVYDLEKERVGFAKKECALLWDSLNS 478
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT L D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 154/401 (38%), Gaps = 70/401 (17%)
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASY 120
+Q LL + V M +L T + D G DL+W C C +C
Sbjct: 75 FQALLENGVGGYNMNISVGTPLL-------TFPVVADTGSDLIWTQCAPCTKC------- 120
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSG 177
+ + P++SST L C+ C N + C T +Y + ++G
Sbjct: 121 ---FQQPAPPFQPASSSTFSKLPCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAG 174
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L + L + GD + SV GC + G + G+ GLG G +S L+
Sbjct: 175 YLATETLKV---GDASFP-----SVAFGCSTENG----VGNSTSGIAGLGRGALS---LI 219
Query: 238 AKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQ---QSTSFLASNGKYITYI-IGVET 290
+ G+ R FS C + I FG T QST F+ + + +Y + +
Sbjct: 220 PQLGVGR--FSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTG 277
Query: 291 CCIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+G + L T+ IVDSG++ T+L K+ YE + F Q + T
Sbjct: 278 ITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVN 337
Query: 340 EGYPWKCCYKSSSQRLP-KLPSVKLMFPQNNSFVVNNPVFVIYGTQV-----VTGFCLAI 393
C+KS+ +PS+ L F + V P + G + VT CL +
Sbjct: 338 GTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAV--PTY-FAGVETDSQGSVTVACLMM 394
Query: 394 QPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
P GD + IG +++D + +S ++C +
Sbjct: 395 LPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAKV 435
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 93/388 (23%), Positives = 144/388 (37%), Gaps = 56/388 (14%)
Query: 78 GPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA 135
G F + S +K L D G L W+ CD C+ C + Y E +
Sbjct: 36 GHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ELKYAV 89
Query: 136 SSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNA 193
T + C+ DL + PK C Y + Y SS G+L+ D L S G N
Sbjct: 90 KCTEQR--CADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTNP 145
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMCF 251
S+ GCG Q + P +G++GLG G++++ S L G+I ++ C
Sbjct: 146 ------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCI 199
Query: 252 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS---SCLKQTSFKAIVD 308
G +FFGD T T + N ++ Y T S S + + I D
Sbjct: 200 SSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFD 258
Query: 309 SGSSFTFLPKEVYE-----------------TIAAEFDRQV------NDTITSFEGYPWK 345
SG+++T+ + Y T E DR + D I + + K
Sbjct: 259 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV--K 316
Query: 346 CCYKSSSQRLPK-LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
C++S S + L P + +++ V G ++ G P IG
Sbjct: 317 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLG--ILDGS--KEHPSLAGTNLIG 372
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNCQDL 432
M V++D E LGW + C +
Sbjct: 373 GITMLDQMVIYDSERSLLGWVNYQCDRI 400
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 85/356 (23%), Positives = 139/356 (39%), Gaps = 64/356 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS-----ASSTSKHLSCSHRLCD- 150
D G DL+W+ CD C C DL+ + + ASS+ K L C+ C
Sbjct: 23 DTGSDLVWLKCDNCDHC-------------DLDHHGETIFFSDASSSYKKLPCNSTHCSG 69
Query: 151 -----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
+G C+ + C Y +Y + + +SG + D + S G S + G
Sbjct: 70 MSSAGIGPRCE---ETCKYKYEY-GDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFG 262
C K G D GLIGLG S+ L + FS C +D S + F
Sbjct: 126 CARKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSPPSAKSFLF 180
Query: 263 DQGPATQQSTSFLAS---NGKYIT---YIIGVETCCIG--------------SSCLKQTS 302
A + +++ +G ++ Y + +++ IG +S +
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLA 240
Query: 303 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSV 361
K ++DSG+++T L VYE + + QV T+ + G C+ SS PSV
Sbjct: 241 NKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAG--LDLCFNSSGDTSYGFPSV 298
Query: 362 KLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
F V+ +F + VV CL++ GD+ IG + +++D
Sbjct: 299 TFYFANQVQLVLPFENIFQVTSRDVV---CLSMDSSGGDLSIIGNMQQQNFHILYD 351
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 130/343 (37%), Gaps = 37/343 (10%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W+ C C C Y D + PS S+T + C + C +C
Sbjct: 206 DTGSDLSWVQCKPCNNC-------YKQHD---PLFDPSQSTTYSAVPCGAQECLDSGTCS 255
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+ K C Y + Y + + + G L D L L D + GCG +G L
Sbjct: 256 SGK--CRYEV-VYGDMSQTDGNLARDTLTLGPSSDQL------QGFVFGCGDDDTG--LF 304
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSGRIFFGDQG--PATQQST 272
G A DGL GLG +S+ S A FS C G + G P Q +
Sbjct: 305 GRA-DGLFGLGRDRVSLAS--QAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPPHAQFTA 361
Query: 273 SFLASNGKYITYIIGVETCCIGSSC-LKQTSFKA---IVDSGSSFTFLPKEVYETIAAEF 328
S+ Y+ V G + + FKA ++DSG+ T LP Y + + F
Sbjct: 362 MVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSF 421
Query: 329 DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN--NPVFVIYGTQVV 386
+ + CY + + ++PSV L+F + + ++V +Q
Sbjct: 422 AGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANRSQAC 481
Query: 387 TGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F A D +G +G + VV+D N K+G+ C
Sbjct: 482 LAF--ASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522
>gi|406861825|gb|EKD14878.1| aspartic-type endopeptidase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 480
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 124/291 (42%), Gaps = 46/291 (15%)
Query: 221 DGLIGLG--LGEISV-----------PSLLAKAGLIRNS-FSMCFDKDD--SGRIFFG-- 262
+G++G+G + E+ V PS + + GLI++S +S+ + D +G I FG
Sbjct: 174 EGILGIGYEINEVQVGRAGQKAYRNLPSQMVEDGLIKSSAYSLWLNDLDANTGSILFGGV 233
Query: 263 DQGPATQ--QSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV-DSGSSFTFLPKE 319
D G T QS A G Y+ ++I + G + + +A++ DSGSS T+LP
Sbjct: 234 DTGKYTGSLQSLPVQAERGSYVEFLITLTEVSFGDTVIASNQAQAVLLDSGSSLTYLPDP 293
Query: 320 VYETIAAEFDRQVNDTIT------SFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
+ E I + D Q + S G +K S + +P +L+ P ++
Sbjct: 294 IAEAIYEQIDAQYESSEDVAYVPCSLAGATTTINFKFSGPVI-AVPMNELVIPAESA--S 350
Query: 374 NNPVFVIYGTQVVTGFCL-AIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH------ 426
P+ GT CL I P D +G F+ +V+D N ++ +
Sbjct: 351 GRPLTFSDGTPS----CLFGIAPAGSDTSVLGDTFIRSAYIVYDLANNEISLAQTNFNST 406
Query: 427 -SNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVGPAVAGRAPSK 476
SN ++ GT S P SNP+ A+ ++ G + G A SK
Sbjct: 407 ISNVVEITTGTAS--VPDATAVSNPVAADSGDAA--GKTGTNGLGGTATSK 453
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 140/363 (38%), Gaps = 65/363 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCDLGTSC 155
D G D+LWI C+ C C D L + PS SST L C G C
Sbjct: 119 DTGSDILWIMCNPCTNC-----------DNHLGLLFDPSMSSTFSPL-CKTPCGFKGCKC 166
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
P P+T+ Y +N+S+SG DIL + + S + VIIGCG + G+
Sbjct: 167 D----PIPFTISY-VDNSSASGTFGRDILVFETTDEGT---SQISDVIIGCG--HNIGFN 216
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----DDSGRIFFGDQGPATQQ 270
+G++GL G P+ LA I FS C + ++ G+
Sbjct: 217 SDPGYNGILGLNNG----PNSLATQ--IGRKFSYCIGNLADPYYNYNQLRLGEGADLEGY 270
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEV 320
ST F +G Y + G+ +G L + + I+DSG++ T+L
Sbjct: 271 STPFEVYHGFYYVTMEGIS---VGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSA 327
Query: 321 YETIAAEFDRQVNDTITS--FEGYPWKCCYKS-SSQRLPKLPSVKLMFPQNNSFVVNNPV 377
++ + E + + FE PWK CY S+ L P V F ++
Sbjct: 328 HKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDTGS 387
Query: 378 FVIYGTQVVTGFCLAIQPVD--------GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F +Q FC+ + P IG + Q Y V +D N + + +C
Sbjct: 388 FF---SQRDDIFCMTVSPASILNTTISPSVIGLLAQQ---SYNVGYDLVNQFVYFQRIDC 441
Query: 430 QDL 432
+ L
Sbjct: 442 ELL 444
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/378 (22%), Positives = 152/378 (40%), Gaps = 72/378 (19%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----- 151
D G DL W+ C C+ C D+ + P+ASS+ ++++C + C L
Sbjct: 169 DTGSDLNWLQCAPCLDC----------FDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPE 218
Query: 152 -GTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGM 208
+C+ P + CPY Y ++ ++ L +E ++L + G + + V+ GCG
Sbjct: 219 PPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD----DVVFGCGH 274
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG 265
G + GL L S L A G ++FS C D + ++ FG+
Sbjct: 275 WNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVDHGSDVASKVVFGEDD 329
Query: 266 --------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------------FKA 305
P + AS+ Y + ++ +G L +S
Sbjct: 330 ALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGGT 389
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM 364
I+DSG++ ++ + Y+ I F ++ + +P CY S P++P + L+
Sbjct: 390 IIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLL 449
Query: 365 --------FPQNNSFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFMTGYRVV 414
FP N F+ +P ++ CLA+ P G + IG + VV
Sbjct: 450 FADGAVWDFPAENYFIRLDPDGIM---------CLAVLGTPRTG-MSIIGNFQQQNFHVV 499
Query: 415 FDRENLKLGWSHSNCQDL 432
+D +N +LG++ C ++
Sbjct: 500 YDLKNNRLGFAPRRCAEV 517
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 144/352 (40%), Gaps = 51/352 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P A Y + + P+ S+T ++SCS C DL S C
Sbjct: 114 DTGSDTTW-----VQCQPCVAYCYRQKE---PLFDPTKSATYANISCSSSYCSDLYVSGC 165
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G +D L L + +KN GCG K G L
Sbjct: 166 SGGH--CLYGIQY-GDGSYTIGFYAQDTLTLAY---DTIKN-----FRFGCGEKNRG--L 212
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP----ATQQ 270
G A GL+GLG G+ S+P K G + F+ C +G F D GP A +
Sbjct: 213 FGRAA-GLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFL-DLGPGAPAANAR 267
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETIA 325
T L G Y +G+ +G L ++ +VDSG+ T LP Y +
Sbjct: 268 LTPMLVDRGPTF-YYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAPLR 326
Query: 326 AEFDRQVNDTITSFEGYP----WKCCYKSSSQRLP--KLPSVKLMFPQNNSFVVNNPVFV 379
+ F + + + P CY + + LP+V L+F Q + + + +
Sbjct: 327 SAFSKAMQG--LGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF-QGGACLDVDASGI 383
Query: 380 IYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+Y V+ CLA P D D+ +G + V++D +G++ C
Sbjct: 384 LY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 143/365 (39%), Gaps = 52/365 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL W+ C C+ C D+ + P AS++ ++++C C L +
Sbjct: 168 DTGSDLNWLQCAPCLDC----------FDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPA 217
Query: 157 NPK-------QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
P+ PCPY +Y + ++++G L L + A + V++GCG +
Sbjct: 218 APRTCRSSRSDPCPYYY-WYGDQSNTTGDLA---LEAFTVNLTASSSRRVDGVVLGCGHR 273
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGP 266
G + GL L S L A G ++FS C S +I FGD
Sbjct: 274 NRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHGSAVGSKIVFGDDNV 328
Query: 267 ATQQS----TSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGS 311
T+F S + Y + ++ +G L + S I+DSG+
Sbjct: 329 LLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGT 388
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNN- 369
+ ++ P+ Y+ I F +++ +P CY S ++P L+F
Sbjct: 389 TLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAV 448
Query: 370 -SFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
F N F+ T+ + CLA+ + IG + V++D + +LG++
Sbjct: 449 WDFPAEN-YFIRLDTEGI--MCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPR 505
Query: 428 NCQDL 432
C ++
Sbjct: 506 RCAEV 510
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 145/353 (41%), Gaps = 37/353 (10%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +L D G D+ W +C P + Y + LN PS S++ K++SCS LC
Sbjct: 82 KEFTLIFDTGSDITW-----TQCEPCVKTCYKQKEPRLN---PSTSTSYKNISCSSALCK 133
Query: 151 LGTSCQNPKQPCP-----YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L S + Q C Y + Y + + S G + L L S N KN + G
Sbjct: 134 LVASGKKFSQSCSSSTCLYQVQ-YGDGSYSIGFFATETLTLSS--SNVFKN-----FLFG 185
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD 263
CG + + GL+GLG ++++PS AK + FS C S G + G
Sbjct: 186 CGQQNN---GLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLPASSSSKGYLSLGG 240
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKE 319
Q + + T A Y + + +G L +++F A ++DSG+ T L
Sbjct: 241 QVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPT 300
Query: 320 VYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
Y +++ F + D S GY + CY S ++P V + F ++
Sbjct: 301 AYSELSSAFQNLMTD-YPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSG- 358
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC 429
++Y + CLA D D T G Y+VV+D ++G++ C
Sbjct: 359 ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
>gi|156065227|ref|XP_001598535.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980]
gi|154691483|gb|EDN91221.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 482
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 79/341 (23%), Positives = 136/341 (39%), Gaps = 44/341 (12%)
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC----GM 208
T C + PC Y ++S+ L D G A + V + IG +
Sbjct: 105 TLCSERRSPCQTAGTYSANSSSTYAYLASDFNISYVDGSGASGDYVTDTFTIGSTTLDKL 164
Query: 209 KQSGGYLDGVAPDGLIGLG--LGEISV-----------PSLLAKAGLIR-NSFSMCFDKD 254
+ GY +P+G++G+G + E+ V P+ + GLI N+FS+ +
Sbjct: 165 QFGIGYTSS-SPEGILGIGYEINEVQVGRARKSAYKNLPAQMVADGLINSNAFSLWLNDL 223
Query: 255 DS--GRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIV 307
DS G + FG A ++ +G Y ++I + +G+ + Q S ++
Sbjct: 224 DSSTGSVLFGGVDTARYHGQLETLPIQKESGYYAEFLITLTEVTLGNLVIAQDQSLAVLL 283
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL----PSVKL 363
DSGSS T+LP + E I + D Q + + EG + C +S+ P++++
Sbjct: 284 DSGSSLTYLPDAMAEAIYEQVDAQYDYS----EGAAYVPCSLASNSSALNFTFTSPTIQV 339
Query: 364 MFPQNNSFVVNNPVFVIYGTQVV----TGFCL-AIQPVDGDIGTIGQNFMTGYRVVFDRE 418
+ V+ PV G Q+ T CL I P +G F+ VV+D
Sbjct: 340 TMDE---LVI--PVTSSNGQQLRFTDGTAACLFGIAPAGESTAVLGDTFIRSAYVVYDLA 394
Query: 419 NLKLGWSHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSS 459
N ++ + +N T P+ L +N +S
Sbjct: 395 NNEISLAQTNFNATATNVVEITTGTSAVPNAALVSNAATAS 435
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 45/348 (12%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-C 155
D G D W V+C P Y + + P+ SST ++SC+ C DL T+ C
Sbjct: 181 DTGSDTTW-----VQCRPCVVKCYKQKE---PLFDPAKSSTYANVSCTDSACADLDTNGC 232
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + + G +D L + +A+K GCG K +G +
Sbjct: 233 TGGH--CLYAVQY-GDGSYTVGFFAQDTLTI---AHDAIKG-----FRFGCGEKNNGLFG 281
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS---- 271
GL+GLG G+ S+ + +F+ C +G + D GP + +
Sbjct: 282 KTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-DFGPGSAGNNARL 335
Query: 272 TSFLASNGKYITYI------IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
T L G+ Y+ +G + + S ++ +VDSG+ T LP Y ++
Sbjct: 336 TPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF--STAGTLVDSGTVITRLPATAYTALS 393
Query: 326 AEFDR-QVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN--NPVFVIY 381
+ FD+ + GY CY + +LP+V L+F V+ V+ I
Sbjct: 394 SAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAIS 453
Query: 382 GTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
QV F A D + +G Y V++D +G++ +C
Sbjct: 454 EAQVCLAF--ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|256271970|gb|EEU06988.1| Yps1p [Saccharomyces cerevisiae JAY291]
Length = 569
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 129/285 (45%), Gaps = 44/285 (15%)
Query: 169 YTENTSSSGLLVEDILHL----ISGGDNALKNSVQASV-IIGCGMKQ-SGGYLDGVAPDG 222
Y + T +SG D+L L ++G A+ N +++ ++G G+ + Y A G
Sbjct: 211 YGDGTFASGTFGTDVLDLSDLNVTGLSFAVANETNSTMGVLGIGLPELEVTYSGSTASHG 270
Query: 223 LIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKDDS--GRIFFGDQGPATQQSTSF----- 274
G + P +L +G I+ N++S+ + D+ G I FG + T +
Sbjct: 271 --GKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTILFGAVDHSKYTGTLYTIPIV 328
Query: 275 --LASNG-----KYITYIIGVETCCIGSS--CLKQTSFKAIVDSGSSFTFLPKEVYETIA 325
L+++G ++ I G+ GSS L T A++DSG++ T+LP+ V IA
Sbjct: 329 NTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPALLDSGTTLTYLPQTVVSMIA 388
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV--FVIYGT 383
E Q + I GY C P S++++F F +N P+ F++
Sbjct: 389 TELGAQYSSRI----GYYVLDC--------PSDDSMEIVF-DFGGFHINAPLSSFIL--- 432
Query: 384 QVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKLGWSHS 427
T L I P D GTI G +F+T VV+D ENL++ + +
Sbjct: 433 STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEISMAQA 477
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 139/355 (39%), Gaps = 45/355 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
++T ++ D G D+ W+ C VRC ++ PS SST +++SC+
Sbjct: 26 TRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFD----------PSLSSTYRNVSCTEP 75
Query: 148 LCDLGTSCQN-PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C +G S + C Y + +Y + +S+ G L D L KN I GC
Sbjct: 76 AC-VGLSTRGCSSSTCLYGV-FYGDGSSTIGFLAMDTFMLTPA--QKFKN-----FIFGC 126
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEI-SVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 265
G + G G A GL+GLG S+ S +A + + N FS C S +
Sbjct: 127 GQNNT-GLFQGTA--GLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGYLNIGN 181
Query: 266 PA-TQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA---IVDSGSSFTFLPKE 319
P T T+ L Y I + +G + L T F++ I+DSG+ T LP
Sbjct: 182 PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRLPPT 241
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + + + CY S P + L F + + VF
Sbjct: 242 AYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFF 301
Query: 380 IYGTQVVTGFCLAI-----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + V CLA + G IG + Q M V +D E ++G+S C
Sbjct: 302 VFNSSQV---CLAFAGNTDSTMIGIIGNVQQLTM---EVTYDNELKRIGFSAGAC 350
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 103/485 (21%), Positives = 173/485 (35%), Gaps = 100/485 (20%)
Query: 6 LTIYLAVFWLLTESSGAETVMFSTK--LIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
L Y +F LL ++ T + + L H V K R T W
Sbjct: 9 LLAYALIFTLLFTAAATPTAGLTMRADLTH----------VDKGRGFTRWERLSRMAVRS 58
Query: 64 VLLSSDVQKQKMKTG-PQFQMLFPSQGS------------KTMSLGNDFGCDLLWIPCD- 109
++ + ++ G P PS G + ++L D G DL+W C
Sbjct: 59 RARAASLYQRGGHYGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTP 118
Query: 110 CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-----SCQNPKQPCPY 164
C C D+ + PS SST + ++C +C + +C C Y
Sbjct: 119 CPVC----------FDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFY 168
Query: 165 TMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 224
Y + + ++G + +D +S + + GCG +G + + G+
Sbjct: 169 LCSY-GDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNES--GIA 225
Query: 225 GLGLGEISVPSLLAKAGLIRNSFSMCFDKDD------SGRIFFG---------DQGPATQ 269
G G G +S+PS L + G FS C D + +F G GP
Sbjct: 226 GFGRGPLSLPSQL-RVG----RFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPF-- 278
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKE 319
+ST + S Y + +E +G + L K S ++DSG+ T P
Sbjct: 279 RSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAA 338
Query: 320 VYETIAAEFDRQV----NDTITSFEGYPWKCCYK--SSSQRLP------KLPSVKLMFPQ 367
V+E + EF Q+ D + C++ +++P L S + P+
Sbjct: 339 VFEQLKNEFVAQLPLPRYDNTSEVGNL---LCFQRPKGGKQVPVPKLIFHLASADMDLPR 395
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHS 427
N + V+ CL I + D+ IG +V+D EN KL ++ +
Sbjct: 396 ENYIPEDTDSGVM---------CLMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASA 446
Query: 428 NCQDL 432
C +
Sbjct: 447 QCDKM 451
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 145/353 (41%), Gaps = 37/353 (10%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +L D G D+ W +C P + Y + LN PS S++ K++SCS LC
Sbjct: 130 KEFTLIFDTGSDITW-----TQCEPCVKTCYKQKEPRLN---PSTSTSYKNISCSSALCK 181
Query: 151 LGTSCQNPKQPCP-----YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L S + Q C Y + Y + + S G + L L S N KN + G
Sbjct: 182 LVASGKKFSQSCSSSTCLYQVQ-YGDGSYSIGFFATETLTLSS--SNVFKN-----FLFG 233
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD 263
CG + + GL+GLG ++++PS AK + FS C S G + G
Sbjct: 234 CGQQNN---GLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLPASSSSKGYLSLGG 288
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKE 319
Q + + T A Y + + +G L +++F A ++DSG+ T L
Sbjct: 289 QVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPT 348
Query: 320 VYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
Y +++ F + D S GY + CY S ++P V + F ++
Sbjct: 349 AYSELSSAFQNLMTD-YPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSG- 406
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC 429
++Y + CLA D D T G Y+VV+D ++G++ C
Sbjct: 407 ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 96/404 (23%), Positives = 149/404 (36%), Gaps = 73/404 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC- 149
+ +SL D G DL+W PC C N+ + P SST++ + C C
Sbjct: 94 QHVSLYLDTGSDLVWFPCKPFECILCEGKAENT---TASTPPPRLSSTARSVHCKSSACS 150
Query: 150 ----DLGTSCQNPKQPCPY----TMDYYTENTSS------SGLLVEDILHLISGGDNALK 195
+L TS CP T D ++ + S G LV + H A
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC---- 250
+ + GC + P G+ G G G +S+P+ LA A + N FS C
Sbjct: 211 SLSLHNFTFGCA------HTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSH 264
Query: 251 -FDKDD---SGRIFFGDQGPATQQS---------TSFLASNGKYITYIIGVETCCIGSSC 297
F+ D + G ++ TS L + Y +G+E IG
Sbjct: 265 SFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKK 324
Query: 298 L----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC- 346
+ ++ S +VDSG++FT LP +Y ++ AEFD +V + K
Sbjct: 325 IPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTG 384
Query: 347 ---CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY---------------GTQVVTG 388
CY + + +PS+ L F N S VV Y G ++
Sbjct: 385 LGPCYYYDT--VVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMN 442
Query: 389 FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G T+G G+ VV+D E ++G++ C L
Sbjct: 443 GGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCASL 486
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 121/306 (39%), Gaps = 58/306 (18%)
Query: 92 TMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-- 149
TM L D G +L W+ C R + + + P AS+T + C C
Sbjct: 75 TMVL--DTGSELSWLLCATGR----------AAAAAADSFRPRASATFAAVPCGSARCSS 122
Query: 150 -DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
DL SC + C ++ Y + ++S G L D+ A+ ++ GC
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSY-ADGSASDGALATDVF--------AVGDAPPLRSAFGC 173
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG--- 262
D VA GL+G+ G + S + +A R FS C D+DD+G + G
Sbjct: 174 MSAAYDSSPDAVATAGLLGMNRGAL---SFVTQASTRR--FSYCISDRDDAGVLLLGHSD 228
Query: 263 ------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVD 308
+ P Q + +A + + + +G + I S L A +VD
Sbjct: 229 LPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVD 288
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYKSSSQRLP---KLP 359
SG+ FTFL + Y + AEF +Q + + E + C++ R P +LP
Sbjct: 289 SGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLP 348
Query: 360 SVKLMF 365
V L+F
Sbjct: 349 PVTLLF 354
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 101/240 (42%), Gaps = 38/240 (15%)
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFG 262
GCG G + G DG++GLG G++S S A + FS C ++DS G + FG
Sbjct: 171 FGCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFG 226
Query: 263 DQGPATQQS------------TSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKA 305
++ AT QS TS L +G Y ++ + +G+ L S
Sbjct: 227 EK--ATSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDIS---VGNKRLNVPSSVFASPGT 281
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS----FEGYPWKCCYKSSSQRLPKLPSV 361
I+DSG+ T LP+ Y + A F + + S +G CY S ++ LP +
Sbjct: 282 IIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEI 341
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI-----QPVDGDIGTIGQNFMTGYRVVFD 416
L F + +N VI+G + CLA ++ ++ IG V++D
Sbjct: 342 VLHFGEGADVRLNGKR-VIWGND-ASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYD 399
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 79/352 (22%), Positives = 139/352 (39%), Gaps = 50/352 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C +C S +N P SS+ L CS +LC +S
Sbjct: 113 DTGSDLIWTQCQPCTQCFNQSTPIFN----------PQGSSSFSTLPCSSQLCQALSSPT 162
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GYL 215
C YT Y + + + G + + L G ++ N + GCG G G
Sbjct: 163 CSNNFCQYTYGY-GDGSETQGSMGTETLTF---GSVSIPN-----ITFGCGENNQGFGQG 213
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFG---DQGPATQ 269
+G GL+G+G G +S+PS L FS C S + G + A
Sbjct: 214 NGA---GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTPSNLLLGSLANSVTAGS 265
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSFTFLPK 318
+T+ + S+ Y I + +GS+ L + I+DSG++ T+
Sbjct: 266 PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 325
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ-RLPKLPSVKLMFPQNNSFVVNNPV 377
Y+++ EF Q+N + + + C+++ S ++P+ + F + + +
Sbjct: 326 NAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENY 385
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
F+ ++ CLA+ + G VV+D N + ++ + C
Sbjct: 386 FISPSNGLI---CLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 78/349 (22%), Positives = 135/349 (38%), Gaps = 40/349 (11%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--S 154
D DL+W+ C C C P +D + P SST +LSC + C
Sbjct: 108 DTASDLIWVQCSPCETCFP----------QDTPLFEPHKSSTFANLSCDSQPCTSSNIYY 157
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C C YT + Y + +S+ G+L + +H S + I GCG +
Sbjct: 158 CPLVGNLCLYT-NTYGDGSSTKGVLCTESIHFGS------QTVTFPKTIFGCGSNNDFMH 210
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQ- 270
G++GLG G +S+ S L I + FS C F + ++ FG+ T
Sbjct: 211 QISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFGNDTTITGNG 268
Query: 271 --STSFLASNGKYITYIIGVETCCIGSSCLK-----QTSFKAIVDSGSSFTFLPKEVYET 323
ST + Y + + IG L+ T+ I+D G+ T+L Y
Sbjct: 269 VVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHN 328
Query: 324 IAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 382
+ + T + YP+ C+ + + + K++F + V +P + +
Sbjct: 329 FVTLLREALGISETKDDIPYPFDFCFPNQAN----ITFPKIVFQFTGAKVFLSPKNLFFR 384
Query: 383 TQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ CLA+ P G ++V +DR+ K+ ++ ++C
Sbjct: 385 FDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADC 433
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 144/369 (39%), Gaps = 69/369 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHLS---CSHRLCDLG 152
D G D+LW+ C C C D L + PS SST L C + C
Sbjct: 119 DTGSDILWVMCTPCTNC-----------DNHLGLLFDPSMSSTFSPLCKTPCDFKGC--- 164
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
+ C P P+T+ Y +N+++SG+ D + + + S V+ GCG +
Sbjct: 165 SRCD----PIPFTVTY-ADNSTASGMFGRDTVVFETTDEGT---SRIPDVLFGCG--HNI 214
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS----GRIFFGDQGPA 267
G +G++GL G P LA I FS C D D ++ G+
Sbjct: 215 GQDTDPGHNGILGLNNG----PDSLATK--IGQKFSYCIGDLADPYYNYHQLILGEGADL 268
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLP 317
ST F NG Y + G+ +G L K + I+D+GS+ TFL
Sbjct: 269 EGYSTPFEVHNGFYYVTMEGIS---VGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLV 325
Query: 318 KEVYETIAAEFDRQVNDTI--TSFEGYPW-KCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
V+ ++ E + + T+ E PW +C Y S S+ L P V F ++
Sbjct: 326 DSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALD 385
Query: 375 NPVFVIYGTQVVTGFCLAIQPVD--------GDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ F V FC+ + PV IG + Q Y V +D N + +
Sbjct: 386 SGSFFNQLNDNV--FCMTVGPVSSLNLKSKPSLIGLLAQQ---SYSVGYDLVNQFVYFQR 440
Query: 427 SNCQDLNDG 435
+C+ L+ G
Sbjct: 441 IDCELLSGG 449
>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
Length = 547
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 155/379 (40%), Gaps = 53/379 (13%)
Query: 78 GPQFQMLF----PSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYS 132
G F ++ P + S ++ G+ F PC +C C + Y++
Sbjct: 106 GTHFAYIYAGTPPQRASVIINTGSHFSA----FPCSECRSCGNHTDPYWD---------- 151
Query: 133 PSASSTSKHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 191
PS SST+ ++C C CQ+ K+ C ++YTE +S V+D+L + G+
Sbjct: 152 PSQSSTAHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWV---GE 206
Query: 192 NALKNSVQ-------ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI- 243
L +S + GC +G + +A DG++GL ++ + LA AG I
Sbjct: 207 RTLSDSQKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKIS 265
Query: 244 RNSFSMCFDKDDSGRIFFGDQGPATQQ---------STSFLASNGKYITYII--GVETCC 292
FS+CF + G + G P + ST +++ +T + GV
Sbjct: 266 ERKFSLCF-SETGGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITT 324
Query: 293 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 352
S K T K + SG++ T+LP+ V E +A ++ + + + C ++
Sbjct: 325 DASVFQKGTGIKIV--SGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMNEF--CMTRTT 380
Query: 353 QRLPKLPSVKLMFPQNNSFVVN-NPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGY 411
L LP LM + VN P + + ++ P G +G N + +
Sbjct: 381 VELEALPV--LMIHMDGGVEVNVRPEAYMDASSDEENVYPSLPPPCSMGGVLGANLLRDH 438
Query: 412 RVVFDRENLKLGWSHSNCQ 430
VVFD +N +G++ C
Sbjct: 439 NVVFDYDNHVVGFADGACD 457
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 135/365 (36%), Gaps = 61/365 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G L WI C+ C+ C YN T + +H G+ C
Sbjct: 128 DTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTDTTFTATH-----GSDCN 182
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS----- 211
+ Y + T++ G + L L D+ + ++ VI GCG +
Sbjct: 183 YSQT--------YADKTTTRGTYAREQL-LFETPDDGI--TIMHDVIFGCGHNNTQLPGP 231
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-----GRIFFGDQGP 266
GY GV GLG+ S S+++K G FS C R+ G++
Sbjct: 232 TGYASGV-------FGLGD-SGSSIISKLGF---GFSYCIGNIGDPLYGFHRLTLGNKLK 280
Query: 267 ATQQSTSFLASNGKYITYI---IGVETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPK 318
ST + YIT + IG E I ++ + ++DSG++ +++P+
Sbjct: 281 IEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLSYIPR 340
Query: 319 EVYETIAAEFDRQVNDTITSFE--GYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN- 374
+ Y + + ++ ++ + CY +Q L P V
Sbjct: 341 QAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGFPDATFHLADGADLVFQV 400
Query: 375 NPVFVIYGTQVVTGFCLAIQPVDGD-----IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+F Y V+ CLA+ P + D IG + Q + Y V +D + KL + C
Sbjct: 401 EGLFFQYTDNVL---CLALVPTESDEETCLIGLLAQQY---YNVAYDLKQQKLYFQRIEC 454
Query: 430 QDLND 434
+ L+D
Sbjct: 455 ELLDD 459
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 136/359 (37%), Gaps = 61/359 (16%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C C ++ ++ P SST +SC+ C
Sbjct: 98 DTGSDLIWTQCLPCETCNAAASVIFD----------PVKSSTYDTVSCASNFCS-----S 142
Query: 157 NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
P Q C + Y Y + +S+SG L S + +V GCG G
Sbjct: 143 LPFQSCTTSCKYDYMYGDGSSTSGAL--------STETVTVGTGTIPNVAFGCGHTNLGS 194
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQ 270
+ G++GLG G +S+ S + + FS C S + + GD A
Sbjct: 195 F---AGAAGIVGLGQGPLSLIS--QASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGV 249
Query: 271 STSFLASN-----------------GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSF 313
+ + L +N GK +TY +G T I +S Q F I+DSG++
Sbjct: 250 AYTALLTNTANPTFYYADLTGISVSGKAVTYPVG--TFSIDAS--GQGGF--ILDSGTTL 303
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T+L + + A +V Y C+ ++ P P++ F + +
Sbjct: 304 TYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELP 363
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
VFV T CLA+ G +G + +V D N ++G+ +NC+ +
Sbjct: 364 PENVFVALDTG--GSICLAMAASTG-FSIMGNIQQQNHLIVHDLVNQRVGFKEANCETI 419
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 144/380 (37%), Gaps = 53/380 (13%)
Query: 80 QFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCV--RCAPLS----ASYYNSLDRDL----- 128
+FQ G L ND G DL W PL+ A Y+ +
Sbjct: 49 EFQTPLMGAGGAGRRLKNDAGEDLFWTQEQVKGGHGVPLTNFMNAQYFTEITLGTPPQNF 108
Query: 129 ---------NEYSPSASSTSKHLSCS-HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 178
N + PS+ TS ++C H D S + +++ Y + S G
Sbjct: 109 KVILDTGSSNLWVPSSKCTS--IACFLHAKYDSSASSTYKQNGTEFSIQY--GSGSMEGF 164
Query: 179 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL-- 236
+ +D+L + GD + A + G+ + G DG+ +GLG ISV +
Sbjct: 165 VSQDVLTI---GDLTIPGQDFAEAVKEPGLTFAFGKFDGI-----LGLGYDTISVNHIVP 216
Query: 237 ----LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVE 289
+ GL+ SF + ++D G FG + + + + + +E
Sbjct: 217 PHYNMINKGLLDEPVFSFRLGKSEEDGGEAIFGGVDKSAYKGDLTYVPVRRKAYWEVELE 276
Query: 290 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 349
GS L+ S A +D+G+S LP ++ E I AE + + W Y+
Sbjct: 277 KISFGSEELELESTGAAIDTGTSLIALPTDMAEMINAEIGAKKS----------WNGQYQ 326
Query: 350 SSSQRLPKLPSVKLMF-PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFM 408
++P LP + L F + + + + + GT + + L I G + IG F+
Sbjct: 327 VECSKVPDLPELSLYFGGKPYTLKGTDYILEVQGTCISSFTGLDINVPGGSLWIIGDVFL 386
Query: 409 TGYRVVFDRENLKLGWSHSN 428
Y V+D +G++ +
Sbjct: 387 RKYYTVYDLGRDAVGFAEAK 406
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 141/349 (40%), Gaps = 36/349 (10%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
K+ ++ D G D+ W+ C C +C + ++ PS+SST SCS C
Sbjct: 144 KSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD----------PSSSSTYSPFSCSSAAC 193
Query: 150 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
G C + + C YT+ Y + +S++G D L L G NA++ G
Sbjct: 194 AQLGQEGNGCSSSQ--CQYTVT-YGDGSSTTGTYSSDTLAL---GSNAVRK-----FQFG 242
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG-DQ 264
C +S G+ D DGL+GLG G S+ S AG +FS C S F
Sbjct: 243 CSNVES-GFND--QTDGLMGLGGGAQSLVS--QTAGTFGAAFSYCLPATSSSSGFLTLGA 297
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEV 320
G + T L S+ Y + ++ +G L + F A I+DSG+ T LP
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVLTRLPPTA 357
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y +++ F + ++ C+ S Q +P+V L+F + + ++
Sbjct: 358 YSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDIASDGIML 417
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + A D +G IG + V++D +G+ C
Sbjct: 418 QTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 145/353 (41%), Gaps = 37/353 (10%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K +L D G D+ W +C P + Y + LN PS S++ K++SCS LC
Sbjct: 142 KEFTLIFDTGSDITW-----TQCEPCVKTCYKQKEPRLN---PSTSTSYKNISCSSALCK 193
Query: 151 LGTSCQNPKQPCP-----YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
L S + Q C Y + Y + + S G + L L S N KN + G
Sbjct: 194 LVASGKKFSQSCSSSTCLYQVQ-YGDGSYSIGFFATETLTLSS--SNVFKN-----FLFG 245
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGD 263
CG + + GL+GLG ++++PS AK + FS C S G + G
Sbjct: 246 CGQQNN---GLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLPASSSSKGYLSLGG 300
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKE 319
Q + + T A Y + + +G L +++F A ++DSG+ T L
Sbjct: 301 QVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPT 360
Query: 320 VYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
Y +++ F + D S GY + CY S ++P V + F ++
Sbjct: 361 AYSELSSAFQNLMTD-YPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSG- 418
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENLKLGWSHSNC 429
++Y + CLA D D T G Y+VV+D ++G++ C
Sbjct: 419 ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 141/373 (37%), Gaps = 80/373 (21%)
Query: 95 LGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LG 152
L D G D+ W+ C C RC YN L SS++ + C C LG
Sbjct: 145 LSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLK----------SSSASDVGCYAPACRALG 194
Query: 153 TS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA-SVIIGCGMK 209
+S C C Y ++Y ++S+ VE + V+ V IGCG
Sbjct: 195 SSGGCVQFLNECQYKVEYGDGSSSAGDFGVETL---------TFPPGVRVPGVAIGCGSD 245
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG----RIFFGDQG 265
G + A G++GLG G +S PS +A G SFS C +G + FG
Sbjct: 246 NQGLFPAPAA--GILGLGRGSLSFPSQIA--GRYGRSFSYCLAGQGTGGRSSTLTFGSGA 301
Query: 266 PATQQSTS------FLASNGKYITYIIGVETCCIGSSCLKQTSFK------------AIV 307
AT +T+ L ++ Y Y +G+ +G ++ + IV
Sbjct: 302 SATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIV 361
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--------WKCCYKSSSQR-LPKL 358
DSG++ T L Y F + G+P + CY S R + K+
Sbjct: 362 DSGTAVTRLSGPAYAAFRDAFRVAAVKEL----GWPSPGGPFAFFDTCYSSVRGRVMKKV 417
Query: 359 PSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMT 409
P+V + F P N + PV GT C A D + IG +
Sbjct: 418 PAVSMHFAGGVEVKLPPQNYLI---PVDSNKGT-----MCFAFAGSGDRGVSIIGNIQLQ 469
Query: 410 GYRVVFDRENLKL 422
G+RVV+D + ++
Sbjct: 470 GFRVVYDVDGQRV 482
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 138/354 (38%), Gaps = 54/354 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL C
Sbjct: 198 DTGSDTTW-----VQCQPCVVVCYEQREK---LFDPARSSTYANVSCAAPACSDLNIHGC 249
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 250 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 299
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQS- 271
+ GL+GLG G+ S+P K G + F+ C +G + FG A ++
Sbjct: 300 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLDFGAGSLAAARAR 353
Query: 272 --TSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYETI 324
T L NG Y +G+ +G L Q+ F IVDSG+ T LP Y ++
Sbjct: 354 LTTPMLTENGPTF-YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSL 412
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP- 376
R + GY CY + +P+V L+F V+
Sbjct: 413 -----RYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 377 -VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ +QV F A GD+G +G + + V +D +G+ C
Sbjct: 468 IMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 150/393 (38%), Gaps = 80/393 (20%)
Query: 98 DFGCDLLWIPC-----DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
D G DL W PC DC+ C +Y N +R + +SPS SS+S SC+ C
Sbjct: 98 DTGSDLTWAPCGNISFDCIECD----NYRN--NRMMASFSPSHSSSSHRDSCTSPFCIDV 151
Query: 153 TSCQNPKQPC-------------------PYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
S NP PC P Y +G L D L + G N
Sbjct: 152 HSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTRDTLRV--HGRNL 209
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
GC + Y + P G+ G G G +S+PS L G +R FS CF
Sbjct: 210 GVTQEIPRFCFGC---VASSYRE---PIGIAGFGRGALSLPSQL---GFLRKGFSHCFLA 260
Query: 252 -----DKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQ--T 301
+ + S + GD ++ Q T L S Y +G+E +G+ + +
Sbjct: 261 FKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSATEVPS 320
Query: 302 SFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYP-WKCCYKS 350
S + +VDSG+++T LP+ Y + + +N T E + CYK
Sbjct: 321 SLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRTGFDLCYKV 380
Query: 351 SSQRLP-----KLPSVKLMFPQNNSFVVNN-----PVFVIYGTQVVTGFCLAIQPVD--- 397
Q LPS+ F N S V++ + + VV CL Q +D
Sbjct: 381 PCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVK--CLLFQSMDDGD 438
Query: 398 -GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G G +G VV+D E ++G+ +C
Sbjct: 439 YGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 148/374 (39%), Gaps = 74/374 (19%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G DL+W C + R+ Y P+ SS+ C RLC+ G+
Sbjct: 107 DTGSDLIWTQCKL---------FDTRQHREKPLYDPAKSSSFAAAPCDGRLCETGSFNTK 157
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
+C K C YT +Y + T G L + G++ V S+ GCG K + G
Sbjct: 158 NCSRNK--CIYTYNYGSATT--KGELASETFTF---GEH---RRVSVSLDFGCG-KLTSG 206
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRIFFGDQGPATQ 269
L G + G++G+ +S+ S L FS C D++ + IFFG ++
Sbjct: 207 SLPGAS--GILGISPDRLSLVSQLQIP-----RFSYCLTPFLDRNTTSHIFFGAMADLSK 259
Query: 270 -------QSTSFL----ASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVD 308
Q+TS + SN Y +IG+ +G+ L + S VD
Sbjct: 260 YRTTGPIQTTSLVTNPDGSNYYYYVPLIGIS---VGTKRLNVPVSSFAIGRDGSGGTFVD 316
Query: 309 SGSSFTFLPKEVYETIAAEFDRQVNDTITSF--EGYPWKCCYK------SSSQRLPKLPS 360
SG + LP V E + V + + GY ++ C++ + + ++P
Sbjct: 317 SGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPP 376
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTG-FCLAIQPVDGDIGTIGQNF-MTGYRVVFDRE 418
+ F + ++ +++ +V G CL I G G I N+ V+FD E
Sbjct: 377 LVYHFDGGAAMLLRRDSYMV---EVSAGRMCLVIS--SGARGAIIGNYQQQNMHVLFDVE 431
Query: 419 NLKLGWSHSNCQDL 432
N + ++ + C +
Sbjct: 432 NHEFSFAPTQCNQI 445
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVNNP-VFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGSRGVFVERSVQEQDVWCLAFAPTE 315
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 96/390 (24%), Positives = 148/390 (37%), Gaps = 61/390 (15%)
Query: 67 SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLD 125
SS V +G F L + + + D G D++W+ C C +C S +N
Sbjct: 97 SSVVSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFN--- 153
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 183
P S + + CS LC + C + C Y + Y + ++ E +
Sbjct: 154 -------PYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETL 206
Query: 184 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL- 242
+ + A V +GCG G + V GL+GLG G +S PS + G+
Sbjct: 207 ---------TFRGNKIAKVALGCGHHNEGLF---VGAAGLLGLGRGRLSFPS---QTGIR 251
Query: 243 IRNSFSMCF-DKDDSGR---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSC 297
+ FS C D+ S + + FGD + + L N K T Y +G+ +G
Sbjct: 252 FNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVR 311
Query: 298 LKQTS---FK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 346
++ S FK I+DSG+S T L + Y + F E +
Sbjct: 312 VRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDT 371
Query: 347 CYKSSSQRLPKLPSVKLMF-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD 399
CY S Q K+P+V L F P N + PV FC A
Sbjct: 372 CYDLSGQSSVKVPTVVLHFRGADMALPATNYLI---PV------DENGSFCFAFAGTISG 422
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ IG G+RVV+D ++G++ C
Sbjct: 423 LSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 91/418 (21%), Positives = 154/418 (36%), Gaps = 48/418 (11%)
Query: 29 TKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV-QKQKMKTGPQFQMLFPS 87
T+ R + K + + A P Y + SDV + +G F +
Sbjct: 87 TRFNARMQRDTKRVAALRRHLAAGKPT-----YAEEAFGSDVVSGMEQGSGEYFVRIGVG 141
Query: 88 QGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+ + D G D++W+ C+ C +C S +N P+ SS+ +SC+
Sbjct: 142 SPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFN----------PADSSSYAGVSCAS 191
Query: 147 RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
+C + + C Y + Y + + + G L L ++ G ++N V IGC
Sbjct: 192 TVCSHVDNAGCHEGRCRYEVSY-GDGSYTKGTLA---LETLTFGRTLIRN-----VAIGC 242
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCFDK---DDSGRIFFG 262
G G + V GL+GLG G +S V L +AG +FS C SG + FG
Sbjct: 243 GHHNQGMF---VGAAGLLGLGSGPMSFVGQLGGQAG---GTFSYCLVSRGIQSSGLLQFG 296
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC---LKQTSFK--------AIVDSGS 311
+ + L N + ++ + + + FK ++D+G+
Sbjct: 297 REAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGT 356
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
+ T LP YE F Q + + + CY ++P+V F
Sbjct: 357 AVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPIL 416
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ F+I V FC A P + IG G + D N +G+ + C
Sbjct: 417 TLPARNFLI-PVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G + W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSISWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVNNP-VFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGSSGVFVERSVQEQDVWCLAFAPTE 315
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 140/350 (40%), Gaps = 49/350 (14%)
Query: 98 DFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS 154
D G D W+ C V+C ++ P+ SST ++SC+ C DL T+
Sbjct: 181 DTGSDTTWVQCRPCVVKCYKQKGPLFD----------PAKSSTYANVSCTDSACADLDTN 230
Query: 155 -CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C C Y + Y + + + G +D L + +A+K GCG K +G
Sbjct: 231 GCTGGH--CLYAVQY-GDGSYTVGFFAQDTLTIA---HDAIKG-----FRFGCGEKNNGL 279
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS-- 271
+ GL+GLG G+ S+ + +F+ C +G + D GP + +
Sbjct: 280 FGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-DFGPGSAGNNA 333
Query: 272 --TSFLASNGKYITYI------IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYET 323
T L G+ Y+ +G + + S ++ +VDSG+ T LP Y
Sbjct: 334 RLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF--STAGTLVDSGTVITRLPATAYTA 391
Query: 324 IAAEFDR-QVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN--NPVFV 379
+++ FD+ + GY CY + +LP+V L+F V+ V+
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I QV F A D + +G Y V++D +G++ +C
Sbjct: 452 ISEAQVCLAF--ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 11/156 (7%)
Query: 284 YIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVYETIAAEFDRQVN 333
Y +G+ +G L +TSF+ IVDSG++ T L +VY + F +
Sbjct: 11 YYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVKGTK 70
Query: 334 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
D + + E + CY SS+ ++P+V F + V+ +++ V T FC A
Sbjct: 71 DLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVGT-FCFAF 129
Query: 394 QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
P + IG G RV FD N +G+S + C
Sbjct: 130 APTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|323303886|gb|EGA57667.1| Yps1p [Saccharomyces cerevisiae FostersB]
Length = 569
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 107/245 (43%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFG--DQGPATQQ----------STSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA 305
FG D T S S +S ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASXFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
++DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 148/349 (42%), Gaps = 48/349 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTS 154
D G D+ WI +CAP S Y S + P +S++ + C C DL +
Sbjct: 167 DTGSDVSWI-----QCAPCSECYQQSDPI----FDPVSSNSYSPIRCDAPQCKSLDL-SE 216
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G + + L G A++N V IGCG G +
Sbjct: 217 CRNGT--CLYEVSY-GDGSYTVGEFATETVTL---GTAAVEN-----VAIGCGHNNEGLF 265
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQS 271
V GL+GLG G++S P A + SFS C D D + F P
Sbjct: 266 ---VGAAGLLGLGGGKLSFP-----AQVNATSFSYCLVNRDSDAVSTLEFNSPLP-RNVV 316
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEV 320
T+ L N + T Y +G++ +G L ++ F+ I+DSG++ T L EV
Sbjct: 317 TAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEV 376
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 380
Y+ + F + + + CY SS+ ++P+V FP+ + ++I
Sbjct: 377 YDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLI 436
Query: 381 YGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V T FC A P + +G G RV FD N +G+S +C
Sbjct: 437 PVDSVGT-FCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 146/352 (41%), Gaps = 48/352 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G D++W+ C C C Y+ D N P S + + C LC L +
Sbjct: 147 DTGSDIVWLQCAPCKNC-------YSQTDPVFN---PVKSGSFAKVLCRTPLCRRLESPG 196
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
N +Q C Y + Y + + ++G V + L + + V +GCG G +
Sbjct: 197 CNQRQTCLYQVSY-GDGSYTTGEFVTETL--------TFRRTKVEQVALGCGHDNEGLF- 246
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRN-SFSMCF-DKDDSGR---IFFGDQGPATQQ 270
V GL+GLG G +S PS +AG N FS C D+ S + + FG+ +
Sbjct: 247 --VGAAGLLGLGRGGLSFPS---QAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTA 301
Query: 271 STSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPK 318
+ L +N + Y ++G+ S + + FK I+D G+S T L K
Sbjct: 302 RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNK 361
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN-SFVVNNPV 377
Y + F + ++ E + CY S + K+P+V L F + S +N +
Sbjct: 362 PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYL 421
Query: 378 FVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G+ FC A + IG G+RVV+D + ++G+S C
Sbjct: 422 IPVDGSGR---FCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 141/357 (39%), Gaps = 50/357 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++ MS+ D G D+ W V+CAP +A +S L + P+ S+T SC C
Sbjct: 141 TQVMSI--DTGSDVSW-----VQCAPCAAQSCSSQKDKL--FDPAMSATYSAFSCGSAQC 191
Query: 150 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
D G C K C Y + Y + ++++G D L L S +A+K S G
Sbjct: 192 AQLGDEGNGCL--KSQCQYIVKY-GDGSNTAGTYGSDTLSLTS--SDAVK-----SFQFG 241
Query: 206 CGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS---GRIF 260
C + +G G LDG+ +GLG + + A +FS C S G +
Sbjct: 242 CSHRAAGFVGELDGL-------MGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLT 294
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGV--ETCCIGSSCLKQT----SFKAIVDSGSSFT 314
G G A+ S + GV + + + L S ++VDSG+ T
Sbjct: 295 LGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGTVIT 354
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
LP Y+ + F +++ ++ C+ S +P+V L F + + ++
Sbjct: 355 QLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLD 414
Query: 375 NPVFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G CLA DGD G +G + ++FD +G+ C
Sbjct: 415 ISGILYAG-------CLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 151/383 (39%), Gaps = 70/383 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLD-----------RDLNEYSPSASSTSKHLSCSH 146
D DL WI C R S+ R N Y P+ SS+ + + CS
Sbjct: 145 DTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQ 204
Query: 147 RLCDL--GTSCQNP--KQPCPYTMDYYTENTSSSGLL-VEDILHLISGGDNALKNSVQAS 201
+ C L +CQ+P + C Y + T + G+ E +S G + +
Sbjct: 205 KECALLPYNTCQSPSKAESCSYYQQM-QDGTLTMGIYGKEKATVTVSDG----RMAKLPG 259
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 256
+I+GC + ++GG +D A DG++ LG GE+S AK FS C +D S
Sbjct: 260 LILGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQRFSFCLLSANSSRDAS 315
Query: 257 GRIFFGDQ----GPATQQS-----TSFLASNGKYITYI-IGVETCCIGSSCL---KQTSF 303
+ FG GP T ++ + G +T I +G E I K
Sbjct: 316 SYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGG 375
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----------SSSQ 353
I+D+ +S T L E Y + + DR ++ +E ++ CY+ + +
Sbjct: 376 GVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWTFAGDGVDLTHNV 435
Query: 354 RLPKLPSVKL-----MFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPV-DGDIGTIGQN 406
+P+L +V++ + P+ S V+ +VV G CLA + + G G +G
Sbjct: 436 TVPRL-TVEMAGGARLEPEAKSVVM---------PEVVPGVACLAFRKLPRGGPGILGNV 485
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
M Y D K+ + C
Sbjct: 486 LMQEYIWEIDHGKGKMRFRKDKC 508
>gi|308810200|ref|XP_003082409.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116060877|emb|CAL57355.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 455
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 132/293 (45%), Gaps = 45/293 (15%)
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG-MKQSGGYLDGVAPDGLIGLG 227
Y +N+++ G++VED++ + GD A +I GCG + ++ G D DG+ G G
Sbjct: 112 YMDNSTAIGVMVEDVMTV---GDEL----AGAKMIFGCGCLVEANGEADRY--DGMAGFG 162
Query: 228 LGEISVPSLLAKAGLIR-NSFSMCFDKDDS-------GRIFFG-DQGPATQQSTSFLASN 278
GE + + LA+ G+I + F C + + GR FG D P + T L +
Sbjct: 163 RGETTFHTQLARTGVIDADVFGFCSEGAGTNTAMLSLGRYDFGRDLSPLSW--TRMLGDD 220
Query: 279 G---KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAE-FDRQVN- 333
+ +++ +G + T+ ++DSG++ LP +Y E DR V+
Sbjct: 221 DLAVRTMSWKLGAKIIA------GSTNVYTVLDSGTTLVVLPPVMYGDFMKELLDRIVDL 274
Query: 334 ----DTITSFEGYPWKC-CYKSSSQRLPK------LPSVKLMFPQNNSFVVNNPVFVIYG 382
+ FE Y + C+ S S L LP + + + + + V+ ++
Sbjct: 275 NATYSDVHVFEDYSFSTFCFYSKSGALTNDIIRDALPKLTITYDPDIALVLPPENYLFSS 334
Query: 383 TQVVTGFCLAI-QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
V C+ I + +G I +GQ + V +D EN ++G + ++C++L +
Sbjct: 335 WIVPREHCIGIMKGAEGQI-ILGQQTLRNTFVEYDLENERIGLAVTHCENLRE 386
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 108/257 (42%), Gaps = 42/257 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ C +D+ + P+ S+T + L C+ C+
Sbjct: 108 DTGSDLIWTQCAPCLLC----------VDQPTPYFDPARSATYRSLGCASPACNALYYPL 157
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
++ C Y +Y ++ S++G+L + G N + S+ + GCG +G +
Sbjct: 158 CYQKVCVYQY-FYGDSASTAGVLANETFTF---GTNETRVSLPG-ISFGCGNLNAGSLAN 212
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG--------DQGPA 267
G G++G G G +S L+++ G R S+ + F R++FG +
Sbjct: 213 G---SGMVGFGRGSLS---LVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSE 266
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSFTFL 316
QST F+ + Y + + +G L + I+DSG++ T+L
Sbjct: 267 PVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYL 326
Query: 317 PKEVYETIAAEFDRQVN 333
+ Y+ + A F Q+
Sbjct: 327 AEPAYDAVRAAFASQIT 343
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 100/414 (24%), Positives = 159/414 (38%), Gaps = 85/414 (20%)
Query: 90 SKTMSLGNDFGCDLLWIPC---DCVRCA--PLSASYYNSLDRDLNEYSPSASSTSKHLSC 144
S+ +SL D G DL+W PC +C+ C +AS ++ L++ + S S S
Sbjct: 90 SQPISLYLDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATPVSCKSSACSA 149
Query: 145 SHR------LCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
H LC + + C+ K CP Y + + + L + I +S
Sbjct: 150 VHSNLPSSDLCAISNCPLESIEISDCR--KHSCPQFYYAYGDGSLIARLYRDSIRLPLSN 207
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFS 248
N + N+ GC + P G+ G G G +S+P+ LA + + N FS
Sbjct: 208 QTNLIFNNF----TFGCA------HTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257
Query: 249 MC---------------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIG 287
C +D D+ R G + P+ TS L + Y +G
Sbjct: 258 YCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVY-TSMLDNPRHPYFYCVG 316
Query: 288 VETCCIGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDR---QVND 334
+E IG + F +VDSG++FT LP +Y+ + AEF+ +VN+
Sbjct: 317 LEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNE 376
Query: 335 TITSF-EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY--------GTQV 385
+ E CY + + +P V L F N S VV Y +
Sbjct: 377 RASVIEENTGLSPCYYFDNNVV-NVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKK 435
Query: 386 VTGFCLAI-------QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CL + + G T+G G+ VV+D EN ++G++ C L
Sbjct: 436 RKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCASL 489
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 151/383 (39%), Gaps = 70/383 (18%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLD-----------RDLNEYSPSASSTSKHLSCSH 146
D DL WI C R S+ R N Y P+ SS+ + + CS
Sbjct: 145 DTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQ 204
Query: 147 RLCDL--GTSCQNP--KQPCPYTMDYYTENTSSSGLL-VEDILHLISGGDNALKNSVQAS 201
+ C L +CQ+P + C Y + T + G+ E +S G + +
Sbjct: 205 KECALLPYNTCQSPSKAESCSYYQQM-QDGTLTMGIYGKEKATVTVSDG----RMAKLPG 259
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 256
+I+GC + ++GG +D A DG++ LG GE+S AK FS C +D S
Sbjct: 260 LILGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQRFSFCLLSANSSRDAS 315
Query: 257 GRIFFGDQ----GPATQQS-----TSFLASNGKYITYI-IGVETCCIGSSCL---KQTSF 303
+ FG GP T ++ + G +T I +G E I K
Sbjct: 316 SYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGG 375
Query: 304 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----------SSSQ 353
I+D+ +S T L E Y + + DR ++ +E ++ CY+ + +
Sbjct: 376 GVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWTFAGDGVDLAHNV 435
Query: 354 RLPKLPSVKL-----MFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPV-DGDIGTIGQN 406
+P+L +V++ + P+ S V+ +VV G CLA + + G G +G
Sbjct: 436 TVPRL-TVEMAGGARLEPEAKSVVM---------PEVVPGVACLAFRKLPRGGPGILGNV 485
Query: 407 FMTGYRVVFDRENLKLGWSHSNC 429
M Y D K+ + C
Sbjct: 486 LMQEYIWEIDHGKGKMRFRKDKC 508
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 142/372 (38%), Gaps = 74/372 (19%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G DL+W C CV CA D+ + P+ S+T + + C LC L
Sbjct: 110 DTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLVPCRSPLCAALPYPA 159
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y YY + S++G+L + G N+ K V + V GCG SG
Sbjct: 160 CFQRSVCVYQY-YYGDEASTAGVLASETFTF--GAANSSKVMV-SDVAFGCGNINSGQLA 215
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG----------DQ 264
+ G++GLG G +S L+++ G R S+ + F + R+ FG
Sbjct: 216 NS---SGMVGLGRGPLS---LVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASS 269
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----------KAIVDSGSSFT 314
+ QST + + Y + ++ +G L +DSG+S T
Sbjct: 270 SGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLT 329
Query: 315 FLPKEVYETIAAEFDRQV------NDTITSFEG-YPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+L ++ Y+ + E + NDT E +PW P PSV + P
Sbjct: 330 WLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPW-----------PPPPSVAVTVPD 378
Query: 368 --------NNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
N V +I G TGF CLA+ GD IG +++D
Sbjct: 379 MELHFDGGANMTVPPENYMLIDGA---TGFLCLAMI-RSGDATIIGNYQQQNMHILYDIA 434
Query: 419 NLKLGWSHSNCQ 430
N L + + C
Sbjct: 435 NSLLSFVPAPCN 446
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 134/355 (37%), Gaps = 63/355 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C C A ++ PS SST K C+ G SC
Sbjct: 79 DTGSDLIWTQCMPCTNCYSQYAPIFD----------PSNSSTFKEKRCN------GNSCH 122
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
Y + Y S L E + +H SG + V IGCG S
Sbjct: 123 -------YKIIYADTTYSKGTLATETVTIHSTSG-----EPFVMPETTIGCGHNSSW--- 167
Query: 216 DGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ--- 270
P G++GL G S+ + G S CF + +I FG
Sbjct: 168 --FKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV 223
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGSSFTFLPKEVYET 323
ST+ + K Y + ++ +G + ++ T+F A I+DSG++ T+ P
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVSYCNL 283
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
+ D V T+ CY + + + P + + F V++ + +Y
Sbjct: 284 VREAVDHYVTAVRTADPTGNDMLCYYTDTIDI--FPVITMHFSGGADLVLDK--YNMYIE 339
Query: 384 QVVTG-FCLAI----QPVDGDIGTIGQ-NFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G FCLAI P D G Q NF+ GY D +L + +S +NC L
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGY----DSSSLLVSFSPTNCSAL 390
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 145/353 (41%), Gaps = 50/353 (14%)
Query: 95 LGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG- 152
L D D WIPC C C SA+ ++ P++S++ + + C LC
Sbjct: 127 LAVDTSNDASWIPCAGCAGCPTSSAAPFD----------PASSASYRTVPCGSPLCAQAP 176
Query: 153 -TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+C + C +++ Y ++S L +D L + NA+K + GC + +
Sbjct: 177 NAACPPGGKACGFSLTY--ADSSLQAALSQDSLAV---AGNAVK-----AYTFGCLQRAT 226
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQG-P 266
G P GL+GLG G +S L + +FS C + SG + G G P
Sbjct: 227 G---TAAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQP 281
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEV 320
++T LA+ + Y + + +G + +F ++DSG+ FT L
Sbjct: 282 QRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPA 341
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP----SVKLMFPQNNSFVVNNP 376
Y + E R+V ++S G+ C+ +++ P + +++ P+ N + +
Sbjct: 342 YVAVRDEVRRRVGAPVSSLGGF--DTCFNTTAVAWPPVTLLFDGMQVTLPEENVVIHST- 398
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
YGT A V+ + I +RV+FD N ++G++ C
Sbjct: 399 ----YGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 155/397 (39%), Gaps = 44/397 (11%)
Query: 60 EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCAPL 116
Y+ + SSD ++++G ++ + G+ + D G DL W C C C P
Sbjct: 71 RYFTMSTSSDAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ 130
Query: 117 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 176
Y++ P AS+T + S +C PC Y Y + S+
Sbjct: 131 DTPIYDTAVSSSFSPVPCASATCLPIWSSR-------NCTASSSPCRYRYAY-GDGAYSA 182
Query: 177 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 236
G+L + L A SV + GCG+ G + G +GLG G +S L
Sbjct: 183 GVLGTETLTF----PGAPGVSV-GGIAFGCGVDNGGLSYNST---GTVGLGRGSLS---L 231
Query: 237 LAKAGLIRNSFSMC--FDKDDSGRIFFGD----QGPATQ---QSTSFLASNGKYITYIIG 287
+A+ G+ + S+ + F+ + FG P+T QST + S Y +
Sbjct: 232 VAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVS 291
Query: 288 VETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 337
+E +G + L S IVDSG++FTFL + + + + +
Sbjct: 292 LEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVV 351
Query: 338 SFEGYPWKCC-YKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ-P 395
+ C + Q+LP +P + L F ++ ++ + Q + FCL I
Sbjct: 352 NASSLDSPCFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSF-NQEESSFCLNIAGS 410
Query: 396 VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
D+ +G +++FD +L + ++C L
Sbjct: 411 PSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 149/366 (40%), Gaps = 60/366 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG--TSC 155
D G +L W+ C + P S +N L + Y+P+ ++S C+ R DL SC
Sbjct: 78 DTGSELSWLHC---KKLPNLNSTFNPLLS--SSYTPTPCNSSI---CTTRTRDLTIPASC 129
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+P + + Y + +S+ G L + +L + Q + GC S GY
Sbjct: 130 -DPNNKLCHVIVSYADASSAEGTLAAETF--------SLAGAAQPGTLFGC--MDSAGYT 178
Query: 216 DGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGD--QGPAT 268
+ D GL+G+ G +S L+ + L + FS C +D+ G + GD P+
Sbjct: 179 SDINEDSKTTGLMGMNRGSLS---LVTQMSLPK--FSYCISGEDALGVLLLGDGTDAPSP 233
Query: 269 QQSTSFLASNG-----KYITYIIGVETCCIGSSCLK--QTSF--------KAIVDSGSSF 313
Q T + + + Y + +E + L+ ++ F + +VDSG+ F
Sbjct: 234 LQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQF 293
Query: 314 TFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
TFL VY ++ EF Q +T FEG CY + + +P+V L+F
Sbjct: 294 TFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEG-AMDLCYHAPAS-FAAVPAVTLVFS 351
Query: 367 QNNSFVVNNPVF--VIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLG 423
V + V G+ V F + G + IG + + FD ++G
Sbjct: 352 GAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVG 411
Query: 424 WSHSNC 429
++ + C
Sbjct: 412 FTQTTC 417
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 149/370 (40%), Gaps = 47/370 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + + SKT + D G D+ W+ C C C Y +D + P++
Sbjct: 157 SGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDC-------YQQVD---PIFDPAS 206
Query: 136 SSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
SS+ L C C +L +C+N C Y + Y + + E + SG +
Sbjct: 207 SSSFSRLGCQTPQCRNLDVFACRN--DSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDK 264
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 251
V IGCG G + V GLIGLG G +S+ S + + SFS C
Sbjct: 265 --------VAIGCGHDNEGLF---VGAAGLIGLGGGPLSLTSQIKAS-----SFSYCLVN 308
Query: 252 -DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 305
D DS + F P+ + ++ Y +G+ +G L + F+
Sbjct: 309 RDSVDSSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGS 368
Query: 306 -----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLP 359
IVD G++ T L + Y + F + D + S G+ + CY SS+ ++P
Sbjct: 369 GKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKD-LPSTSGFALFDTCYNLSSRTSVRVP 427
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V +F S + ++I T FCLA P + IG G RV +D N
Sbjct: 428 TVAFLFDGGKSLPLPPSNYLIPVDSAGT-FCLAFAPTTASLSIIGNVQQQGTRVTYDLAN 486
Query: 420 LKLGWSHSNC 429
++ +S C
Sbjct: 487 SQVSFSSRKC 496
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 151/392 (38%), Gaps = 77/392 (19%)
Query: 98 DFGCDLLWIPC-----DCVRC-----APLSASYYNSLDRDLNEYSPSA-------SSTSK 140
D G DL W+PC DC+ C L A++ S S ++ SS +
Sbjct: 100 DTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASCASPFCIDIHSSDNP 159
Query: 141 HLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
+C+ C L T + +PCP Y +G+L D L ++G + +
Sbjct: 160 LDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLR-VNGSSPGVAKEI- 217
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-------D 252
GC Y + P G+ G G G + S++++ G ++ FS CF +
Sbjct: 218 PKFCFGC---VGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQKGFSHCFLAFKYANN 268
Query: 253 KDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT-----SFK 304
+ S + GD ++ Q T L S Y +G+E +G+ + F
Sbjct: 269 PNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVSATEVPSSLREFD 328
Query: 305 AI------VDSGSSFTFLPKEVYETIAAEFDRQVN---DTITSFEGYPWKCCYK------ 349
++ +DSG+++T LP+ Y + + +N DT + + CYK
Sbjct: 329 SLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQT-GFDLCYKVPRPNN 387
Query: 350 ---SSSQRLPK-----LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV----D 397
+S LP L +V L+ PQ N F PV VV CL Q D
Sbjct: 388 NTLTSDDLLPSITFHFLNNVSLVLPQGNHFY---PVSAPGNPAVVK--CLMFQSTDDGDD 442
Query: 398 GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G G G VV+D E ++G+ +C
Sbjct: 443 GPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|190406152|gb|EDV09419.1| aspartic proteinase 3 precursor [Saccharomyces cerevisiae RM11-1a]
gi|207343057|gb|EDZ70636.1| YLR120Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 569
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 109/245 (44%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFGDQGPATQQSTSF-------LASNG-----KYITYIIGVETCCIGSS--CLKQTSFKA 305
FG + T + L+++G ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
++DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 142/372 (38%), Gaps = 74/372 (19%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G DL+W C CV CA D+ + P+ S+T + + C LC L
Sbjct: 110 DTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLVPCRSPLCAALPYPA 159
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ C Y YY + S++G+L + G N+ K V + V GCG SG
Sbjct: 160 CFQRSVCVYQY-YYGDEASTAGVLASETFTF--GAANSSKVMV-SDVAFGCGNINSGQLA 215
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG----------DQ 264
+ G++GLG G +S L+++ G R S+ + F + R+ FG
Sbjct: 216 NS---SGMVGLGRGPLS---LVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASS 269
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----------KAIVDSGSSFT 314
+ QST + + Y + ++ +G L +DSG+S T
Sbjct: 270 SGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLT 329
Query: 315 FLPKEVYETIAAEFDRQV------NDTITSFEG-YPWKCCYKSSSQRLPKLPSVKLMFPQ 367
+L ++ Y+ + E + NDT E +PW P PSV + P
Sbjct: 330 WLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPW-----------PPPPSVAVTVPD 378
Query: 368 --------NNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFMTGYRVVFDRE 418
N V +I G TGF CLA+ GD IG +++D
Sbjct: 379 MELHFDGGANMTVPPENYMLIDGA---TGFLCLAMI-RSGDATIIGNYQQQNMHILYDIA 434
Query: 419 NLKLGWSHSNCQ 430
N L + + C
Sbjct: 435 NSLLSFVPAPCN 446
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 109/478 (22%), Positives = 188/478 (39%), Gaps = 92/478 (19%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEE--------------VKALGVSKNRN 49
+ LTI + + +++ G FS +++HR+S E + + +SK R
Sbjct: 10 VYLTILSLIHFAISKPDG-----FSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRA 64
Query: 50 ---ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWI 106
A + + S E +++ +S D T +++ S G + L D G L W
Sbjct: 65 HNLAITTSSGFSPEAFRLRISQD------DTCYLVKVIIGSPGVP-LYLVPDTGSGLFWT 117
Query: 107 PCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ----P 161
C+ C R +NS +AS T + L C H+ C T+ QN Q
Sbjct: 118 QCEPCTRRFRQLPPIFNS----------TASRTYRDLPCQHQFC---TNNQNVFQCRDDK 164
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD 221
C Y + Y ++++G+ +DIL S ++ + GC +
Sbjct: 165 CVYRIAY-AGGSATAGVAAQDILQ--SAENDRIP------FYFGCSRDNQNFSTFESSGK 215
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-------SGRIFFGDQGPATQQ---S 271
G +GL V L + +N FS C + D + + FG+ +++ S
Sbjct: 216 GGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLS 275
Query: 272 TSFLASNG--KYITYIIGVETCC------IGSSCLK-QTSFKAIVDSGSSFTFLPKEVYE 322
T F++ G Y +I V G+ LK + I+DSG++ T++ + Y
Sbjct: 276 TPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYF 335
Query: 323 TIAAEFD--------RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 374
+ F ++VN ++ + CYK PS+ F + FV
Sbjct: 336 PVITAFKNYFDQHGFQRVNIQLSGY------ICYKQQGHTFHNYPSMAFHFQGADFFV-- 387
Query: 375 NPVFVIYGTQVVTGFCLAIQPVDGDIGT-IGQNFMTGYRVVFDRENLKLGWSHSNCQD 431
P +V Q FC+A+QP+ T IG + ++D N +L ++ NCQD
Sbjct: 388 EPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENCQD 445
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTTWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVNNP-VFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGSRGVFVERSVQEQDVWCLAFAPTE 315
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGRHGVFVERSVQEQDVWCLAFAPTE 315
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 147/355 (41%), Gaps = 45/355 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++TMS+ D G D+ W+ C C +C S +SL + PS+SST SCS
Sbjct: 134 TQTMSM--DTGSDVSWVQCKPCSQCH----SEVDSL------FDPSSSSTYSPFSCSSAP 181
Query: 149 C------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 202
C G C + + C Y ++Y ++++ + + L +S
Sbjct: 182 CAQLSQSQEGNGCMSSQ--CQYIVNYGDSSSTTGTYSSDTL---------TLGSSAMTDF 230
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDSGRIF 260
GC +SGG+ D DGL+GLG G S+ S AG +FS C SG +
Sbjct: 231 QFGCSQSESGGFNDQT--DGLMGLGGGAQSLAS--QTAGTFGTAFSYCLPPTSGSSGFLT 286
Query: 261 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFL 316
G G + T L S Y++ +E+ +GS L + F A ++DSG+ T L
Sbjct: 287 LG-TGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITRL 345
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
P Y +++ F + + C+ S Q +P+V L+F + +
Sbjct: 346 PPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFD 405
Query: 377 VFVIYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ + + CLA P D +G IG + V++D +G+ C
Sbjct: 406 GIMLEISSSIR--CLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|323308128|gb|EGA61381.1| Yps1p [Saccharomyces cerevisiae FostersO]
Length = 569
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 109/245 (44%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFGDQGPATQQSTSF-------LASNG-----KYITYIIGVETCCIGSS--CLKQTSFKA 305
FG + T + L+++G ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTISIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
++DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 80/180 (44%), Gaps = 19/180 (10%)
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C N K C Y+ Y E +SS G +VED + ++ GC ++G
Sbjct: 2 CNNEK--CYYSRTY-AERSSSEGWMVEDAFGFP-------DDQPPVRMVFGCENGETGEI 51
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF 274
+A DG++G+G + S L G+I + FS+CF G + GD +T +
Sbjct: 52 YRQLA-DGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVY 110
Query: 275 --LASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAA 326
L +N Y + ++ + L + + ++DSG++FT+LP E + +AA
Sbjct: 111 TPLLNNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAA 170
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 150/382 (39%), Gaps = 66/382 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ +S+ D G +L W+ C+ +S +N + P+ SS+ + CS C
Sbjct: 84 QNISMVIDTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTCR 132
Query: 151 LGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
T SC + K C T+ Y + +SS G L +I H + +++ ++I
Sbjct: 133 TRTRDFLIPASCDSDKL-CHATLSY-ADASSSEGNLAAEIFHFGNSTNDS-------NLI 183
Query: 204 IGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GC SG + GL+G+ G +S +++ G + S+ + D G + G
Sbjct: 184 FGCMGSVSGSDPEEDTKTTGLLGMNRGSLS---FISQMGFPKFSYCISGTDDFPGFLLLG 240
Query: 263 DQG----------PATQQSTSF-LASNGKYITYIIGVET----CCIGSSCL---KQTSFK 304
D P + ST Y + G++ I S L + +
Sbjct: 241 DSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQ 300
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR---- 354
+VDSG+ FTFL VY + ++F Q N +T +E + CY+ S R
Sbjct: 301 TMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTG 360
Query: 355 -LPKLPSVKLMFPQNNSFVVNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIGQNF 407
L +LP+V L+F V P+ + G V F + G + IG +
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHH 420
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
+ FD + ++G + C
Sbjct: 421 QQNMWIEFDLQRSRIGLAPVQC 442
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 141/350 (40%), Gaps = 45/350 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS- 154
D G D+ W+ C C C Y D + PS S++ +SC + C DL T+
Sbjct: 4 DTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSQRCRDLDTAA 53
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C+N C Y + Y + + + G + L L G + N V IGCG G +
Sbjct: 54 CRNATGACLYEV-AYGDGSYTVGDFATETLTL--GDSTPVGN-----VAIGCGHDNEGLF 105
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD--SGRIFFGDQGPATQQS 271
V GL+ LG G +S PS ++ ++FS C D+D + + FGD
Sbjct: 106 ---VGAAGLLALGGGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTV 157
Query: 272 TSFLASNGKYIT-YIIGVETCCIGSSCLK-----------QTSFKAIVDSGSSFTFLPKE 319
T+ L + + T Y + + +G L S IVDSG++ T L
Sbjct: 158 TAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSA 217
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + F + + + CY S + ++P+V L F + + ++
Sbjct: 218 AYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYL 277
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I T +CLA P + + IG G RV FD +G++ + C
Sbjct: 278 IPVDGAGT-YCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 155/382 (40%), Gaps = 65/382 (17%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDL 128
V + ++ T PQ Q+L L D D WIPC C C SA ++
Sbjct: 111 VVRARLGTPPQ-QLL----------LAVDTSNDAAWIPCAGCAGCPTSSAPPFD------ 153
Query: 129 NEYSPSASSTSKHLSCSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 186
P+AS++ + + C LC +C + C +++ Y ++S L +D L +
Sbjct: 154 ----PAASTSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTY--ADSSLQAALSQDSLAV 207
Query: 187 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 246
GD A+K + GC K +G P GL+GLG G +S L + + +
Sbjct: 208 --AGD-AVK-----TYTFGCLQKATG---TAAPPQGLLGLGRGPLSF--LSQTRDMYQGT 254
Query: 247 FSMCFDK----DDSGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 299
FS C + SG + G G P ++T LA+ + Y + + +G +
Sbjct: 255 FSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP 314
Query: 300 --------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS 351
T ++DSG+ FT L Y + E R+V ++S G+ C+ ++
Sbjct: 315 PPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGGF--DTCFNTT 372
Query: 352 SQRLPKLP----SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF 407
+ P + +++ P+ N + + YGT A V+ + I
Sbjct: 373 AVAWPPVTLLFDGMQVTLPEENVVIHST-----YGTISCLAMAAAPDGVNTVLNVIASMQ 427
Query: 408 MTGYRVVFDRENLKLGWSHSNC 429
+RV+FD N ++G++ C
Sbjct: 428 QQNHRVLFDVPNGRVGFARERC 449
>gi|6323149|ref|NP_013221.1| Yps1p [Saccharomyces cerevisiae S288c]
gi|2507240|sp|P32329.2|YPS1_YEAST RecName: Full=Aspartic proteinase 3; AltName: Full=Proprotein
convertase; AltName: Full=Yapsin-1; Contains: RecName:
Full=Aspartic proteinase 3 subunit alpha; Contains:
RecName: Full=Aspartic proteinase 3 subunit beta; Flags:
Precursor
gi|1256861|gb|AAB82367.1| Yap3p: aspartic proteinase [Saccharomyces cerevisiae]
gi|1297035|emb|CAA61699.1| Aspartyl protease [Saccharomyces cerevisiae]
gi|1360522|emb|CAA97688.1| YAP3 [Saccharomyces cerevisiae]
gi|151941285|gb|EDN59663.1| aspartic protease [Saccharomyces cerevisiae YJM789]
gi|259148106|emb|CAY81355.1| Yps1p [Saccharomyces cerevisiae EC1118]
gi|285813538|tpg|DAA09434.1| TPA: Yps1p [Saccharomyces cerevisiae S288c]
gi|323332551|gb|EGA73959.1| Yps1p [Saccharomyces cerevisiae AWRI796]
gi|323347468|gb|EGA81738.1| Yps1p [Saccharomyces cerevisiae Lalvin QA23]
gi|349579844|dbj|GAA25005.1| K7_Yps1p [Saccharomyces cerevisiae Kyokai no. 7]
gi|365764393|gb|EHN05917.1| Yps1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392297639|gb|EIW08738.1| Yps1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 569
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 109/245 (44%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFGDQGPATQQSTSF-------LASNG-----KYITYIIGVETCCIGSS--CLKQTSFKA 305
FG + T + L+++G ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
++DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/384 (22%), Positives = 155/384 (40%), Gaps = 76/384 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC------- 149
D G DL W+ C C+ C ++ + P+ASS+ ++++C C
Sbjct: 169 DTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPP 218
Query: 150 ----DLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVI 203
+C+ P + PCPY Y ++ ++ L +E ++L + G + + V +
Sbjct: 219 EPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGV----V 274
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIF 260
GCG + G + GL L S L A G ++FS C D ++
Sbjct: 275 FGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVDHGSDVGSKVV 329
Query: 261 FGDQGPATQ-------QSTSFLASNGKYIT----YIIGVETCCIGSSCL----------K 299
FG+ A + T+F ++ Y + ++ +G L K
Sbjct: 330 FGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGK 389
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL 358
S I+DSG++ ++ + Y+ I F +++ + +P CY S P++
Sbjct: 390 DGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEV 449
Query: 359 PSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAI--QPVDGDIGTIGQNFM 408
P + L+F P N F+ +P G ++ CLA+ P G + IG
Sbjct: 450 PELSLLFADGAVWDFPAENYFIRLDPD----GGSIM---CLAVLGTPRTG-MSIIGNFQQ 501
Query: 409 TGYRVVFDRENLKLGWSHSNCQDL 432
+ VV+D +N +LG++ C ++
Sbjct: 502 QNFHVVYDLQNNRLGFAPRRCAEV 525
>gi|323336649|gb|EGA77915.1| Yps1p [Saccharomyces cerevisiae Vin13]
Length = 516
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 109/245 (44%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFGDQGPATQQSTSF-------LASNG-----KYITYIIGVETCCIGSS--CLKQTSFKA 305
FG + T + L+++G ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
++DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LLDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 103/446 (23%), Positives = 170/446 (38%), Gaps = 60/446 (13%)
Query: 15 LLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKK----SFEYYQVLL---- 66
LL + + T + +L + E + + R KK S+E +
Sbjct: 81 LLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAGVTAEFG 140
Query: 67 SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLD 125
S V + +G F + ++ + D G D++WI C+ C C Y+ D
Sbjct: 141 SEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCREC-------YSQAD 193
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
N PS+S + + C +C + C Y + Y + + + G + L
Sbjct: 194 PIFN---PSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLT 249
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
G +++N V IGCG G + V GL+GLG G +S P+ L
Sbjct: 250 F---GTTSIQN-----VAIGCGHDNVGLF---VGAAGLLGLGAGSLSFPAQLGTQ--TGR 296
Query: 246 SFSMCF---DKDDSGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 301
+FS C D + SG + FG + P T +A+ Y + + +G L
Sbjct: 297 AFSYCLVDRDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSV 356
Query: 302 SFKA------------IVDSGSSFTFLPKEVYETIAAEFDRQVN-----DTITSFEGYPW 344
+A I+DSG++ T L Y+ + F D I+ F+
Sbjct: 357 PSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFD---- 412
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
CY S+ + +P+V F F++ +I + T FC A P D ++ +G
Sbjct: 413 -TCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGT-FCFAFAPADSNLSIMG 470
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNCQ 430
G RV FD N +G++ CQ
Sbjct: 471 NIQQQGIRVSFDSANSLVGFAIDQCQ 496
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 136/353 (38%), Gaps = 73/353 (20%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSC 155
D G D+ W+ C+ C +P A + +L + P+ASST +CS C LG S
Sbjct: 126 DTGSDVSWVQCEPCPAPSPCHA-HAGAL------FDPAASSTYAAFNCSAAACAQLGDSG 178
Query: 156 Q----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
+ + K C Y + Y + ++++G D+L L SG D V GC +
Sbjct: 179 EANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTL-SGSD------VVRGFQFGCSHAEL 230
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS 271
G +D DGLIGLG G+ P + A SF C PAT S
Sbjct: 231 GAGMDDKT-DGLIGLG-GDAQSP-VSQTAARYGKSFFYCL--------------PATPAS 273
Query: 272 TSFL----------ASNGKYIT------------YIIGVETCCIGSS--CLKQTSFKA-- 305
+ FL ++ T Y +E +G L + F A
Sbjct: 274 SGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGS 333
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
+VDSG+ T LP Y +++ F + + C+ + +P+V L+F
Sbjct: 334 LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVF 393
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFD 416
V + +V+G CLA P D GTIG + V++D
Sbjct: 394 -------AGGAVVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 144/363 (39%), Gaps = 52/363 (14%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL--CD--LGT 153
D G DL+W +CAP S + L Y+P++S+T L C+ L C L
Sbjct: 110 DTGSDLIW-----TQCAPCSGDQCFAQPAPL--YNPASSTTFGVLPCNSSLSMCAGVLAG 162
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
P C Y Y T T+ G+ + G A + + GC S
Sbjct: 163 KAPPPGCACMYNQTYGTGWTA--GVQGSETFTF---GSAAADQARVPGIAFGCSNASSSD 217
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQ 269
+ +G A GL+GLG G +S+ S L FS C D + + + G
Sbjct: 218 W-NGSA--GLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQDTNSTSTLLLGPSAALNG 269
Query: 270 ---QSTSFLASNGKY---ITYIIGVETCCIGSSCLKQT----SFKA------IVDSGSSF 313
+ST F+AS K Y + + +G+ L + S KA I+DSG++
Sbjct: 270 TGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTI 329
Query: 314 TFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYK--SSSQRLPKLPSVKLMFPQNNS 370
T L Y+ + A V I + CY + + P +PS+ L F
Sbjct: 330 TSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHF-DGAD 388
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQ-PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V+ ++I G+ V +CLA++ DG + T G +++D N L ++ + C
Sbjct: 389 MVLPADSYMISGSGV---WCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKC 445
Query: 430 QDL 432
L
Sbjct: 446 STL 448
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 142/376 (37%), Gaps = 75/376 (19%)
Query: 95 LGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--G 152
L D D W C P S S + +P+ S++ L CS +C + G
Sbjct: 92 LALDTSADATWAHCSPCGTCPSSGSLF----------APANSTSYAPLPCSSTMCTVLQG 141
Query: 153 TSC--QNP-KQPCPYTMDYYTE---NTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C Q+P P M +T+ + S L D LHL G +A+ N GC
Sbjct: 142 QPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHL---GKDAIPN-----YAFGC 193
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDDS----GRIFF 261
SG + + GL+GLG G ++ LL++ G + N FS C S G +
Sbjct: 194 VSAVSGPTAN-LPKQGLLGLGRGPMA---LLSQVGNMYNGVFSYCLPSYKSYYFSGSLRL 249
Query: 262 GDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSG 310
G G P + T L + + Y + V +G + +K T +VDSG
Sbjct: 250 GAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSG 309
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGY----PWKCCYKSSSQRLPKLPSV----- 361
+ T VY + EF R V + GY + C+ + P+V
Sbjct: 310 TVITRWTPPVYAALREEFRRHV----AAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMD 365
Query: 362 ---KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI----QPVDGDIGTIGQNFMTGYRVV 414
L P N+ + ++ + CLA+ Q V+ + + RVV
Sbjct: 366 GGLDLALPMENTLIHSSATPLA---------CLAMAEAPQNVNAVVNVLANLQQQNLRVV 416
Query: 415 FDRENLKLGWSHSNCQ 430
FD N ++G++ +C
Sbjct: 417 FDVANSRVGFARESCN 432
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/387 (22%), Positives = 147/387 (37%), Gaps = 70/387 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 146
+ +S D G ++W PC C C S+ ++ + + ++P SS+SK L C +
Sbjct: 98 QKLSFLVDTGSHVVWAPCTTHYTCTNC-----SFSDAEPKKVPIFNPKLSSSSKILGCRN 152
Query: 147 RLC------DLGTSC-------QNPKQPCP-YTMDYYTENTSSSGLLVEDILHLISGGDN 192
C D+ C +N CP Y++ Y T SS L+E++
Sbjct: 153 PKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGT-GASSGDFLLENL--------- 202
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA--KAGLIRNSFSMC 250
++GC G V L G G S+P + K NS
Sbjct: 203 NFPGKTIHEFLVGCTTSAVGE----VTSAALAGFGRSMFSLPMQMGVKKFAYCLNSHDYD 258
Query: 251 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKY-ITYIIGVETCCIGSSCLKQTS-FKA-- 305
++ S I + D FL + + I Y +GV+ IG+ L+ S + A
Sbjct: 259 DTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPG 318
Query: 306 -------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW---KCCYKSSSQRL 355
++DSG ++ ++ V++ + E ++++ S E CY + Q+
Sbjct: 319 SDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKS 378
Query: 356 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG----- 410
K+P + F + VV + + ++ LA P+ D GT F G
Sbjct: 379 IKIPDLIYQFRGGATMVVPGKNYFV----LIPEISLACFPLTTDAGTNTLEFTPGPSIIL 434
Query: 411 -------YRVVFDRENLKLGWSHSNCQ 430
Y V FD +N +LG+ CQ
Sbjct: 435 GNSQHVDYYVEFDLKNERLGFRQQTCQ 461
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/337 (26%), Positives = 138/337 (40%), Gaps = 67/337 (19%)
Query: 92 TMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL 151
TM L D G +L W+ C + +P S +N L + YSP S+ C R DL
Sbjct: 1014 TMVL--DTGSELSWLHC---KKSPNLTSVFNPLSS--SSYSPIPCSSP---ICRTRTRDL 1063
Query: 152 GTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+PK+ C + + Y + +S G L D + G +AL + + GC
Sbjct: 1064 PNPVTCDPKKLC-HAIVSYADASSLEGNLASDNFRI---GSSALPGT-----LFGC---M 1111
Query: 211 SGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGD-- 263
G+ D GL+G+ G +S + + GL + FS C +D SG + FGD
Sbjct: 1112 DSGFSSNSEEDAKTTGLMGMNRGSLS---FVTQLGLPK--FSYCISGRDSSGVLLFGDLH 1166
Query: 264 --------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKA 305
P Q ST + + Y + ++ +G+ L + +
Sbjct: 1167 LSWLGNLTYTPLVQISTPLPYFD--RVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQT 1224
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSSQRLPKL 358
+VDSG+ FTFL VY + EF Q + F+G C ++ +LP L
Sbjct: 1225 MVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTL 1284
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTG----FCL 391
PSV LMF + VV V + +++ G +CL
Sbjct: 1285 PSVSLMF-RGAEMVVGGEVLLYRVPEMMKGNEWVYCL 1320
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 139/354 (39%), Gaps = 56/354 (15%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C C A ++ PS SST K C G SC
Sbjct: 79 DTGSDLIWTQCMPCPNCYTQFAPIFD----------PSKSSTFKEKRCH------GNSC- 121
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
PY + Y E+ S+ L E + + G+ V A IGCG+ S
Sbjct: 122 ------PYEIIYADESYSTGILATETVTIQSTSGEPF----VMAETSIGCGLNNSNLMTP 171
Query: 217 GVAPD--GLIGLGLGEISVPSLLAKAGL-IRNSFSMCFDKDDSGRIFFGDQ----GPATQ 269
G A G++GL +G SL+++ L I S CF + +I FG G T
Sbjct: 172 GYAASSSGIVGLNMGP---SSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVVAGDGTV 228
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGSSFTFLPKEVYE 322
+ F+ + + Y + ++ +G ++ T F A +DSG+++T+LP
Sbjct: 229 AADMFIKKDQPF--YYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYLPTSYCN 286
Query: 323 TIAAEFDRQVNDTITSFEGYPWK-CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ V + CY + + P + L F V++ + +Y
Sbjct: 287 LVREAVAASVVAANQVPDPSSENLLCYNWDTMEI--FPVITLHFAGGADLVLDK--YNMY 342
Query: 382 GTQVVTG--FCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ +TG FCLAI VD + I G V +D L + +S +NC L
Sbjct: 343 -VETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCSAL 395
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVN-NPVFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGIHGVFVERSVQEQDVWCLAFAPTE 315
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 139/361 (38%), Gaps = 45/361 (12%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
++TM + D D+ W V+CAP A + ++ L Y PS SS+S CS C
Sbjct: 155 AQTMVI--DTASDVPW-----VQCAPCPAPHCHAQTDVL--YDPSKSSSSAAFPCSSPAC 205
Query: 150 -DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
+LG C C Y + Y + ++S+G + D+L L + A S + G
Sbjct: 206 RNLGPYANGCTPAGDQCQYRVQ-YPDGSASAGTYISDVLTL----NPAKPASAISEFRFG 260
Query: 206 C--GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 263
C + Q G + + + G++ LG G S+P+ + FS C FF
Sbjct: 261 CSHALLQPGSFSNKTS--GIMALGRGAQSLPT--QTKATYGDVFSYCLPPTPVHSGFFIL 316
Query: 264 QGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIVDSGSSFTF 315
P S T L S + Y++ + + L + A++DS + T
Sbjct: 317 GVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTR 376
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP-----KLPSVKLMFPQNNS 370
LP Y + A F ++ + CY S KLP + L+F N
Sbjct: 377 LPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNG 436
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSN 428
V +P + V+ CLA P D G IG V+++ + +G+
Sbjct: 437 AVELDP------SGVLLDGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGA 490
Query: 429 C 429
C
Sbjct: 491 C 491
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/415 (22%), Positives = 153/415 (36%), Gaps = 84/415 (20%)
Query: 86 PSQGSKTMSLGNDFGCDLLWIPC---DCVRCAPLSASYYNSLDRDLNEYSPSASST-SKH 141
P + +SL D G DL+W PC C+ C N+ N +P T S+
Sbjct: 91 PLSTANPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNN-----NSSNPLPPPTDSRR 145
Query: 142 LSCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSGLLVEDILHLISGGDNAL---- 194
+ C+ C S P C +D + ++ + + GD +L
Sbjct: 146 IPCASPFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAY--GDGSLVARL 203
Query: 195 ---KNSVQASVII-----GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 246
+ + ASV + C G P G+ G G G +S+P+ LA A L
Sbjct: 204 RRGRVGIAASVAVENFTFACAHTALG------EPVGVAGFGRGPLSLPAQLAPAAL-SGR 256
Query: 247 FSMC-----FDKDDSGR---IFFGD---QGPATQQSTSF--LASNGKY-ITYIIGVETCC 292
FS C F D R + G + PA++ + L N K+ Y + +E
Sbjct: 257 FSYCLVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVS 316
Query: 293 IGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR------------ 330
+G + + + +VDSG++FT LP E Y +A EF R
Sbjct: 317 VGGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEA 376
Query: 331 ---QVNDTITSFEGYPWKCCYKSSSQRLPKLP-----SVKLMFPQNNSFVVNNPVFVIYG 382
Q + + + S++ +P L ++ P+ N F+ F
Sbjct: 377 AEDQTGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFM----GFRSEE 432
Query: 383 TQVVTGFCLAIQPVD---GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
+ V L D G GT+G G+ VV+D + ++G++ C DL D
Sbjct: 433 RRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDLWD 487
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 100/445 (22%), Positives = 165/445 (37%), Gaps = 60/445 (13%)
Query: 15 LLTESSGAETVMFSTKLIHRFSEE-VKALGVSKNRNATSWPAKKSFEYYQVLLSSD---- 69
LL +++ T + +L + E V+ G+ + T K Y+ + D
Sbjct: 84 LLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYENVAEVDADFG 143
Query: 70 ---VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLD 125
V + +G F + ++ + D G D+ WI C+ C C Y+ D
Sbjct: 144 GEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCREC-------YSQAD 196
Query: 126 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
N PS S++ + C +C + C Y Y + S+ E +
Sbjct: 197 PIFN---PSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL-- 251
Query: 186 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 245
+ A+V IGCG K G + + GL+GLG G +S P+ + +
Sbjct: 252 -------TFGTTSVANVAIGCGHKNVGLF---IGAAGLLGLGAGALSFPNQIGTQ--TGH 299
Query: 246 SFSMCF---DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK-- 299
+FS C + D SG + FG + + L N T Y + V +G + L
Sbjct: 300 TFSYCLVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSI 359
Query: 300 --------QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQV-----NDTITSFEGYPW 344
+TS I+DSG+ T L Y+ + F D ++ F+
Sbjct: 360 PPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFD---- 415
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
CY S + +P+V F S ++ ++I V T FC A P + +G
Sbjct: 416 -TCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGT-FCFAFAPAASSVSIMG 473
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNC 429
RV FD N +G++ C
Sbjct: 474 NTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 147/366 (40%), Gaps = 60/366 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG--TSC 155
D G +L W+ C + P S +N L + Y+P+ ++S C R DL SC
Sbjct: 77 DTGSELSWLHC---KKLPNLNSTFNPLLS--SSYTPTPCNSS---VCMTRTRDLTIPASC 128
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+P + + Y + +S+ G L + +L + Q + GC S GY
Sbjct: 129 -DPNNKLCHVIVSYADASSAEGTLAAETF--------SLAGAAQPGTLFGC--MDSAGYT 177
Query: 216 DGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGD--QGPAT 268
+ D GL+G+ G +S+ + ++ FS C +D+ G + GD P+
Sbjct: 178 SDINEDAKTTGLMGMNRGSLSLVT-----QMVLPKFSYCISGEDAFGVLLLGDGPSAPSP 232
Query: 269 QQSTSFLASNG-----KYITYIIGVETCCIGSSCLK--QTSF--------KAIVDSGSSF 313
Q T + + + Y + +E + L+ ++ F + +VDSG+ F
Sbjct: 233 LQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQF 292
Query: 314 TFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
TFL VY ++ EF Q +T FEG CY + + L +P+V L+F
Sbjct: 293 TFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEG-AMDLCYHAPAS-LAAVPAVTLVFS 350
Query: 367 QNNSFVVNNPVF--VIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLG 423
V + V G V F + G + IG + + FD ++G
Sbjct: 351 GAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRVG 410
Query: 424 WSHSNC 429
++ + C
Sbjct: 411 FTETTC 416
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 146/369 (39%), Gaps = 43/369 (11%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + +K M L D G D+ WI C+ C C S +N P++
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFN----------PTS 208
Query: 136 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
SST K L+CS C L + C Y + Y + + + G L D ++ G++
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVS-YGDGSFTVGELATDT---VTFGNSGKI 264
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 255
N V +GCG G + GL+GLG G +S+ + + SFS C D
Sbjct: 265 NDVA----LGCGHDNEGLF---TGAAGLLGLGGGALSITNQMKAT-----SFSYCLVDRD 312
Query: 256 SGR---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL----------KQT 301
SG+ + F + +T+ L N K T Y +G+ +G +
Sbjct: 313 SGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASG 372
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPS 360
S I+D G++ T L + Y ++ F + + + CY SS K+P+
Sbjct: 373 SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPT 432
Query: 361 VKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENL 420
V F S + ++I T FC A P + IG G R+ +D N
Sbjct: 433 VAFHFTGGKSLDLPAKNYLIPVDDNGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLANK 491
Query: 421 KLGWSHSNC 429
+G S + C
Sbjct: 492 IIGLSGNKC 500
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 143/377 (37%), Gaps = 42/377 (11%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F L ++++ + D G DL W+ C C C Y D + P
Sbjct: 126 SGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSC-------YKQAD---PIFDPRN 175
Query: 136 SSTSKHLSCSHRLCDLGT--SCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
SS+ + + C LC SC + C Y + Y + + S G D+ L +G
Sbjct: 176 SSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTG- 233
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
S SV GCG G + GL L S + NSFS C
Sbjct: 234 ------SKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC 287
Query: 251 F-DKDD-----SGRIFFGDQGPATQQSTSFLASNGK----YITYIIGVET------CCIG 294
D+ + S + FG + + S L N K Y +IGV +
Sbjct: 288 LVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLK 347
Query: 295 SSCLKQT-SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 353
S L Q+ S I+DSG+S T P VY TI F + ++ + CY S +
Sbjct: 348 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGK 407
Query: 354 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
+P++ L F +N + + P + FCLA P ++G IG +R+
Sbjct: 408 ASVDVPALVLHF-ENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRI 466
Query: 414 VFDRENLKLGWSHSNCQ 430
FD + L ++ C+
Sbjct: 467 GFDLQKSHLAFAPQQCK 483
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 134/355 (37%), Gaps = 63/355 (17%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C C A ++ PS SST K C+ G SC
Sbjct: 79 DTGSDLIWTQCMPCTNCYSQYAPIFD----------PSNSSTFKEKRCN------GNSCH 122
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
Y + Y S L E + +H SG + V IGCG S
Sbjct: 123 -------YKIIYADTTYSKGTLATETVTIHSTSG-----EPFVMPETTIGCGHNSSW--- 167
Query: 216 DGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ--- 270
P G++GL G S+ + G S CF + +I FG
Sbjct: 168 --FKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV 223
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGSSFTFLPKEVYET 323
ST+ + K Y + ++ +G + ++ T+F A I+DSG++ T+ P
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVSYCNL 283
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGT 383
+ D V T+ CY + + + P + + F V++ + +Y
Sbjct: 284 VREAVDHYVTAVRTADPTGNDMLCYYTDTIDI--FPVITMHFSGGADLVLDK--YNMYIE 339
Query: 384 QVVTG-FCLAI----QPVDGDIGTIGQ-NFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G FCLAI P D G Q NF+ GY D +L + +S +NC L
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGY----DSSSLLVFFSPTNCSAL 390
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/367 (23%), Positives = 138/367 (37%), Gaps = 54/367 (14%)
Query: 95 LGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LG 152
L D G D+ W+ C C RC P S ++ P S++ + + C LG
Sbjct: 149 LAMDTGSDITWLQCQPCRRCYPQSGPVFD----------PRHSTSYREMGYDAPDCQALG 198
Query: 153 TSC--QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI-IGCGMK 209
S + C Y + Y + +++ G +E+ L G VQ + IGCG
Sbjct: 199 RSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGG--------VQVPHMSIGCGHD 250
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--------DKDDSGRIFF 261
G + A G++GLG G+IS PS +A G SFS C + S +
Sbjct: 251 NKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVSSTLTI 308
Query: 262 GDQGPATQQSTSFLAS--NGKYITYIIGVETCCIGSSC---------LKQTSFKA----I 306
GD A SF + N T+ LK + I
Sbjct: 309 GDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYTGRGGVI 368
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSSQRLPKLPSVKL 363
+DSG++ T L + Y F D G P + CY + + K+P+V +
Sbjct: 369 LDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAM-KVPTVSM 427
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKL 422
F + ++I + T C A D + IG G+RVV++ ++
Sbjct: 428 HFAGGVELTLPPKNYLIPVDSMGT-VCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGGGRV 486
Query: 423 GWSHSNC 429
G++ ++C
Sbjct: 487 GFAPNSC 493
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 84/188 (44%), Gaps = 15/188 (7%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D+LW+ C CV C PL +++ + P ASS++ L+CS + C +
Sbjct: 100 DTGSDVLWVSCISCVGC-PL---------QNVTFFDPGASSSAVKLACSDKRCFSDLHKK 149
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY-L 215
+ P Y ++Y ++ + +SG + D++ + + L A + GC +G L
Sbjct: 150 SGCSPLEYKVEY-SDGSFTSGYYISDLISFETVMSSNLTVKSSAPFVFGCSNLHAGLISL 208
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGDQGPATQQSTS 273
+ G++GLG G + V S L+ L FS+C ++ G I G+ T
Sbjct: 209 PETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGGQEGGGVIILGENRLPNTVYTP 268
Query: 274 FLASNGKY 281
+ S Y
Sbjct: 269 LVRSQTHY 276
>gi|222640101|gb|EEE68233.1| hypothetical protein OsJ_26421 [Oryza sativa Japonica Group]
Length = 439
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 143/389 (36%), Gaps = 72/389 (18%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAP----LSASYYNSLDRDLNEYSPSASSTSKHLSC 144
KT + D +++W+ C C C P S +YYN+ S S + LSC
Sbjct: 73 KTRLVSFDTAVNMVWLQCSDYCRDCNPSQVGTSTTYYNA----------SMSISYNPLSC 122
Query: 145 SHRLCDLGTSCQNPKQPCPYTMD----YYTENTSSSGLLVEDIL--HLISGGDNALKNSV 198
H LC G + + +Q MD + ++ ++G V+ IL IS D+
Sbjct: 123 DHPLCGAGDN--HDQQVLAECMDGTCTFKVDSLDNNGGWVQGILGSDRISISDHFFF-LF 179
Query: 199 QASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 257
++I GC Y LD G++GLGLG+ S+P ++ FS C
Sbjct: 180 DTNIIFGCATVDHSKYTLDQYGSSGVVGLGLGKYSLPQQISVT-----RFSYCLPSWVKN 234
Query: 258 RIF------FGDQGPATQQSTSFLASNGKYITYIIGVETCCI-----GSSC--------- 297
+F FG T FL KY + G+ + GS+
Sbjct: 235 ELFSPPYVLFGSNAVLQGDMTPFLPGFPKYYLKLEGISYGIVRLDIFGSNAAAADQYHQQ 294
Query: 298 --------LKQTSFKAI-VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 348
L F A+ V+S + LP YE + EF+ Q N + P CY
Sbjct: 295 AQFCRGPYLPDAQFYAMSVESATFPLMLPSRAYELLEKEFE-QDNPLLIKSRLQPMNTCY 353
Query: 349 KSSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDI 400
K S + ++ L F +N +F+ + + G Q CL +
Sbjct: 354 KGSVDDIADNATITLHFHGGIDLQLSRNATFM---EITSMNGDQEERYVCLIVDKTVDGT 410
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+G + + + FD EN ++ C
Sbjct: 411 AVLGLSPQLDHNIGFDLENKQISIYRKIC 439
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 101/227 (44%), Gaps = 43/227 (18%)
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 228
Y + +SS G L D+ + S S++A+ GC DGVA GL+G+
Sbjct: 65 YADGSSSDGALATDVFAVGSA-----TPSLRAA--FGCMASAFDSSPDGVASAGLLGMNR 117
Query: 229 GEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG----------DQGPATQQSTSF--- 274
G +S +++AG R FS C D+DD+G + G + P Q S
Sbjct: 118 GALS---FVSQAGTRR--FSYCISDRDDAGVLLLGHSDLPNFLPLNYTPLYQPSLPLPYF 172
Query: 275 --LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFD 329
+A + + + ++G + I +S L A +VDSG+ FTFL + Y + AEF
Sbjct: 173 DRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAEFY 232
Query: 330 RQ-------VNDTITSFEGYPWKCCYKSSSQRLPK----LPSVKLMF 365
RQ +++ +F+G + C++ P LPSV L F
Sbjct: 233 RQSTPFLRALDEPSFAFQGA-FDTCFRVPRGMSPPPGRLLPSVTLRF 278
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 143/351 (40%), Gaps = 46/351 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSC 155
D G D++W+ C C C Y+ D N P S + + C LC L +
Sbjct: 60 DTGSDIVWLQCAPCKNC-------YSQTDPVFN---PVKSGSFAKVLCRTPLCRRLESPG 109
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
N +Q C Y + Y + + ++G V + L + + V +GCG G +
Sbjct: 110 CNQRQTCLYQVSY-GDGSYTTGEFVTETL--------TFRRTKVEQVALGCGHDNEGLF- 159
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR---IFFGDQGPATQQS 271
V GL+GLG G +S PS + FS C D+ S + + FG+ +
Sbjct: 160 --VGAAGLLGLGRGGLSFPSQAGRT--FNQKFSYCLVDRSASSKPSSVVFGNSAVSRTAR 215
Query: 272 TSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKE 319
+ L +N + Y ++G+ S + + FK I+D G+S T L K
Sbjct: 216 FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKP 275
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN-SFVVNNPVF 378
Y + F + ++ E + CY S + K+P+V L F + S +N +
Sbjct: 276 AYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLI 335
Query: 379 VIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ G+ FC A + IG G+RVV+D + ++G+S C
Sbjct: 336 PVDGSGR---FCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVNNP-VFVIYGTQVVTGFCLAIQPVD 397
F + + VFV Q +CLA P +
Sbjct: 285 GARFDLGSKGVFVERSVQEQDVWCLAFAPTE 315
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 107/458 (23%), Positives = 176/458 (38%), Gaps = 87/458 (18%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
+ L +Y+ + +L G FS ++IHR S +R+ P + F+
Sbjct: 13 VLLCLYINISFLNALDGGG----FSVEIIHRDS----------SRSPYYRPTETQFQRVA 58
Query: 64 VLLSSDVQKQKMKTGPQF--------QMLFPSQGSKTMS----------LG-NDFGCDLL 104
L + + P + SQG MS LG D G D++
Sbjct: 59 NALRRSINRANHFNKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDII 118
Query: 105 WIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNPKQ 160
W+ C C C YN + + PS S T K L CS +C SC +
Sbjct: 119 WLQCQPCEDC-------YN---QTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNND 168
Query: 161 PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 220
C YT+ Y +N+ S G L + L L S ++++ +IGCG G +
Sbjct: 169 ECEYTIT-YGDNSHSQGDLSVETLTLGSTDGSSVQ---FPKTVIGCGHNNKGTF----QR 220
Query: 221 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---ST 272
+G +GLG V + + I FS C + S ++ FGD+ + + ST
Sbjct: 221 EGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVST 280
Query: 273 SFLASNGKYITYIIGVETCCIGSSCL---------KQTSFKAIVDSGSSFTFLPKEVYET 323
+ NG Y + +E +G + + I+DSG++ T LP++ Y
Sbjct: 281 PIVPKNGLGF-YFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTLTILPEDDYLN 339
Query: 324 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV--FVIY 381
+ + + + CY+++S +P + F + V NP+ F+
Sbjct: 340 LESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGAD--VELNPISTFIEV 397
Query: 382 GTQVVTGFCLA-----IQPVDGDIGTIGQNFMTGYRVV 414
VV C A I P+ G++ QN + GY +V
Sbjct: 398 DEGVV---CFAFRSSKIGPIFGNLAQ--QNLLVGYDLV 430
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 108/257 (42%), Gaps = 42/257 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ C +D+ + P+ S+T + L C+ C+
Sbjct: 108 DTGSDLIWTQCAPCLLC----------VDQPTPYFDPARSATYRSLGCASPACNALYYPL 157
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
++ C Y +Y ++ S++G+L + G N + S+ + GCG +G +
Sbjct: 158 CYQKVCVYQY-FYGDSASTAGVLANETFTF---GTNETRVSLPG-ISFGCGNLNAGLLAN 212
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG--------DQGPA 267
G G++G G G +S L+++ G R S+ + F R++FG +
Sbjct: 213 G---SGMVGFGRGSLS---LVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSE 266
Query: 268 TQQSTSFLASNGKYITYIIGVETCCIGSSCL-----------KQTSFKAIVDSGSSFTFL 316
QST F+ + Y + + +G L + I+DSG++ T+L
Sbjct: 267 PVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYL 326
Query: 317 PKEVYETIAAEFDRQVN 333
+ Y+ + A F Q+
Sbjct: 327 AEPAYDAVRAAFASQIT 343
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 111/481 (23%), Positives = 192/481 (39%), Gaps = 94/481 (19%)
Query: 1 MNRISLTIYLAVFWLLTES--SGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWP---- 54
M+ +SL + LA+F + S + V+ K+ + F ++K V +N T +
Sbjct: 4 MSSLSLVVALAIFAFVFSHAFSTSRRVLEHPKVQNGFRAKLKH--VDSGKNLTKFERIQH 61
Query: 55 ----AKKSFEYYQVLL-----SSDVQKQKMKTGPQFQM-LFPSQGSKTMSLGNDFGCDLL 104
+ + ++ + +S++ + +F M L +T S D G DL+
Sbjct: 62 GVKRGRHRLQRFKAMALVASSNSEIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLI 121
Query: 105 WIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 163
W C C +C D+ + P SS+ LSCS +LC+ P+ C
Sbjct: 122 WTQCKPCTQC----------FDQPTPIFDPKKSSSFSKLSCSSKLCE-----ALPQSTCS 166
Query: 164 YTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GYLDGVA 219
+Y Y + +S+ G+L + L K SV V GCG G G+ G
Sbjct: 167 DGCEYLYGYGDYSSTQGMLASETLTFG-------KVSV-PEVAFGCGEDNEGSGFSQG-- 216
Query: 220 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQGPATQ-----QS 271
GL+GLG G +S+ S L + FS C D + + G ++
Sbjct: 217 -SGLVGLGRGPLSLVSQLKEP-----KFSYCLTSVDDTKASTLLMGSLASVKASDSEIKT 270
Query: 272 TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------IVDSGSSFTFLPKEVY 321
T + ++ + Y + +E +G + L K+++F I+DSG++ T+L + +
Sbjct: 271 TPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAF 330
Query: 322 ETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL----PSVKLMFPQNNSFVV 373
+ +A EF Q+N + + + C+ S+ +PKL L P N +
Sbjct: 331 DLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIA 390
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQD 431
+ + V CLA+ G G I Q M V+ D E L + + C +
Sbjct: 391 DASMGVA---------CLAMGSSSGMSIFGNIQQQNML---VLHDLEKETLSFLPTQCDE 438
Query: 432 L 432
L
Sbjct: 439 L 439
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 150/397 (37%), Gaps = 73/397 (18%)
Query: 91 KTMSLGNDFGCDLLWIPC-----DCVRCAPLSASYYNS-------------LDRDLNEY- 131
K + + D G DL W+PC DC+ C Y N+ RDL
Sbjct: 40 KVIQVYMDTGSDLTWVPCGNLSFDCMDCN----DYRNNKLMSTYSPSYSSSSLRDLCVSP 95
Query: 132 --SPSASSTSKHLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLIS 188
S SS + + C+ C L T + +PCP Y G L D L
Sbjct: 96 LCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDTL-TTH 154
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
G + V + GC Y + P G+ G G G +S+PS L G ++ FS
Sbjct: 155 GSSPSFTREV-PNFCFGC---VGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 204
Query: 249 MCF-------DKDDSGRIFFGDQGPATQQSTSF--LASNGKYITYI-IGVETCCIGSSCL 298
CF + + S + GD ++ F L N Y Y IG+E +G++
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264
Query: 299 KQ--TSFKA---------IVDSGSSFTFLPKEVY-------ETIAAEFDRQVNDTITSFE 340
Q +S + I+DSG+++T LP Y ++I Q + T F+
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 324
Query: 341 -GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPV 396
Y C + LPS+ F N S V+ N + + T CL +Q +
Sbjct: 325 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNM 384
Query: 397 D----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
D G G G +VV+D E ++G+ +C
Sbjct: 385 DDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 125/311 (40%), Gaps = 33/311 (10%)
Query: 129 NEYSPSASSTSKHLSCS-HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 187
N + PS TS ++C H D S + K + ++Y + S G + D+L +
Sbjct: 124 NLWVPSTKCTS--IACFLHAKYDSSASSTHKKNGTSFKIEY--GSGSMEGFVSNDVLSI- 178
Query: 188 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL------LAKAG 241
GD + + A G+ + G DG+ +GLG ISV + + G
Sbjct: 179 --GDLKIHDQDFAEATKEPGLAFAFGKFDGI-----LGLGYDTISVNHITPPFYSMVNKG 231
Query: 242 LIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 298
L+ SF + ++D G FG + A + + + + G L
Sbjct: 232 LLDAPVFSFRLGSSEEDGGEAVFGGIDESAYSGKINYAPVRRKAYWEVELPKVAFGDDVL 291
Query: 299 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
+ + A +D+G+S LP +V E + A Q+ T + W Y +++P L
Sbjct: 292 ELENTGAAIDTGTSLIALPSDVAEMLNA----QIGATKS------WNGQYTVDCKKVPDL 341
Query: 359 PSVKLMFP-QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDR 417
P L F Q ++ + + GT + + L I G + IG F+ Y V+D
Sbjct: 342 PDFTLWFNGQAYPLKGSDYILEVQGTCISSFTGLDINVPGGSLWIIGDVFLRRYFTVYDH 401
Query: 418 ENLKLGWSHSN 428
+G+++SN
Sbjct: 402 GRDAVGFANSN 412
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 150/397 (37%), Gaps = 73/397 (18%)
Query: 91 KTMSLGNDFGCDLLWIPC-----DCVRCAPLSASYYNS-------------LDRDLNEY- 131
K + + D G DL W+PC DC+ C Y N+ RDL
Sbjct: 23 KVIQVYMDTGSDLTWVPCGNLSFDCMDCN----DYRNNKLMSTYSPSYSSSSLRDLCVSP 78
Query: 132 --SPSASSTSKHLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLIS 188
S SS + + C+ C L T + +PCP Y G L D L
Sbjct: 79 LCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDTL-TTH 137
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
G + V + GC Y + P G+ G G G +S+PS L G ++ FS
Sbjct: 138 GSSPSFTREV-PNFCFGC---VGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 187
Query: 249 MCF-------DKDDSGRIFFGDQGPATQQSTSF--LASNGKYITYI-IGVETCCIGSSCL 298
CF + + S + GD ++ F L N Y Y IG+E +G++
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247
Query: 299 KQ--TSFKA---------IVDSGSSFTFLPKEVY-------ETIAAEFDRQVNDTITSFE 340
Q +S + I+DSG+++T LP Y ++I Q + T F+
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 307
Query: 341 -GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPV 396
Y C + LPS+ F N S V+ N + + T CL +Q +
Sbjct: 308 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNM 367
Query: 397 D----GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
D G G G +VV+D E ++G+ +C
Sbjct: 368 DDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 104/234 (44%), Gaps = 19/234 (8%)
Query: 173 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 232
+SSSG+L EDI+ G ++ LK + GC ++G A DG++GLG G++S
Sbjct: 2 SSSSGVLGEDIVSF--GRESELK---AQRAVFGCENSETGDLFSQHA-DGIMGLGRGQLS 55
Query: 233 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG-KYITYIIGVETC 291
+ L + G+I +SFS+C+ D G G T F S+ + Y I ++
Sbjct: 56 IMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPYYNIELKEI 115
Query: 292 CIGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN--DTITSFEGYP 343
+ L+ + ++DSG+++ +LP++ + +V+ I +
Sbjct: 116 HVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSY 175
Query: 344 WKCCYKSSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
C+ + + + KL P V ++F + ++ ++V +CL +
Sbjct: 176 KDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 229
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/359 (21%), Positives = 143/359 (39%), Gaps = 63/359 (17%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LG- 152
D G D+ W+ PC+ C P ++ PS SST ++C C+ LG
Sbjct: 143 DTGSDVSWVQCAPCNSTECYPQKDPLFD----------PSKSSTYAPIACGADACNKLGD 192
Query: 153 ---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C + C Y ++Y + +S+ G+ + + G GCG
Sbjct: 193 HYRNGCTSGGTQCGYRVEY-GDGSSTRGVYSNETITFAPG-------ITVKDFHFGCGHD 244
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFGDQGPA 267
Q G DGL+GLG S+ ++ A + +FS C ++G + G + A
Sbjct: 245 QRG---PSDKFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNSEAGFLALGVRPSA 299
Query: 268 TQQSTSFLASNGKYI-----TYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPK 318
+++F+ + ++ +Y++ + +G L +++F+ ++DSG+ T LP+
Sbjct: 300 ATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPE 359
Query: 319 EVYETIAAEFDRQVNDTITSFEGYP------WKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
Y + A + +F YP + CY + +P V L F +
Sbjct: 360 TAYNALNAALRK-------AFAAYPMVASEDFDTCYNFTGYSNVTVPRVALTFSGGATID 412
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPVDGDI--GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ P ++ CLA + D+ G IG V++D + K+G+ C
Sbjct: 413 LDVP------NGILVKDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 129/331 (38%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ S
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPSF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVVNNP-VFVIYGTQVVTGFCLAIQPVD 397
F + VFV Q +CLA P +
Sbjct: 285 GARFDLGRRGVFVERSVQEQDVWCLAFAPTE 315
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 128/331 (38%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
SKT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 SKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + VFV Q +CLA P +
Sbjct: 285 GARFDLGRGGVFVERSVQEQDVWCLAFAPTE 315
>gi|389639248|ref|XP_003717257.1| candidapepsin-3 [Magnaporthe oryzae 70-15]
gi|351643076|gb|EHA50938.1| candidapepsin-3 [Magnaporthe oryzae 70-15]
gi|440468840|gb|ELQ37974.1| candidapepsin-3 precursor [Magnaporthe oryzae Y34]
gi|440484743|gb|ELQ64772.1| candidapepsin-3 precursor [Magnaporthe oryzae P131]
Length = 474
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 147/367 (40%), Gaps = 65/367 (17%)
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG------M 208
C QPC + ++ N+SS+ + + + IS D + N S ++ G +
Sbjct: 106 CSVSSQPCRFA-GTFSANSSSTYQYINSVFN-ISYVDGSGANGDYVSDMVTVGNTKIDRL 163
Query: 209 KQSGGYLDGVAPDGLIGLGL--GEISV-----------PSLLAKAGLI-RNSFSMCFD-- 252
+ GY A G++G+G E+ V PS + + GLI N++S+ +
Sbjct: 164 QFGIGYTSSSA-QGILGVGYEANEVQVGRAQLKPYRNLPSRMVEEGLIASNAYSLYLNDL 222
Query: 253 KDDSGRIFFG----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKAI 306
+ + G I FG +Q T Q+ + G+ ++I + + + S+ + + + +
Sbjct: 223 QSNKGSILFGGIDTEQYTGTLQTVPIQPNGGRMAEFLITLTSVSLTSASIGGDKLALAVL 282
Query: 307 VDSGSSFTFLP----KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP---KLP 359
+DSGSS T+LP K +Y + A++D S EG + C + Q
Sbjct: 283 LDSGSSLTYLPDDIVKNMYSAVGAQYD--------SNEGAAYVPCSLARDQANSLTFSFS 334
Query: 360 SVKLMFPQNN---SFVVNN---PVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRV 413
+ ++ P N V +N P F V + P +G F+ V
Sbjct: 335 GIPIVVPMNELVLDLVTSNGRRPSF----RNGVPACLFGVAPAGKGTNVLGDTFLRSAYV 390
Query: 414 VFDRENLKLGWSH-------SNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSSPGGHAVG 466
V+D EN + + SN +++ G+ PG S P+ A S GG+ G
Sbjct: 391 VYDLENNAISLAQTSFNATKSNVKEIGKGSNP--VPGAVAVSQPVAATSGLSQNGGNRSG 448
Query: 467 PAVAGRA 473
RA
Sbjct: 449 SGAIARA 455
>gi|260790155|ref|XP_002590109.1| hypothetical protein BRAFLDRAFT_83387 [Branchiostoma floridae]
gi|229275297|gb|EEN46120.1| hypothetical protein BRAFLDRAFT_83387 [Branchiostoma floridae]
Length = 493
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 109/275 (39%), Gaps = 47/275 (17%)
Query: 214 YLDGVAPDGLIGLGLGEISVPSL--------LAKAGLIRNSFSM----CFDKDDSGRIFF 261
+++G +G++GL EI+ P + K G + N FSM D+ ++ I
Sbjct: 168 FINGSHWEGILGLAYSEIARPDSTVEPFFDSMVKEGRVSNIFSMQLCGTIDQGNTTDISV 227
Query: 262 GD------------QGPATQQSTSFLASNGKYITYIIGVETCC--IGSSCLKQTSFKAIV 307
G +GP S L Y I VE +G C + K IV
Sbjct: 228 GGTMVVGGIDADLYEGPILYSS---LRREWYYEVVITKVEVDGEDLGMDCKEYNFDKTIV 284
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
DSG++ +PK+V+ + D + + D F C+K S P + + +
Sbjct: 285 DSGTTNLRVPKKVFRKVKQMLDAKTDIDIPAEFWTGEDLMCWKIGSTPWEHFPPMGI-YL 343
Query: 367 QNNSFVVNNPVFVI------YGTQVVTGF-----CLAIQPVDGDIGT-IGQNFMTGYRVV 414
Q S N+ F + Y V G C D GT IG M G+ VV
Sbjct: 344 QGTS---NSEAFRLSISPQQYMRAVSDGLGRTEDCYKFAITSSDTGTVIGAVVMEGFYVV 400
Query: 415 FDRENLKLGWSHSNCQDLNDGTKSPLTPGPGTPSN 449
FDREN +G++ S C + D T+S GP SN
Sbjct: 401 FDRENKTVGFAKSTC-GVRDTTQSSGVAGPFPHSN 434
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 143/386 (37%), Gaps = 76/386 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ MS+ D G +L W+ C+ A + ++N P+ SS+ +SCS C
Sbjct: 77 QNMSMVIDTGSELSWLHCNTNTTATIPYPFFN----------PNISSSYTPISCSSPTCT 126
Query: 151 LGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
T SC + C T+ Y + +SS G L D +S ++
Sbjct: 127 TRTRDFPIPASCDS-NNLCHATLSY-ADASSSEGNLASDTF--------GFGSSFNPGIV 176
Query: 204 IGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-SGR 258
GC + Y D GL+G+ LG +S+ S L FS C D SG
Sbjct: 177 FGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP-----KFSYCISGSDFSGI 228
Query: 259 IFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT------- 301
+ G+ P Q ST + Y + +E I L +
Sbjct: 229 LLLGESNFSWGGSLNYTPLVQISTPLPYFDRS--AYTVRLEGIKISDKLLNISGNLFVPD 286
Query: 302 ---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYK--S 350
+ + + D G+ F++L VY + EF Q N T+ + + CY+
Sbjct: 287 HTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPV 346
Query: 351 SSQRLPKLPSVKLMFPQNNSFVVNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIG 404
+ LP+LPSV L+F V + + ++G V F + G + IG
Sbjct: 347 NQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ + FD ++G +H+ C
Sbjct: 407 HHHQQSMWMEFDLVEHRVGLAHARCD 432
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 141/390 (36%), Gaps = 85/390 (21%)
Query: 98 DFGCDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---- 149
D G L+W PC C C ++ N + + P SST+K L C + C
Sbjct: 106 DTGSSLVWFPCTSHYLCSHC-----NFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYLF 160
Query: 150 --DLGTSCQNPKQP--------CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
D+ + C K+P CP + Y ++ LL++++
Sbjct: 161 GPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNL---------NFPGKTV 211
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF------DK 253
++GC + L P G+ G G G+ S+PS + L R FS C D
Sbjct: 212 PQFLVGCSI------LSIRQPSGIAGFGRGQESLPS---QMNLKR--FSYCLVSHRFDDT 260
Query: 254 DDSGRIFF-----GDQGPATQQSTSFLA--SNGKYIT--YIIGVETCCIGSSCLKQTSFK 304
S + GD T F + SN Y + + +G +K +K
Sbjct: 261 PQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVK-IPYK 319
Query: 305 -----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC----CYK 349
IVDSGS+FTF+ + VY +A EF RQ+ + E + C+
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFN 379
Query: 350 SSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG 401
S + P F P N F V+ T V G A QP
Sbjct: 380 ISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDG--GAGQPKTAGPA 437
Query: 402 TIGQNF-MTGYRVVFDRENLKLGWSHSNCQ 430
I N+ + V +D EN + G+ NC+
Sbjct: 438 IILGNYQQQNFYVEYDLENERFGFGPRNCK 467
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 152/368 (41%), Gaps = 64/368 (17%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T+++ D G D+LW+ C C C Y D N PS SST + ++C LC
Sbjct: 92 RTVNMVADTGSDVLWLQCLPCQSC-------YGQTDPLFN---PSFSSTFQSITCGSSLC 141
Query: 150 D--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
L C+ + C Y + Y S + E +S G NA+ SV IGCG
Sbjct: 142 QQLLIRGCR--RNQCLYQVSY----GDGSFTVGEFSTETLSFGSNAVN-----SVAIGCG 190
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRI--FFGDQ 264
G + GL+GLG G +S PS + + L + FS C ++ +G + FG+Q
Sbjct: 191 HNNQGLF---TGAAGLLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGSVPLIFGNQ 245
Query: 265 GPATQQSTSFLASNGK----YITYIIGVE------TCCIGSSCLKQTSFKA--IVDSGSS 312
A+ + L +N K Y ++G++ + GS L ++ I+DSG++
Sbjct: 246 AVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTA 305
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMF------ 365
T L Y + F + G+ + CY S + LP+V +F
Sbjct: 306 VTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365
Query: 366 --PQNNSFV-VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
P N V V+N GT +CLA P + IG +R+ FD ++
Sbjct: 366 ALPAQNIMVPVDNS-----GT-----YCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRV 415
Query: 423 GWSHSNCQ 430
G + C
Sbjct: 416 GIGANQCN 423
>gi|500621|gb|AAA19107.1| aspartyl protease 3 [Saccharomyces cerevisiae]
Length = 569
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 106/245 (43%), Gaps = 55/245 (22%)
Query: 222 GLIGLGLGEISV-------------------PSLLAKAGLIR-NSFSMCFDKDDS--GRI 259
G++G+GL E+ V P +L +G I+ N++S+ + D+ G I
Sbjct: 249 GVLGIGLPELEVTYSGSTASHSGKAYKYDNFPIVLKNSGAIKSNTYSLYLNDSDAMHGTI 308
Query: 260 FFG--DQGPATQQ----------STSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA 305
FG D T S S +S ++ I G+ GSS L T A
Sbjct: 309 LFGAVDHSKYTGTLYTIPIVNTLSASGFSSPIQFDVTINGIGISDSGSSNKTLTTTKIPA 368
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
+ DSG++ T+LP+ V IA E Q + I GY C P S++++F
Sbjct: 369 LSDSGTTLTYLPQTVVSMIATELGAQYSSRI----GYYVLDC--------PSDDSMEIVF 416
Query: 366 PQNNSFVVNNPV--FVIYGTQVVTGFCLAIQPVDGDIGTI-GQNFMTGYRVVFDRENLKL 422
F +N P+ F++ T L I P D GTI G +F+T VV+D ENL++
Sbjct: 417 -DFGGFHINAPLSSFIL---STGTTCLLGIIPTSDDTGTILGDSFLTNAYVVYDLENLEI 472
Query: 423 GWSHS 427
+ +
Sbjct: 473 SMAQA 477
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 154/392 (39%), Gaps = 93/392 (23%)
Query: 92 TMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
T S+ D G L+W C C CA R + P++SST L C+ LC
Sbjct: 102 TFSVLADTGSSLIWTQCAPCTECAA----------RPAPPFQPASSSTFSKLPCASSLCQ 151
Query: 151 LGTSCQNPKQPC---------PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
TS P C PY M + ++G L + LH+ GG +
Sbjct: 152 FLTS---PYLTCNATGCVYYYPYGMGF------TAGYLATETLHV--GGAS------FPG 194
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD-DSGR-- 258
V GC + G + G++GLG +S L+++ G+ R FS C D D+G
Sbjct: 195 VAFGCSTENG----VGNSSSGIVGLGRSPLS---LVSQVGVGR--FSYCLRSDADAGDSP 245
Query: 259 IFFGDQGPATQ---QSTSFLA-----SNGKYITYIIGVETCCIGSSCLKQTSFK------ 304
I FG T QST L S+ Y + G+ +G++ L TS
Sbjct: 246 ILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGIT---VGATDLPVTSTTFGFTRG 302
Query: 305 --------AIVDSGSSFTFLPKEVYETIAAEFDRQV--NDTITSFEG--YPWKCCYKSSS 352
IVDSG++ T+L KE Y + F Q+ + T+ G + + C+ +++
Sbjct: 303 AGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATA 362
Query: 353 ----QRLPKLPSVKLMFPQNNSFVVNNPVFVIY------GTQVVTGFCLAIQPVDG--DI 400
+P +P++ L F + V +V G V CL + P I
Sbjct: 363 AGGGSGVP-VPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVE--CLLVLPASEKLSI 419
Query: 401 GTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
IG V++D + ++ ++C ++
Sbjct: 420 SIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 451
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 144/374 (38%), Gaps = 64/374 (17%)
Query: 89 GSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G + +L D G DL W+ C C C YN + N PS SS+ L C+
Sbjct: 152 GGQNSTLIVDTGSDLTWVQCLPCRLC-------YNQQEPLFN---PSNSSSFLSLPCNSP 201
Query: 148 LC-----DLGTS--CQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
C G+S C N C Y +DY + + S G L + L L G + N
Sbjct: 202 TCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEKLTL---GKTEIDN--- 254
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 259
I GCG + + G G + GL+GL E+S+ S + L + FS C
Sbjct: 255 --FIFGCG-RNNKGLFGGAS--GLMGLARSELSLVS--QTSSLFGSVFSYCLPTTG---- 303
Query: 260 FFGDQGPATQQSTSFLASNGKYIT----------------YIIGVETCCIGSSCLK---- 299
G G T F SN K I+ Y + + IG L
Sbjct: 304 -VGSSGSLTLGGADF--SNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360
Query: 300 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 357
+++DSG+ T L +Y+ AEF++Q + T+ C+ +
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420
Query: 358 LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVF 415
+P+VK +F N +V+ + + CLA + + T IG RV++
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480
Query: 416 DRENLKLGWSHSNC 429
+ + K+G++ C
Sbjct: 481 NSKESKVGFAGEPC 494
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 149/356 (41%), Gaps = 38/356 (10%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
Q K L D G D+ W+ C CA + Y D + P +SS+ LSC+ +
Sbjct: 156 QPVKLFYLVPDTGSDVTWLQCQ--PCAS-ENTCYKQFDP---IFDPKSSSSYSPLSCNSQ 209
Query: 148 LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
C L C Y + +Y + + ++G L + L G N++ N + IGCG
Sbjct: 210 QCKLLDKANCNSDTCIYQV-HYGDGSFTTGELATETLSF--GNSNSIPN-----LPIGCG 261
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQ 264
G + G LIGLG G IS+ S L + SFS C D D S + F
Sbjct: 262 HDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFSYCLVNLDSDSSSTLEFNSN 313
Query: 265 GPATQQSTSFLASNGKYITY-IIGVETCCIGSSCL--KQTSFKA--------IVDSGSSF 313
P+ TS L N ++ +Y + V +G L T F+ IVDSG+
Sbjct: 314 MPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
+ LP +VYE++ F + + + + CY S Q ++P++ + + S +
Sbjct: 373 SRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I T +CLA + IG G RV +D N +G+S + C
Sbjct: 433 PARNYLIMLDTAGT-YCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 139/356 (39%), Gaps = 59/356 (16%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT 153
D D+ W+ PC +C L +D Y P+ SST + C C +LG+
Sbjct: 174 DTSSDIPWVQCLPCPIPQC---------HLQKD-PLYDPAKSSTFAPIPCGSPACKELGS 223
Query: 154 S----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
S C C Y ++Y + +++G V D L + V GC
Sbjct: 224 SYGNGCSPTTDECKYIVNY-GDGKATTGTYVTDTLTM-------SPTIVVKDFRFGCSHA 275
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 269
G + + A G++ LG G S+ L A N+FS C K S F GP +
Sbjct: 276 VRGSFSNQNA--GILALGGGRGSL--LEQTADAYGNAFSYCIPKPSSAG-FLSLGGP-VE 329
Query: 270 QSTSF----LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--AIVDSGSSFTFLPKEV 320
S F L N T YI+ +E + L T+F A++DSG+ T LP +V
Sbjct: 330 ASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQV 389
Query: 321 YETIAAEFDRQVNDTITSFEGYPWK---CCYKSSSQRLP--KLPSVKLMFPQNNSFVVNN 375
Y + A F R P + CY + R P K+P V L+F + +
Sbjct: 390 YAALRAAF-RSAMAAYGPLAA-PVRNLDTCYDFT--RFPDVKVPKVSLVFAGGATLDLEP 445
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ G CLA G+ +G IG Y V++D K+G+ C
Sbjct: 446 ASIILDG-------CLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 149/383 (38%), Gaps = 66/383 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ +S+ D G +L W+ C+ +S +N + P+ SS+ + CS C
Sbjct: 84 QNISMVIDTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTCR 132
Query: 151 LGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
T SC + K C T+ Y + +SS G L +I H + +++ ++I
Sbjct: 133 TRTRDFLIPASCDSDKL-CHATLSY-ADASSSEGNLAAEIFHFGNSTNDS-------NLI 183
Query: 204 IGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
GC SG + GL+G+ G +S +++ G + S+ + D G + G
Sbjct: 184 FGCMGSVSGSDPEEDTKTTGLLGMNRGSLS---FISQMGFPKFSYCISGTDDFPGFLLLG 240
Query: 263 DQG----------PATQQSTSF-LASNGKYITYIIGVET----CCIGSSCL---KQTSFK 304
D P + ST Y + G++ I S L + +
Sbjct: 241 DSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQ 300
Query: 305 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR---- 354
+VDSG+ FTFL VY + + F + N +T +E + CY+ S R
Sbjct: 301 TMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360
Query: 355 -LPKLPSVKLMFPQNNSFVVNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIGQNF 407
L +LP+V L+F V P+ + G V F + G + IG +
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHH 420
Query: 408 MTGYRVVFDRENLKLGWSHSNCQ 430
+ FD + ++G + C
Sbjct: 421 QQNMWIEFDLQRSRIGLAPVECD 443
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 144/374 (38%), Gaps = 64/374 (17%)
Query: 89 GSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
G + +L D G DL W+ C C C YN + N PS SS+ L C+
Sbjct: 73 GGQNSTLIVDTGSDLTWVQCLPCRLC-------YNQQEPLFN---PSNSSSFLSLPCNSP 122
Query: 148 LC-----DLGTS--CQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 199
C G+S C N C Y +DY + + S G L + L L G + N
Sbjct: 123 TCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEKLTL---GKTEIDN--- 175
Query: 200 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 259
I GCG + + G G + GL+GL E+S+ S + L + FS C
Sbjct: 176 --FIFGCG-RNNKGLFGGAS--GLMGLARSELSLVS--QTSSLFGSVFSYCLPTTG---- 224
Query: 260 FFGDQGPATQQSTSFLASNGKYIT----------------YIIGVETCCIGSSCLK---- 299
G G T F SN K I+ Y + + IG L
Sbjct: 225 -VGSSGSLTLGGADF--SNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 281
Query: 300 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 357
+++DSG+ T L +Y+ AEF++Q + T+ C+ +
Sbjct: 282 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 341
Query: 358 LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVF 415
+P+VK +F N +V+ + + CLA + + T IG RV++
Sbjct: 342 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 401
Query: 416 DRENLKLGWSHSNC 429
+ + K+G++ C
Sbjct: 402 NSKESKVGFAGEPC 415
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/369 (23%), Positives = 136/369 (36%), Gaps = 65/369 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G DL W V+C P S Y RD + PS S++ + C+ C+
Sbjct: 181 DTGSDLTW-----VQCKPCSVCYAQ---RD-PLFDPSGSASYAAVPCNASACEASLKAAT 231
Query: 154 ----SCQN--------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
SC + C Y++ Y + + S G+L D + AL +
Sbjct: 232 GVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTV--------ALGGASVDG 282
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSG 257
+ GCG+ G G A GL+GLG E+S+ S A FS C D +G
Sbjct: 283 FVFGCGLSNRG-LFGGTA--GLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAG 337
Query: 258 RIFFGDQGPATQQST-----SFLASNGK---YITYIIGVETCCIGSSCLKQTSFKAIVDS 309
+ G + + +T +A + Y + G + + ++DS
Sbjct: 338 SLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDS 397
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVK 362
G+ T L VY + AEF RQ E YP CY + K+P +
Sbjct: 398 GTVITRLAPSVYRAVRAEFARQFGA-----ERYPAAPPFSLLDACYNLTGHDEVKVPLLT 452
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENL 420
L V+ + + + CLA+ + + T IG RVV+D
Sbjct: 453 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 512
Query: 421 KLGWSHSNC 429
+LG++ +C
Sbjct: 513 RLGFADEDC 521
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/369 (23%), Positives = 136/369 (36%), Gaps = 65/369 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G DL W V+C P S Y RD + PS S++ + C+ C+
Sbjct: 182 DTGSDLTW-----VQCKPCSVCYAQ---RD-PLFDPSGSASYAAVPCNASACEASLKAAT 232
Query: 154 ----SCQN--------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
SC + C Y++ Y + + S G+L D + AL +
Sbjct: 233 GVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTV--------ALGGASVDG 283
Query: 202 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSG 257
+ GCG+ G G A GL+GLG E+S+ S A FS C D +G
Sbjct: 284 FVFGCGLSNRG-LFGGTA--GLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAG 338
Query: 258 RIFFGDQGPATQQST-----SFLASNGK---YITYIIGVETCCIGSSCLKQTSFKAIVDS 309
+ G + + +T +A + Y + G + + ++DS
Sbjct: 339 SLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDS 398
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVK 362
G+ T L VY + AEF RQ E YP CY + K+P +
Sbjct: 399 GTVITRLAPSVYRAVRAEFARQFGA-----ERYPAAPPFSLLDACYNLTGHDEVKVPLLT 453
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGT--IGQNFMTGYRVVFDRENL 420
L V+ + + + CLA+ + + T IG RVV+D
Sbjct: 454 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 513
Query: 421 KLGWSHSNC 429
+LG++ +C
Sbjct: 514 RLGFADEDC 522
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 136/354 (38%), Gaps = 38/354 (10%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ +SL D G L W +C P + S Y D + PS SS+ ++ C+ LC
Sbjct: 151 RDLSLIFDTGSYLTW-----TQCEPCAGSCYKQQDPI---FDPSKSSSYTNIKCTSSLCT 202
Query: 151 LGTSC---QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
S + C Y + Y +N+ S G L ++ L + + + + GCG
Sbjct: 203 QFRSAGCSSSTDASCIYDVKY-GDNSISRGFLSQERLTITA-------TDIVHDFLFGCG 254
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQG 265
+ + G G A GL+GL IS + + + FS C S G + FG
Sbjct: 255 -QDNEGLFRGTA--GLMGLSRHPISF--VQQTSSIYNKIFSYCLPSTPSSLGHLTFGASA 309
Query: 266 P--ATQQSTSFLASNGKYITY---IIGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLP 317
A + T F +G+ Y I+G+ + ++F A I+DSG+ T LP
Sbjct: 310 ATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLP 369
Query: 318 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
Y + + F + + ++ CY S + +P + F V P+
Sbjct: 370 PTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA--GGVKVELPL 427
Query: 378 FVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I + CLA DI G VV+D E ++G+ + C
Sbjct: 428 VGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/420 (23%), Positives = 164/420 (39%), Gaps = 66/420 (15%)
Query: 33 HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKT 92
HRF+ + +L K++ TS P Q+ + + V + ++ T PQ
Sbjct: 74 HRFTY-LSSLVAGKSK-PTSVPVASG---NQLHIGNYVVRARLGTPPQL----------- 117
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
M + D D +W+PC C+ S + + + YS + ST++ C
Sbjct: 118 MFMVLDTSNDAVWLPCS--GCSGCSNASTSFNTNSSSTYSTVSCSTTQCTQARGLTCPSS 175
Query: 153 TSCQNPKQP--CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
T QP C + Y +++ S+ L V+D L L V + GC
Sbjct: 176 T-----PQPSICSFNQSYGGDSSFSANL-VQDTL--------TLSPDVIPNFSFGCINSA 221
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----GRIFFGDQG- 265
SG L P GL+GLG G +S+ S L FS C S G + G G
Sbjct: 222 SGNSL---PPQGLMGLGRGPMSLVS--QTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQ 276
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTF 315
P + + T L + + Y + + +GS + + I+DSG+ T
Sbjct: 277 PKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTVITR 336
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL-PK----LPSVKLMFPQNNS 370
+ VYE I EF +QVN + ++ + C+ + ++ + PK + S+ L P N+
Sbjct: 337 FAQPVYEAIRDEFRKQVNGSFSTLGAF--DTCFSADNENVTPKITLHMTSLDLKLPMENT 394
Query: 371 FVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ ++ GT Q + + I R++FD N ++G + C
Sbjct: 395 LIHSS-----AGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 153/389 (39%), Gaps = 52/389 (13%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F +F K SL D G DL WI C C C + YY+ P
Sbjct: 193 SGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYD----------PKD 242
Query: 136 SSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLIS 188
S + ++++C+ C L +S C+ Q CPY Y + NT+ L ++L S
Sbjct: 243 SISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS 302
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
+ +V+ GCG G + L+GLG G +S S L L +SFS
Sbjct: 303 STTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFS 357
Query: 249 MCF-DKDD----SGRIFFG-DQGPATQQSTSFLA-----SNGKYITYIIGVETCCIGSSC 297
C D+D S ++ FG D+ T +F + N Y + +++ +G
Sbjct: 358 YCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEK 417
Query: 298 LK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 346
L+ + I+DSG++ ++ Y I F R+V E +P
Sbjct: 418 LQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHP 476
Query: 347 CYKSSSQRLPKLPSVKLMFPQNN--SFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTI 403
CY S P + F +F V N I +V CLA+ + I
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIV---CLAMLGTPKSALSII 533
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G + +++D +N +LG++ C ++
Sbjct: 534 GNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 115/486 (23%), Positives = 186/486 (38%), Gaps = 99/486 (20%)
Query: 4 ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFS------EEVKALGVSKNRNATSWPAKK 57
++L YL+ + + + +TKLIHR S ++ + + R TS +
Sbjct: 15 LTLAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERF 74
Query: 58 SFEYYQVLLSSDVQKQKMKTGPQFQMLFP-SQGSK---TMSLGN---------DFGCDLL 104
F L S +++ K L P ++GS +S+G+ D G LL
Sbjct: 75 DF------LESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLL 128
Query: 105 WIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPKQP 161
W+ C C+ C S S+++ P S + K L C + G C Q
Sbjct: 129 WVQCLPCINCFQQSTSWFD----------PLKSVSFKTLGCGFPGYNYINGYKCNRFNQ- 177
Query: 162 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD 221
Y + Y + SS G+L ++ L + + +K S ++ GCG D A +
Sbjct: 178 AEYKLRYLGGD-SSQGILAKESLLFETLDEGKIKKS---NITFGCGHMNIKTNNDD-AYN 232
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY 281
G+ GLG + P + A + N FS C GD + G Y
Sbjct: 233 GVFGLG----AYPHI-TMATQLGNKFSYCI----------GDINNPLYTHNHLVLGQGSY 277
Query: 282 IT------------YIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTFLPKE 319
I Y + +++ +GS LK +FK ++DSG ++T L
Sbjct: 278 IEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANG 337
Query: 320 VYETIAAEFDRQVND------TITSFEGYPWKCCYKSSSQR-LPKLPSVKLMFPQNNSFV 372
+E + E + T FEG C+K R L P+V F V
Sbjct: 338 GFELLYDEIVDLMKGLLERIPTQRKFEG----LCFKGVVSRDLVGFPAVTFHFAGGADLV 393
Query: 373 VNN-PVFVIYGTQVVTGFCLAIQPVDGDI---GTIGQNFMTGYRVVFDRENLKLGWSHSN 428
+ + +F +G FCLAI P + ++ IG Y V FD E +K+ + +
Sbjct: 394 LESGSLFRQHGGDR---FCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRID 450
Query: 429 CQDLND 434
CQ L++
Sbjct: 451 CQLLDE 456
>gi|449017891|dbj|BAM81293.1| pepsin A precursor [Cyanidioschyzon merolae strain 10D]
Length = 564
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/211 (22%), Positives = 90/211 (42%), Gaps = 19/211 (9%)
Query: 237 LAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSF---LASNGKYITYIIGVETCC 292
+ + G++ R+ F++C +F G GP ++ + + Y +GVE+
Sbjct: 263 MVRTGVVPRDMFALCLTDTSGALVFGGAAGPEMRKGEYRWVPMVNRAVRTYYEVGVESVR 322
Query: 293 IG---SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-W---K 345
G S+ L + AIVDSG++ + + T+ + D + G W
Sbjct: 323 FGTDESAGLPEIR-SAIVDSGTTLIVISTSAFGTLREHLQSRYCDQVPGLCGEKTWLETG 381
Query: 346 CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIGT-- 402
C + + + +LP + + V ++++ + F C IQ V G++
Sbjct: 382 RCATLTDRHVSRLPPINIRLAGGVELSVPPELYMLRAQKNGRTFRCFGIQHVTGELVNGR 441
Query: 403 --IGQNFMTGYRVVFDRENLKLGWSHS--NC 429
+G FM Y VFDREN ++G++ + NC
Sbjct: 442 VILGDTFMRAYVTVFDRENSRIGFAPAAENC 472
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 82/341 (24%), Positives = 134/341 (39%), Gaps = 48/341 (14%)
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSC----------QNPKQPCPYTMDYY-TENTSSSGL 178
E S S T L C C+ SC + C Y + Y N S++G+
Sbjct: 140 EKECSRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAGV 199
Query: 179 LVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
+ ED L +++ A+ +S V IGC + + D + G+ GLG S+P L
Sbjct: 200 MYEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDP-SIKGVFGLGRSATSLPRQL 258
Query: 238 AKAGLIRNSFSMC---FDKDDSGRIFFGDQGP---------ATQQSTSFLASNGKYIT-Y 284
+ FS C + + D P +T+ L N Y T Y
Sbjct: 259 NFS-----KFSYCLSSYQEPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLY 313
Query: 285 IIGVETCCIGSSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 340
+ ++ IG + S K+ VD+G+SFT L V+ + E DR + + E
Sbjct: 314 FVHLQNISIGGTRFPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVKE 373
Query: 341 GYPWK----CCY---KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAI 393
P + CY +++ KLP + L F + + V+ + Y + + CLAI
Sbjct: 374 -QPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLP---WDSYLWKTTSKLCLAI 429
Query: 394 QP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ G I +G M ++ D N KL + ++C +
Sbjct: 430 YKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKV 470
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 81/364 (22%), Positives = 142/364 (39%), Gaps = 51/364 (14%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ S+ D G DL W V+C+P Y ++ + P+ S++ L+C LC+
Sbjct: 24 RVFSVIVDTGSDLTW-----VQCSPCGKCY----SQNDALFLPNTSTSFTKLACGSALCN 74
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+ C Y Y + + ++G V D + + G N K V + GCG
Sbjct: 75 GLPFPMCNQTTCVYWYSY-GDGSLTTGDFVYDTITM--DGINGQKQQV-PNFAFGCGHDN 130
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQG 265
G + DG++GLG G +S S L + FS C + + FGD
Sbjct: 131 EGSF---AGADGILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSPLLFGDAA 185
Query: 266 PATQQSTSFLA--SNGKYIT-YIIGVETCCIGSSCLKQTS----------FKAIVDSGSS 312
+L +N K T Y + + +G + L +S I DSG++
Sbjct: 186 VPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTT 245
Query: 313 FTFLPKEVYETIAA-------EFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 365
T L + Y+ + A + R+++D I+ + C +LP +P++ F
Sbjct: 246 VTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLD----LCLSGFPKDQLPTVPAMTFHF 300
Query: 366 PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
+ + + F+ + F + P D+ IG ++V +D KLG+
Sbjct: 301 EGGDMVLPPSNYFIYLESSQSYCFAMTSSP---DVNIIGSVQQQNFQVYYDTAGRKLGFV 357
Query: 426 HSNC 429
+C
Sbjct: 358 PKDC 361
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 42/368 (11%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + + ++ + + D G D+ W+ C C C S Y+ PS
Sbjct: 160 SGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYD----------PSV 209
Query: 136 SSTSKHLSCSHRLC-DL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
S++ + C C DL +C+N C Y + Y + + + G + L L GD+A
Sbjct: 210 STSYATVGCDSPRCRDLDAAACRNSTGSCLYEV-AYGDGSYTVGDFATETLTL---GDSA 265
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-D 252
++V IGCG G + V GL+ LG G +S PS ++ +FS C D
Sbjct: 266 PVSNV----AIGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-----TTFSYCLVD 313
Query: 253 KDD--SGRIFFGDQGPATQQSTSFLASNGKYITYI------IGVETCCIGSSCLKQT--- 301
+D S + FGD + + Y+ +G E I SS
Sbjct: 314 RDSPSSSTLQFGDSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAG 373
Query: 302 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
S IVDSG++ T L Y + F + + + CY + + ++P+V
Sbjct: 374 SGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAV 433
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L F + ++I +CLA G + IG G RV FD
Sbjct: 434 ALWFEGGGELKLPAKNYLI-PVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNT 492
Query: 422 LGWSHSNC 429
+G++ C
Sbjct: 493 VGFTADKC 500
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 143/358 (39%), Gaps = 71/358 (19%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + W C CV C S Y++S SASST SC + ++ +
Sbjct: 146 DTGSSITWTQCKACVNCLQDSNRYFDS----------SASSTYSFGSC------IPSTVE 189
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N Y M Y ++++S G D + L + V GCG G +
Sbjct: 190 NN-----YNMTY-GDDSTSVGNYGCDTMTL-------EPSDVFQKFQFGCGRNNKGDFGS 236
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSF- 274
GV DG++GLG G++S S A FS C ++DS G + FG++ AT QS+S
Sbjct: 237 GV--DGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLFGEK--ATSQSSSLK 290
Query: 275 ----------LASNGKYITYI----IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEV 320
L +G Y + +G E I SS S I+DS + T LP+
Sbjct: 291 FTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF--ASPGTIIDSRTVITRLPQRA 348
Query: 321 YETIAAEFDRQVNDTITSF----EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 376
Y + A F + + S +G CY S ++ LP + L F +N
Sbjct: 349 YSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLN-- 406
Query: 377 VFVIYGTQVVTG-----FCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
GT +V G CLA ++ IG V++D + ++G+ + C
Sbjct: 407 -----GTNIVWGSDASRLCLAFAGTS-ELTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|453087366|gb|EMF15407.1| candidapepsin-4 precursor [Mycosphaerella populorum SO2202]
Length = 471
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 128/306 (41%), Gaps = 46/306 (15%)
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG----CGMKQ 210
CQ PC + Y ++S+ L D G + + V +V IG G +
Sbjct: 107 CQARGDPCSISGTYNANDSSTYTYLNSDFNISYVDGSGSAGDYVSDTVKIGDTTLTGQQF 166
Query: 211 SGGYLDGVAPDGLIGLG--LGEISV-----------PSLLAKAGLIR-NSFSMCFDKDD- 255
GY + + +G++G+G + E++V P L KAG I N++S+ + D
Sbjct: 167 GIGY-ESSSQEGILGIGYPINEVAVQYNGGKTYSNVPQSLVKAGAINTNAYSLWLNDLDA 225
Query: 256 -SGRIFFG----DQGPATQQSTSFLASNGKYITYIIG---VETCCIGSSCLKQTSFKAIV 307
+G I FG ++ + ++ + + G Y +II V S + + + A++
Sbjct: 226 STGSILFGGVNTEKYTGSLETIPIVETQGVYAEFIIALTAVGANGTAGSIVNKQAIPALL 285
Query: 308 DSGSSFTFLPKE----VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
DSGSS +LP + +Y+++ A +D + +G + C ++S S+ L
Sbjct: 286 DSGSSLMYLPNDITQSIYDSVGASYDSE--------QGAAFVDCDLANSD-----GSLDL 332
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFC-LAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
F V N + ++ G C L I P +G F+ VV+D ++
Sbjct: 333 TFSSPTIKVPMNELVIVAGIDRGKEVCILGIGPAGSSTPVLGDTFLRSAYVVYDLAKNEI 392
Query: 423 GWSHSN 428
+ +N
Sbjct: 393 SLAQTN 398
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 138/375 (36%), Gaps = 68/375 (18%)
Query: 95 LGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-- 152
L D D W+PC P +A +N P++S+T + + C C
Sbjct: 109 LAVDTSNDAAWVPCAGCHGCPTTAPSFN----------PASSATFRPVPCGAPPCSQAPN 158
Query: 153 ---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
TS K C +++ Y ++S L +D L + + G V GC K
Sbjct: 159 PSCTSLAKSKNSCGFSLSY--GDSSLDATLSQDNLAVTANGG------VIKGYTFGCLTK 210
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD------KDDSGRIFFGD 263
+G A LGLG + + G+ +FS C + SG + G
Sbjct: 211 S-----NGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGR 265
Query: 264 QG---PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSG 310
+G P ++T LAS + Y + + IG + T ++DSG
Sbjct: 266 KGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSG 325
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDT------------ITSFEGYPWKCCYKSSSQRLPKL 358
+ F L + Y + E R+V + ++S G+ CY S+
Sbjct: 326 TMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGF--DTCYNVSTV---AW 380
Query: 359 PSVKLMFPQNNSFVVNNPVFVIYGTQVVTG-FCLAIQPVDG---DIGTIGQNFMTGYRVV 414
P+V L+F + VI T T +A P DG + IG +RV+
Sbjct: 381 PAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVL 440
Query: 415 FDRENLKLGWSHSNC 429
FD N ++G++ C
Sbjct: 441 FDVPNARVGFARERC 455
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 159/391 (40%), Gaps = 94/391 (24%)
Query: 84 LFPSQGSKTMSLG-----------NDFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLN 129
L PS G M+L D G DL W+ PCD +C P ++
Sbjct: 73 LLPSGGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCD--QCYPQKGPIFD------- 123
Query: 130 EYSPSASSTSKHLSCSHRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 185
PS S+T L C+ C+ SC +P C YT Y +++ ++G L D +
Sbjct: 124 ---PSNSTTFHKLPCTTAPCNALDESARSCTDPTT-CGYTYSY-GDHSYTTGYLASDTVT 178
Query: 186 LISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 244
+ NA SVQ +V GCG + G + + + G++GLG G +S S L I
Sbjct: 179 V----GNA---SVQIRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IG 227
Query: 245 NSFSMCF------------DKDDSGRIFFGDQGPATQQST-------SFLASNGKYITYI 285
FS C D + RI FGD + ST + L + Y
Sbjct: 228 KKFSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYY 287
Query: 286 IGVETCCIGSSCL-------KQTSFKA-----------IVDSGSSFTFLPKEVYETIAAE 327
+ +E +G L K S+ + I+DSG++ TFL +E Y + A
Sbjct: 288 LTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAA 347
Query: 328 FDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV--FVIYGTQ 384
++ + + + + C+KS + + +LP +K+ F + + V PV FV
Sbjct: 348 LVEEIKMERVNDVKNSMFSLCFKSGKEEV-ELPLMKVHF-RGGADVELKPVNTFVRAEEG 405
Query: 385 VVTGFCLAIQPVDGDIGTIGQ----NFMTGY 411
+V C + P + D+G G NF+ GY
Sbjct: 406 LV---CFTMLPTN-DVGIYGNLAQMNFVVGY 432
>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 453
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 147/373 (39%), Gaps = 57/373 (15%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ +L D G L C+ C +C A + LD P SST ++ C L
Sbjct: 93 QAQTLIVDTGSRLTATACEPCSQCGTTHAHPFPHLD-------PQRSSTLRYTQCGSCLL 145
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII--GCG 207
C +Q C Y TE +S + + V D L ++L+ V ++I GC
Sbjct: 146 SGIQECA-AEQKCGINQRY-TEGSSWTAVEVSDTFVLGGPEISSLEQYVSFTIIFAFGCQ 203
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFG---- 262
K G + A +G++GL ++S+ L K +I R SFS+C + G I G
Sbjct: 204 QKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVIPRESFSLCMTPFE-GYIGLGGPLR 261
Query: 263 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----------------A 305
D+ + + T F ++ Y +++ V +G CL
Sbjct: 262 DKHTESMKYTPFTSTQSWYAVHVVRV---FVGDECLTSNDQHDTVVEHALVEAFAEGKGT 318
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL-- 363
I+DSG++ T+LPK V + + R N T F+ Y + LP V
Sbjct: 319 ILDSGTTDTYLPKAVAGRMREIWARLSN---TPFQP---SSTYAYTYDEFRSLPIVTFEL 372
Query: 364 -------MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
P+N + P+ G + + A + V G + +G N M GY ++FD
Sbjct: 373 ANNVTLQALPKNFMEDLPEPLRPWTGRRKLMNRLYADE-VQGAV--VGLNTMVGYDLLFD 429
Query: 417 RENLKLGWSHSNC 429
+ + G + + C
Sbjct: 430 VQGNRFGVAPALC 442
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 153/389 (39%), Gaps = 52/389 (13%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F +F K SL D G DL WI C C C + YY+ P
Sbjct: 193 SGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYD----------PKD 242
Query: 136 SSTSKHLSCSHRLCDLGTS------CQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLIS 188
S + ++++C+ C L +S C+ Q CPY Y + NT+ L ++L S
Sbjct: 243 SISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS 302
Query: 189 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
+ +V+ GCG G + L+GLG G +S S L L +SFS
Sbjct: 303 STTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFS 357
Query: 249 MCF-DKDD----SGRIFFG-DQGPATQQSTSFLA-----SNGKYITYIIGVETCCIGSSC 297
C D+D S ++ FG D+ T +F + N Y + +++ +G
Sbjct: 358 YCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEK 417
Query: 298 LK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 346
L+ + I+DSG++ ++ Y I F R+V E +P
Sbjct: 418 LQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHP 476
Query: 347 CYKSSSQRLPKLPSVKLMFPQNN--SFVVNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTI 403
CY S P + F +F V N I +V CLA+ + I
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIV---CLAMLGTPKSALSII 533
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G + +++D +N +LG++ C ++
Sbjct: 534 GNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 132/352 (37%), Gaps = 57/352 (16%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 150
D G D+ W+ PC +C P Y+ PS SST + C+ +C
Sbjct: 97 DTGSDVSWLQCKPCSSGQCFPQKDPLYD----------PSHSSTYSAVPCASDVCKKLAA 146
Query: 151 --LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
G+ C + KQ C + + Y + TS+ G +D L L G ++ + GCG
Sbjct: 147 DAYGSGCTSGKQ-CGFAISY-ADGTSTVGAYSQDKLTLAPG-------AIVQNFYFGCGH 197
Query: 209 KQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG---D 263
+ G DGV LGLG + SL A+ G + FS C S F
Sbjct: 198 GKHAVRGLFDGV-------LGLGRLR-ESLGARYGGV---FSYCLPSVSSKPGFLALGAG 246
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKE 319
+ P+ T G+ + + +G L + ++F IVDSG+ T L
Sbjct: 247 KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQST 306
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + + F R+ + CY + + +P + L F + ++ P
Sbjct: 307 AYRALRSAF-RKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVP--- 362
Query: 380 IYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ CLA DG G +G + V+FD K G+ C
Sbjct: 363 ---NGILVNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 149/356 (41%), Gaps = 38/356 (10%)
Query: 88 QGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 147
Q K L D G D+ W+ C CA + Y D + P +SS+ LSC+ +
Sbjct: 156 QPVKLFYLVPDTGSDVTWLQCQ--PCAS-ENTCYKQFDP---IFDPKSSSSYSPLSCNSQ 209
Query: 148 LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
C L C Y + +Y + + ++G L + L G N++ N + IGCG
Sbjct: 210 QCKLLDKANCNSDTCIYQV-HYGDGSFTTGELATETLSF--GNSNSIPN-----LPIGCG 261
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQ 264
G + G LIGLG G IS+ S L + SFS C D D S + F
Sbjct: 262 HDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFSYCLVNLDSDSSSTLEFNSY 313
Query: 265 GPATQQSTSFLASNGKYITY-IIGVETCCIGSSCL--KQTSFKA--------IVDSGSSF 313
P+ TS L N ++ +Y + V +G L T F+ IVDSG+
Sbjct: 314 MPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
+ LP +VYE++ F + + + + CY S Q ++P++ + + S +
Sbjct: 373 SRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++I T +CLA + IG G RV +D N +G+S + C
Sbjct: 433 PARNYLIMLDTAGT-YCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 151/368 (41%), Gaps = 64/368 (17%)
Query: 91 KTMSLGNDFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T+++ D G D+LW+ C C C Y D N PS SST + ++C LC
Sbjct: 92 RTVNMVADTGSDVLWLQCLPCQSC-------YGQTDPLFN---PSFSSTFQSITCGSSLC 141
Query: 150 D--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 207
L C+ + C Y + Y S + E +S G NA+ SV IGCG
Sbjct: 142 QQLLIRGCR--RNQCLYQVSY----GDGSFTVGEFSTETLSFGSNAVN-----SVAIGCG 190
Query: 208 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRI--FFGDQ 264
G + GL+GLG G +S PS + + L + FS C ++ +G + FG+Q
Sbjct: 191 HNNQGLF---TGAAGLLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGSVPLIFGNQ 245
Query: 265 GPATQQSTSFLASNGK----YITYIIGVET------CCIGSSCLKQTSFKA--IVDSGSS 312
A+ + L +N K Y ++G++ GS L ++ I+DSG++
Sbjct: 246 AVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTA 305
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMF------ 365
T L Y + F + G+ + CY S + LP+V +F
Sbjct: 306 VTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365
Query: 366 --PQNNSFV-VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
P N V V+N GT +CLA P + IG +R+ FD ++
Sbjct: 366 ALPAQNIMVPVDNS-----GT-----YCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRV 415
Query: 423 GWSHSNCQ 430
G + C
Sbjct: 416 GIGANQCN 423
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 156/392 (39%), Gaps = 64/392 (16%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYN 122
Q+ + + V + K+ T PQ M + D D +W+PC C+ S + +
Sbjct: 98 QLHIGNYVVRAKLGTPPQL-----------MFMVLDTSNDAVWLPCS--GCSGCSNASTS 144
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLL 179
+ YS + ST++ C+ G +C + P P + Y ++S S L
Sbjct: 145 FNTNSSSTYSTVSCSTAQ---CTQAR---GLTCPS-SSPQPSVCSFNQSYGGDSSFSASL 197
Query: 180 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 239
V+D L L V + GC SG L P GL+GLG G +S+ S
Sbjct: 198 VQDTL--------TLAPDVIPNFSFGCINSASGNSL---PPQGLMGLGRGPMSLVS--QT 244
Query: 240 AGLIRNSFSMCFDKDDS----GRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIG 294
L FS C S G + G G P + + T L + + Y + + +G
Sbjct: 245 TSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVG 304
Query: 295 SSCLK----QTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-P 343
S + +F A I+DSG+ T + VYE I EF +QVN ++SF
Sbjct: 305 SVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN--VSSFSTLGA 362
Query: 344 WKCCYKSSSQRL-PK----LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG 398
+ C+ + ++ + PK + S+ L P N+ + ++ GT Q +
Sbjct: 363 FDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSS-----AGTLTCLSMAGIRQNANA 417
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ I R++FD N ++G + C
Sbjct: 418 VLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 120/302 (39%), Gaps = 56/302 (18%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
S+ + L D G D++W C+ C C + L + +AS+T + ++CS L
Sbjct: 103 SQPVVLTLDTGSDVVWTQCEPCAEC----------FTQPLPRFDTAASNTVRSVACSDPL 152
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C+ + C Y + Y + + S G + D G K +V + GCGM
Sbjct: 153 CNAHSEHGCFLHGCTY-VSGYGDGSLSFGHFLRDSF-TFDDGKGGGKVTV-PDIGFGCGM 209
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQG 265
+G +L G+ G G G +S+PS L +R FS CF + S +F G G
Sbjct: 210 YNAGRFLQ--TETGIAGFGRGPLSLPSQLK----VRQ-FSYCFTTRFEAKSSPVFLGGAG 262
Query: 266 PATQQ------STSFLAS------NGKYITYIIGVETCCIGSSCLKQTSFKA------IV 307
ST F+ S N Y+ GV +G + L KA +
Sbjct: 263 DLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVT---VGKTRLPVPEIKADGSGATFI 319
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQ----VNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 363
DSG+ T P V+ + + F Q VN T + C+ ++ +P KL
Sbjct: 320 DSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDD-----ICFSWDGKKTAAMP--KL 372
Query: 364 MF 365
+F
Sbjct: 373 VF 374
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 101/412 (24%), Positives = 155/412 (37%), Gaps = 82/412 (19%)
Query: 84 LFPSQGSKTMSLGNDFGCDLLWIPC---DCVRCA--PLSASYYNSLDRDLNEYSPSASST 138
L P ++ ++L D G DL+W PC C+ C P ++ N+ A S
Sbjct: 54 LGPRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNASPPVNTTRSVAVSCKSPACSA 113
Query: 139 SKHLSCSHRLCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
+ +L+ LC + C N K P Y Y + S L D L L S
Sbjct: 114 AHNLASPSDLCAAARCPLESIETSDCANFKCPPFY---YAYGDGSLIARLYRDTLSLSS- 169
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFS 248
L+N GC Y P G+ G G G +S+P+ LA + + N FS
Sbjct: 170 --LFLRN-----FTFGCA------YTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFS 216
Query: 249 MC-----FDKDDSGR---IFFGDQ-----------GPATQQSTSFLASNGKYITYIIGVE 289
C FD + + + G G A T L + Y +G+
Sbjct: 217 YCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLI 276
Query: 290 TCCIGS------SCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDR---QVNDTI 336
+G L++ + + +VDSG++FT LP Y ++ EFDR +VN+
Sbjct: 277 GISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERA 336
Query: 337 TSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVV----NNPVFVIYGTQVVTGF-- 389
E CY +S + ++P + L F NS VV N + G G
Sbjct: 337 RKIEEKTGLAPCYYLNS--VAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394
Query: 390 --CLAI-------QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
CL + + G T+G G+ V +D E ++G++ C L
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCASL 446
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 132/352 (37%), Gaps = 57/352 (16%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 150
D G D+ W+ PC +C P Y+ PS SST + C+ +C
Sbjct: 131 DTGSDVSWLQCKPCSSGQCFPQKDPLYD----------PSHSSTYSAVPCASDVCKKLAA 180
Query: 151 --LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
G+ C + KQ C + + Y + TS+ G +D L L G ++ + GCG
Sbjct: 181 DAYGSGCTSGKQ-CGFAISY-ADGTSTVGAYSQDKLTLAPG-------AIVQNFYFGCGH 231
Query: 209 KQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG---D 263
+ G DGV LGLG + SL A+ G + FS C S F
Sbjct: 232 GKHAVRGLFDGV-------LGLGRLR-ESLGARYGGV---FSYCLPSVSSKPGFLALGAG 280
Query: 264 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKE 319
+ P+ T G+ + + +G L + ++F IVDSG+ T L
Sbjct: 281 KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQST 340
Query: 320 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 379
Y + + F R+ + CY + + +P + L F + ++ P
Sbjct: 341 AYRALRSAF-RKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVP--- 396
Query: 380 IYGTQVVTGFCLAIQPV--DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ CLA DG G +G + V+FD K G+ C
Sbjct: 397 ---NGILVNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 107/446 (23%), Positives = 174/446 (39%), Gaps = 67/446 (15%)
Query: 7 TIYLAVFWLLTESS--GAETVMFSTKLIHRFSEEV-----KALGVSKNRNATSWPAKKSF 59
++ L + W L S A FS ++IHR S + NA +
Sbjct: 9 SLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRRSINRGN 68
Query: 60 EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN--DFGCDLLWIPCD-CVRCAPL 116
+ + +S+D + + ++ S GS + D G D+LW+ C+ C C
Sbjct: 69 HFKKAFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQ 128
Query: 117 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTS 174
+ ++ PS S T K L CS C+ T+C + C Y++DY + S
Sbjct: 129 TTPIFD----------PSKSKTYKTLPCSSNTCESLRNTACSS-DNVCEYSIDYGDGSHS 177
Query: 175 SSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 233
L VE + + G +SV +IGCG G + + +G +GLG V
Sbjct: 178 DGDLSVETLTLGSTDG-----SSVHFPKTVIGCGHNNGGTFQE----EGSGIVGLGGGPV 228
Query: 234 PSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYI 285
+ + I FS C + + S ++ FGD + + ST NG+ + Y
Sbjct: 229 SLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQ-VFYF 287
Query: 286 IGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
+ +E +G + ++ I+DSG++ T LP+E Y + + +
Sbjct: 288 LTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLE 347
Query: 336 ITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV--FVIYGTQVVTGFCLAI 393
CYK++S L LP + F + V NP+ FV VV C A
Sbjct: 348 RARDPSKLLSLCYKTTSDEL-DLPVITAHFKGAD--VELNPISTFVPVEKGVV---CFAF 401
Query: 394 QPVDGDIGTI-----GQNFMTGYRVV 414
+ IG I QN + GY +V
Sbjct: 402 --ISSKIGAIFGNLAQQNLLVGYDLV 425
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 71/161 (44%), Gaps = 19/161 (11%)
Query: 281 YITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 337
Y Y I + IG L+ S + +VDSG+ T LP +Y+ + AEF +Q
Sbjct: 203 YNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQ------ 256
Query: 338 SFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFC 390
F G+P C+ S+ + +P++K+ F N V+ + + C
Sbjct: 257 -FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVC 315
Query: 391 LAIQPVD--GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
LA+ ++ ++ +G RV++D + K+G++ C
Sbjct: 316 LALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 356
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 126/327 (38%), Gaps = 50/327 (15%)
Query: 130 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP--------YTMDYYTENTSSSGLLVE 181
+ P+ SST + + C C Q P CP + + Y ++ LL +
Sbjct: 146 SFDPTRSSTYRPVRCGAPQCS-----QAPAPSCPGGLGSSCAFNLSY--AASTFQALLGQ 198
Query: 182 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 241
D L L D A+ GC +GG V P GL+G G G +S PS
Sbjct: 199 DALALHDDVDAV------AAYTFGCLHVVTGG---SVPPQGLVGFGRGPLSFPSQTKD-- 247
Query: 242 LIRNSFSMCF----DKDDSGRIFFGDQGPATQQSTSFLASNGK-----YITYI---IGVE 289
+ + FS C + SG + G G + T+ L SN Y+ + +G
Sbjct: 248 VYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGR 307
Query: 290 TCCIGSSCLK--QTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 346
+ +S L TS + IVD+G+ FT L VY + F +V + G +
Sbjct: 308 PVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG-FDT 366
Query: 347 CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ-VVTGFCLAIQP---VDGDIGT 402
CY + +P+V F S + VI + + +A P VD +
Sbjct: 367 CYNVTI----SVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNV 422
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +RV+FD N ++G+S C
Sbjct: 423 LASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--C 155
D G LW+ CD N Y SST + C C L S C
Sbjct: 63 DIGGQFLWVDCD-------------------NNY---VSSTYRPARCGSAQCSLARSDSC 100
Query: 156 QN----PK-----QPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIG 205
N PK C T D T++SG L +D++ L S G N ++N+ + +
Sbjct: 101 GNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVSLQSTNGFNPIQNATVSRFLFS 160
Query: 206 CG---MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFG 262
C + Q G GV+ G+ GLG I++PS LA A R F++C + G FFG
Sbjct: 161 CAPTFLLQ--GLATGVS--GMAGLGRTRIALPSQLASAFSFRRKFAVCLSSSN-GVAFFG 215
Query: 263 DQGP 266
D GP
Sbjct: 216 D-GP 218
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 136/345 (39%), Gaps = 40/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D+ WI +CAP + Y+ + + P++S++ LSC + C +
Sbjct: 162 DTGSDVNWI-----QCAPCADCYHQADPI----FEPASSTSYSPLSCDTKQCQSLDVSEC 212
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 217
C Y + Y + + + E I + DN V IGCG G +
Sbjct: 213 RNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDN---------VAIGCGHNNEGLF--- 260
Query: 218 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD-DSGRIFFGDQGPATQQSTSFL 275
+ GL+GLG G++S PS + + SFS C D+D DS + T+ L
Sbjct: 261 IGAAGLLGLGGGKLSFPSQINAS-----SFSYCLVDRDSDSASTLEFNSALLPHAITAPL 315
Query: 276 ASNGKYIT-YIIGVETCCIGSSCLK--QTSFKA--------IVDSGSSFTFLPKEVYETI 324
N + T Y +G+ +G L ++ F+ I+DSG++ T L Y +
Sbjct: 316 LRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNAL 375
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
F + D + E + CY S + ++P+V + ++I
Sbjct: 376 RDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDS 435
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T FC A P + IG G RV FD N +G+ C
Sbjct: 436 DGT-FCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 140/356 (39%), Gaps = 62/356 (17%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 150
D D+ W+ PC C P S+Y+ PS S TS SCS C
Sbjct: 34 DSASDVPWVQCVPCPIPPCHPQVDSFYD----------PSRSPTSAAFSCSSPTCTALGP 83
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C N + C Y + Y + +S+SG + D+L L +G NA+ GC +
Sbjct: 84 YANGCANNQ--CQYLVR-YPDGSSTSGAYIADLLTLDAG--NAVSG-----FKFGCSHAE 133
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
G + A G++ LG G S+ L A N+FS C S FF P
Sbjct: 134 QGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTLGVPRRAS 189
Query: 271 STSFLASNGKY----ITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKEVYE 322
S + ++ Y + + T +G L F A ++DS ++ T LP Y+
Sbjct: 190 SRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQ 249
Query: 323 TIAAEFDRQVNDTITSFEGYPWK----CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ A F ++T + P K CY + +LP + L+F N+ + +P
Sbjct: 250 ALRAAF----RSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVF-DRNAVLPLDPSG 304
Query: 379 VIYGTQVVTGFCLAI-----QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+++ CLA + G +G++ Q + V++D +G+ C
Sbjct: 305 ILFND------CLAFTSNADDRMPGVLGSVQQQTI---EVLYDVGGGAVGFRQGAC 351
>gi|302757745|ref|XP_002962296.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
gi|300170955|gb|EFJ37556.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
Length = 163
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 63/140 (45%), Gaps = 10/140 (7%)
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
+S I DSG++ TFLP VY + + F R++N + + CY S QR P
Sbjct: 26 DSSVGTIFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFP 85
Query: 360 SVKLMFP-------QNNSFVVNNPVFVIYGTQVVTGFCLAIQ-PVDGDIGTIGQNFMTGY 411
S+ L FP Q+N VV + + V CLAI I IG GY
Sbjct: 86 SLALHFPDAWMNLHQDNYIVVPSRADAEAWNESVA--CLAIMSSASIGINIIGNVMQQGY 143
Query: 412 RVVFDRENLKLGWSHSNCQD 431
++FD E + ++ ++C +
Sbjct: 144 HIMFDNEKSTVTFAPASCSE 163
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 144/384 (37%), Gaps = 72/384 (18%)
Query: 95 LGNDFGCDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
L D G DL W+ C+ C + P + + D SS+ + + CS C
Sbjct: 135 LVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRAND----------SSSFRTIPCSSDDC 184
Query: 150 DLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQAS 201
+ T C NP PC + DY Y + G+ + ++ G N K
Sbjct: 185 KIELQDYFSLTECPNPNAPCLF--DYRYLNGPRAIGVFANET---VTVGLNDHKKIRLFD 239
Query: 202 VIIGC--GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKD 254
V+IGC ++ G+ PDG++GLG + S+ LA+ + N FS C +
Sbjct: 240 VLIGCTESFNETNGF-----PDGVMGLGYRKHSLALRLAE--IFGNKFSYCLVDHLSSSN 292
Query: 255 DSGRIFFGD----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------- 303
+ FGD + P Q + L + Y + V +G S L +S
Sbjct: 293 HKNFLSFGDIPEMKLPKMQHTELLLGYINAF--YPVNVSGISVGGSMLSISSDIWNVTGV 350
Query: 304 -KAIVDSGSSFTFLPKEVYETIAAE----FDRQ---VNDTITSFEGYPWKCCYKSSSQRL 355
IVDSG+S T L E Y+ + FD+ V + + C++
Sbjct: 351 GGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNF----CFEDKGFDR 406
Query: 356 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFM-TGYRV 413
+P + + F F P Y V G CL I D +I N M +
Sbjct: 407 AAVPRLLIHFADGAIF---KPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLW 463
Query: 414 VFDRENLKLGWSHSNCQDLNDGTK 437
+D KLG+ S+C N +K
Sbjct: 464 EYDLGRGKLGFGPSSCIMSNSNSK 487
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 101/410 (24%), Positives = 159/410 (38%), Gaps = 86/410 (20%)
Query: 90 SKTMSLGNDFGCDLLWIPC---DCVRCA--PLSASYYNSLDRDLNEYSPSASSTSKH--- 141
S +SL D G DL+W PC +C+ C P S + + + +A+ ++ H
Sbjct: 86 SHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSAAACSAAHGGS 145
Query: 142 LSCSHRLCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
LS SH LC + + C + P Y Y + S L D L L + +
Sbjct: 146 LSASH-LCAISRCPLESIEISECSSFSCPPFY---YAYGDGSLVARLYRDSLSLPTPAPS 201
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC- 250
N + GC G P G+ G G G +S+PS LA + + N FS C
Sbjct: 202 PPINV--RNFTFGCAHTTLG------EPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCL 253
Query: 251 ----FDKDDS--------GRIFFGDQGPATQQSTSFLASNGKY-ITYIIGVETCCIGSSC 297
F D GR + G+ T+ + L N K+ Y +G+ +G+
Sbjct: 254 VSHSFAADRVRRPSPLILGRYYTGE----TEFIYTSLLENPKHPYFYSVGLAGISVGNIR 309
Query: 298 LKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK-- 345
+ F +VDSG++FT LP +YE++ AEF+ +
Sbjct: 310 IPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTG 369
Query: 346 ---CCYKSSSQRLPKL------PSVKLMFPQNNSFVVNNPVFVIYGTQVV-----TGFCL 391
C Y +S +P++ ++ P+ N F F+ G VV G CL
Sbjct: 370 LSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFY----EFLDGGDGVVGRKRKVG-CL 424
Query: 392 AI-------QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDLND 434
+ + G T+G G+ VV+D E ++G++ C L D
Sbjct: 425 MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD 474
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 75/386 (19%)
Query: 70 VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDL 128
V + ++ T PQ Q+L L D D WIPC C C
Sbjct: 109 VVRARLGTPPQ-QLL----------LAVDTSNDAAWIPCSGCAGCP------------TT 145
Query: 129 NEYSPSASSTSKHLSCSHRLCDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 186
++P+AS + + + C C SC + C +++ Y ++S L +D L
Sbjct: 146 TPFNPAASKSYRAVPCGSPACSRAPNPSCSLNTKSCGFSLTY--ADSSLEAALSQDSL-- 201
Query: 187 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 246
A+ N V S GC K +G P GL+GLG G +S L + +
Sbjct: 202 ------AVANDVVKSYTFGCLQKATG---TATPPQGLLGLGRGPLSF--LSQTKDMYEGT 250
Query: 247 FSMCFDK----DDSGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 299
FS C + SG + G +G P ++T L + + Y + + +G +
Sbjct: 251 FSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIP 310
Query: 300 --------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKS 350
T ++DSG+ FT L Y + E R++ ++S G+ CY +
Sbjct: 311 PAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGGF--DTCYNT 368
Query: 351 SSQRLPKLPSVKLMF-------PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTI 403
+ K P V MF P +N V+++ YGT A V+ + I
Sbjct: 369 TV----KWPPVTFMFTGMQVTLPADN-LVIHS----TYGTTSCLAMAAAPDGVNTVLNVI 419
Query: 404 GQNFMTGYRVVFDRENLKLGWSHSNC 429
+R++FD N ++G++ C
Sbjct: 420 ASMQQQNHRILFDVPNGRVGFAREQC 445
>gi|68071623|ref|XP_677725.1| aspartyl (acid) protease [Plasmodium berghei strain ANKA]
gi|56497949|emb|CAH98861.1| aspartyl (acid) protease, putative [Plasmodium berghei]
Length = 518
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 101/455 (22%), Positives = 174/455 (38%), Gaps = 92/455 (20%)
Query: 45 SKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLL 104
++N + + ++K+ E Y+ L D+ + F + + +SL D G L
Sbjct: 24 TENEDILNKNSEKNEEIYKYKLYGDIDEY----AYYFMDINIGTPGQKLSLIVDTGSSSL 79
Query: 105 WIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 163
PC +C C N ++ + SSTS L C+ +C C K C
Sbjct: 80 SFPCSECKDCGVHME----------NPFNLNNSSTSSILYCNDNICPYNLKC--VKGRCE 127
Query: 164 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 223
Y + Y E + +G DI+ L S +N ++ +GC M + G +L A G+
Sbjct: 128 Y-LQSYCEGSRINGFYFSDIVRLES-NNNTKNGNITFKKHMGCHMHEEGLFLHQHAT-GV 184
Query: 224 IGLGL----GEISVPSLLAKAGLIRNS-FSMCFDKDDSGRIFFGDQGPATQQSTS----- 273
+GL L G + LL K+ N FS+C + I G + S
Sbjct: 185 LGLSLTKPKGVPTFIDLLFKSSPKLNKIFSLCISEYGGELILGGYSKDYIVKEVSIDEKK 244
Query: 274 -----------------------FLASNGKYITYIIGVETCCIGSS-CLKQTSFKAIVDS 309
+ A KY YI G++ S + +VDS
Sbjct: 245 DNIEHNKNENINSINKSIVDGILWEAITRKYYYYIRVKGFQLFGTTFSHNNKSMEMLVDS 304
Query: 310 GSSFTFLPKEVYETIAAEFD-----------------RQVNDTITS----FEGYP----- 343
GS+FT LP ++Y + FD + N+T+++ F+ +
Sbjct: 305 GSTFTHLPDDLYNNLNFFFDILCIHNMNNPIDIEKKLKITNETLSNHLLYFDDFKSTLKN 364
Query: 344 ----WKCCYKSSS-----QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ 394
C K + + L LP++ + NN+ +V P +Y + + +C ++
Sbjct: 365 IISSENVCVKIADNVQCWRYLENLPNIYIKL-SNNTKLVWQPSSYLYKKE--SFWCKGLE 421
Query: 395 PVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
D +G +F +++FD +N K+G+ SNC
Sbjct: 422 KQVNDKPILGLSFFKNKQIIFDLKNNKIGFIESNC 456
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 129/347 (37%), Gaps = 57/347 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D DL+W C AP ++P S+T + C+ C +C
Sbjct: 118 DISSDLVWTACGAT--AP---------------FNPVRSTTVADVPCTDDACQQFAPQTC 160
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C YT Y +++GLL + GD + V+ GCG+K G +
Sbjct: 161 GAGASECAYTYMYGGGAANTTGLLGTEAFTF---GDTRIDG-----VVFGCGLKNVGDF- 211
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----GRIFFGDQG-PATQQ 270
GV+ G+IGLG G +S+ S L + FS F DDS I FGD P T
Sbjct: 212 SGVS--GVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFILFGDDATPQTSH 264
Query: 271 --STSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKAIVDSGSSFTFLPKEVYETIAA 326
ST LAS+ Y + + + L S F GS FL T+
Sbjct: 265 TLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLE 324
Query: 327 EFD-RQVNDTITSFEGYP--------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
E + + + S G P CY S K+PS+ L+F V+ +
Sbjct: 325 EAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFA--GGAVMELEL 382
Query: 378 FVIYGTQVVTGF-CLAIQPVD-GDIGTIGQNFMTGYRVVFDRENLKL 422
+ TG CL I P GD +G G +++D KL
Sbjct: 383 GNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKL 429
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 141/355 (39%), Gaps = 52/355 (14%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G D++WI C+ C C Y+ D N PS+S + + C +C +
Sbjct: 26 DTGSDVVWIQCEPCREC-------YSQADPIFN---PSSSVSFSTVGCDSAVCSQLDAND 75
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y + Y + + + G + L G +++N V IGCG G +
Sbjct: 76 CHGGGCLYEVSY-GDGSYTVGSYATETLTF---GTTSIQN-----VAIGCGHDNVGLF-- 124
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFGDQG-PATQQST 272
V GL+GLG G +S P+ L +FS C D + SG + FG + P T
Sbjct: 125 -VGAAGLLGLGAGSLSFPAQLGTQ--TGRAFSYCLVDRDSESSGTLEFGPESVPIGSIFT 181
Query: 273 SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------------IVDSGSSFTFLPKEV 320
+A+ Y + + +G L +A I+DSG++ T L
Sbjct: 182 PLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSA 241
Query: 321 YETIAAEFDRQVN-----DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 375
Y+ + F D I+ F+ CY S+ + +P+V F F++
Sbjct: 242 YDALRDAFIAGTQHLPRADGISIFD-----TCYDLSALQSVSIPAVGFHFSNGAGFILPA 296
Query: 376 PVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+I + T FC A P D ++ +G G RV FD N +G++ CQ
Sbjct: 297 KNCLIPMDSMGT-FCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 157/392 (40%), Gaps = 64/392 (16%)
Query: 63 QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYN 122
Q+ + + V + K+ T PQ M + D D +W+PC C+ S + +
Sbjct: 24 QLHIGNYVVRAKLGTPPQL-----------MFMVLDTSNDAVWLPCS--GCSGCSNASTS 70
Query: 123 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLL 179
+ YS + ST++ C+ G +C + P P + Y ++S S L
Sbjct: 71 FNTNSSSTYSTVSCSTAQ---CTQAR---GLTCPS-SSPQPSVCSFNQSYGGDSSFSASL 123
Query: 180 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 239
V+D L L V + GC SG + + P GL+GLG G +S+ S
Sbjct: 124 VQDTL--------TLAPDVIPNFSFGCINSASG---NSLPPQGLMGLGRGPMSLVS--QT 170
Query: 240 AGLIRNSFSMCFDKDDS----GRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIG 294
L FS C S G + G G P + + T L + + Y + + +G
Sbjct: 171 TSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVG 230
Query: 295 SSCLK----QTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-P 343
S + +F A I+DSG+ T + VYE I EF +QVN ++SF
Sbjct: 231 SVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN--VSSFSTLGA 288
Query: 344 WKCCYKSSSQRL-PK----LPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG 398
+ C+ + ++ + PK + S+ L P N+ + ++ GT Q +
Sbjct: 289 FDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSS-----AGTLTCLSMAGIRQNANA 343
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ I R++FD N ++G + C
Sbjct: 344 VLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 375
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 136/354 (38%), Gaps = 54/354 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P+ SST ++SC+ C DL C
Sbjct: 198 DTGSDTTW-----VQCQPCVVVCYEQREK---LFDPARSSTYANVSCAAPACSDLNIHGC 249
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 250 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 299
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFF-----GDQGPATQ 269
+ GL+GLG G+ S+P K G + F+ C +G + + +
Sbjct: 300 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLDFGAGSLAAASAR 353
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYETI 324
+T L NG Y +G+ +G L Q+ F IVDSG+ T LP Y ++
Sbjct: 354 LTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSL 412
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP- 376
R + GY CY + +P+V L+F V+
Sbjct: 413 -----RYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 377 -VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ +QV F A GD+G +G + + V +D +G+ C
Sbjct: 468 IMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|32482806|gb|AAP84703.1| putative xyloglucanase inhibitor [Solanum tuberosum]
Length = 437
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 94/395 (23%), Positives = 153/395 (38%), Gaps = 94/395 (23%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 152
+SL D G LW+ CD SS+ K C C LG
Sbjct: 60 ISLTLDLGGQFLWVDCD----------------------QGYVSSSYKPARCRSAQCSLG 97
Query: 153 TS-----CQNPKQP------CPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQA 200
+ C +P +P C D T++SG L DI+ + S G N ++
Sbjct: 98 GASGCGECFSPPRPGCNNNTCGLLPDNTVTRTATSGELASDIVSVQSTNGKNPGRSVSDK 157
Query: 201 SVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-G 257
+ + CG L G+A G+ GLG IS+PS + F++C +S G
Sbjct: 158 NFLFVCGATF---LLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPRKFALCLTSSNSKG 214
Query: 258 RIFFGDQGP--------------------ATQQSTSFLASNGKYIT-YIIGVETCCIGSS 296
+ FGD GP ST+ S+G+ + Y IGV++ I
Sbjct: 215 VVLFGD-GPYFFLPNREFSNNDFQYTPLFINPVSTASAFSSGQPSSEYFIGVKSIKINQK 273
Query: 297 CLK-QTSFKAIVDSG---------SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 346
+ T+ +I + G + +T L +Y I F +++ + P+K
Sbjct: 274 VVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNFFVKELANVTRVAAVAPFKV 333
Query: 347 CYKS----SSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTG-----FCLAIQPVD 397
C+ S S++ P +PS+ L+ QN N V+ I+G + CL + +D
Sbjct: 334 CFDSRNIGSTRVGPAVPSIDLVL-QN-----ENVVWTIFGANSMVQVSENVLCLGV--LD 385
Query: 398 GDIGT-----IGQNFMTGYRVVFDRENLKLGWSHS 427
G + + IG + + + FD +LG++ S
Sbjct: 386 GGVNSRTSIVIGGHTIEDNLLQFDHAASRLGFTSS 420
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 149/368 (40%), Gaps = 63/368 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T S D G DL+W C C C D+ + P SS+ L CS L
Sbjct: 107 AETYSAIMDTGSDLIWTQCKPCKVC----------FDQPTPIFDPEKSSSFSKLPCSSDL 156
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C Y Y +++S+ G+L + GD ++ + + GCG
Sbjct: 157 C-VALPISSCSDGCEYRYSY-GDHSSTQGVLATETFTF---GDASV-----SKIGFGCGE 206
Query: 209 KQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--RIFFGDQG 265
G Y G GL+GLG G +S L+++ G+ + S+ + D G + G +
Sbjct: 207 DNRGRAYSQGA---GLVGLGRGPLS---LISQLGVPKFSYCLTSIDDSKGISTLLVGSE- 259
Query: 266 PATQQS---TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------IVDSGSS 312
AT +S T + + + Y + +E +G + L ++++F I+DSG++
Sbjct: 260 -ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTT 318
Query: 313 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL----PSVKLM 364
T+L + + EF Q+ + + + C+ S +P+L V L
Sbjct: 319 ITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLK 378
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGW 424
P+ N + ++ + VI CL + G + G V+ D E + +
Sbjct: 379 LPKENYIIEDSALRVI---------CLTMGSSSG-MSIFGNFQQQNIVVLHDLEKETISF 428
Query: 425 SHSNCQDL 432
+ + C L
Sbjct: 429 APAQCNQL 436
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 131/331 (39%), Gaps = 49/331 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC M G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 SFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 165
Query: 262 --------GDQGPATQQSTSFLASNGK-----YITYI-IGVETCCIGSSCLKQTSFKAIV 307
G T + + + K ++ I I V+ +G S + +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P ++ R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLSQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQPVD 397
F + ++ VFV Q +CLA P +
Sbjct: 285 AARFDLGSHGVFVERSVQEQDVWCLAFAPTE 315
>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
Length = 477
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 131/346 (37%), Gaps = 58/346 (16%)
Query: 98 DFGCDLLWIPCD-CVR---CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 153
D +W+PC+ CV C Y +L R+L SC + C
Sbjct: 105 DISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL-------------YSCGEQRCRTIV 151
Query: 154 ---SCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C P PC YT Y + + + L + GDN + ++I GCG++
Sbjct: 152 GQPDCGAPYNGPCKYTCRYGGAGGTETEGHLG--LQPFTLGDNTMP----VNMIFGCGLE 205
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-----IFFGDQ 264
+ G+IGL G +S L+++ L R S+ + DD+ I FG+
Sbjct: 206 PETNF-------GVIGLNRGRLS---LISQLQLGRFSYYFAPEYDDTAAGNASFILFGEY 255
Query: 265 G-PATQ--QSTSFLA-SNGKY-ITYIIGVETCCIGSSCLK--------QTSFKAIVDSGS 311
P T + T F + NG Y Y++G+ +GS+ L + A + +
Sbjct: 256 AVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVGSNNLNMLGAGSGGRDPLVAYLSTSV 315
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
TFL K Y+ + E V CY S K P++ L+F + +
Sbjct: 316 PITFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLAKAKFPAMALVF-WDGAV 374
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVF 415
+ P +Y CL I P V G + +G TG +++
Sbjct: 375 MELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHMMY 420
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 140/359 (38%), Gaps = 61/359 (16%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-- 154
D G D++WI C C +C Y+ D N P+ S + ++ C LC S
Sbjct: 165 DTGSDVVWIQCAPCKKC-------YSQTDPVFN---PTKSRSFANIPCGSPLCRRLDSPG 214
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C K C Y + Y + + + G + L + + V +GCG G +
Sbjct: 215 CSTKKHICLYQVSY-GDGSFTYGEFSTETL--------TFRGTRVGRVALGCGHDNEGLF 265
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR---IFFGDQGPATQQ 270
+ L+GLG G +S PS + + FS C D+ S + + FGD +
Sbjct: 266 IGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSKPSYMVFGDSAISRTA 320
Query: 271 STSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPK 318
+ L SN K Y ++GV + + FK I+DSG+S T L +
Sbjct: 321 RFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTR 380
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF-------PQNNSF 371
Y + F ++ + E + C+ S + K+P+V L F P +N
Sbjct: 381 PAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYL 440
Query: 372 V-VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ V+N FC A + +G G+RVV+D ++G++ C
Sbjct: 441 IPVDNS----------GSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 145/373 (38%), Gaps = 83/373 (22%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + W C CV C S +++SL ASST SC + ++
Sbjct: 145 DTGSSITWTQCKACVHCLKDSHRHFDSL----------ASSTYSFGSC------IPSTVG 188
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
N Y M Y + ++S G D + L + V GCG G +
Sbjct: 189 NT-----YNMTY-GDKSTSVGNYGCDTMTL-------EPSDVFQKFQFGCGRNNEGDF-- 233
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSF- 274
G DG++GLG G++S S A + FS C +++S G + FG++ AT QS+S
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLFGEK--ATSQSSSLK 289
Query: 275 ------------LASNGKYITYI----IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
L +G Y + +G + I SS S I+DSG+ T LP+
Sbjct: 290 FTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF--ASPGTIIDSGTVITRLPQ 347
Query: 319 EVYETIA------------AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
Y + + R+ ND + + CY S ++ LP L F
Sbjct: 348 RAYSALKAAFKKAMAKYPLSNGRRKENDMLDT--------CYNLSGRKDVLLPEXVLHFG 399
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAI-----QPVDGDIGTIGQNFMTGYRVVFDRENLK 421
+N V++G + CLA ++ ++ IG V++D +
Sbjct: 400 DGADVRLNGKR-VVWGND-ASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRR 457
Query: 422 LGWSHSNCQDLND 434
+G+ + C +L +
Sbjct: 458 IGFGGNGCSNLKN 470
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 88/205 (42%), Gaps = 22/205 (10%)
Query: 242 LIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIG 294
L SFS C D + S + F P+ TS L N ++ T+ +IG+ +G
Sbjct: 324 LEATSFSYCLVDLDSESSSTLDFNADQPS-DSLTSPLVKNDRFPTFRYVKVIGMS---VG 379
Query: 295 SSCL--KQTSFKA--------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 344
L +SF+ IVDSG++ T +P +VY+ + F + + P+
Sbjct: 380 GKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPF 439
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIG 404
CY SSQ ++P++ + P NS + +I T FCLA P + IG
Sbjct: 440 DTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGT-FCLAFLPSTFPLSIIG 498
Query: 405 QNFMTGYRVVFDRENLKLGWSHSNC 429
G RV +D N +G+S C
Sbjct: 499 NVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|302763589|ref|XP_002965216.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
gi|300167449|gb|EFJ34054.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
Length = 163
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 63/140 (45%), Gaps = 10/140 (7%)
Query: 300 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 359
+S I DSG++ TFLP VY + + F R++N + + CY S QR P
Sbjct: 26 DSSVGTIFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFP 85
Query: 360 SVKLMFP-------QNNSFVVNNPVFVIYGTQVVTGFCLAIQ-PVDGDIGTIGQNFMTGY 411
S+ L FP Q+N +V + + V CLAI I IG GY
Sbjct: 86 SLALHFPDAWMNLHQDNYIIVPSRADAEAWNESVA--CLAIMSSASIGINIIGNVMQEGY 143
Query: 412 RVVFDRENLKLGWSHSNCQD 431
++FD E + ++ ++C +
Sbjct: 144 HIMFDNEKSTVTFAPASCSE 163
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 103/470 (21%), Positives = 169/470 (35%), Gaps = 73/470 (15%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHR-------FSEEVKALGVSKNRNATSW 53
M + L+ + L+T + + KL HR S +G + R++
Sbjct: 1 MQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLIS 60
Query: 54 PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRC 113
+ S ++ L S + T F + +K + D G +L W+ C
Sbjct: 61 RKRNSTVGVKMDLGSGID---YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR---- 113
Query: 114 APLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYT 165
Y + +D + S + K + C + C + T+C P PC Y
Sbjct: 114 -------YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY- 165
Query: 166 MDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQASV---IIGCGMKQSGGYLDGVAPD 221
DY Y + +++ G+ ++ + + L N A + +IGC +G G D
Sbjct: 166 -DYRYADGSAAQGVFAKETITV------GLTNGRMARLPGHLIGCSSSFTGQSFQGA--D 216
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLA 276
G++GL + S S L FS C +K+ S + FG + T+F
Sbjct: 217 GVLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGS---SRSTKTAFRR 271
Query: 277 SNGKYITYI------------IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYET 323
+ +T I +G + I S TS I+DSG+S T L Y+
Sbjct: 272 TTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQ 331
Query: 324 IAAEFDRQ-VNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ R V EG P + C+ +S + KLP + F + +++
Sbjct: 332 VVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 391
Query: 382 GTQVVT--GFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V GF A P IG I Q Y FD L ++ S C
Sbjct: 392 AAPGVKCLGFVSAGTPATNVIGNIMQQ---NYLWEFDLMASTLSFAPSAC 438
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 111/271 (40%), Gaps = 56/271 (20%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 157
D G D +W C C P L++ + PS SST K + C+ +C
Sbjct: 108 DTGNDNIWFQCK--PCKP-------CLNQTSPMFHPSKSSTYKTIPCTSPIC-------- 150
Query: 158 PKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
+N L V+ + L+ +G + KN ++IGCG + G L+
Sbjct: 151 -------------KNADGHYLGVDTLTLNSNNGTPISFKN-----IVIGCGHRNQGP-LE 191
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDD-SGRIFFGDQGPAT--- 268
G G IGL G +S S L + I FS C F K++ S ++ FGD+ +
Sbjct: 192 GYV-SGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKLHFGDKSTVSGLG 248
Query: 269 QQSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIVDSGSSFTFLPKEVYETI 324
ST NG Y + +E +G +K +I+DSG++ T LPK+VY +
Sbjct: 249 TVSTPIKEENG----YFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRL 304
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 355
+ V + CY+++S L
Sbjct: 305 ESVVLDMVKLKRVKDPSQQFNLCYQTTSTTL 335
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 89/207 (42%), Gaps = 26/207 (12%)
Query: 242 LIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIG 294
L SFS C D + S + F P+ TS L N ++ T+ +IG+ +G
Sbjct: 324 LEATSFSYCLVDLDSESSSTLDFNADQPS-DSLTSPLVKNDRFPTFRYVKVIGMS---VG 379
Query: 295 SSCL--KQTSFKA--------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 344
L +SF+ IVDSG++ T +P +VY+ + F + + P+
Sbjct: 380 GKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPF 439
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGFCLAIQPVDGDIGT 402
CY SSQ ++P++ + P NS + N +F + FCLA P +
Sbjct: 440 DTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQV---DSAGTFCLAFLPSTFPLSI 496
Query: 403 IGQNFMTGYRVVFDRENLKLGWSHSNC 429
IG G RV +D N +G+S C
Sbjct: 497 IGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 101/427 (23%), Positives = 164/427 (38%), Gaps = 56/427 (13%)
Query: 26 MFSTKLIHRFSEEVKALGVSK----NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
+FS++L R S VK++ RN T P F SS V +G F
Sbjct: 91 LFSSRL-QRDSRRVKSIATLAAQIPGRNVTHAPRPGGFS------SSVVSGLSQGSGEYF 143
Query: 82 QMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
L ++ + + D G D++W+ C C RC S ++ P S T
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFD----------PRKSKTYA 193
Query: 141 HLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 198
+ CS C C ++ C Y + Y + + E + +N V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR--------RNRV 245
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 257
+ V +GCG G + V GL+GLG G++S P FS C D+ S
Sbjct: 246 KG-VALGCGHDNEGLF---VGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASS 299
Query: 258 R---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTS---FK------ 304
+ + FG+ + + L SN K T Y +G+ +G + + + FK
Sbjct: 300 KPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGN 359
Query: 305 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 362
I+DSG+S T L + Y + F + + + C+ S+ K+P+V
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVV 419
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
L F + + + T FC A G + IG G+RVV+D + ++
Sbjct: 420 LHFRGADVSLPATNYLIPVDTN--GKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477
Query: 423 GWSHSNC 429
G++ C
Sbjct: 478 GFAPGGC 484
>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
Length = 477
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 131/346 (37%), Gaps = 58/346 (16%)
Query: 98 DFGCDLLWIPCD-CVR---CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 153
D +W+PC+ CV C Y +L R+L SC + C
Sbjct: 105 DISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL-------------YSCGEQRCRTIV 151
Query: 154 ---SCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C P PC YT Y + + + L + GDN + ++I GCG++
Sbjct: 152 GQPDCGAPYNGPCKYTCRYGGAGGTETEGHLG--LQPFTLGDNTMP----VNMIFGCGLE 205
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-----IFFGDQ 264
+ G+IGL G +S L+++ L R S+ + DD+ I FG+
Sbjct: 206 PETNF-------GVIGLNRGRLS---LISQLQLGRFSYYFAPEYDDTAAGNASFILFGEY 255
Query: 265 G-PATQ--QSTSFLA-SNGKY-ITYIIGVETCCIGSSCLK--------QTSFKAIVDSGS 311
P T + T F + NG Y Y++G+ +GS+ L + A + +
Sbjct: 256 AVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVGSNNLNMLGAGSGGRDPLVAYLSTSV 315
Query: 312 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 371
TFL K Y+ + E V CY S K P++ L+F + +
Sbjct: 316 PVTFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLAKAKFPAMALVF-WDGAV 374
Query: 372 VVNNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVF 415
+ P +Y CL I P V G + +G TG +++
Sbjct: 375 MELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHMMY 420
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 57/128 (44%), Gaps = 6/128 (4%)
Query: 306 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK--LPSVKL 363
+VDSG++ FL + Y ++ A R+V I + C S P+ LP +K
Sbjct: 222 VVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKF 281
Query: 364 MFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG--TIGQNFMTGYRVVFDRENLK 421
F FV + I + + CLAIQ VD +G IG G+ FDR+ +
Sbjct: 282 EFSGGAVFVPPPRNYFIETEEQIQ--CLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 339
Query: 422 LGWSHSNC 429
LG+S C
Sbjct: 340 LGFSRRGC 347
>gi|414888272|tpg|DAA64286.1| TPA: hypothetical protein ZEAMMB73_677781 [Zea mays]
Length = 118
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 388 GFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL-NDGTKSPLTPGP-G 445
+CLA+ +G + IG+NFM+G +VVFDRE LGW + +C + N + P+ P P G
Sbjct: 2 AYCLAVMKSEG-VNLIGENFMSGLKVVFDRERKVLGWKNFDCYSVGNSRSNLPVNPNPSG 60
Query: 446 TPSNPL-------PANQEQSSPGGHAV 465
P P P + +SP G V
Sbjct: 61 VPPKPALGPNSYTPEATKGASPNGTQV 87
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 102/428 (23%), Positives = 167/428 (39%), Gaps = 58/428 (13%)
Query: 26 MFSTKLIHRFSEEVKALGVSK----NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
+FS++L R S V+++ RN T P F SS V +G F
Sbjct: 91 LFSSRL-QRDSRRVRSIATLAAQIPGRNVTHAPRPGGFS------SSVVSGLSQGSGEYF 143
Query: 82 QMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
L ++ + + D G D++W+ C C RC S ++ P S T
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFD----------PRKSKTYA 193
Query: 141 HLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 198
+ CS C C ++ C Y + Y + + E + +N V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR--------RNRV 245
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 257
+ V +GCG G + V GL+GLG G++S P FS C D+ S
Sbjct: 246 KG-VALGCGHDNEGLF---VGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASS 299
Query: 258 R---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTS---FK------ 304
+ + FG+ + + L SN K T Y +G+ +G + + + FK
Sbjct: 300 KPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGN 359
Query: 305 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSV 361
I+DSG+S T L + Y + F R T+ + + C+ S+ K+P+V
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAF-RVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTV 418
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L F + + + + T FC A G + IG G+RVV+D + +
Sbjct: 419 VLHFRRADVSLPATNYLIPVDTN--GKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSR 476
Query: 422 LGWSHSNC 429
+G++ C
Sbjct: 477 VGFAPGGC 484
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 101/440 (22%), Positives = 170/440 (38%), Gaps = 97/440 (22%)
Query: 33 HRFSEEVKALGVSKNRNATSWPA--KKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS 90
HRF+ +L KN + +S P + F+Y L+ S + T PQ Q + GS
Sbjct: 41 HRFTT---SLLSRKNPSPSSPPYNFRSRFKYSMALIIS----LPIGTPPQAQQMVLDTGS 93
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ L WI C + P + + PS SS+ L CSH LC
Sbjct: 94 Q-----------LSWIQCHRKKLPP----------KPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 151 -------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
L TSC + + C Y+ +Y + T + G LV++ + + + +I
Sbjct: 133 PRIPDFTLPTSCDSNRL-CHYSY-FYADGTFAEGNLVKEKITFSN-------TEITPPLI 183
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-------S 256
+GC + S G++G+ G +S +++A + + FS C +
Sbjct: 184 LGCATESSDD-------RGILGMNRGRLS---FVSQAKI--SKFSYCIPPKSNRPGFTPT 231
Query: 257 GRIFFGDQG-------------PATQQSTSF--LASNGKYITYIIGVETCCIGSSCLKQT 301
G + GD P +Q+ + LA I G++ I S +
Sbjct: 232 GSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 302 ---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW----KCCYKSSSQR 354
S + +VDSGS FT L Y+ + AE +V + +GY + C+ +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK--KGYVYGGTADMCFDGNVAM 349
Query: 355 LPKL-PSVKLMFPQN-NSFVVNNPVFVIYGTQVVTGFCLAI---QPVDGDIGTIGQNFMT 409
+P+L + +F + FV V V G + C+ I + IG
Sbjct: 350 IPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGI---HCVGIGRSSMLGAASNIIGNVHQQ 406
Query: 410 GYRVVFDRENLKLGWSHSNC 429
V FD N ++G++ ++C
Sbjct: 407 NLWVEFDVTNRRVGFAKADC 426
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 137/345 (39%), Gaps = 39/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G L W+ +C+P S + + + P ASST + CS CD L +
Sbjct: 152 DTGSSLTWL-----QCSPCVVSCHRQVG---PLFDPRASSTYASVRCSASQCDELQAATL 203
Query: 157 NP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
NP C Y Y +++ S G L D + + ++ S GCG
Sbjct: 204 NPSACSASNVCIYQAS-YGDSSFSVGSLSTDTV--------SFGSTRYPSFYYGCGQDNE 254
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQ 270
G + GLIGL ++S+ LA + + SFS C S G + G
Sbjct: 255 GLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGPYNTGHYY 309
Query: 271 STSFLASNGKYIT-YIIGVETCCIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETI 324
S + +AS+ + Y I + +G S L + +S I+DSG+ T LP V+ +
Sbjct: 310 SYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTAL 369
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
+ + + + C++ + +L ++P+V + F S + +I
Sbjct: 370 SKAVAQAMAGAQRAPAFSILDTCFEGQASQL-RVPTVAMAFAGGASMKLTTRNVLIDVDD 428
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T CLA P D IG + V++D ++G+S C
Sbjct: 429 STT--CLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 156/388 (40%), Gaps = 58/388 (14%)
Query: 131 YSPSASSTSKHLSCSHRL-CDLGTSCQNPK--QPCPYTMDYYTENTSSSGLLVEDILHLI 187
YS S +S L+CS C+ +C+N K +PCP+ + Y + + +G LV D H+
Sbjct: 259 YSLEESISSNQLNCSDTSNCN---TCKNNKSNKPCPFVLKY-GDGSFIAGSLVID--HVT 312
Query: 188 SGG-------DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS------VP 234
G N K S+ S + ++S DG++GL ++ +
Sbjct: 313 IGDFTVPAKFGNIQKESLSFSQLTCPSTQRSQA-----VRDGILGLSFQQLDPDNGDDIF 367
Query: 235 SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 294
S + I N FSMC KD G TQ++ + + Y I V +G
Sbjct: 368 SKIVAHYNIPNVFSMCLGKDGGLLTIGGTNDHITQETPKYTPIFDSHY-YSITVTNIYVG 426
Query: 295 SSCLKQTS---FKAIVDSGSSFTFLPKEVYETIAAEFDRQ-------VNDTITSFEGYPW 344
+ L +IVDSG++ + E++ +I + + ND +EG
Sbjct: 427 NDSLNLAPPDLSTSIVDSGTTLLYFSDEIFYSIVRNLEEKHCELPGICNDPF--WEG--- 481
Query: 345 KCCYKSSSQRLPKLPSVKLMFPQNN---SFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIG 401
C+ + + + P++ L N SF + P +Y + +C I +
Sbjct: 482 -NCHHLEEKLISEYPTIYLEMKGMNGEPSFKLEVPP-DLYFLNINGLYCFGISHMKEISV 539
Query: 402 TIGQNFMTGYRVVFDRENLKLGW--SHSNCQDLNDGTKSPLTPGPGTPSNPLPANQEQSS 459
IG + GY V+++REN +G+ +H N+ T L+ G N ++S+
Sbjct: 540 LIGDVVLQGYNVIYNRENSSIGFARTHGCSTKGNNNTSLMLSIESG--------NLQKST 591
Query: 460 PGGHAVGPAVAGRAPSKPSTASTQLISS 487
P V + SK TA + +I S
Sbjct: 592 EEERFASPLVLKLSDSKNKTAVSGIIVS 619
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 92/402 (22%), Positives = 146/402 (36%), Gaps = 87/402 (21%)
Query: 62 YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASY 120
+Q LL + V M +L T S+ D G DL+W C C +C
Sbjct: 75 FQALLENGVGGYNMNISVGTPLL-------TFSVVADTGSDLIWTQCAPCTKC------- 120
Query: 121 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSG 177
+ + P++SST L C+ C N + C T +Y + ++G
Sbjct: 121 ---FQQPAPPFQPASSSTFSKLPCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAG 174
Query: 178 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 237
L + L + GD + SV GC + G LD LG+G
Sbjct: 175 YLATETLKV---GDASFP-----SVAFGCSTENGLGQLD---------LGVGR------- 210
Query: 238 AKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQ---QSTSFLASNGKYITYI-IGVET 290
FS C + I FG T QST F+ + + +Y + +
Sbjct: 211 ---------FSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTG 261
Query: 291 CCIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 339
+G + L T+ IVDSG++ T+L K+ YE + F Q D T
Sbjct: 262 ITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVN 321
Query: 340 EGYPWKCCYKSSSQRLPKL--PSVKLMFPQNNSFVVNNPVFVIYGTQV-----VTGFCLA 392
C+KS+ + PS+ L F + V P + G + VT CL
Sbjct: 322 GTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV--PTY-FAGVETDSQGSVTVACLM 378
Query: 393 IQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
+ P GD + IG +++D + ++ ++C +
Sbjct: 379 MLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCAKV 420
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 146/369 (39%), Gaps = 72/369 (19%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ CA Y++ + R S+T + L C C +S
Sbjct: 107 DTGSDLIWTQCAPCLLCAAQPTPYFD-VKR---------SATYRALPCRSSRCAALSSPS 156
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
K+ C Y YY + S++G+L + + ++ A++ GCG +G +
Sbjct: 157 CFKKMCVYQY-YYGDTASTAGVLANETFTFGAASSTKVR---AANISFGCGSLNAGELAN 212
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFFG---------DQGP 266
G++G G G +S L+++ G R S+ + + R++FG
Sbjct: 213 S---SGMVGFGRGPLS---LVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANLNSTNTSSG 266
Query: 267 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----------KAIVDSGSSFTFL 316
+ QST F+ + Y + V+ +G+ L I+DSG+S T+L
Sbjct: 267 SPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWL 326
Query: 317 PKEVYETIAAEFDRQV-----NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ---- 367
++ YE + + NDT + C++ P P+V + P
Sbjct: 327 QQDAYEAVRRGLASTIPLPAMNDTDIGLD-----TCFQ-----WPPPPNVTVTVPDFVFH 376
Query: 368 ----NNSFVVNNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNF-MTGYRVVFDRENLK 421
N + N + + TG+ CLA+ P +GTI N+ +++D N
Sbjct: 377 FDGANMTLPPENYMLI----ASTTGYLCLAMAPT--SVGTIIGNYQQQNLHLLYDIANSF 430
Query: 422 LGWSHSNCQ 430
L + + C
Sbjct: 431 LSFVPAPCD 439
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 137/345 (39%), Gaps = 39/345 (11%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQ 156
D G L W+ +C+P S + + + P ASST + CS CD L +
Sbjct: 152 DTGSSLTWL-----QCSPCVVSCHRQVG---PLFDPRASSTYTSVRCSASQCDELQAATL 203
Query: 157 NP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 211
NP C Y Y +++ S G L D + + ++ S GCG
Sbjct: 204 NPSACSASNVCIYQAS-YGDSSFSVGYLSTDTV--------SFGSTSYPSFYYGCGQDNE 254
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQ 270
G + GLIGL ++S+ LA + + SFS C S G + G
Sbjct: 255 GLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGPYNTGHYY 309
Query: 271 STSFLASNGKYIT-YIIGVETCCIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETI 324
S + +AS+ + Y I + +G S L + +S I+DSG+ T LP V+ +
Sbjct: 310 SYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTAL 369
Query: 325 AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 384
+ + + + C++ + +L ++P+V + F S + +I
Sbjct: 370 SKAVAQAMAGAQRAPAFSILDTCFEGQASQL-RVPTVVMAFAGGASMKLTTRNVLIDVDD 428
Query: 385 VVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
T CLA P D IG + V++D ++G+S C
Sbjct: 429 STT--CLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 106/251 (42%), Gaps = 47/251 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ CA Y+ D+ + S+T + L C C +S
Sbjct: 107 DTGSDLIWTQCAPCLLCADQPTPYF-----DVKK-----SATYRALPCRSSRCASLSSPS 156
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VIIGCGMKQSGGYL 215
K+ C Y YY + S++G+L + G N+ K V+A+ + GCG +G
Sbjct: 157 CFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATNIAFGCGSLNAG--- 208
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQ--- 269
D G++G G G +S+ S L + FS C S R++FG +
Sbjct: 209 DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTNT 263
Query: 270 ------QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSF 313
QST F+ + Y + ++ +G+ L + I+DSG+S
Sbjct: 264 SSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSI 323
Query: 314 TFLPKEVYETI 324
T+L ++ YE +
Sbjct: 324 TWLQQDAYEAV 334
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 147/368 (39%), Gaps = 66/368 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ + L D D WIPC C C P S + ++P+AS++ + + C
Sbjct: 117 AQQLLLAVDTSNDAAWIPCSGCAGC-PTS-----------SPFNPAASASYRPVPCGSPQ 164
Query: 149 CDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C L SC + C +++ Y ++S L +D L A+ V + GC
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSY--ADSSLQAALSQDTL--------AVAGDVVKAYTFGC 214
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFG 262
+ +G P GL+GLG G +S L + +FS C + SG + G
Sbjct: 215 LQRATG---TAAPPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNFSGTLRLG 269
Query: 263 DQG-PATQQSTSFLASNGKYITYII-------GVETCCIGSSCLK---QTSFKAIVDSGS 311
G P ++T LA+ + Y + G + I +S L T ++DSG+
Sbjct: 270 RNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGT 329
Query: 312 SFTFLPKEVYETIAAEFDRQVND---TITSFEGYPWKCCYKSSSQRLPKLPSVKLMF--- 365
FT L VY + E R+V ++S G+ CY ++ P V L+F
Sbjct: 330 MFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGF--DTCYNTTV----AWPPVTLLFDGM 383
Query: 366 ----PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
P+ N + YGT A V+ + I +RV+FD N +
Sbjct: 384 QVTLPEENVVI-----HTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGR 438
Query: 422 LGWSHSNC 429
+G++ +C
Sbjct: 439 VGFARESC 446
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 82/366 (22%), Positives = 146/366 (39%), Gaps = 59/366 (16%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T S D G DL+W C C C D+ + P SS+ L CS L
Sbjct: 107 AETYSAIMDTGSDLIWTQCKPCKVC----------FDQPTPIFDPEKSSSFSKLPCSSDL 156
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C + + C Y Y +++S+ G+L + GD ++ + + GCG
Sbjct: 157 C-VALPISSCSDGCEYRYSY-GDHSSTQGVLATETFTF---GDASV-----SKIGFGCGE 206
Query: 209 KQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPA 267
G Y G GL+GLG G +S L+++ G+ + S+ + D G A
Sbjct: 207 DNRGRAYSQGA---GLVGLGRGPLS---LISQLGVPKFSYCLTSIDDSKGISTLLVGSEA 260
Query: 268 TQQS---TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------IVDSGSSFT 314
T +S T + + + Y + +E +G + L ++++F I+DSG++ T
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTIT 320
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL----PSVKLMFP 366
+L + + EF Q+ + + + C+ S +P+L V L P
Sbjct: 321 YLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLP 380
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ N + ++ + VI CL + G + G V+ D E + ++
Sbjct: 381 KENYIIEDSALRVI---------CLTMGSSSG-MSIFGNFQQQNIVVLHDLEKETISFAP 430
Query: 427 SNCQDL 432
+ C L
Sbjct: 431 AQCNQL 436
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 137/333 (41%), Gaps = 45/333 (13%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGT 153
D G +L+W C C C Y +D + P ASST K +SCS C +
Sbjct: 112 DTGSNLIWTQCKPCDDC-------YTQVDP---LFDPKASSTYKDVSCSSSQCTALENQA 161
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN--ALKNSVQASVIIGCGMKQS 211
SC + C Y + Y + + + G D L L S + LKN +IIGCG +
Sbjct: 162 SCSTEDKTCSYLVS-YADGSYTMGKFAVDTLTLGSTDNRPVQLKN-----IIIGCGQNNA 215
Query: 212 GGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSMCF--DKDDSGRIFFGDQ---- 264
+ + + G+ SL+ + G I FS C + D + +I FG
Sbjct: 216 VTFRNKSS-----GVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270
Query: 265 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEV 320
GP T + + S + Y + +++ +GS ++ ++ K ++DSG++ T LP +
Sbjct: 271 GPGTVSTPLVVKSRDTF--YYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKY 328
Query: 321 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP-QNNSFVVNNPVFV 379
Y I +N + E CY +++ +P + + F + N F
Sbjct: 329 YIEIENAVASLINADKSKDERIGSSLCYNATADL--NIPVITMHFEGADVKLYPYNSFFK 386
Query: 380 IYGTQVVTGFCLAIQPVDGDIGTIGQ-NFMTGY 411
+ V F ++ +G G + Q NF+ GY
Sbjct: 387 VTEDLVCLAFGMSFYR-NGIYGNVAQKNFLVGY 418
>gi|194706442|gb|ACF87305.1| unknown [Zea mays]
Length = 83
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 4/77 (5%)
Query: 420 LKLGWSHSNCQDLNDGTKSPLTPGP-GTPSNPLPANQEQSSPGGHAVGPAVAGRAPSKPS 478
+KLGW S C+ + D T PL P +P +PLP+N++Q+SP AV PA AG AP +
Sbjct: 1 MKLGWYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTSP---AVTPATAGTAPLSCA 57
Query: 479 TASTQLISSRSSSLKVL 495
T + Q++ + S L +L
Sbjct: 58 TTNLQMLLASSYPLLLL 74
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 145/370 (39%), Gaps = 47/370 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + Q +K + D G D+ W+ C C C Y D + P +
Sbjct: 152 SGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTDP---IFDPRS 201
Query: 136 SSTSKHLSCSHRLCD-LGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
SS+ L C + C L TS C+ K C Y + Y S + E ++ ++ G++
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASK--CLYQVSY----GDGSFTVGEFVIETLTFGNSG 255
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-D 252
+ N+V +GCG G ++ L + SL + + +SFS C D
Sbjct: 256 MINNV----AVGCGHDNEGLFVGSAG--------LLGLGGGSLSLTSQMKASSFSYCLVD 303
Query: 253 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 305
+D S + F P+ + L S Y +G+ +G L F+
Sbjct: 304 RDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS 363
Query: 306 -----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLP 359
IVDSG++ T L + Y T+ F + + G+ + CY SSQ +P
Sbjct: 364 GYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT-PYLKKTNGFALFDTCYDLSSQSRVTIP 422
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V F S + ++I V T FC A P + IG G RV +D N
Sbjct: 423 TVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481
Query: 420 LKLGWSHSNC 429
+G+S C
Sbjct: 482 SVVGFSPHKC 491
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 152/369 (41%), Gaps = 61/369 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL----CDLGT 153
D G DL+W +CAP S+ + + Y+PS+S+T L C+ L L
Sbjct: 104 DTGSDLIW-----TQCAPCSSQCFQ---QPTPLYNPSSSTTFAVLPCNSSLSMCAAALAG 155
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
+ P C Y M Y + TS + + G + + GC SGG
Sbjct: 156 TTPPPGCTCMYNMTYGSGWTS----VYQGSETFTFGSSTPANQTGVPGIAFGCS-NASGG 210
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFG------D 263
+ A GL+GLG G + SL+++ G+ + FS C D + + + G D
Sbjct: 211 FNTSSA-SGLVGLGRGSL---SLVSQLGVPK--FSYCLTPYQDTNSTSTLLLGPSASLND 264
Query: 264 QGPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK----QTSFKA------IVDSG 310
G + ST F+AS Y + + +G++ L S KA I+DSG
Sbjct: 265 TGGVS--STPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSG 322
Query: 311 SSFTFLPKEVYETIAAEFDRQVNDTITSFEG----YPWKCCYK--SSSQRLPKLPSVKLM 364
++ T L Y+ + A V T+ + +G C++ SS+ P +PS+ L
Sbjct: 323 TTITLLGNTAYQQVRAAVVSLV--TLPTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLH 380
Query: 365 FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ-PVDGDIGTIGQNFMTGYRVVFDRENLKLG 423
F V+ +++ + + +CLA+Q DG + +G +++D L
Sbjct: 381 F-DGADMVLPADSYMMLDSNL---WCLAMQNQTDGGVSILGNYQQQNMHILYDVGQETLT 436
Query: 424 WSHSNCQDL 432
++ + C L
Sbjct: 437 FAPAKCSTL 445
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 103/470 (21%), Positives = 169/470 (35%), Gaps = 73/470 (15%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHR-------FSEEVKALGVSKNRNATSW 53
M + L+ + L+T + + KL HR S +G + R++
Sbjct: 23 MQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLIS 82
Query: 54 PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCDCVRC 113
+ S ++ L S + T F + +K + D G +L W+ C
Sbjct: 83 RKRNSTVGVKMDLGSGID---YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR---- 135
Query: 114 APLSASYYNSLDRDLNE-YSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYT 165
Y + +D + S + K + C + C + T+C P PC Y
Sbjct: 136 -------YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY- 187
Query: 166 MDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQASV---IIGCGMKQSGGYLDGVAPD 221
DY Y + +++ G+ ++ + + L N A + +IGC +G G D
Sbjct: 188 -DYRYADGSAAQGVFAKETITV------GLTNGRMARLPGHLIGCSSSFTGQSFQGA--D 238
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLA 276
G++GL + S S L FS C +K+ S + FG + T+F
Sbjct: 239 GVLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGS---SRSTKTAFRR 293
Query: 277 SNGKYITYI------------IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYET 323
+ +T I +G + I S TS I+DSG+S T L Y+
Sbjct: 294 TTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQ 353
Query: 324 IAAEFDRQ-VNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 381
+ R V EG P + C+ +S + KLP + F + +++
Sbjct: 354 VVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 413
Query: 382 GTQVVT--GFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V GF A P IG I Q Y FD L ++ S C
Sbjct: 414 AAPGVKCLGFVSAGTPATNVIGNIMQQ---NYLWEFDLMASTLSFAPSAC 460
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 139/354 (39%), Gaps = 60/354 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGT 153
D G D+ W+ C P Y+ D + P+ SS+ + C+ C
Sbjct: 160 DTGSDVSWVQCKPCPSPPC----YSQRD---PLFDPTRSSSYSAVPCAAASCSQLALYSN 212
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y + Y + ++++G+ D L L G NALK + GCG Q G
Sbjct: 213 GCSGGQ--CGYVVSY-GDGSTTTGVYSSDTLTLT--GSNALKG-----FLFGCGHAQQG- 261
Query: 214 YLDGVAPDGLIGLGL-GEISVPSLLAKAGLIRNS-FSMCFDKDDSGRIFFGDQGPATQ-- 269
GV DGL+GLG G+ SL+++A FS C + + GP++
Sbjct: 262 LFAGV--DGLLGLGRQGQ----SLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTAG 315
Query: 270 -QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--AIVDSGSSFTFLPKEVYETI 324
+T L ++ YI+ + +G L + F A+VD+G+ T LP Y +
Sbjct: 316 FSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSAL 375
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 377
+ F + GYP CY + LP++ + F + +
Sbjct: 376 RSAFRAAMAP-----YGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGT-- 428
Query: 378 FVIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++T CLA P GD +G + V FD +G+ ++C
Sbjct: 429 -----SGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFDGST--VGFMPASC 475
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 150/367 (40%), Gaps = 74/367 (20%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTS 154
D G DL+W C C +C S ++ P SS+ LSCS +LC+ +S
Sbjct: 115 DTGSDLIWTQCKPCTQCFHQSTPIFD----------PKKSSSFSKLSCSSQLCEALPQSS 164
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-G 213
C N C Y + Y + +S+ G+L + L G ++ N V GCG G G
Sbjct: 165 CNNG---CEY-LYSYGDYSSTQGILASETLTF---GKASVPN-----VAFGCGADNEGSG 212
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG-----DQG 265
+ G GL+GLG G +S+ S L + FS C D + + G +
Sbjct: 213 FSQGA---GLVGLGRGPLSLVSQLKEP-----KFSYCLTTVDDTKTSTLLMGSLASVNAS 264
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------IVDSGSSFTF 315
+ ++T + S Y + +E +G + L K+++F I+DSG++ T+
Sbjct: 265 SSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITY 324
Query: 316 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL----PSVKLMFPQ 367
L + + +A EF ++N + S C+ S++ +PKL L P
Sbjct: 325 LEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGADLELPA 384
Query: 368 NNSFVVNNPVFVIYGTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLGWS 425
N + ++ + V CLA+ G G + Q M V+ D E L +
Sbjct: 385 ENYMIGDSSMGVA---------CLAMGSSSGMSIFGNVQQQNML---VLHDLEKETLSFL 432
Query: 426 HSNCQDL 432
+ C L
Sbjct: 433 PTQCDLL 439
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 104/251 (41%), Gaps = 47/251 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ CA Y++ S+T + L C C +S
Sbjct: 107 DTGSDLIWTQCAPCLLCADQPTPYFDV----------KKSATYRALPCRSSRCASLSSPS 156
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VIIGCGMKQSGGYL 215
K+ C Y YY + S++G+L + G N+ K V+A+ + GCG +G
Sbjct: 157 CFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATNIAFGCGSLNAG--- 208
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQ--- 269
D G++G G G +S+ S L + FS C S R++FG +
Sbjct: 209 DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTNT 263
Query: 270 ------QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSF 313
QST F+ + Y + ++ +G+ L + I+DSG+S
Sbjct: 264 SSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSI 323
Query: 314 TFLPKEVYETI 324
T+L ++ YE +
Sbjct: 324 TWLQQDAYEAV 334
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 138/359 (38%), Gaps = 61/359 (16%)
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
K + L D G DL W C + P+ S++ ++SCS LC
Sbjct: 145 KDLMLIFDTGSDLTWARCSAAE-----------------TFDPTKSTSYANVSCSTPLCS 187
Query: 151 -LGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
+ ++ NP + T Y Y + + S G L ++ L + G + N GC
Sbjct: 188 SVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTI--GSTDIFNN-----FYFGC 240
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQG 265
G G L G A GL+GLG ++SV S A FS C S G + FG
Sbjct: 241 GQDVDG--LFGKAA-GLLGLGRDKLSVVSQTAPK--YNQLFSYCLPSSSSTGFLSFGSSQ 295
Query: 266 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEV 320
+ + T S+G Y + + +G L ++ I+DSG+ T LP
Sbjct: 296 SKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAA 353
Query: 321 YETIAAEFDRQVNDTITSFEGYPW-------KCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
Y + + F + + YP CY S + K+P + + F V
Sbjct: 354 YSALRSAFRK-------AMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDV 406
Query: 374 NNP-VFVIYGTQVVTGFCLAIQPVDG--DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ +FV G + V CLA G D G + VV+D K+G++ ++C
Sbjct: 407 DQAGIFVANGLKQV---CLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASC 462
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 105/256 (41%), Gaps = 51/256 (19%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
++ IS+ +++ +F + +G F+ KLI R S + NRN P S
Sbjct: 7 IHLISILLFVFIFPHIEAHNGG----FTGKLIPRNSSKDFF-----NRNTIQSPV--SAN 55
Query: 61 YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPC-DCVRCAPLSAS 119
+Y L+ + +K Q D G DL+W+ C C C
Sbjct: 56 HYDYLMELSIGTPPVKIYAQ----------------ADTGSDLIWLQCIPCTNC------ 93
Query: 120 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSG 177
Y L+ + S SST +++C C TSC + C Y Y + + + G
Sbjct: 94 -YKQLNPMFDSQS---SSTFSNIACGSESCSKLYSTSCSPDQINCKYNYS-YVDGSETQG 148
Query: 178 LLVEDILHLIS--GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 235
+L ++ L L S G A K VI GCG +G + D G+IGLG G +S+ S
Sbjct: 149 VLAQETLTLTSTTGEPVAFK-----GVIFGCGHNNNGAFND--KEMGIIGLGRGPLSLVS 201
Query: 236 LLAKAGLIRNSFSMCF 251
+ + L N FS C
Sbjct: 202 QIGSS-LGGNMFSQCL 216
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/373 (22%), Positives = 146/373 (39%), Gaps = 66/373 (17%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT--- 153
D G DL W+ C C+ C ++ + P+AS + ++++C C L
Sbjct: 170 DTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPT 219
Query: 154 ---SCQNPK-QPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGM 208
+C+ P PCPY Y ++ ++ L +E ++L + G + + V + GCG
Sbjct: 220 APRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDV----VFGCGH 275
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQG 265
G + GL L S L A G ++FS C S +I FGD
Sbjct: 276 SNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG---HAFSYCLVDHGSSVGSKIVFGDDD 330
Query: 266 -----PATQQSTSFLASNGKYITY--------IIGVETCCIGSSCL---KQTSFKAIVDS 309
P + ++ T+ ++G E I S K S I+DS
Sbjct: 331 ALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDS 390
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM---- 364
G++ ++ + YE I F +++ +P CY S ++P L+
Sbjct: 391 GTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADG 450
Query: 365 ----FPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNF-MTGYRVVFDREN 419
FP N FV +P ++ CLA+ +I NF + V++D +N
Sbjct: 451 AVWDFPAENYFVRLDPDGIM---------CLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501
Query: 420 LKLGWSHSNCQDL 432
+LG++ C ++
Sbjct: 502 NRLGFAPRRCAEV 514
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/367 (24%), Positives = 150/367 (40%), Gaps = 65/367 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T+ L D D W+PC CV C+ ++P+ S+T K + C
Sbjct: 108 AQTLLLAMDTSNDASWVPCTACVGCS------------TTTPFAPAKSTTFKKVGCGASQ 155
Query: 149 CDLGTSCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
C +NP C + Y T + ++S LV+D + L + A G
Sbjct: 156 CK---QVRNPTCDGSACAFNFTYGTSSVAAS--LVQDTVTLATDPVPAYA--------FG 202
Query: 206 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFF 261
C K +G V P GL+GLG G +S+ + K L +++FS C + SG +
Sbjct: 203 CIQKVTG---SSVPPQGLLGLGRGPLSLLAQTQK--LYQSTFSYCLPSFKTLNFSGSLRL 257
Query: 262 GDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----KQTSFKA------IVDSG 310
G P + T L + + Y + + +G + + +F A + DSG
Sbjct: 258 GPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSG 317
Query: 311 SSFTFLPKEVYETIAAEFDRQV----NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 366
+ FT L + Y + EF R++ T+TS G+ CY + P++ MF
Sbjct: 318 TVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGF--DTCYTAPI----VAPTITFMFS 371
Query: 367 QNNSFVVNNPVFVIYGTQVVTGFCLAIQP----VDGDIGTIGQNFMTGYRVVFDRENLKL 422
N + + + + VT CLA+ P V+ + I +RV+FD N +L
Sbjct: 372 GMNVTLPPDNILIHSTAGSVT--CLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRL 429
Query: 423 GWSHSNC 429
G + C
Sbjct: 430 GVARELC 436
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 137/353 (38%), Gaps = 58/353 (16%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGT 153
D G D+ W+ C P Y+ D + P+ SS+ + C+ C
Sbjct: 149 DTGSDVSWVQCKPCPSPPC----YSQRD---PLFDPTRSSSYSAVPCAAASCSQLALYSN 201
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
C + C Y + Y + ++++G+ D L L G NALK + GCG Q G
Sbjct: 202 GCSGGQ--CGYVVSY-GDGSTTTGVYSSDTLTLT--GSNALKG-----FLFGCGHAQQG- 250
Query: 214 YLDGVAPDGLIGLGL-GEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ--- 269
GV DGL+GLG G+ V + G + FS C + + GP++
Sbjct: 251 LFAGV--DGLLGLGRQGQSLVSQASSTYGGV---FSYCLPPTQNSVGYISLGGPSSTAGF 305
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--AIVDSGSSFTFLPKEVYETIA 325
+T L ++ YI+ + +G L + F A+VD+G+ T LP Y +
Sbjct: 306 STTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALR 365
Query: 326 AEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ F + GYP CY + LP++ + F + +
Sbjct: 366 SAFRAAMAP-----YGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGT--- 417
Query: 379 VIYGTQVVTGFCLAIQPVDGD--IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ ++T CLA P GD +G + V FD +G+ ++C
Sbjct: 418 ----SGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFDGST--VGFMPASC 464
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 147/368 (39%), Gaps = 66/368 (17%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++ + L D D WIPC C C P S + ++P+AS++ + + C
Sbjct: 64 AQQLLLAVDTSNDAAWIPCSGCAGC-PTS-----------SPFNPAASASYRPVPCGSPQ 111
Query: 149 CDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
C L SC + C +++ Y ++S L +D L A+ V + GC
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSY--ADSSLQAALSQDTL--------AVAGDVVKAYTFGC 161
Query: 207 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFG 262
+ +G P GL+GLG G +S L + +FS C + SG + G
Sbjct: 162 LQRATG---TAAPPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNFSGTLRLG 216
Query: 263 DQG-PATQQSTSFLASNGKYITYII-------GVETCCIGSSCLK---QTSFKAIVDSGS 311
G P ++T LA+ + Y + G + I +S L T ++DSG+
Sbjct: 217 RNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGT 276
Query: 312 SFTFLPKEVYETIAAEFDRQVND---TITSFEGYPWKCCYKSSSQRLPKLPSVKLMF--- 365
FT L VY + E R+V ++S G+ CY ++ P V L+F
Sbjct: 277 MFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGF--DTCYNTTV----AWPPVTLLFDGM 330
Query: 366 ----PQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
P+ N + YGT A V+ + I +RV+FD N +
Sbjct: 331 QVTLPEENVVI-----HTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGR 385
Query: 422 LGWSHSNC 429
+G++ +C
Sbjct: 386 VGFARESC 393
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 142/368 (38%), Gaps = 65/368 (17%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSC 155
D G DL+W +CAP ++ L + ++P S++ + + C+ LC L SC
Sbjct: 114 DTGSDLIW-----TQCAPCASC----LSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSC 164
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
+ P C Y +Y + T + G+ + S + + GCG G
Sbjct: 165 ERPDT-CTYRYNY-GDGTMTVGVYATERFTFASS-GGGGLTTTTVPLGFGCGSVNVGSLN 221
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IFFGD-----QGPA 267
+G G++G G +S+ S L+ IR FS C S R + FG G A
Sbjct: 222 NG---SGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYASRRQSTLLFGSLSDGVYGDA 273
Query: 268 TQ--QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK--------AIVDSGSSFTF 315
T Q+T L S Y + +G+ L+ +++F IVDSG++ T
Sbjct: 274 TGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 333
Query: 316 LPKEVYETIAAEFDRQVN----------DTITSFEGYPWKCCYKSSSQRLPKL----PSV 361
LP V + F +Q+ D + W+ +S +P++
Sbjct: 334 LPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGA 393
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L P+ N +V+++ CL + D TIG RV++D E
Sbjct: 394 DLDLPRRN-YVLDD--------HRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAET 444
Query: 422 LGWSHSNC 429
L + + C
Sbjct: 445 LSIAPARC 452
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 142/347 (40%), Gaps = 37/347 (10%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQ 156
D G DL W V+C P +S + +D + PS SST + C C G C
Sbjct: 167 DTGSDLSW-----VQCQPCGSSGHCHPQQD-PLFDPSKSSTYAAVHCGEPQCAAAGGLCS 220
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 216
C Y + +Y + +S++G+L D L L S + A GCG + G D
Sbjct: 221 EDNTTCLYLV-HYGDGSSTTGVLSRDTLALTS-------SRALAGFPFGCGTRNLG---D 269
Query: 217 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQ----Q 270
DGL+GLG GE+S+PS A + FS C +S G + G PAT Q
Sbjct: 270 FGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLTIGAT-PATDTGAAQ 326
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETIA 325
T+ L Y + + + IG L T ++DSG+ T+LP + YE +
Sbjct: 327 YTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYLPAQAYELLR 386
Query: 326 AEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 385
F + + CY + + +P+V F F ++ +I+ +
Sbjct: 387 DRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFGVMIFLDEN 446
Query: 386 VTGFCLAIQPVDGD---IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
V CLA +D + IG V++D K+G+ ++C
Sbjct: 447 VG--CLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 120/294 (40%), Gaps = 35/294 (11%)
Query: 155 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 214
C N C Y Y + + S G L +D+L L + + + GCG G
Sbjct: 181 CSNATGACVYKASY-GDTSFSIGYLSQDVLTLTPSA------APSSGFVYGCGQDNQG-- 231
Query: 215 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQ 269
L G + G+IGL ++S+ L+ N+FS C + +S F G ++
Sbjct: 232 LFGRSA-GIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASSL 288
Query: 270 QSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEV 320
S+ + L N K + Y +G+ T + L ++ I+DSG+ T LP +
Sbjct: 289 SSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAI 348
Query: 321 YETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSF---VVNNP 376
Y + F ++ G+ C+K S + + +P ++++F V N+
Sbjct: 349 YNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSL 408
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
V + GT CLAI I IG + V +D N K+G++ CQ
Sbjct: 409 VEIEKGTT-----CLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGCQ 457
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 138/343 (40%), Gaps = 65/343 (18%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G + W C CVRC S +++ PSAS T SC + ++
Sbjct: 180 DTGSSITWTQCKPCVRCLKASRRHFD----------PSASLTYSLGSC------IPSTVG 223
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS-VQASVIIGCGMKQSGGYL 215
N Y M Y ++TS + + L++S V GCG G +
Sbjct: 224 NT-----YNMTYGDKSTSVGNYGCDTM---------TLEHSDVFPKFQFGCGRNNEGDF- 268
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSF 274
G DG++GLG G++S S A + FS C ++DS G + FG++ AT QS+S
Sbjct: 269 -GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEK--ATSQSSSL 323
Query: 275 -------------LASNGKYITYI----IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 317
L +G Y + +G + I SS S I+DSG+ T LP
Sbjct: 324 KFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF--ASPGTIIDSGTVITRLP 381
Query: 318 KEVYETIAAEFDRQVNDTITS----FEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
+ Y + A F + + S +G CY S ++ LP + L F + +
Sbjct: 382 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRL 441
Query: 374 NNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFD 416
N VI+G + CLA + ++ IG V++D
Sbjct: 442 NGKR-VIWGND-ASRLCLAFAG-NSELTIIGNRQQVSLTVLYD 481
>gi|115398434|ref|XP_001214806.1| hypothetical protein ATEG_05628 [Aspergillus terreus NIH2624]
gi|114191689|gb|EAU33389.1| hypothetical protein ATEG_05628 [Aspergillus terreus NIH2624]
Length = 486
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 147/377 (38%), Gaps = 67/377 (17%)
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS- 211
T C++ PC + Y + +S+ + D + G A + V ++ IG +
Sbjct: 102 TLCESSSDPCSASGSYNPDKSSTYNFVSSDFNISYADGTGAAGDYVTDTLHIGGATIKDF 161
Query: 212 ---GGYLDGVAPDGLIGLG----------LGEISVPSL---LAKAGLIR-NSFSMCFDK- 253
GY G + +G++G+G LG+ S P+L + K GLIR N++S+ +
Sbjct: 162 QFGVGYYSG-SSEGVLGIGYPSNEVQVGRLGKSSYPNLPQAMVKNGLIRSNAYSLWLNDL 220
Query: 254 -DDSGRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLKQT------S 302
+G I FG A Q+ NG Y +I + I S Q
Sbjct: 221 SASTGSILFGGVNKAKYHGELQTLPVQPVNGGYSELLIALTAVSIKSDSDSQNYTSDALP 280
Query: 303 FKAIVDSGSSFTFLP----KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 358
++DSGSS T+LP +E+Y + ++ +S G+ KC SS +L
Sbjct: 281 AAVLLDSGSSLTYLPNSIVEEIYNNLGVVYES------SSGVGFV-KCSLAESSVKLSYT 333
Query: 359 ---PSV-----KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTG 410
P++ +L+ + N I+G I P +G F+
Sbjct: 334 FSSPTINVGIDELVIDAGDIRFRNGDRACIFG----------IAPAGSSTAVLGDTFLRS 383
Query: 411 YRVVFDRENLKLGWSHSNCQDLND-----GTKSPLTPGPGTPSNPLPANQEQSSPGGHAV 465
VV+D N ++ +++N +D GT PG +NP+ + S G +
Sbjct: 384 AYVVYDLANNEISLANTNFNSTDDDIVEIGTGDDAVPGATNVANPVTSVVADGS--GARI 441
Query: 466 GPAVAGRAPSKPSTAST 482
G G PS S+
Sbjct: 442 GGPTGGVFTDLPSATSS 458
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 143/377 (37%), Gaps = 44/377 (11%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F L ++++ + D G DL W+ C C C Y D + P
Sbjct: 51 SGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSC-------YKQAD---PIFDPRN 100
Query: 136 SSTSKHLSCSHRLCDLGT--SCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGG 190
SS+ + + C LC SC + C Y + Y + + S G D+ L +G
Sbjct: 101 SSSFQRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTG- 158
Query: 191 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 250
S SV GCG G + GL L S + NSFS C
Sbjct: 159 ------SKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC 212
Query: 251 F-DKDD-----SGRIFFGDQGPATQQSTSFLASNGK----YITYIIGVET------CCIG 294
D+ + S + FG + + S L N K Y +IGV +
Sbjct: 213 LVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLK 272
Query: 295 SSCLKQT-SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSS 352
S L Q+ S I+DSG+S T P VY TI F R + S Y + CY S
Sbjct: 273 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAF-RNATINLPSAPRYSLFDTCYNFSG 331
Query: 353 QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYR 412
+ +P++ L F +N + + P + FCLA P ++G IG +R
Sbjct: 332 KASVDVPALVLHF-ENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFR 390
Query: 413 VVFDRENLKLGWSHSNC 429
+ FD + L ++ C
Sbjct: 391 IGFDLQKSHLAFAPQQC 407
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 137/353 (38%), Gaps = 56/353 (15%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 150
D D+ W+ PC C P S+Y+ PS S +S SCS C
Sbjct: 164 DSASDVPWVQCVPCPIPPCHPQVDSFYD----------PSRSPSSAPFSCSSPTCTALGP 213
Query: 151 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
C N + C Y + Y + +S+SG + D+L L +G NA+ + GC +
Sbjct: 214 YANGCANNQ--CQYLVR-YPDGSSTSGAYIADLLTLDAG--NAV-----SGFKFGCSHAE 263
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ 270
G + A G++ LG G S+ L A N+FS C S FF P
Sbjct: 264 QGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTLGVPRRAS 319
Query: 271 STSFLASNGKY----ITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFLPKEVYE 322
S + ++ Y + + T +G L F A ++DS ++ T LP Y+
Sbjct: 320 SRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQ 379
Query: 323 TIAAEFDRQVNDTITSFEGYPWK----CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVF 378
+ + F ++T + P K CY + +LP + L+F N+ + +P
Sbjct: 380 ALRSAF----RSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVF-DRNAVLPLDPSG 434
Query: 379 VIYGTQVVTGFCLAIQPVDGDI--GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+++ CLA D G +G V++D +G+ C
Sbjct: 435 ILFND------CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 145/370 (39%), Gaps = 62/370 (16%)
Query: 91 KTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
K L D G DL W+ CD C C +L D Y P + + C L
Sbjct: 66 KVFELDIDTGSDLTWVQCDAPCTGC---------TLPHD-RLYKPH----NNVVRCGEPL 111
Query: 149 CDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQAS 201
C + C+NP C Y ++Y ++ SS G+LV+D L L +G + +
Sbjct: 112 CSALFSASKSPCKNPNDQCDYEVEY-ADHGSSIGVLVKDPVPLRLTNG------TILAPN 164
Query: 202 VIIGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRI 259
+ GCG Q +GG G++GLG + ++ + L+ +RN C +
Sbjct: 165 LGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGGGFLF 224
Query: 260 FFGDQGPATQQS-TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 318
F GD P++ S L + G Y G G + + DSGSS+T+
Sbjct: 225 FGGDLVPSSGMSWMPILRTPGG--KYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYFNS 282
Query: 319 EVYETIAAEFDRQVNDTITSFEGYPWK---------CCYKSSSQRLPKLPSVKLMF-PQN 368
+VY + +N +G P + C+K S+ + V+ F P
Sbjct: 283 QVYGAV-------LNLLRNGLKGQPLRDAPEDKTLPICWK-GSKAFKSVADVRNFFKPLA 334
Query: 369 NSFVVNNPVFVIYGTQVVT-----GFCLAI----QPVDGDIGTIGQNFMTGYRVVFDREN 419
SF + F I + CL I Q G++ IG M +V+D E
Sbjct: 335 LSFGNSKVQFQIPPEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDISMLDKMMVYDNER 394
Query: 420 LKLGWSHSNC 429
++GW+ +NC
Sbjct: 395 QQIGWAPANC 404
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/391 (22%), Positives = 144/391 (36%), Gaps = 53/391 (13%)
Query: 80 QFQMLFPSQGSKTMSLGNDFGCDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 137
+F++ P+Q L D G DL W+ C A ++S S + P S
Sbjct: 98 RFRVGTPAQ---PFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 138 TSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
T + C+ C ++C P PC Y Y + + + E +S +
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214
Query: 193 ALKNSVQAS----VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
+ KN V+ + +++GC +G + A DG++ LG +S S A FS
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGRFS 270
Query: 249 MCF-----DKDDSGRIFFGDQ-----------GPATQQSTSFLASNGKYITYIIGVETCC 292
C ++ + + FG GP +Q+ L S + Y + ++
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPF-YDVSIKAIS 329
Query: 293 IGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 344
+ LK IVDSG+S T L K Y + A +++ P+
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLA-RFPRVAMDPF 388
Query: 345 KCCYK----SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQ--PVDG 398
+ CY S LP + + F + + +VI V C+ +Q P G
Sbjct: 389 EYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVK--CIGVQEGPWPG 446
Query: 399 DIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
I IG + FD +N +L + S C
Sbjct: 447 -ISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 103/260 (39%), Gaps = 46/260 (17%)
Query: 222 GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGDQG----------PATQQ 270
GLIG+ G +S + + GL FS C +D SG + FG+ P Q
Sbjct: 441 GLIGMNRGSLS---FVTQMGL--QKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQI 495
Query: 271 STSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEV 320
ST + + Y + +E + +S L+ + + +VDSG+ FTFL V
Sbjct: 496 STPLPYFD--RVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPV 553
Query: 321 YETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR--LPKLPSVKLMFPQNNSFV 372
Y + EF RQ ++ E + CY+ R LP LP+V LMF V
Sbjct: 554 YTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSV 613
Query: 373 VNNPVF-----VIYGTQVVTGFCLAIQPVDG-DIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+ VI G+ V F + G + IG + + FD ++G++
Sbjct: 614 SAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAE 673
Query: 427 SNC----QDLNDGTKSPLTP 442
C Q L G + L P
Sbjct: 674 VRCDLAGQRLGVGIRVKLPP 693
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 104/251 (41%), Gaps = 47/251 (18%)
Query: 98 DFGCDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 156
D G DL+W C C+ CA Y++ S+T + L C C +S
Sbjct: 2 DTGSDLIWTQCAPCLLCADQPTPYFDV----------KKSATYRALPCRSSRCASLSSPS 51
Query: 157 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VIIGCGMKQSGGYL 215
K+ C Y YY + S++G+L + G N+ K V+A+ + GCG +G
Sbjct: 52 CFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATNIAFGCGSLNAG--- 103
Query: 216 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQ--- 269
D G++G G G +S+ S L + FS C S R++FG +
Sbjct: 104 DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTNT 158
Query: 270 ------QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSF 313
QST F+ + Y + ++ +G+ L + I+DSG+S
Sbjct: 159 SSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSI 218
Query: 314 TFLPKEVYETI 324
T+L ++ YE +
Sbjct: 219 TWLQQDAYEAV 229
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 130/358 (36%), Gaps = 54/358 (15%)
Query: 98 DFGCDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS--HRLCDLG 152
D G DL W+ PC+ C P ++ P AS K L C
Sbjct: 143 DTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASDACKQLPVDGYDNGCTNN 202
Query: 153 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 212
TS P+ C Y ++Y + G+ + L L S ++V S GCG Q G
Sbjct: 203 TSGMPPQ--CGYAIEY-GNGAITEGVYSTETLALGS-------SAVVKSFRFGCGSDQHG 252
Query: 213 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS- 271
Y D DGL+GLG S+ S A + +FS C +SG F P + +
Sbjct: 253 PY-DKF--DGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNS 307
Query: 272 ------TSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEV 320
T A + K T Y++ + +G L F IVDSG+ T +P
Sbjct: 308 NSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNIVDSGTVITGIPTTA 367
Query: 321 YETIAAEFDRQVNDTITSFEGYP--------WKCCYKSSSQRLPKLPSVKLMFPQNNSFV 372
Y+ + F + + YP CY + +P V L F +
Sbjct: 368 YKALRTAFRSAMAE-------YPLLPPADSALDTCYNFTGHGTVTVPKVALTFVGGATVD 420
Query: 373 VNNPVFVIYGTQVVTGFCLAIQPV-DGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ P + V+ CLA DG G IG V++D LG+ C
Sbjct: 421 LDVP------SGVLVEDCLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 100/419 (23%), Positives = 166/419 (39%), Gaps = 70/419 (16%)
Query: 56 KKSFEYYQVLLS--SDVQKQKMKTGPQFQMLFPSQGSKTMSLGN---------------- 97
++ Y+ L+ SD K GP+ + P + +M GN
Sbjct: 60 EERIRYFHSRLAKNSDANASSKKVGPKLAGI-PLKSGLSMGSGNYYVKMGLGSPTKYYTM 118
Query: 98 --DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-- 153
D G W+ +C P + Y + D ++PSAS T K + CS C
Sbjct: 119 IVDTGSSFSWL-----QCQP--CTIYCHIQED-PVFNPSASKTYKTVPCSSSQCSSLKSA 170
Query: 154 -----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
+C C Y Y +++ S G L +D+L L + +S + GCG
Sbjct: 171 TLNEPTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLT-------PSQTLSSFVYGCGQ 222
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRIFFGDQ 264
G L G DG+IGL E+S+ S L+ G N+FS C F +S + F
Sbjct: 223 DNQG--LFGRT-DGIIGLANNELSMLSQLS--GKYGNAFSYCLPTSFSTPNSPKEGFLSI 277
Query: 265 GPAT------QQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFT 314
G ++ + T L + Y I +E+ + L +S+K I+DSG+ T
Sbjct: 278 GTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVIT 337
Query: 315 FLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL-PSVKLMFPQNNSFV 372
LP VY T+ + ++ G C+K S + ++ P ++++F
Sbjct: 338 RLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQ 397
Query: 373 VNNPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
+ ++ ++ TG CLA+ I IG +V +D N ++G++ CQ
Sbjct: 398 LKGHNSLV---ELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 148/390 (37%), Gaps = 73/390 (18%)
Query: 98 DFGCDLLWIPC-----DCVRCAPLSASYYNS-------LDRDLNEYSPSASS---TSKHL 142
D G DL W+PC DC+ C Y NS + Y S +S T H
Sbjct: 30 DTGSDLTWVPCGNLSFDCMDC----DDYRNSKLMSAFSPSHSSSSYRDSCASPYCTDIHS 85
Query: 143 S------CSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 195
S C+ C L T + +PCP Y +G L D L + G K
Sbjct: 86 SDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTRDTLRVHEGPARVTK 145
Query: 196 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 251
+ + GC Y + P G+ G G +S PS L GL++ FS CF
Sbjct: 146 DIPK--FCFGC---VGSTYHE---PIGIAGFVRGTLSFPSQL---GLLKKGFSHCFLAFK 194
Query: 252 ---DKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSC-----LKQ 300
+ + S + GD +++ Q T L S Y IG+E +G+ L
Sbjct: 195 YANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVSATTVPLNL 254
Query: 301 TSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYP-WKCCYK--S 350
F + ++DSG+++T LP+ Y + + F + T E + CYK
Sbjct: 255 REFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEMRAGFDLCYKVPC 314
Query: 351 SSQRLPK----LPSVKLMFPQNNSFVV--NNPVFVIYGTQVVTGF-CLAIQPVD----GD 399
+ RL PS+ F N SFV+ N + + T CL Q + G
Sbjct: 315 PNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQSMADSDYGP 374
Query: 400 IGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
G G ++V+D E ++G+ +C
Sbjct: 375 AGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 128/329 (38%), Gaps = 49/329 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+KT + D G W+ C+C C ++ S S+T +SC +C
Sbjct: 11 AKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVSCGTSMC 59
Query: 150 DLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA--SV 202
LG S CQ+ + CP+ + Y + ++S G+L +D L + VQ
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDVQKIPGF 109
Query: 203 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF- 261
GC + G G DGL+G+G G +SV L ++ + FS C S R FF
Sbjct: 110 TFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSERGFFS 165
Query: 262 --------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----SFKAIV- 307
G T + T +A + + + + L + S K +V
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 225
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 367
DSGS +++P + R++ + E + CY S +P++ L F
Sbjct: 226 DSGSELSYIPDRALSVLRQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 284
Query: 368 NNSFVV-NNPVFVIYGTQVVTGFCLAIQP 395
F + ++ VFV Q +CLA P
Sbjct: 285 GARFDLGSHGVFVERSVQEQDVWCLAFAP 313
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 135/354 (38%), Gaps = 54/354 (15%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SC 155
D G D W V+C P Y ++ + P SST ++SC+ C DL C
Sbjct: 196 DTGSDTTW-----VQCQPCVVVCYEQQEK---LFDPVRSSTYANVSCAAPACSDLNIHGC 247
Query: 156 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 215
C Y + Y + + S G D L L S +A+K GCG + G +
Sbjct: 248 SGGH--CLYGVQY-GDGSYSIGFFAMDTLTLSS--YDAVKG-----FRFGCGERNEGLFG 297
Query: 216 DGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIFF-----GDQGPATQ 269
+ GL+GLG G+ S+P K G + F+ C +G + + +
Sbjct: 298 EAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLDFGAGSPAAASAR 351
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA---IVDSGSSFTFLPKEVYETI 324
+T L NG Y IG+ +G L Q+ F IVDSG+ T LP Y ++
Sbjct: 352 LTTPMLTDNGPTF-YYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAYSSL 410
Query: 325 AAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP- 376
R + GY CY + +P+V L+F V+
Sbjct: 411 -----RYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 465
Query: 377 -VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
++ +QV F A GD+G +G + + V +D +G+ C
Sbjct: 466 IMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 147/357 (41%), Gaps = 49/357 (13%)
Query: 98 DFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 153
D G W+ +C P + Y + D ++PSAS T K + CS C
Sbjct: 121 DTGSSFSWL-----QCQP--CTIYCHIQED-PVFNPSASKTYKTVPCSSSQCSSLKSATL 172
Query: 154 ---SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 210
+C C Y Y +++ S G L +D+L L + +S + GCG
Sbjct: 173 NEPTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLT-------PSQTLSSFVYGCGQDN 224
Query: 211 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRIFFGDQGP 266
G L G DG+IGL E+S+ S L+ G N+FS C F +S + F G
Sbjct: 225 QG--LFGRT-DGIIGLANNELSMLSQLS--GKYGNAFSYCLPTSFSTPNSPKEGFLSIGT 279
Query: 267 AT------QQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSFTFL 316
++ + T L + Y I +E+ + L +S+K I+DSG+ T L
Sbjct: 280 SSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRL 339
Query: 317 PKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVN 374
P VY T+ + ++ G C+K S + ++ P ++++F +
Sbjct: 340 PTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLK 399
Query: 375 NPVFVIYGTQVVTGF-CLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQ 430
++ ++ TG CLA+ I IG +V +D N ++G++ CQ
Sbjct: 400 GHNSLV---ELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 144/358 (40%), Gaps = 51/358 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T + D D WIPC+ CV C S++ +NS+ S+T K L C
Sbjct: 100 AQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLGCDAPQ 146
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C Q P C + T NT+ G IL ++ AL + GC
Sbjct: 147 CK-----QVPNPTCGGST--CTWNTTYGG---STILSNLTRDTIALSTDIVPGYTFGCIQ 196
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
K +G V P GL+GLG G +S L L +++FS C + SG + G
Sbjct: 197 KTTG---SSVPPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251
Query: 265 GPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDSGSSF 313
G + T+ L N + Y+ I +G + I +S L T I DSG+ F
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T L VY + EF ++V + I S G + CY P++ MF N +
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAIVSSLG-GFDTCYTGPI----VAPTMTFMFSGMNVTLP 366
Query: 374 NNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + + + +A P V+ + I +R++FD N ++G + C
Sbjct: 367 TDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/371 (22%), Positives = 140/371 (37%), Gaps = 69/371 (18%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T S D G DL+W C C C D+ + P SS+ L CS L
Sbjct: 107 AETYSAIMDTGSDLIWTQCKPCKDC----------FDQPTPIFDPKKSSSFSKLPCSSDL 156
Query: 149 CDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 205
C P C +Y Y + +S+ G+L + A ++ + + G
Sbjct: 157 C-----AALPISSCSDGCEYLYSYGDYSSTQGVLATETF--------AFGDASVSKIGFG 203
Query: 206 CGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR----IF 260
CG G G+ G GL+GLG G +S+ S L + FS C D + +
Sbjct: 204 CGEDNDGSGFSQGA---GLVGLGRGPLSLISQLGEP-----KFSYCLTSMDDSKGISSLL 255
Query: 261 FGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL--KQTSFKA--------IVDS 309
G + T+ L N + Y + +E +G + L ++++F I+DS
Sbjct: 256 VGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDS 315
Query: 310 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL----PSV 361
G++ T+L + + EF Q+ + C+ +S+ +P+L
Sbjct: 316 GTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGA 375
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLK 421
L P N + ++ + VI CL + G + G V+ D E
Sbjct: 376 DLKLPAENYIIADSGLGVI---------CLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425
Query: 422 LGWSHSNCQDL 432
+ ++ + C L
Sbjct: 426 ISFAPAQCNQL 436
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 100/440 (22%), Positives = 170/440 (38%), Gaps = 97/440 (22%)
Query: 33 HRFSEEVKALGVSKNRNATSWPA--KKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS 90
HRF+ +L KN + +S P + F+Y L+ S + T PQ Q + GS
Sbjct: 41 HRFTT---SLLSRKNPSPSSPPYNFRSRFKYSMALIIS----LPIGTPPQAQQMVLDTGS 93
Query: 91 KTMSLGNDFGCDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 150
+ L WI C + P + + PS SS+ L CSH LC
Sbjct: 94 Q-----------LSWIQCHRKKLPP----------KPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 151 -------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 203
L TSC + + C Y+ +Y + T + G LV++ + + + +I
Sbjct: 133 PRIPDFTLPTSCDSNRL-CHYSY-FYADGTFAEGNLVKEKITFSN-------TEITPPLI 183
Query: 204 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-------S 256
+GC + S G++G+ G +S +++A + + FS C +
Sbjct: 184 LGCATESSDD-------RGILGMNRGRLS---FVSQAKI--SKFSYCIPPKSNRPGFTPT 231
Query: 257 GRIFFGDQG-------------PATQQSTSF--LASNGKYITYIIGVETCCIGSSCLKQT 301
G + GD P +Q+ + LA I G++ I S +
Sbjct: 232 GSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 302 ---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW----KCCYKSSSQR 354
S + +VDSGS FT L Y+ + AE +V + +GY + C+ +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK--KGYVYGGTADMCFDGNVAM 349
Query: 355 LPKL-PSVKLMFPQNNSFVV-NNPVFVIYGTQVVTGFCLAI---QPVDGDIGTIGQNFMT 409
+P+L + +F + +V V V G + C+ I + IG
Sbjct: 350 IPRLIGDLVFVFTRGVEILVPKERVLVNVGGGI---HCVGIGRSSMLGAASNIIGNVHQQ 406
Query: 410 GYRVVFDRENLKLGWSHSNC 429
V FD N ++G++ ++C
Sbjct: 407 NLWVEFDVTNRRVGFAKADC 426
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 99/427 (23%), Positives = 161/427 (37%), Gaps = 56/427 (13%)
Query: 26 MFSTKLIHRFSEEVKALGVSK----NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
+FS++L R S VK++ RN T P F SS V +G F
Sbjct: 91 LFSSRL-QRDSRRVKSIATLAAQIPGRNVTHAPRTGGFS------SSVVSGLSQGSGEYF 143
Query: 82 QMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 140
L ++ + + D G D++W+ C C RC S ++ P S T
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFD----------PRKSKTYA 193
Query: 141 HLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 198
+ CS C C ++ C Y + Y + + E + +N V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR--------RNRV 245
Query: 199 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 257
+ V +GCG G + V GL+GLG G++S P FS C D+ S
Sbjct: 246 KG-VALGCGHDNEGLF---VGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASS 299
Query: 258 R---IFFGDQGPATQQSTSFLASNGK----YITYIIGVETCCIGSSCLKQTSFK------ 304
+ + FG+ + + L SN K Y ++G+ + + FK
Sbjct: 300 KPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGN 359
Query: 305 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 362
I+DSG+S T L + Y + F + + + C+ S+ K+P+V
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVV 419
Query: 363 LMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKL 422
L F + + + T FC A G + IG G+RVV+D + ++
Sbjct: 420 LHFRGADVSLPATNYLIPVDTN--GKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477
Query: 423 GWSHSNC 429
G++ C
Sbjct: 478 GFAPGGC 484
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 111/449 (24%), Positives = 182/449 (40%), Gaps = 61/449 (13%)
Query: 1 MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFS-----EEVKALGVSKNRNATSWPA 55
+N + L I + S+ +++ FST LIH S + VKA ++K+ S +
Sbjct: 4 VNNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAKDTALESTLS 63
Query: 56 KKSFEYYQVLLSSDVQK--QKMKTGPQFQMLFPSQGSKTMSLGN---------DFGCDLL 104
+ ++ L + QK Q P + S +S+GN D G DL
Sbjct: 64 RHAY------LRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLF 117
Query: 105 WIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQPC 162
WI C+ C C YN + S + + C+ C LG Q
Sbjct: 118 WIQCEPCDVCYKQKDPIYNR----------TKSDSYTEMLCNEPPCVSLGREGQCSDSGS 167
Query: 163 PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDG 222
Y + +SGLL + + S + K A V GCG+ Q+ ++ G
Sbjct: 168 CLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKT---AQVGFGCGL-QNLNFITSNRDGG 223
Query: 223 LIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQQSTSFLASN 278
++GLG G +S+ S L+ G + SF+ CF + + G + FGD T + +
Sbjct: 224 VLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAE 283
Query: 279 GKYITYI-----IGVETCCIGSSCLKQT---SFKAIVDSGSSFTFLPKEVYETIA-AEFD 329
Y+ + +G I SS ++ S I+DSGS+ + P EVYE + A D
Sbjct: 284 FYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVD 343
Query: 330 R-QVNDTITSFEGYPWKCCYKSSSQR-LPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVVT 387
+ + I+ P C++ +R LP P++ L + N + I+ +
Sbjct: 344 KLKKGYNISPLTSSPD--CFEGKIERDLPLFPTLVLYLESTG---ILNDRWSIFLQRYDE 398
Query: 388 GFCLAIQPVDG--DIGTIG-QNFMTGYRV 413
FCL +G IGT+ Q++ GY +
Sbjct: 399 LFCLGFTSGEGLSIIGTLAQQSYKFGYNL 427
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 143/370 (38%), Gaps = 47/370 (12%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F + Q +K + D G D+ W+ C C C Y D + P +
Sbjct: 152 SGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTDP---IFDPRS 201
Query: 136 SSTSKHLSCSHRLCD-LGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
SS+ L C + C L TS C+ K C Y + Y + + + G V + L G++
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASK--CLYQVSY-GDGSFTVGEFVTETLTF---GNSG 255
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-D 252
+ N V +GCG G ++ L + L + + +SFS C D
Sbjct: 256 MINDV----AVGCGHDNEGLFVGSAG--------LLGLGGGPLSLTSQMKASSFSYCLVD 303
Query: 253 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 305
+D S + F P+ + L S Y +G+ +G L F+
Sbjct: 304 RDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS 363
Query: 306 -----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLP 359
IVDSG++ T L + Y T+ F + + G+ + CY SSQ +P
Sbjct: 364 GYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT-PYLKKTNGFALFDTCYDLSSQSRVTIP 422
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
+V F S + ++I V T FC A P + IG G RV +D N
Sbjct: 423 TVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481
Query: 420 LKLGWSHSNC 429
+G+S C
Sbjct: 482 SVVGFSPHKC 491
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 101/413 (24%), Positives = 151/413 (36%), Gaps = 101/413 (24%)
Query: 98 DFGCDLLWIPC-----DCVRCAPLSASYYNSLD-RDLNEYSPSASSTSKHLSCSHRLCDL 151
D G DL W+PC DC C Y N++ L + P+ SSTS +C C
Sbjct: 39 DTGSDLTWVPCGNLSFDCQDCE----EYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMD 94
Query: 152 GTSCQNP-------------------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 192
S NP +PCP Y + +G L D+L + G+
Sbjct: 95 IHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVL--FTHGNY 152
Query: 193 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 251
N+ + C Y + P G+ G G G +S+P L G FS CF
Sbjct: 153 NNNNNNNKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPFQL---GFSHKGFSHCFL 206
Query: 252 ------DKDDSGRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIG------- 294
+ + S + G+ +++ Q T L S Y IG+E+ IG
Sbjct: 207 PFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFR 266
Query: 295 ---SSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---- 343
S L++ K ++DSG+++T LP+ +Y + + + + GYP
Sbjct: 267 FGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI--------GYPRAKQ 318
Query: 344 ------WKCCYK-------SSSQRLPKLPSVKLMFPQNNSFVV---NNPVFVIYGTQVVT 387
+ CYK SS +LPS+ F N S V+ NN +
Sbjct: 319 VELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV 378
Query: 388 GFCLAIQPVDGDI-----------GTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
CL Q +DG G G VV+D E +LG+ +C
Sbjct: 379 VKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 431
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 117/288 (40%), Gaps = 51/288 (17%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+ ++L D G D++W C C C + L + SAS T + C+ +C
Sbjct: 104 QQVALEVDTGSDVVWTQCRPCFDC----------FTQPLPRFDTSASDTVHGVLCTDPIC 153
Query: 150 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 209
C Y ++Y +N+ + G L +D G + ++ GCG
Sbjct: 154 RALRPHACFLGGCTYQVNY-GDNSVTIGQLAKDSFTFDGKGGGKV---TVPDLVFGCGQY 209
Query: 210 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQ-- 264
+G + G+ G G G +S+P L + SFS CF + S +F G
Sbjct: 210 NTGNFHSNET--GIAGFGRGPLSLPRQLGVS-----SFSYCFTTIFESKSTPVFLGGAPA 262
Query: 265 --------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF--------KAI 306
GP ST FL ++ +Y Y + ++ +G + L +++F I
Sbjct: 263 DGLRAHATGPIL--STPFLPNHPEY--YYLSLKGITVGKTRLAVPESAFVVKADGSGGTI 318
Query: 307 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE--GYPWKCCYKSSS 352
+DSG++ T P+ V+ ++ F QV TS+ G P C+ + S
Sbjct: 319 IDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTES 366
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 117/283 (41%), Gaps = 38/283 (13%)
Query: 164 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 223
YTM Y +N+ S G+ V D + LK V GCG SGG G A G+
Sbjct: 192 YTMKY-EDNSYSKGVFVCD--------EVTLKPDVFPKFQFGCG--DSGGGEFGTA-SGV 239
Query: 224 IGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLA---- 276
+GL GE SL+++ A + FS CF + G + FG++ + S F
Sbjct: 240 LGLAKGEQY--SLISQTASKFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNP 297
Query: 277 -SNGKYITYIIGVETC----CIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ 331
S Y +IG+ + SS S I+DSG+ T LP YE + F ++
Sbjct: 298 PSGLGYFVELIGISVAKKRLNVSSSLF--ASPGTIIDSGTVITRLPTAAYEALRTAFQQE 355
Query: 332 VNDTITSFEGYPWK----CCY--KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 385
+ S P + CY K R KLP + L F V +P +++
Sbjct: 356 MLH-CPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVD-VSLHPSGILWANGD 413
Query: 386 VTGFCLAI--QPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSH 426
+T CLA + + IG +VV+D E +LG+ +
Sbjct: 414 LTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFGN 456
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 94/413 (22%), Positives = 153/413 (37%), Gaps = 83/413 (20%)
Query: 90 SKTMSLGNDFGCDLLWIPC---DCVRCA--PLSASYYNSLDRDLNEYSPSASSTSKHLSC 144
S+ + L D G DL+W PC +C+ C + S ++ L++ + S S S
Sbjct: 90 SQPIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSA 149
Query: 145 SHR------LCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 189
+H LC + + CQ K CP Y + + + L + I +S
Sbjct: 150 AHSNLPSSDLCAISNCPLESIETSDCQ--KHSCPQFYYAYGDGSLIARLYRDSISLPLSN 207
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFS 248
N + N+ GC + P G+ G G G +S+P+ LA + + N FS
Sbjct: 208 PTNLIVNNF----TFGCA------HTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257
Query: 249 MC---------------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIG 287
C +D D+ R G P TS L + Y +G
Sbjct: 258 YCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVG 316
Query: 288 VETCCIGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDR---QVND 334
+E IG + F +VDSG++FT LP +Y ++ AEF+ +VN+
Sbjct: 317 LEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNE 376
Query: 335 TITSFEGYPW--KCCYKSSSQRLPKLPSVK-------LMFPQNNSFVVNNPVFVIYGTQV 385
E C Y ++ + ++ P+ N F G +
Sbjct: 377 RARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436
Query: 386 VTGFCLAIQPVD------GDIGTIGQNFMTGYRVVFDRENLKLGWSHSNCQDL 432
G + + D G T+G G+ VV+D EN ++G++ C L
Sbjct: 437 KVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASL 489
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 144/358 (40%), Gaps = 51/358 (14%)
Query: 90 SKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 148
++T + D D WIPC+ CV C S++ +NS+ S+T K L C
Sbjct: 100 AQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLGCDAPQ 146
Query: 149 CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 208
C Q P C + T NT+ G IL ++ AL + GC
Sbjct: 147 CK-----QVPNPTCGGST--CTWNTTYGG---STILSNLTRDTIALSTDIVPGYTFGCIQ 196
Query: 209 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIFFGDQ 264
K +G V P GL+GLG G +S L L +++FS C + SG + G
Sbjct: 197 KTTG---SSVPPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251
Query: 265 GPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDSGSSF 313
G + T+ L N + Y+ I +G + I +S L T I DSG+ F
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311
Query: 314 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 373
T L VY + EF ++V + I S G + CY P++ MF N +
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAIVSSLG-GFDTCYTGPI----VAPTMTFMFSGMNVTLP 366
Query: 374 NNPVFVIYGTQVVTGFCLAIQP--VDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ + + + +A P V+ + I +R++FD N ++G + C
Sbjct: 367 PDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 144/380 (37%), Gaps = 61/380 (16%)
Query: 77 TGPQFQMLFPSQGSKTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 135
+G F L K + + D G D++W+ C C +C Y+ D+ + PS
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKC-------YSQTDQ---IFDPSK 176
Query: 136 SSTSKHLSCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 193
S + + C LC S C C Y + Y + + E +
Sbjct: 177 SKSFAGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETL---------T 227
Query: 194 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-D 252
+ + V IGCG G + V GL+GLG G +S P+ N FS C D
Sbjct: 228 FRRAAVPRVAIGCGHDNEGLF---VGAAGLLGLGRGGLSFPT--QTGTRFNNKFSYCLTD 282
Query: 253 KDDSGR---IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTS---FK- 304
+ S + I FGD + + L N K T Y + + +G + ++ S F+
Sbjct: 283 RTASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRL 342
Query: 305 -------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 357
I+DSG+S T L + Y ++ F + + E + CY S K
Sbjct: 343 DSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVK 402
Query: 358 LPSVKLMF-------PQNNSFV-VNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMT 409
+P+V L F P N V V+N FC A + IG
Sbjct: 403 VPTVVLHFRGADVSLPAANYLVPVDN----------SGSFCFAFAGTMSGLSIIGNIQQQ 452
Query: 410 GYRVVFDRENLKLGWSHSNC 429
G+RVVFD ++G++ C
Sbjct: 453 GFRVVFDLAGSRVGFAPRGC 472
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 138/353 (39%), Gaps = 45/353 (12%)
Query: 98 DFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GT 153
D G D++WI C C C Y D + P+AS++ + C +C G+
Sbjct: 151 DSGSDVIWIQCRPCAEC-------YQQAD---PLFDPAASASFTAVPCDSGVCRTLPGGS 200
Query: 154 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 213
S C Y + Y + + + G+L + L GD+ VQ V IGCG + G
Sbjct: 201 SGCADSGACRYQVSY-GDGSYTQGVLAMETLTF---GDS---TPVQG-VAIGCGHRNRGL 252
Query: 214 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--RIFFG--DQGPATQ 269
+ V GL+GLG G +S+ L A S+ + D+G + FG D P
Sbjct: 253 F---VGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRDDAMPVGA 309
Query: 270 QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKE 319
L + + Y +G+ +G L + ++D+G++ T LP +
Sbjct: 310 VWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPD 369
Query: 320 VYETIAAEFDRQVNDTITSFEGYPW-KCCYKSSSQRLPKLPSVKLMFPQNNSFVV--NNP 376
Y + F + + G CY S ++P+V L F ++ + +
Sbjct: 370 AYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARN 429
Query: 377 VFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDRENLKLGWSHSNC 429
+ V G V +CLA + +G G ++ D N +G+ S C
Sbjct: 430 LLVEMGGGV---YCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479
>gi|452821303|gb|EME28335.1| aspartyl protease isoform 2 [Galdieria sulphuraria]
Length = 532
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 126/296 (42%), Gaps = 56/296 (18%)
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 228
Y + T+++G L +DI+ + + SVQA+ ++ +L G A G++GL
Sbjct: 247 YGDGTTATGALYQDIVTV-------GEYSVQAT--FAGADTETANFLVGKAA-GVLGLAY 296
Query: 229 GEIS--------VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD-----QGPATQQSTSFL 275
+S V L ++ + N FS+ ++D + G +GP S L
Sbjct: 297 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQDIGAFVVGGVNSSLYEGPIEYSS---L 353
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
A+ Y + +E+ + S+ L SF AIVD+G++ +++ + F +
Sbjct: 354 ANEQNPQFYDVTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCNV 413
Query: 336 -----ITSFEGYPW---KCCYKSSSQRLPKLPSVKL---------MFPQNNSF-VVNNPV 377
+S G W C + + L +LP ++ + P++ F V +N +
Sbjct: 414 PGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNNI 473
Query: 378 FVIYGTQVVTGFCLAIQP--------VDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
F + +CL IQP DG+ +G Y +VFDREN ++G++
Sbjct: 474 F----SAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFA 525
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 104/251 (41%), Gaps = 13/251 (5%)
Query: 190 GDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 248
G+ NS AS++ GC QSG A DG+ G G ++SV S L G+ FS
Sbjct: 8 GNEQTANS-SASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 66
Query: 249 MCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTS 302
C D+G + G+ T + S Y + + + I SS ++
Sbjct: 67 HCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 126
Query: 303 FKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 361
+ IVDSG++ +L Y+ + V+ ++ S +C SSS P+V
Sbjct: 127 TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD-SSFPTV 185
Query: 362 KLMFPQNNSFVVNNPVFVIYGTQVVTG--FCLAIQPVDG-DIGTIGQNFMTGYRVVFDRE 418
L F + V +++ V +C+ Q G +I +G + V+D
Sbjct: 186 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 245
Query: 419 NLKLGWSHSNC 429
N+++GW+ +C
Sbjct: 246 NMRMGWADYDC 256
>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 435
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 93 MSLGNDFGCDLLWIPCDCVRCAPLSASYYN-SLDRDLNEYSPSASSTSKHLSCSHRLCDL 151
+ L D G +W+ CD +S+SY D L + + S S T++ S C
Sbjct: 62 VKLTVDLGGTFMWVDCDNY----VSSSYTPVRCDSALCKLADSHSCTTECYSSPKPGCYN 117
Query: 152 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIGCGMKQ 210
T P P + S+SG + D++ L S G +N +V CG
Sbjct: 118 NTCSHIPYNP--------VVHVSTSGDIGLDVVSLQSMDGKYPGRNVSVPNVPFVCG--- 166
Query: 211 SGGYLDGVAPD--GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQ-G 265
+G L+ +A G+ GLG G IS+P+ + A +++ F++C + SG I+FGD G
Sbjct: 167 TGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSLTNSSGVIYFGDSIG 226
Query: 266 PATQQSTSF-------LASNGKYIT------YIIGVETCCIGSSCLK-QTSFKAIVDSGS 311
P + + +++ G Y Y I V+T +G +K + +I + G
Sbjct: 227 PLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLRVGGKEIKFNKTLLSIDNEGK 286
Query: 312 S---------FTFLPKEVYETIAAEFDRQVNDTI-TSFEGYPWKCCYKSSSQRL----PK 357
+T L +Y+ + F +Q+ I + P+ CY+S++ + P
Sbjct: 287 GGTRISTVHPYTLLHTSIYKAVIKAFAKQMKFLIEVNPPIAPFGLCYQSAAMDINEYGPV 346
Query: 358 LPSVKLMFPQNNSFVVNNPVFVIYGTQV---VTGFCLAIQPVDGDIG-----TIGQNFMT 409
+P + L+ S + I+G ++ + + + VDG + IG +
Sbjct: 347 VPFIDLVLESQGSV-----YWRIWGANSMVKISSYVMCLGFVDGGLKPDSSIIIGGRQLE 401
Query: 410 GYRVVFDRENLKLGWSHS 427
+ FD + +LG++ S
Sbjct: 402 DNLLQFDLASARLGFTSS 419
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 150/373 (40%), Gaps = 72/373 (19%)
Query: 91 KTMSLGNDFGCDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 149
+T S D G DL+W C C +C D+ + P SS+ LSCS +LC
Sbjct: 111 ETYSAIMDTGSDLIWTQCKPCTQC----------FDQPSPIFDPKKSSSFSKLSCSSQLC 160
Query: 150 DLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 206
P+ C + +Y Y + +S+ G + + G ++ N V GC
Sbjct: 161 K-----ALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF---GKVSIPN-----VGFGC 207
Query: 207 GMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG 262
G G G+ G GL+GLG G +S+ S L +A FS C D + + G
Sbjct: 208 GEDNEGDGFTQG---SGLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKTSTLLMG 259
Query: 263 -----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------IV 307
+ A ++T + + + Y + +E +G + L K+++F+ I+
Sbjct: 260 SLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLII 319
Query: 308 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL----P 359
DSG++ T+L + ++ + EF Q+ + + + CY +S +PKL
Sbjct: 320 DSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFT 379
Query: 360 SVKLMFPQNNSFVVNNPVFVIYGTQVVTGFCLAIQPVDGDIGTIGQNFMTGYRVVFDREN 419
L P N + ++ + VI CLA+ G + G V D E
Sbjct: 380 GADLELPGENYMIADSSMGVI---------CLAMGS-SGGMSIFGNVQQQNMFVSHDLEK 429
Query: 420 LKLGWSHSNCQDL 432
L + +NC L
Sbjct: 430 ETLSFLPTNCGQL 442
>gi|452821304|gb|EME28336.1| aspartyl protease isoform 1 [Galdieria sulphuraria]
Length = 456
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 126/296 (42%), Gaps = 56/296 (18%)
Query: 169 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 228
Y + T+++G L +DI+ + + SVQA+ ++ +L G A G++GL
Sbjct: 171 YGDGTTATGALYQDIVTV-------GEYSVQAT--FAGADTETANFLVGKAA-GVLGLAY 220
Query: 229 GEIS--------VPSLLAKAGLIRNSFSMCFDKDDSGRIFFG-----DQGPATQQSTSFL 275
+S V L ++ + N FS+ ++D + G +GP S L
Sbjct: 221 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQDIGAFVVGGVNSSLYEGPIEYSS---L 277
Query: 276 ASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 335
A+ Y + +E+ + S+ L SF AIVD+G++ +++ + F +
Sbjct: 278 ANEQNPQFYDVTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCNV 337
Query: 336 -----ITSFEGYPW---KCCYKSSSQRLPKLPSVKL---------MFPQNNSF-VVNNPV 377
+S G W C + + L +LP ++ + P++ F V +N +
Sbjct: 338 PGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNNI 397
Query: 378 FVIYGTQVVTGFCLAIQP--------VDGDIGTIGQNFMTGYRVVFDRENLKLGWS 425
F + +CL IQP DG+ +G Y +VFDREN ++G++
Sbjct: 398 F----SAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFA 449
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,185,176,977
Number of Sequences: 23463169
Number of extensions: 365816940
Number of successful extensions: 1126488
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 232
Number of HSP's successfully gapped in prelim test: 1858
Number of HSP's that attempted gapping in prelim test: 1122452
Number of HSP's gapped (non-prelim): 2630
length of query: 508
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 361
effective length of database: 8,910,109,524
effective search space: 3216549538164
effective search space used: 3216549538164
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)