BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046254
(321 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 169/352 (48%), Gaps = 54/352 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++K+ IG P L+ + DT +GL WTQC+PC + Q PI+NS + ++Y+ LPC
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQF 150
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR----FGC 120
C ++ F C + C Y I Y T V + D S +N R FGC
Sbjct: 151 CTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQ----------SAENDRIPFYFGC 200
Query: 121 SLESKDFVSIQ-KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR----L 175
S ++++F + + GI+GLN S + Q+ + +RFS CL D S S L
Sbjct: 201 SRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLL 260
Query: 176 EFGDQI-----------------------------IAGKSLNLPPNSFTIKLNGQRGCIN 206
FG+ I +AG + +PP +F +K +G G I
Sbjct: 261 RFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTII 320
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMTYH 265
D G+ +T I Y + F +YF QH +++ + G C+ F+++PSM +H
Sbjct: 321 DSGTAVTYISQTAYFPVITAFKNYFDQHGFQRV-NIQLSGYICYKQQGHTFHNYPSMAFH 379
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
FQGAD VEPE V++ F +P++ +TI+GA +Q NTQF+YD
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQ-RTIIGALNQANTQFIYD 430
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 156/337 (46%), Gaps = 41/337 (12%)
Query: 19 KSLWFLLDTVAGLTWTQCQPCKS----CYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHC 74
K+ +F +DT L+W QC+ C++ C+ DP Y S KSYK + C S P C
Sbjct: 99 KTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCEPNQC 158
Query: 75 FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVS---IQ 131
EG C Y +TYG T + +T T + ++++I FGCS +S++ + +
Sbjct: 159 KEGLCAYNVTYGPGSYTSGNLANETFTFY-SNHGKHTALKSISFGCSTDSRNMIYAFLLD 217
Query: 132 KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKS----- 186
K ++G++G+ W SF+ QLG + +FS C + + + ++ L FG ++ K+
Sbjct: 218 KNPVSGVLGMGWGPRSFLAQLGSISHGKFSYC-ITANNTHNTYLRFGKHVVKSKNLQTTK 276
Query: 187 -----------------------LNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL 223
LN+ ++ +G RGCI D G++ T++ ++ L
Sbjct: 277 IMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTL 336
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGV-TCFNL--PARFNSFPSMTYHFQGADLVVEPENVFI 280
++ S + K + K C+ A + P +T+H + ADL V+PE +F+
Sbjct: 337 HTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFL 396
Query: 281 FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F + F + KTI+GA Q +FVYD
Sbjct: 397 FREFEGKNVFCLSMLS-DDSKTIIGAYQQMKQKFVYD 432
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 157/352 (44%), Gaps = 51/352 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P +S ++DT + L WTQC+PC+ C++Q+ PI++ + S+ K+ C
Sbjct: 108 NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCS 167
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C + C C Y TYGD T+ V + +T T E +S+ + FGC
Sbjct: 168 SELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTE-DQISIPGLGFGCGN 226
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---- 178
++ Q AG++GL S + QL +F+ CL D S S L G
Sbjct: 227 DNNGDGFSQG---AGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPSSLLLGSLAN 280
Query: 179 -------DQI-----------------------IAGKSLNLPPNSFTIKLNGQRGCINDC 208
D++ + G L++P ++F + +G G I D
Sbjct: 281 ITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDS 340
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHF 266
G+ +T +E + L EFI +Q ++ + CFNLPA N P +T+HF
Sbjct: 341 GTTITYVENSAFTSLKNEFI---AQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHF 397
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+GADL + EN I + + A +G +I G Q N V+DL
Sbjct: 398 KGADLELPGENYMIGDSKAGLLCL---AIGSSRGMSIFGNLQQQNFMVVHDL 446
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 157/352 (44%), Gaps = 51/352 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P +S ++DT + L WTQC+PC+ C++Q+ PI++ + S+ K+ C
Sbjct: 363 NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCS 422
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C + C C Y TYGD T+ V + +T T E +S+ + FGC
Sbjct: 423 SELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTE-DQISIPGLGFGCGN 481
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---- 178
++ Q AG++GL S + QL +F+ CL D S S L G
Sbjct: 482 DNNGDGFSQG---AGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPSSLLLGSLAN 535
Query: 179 -------DQI-----------------------IAGKSLNLPPNSFTIKLNGQRGCINDC 208
D++ + G L++P ++F + +G G I D
Sbjct: 536 ITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDS 595
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHF 266
G+ +T +E + L EFI +Q ++ + CFNLPA N P +T+HF
Sbjct: 596 GTTITYVENSAFTSLKNEFI---AQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHF 652
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+GADL + EN I + + A +G +I G Q N V+DL
Sbjct: 653 KGADLELPGENYMIGDSKAGLLCL---AIGSSRGMSIFGNLQQQNFMVVHDL 701
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 150/347 (43%), Gaps = 50/347 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P SL ++DT + L WTQC+PC C+ Q PI+N + S+ LPC
Sbjct: 96 YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155
Query: 68 CKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
C+ P DC Y YGD T+ + +T T E S SV NI FGC +++
Sbjct: 156 CQDLPSESCYNDCQYTYGYGDGSSTQGYMATETFTF----ETS--SVPNIAFGCGEDNQG 209
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ------ 180
F + AG++G+ W S QLG +FS C+ S S L G
Sbjct: 210 F---GQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSSGSSSPSTLALGSAASGVPE 263
Query: 181 ------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G +L +P ++F ++ +G G I D G+ LT +
Sbjct: 264 GSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLP 323
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGADLVVE 274
+ Y + F D + +++ + TCF LP+ ++ P ++ F G L +
Sbjct: 324 QDAYNAVAQAFTDQINLSPVDESSSGLS---TCFQLPSDGSTVQVPEISMQFDGGVLNLG 380
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
ENV I + G + ++G +I G Q TQ +YDL
Sbjct: 381 EENVLISPAEGVICLAMGS--SSQQGISIFGNIQQQETQVLYDLQNL 425
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 152/348 (43%), Gaps = 52/348 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P ++ ++DT + L WTQC+PCK C++Q PI++ + S+ KLPC
Sbjct: 94 NGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCS 153
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + P C Y +YGD T+ V L T T D SV I FGC E
Sbjct: 154 SDLCAALPISSCSDGCEYLYSYGDYSSTQGV--LATETFAFGD----ASVSKIGFGCG-E 206
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQII 182
D + AG++GL S + QLG +FS CL D S S L G +
Sbjct: 207 DNDGSGFSQG--AGLVGLGRGPLSLISQLGE---PKFSYCLTSMDDSKGISSLLVGSEAT 261
Query: 183 AGKSLNLP----------------------------PNSFTIKLNGQRGCINDCGSVLTV 214
++ P ++F+I+ +G G I D G+ +T
Sbjct: 262 MKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITY 321
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLP--ARFNSFPSMTYHFQGADL 271
+E +A L EFI + D+++ G+ CF LP A P + +HF+GADL
Sbjct: 322 LEDSAFAALKKEFISQL-KLDVDE---SGSTGLDLCFTLPPDASTVDVPQLVFHFEGADL 377
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ EN I DS G +I G Q N ++DL+
Sbjct: 378 KLPAENYII---ADSGLGVICLTMGSSSGMSIFGNFQQQNIVVLHDLE 422
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 143/357 (40%), Gaps = 64/357 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P + ++DT + L WTQC+PC C+ Q+ P+++ S +Y LPC
Sbjct: 99 NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCS 158
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C C Y TYGD T+ V + +T TL P ++ FGC
Sbjct: 159 STLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTKLP------DVAFGCGD 212
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD--- 179
++ Q AG++GL S + QLG ++FS CL D + S L G
Sbjct: 213 TNEGDGFTQG---AGLVGLGRGPLSLVSQLGL---NKFSYCLTSLDDTSKSPLLLGSLAT 266
Query: 180 --------------------------------QIIAGKSLNLPPNSFTIKLNGQRGCIND 207
+ + LP ++F ++ +G G I D
Sbjct: 267 ISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVD 326
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV---TCFNLPARF---NSFPS 261
G+ +T +E + Y L F KL G+ TCF PA P
Sbjct: 327 SGTSITYLELQGYRALKKAFAAQM------KLPAADGSGIGLDTCFEAPASGVDQVEVPK 380
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +H GADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 381 LVFHLDGADLDLPAENYMVLDSGSGALCL---TVMGSRGLSIIGNFQQQNIQFVYDV 434
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 160/355 (45%), Gaps = 56/355 (15%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
++ N +++K+ IG P S +LDT + LTWTQC+PC CY Q PIY+ +Y K
Sbjct: 108 VYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSK 167
Query: 61 LPCYDASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+PC + C++ + C +C Y +YGD T+ + S ++ TL + S+ +I F
Sbjct: 168 VPCSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTL------TSQSLPHIAF 221
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV----QPDKS---- 170
GC E++ Q + G S + QLG+ + ++FS CLV P K+
Sbjct: 222 GCGQENEGGGFSQGGGLVGFGRGPL---SLISQLGQSLGNKFSYCLVSITDSPSKTSPLF 278
Query: 171 -----------------FHSR-------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
SR L + G+ L++ +F ++L+G G I
Sbjct: 279 IGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVII 338
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNS---FPSM 262
D G+ +T +E Y V+ I + ++ G+ CF P +S FP++
Sbjct: 339 DSGTTVTYLEQSGYDVVKKAVISSINLPQVDG----SNIGLDLCFE-PQSGSSTSHFPTI 393
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
T+HF+GAD + EN +I+ A P G +I G Q N Q +YD
Sbjct: 394 TFHFEGADFNLPKEN-YIYTDSSGIACL---AMLPSNGMSIFGNIQQQNYQILYD 444
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 154/356 (43%), Gaps = 54/356 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + +G P K + DT + L W QC+PC++C+ Q DPI++ SY + C D
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 68 CKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS-LESK 125
C S P DC Y YGD T+ S +T TL + ++ +NI FGC L
Sbjct: 100 CDSLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLT-STQGEKLAAKNIAFGCGHLNRG 158
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV----QPDK----------SF 171
F +G++GL + SF+ QLG L +FS CLV P K S
Sbjct: 159 SF-----NDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSS 213
Query: 172 HSR----------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
HS ++ D IAG++L +P SF IK +G G I D G
Sbjct: 214 HSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSG 273
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFNSF----PSMTY 264
+ LT++ Y ++ S I+ G+ C+++ S+ P+M +
Sbjct: 274 TTLTLLPDAPYQIVLRALRSKISFPKIDG----SSAGLDLCYDVSGSKASYKMKIPAMVF 329
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
HF+GAD + EN FI + D+ + I G Q N + +YD+ +
Sbjct: 330 HFEGADYQLPVENYFIAAN-DAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGS 384
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 151/347 (43%), Gaps = 50/347 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ L IG P ++ ++DT + L WTQC+PCK C++Q PI++ S+ KLPC
Sbjct: 94 NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + P C Y +YGD T+ V + +T T SV I FGC +
Sbjct: 154 SDLCVALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF------GDASVSKIGFGCGED 207
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQII 182
++ Q AG++GL S + QLG VP +FS CL D S S L G +
Sbjct: 208 NRGRAYSQG---AGLVGLGRGPLSLISQLG--VP-KFSYCLTSIDDSKGISTLLVGSEAT 261
Query: 183 AGKSLNLP----------------------------PNSFTIKLNGQRGCINDCGSVLTV 214
++ P ++F+I+ +G G I D G+ +T
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHFQGADLV 272
++ +A L EFI SQ ++ + CF LP + P + +HF+G DL
Sbjct: 322 LKDNAFAALKKEFI---SQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLK 378
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ EN I +DS G +I G Q N ++DL+
Sbjct: 379 LPKENYII---EDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLE 422
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 151/347 (43%), Gaps = 50/347 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ L IG P ++ ++DT + L WTQC+PCK C++Q PI++ S+ KLPC
Sbjct: 94 NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + P C Y +YGD T+ V + +T T SV I FGC +
Sbjct: 154 SDLCVALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF------GDASVSKIGFGCGED 207
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQII 182
++ Q AG++GL S + QLG VP +FS CL D S S L G +
Sbjct: 208 NRGRAYSQG---AGLVGLGRGPLSLISQLG--VP-KFSYCLTSIDDSKGISTLLVGSEAT 261
Query: 183 AGKSLNLP----------------------------PNSFTIKLNGQRGCINDCGSVLTV 214
++ P ++F+I+ +G G I D G+ +T
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHFQGADLV 272
++ +A L EFI SQ ++ + CF LP + P + +HF+G DL
Sbjct: 322 LKDSAFAALKKEFI---SQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLK 378
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ EN I +DS G +I G Q N ++DL+
Sbjct: 379 LPKENYII---EDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLE 422
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 155/356 (43%), Gaps = 54/356 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + +G P K + DT + L W QC+PC++C+ Q DPI++ SY + C D
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 68 CKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS-LESK 125
C S P +C Y YGD T+ S +T TL + ++ +NI FGC L
Sbjct: 100 CDSLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLT-STQGEKLAAKNIAFGCGHLNRG 158
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV----QPDK----------SF 171
F +G++GL + SF+ QLG L +FS CLV P K S
Sbjct: 159 SF-----NDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSS 213
Query: 172 HSR----------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
HS ++ D IAG++L +P SF IK +G G I D G
Sbjct: 214 HSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSG 273
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFNSF----PSMTY 264
+ LT++ Y ++ S +I+ G+ C+++ S+ P+M +
Sbjct: 274 TTLTLLPDAPYQIVLRALRSKVSFPEIDG----SSAGLDLCYDVSGSKASYKKKIPAMVF 329
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
HF+GAD + EN FI + D+ + I G Q N + +YD+ +
Sbjct: 330 HFEGADHQLPVENYFIAAN-DAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGS 384
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 151/344 (43%), Gaps = 45/344 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P ++ +LDT + + W QCQPCK CYEQ PI++S ++YK LPC +
Sbjct: 89 YLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNT 148
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+S F C Y I Y D ++ S++T TL + SPV GC
Sbjct: 149 CQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNG-SPVQFPGTVIGCG--R 205
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD-QIIA 183
+ + I++K +GI+GL S + QL +FS CLV + S+L FG+ +++
Sbjct: 206 YNAIGIEEK-NSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVS 264
Query: 184 GKSLNLPP--------------NSFTIKLN----------GQRGCINDCGSVLTVIECEV 219
G+ P +F++ N G+ I D G+ LT + V
Sbjct: 265 GRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKGNIIIDSGTTLTALPNGV 324
Query: 220 YAVLTAEFIDYFSQHDI----EKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y+ L A + + L C K VT L A S P +T HF GAD+ +
Sbjct: 325 YSKLEAAVAKTVILQRVRDPNQVLGLCYK--VTPDKLDA---SVPVITAHFSGADVTLNA 379
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
N F+ D F AF P + + G Q N YDL
Sbjct: 380 INTFVQVADDVVCF----AFQPTETGAVFGNLAQQNLLVGYDLQ 419
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 151/355 (42%), Gaps = 51/355 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
++ + Y++ + IG P S ++DT + L WTQC+PC C+ Q PI+N + S+
Sbjct: 89 VYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFST 148
Query: 61 LPCYDASCKS-PFH-CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
LPC C+ P C +C Y YGD T+ + +T T E S SV NI F
Sbjct: 149 LPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF----ETS--SVPNIAF 202
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC +++ F + AG++G+ W S QLG +FS C+ S S L G
Sbjct: 203 GCGEDNQGF---GQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSYGSSSPSTLALG 256
Query: 179 DQ------------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDC 208
+ G +L +P ++F ++ +G G I D
Sbjct: 257 SAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDS 316
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHF 266
G+ LT + + Y + F D + +++ + TCF P+ ++ P ++ F
Sbjct: 317 GTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLS---TCFQQPSDGSTVQVPEISMQF 373
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
G L + +N+ I + G + + G +I G Q TQ +YDL
Sbjct: 374 DGGVLNLGEQNILISPAEGVICLAMGSS--SQLGISIFGNIQQQETQVLYDLQNL 426
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 99/345 (28%), Positives = 148/345 (42%), Gaps = 54/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ IG P SL ++DT + L WT+C PC C IY+ S +Y K+ C +
Sbjct: 42 YLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSL 99
Query: 68 CKSP--FHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ P F C +GDC Y YGD T + S +T ++ S S+ NI FGC ++
Sbjct: 100 CQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSI------SSQSLPNITFGCGHDN 153
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--------------------- 163
+ F + G++G S S + QLG + ++FS C
Sbjct: 154 QGF-----DKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASL 208
Query: 164 ---------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
LVQ + H L + G+SL +P +F I+ +G G I D G+ LT
Sbjct: 209 EATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTF 268
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVV 273
++ Y + + + + CFN N FPSMT+HF+GAD V
Sbjct: 269 LQQTAYDAVKEAMVSSINLPQADGQLDL------CFNQQGSSNPGFPSMTFHFKGADYDV 322
Query: 274 EPEN-VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
EN +F + D P + I G Q N Q +YD
Sbjct: 323 PKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYD 367
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 150/353 (42%), Gaps = 51/353 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++ L +G P +S ++DT + L W QC PC+ CY+Q P ++ +S++K C
Sbjct: 36 NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACT 95
Query: 65 DASCKS---PFH-CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
D C P C C Y TYGD T + +T +L + SV N FGC
Sbjct: 96 DNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISL--NNGAGTQSVPNFAFGC 153
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
++ + AG++GL S QL ++FS CLV + S L FG
Sbjct: 154 GTQNLGTFA----GAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSI 209
Query: 181 IIA----------------------------GKSLNLPPNSFTI-KLNGQRGCINDCGSV 211
A G+ LNL P+ F I + G+ G I D G+
Sbjct: 210 AAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTT 269
Query: 212 LTVIECEVY-AVLTA--EFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
+T++ Y AVL A F++Y +L CFN+ N S P M + FQ
Sbjct: 270 ITMLTLPAYSAVLRAYESFVNY------PRLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQ 323
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
GAD + EN+F+ + A +G +I+G Q N VYDL+
Sbjct: 324 GADFQMRGENLFVLVDTSATTLCL--AMGGSQGFSIIGNIQQQNHLVVYDLEA 374
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 151/346 (43%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P ++ +LDT + ++W QC PC CYEQ DPI+ S S+ L C
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETEQ 210
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C G C Y ++YGD T +T TL S+ NI GC ++
Sbjct: 211 CKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL------GSTSLGNIAIGCGHNNE 264
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
F+ + G L++ S +L FS CLV D S L+F I
Sbjct: 265 GLFIGAAGLLGLGGGSLSFPS--------QLNASSFSYCLVDRDSDSTSTLDFNSPITPD 316
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G L +P SF + +G G I D G+ +T ++
Sbjct: 317 AVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQT 376
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF-QGADLVV 273
VY VL F+ S HD++ T R + TC++L ++ P++++HF G +L +
Sbjct: 377 TVYNVLRDAFVK--STHDLQ---TARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 274 EPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+N I + + +F F F P +ILG Q T+ +DL
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPT---DSTLSILGNAQQQGTRVGFDL 474
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 147/356 (41%), Gaps = 66/356 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + +G P + ++DT + LTW QC PC +CY QND ++ + S+ KL C
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 68 CKS-PF-HCFEGDCFYGITYGD--------VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C P+ C + C Y +YGD VY+T +D ++ V N
Sbjct: 63 CNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQ---------QVPNFA 113
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRL 175
FGC +++ + GI+GL SF QL + +FS CLV + S L
Sbjct: 114 FGCGHDNEGSFAGAD----GILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPL 169
Query: 176 EFGDQI------------------------------IAGKSLNLPPNSFTIKLNGQRGCI 205
FGD + GK LN+ +F I G+ G I
Sbjct: 170 LFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTI 229
Query: 206 NDCGSVLTVIECEVY----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPS 261
D G+ +T + EV+ A + A +DY + D G LP + PS
Sbjct: 230 FDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLP----TVPS 285
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
MT+HF+G D+ + P N FIF + F + TI+G+ Q N Q YD
Sbjct: 286 MTFHFEGGDMELPPSNYFIFLESSQSYCF---SMVSSPDVTIIGSIQQQNFQVYYD 338
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/346 (30%), Positives = 145/346 (41%), Gaps = 55/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + L WTQCQPC C+ Q+ PI+N + S+ LPC
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C +SP C C Y YGD ET+ S+ T TL VS+ NI FGC +
Sbjct: 155 CQALQSP-TCSNNSCQYTYGYGDGSETQ--GSMGTETL----TFGSVSIPNITFGCGENN 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---DQI 181
+ F + AG++G+ S QL +FS C+ S S L G + +
Sbjct: 208 QGF---GQGNGAGLVGMGRGPLSLPSQLDV---TKFSYCMTPIGSSTSSTLLLGSLANSV 261
Query: 182 IAGK-----------------SLN--------LPPNSFTIKL---NGQRGCINDCGSVLT 213
AG +LN LP + KL NG G I D G+ LT
Sbjct: 262 TAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLT 321
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF--PSMTYHFQGADL 271
Y + FI SQ ++ + CF +P+ ++ P+ HF G DL
Sbjct: 322 YFADNAYQAVRQAFI---SQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDL 378
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
V+ EN FI G + +G +I G Q N VYD
Sbjct: 379 VLPSENYFISPSNGLICLAMGSS---SQGMSIFGNIQQQNLLVVYD 421
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 152/343 (44%), Gaps = 42/343 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P ++ + DT + + W QC+PC+ CY Q PI+N SYK +PC
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C S + C Y I+YGD ++ S+DT + L SPVS I GC
Sbjct: 147 CHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLS-LESTSGSPVSFPKIVIGC---G 202
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFGD-Q 180
D +GI+GL S + QLG + +FS CLV + + S L FGD
Sbjct: 203 TDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAA 262
Query: 181 IIAGKSLNLPP-------------NSFT-----IKLNG-------QRGCINDCGSVLTVI 215
+++G + P +F+ ++ G + I D G+ LT+I
Sbjct: 263 VVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLI 322
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
+VY L + +D ++++ + C++L + FP +T HF+GAD+ +
Sbjct: 323 PSDVYTNLESAVVDLVK---LDRVDDPNQQFSLCYSLKSNEYDFPIITVHFKGADVELHS 379
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F+ F F P+ P+ G +I G Q N YDL
Sbjct: 380 ISTFVPITDGIVCFAFQPS--PQLG-SIFGNLAQQNLLVGYDL 419
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 104/346 (30%), Positives = 145/346 (41%), Gaps = 55/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + L WTQCQPC C+ Q+ PI+N + S+ LPC
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C +SP C C Y YGD ET+ S+ T TL VS+ NI FGC +
Sbjct: 155 CQALQSP-TCSNNSCQYTYGYGDGSETQ--GSMGTETL----TFGSVSIPNITFGCGENN 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---DQI 181
+ F + AG++G+ S QL +FS C+ S S L G + +
Sbjct: 208 QGF---GQGNGAGLVGMGRGPLSLPSQLDV---TKFSYCMTPIGSSNSSTLLLGSLANSV 261
Query: 182 IAGK-----------------SLN--------LPPNSFTIKL---NGQRGCINDCGSVLT 213
AG +LN LP + KL NG G I D G+ LT
Sbjct: 262 TAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLT 321
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF--PSMTYHFQGADL 271
Y + FI SQ ++ + CF +P+ ++ P+ HF G DL
Sbjct: 322 YFVDNAYQAVRQAFI---SQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDL 378
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
V+ EN FI G + +G +I G Q N VYD
Sbjct: 379 VLPSENYFISPSNGLICLAMGSS---SQGMSIFGNIQQQNLLVVYD 421
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 150/346 (43%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P ++ +LDT + ++W QC PC CYEQ DP + S S+ L C
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETEQ 210
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C G C Y ++YGD T +T TL S+ NI GC ++
Sbjct: 211 CKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL------GSTSLGNIAIGCGHNNE 264
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
F+ + G L++ S +L FS CLV D S L+F I
Sbjct: 265 GLFIGAAGLLGLGGGSLSFPS--------QLNASSFSYCLVDRDSDSTSTLDFNSPITPD 316
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G L +P SF + +G G I D G+ +T ++
Sbjct: 317 AVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQT 376
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF-QGADLVV 273
VY VL F+ S HD++ T R + TC++L ++ P++++HF G +L +
Sbjct: 377 TVYNVLRDAFVK--STHDLQ---TARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 274 EPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+N I + + +F F F P +ILG Q T+ +DL
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPT---DSTLSILGNAQQQGTRVGFDL 474
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 151/343 (44%), Gaps = 42/343 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P ++ + DT + + W QC+PC+ CY Q PI+N SYK +PC
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C S + C Y I+YGD ++ S+DT + L SPVS GC
Sbjct: 147 CHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLS-LESTSGSPVSFPKTVIGC---G 202
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFGD-Q 180
D +GI+GL S + QLG + +FS CLV + + S L FGD
Sbjct: 203 TDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAA 262
Query: 181 IIAGKSLNLPP-------------NSFT-----IKLNG-------QRGCINDCGSVLTVI 215
+++G + P +F+ ++ G + I D G+ LT+I
Sbjct: 263 VVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLI 322
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
+VY L + +D ++++ + C++L + FP +T HF+GAD+ +
Sbjct: 323 PSDVYTNLESAVVDLVK---LDRVDDPNQQFSLCYSLKSNEYDFPIITAHFKGADIELHS 379
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F+ F F P+ P+ G +I G Q N YDL
Sbjct: 380 ISTFVPITDGIVCFAFQPS--PQLG-SIFGNLAQQNLLVGYDL 419
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 139/326 (42%), Gaps = 33/326 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + ++D+ + + W QC+PC+ CY Q DP+++ + S+ + C A
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 68 CKS------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C++ G C Y +TYGD TK +L+T TL +VQ + GC
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCG 243
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLE---- 176
S FV AG++GL W + S + QLG FS CL L
Sbjct: 244 HRNSGLFVG-----AAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLASSFY 298
Query: 177 ---FGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ 233
+ G+ L L + F + +G G + D G+ +T + E YA L F
Sbjct: 299 YVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGA 358
Query: 234 HDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFF 291
+ TC++L + P+++++F QGA L + N+ + F F
Sbjct: 359 LPRSPAVSLLD---TCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAF 415
Query: 292 GPAFTPRKGKTILGARHQHNTQFVYD 317
P+ G +ILG Q Q D
Sbjct: 416 APS---SSGISILGNIQQEGIQITVD 438
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 150/359 (41%), Gaps = 66/359 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P + ++DT + L WTQC+PC C++Q+ P+++ S +Y +PC
Sbjct: 102 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 161
Query: 65 DASCKS--PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
ASC C C Y TYGD T+ V + +T TL P + FGC
Sbjct: 162 SASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP------GVVFGCG 215
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD-- 179
++ Q AG++GL S + QLG D+FS CL D + +S L G
Sbjct: 216 DTNEGDGFSQG---AGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNNSPLLLGSLA 269
Query: 180 --------------------------------QIIAGKS-LNLPPNSFTIKLNGQRGCIN 206
I G + ++LP ++F ++ +G G I
Sbjct: 270 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 329
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT---CFNLPARF---NSFP 260
D G+ +T +E + Y L F + L GV CF PA+ P
Sbjct: 330 DSGTSITYLEVQGYRALKKAFAAQMA------LPAADGSGVGLDLCFRAPAKGVDQVEVP 383
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +HF GADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 384 RLVFHFDGGADLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDV 439
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 150/359 (41%), Gaps = 66/359 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P + ++DT + L WTQC+PC C++Q+ P+++ S +Y +PC
Sbjct: 92 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 151
Query: 65 DASCKS--PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
ASC C C Y TYGD T+ V + +T TL P + FGC
Sbjct: 152 SASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP------GVVFGCG 205
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD-- 179
++ Q AG++GL S + QLG D+FS CL D + +S L G
Sbjct: 206 DTNEGDGFSQG---AGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNNSPLLLGSLA 259
Query: 180 --------------------------------QIIAGKS-LNLPPNSFTIKLNGQRGCIN 206
I G + ++LP ++F ++ +G G I
Sbjct: 260 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 319
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT---CFNLPARF---NSFP 260
D G+ +T +E + Y L F + L GV CF PA+ P
Sbjct: 320 DSGTSITYLEVQGYRALKKAFAAQMA------LPAADGSGVGLDLCFRAPAKGVDQVEVP 373
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +HF GADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 374 RLVFHFDGGADLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDV 429
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 150/359 (41%), Gaps = 66/359 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P + ++DT + L WTQC+PC C++Q+ P+++ S +Y +PC
Sbjct: 71 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 130
Query: 65 DASCKS--PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
ASC C C Y TYGD T+ V + +T TL P + FGC
Sbjct: 131 SASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP------GVVFGCG 184
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD-- 179
++ Q AG++GL S + QLG D+FS CL D + +S L G
Sbjct: 185 DTNEGDGFSQG---AGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNNSPLLLGSLA 238
Query: 180 --------------------------------QIIAGKS-LNLPPNSFTIKLNGQRGCIN 206
I G + ++LP ++F ++ +G G I
Sbjct: 239 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 298
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT---CFNLPARF---NSFP 260
D G+ +T +E + Y L F + L GV CF PA+ P
Sbjct: 299 DSGTSITYLEVQGYRALKKAFAAQMA------LPAADGSGVGLDLCFRAPAKGVDQVEVP 352
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +HF GADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 353 RLVFHFDGGADLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDV 408
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 148/353 (41%), Gaps = 57/353 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P + ++ ++DT + +TW QC PC +CY+Q D ++N S S+K L C +
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSSL 75
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C C Y YGD T D L P V + NI GC +++
Sbjct: 76 CLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNE 135
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLEFGDQII- 182
AGI+GL SF L + FS CL + D + S L FGD I
Sbjct: 136 GTFGTA----AGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFGDAAIP 191
Query: 183 ------------------------------AGKSL--NLPPNSFTIKLNGQRGCINDCGS 210
G +L N+P + F + +G G I D G+
Sbjct: 192 HTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGT 251
Query: 211 VLTVIECEVYAVLTAEF----IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYH 265
+T +E Y + F + S D K+F TC++ + S P++T+H
Sbjct: 252 TITRLEARAYTAVRDAFRAATMHLTSAADF-KIFD------TCYDFTGMNSISVPTVTFH 304
Query: 266 FQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
FQG D+ + P N + ++ F F AF G +++G Q + + +YD
Sbjct: 305 FQGDVDMRLPPSNYIVPVSNNNIFCF---AFAASMGPSVIGNVQQQSFRVIYD 354
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 148/358 (41%), Gaps = 61/358 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P S ++DT + L WTQC+PC C++Q+ P+++ S +Y +PC
Sbjct: 97 NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 156
Query: 65 DASCK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
A C + C Y TYGD T+ V + +T TL + P + FGC
Sbjct: 157 SALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLP----GVAFGCG 212
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFGDQ 180
++ Q AG++GL S + QLG D+FS CL D S L G
Sbjct: 213 DTNEGDGFTQG---AGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDGDGKSPLLLGGS 266
Query: 181 -----------------------------------IIAGKSLNLPPNSFTIKLNGQRGCI 205
+ + LP ++F I+ +G G I
Sbjct: 267 AAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVI 326
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARF---NSFPS 261
D G+ +T +E + Y L F+ + ++ + G+ CF PA+ P
Sbjct: 327 VDSGTSITYLELQGYRALKKAFVAQMALPTVDG----SEIGLDLCFQGPAKGVDEVQVPK 382
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ HF GADL + EN + + P +G +I+G Q N QFVYD+
Sbjct: 383 LVLHFDGGADLDLPAENYMVLDSASGALCL---TVAPSRGLSIIGNFQQQNFQFVYDV 437
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 148/349 (42%), Gaps = 59/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + + +LDT + + W QC+PC+ CY Q DPI+N S S+ + C A
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 67
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C G C Y ++YGD T V S T TL S+QN+ GC ++
Sbjct: 68 CSQLDANDCHGGGCLYEVSYGDGSYT--VGSYATETL----TFGTTSIQNVAIGCGHDNV 121
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + G L SF QLG FS CLV D LEFG + +
Sbjct: 122 GLFVGAAGLLGLGAGSL-----SFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPI 176
Query: 185 KSL-----------------------------NLPPNSFTI-KLNGQRGCINDCGSVLTV 214
S+ ++P +F I + G+ G I D G+ +T
Sbjct: 177 GSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTR 236
Query: 215 IECEVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHF-QGA 269
++ Y L FI + + D +F TC++L A + S P++ +HF GA
Sbjct: 237 LQTSAYDALRDAFIAGTQHLPRADGISIFD------TCYDLSALQSVSIPAVGFHFSNGA 290
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++ +N I + +F F F PA +I+G Q + +D
Sbjct: 291 GFILPAKNCLIPMDSMGTFCFAFAPA---DSNLSIMGNIQQQGIRVSFD 336
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 148/349 (42%), Gaps = 59/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + + +LDT + + W QC+PC+ CY Q DPI+N S S+ + C A
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 213
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C G C Y ++YGD T V S T TL S+QN+ GC ++
Sbjct: 214 CSQLDANDCHGGGCLYEVSYGDGSYT--VGSYATETL----TFGTTSIQNVAIGCGHDNV 267
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + G L SF QLG FS CLV D LEFG + +
Sbjct: 268 GLFVGAAGLLGLGAGSL-----SFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPI 322
Query: 185 KSL-----------------------------NLPPNSFTI-KLNGQRGCINDCGSVLTV 214
S+ ++P +F I + G+ G I D G+ +T
Sbjct: 323 GSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTR 382
Query: 215 IECEVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHF-QGA 269
++ Y L FI + + D +F TC++L A + S P++ +HF GA
Sbjct: 383 LQTSAYDALRDAFIAGTQHLPRADGISIFD------TCYDLSALQSVSIPAVGFHFSNGA 436
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++ +N I + +F F F PA +I+G Q + +D
Sbjct: 437 GFILPAKNCLIPMDSMGTFCFAFAPA---DSNLSIMGNIQQQGIRVSFD 482
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 148/344 (43%), Gaps = 52/344 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y L++GIG P K+ + ++DT + + W QC+PC CY+Q DPI++ S S+ +L C
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQ 219
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++ F C C Y ++YGD T V T T+ + SV + GC +++
Sbjct: 220 CRNLDVFACRNDSCLYQVSYGDGSYT--VGDFATETVSFGNSG---SVDKVAIGCGHDNE 274
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-----GD 179
FV I G L+ S ++ FS CLV D S LEF D
Sbjct: 275 GLFVGAAGLIGLGGGPLSLTS--------QIKASSFSYCLVNRDSVDSSTLEFNSAKPSD 326
Query: 180 QIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ A G+ L +PP+ F + +G+ G I DCG+ +T ++
Sbjct: 327 SVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQT 386
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA-DLVVEP 275
+ Y L F+ F TC+NL +R + P++ + F G L + P
Sbjct: 387 QAYNALRDTFVKLTKDLPSTSGFALFD---TCYNLSSRTSVRVPTVAFLFDGGKSLPLPP 443
Query: 276 ENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N I + +F F P +I+G Q T+ YDL
Sbjct: 444 SNYLIPVDSAGTFCLAFAPT---TASLSIIGNVQQQGTRVTYDL 484
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 146/357 (40%), Gaps = 68/357 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + +G P + ++DT + LTW QC PC CY QND ++ + S+ KL C A
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSAL 72
Query: 68 CKS-PF-HCFEGDCFYGITYGD--------VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C PF C + C Y +YGD VY+T +D ++ V N
Sbjct: 73 CNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQ---------QVPNFA 123
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRL 175
FGC +++ + GI+GL SF QL + +FS CLV + S L
Sbjct: 124 FGCGHDNEGSFAGAD----GILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPL 179
Query: 176 EFGDQI------------------------------IAGKSLNLPPNSFTIKLNGQRGCI 205
FGD + LN+ F I G G I
Sbjct: 180 LFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTI 239
Query: 206 NDCGSVLTVIE----CEVYAVLTAEFIDYFSQ-HDIEKLFTCRKCGVTCFNLPARFNSFP 260
D G+ +T + EV A + A + Y + DI +L C G LP + P
Sbjct: 240 FDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLS-GFPKDQLP----TVP 294
Query: 261 SMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+MT+HF+G D+V+ P N FI+ + F A T I+G+ Q N Q YD
Sbjct: 295 AMTFHFEGGDMVLPPSNYFIYLESSQSYCF---AMTSSPDVNIIGSVQQQNFQVYYD 348
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 140/339 (41%), Gaps = 46/339 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + ++D+ + + W QC+PC+ CY Q DP+++ + S+ + C A
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 68 CKS------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C++ G C Y +TYGD TK +L+T TL +VQ + GC
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCG 243
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
S FV AG++GL W + S + QLG FS CL L G
Sbjct: 244 HRNSGLFVG-----AAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298
Query: 181 --------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
+ G+ L L + F + +G G + D G+ +T + E Y
Sbjct: 299 EAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAY 358
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLVVEPENV 278
A L F + TC++L + P+++++F QGA L + N+
Sbjct: 359 AALRGAFDGAMGALPRSPAVSLLD---TCYDLSGYASVRVPTVSFYFDQGAVLTLPARNL 415
Query: 279 FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F F P+ G +ILG Q Q D
Sbjct: 416 LVEVGGAVFCLAFAPS---SSGISILGNIQQEGIQITVD 451
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/363 (27%), Positives = 150/363 (41%), Gaps = 70/363 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYD- 65
Y++ L IG P S + DT + L WTQC PC S C++Q P+YN S ++ LPC
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145
Query: 66 --------ASCKSPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPVSV 113
A P C C Y +TYG VY+ E + +ST P ++ V
Sbjct: 146 LSMCAAALAGTTPPPGC---TCMYNMTYGSGWTSVYQGSETFTFGSST--PANQ---TGV 197
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFH 172
I FGCS S F +G++GL S S + QLG VP +FS CL D +
Sbjct: 198 PGIAFGCSNASGGF---NTSSASGLVGLGRGSLSLVSQLG--VP-KFSYCLTPYQDTNST 251
Query: 173 SRLEFGDQI----------------------------------IAGKSLNLPPNSFTIKL 198
S L G + +L++P + ++K
Sbjct: 252 STLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKA 311
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS 258
+G G I D G+ +T++ Y + A + + + + CF LP+ ++
Sbjct: 312 DGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGGSAATGLDL-CFELPSSTSA 370
Query: 259 ---FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
PSMT HF GAD+V+ ++ + DS + G +ILG Q N +
Sbjct: 371 PPTMPSMTLHFDGADMVLPADSYMML---DSNLWCLAMQNQTDGGVSILGNYQQQNMHIL 427
Query: 316 YDL 318
YD+
Sbjct: 428 YDV 430
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 142/361 (39%), Gaps = 67/361 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ + IG P + ++DT + L WTQC+PC C+ Q+ P+++ S +Y LPC
Sbjct: 115 NGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCS 174
Query: 65 DASCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+ C S DC Y TYGD T+ V + +T TL P + FGC
Sbjct: 175 SSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLP------GVAFGC 228
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD- 179
++ Q AG++GL S + QLG +FS CL D + S L G
Sbjct: 229 GDTNEGDGFTQG---AGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDTSKSPLLLGSL 282
Query: 180 ----------------------------------QIIAGKSLNLPPNSFTIKLNGQRGCI 205
+ + LP ++F ++ +G G I
Sbjct: 283 AAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVI 342
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT---CFNLPARF---NSF 259
D G+ +T +E + Y L F KL V CF PA
Sbjct: 343 VDSGTSITYLELQGYRPLKKAFAAQM------KLPVADGSAVGLDLCFKAPASGVDDVEV 396
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + HF GADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 397 PKLVLHFDGGADLDLPAENYMVLDSASGALCL---TVMGSRGLSIIGNFQQQNIQFVYDV 453
Query: 319 D 319
D
Sbjct: 454 D 454
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 147/343 (42%), Gaps = 48/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC PC CY+Q+DPI++ S ++K L C D
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPK 223
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C S C C Y ++YGD T + DT T + V ++ GC +++
Sbjct: 224 CASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGK-----VNDVALGCGHDNE 278
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-------G 178
+ ++ + ++ FS CLV D + S L+F G
Sbjct: 279 GLFTGAAGLLGLGG-------GALSMTNQIKAKSFSYCLVDRDSAKSSSLDFNSVQIGAG 331
Query: 179 DQI---------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
D + G+ +++P + F + +G G I DCG+ +T ++
Sbjct: 332 DATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQT 391
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPE 276
+ Y L F+ + D +K + TC++ + P++T+HF G + P
Sbjct: 392 QAYNSLRDAFVKLTT--DFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPA 449
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
++ D+ F F AF P +I+G Q T+ YDL
Sbjct: 450 KNYLIPIDDAGTFCF--AFAPTSSSLSIIGNVQQQGTRITYDL 490
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 143/345 (41%), Gaps = 53/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + L WTQCQPC C+ Q+ PI+N + S+ LPC
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ S C C Y YGD ET+ S+ T TL VS+ NI FGC ++
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQ--GSMGTETL----TFGSVSIPNITFGCGENNQ 208
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---DQII 182
F + AG++G+ S QL +FS C+ S S L G + +
Sbjct: 209 GF---GQGNGAGLVGMGRGPLSLPSQLDV---TKFSYCMTPIGSSTPSNLLLGSLANSVT 262
Query: 183 AGK-------------------------SLNLP--PNSFTIKL-NGQRGCINDCGSVLTV 214
AG S LP P++F + NG G I D G+ LT
Sbjct: 263 AGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 322
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF--PSMTYHFQGADLV 272
Y + EFI SQ ++ + CF P+ ++ P+ HF G DL
Sbjct: 323 FVNNAYQSVRQEFI---SQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLE 379
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ EN FI G + +G +I G Q N VYD
Sbjct: 380 LPSENYFISPSNGLICLAMGSS---SQGMSIFGNIQQQNMLVVYD 421
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 148/365 (40%), Gaps = 69/365 (18%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKL 61
T+ +++ L IG P + DT + L WTQC PC + C++Q P+YN S ++ L
Sbjct: 80 TVPGEFLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSAL 139
Query: 62 PCYDASCKSPFHCFEGDCFYGITYGD----VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
PC + C C Y +TYG V++ E + +ST P D+ V V I
Sbjct: 140 PCNSSLGLCAPAC---ACMYNMTYGSGWTYVFQGTETFTFGSST--PADQ---VRVPGIA 191
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------------- 164
FGCS S F +G++GL S S + QLG +FS CL
Sbjct: 192 FGCSNASSGF---NASSASGLVGLGRGSLSLVSQLGA---PKFSYCLTPYQDTNSTSTLL 245
Query: 165 ------------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
V S + L + +L +PPN+F++K +G G I
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLPARFN---SF 259
D G+ +T++ Y + A + + L T T CF LP+ + S
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVL------SLVTLPTTDGSAATGLDLCFELPSSTSAPPSM 359
Query: 260 PSMTYHFQGADLVVEPENVFIFNHQDSF------FFFFGPAFTPRKGKTILGARHQHNTQ 313
PSMT HF GAD+V+ +N + T +ILG Q N
Sbjct: 360 PSMTLHFDGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMH 419
Query: 314 FVYDL 318
+YD+
Sbjct: 420 ILYDV 424
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 141/357 (39%), Gaps = 62/357 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + ++DT + L WTQC PC C EQ P + SY LPC A
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 144
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + + CF+ C Y YGD + V + +T T + V+V + FGC
Sbjct: 145 CNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTF--GTNSTRVAVPRVSFGCG---- 198
Query: 126 DFVSIQKKII---AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ + +G++G + S + QLG RFS CL SRL FG
Sbjct: 199 ---NMNAGTLFNGSGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYAT 252
Query: 182 -----------------------------------IAGKSLNLPPNSFTI-KLNGQRGCI 205
+AG L + P+ F I + +G G I
Sbjct: 253 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 312
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA---RFNSFPSM 262
D G+ +T + YA++ F+ + T TCF P R + P M
Sbjct: 313 IDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANA--TPSDTFDTCFKWPPPPRRMVTLPEM 370
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF GAD+ + EN + + A P +I+G+ N +YDL+
Sbjct: 371 VLHFDGADMELPLENYMVMDGGTGNLCL---AMLPSDDGSIIGSFQHQNFHMLYDLE 424
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 148/353 (41%), Gaps = 70/353 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G+P KS + +LDT + + W QCQPC CY+Q+DPI+ + SY L C
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218
Query: 68 CKS--PFHCFEGDCFYGITYGD--------VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S C G C Y + YGD V ET T V +I
Sbjct: 219 CNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGT-------------VNSIA 265
Query: 118 FGCSLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLE 176
GC +++ FV + G L+ S +L FS CLV D + S L+
Sbjct: 266 LGCGHDNEGLFVGAAGLLGLGGGPLSLTS--------QLKATSFSYCLVNRDSAASSTLD 317
Query: 177 F-----GDQIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCG 209
F GD +IA G+ L +P F + +G G I DCG
Sbjct: 318 FNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCG 377
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF 266
+ +T ++ E Y L F+ S+H L + + TC++L + + P++++HF
Sbjct: 378 TAITRLQSEAYNSLRDSFVS-MSRH----LRSTSGVALFDTCYDLSGQSSVKVPTVSFHF 432
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
G P ++ + + F AF P +I+G Q T+ +DL
Sbjct: 433 DGGKSWDLPAANYLIPVDSAGTYCF--AFAPTTSSLSIIGNVQQQGTRVSFDL 483
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 141/357 (39%), Gaps = 62/357 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + ++DT + L WTQC PC C EQ P + SY LPC A
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 147
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + + CF+ C Y YGD + V + +T T + V+V + FGC
Sbjct: 148 CNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTF--GTNSTRVAVPRVSFGCG---- 201
Query: 126 DFVSIQKKII---AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ + +G++G + S + QLG RFS CL SRL FG
Sbjct: 202 ---NMNAGTLFNGSGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYAT 255
Query: 182 -----------------------------------IAGKSLNLPPNSFTI-KLNGQRGCI 205
+AG L + P+ F I + +G G I
Sbjct: 256 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 315
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA---RFNSFPSM 262
D G+ +T + YA++ F+ + T TCF P R + P M
Sbjct: 316 IDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANA--TPSDTFDTCFKWPPPPRRMVTLPEM 373
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF GAD+ + EN + + A P +I+G+ N +YDL+
Sbjct: 374 VLHFDGADMELPLENYMVMDGGTGNLCL---AMLPSDDGSIIGSFQHQNFHMLYDLE 427
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 144/343 (41%), Gaps = 48/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC+PC CY+Q+DP++N S +YK L C
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C C Y ++YGD T + DT T + + N+ GC +++
Sbjct: 222 CSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK-----INNVALGCGHDNE 276
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-------G 178
+ ++ G+ + ++ FS CLV D S L+F G
Sbjct: 277 GLFTGAAGLLGLGGGV-------LSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGG 329
Query: 179 DQI---------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
D + G+ + LP F + +G G I DCG+ +T ++
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPE 276
+ Y L F+ +++K + TC++ + P++ +HF G + P
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
++ DS F F AF P +I+G Q T+ YDL
Sbjct: 448 KNYLIPVDDSGTFCF--AFAPTSSSLSIIGNVQQQGTRITYDL 488
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 144/343 (41%), Gaps = 48/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC+PC CY+Q+DP++N S +YK L C
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C C Y ++YGD T + DT T + + N+ GC +++
Sbjct: 222 CSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK-----INNVALGCGHDNE 276
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-------G 178
+ ++ G+ + ++ FS CLV D S L+F G
Sbjct: 277 GLFTGAAGLLGLGGGV-------LSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGG 329
Query: 179 DQI---------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
D + G+ + LP F + +G G I DCG+ +T ++
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPE 276
+ Y L F+ +++K + TC++ + P++ +HF G + P
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
++ DS F F AF P +I+G Q T+ YDL
Sbjct: 448 KNYLIPVDDSGTFCF--AFAPTSSSLSIIGNVQQQGTRITYDL 488
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 151/348 (43%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + L+ +LDT + +TW QCQPC CY+Q+DP+++ SY + C +
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 226
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + G C Y + YGD T V T TL D +PVS ++ GC +
Sbjct: 227 CHDLDAAACRNSTGACLYEVAYGDGSYT--VGDFATETLTLGDS-APVS--SVAIGCGHD 281
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ FV + G L++ S ++ FS CLV D S L+FGD
Sbjct: 282 NEGLFVGAAGLLALGGGPLSFPS--------QISATTFSYCLVDRDSPSSSTLQFGDAAD 333
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G+ L++PP++F + G G I D G+ +T ++
Sbjct: 334 AEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQ 393
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQ-GAD 270
YA L F+ + L R GV TC++L R + P+++ F G +
Sbjct: 394 SSAYAALRDAFV-----RGTQSL--PRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGE 446
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L + +N I + ++ F P +I+G Q T+ +D
Sbjct: 447 LRLPAKNYLIPVDGAGTYCLAFAPT---NAAVSIIGNVQQQGTRVSFD 491
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 140/356 (39%), Gaps = 63/356 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P ++DT + L WTQC PC C Q P ++ + +Y+ LPC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSR 148
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C S CF+ C Y YGD T V + +T T + V NI FGC SL +
Sbjct: 149 CAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASS-TKVRAANISFGCGSLNA 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG------ 178
+ + +G++G S + QLG P RFS CL SRL FG
Sbjct: 208 GELAN-----SSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSPTPSRLYFGVFANLN 259
Query: 179 ------------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
+ K L + P F I +G G I D
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDS 319
Query: 209 GSVLTVIECEVYAVLT---AEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---SFPSM 262
G+ +T ++ + Y + A I + +D + TCF P N + P
Sbjct: 320 GTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLD------TCFQWPPPPNVTVTVPDF 373
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+HF GA++ + PEN + + A P TI+G Q N +YD+
Sbjct: 374 VFHFDGANMTLPPENYMLIASTTGYLCL---AMAPTSVGTIIGNYQQQNLHLLYDI 426
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 144/347 (41%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P +S + ++D+ + + W QC+PC CY Q DP+++ S+ + C A
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS-LES 124
C C G C Y ++YGD TK +L+T TL VQN+ GC +
Sbjct: 103 CDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL------GRTVVQNVAIGCGHMNQ 156
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + S SF+ QL R + FS CLV + + LEFG + +
Sbjct: 157 GMFVGAAGLLGL-----GGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPV 211
Query: 185 KSLNLP-------PNSFTIKLN---------------------GQRGCINDCGSVLTVIE 216
+ +P P+ + I L+ G G + D G+ +T
Sbjct: 212 GAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFP 271
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADL 271
Y FID R GV TC+NL + P+++++F G +
Sbjct: 272 TVAYEAFRDAFIDQTGNLP-------RASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPI 324
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPR-KGKTILGARHQHNTQFVYD 317
+ P N F+ D+ F F AF P G +ILG Q Q D
Sbjct: 325 LTLPANNFLIPVDDAGTFCF--AFAPSPSGLSILGNIQQEGIQISVD 369
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 150/349 (42%), Gaps = 48/349 (13%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y+++L IG P S +LDT + L WTQC+PC CY+Q PI++ + S+ K+ C
Sbjct: 105 NGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+ C + P C Y +YGD T+ V L T T + VSV NI FGC +
Sbjct: 165 SSLCSALPSSTCSDGCEYVYSYGDYSMTQGV--LATETFTFGKSKNKVSVHNIGFGCGED 222
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG----- 178
++ Q +G++GL S + QL RFS CL D + S L G
Sbjct: 223 NEGDGFEQA---SGLVGLGRGPLSLVSQLKE---QRFSYCLTPIDDTKESVLLLGSLGKV 276
Query: 179 -------------------------DQIIAGKS-LNLPPNSFTIKLNGQRGCINDCGSVL 212
+ I G + L++ ++F + +G G I D G+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHFQGAD 270
T ++ + Y L EFI SQ + T CF+LP+ P + +HF+G D
Sbjct: 337 TYVQQKAYEALKKEFI---SQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD 393
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
L + EN I DS A G +I G Q N +DL+
Sbjct: 394 LELPAENYMI---GDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLE 439
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 152/357 (42%), Gaps = 63/357 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQCQPCK C+ Q+ PIY++ S+ +PC A+
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152
Query: 68 CK---SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C S +C C Y YGD + V L T TL P P VSV I FGC +
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGV--LGTETLTFPGAPG-VSVGGIAFGCGV 209
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG--- 178
++ G +GL S S + QLG +FS CL + S S + FG
Sbjct: 210 DNGGL----SYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGALA 262
Query: 179 ------------------------------DQIIAGKS-LNLPPNSFTIKLNGQRGCIND 207
+ I G + L +P +F ++ +G G I D
Sbjct: 263 ELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVD 322
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-----RFNSFPSM 262
G+ T + + V+ +D+ + + + CF PA + + P M
Sbjct: 323 SGTTFTFLVESAFRVV----VDHVAGVLRQPVVNASSLDSPCF--PAATGEQQLPAMPDM 376
Query: 263 TYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF GAD+ + +N FN ++S F A +P +ILG Q N Q ++D+
Sbjct: 377 VLHFAGGADMRLHRDNYMSFNQEES-SFCLNIAGSPSADVSILGNFQQQNIQMLFDI 432
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 151/348 (43%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + L+ +LDT + +TW QCQPC CY+Q+DP+++ SY + C +
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 222
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + G C Y + YGD T V T TL D +PVS ++ GC +
Sbjct: 223 CHDLDAAACRNSTGACLYEVAYGDGSYT--VGDFATETLTLGDS-APVS--SVAIGCGHD 277
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ FV + G L++ S ++ FS CLV D S L+FGD
Sbjct: 278 NEGLFVGAAGLLALGGGPLSFPS--------QISATTFSYCLVDRDSPSSSTLQFGDAAD 329
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G+ L++PP++F + G G I D G+ +T ++
Sbjct: 330 AEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAVTRLQ 389
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQ-GAD 270
YA L F+ + L R GV TC++L R + P+++ F G +
Sbjct: 390 SSAYAALRDAFV-----RGTQSL--PRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGE 442
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L + +N I + ++ F P +I+G Q T+ +D
Sbjct: 443 LRLPAKNYLIPVDGAGTYCLAFAPT---NAAVSIIGNVQQQGTRVSFD 487
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 152/348 (43%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + +LDT + + W QC PC CY+Q DPI+ S S+ L C
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQ 208
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+S C C Y ++YGD T V T T+ P V N+ GC ++
Sbjct: 209 CRSLDVSECRNDTCLYEVSYGDGSYT--VGDFVTETITLGSAP----VDNVAIGCGHNNE 262
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L++ S ++ FS CLV D S LEF +
Sbjct: 263 GLFVGAAGLLGLGGGSLSFPS--------QINATSFSYCLVDRDSESASTLEFNSTLPPN 314
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ +++P ++F I +G G I D G+ +T ++
Sbjct: 315 AVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQT 374
Query: 218 EVYAVLTAEFI----DYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADL 271
+VY L F+ D S + I LF TC++L ++ N P++++HF G +L
Sbjct: 375 DVYNSLRDAFVKRTRDLPSTNGI-ALFD------TCYDLSSKGNVEVPTVSFHFPDGKEL 427
Query: 272 VVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +N + + + +F F F P + +I+G Q T+ VYDL
Sbjct: 428 PLPAKNYLVPLDSEGTFCFAFAPTAS---SLSIIGNVQQQGTRVVYDL 472
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 148/362 (40%), Gaps = 71/362 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
++++L IG+P ++DT + L WTQC+PC C++Q PI++ SY K+ C
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGL 166
Query: 68 CKS--PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + +C E C Y TYGD T+ + L T T DE S+ I FGC +E
Sbjct: 167 CNALPRSNCNEDKDACEYLYTYGDYSSTRGL--LATETFTFEDEN---SISGIGFGCGVE 221
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV------------------ 165
++ Q +G++GL S + QL +FS CL
Sbjct: 222 NEGDGFSQG---SGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLAS 275
Query: 166 ----------------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
PD+ LE + K L++ ++F + +G G
Sbjct: 276 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 335
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--ARFN 257
I D G+ +T +E + VL EF S G T CF LP A+
Sbjct: 336 MIIDSGTTITYLEETAFKVLKEEFTSRMS-------LPVDDSGSTGLDLCFKLPDAAKNI 388
Query: 258 SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P M +HF+GADL + EN + DS A G +I G Q N ++D
Sbjct: 389 AVPKMIFHFKGADLELPGENYMV---ADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHD 445
Query: 318 LD 319
L+
Sbjct: 446 LE 447
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 140/354 (39%), Gaps = 59/354 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P ++DT + L WTQC PC C +Q P ++ + +Y+ LPC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C S CF+ C Y YGD T V + +T T + + V NI FGC SL +
Sbjct: 149 CASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANS-TKVRATNIAFGCGSLNA 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
D + +G++G S + QLG P RFS CL + SRL FG
Sbjct: 208 GDLAN-----SSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLS 259
Query: 182 ---------------------------------IAGKSLNLPPNSFTIKLNGQRGCINDC 208
+ K L + P F I +G G I D
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDS 319
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFN---SFPSMTY 264
G+ +T ++ + Y + + + G+ TCF P N + P + +
Sbjct: 320 GTSITWLQQDAYEAVRRGLVSAIPLPAMND----TDIGLDTCFQWPPPPNVTVTVPDLVF 375
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF A++ + PEN + + P TI+G Q N +YD+
Sbjct: 376 HFDSANMTLLPENYMLIASTTGYLCLV---MAPTGVGTIIGNYQQQNLHLLYDI 426
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 141/348 (40%), Gaps = 55/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + ++D+ + + W QC+PC+ CY Q DP+++ + S+ + C A
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 68 CKS------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C++ G C Y +TYGD TK +L+T TL +VQ + GC
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCG 243
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-- 178
S FV AG++GL W + S + QLG FS CL L G
Sbjct: 244 HRNSGLFVG-----AAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298
Query: 179 ---------------DQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+Q + G+ L L F + +G G + D G+
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTA 358
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGA 269
+T + E YA L F + TC++L + P+++++F QGA
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLD---TCYDLSGYASVRVPTVSFYFDQGA 415
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L + N+ + F F P+ G +ILG Q Q D
Sbjct: 416 VLTLPARNLLVEVGGAVFCLAFAPS---SSGISILGNIQQEGIQITVD 460
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 142/348 (40%), Gaps = 55/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + ++D+ + + W QC+PC+ CY Q DP+++ + S+ + C A
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 68 CKS------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C++ G C Y +TYGD TK +L+T TL +VQ + GC
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCG 243
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-- 178
S FV AG++GL W + S + QLG FS CL L G
Sbjct: 244 HRNSGLFVG-----AAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298
Query: 179 ---------------DQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+Q + G+ L L + F + +G G + D G+
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTA 358
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGA 269
+T + E YA L F + TC++L + P+++++F QGA
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLD---TCYDLSGYASVRVPTVSFYFDQGA 415
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L + N+ + F F P+ G +ILG Q Q D
Sbjct: 416 VLTLPARNLLVEVGGAVFCLAFAPS---SSGISILGNIQQEGIQITVD 460
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 145/350 (41%), Gaps = 66/350 (18%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
IG P + ++DT + L WTQC+PC C++Q+ P+++ S +Y +PC ASC
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 72 FHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSI 130
C C Y TYGD T+ V + +T TL P + FGC ++
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP------GVVFGCGDTNEGDGFS 286
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD----------- 179
Q AG++GL S + QLG D+FS CL D + +S L G
Sbjct: 287 QG---AGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNNSPLLLGSLAGISEASAAA 340
Query: 180 -----------------------QIIAGKS-LNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
I G + ++LP ++F ++ +G G I D G+ +T +
Sbjct: 341 SSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYL 400
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT---CFNLPARF---NSFPSMTYHFQ-G 268
E + Y L F + L GV CF PA+ P + +HF G
Sbjct: 401 EVQGYRALKKAFAAQMA------LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 454
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
ADL + EN + + +G +I+G Q N QFVYD+
Sbjct: 455 ADLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDV 501
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 153/348 (43%), Gaps = 50/348 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P L+ ++DT + + W QC+PC+ CY Q P++N SYK +PC
Sbjct: 87 YLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKL 146
Query: 68 CKSPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+S D C Y YGD + S+DT TL + + VS NI GC +
Sbjct: 147 CQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLT-VSFPNIVIGCG--T 203
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------VQPDKSFHSRLE 176
+ +S + +GI+G SF+ QLG +FS CL +Q + + S+L
Sbjct: 204 NNILSYEGA-SSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNAT--SKLN 260
Query: 177 FGDQ-------IIAGKSLNLPPNSF-------------TIKLNG------QRGCINDCGS 210
FGD ++ L P +F +++ G + I D G+
Sbjct: 261 FGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGT 320
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
LT + + Y+ L + +D +E++ + C+++ A FP +T HF+GAD
Sbjct: 321 TLTSLTKDDYSFLESAVVDLVK---LERVDDPTQTLNLCYSVKAEGYDFPIITMHFKGAD 377
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + P + F+ + D F AF + I G Q N YDL
Sbjct: 378 VDLHPISTFV-SVADGVFCL---AFESSQDHAIFGNLAQQNLMVGYDL 421
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 150/349 (42%), Gaps = 48/349 (13%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y+++L IG P S +LDT + L WTQC+PC CY+Q PI++ + S+ K+ C
Sbjct: 105 NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+ C + P C Y +YGD T+ V L T T + VSV NI FGC +
Sbjct: 165 SSLCSAVPSSTCSDGCEYVYSYGDYSMTQGV--LATETFTFGKSKNKVSVHNIGFGCGED 222
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG----- 178
++ Q +G++GL S + QL RFS CL D + S L G
Sbjct: 223 NEGDGFEQA---SGLVGLGRGPLSLVSQLKE---PRFSYCLTPMDDTKESILLLGSLGKV 276
Query: 179 -------------------------DQIIAGKS-LNLPPNSFTIKLNGQRGCINDCGSVL 212
+ I G + L++ ++F + +G G I D G+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHFQGAD 270
T IE + + L EFI SQ + T CF+LP+ P + +HF+G D
Sbjct: 337 TYIEQKAFEALKKEFI---SQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGD 393
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
L + EN I DS A G +I G Q N +DL+
Sbjct: 394 LELPAENYMI---GDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLE 439
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 148/362 (40%), Gaps = 71/362 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
++++L IG+P ++DT + L WTQC+PC C++Q PI++ SY K+ C
Sbjct: 108 FLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGL 167
Query: 68 CKS--PFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + +C E C Y TYGD T+ + L T T DE S+ I FGC +E
Sbjct: 168 CNALPRSNCNEDKDSCEYLYTYGDYSSTRGL--LATETFTFEDEN---SISGIGFGCGVE 222
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV------------------ 165
++ Q +G++GL S + QL +FS CL
Sbjct: 223 NEGDGFSQG---SGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLAS 276
Query: 166 ----------------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
PD+ LE + K L++ ++F + +G G
Sbjct: 277 GIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGG 336
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--ARFN 257
I D G+ +T +E + VL EF S G T CF LP A+
Sbjct: 337 MIIDSGTTITYLEETAFKVLKEEFTSRMS-------LPVDDSGSTGLDLCFKLPNAAKNI 389
Query: 258 SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P + +HF+GADL + EN + DS A G +I G Q N ++D
Sbjct: 390 AVPKLIFHFKGADLELPGENYMV---ADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHD 446
Query: 318 LD 319
L+
Sbjct: 447 LE 448
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 94/346 (27%), Positives = 150/346 (43%), Gaps = 56/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P K ++ ++DT + + W QC PC CY+Q DPI+ SY L C
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQ 214
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C C Y ++YGD T + +T TL S+ N+ GC +++
Sbjct: 215 CKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITL-----DGSASLNNVAIGCGHDNE 269
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L++ S ++ FS CLV D S LEF I
Sbjct: 270 GLFVGAAGLLGLGGGSLSFPS--------QINASSFSYCLVNRDTDSASTLEFNSPIPSH 321
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L++P +SF + +G G I D G+ +T ++
Sbjct: 322 SVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQS 381
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF-QGADLVV 273
+VY L F+ + L + + TC++L +R + P++++HF G L +
Sbjct: 382 DVYNSLRDSFV-----RGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLAL 436
Query: 274 EPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+N I + +F F F P + +I+G Q T+ YDL
Sbjct: 437 PAKNYLIPVDSAGTFCFAFAPTTS---ALSIIGNVQQQGTRVSYDL 479
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 147/348 (42%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG+P + ++ +LDT + + W QC PC CY Q +PI+ S SY+ L C
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 210
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C C Y ++YGD T V T TL VQN+ GC ++
Sbjct: 211 CNALEVSECRNATCLYEVSYGDGSYT--VGDFATETL----TIGSTLVQNVAVGCGHSNE 264
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-----D 179
FV + G L S +L FS CLV D S +EFG D
Sbjct: 265 GLFVGAAGLLGLGGGLLALPS--------QLNTTSFSYCLVDRDSDSASTVEFGTSLPPD 316
Query: 180 QIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
++A G+ L +P +SF + +G G I D G+ +T ++
Sbjct: 317 AVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQT 376
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADLV 272
+Y L F+ S D+EK GV TC+NL A+ P++ +HF G ++
Sbjct: 377 GIYNSLRDSFLKGTS--DLEK-----AAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKML 429
Query: 273 VEPENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P ++ + +F F P + I+G Q T+ +DL
Sbjct: 430 ALPAKNYMIPVDSVGTFCLAFAPTAS---SLAIIGNVQQQGTRVTFDL 474
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 146/357 (40%), Gaps = 67/357 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K L + DT + LTWTQCQPC KSCY Q PI++ + K+Y + C
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213
Query: 67 SC--------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+C SP C +C YGI YGD T + DT TL D F
Sbjct: 214 ACSGLKSATGNSP-GCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDV-----FDGFMF 267
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ AG++GL D S + Q + FS CL S + L FG
Sbjct: 268 GCGQNNRGLFG----KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS-NGHLTFG 322
Query: 179 DQ----------------------------------IIAGKSLNLPPNSFTIKLNGQRGC 204
+ + GK+L++ P F G
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGT 377
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMT 263
I D G+V+T + VY L + F + S++ + TC++L + S P ++
Sbjct: 378 IIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLD---TCYDLSNYTSISIPKIS 434
Query: 264 YHFQG-ADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++F G A++ +EP + I N F G G I G Q + VYD+
Sbjct: 435 FNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIG--IFGNIQQQTLEVVYDV 489
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 146/348 (41%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + ++ +LDT + + W QC PC CY Q +PI+ S SY+ L C
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 207
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C C Y ++YGD T V T TL VQN+ GC ++
Sbjct: 208 CNALEVSECRNATCLYEVSYGDGSYT--VGDFATETL----TIGSTLVQNVAVGCGHSNE 261
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-----D 179
FV + G L S +L FS CLV D S ++FG D
Sbjct: 262 GLFVGAAGLLGLGGGLLALPS--------QLNTTSFSYCLVDRDSDSASTVDFGTSLSPD 313
Query: 180 QIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
++A G+ L +P +SF + +G G I D G+ +T ++
Sbjct: 314 AVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQT 373
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADLV 272
E+Y L F+ D+EK GV TC+NL A+ P++ +HF G ++
Sbjct: 374 EIYNSLRDSFVK--GTLDLEK-----AAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKML 426
Query: 273 VEPENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P ++ + +F F P + I+G Q T+ +DL
Sbjct: 427 ALPAKNYMIPVDSVGTFCLAFAPTASSLA---IIGNVQQQGTRVTFDL 471
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 141/346 (40%), Gaps = 50/346 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++ + G P + ++DT + L WTQC PC++C I++ +Y + C
Sbjct: 77 NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCA 136
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C S PF C Y YGD T S +T T+ P N+ FGC
Sbjct: 137 SNFCSSLPFQSCTTSCKYDYMYGDGSSTSGALSTETVTVGTGTIP------NVAFGCG-- 188
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-- 181
+ S AGI+GL S + Q + +FS CLV + S + GD
Sbjct: 189 HTNLGSFAGA--AGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAA 246
Query: 182 --------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
++GK++ P +F+I +GQ G I D G+ LT +
Sbjct: 247 GGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYL 306
Query: 216 ECEVYAVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
E + L A F + D L+ CF+ N ++P+MT+HF+GAD
Sbjct: 307 ETGAFNALVAALKAEVPFPEAD-GSLYGLDY----CFSTAGVANPTYPTMTFHFKGADYE 361
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ PENVF+ A G +I+G Q N V+DL
Sbjct: 362 LPPENVFVALDTGGSICL---AMAASTGFSIMGNIQQQNHLIVHDL 404
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/348 (25%), Positives = 143/348 (41%), Gaps = 59/348 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P ++ + ++D+ + + W QC+PC CY Q+DP++N SY + C
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTV 193
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C EG C Y ++YGD TK +L+T T ++N+ GC ++
Sbjct: 194 CSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTF------GRTLIRNVAIGCGHHNQ 247
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV AG++GL SF+ QLG FS CLV L+FG + +
Sbjct: 248 GMFVG-----AAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVPV 302
Query: 185 KSLNLP----------------------------PNSFTIKLNGQRGCINDCGSVLTVIE 216
+ +P + F + G G + D G+ +T +
Sbjct: 303 GAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLP 362
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADL 271
Y FI + R GV TC++L + P+++++F G +
Sbjct: 363 TAAYEAFRDAFI-------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPI 415
Query: 272 VVEPENVFIFNHQD--SFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P F+ D SF F F P+ G +I+G Q + D
Sbjct: 416 LTLPARNFLIPVDDVGSFCFAFAPS---SSGLSIIGNIQQEGIEISVD 460
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 152/347 (43%), Gaps = 58/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K + +LDT + + W QCQPC CY+Q DPI++ RS S+ LPC
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++ C C Y ++YGD T ++T T + + N+ GC +++
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTF-----GNSGMINNVAVGCGHDNE 269
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L+ S ++ FS CLV D S S LEF
Sbjct: 270 GLFVGSAGLLGLGGGSLSLTS--------QMKASSFSYCLVDRDSSSSSDLEFNSAAPSD 321
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L++PPN F + +G G I D G+ +T ++
Sbjct: 322 SVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQT 381
Query: 218 EVYAVLTAEFID---YFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA-DLV 272
+ Y L F+ Y + + LF TC++L ++ + P++++ F G L
Sbjct: 382 QAYNTLRDAFVSRTPYLKKTNGFALFD------TCYDLSSQSRVTIPTVSFEFAGGKSLQ 435
Query: 273 VEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ P+N I + +F F F P + +I+G Q T+ YDL
Sbjct: 436 LPPKNYLIPVDSVGTFCFAFAPTTS---SLSIIGNVQQQGTRVHYDL 479
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 145/346 (41%), Gaps = 48/346 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQCQPCK C+ Q+ P+Y+ + ++ LPC A+
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSAT 130
Query: 68 CKSPF--HCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC---- 120
C + +C C Y YGD + + +T TL P +PVSV + FGC
Sbjct: 131 CLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGP--SSAPVSVGGVAFGCGTDN 188
Query: 121 ---SLESKDFVSIQKKIIAGIMGLN---------------WDSTSFMVQLGRLVPDRF-- 160
SL S V + + ++ + L DS + L L P
Sbjct: 189 GGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTV 248
Query: 161 -SCCLVQ----PDKSFHS--RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
S L+Q P + F S + GD L +P +F ++ +G G I D G+ T
Sbjct: 249 QSTPLLQSPQNPSRYFVSLQGISLGD-----VRLPIPNGTFDLRGDGTGGMIVDSGTTFT 303
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTYHFQG-ADL 271
++ + + Q + CF PA + P + HF G AD+
Sbjct: 304 ILAESGFREVVGRVARVLGQPPVN----ASSLDAPCFPAPAGEPPYMPDLVLHFAGGADM 359
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ +N +N +DS F TP ++LG Q N Q ++D
Sbjct: 360 RLYRDNYMSYNEEDSSFCLNIAGTTPES-TSVLGNFQQQNIQMLFD 404
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 145/360 (40%), Gaps = 71/360 (19%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
++L IG+P ++DT + L WTQC+PC C++Q PI++ SY K+ C C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 70 S--PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
+ +C E C Y TYGD T+ + L T T DE S+ I FGC +E++
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGL--LATETFTFEDEN---SISGIGFGCGVENE 115
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-------------------- 165
Q +G++GL S + QL +FS CL
Sbjct: 116 GDGFSQG---SGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGI 169
Query: 166 --------------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
PD+ LE + K L++ ++F + +G G I
Sbjct: 170 VNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMI 229
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--ARFNSF 259
D G+ +T +E + VL EF S G T CF LP A+ +
Sbjct: 230 IDSGTTITYLEETAFKVLKEEFTSRMS-------LPVDDSGSTGLDLCFKLPDAAKNIAV 282
Query: 260 PSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
P M +HF+GADL + EN + + A G +I G Q N ++DL+
Sbjct: 283 PKMIFHFKGADLELPGENYMVADSSTGVLCL---AMGSSNGMSIFGNVQQQNFNVLHDLE 339
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 148/362 (40%), Gaps = 63/362 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQC+PCK C+ Q+ PIY++ + S+ +PC A+
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 68 C----KSPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLL--PPDEPSP-VSVQNIR 117
C +S +C C Y Y D + V +T T P P P VSV +
Sbjct: 155 CLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVA 214
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLE 176
FGC +++ G +GL S S + QLG +FS CL + S S +
Sbjct: 215 FGCGVDNGGL----SYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVL 267
Query: 177 FGD-------QIIAGKS-----------------------------LNLPPNSFTIKLNG 200
FG I G + L +P +F ++ +G
Sbjct: 268 FGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDG 327
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA---RFN 257
G I D G++ TV+ + V+ +Q + CF A +
Sbjct: 328 SGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQ----PVVNASSLDSPCFPATAGEQQLP 383
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P M HF GAD+ + +N FN Q+S F A P +ILG Q N Q ++
Sbjct: 384 DMPDMLLHFAGGADMRLHRDNYMSFN-QESSSFCLNIAGAPSAYGSILGNFQQQNIQMLF 442
Query: 317 DL 318
D+
Sbjct: 443 DI 444
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 145/350 (41%), Gaps = 62/350 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC+PC CY+Q+DP++N S +YK L C
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C C Y ++YGD T + DT T + + ++ GC +++
Sbjct: 222 CSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK-----INDVALGCGHDNE 276
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-------G 178
+ ++ L + S Q+ FS CLV D S L+F G
Sbjct: 277 GLFTGAAGLLG----LGGGALSITNQMK---ATSFSYCLVDRDSGKSSSLDFNSVQLGSG 329
Query: 179 DQI---------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
D + G+ + +P F + +G G I DCG+ +T ++
Sbjct: 330 DATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCFNLPARFN-SFPSMTYHFQGA 269
+ Y L F+ KL T K G TC++ + + P++ +HF G
Sbjct: 390 QAYNSLRDAFL---------KLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGG 440
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
+ P ++ D+ F F AF P +I+G Q T+ YDL
Sbjct: 441 KSLDLPAKNYLIPVDDNGTFCF--AFAPTSSSLSIIGNVQQQGTRITYDL 488
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 139/348 (39%), Gaps = 54/348 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++ + G+P + ++DT + L W QC PCKSCYE ++ SYK L C
Sbjct: 87 NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCG 146
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ PF C Y YGD T S D T+ P N+ FGC
Sbjct: 147 SNFCQDLPFQSCAASCQYDYMYGDGSSTSGALSTDDVTIGTGKIP------NVAFGCGNS 200
Query: 124 S-KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
+ F + G L S + QLG +FS CLV + S L GD +
Sbjct: 201 NLGTFAGAGGLVGLGKGPL-----SLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTL 255
Query: 183 A----------------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
A GK++N P N+F I G+ G I D G+ LT
Sbjct: 256 AGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTY 315
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVV 273
++ + + + A + + F + CF+ N ++P++ +HF GAD+ +
Sbjct: 316 LDVDAFNPMVAALKAALPYPEADGSFYGLE---YCFSTAGVANPTYPTVVFHFNGADVAL 372
Query: 274 EPENVFIFNHQDSFFFFFGP---AFTPRKGKTILGARHQHNTQFVYDL 318
P+N FI F G A G +I G Q N V+DL
Sbjct: 373 APDNTFI------ALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDL 414
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 146/340 (42%), Gaps = 35/340 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++ IG P ++DT + L W QC PC +C+ Q P++ +YK C
Sbjct: 89 YLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQP 148
Query: 68 CK----SPFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C S C + G C YGI YGD + + +T + VS N FGC +
Sbjct: 149 CTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGV 208
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
++ + K++ GI GL S + QLG + +FS CL+ D + S+L+FG + I
Sbjct: 209 DNNFTIYTSNKVM-GIAGLGAGPLSLVSQLGAQIGHKFSYCLLPYDSTSTSKLKFGSEAI 267
Query: 183 AGKS------------------LNLPPNSFTIKL--NGQR--GCINDCGSVLTVIECEVY 220
+ LNL + K+ GQ + D G+ LT +E Y
Sbjct: 268 ITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYLENTFY 327
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPENVF 279
A + ++ L + K TCF P R N + P + + F GA + + P+NV
Sbjct: 328 NNFVASLQETLGVKLLQDLPSPLK---TCF--PNRANLAIPDIAFQFTGASVALRPKNVL 382
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
I DS + G ++ G+ Q++ Q YDL+
Sbjct: 383 I-PLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLE 421
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 134/346 (38%), Gaps = 55/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + + ++D+ + + W QCQPC CY Q DP+++ S+ +PC +
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSV 201
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ C G C Y + YGD TK +L+T T V+N+ GC ++
Sbjct: 202 CERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF------GRTVVRNVAIGCGHRNR 255
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + S S + QLG FS CLV LEFG
Sbjct: 256 GMFVGAAGLLGL-----GGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAMPV 310
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G + + + F + G G + D G+ +T I
Sbjct: 311 GAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIP 370
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-----PSMTYHFQGADL 271
Y FI R GV+ F+ N F P+++++F G +
Sbjct: 371 TVAYVAFRDAFIGQTGNLP-------RASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPI 423
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P F+ D F F A +P G +I+G Q Q +D
Sbjct: 424 LTLPARNFLIPVDDVGTFCFAFAASP-SGLSIIGNIQQEGIQISFD 468
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 143/355 (40%), Gaps = 66/355 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++GIG P + ++D+ + + W QC+PC CY Q DP+++ S ++ + C A
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAI 184
Query: 68 CKS--PFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C++ C + G C Y ++YGD TK +L+T TL +V+ + GC +
Sbjct: 185 CRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL------GGTAVEGVAIGCGHRN 238
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA 183
+ FV AG++GL W S + QLG FS CL S + ++
Sbjct: 239 RGLFVG-----AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVL 293
Query: 184 GKSLNLPPNS-----------------------------------FTIKLNGQRGCINDC 208
G+S +P + F + +G G + D
Sbjct: 294 GRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDT 353
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMT 263
G+ +T + E YA L F+ R GV TC++L + P+++
Sbjct: 354 GTAVTRLPQEAYAALRDAFVGAVGALP-------RAPGVSLLDTCYDLSGYTSVRVPTVS 406
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTP-RKGKTILGARHQHNTQFVYD 317
++F GA + P + + AF P G +ILG Q Q D
Sbjct: 407 FYFDGAATLTLPARNLLLEVDGGIYCL---AFAPSSSGLSILGNIQQEGIQITVD 458
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 152/347 (43%), Gaps = 58/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K + +LDT + + W QCQPC CY+Q DPI++ RS S+ LPC
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++ C C Y ++YGD T V T TL + + ++ GC +++
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFT--VGEFVTETLTFGNSG---MINDVAVGCGHDNE 269
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L+ S ++ FS CLV D S S LEF
Sbjct: 270 GLFVGSAGLLGLGGGPLSLTS--------QMKASSFSYCLVDRDSSSSSDLEFNSAAPSD 321
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L++PPN F + +G G I D G+ +T ++
Sbjct: 322 SVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQT 381
Query: 218 EVYAVLTAEFID---YFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA-DLV 272
+ Y L F+ Y + + LF TC++L ++ + P++++ F G L
Sbjct: 382 QAYNTLRDAFVSRTPYLKKTNGFALFD------TCYDLSSQSRVTIPTVSFEFAGGKSLQ 435
Query: 273 VEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ P+N I + +F F F P + +I+G Q T+ YDL
Sbjct: 436 LPPKNYLIPVDSVGTFCFAFAPTTS---SLSIIGNVQQQGTRVHYDL 479
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 144/356 (40%), Gaps = 65/356 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K L + DT + LTWTQCQPC KSCY Q PI++ + K+Y + C A
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213
Query: 67 SCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
+C S C +C YGI YGD T + D TL D FG
Sbjct: 214 ACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDV-----FDGFMFG 268
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C +K AG++GL D S + Q + FS CL S + L FG+
Sbjct: 269 CGQNNKGLFG----KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS-NGHLTFGN 323
Query: 180 Q----------------------------------IIAGKSLNLPPNSFTIKLNGQRGCI 205
+ GK+L++ P F G I
Sbjct: 324 GNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NAGTI 378
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + Y L + F + S++ + TC++L + S P +++
Sbjct: 379 IDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLD---TCYDLSNYTSISIPKISF 435
Query: 265 HFQG-ADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F G A++ ++P + I N F G G I G Q + VYD+
Sbjct: 436 NFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIG--IFGNIQQQTLEVVYDV 489
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 145/350 (41%), Gaps = 62/350 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + L+ +LDT + +TW QCQPC CY+Q+DP+++ SY + C
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQR 225
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + G C Y + YGD T V T TL D V N+ GC +
Sbjct: 226 CRDLDTAACRNATGACLYEVAYGDGSYT--VGDFATETLTLGDS---TPVGNVAIGCGHD 280
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ FV + G L++ S ++ FS CLV D S L+FGD
Sbjct: 281 NEGLFVGAAGLLALGGGPLSFPS--------QISASTFSYCLVDRDSPAASTLQFGDGAA 332
Query: 182 ---------------------------IAGKSLNLPPNSFTI-KLNGQRGCINDCGSVLT 213
+ G+ L++P ++F + +G G I D G+ +T
Sbjct: 333 EAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVT 392
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQG 268
++ YA L F+ R GV TC++L R + P+++ F+G
Sbjct: 393 RLQSAAYAALRDAFVQGAPSLP-------RTSGVSLFDTCYDLSDRTSVEVPAVSLRFEG 445
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
+ P ++ + + AF P +I+G Q T+ +D
Sbjct: 446 GGALRLPAKNYLIPVDGAGTYCL--AFAPTNAAVSIIGNVQQQGTRVSFD 493
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 152/353 (43%), Gaps = 61/353 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG+P +S + LDT + +TW QC PC SCY Q DPIY+ + SY+++ C A
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 71
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++ + C C Y + YGD + +++ L P S +++NI FGC +
Sbjct: 72 CQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGP---NSSTAMRNIAFGCGHSNS 128
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGDQII 182
+ AG++G+ + SF Q+ + FS CLV SR L FG I
Sbjct: 129 GLF----RGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAI 184
Query: 183 ----------------------------AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
G L +PP F + NG G I D G+ +T
Sbjct: 185 PFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTR 244
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFN---LPARFNSFPSMTYHF- 266
+ YAVL + + ++ GV TCFN LP PS+ HF
Sbjct: 245 VVPPAYAVLRDAY--RAASRNLPP-----APGVYLLDTCFNFQGLPTV--QIPSLVLHFD 295
Query: 267 QGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
G D+V+ N+ I + +F F P+ P +++G Q + +DL
Sbjct: 296 NGVDMVLPGGNILIPVDRSGTFCLAFAPSSMP---ISVIGNVQQQTFRIGFDL 345
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 144/346 (41%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++GIG P + ++D+ + + W QC+PC CY Q DP+++ + ++ +PC A
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAV 186
Query: 68 CKS--PFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C++ C + G C Y ++YGD TK +L+T TL +V+ + GC +
Sbjct: 187 CRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTL------GGTAVEGVAIGCGHRN 240
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--------------PDK 169
+ FV AG++GL W S + QLG FS CL P+
Sbjct: 241 RGLFVG-----AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVLGRSEAVPEG 295
Query: 170 SFHSRLEFGDQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ L Q + + L L + F + +G G + D G+ +T +
Sbjct: 296 AVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQ 355
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADLV 272
E YA L F+ R GV TC++L + P+++++F GA +
Sbjct: 356 EAYAALRDAFVAAVGALP-------RAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATL 408
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTP-RKGKTILGARHQHNTQFVYD 317
P + + AF P G +ILG Q Q D
Sbjct: 409 TLPARNLLLEVDGGIYCL---AFAPSSSGPSILGNIQQEGIQITVD 451
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 139/343 (40%), Gaps = 47/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P L + DT + LTWTQCQPC ++CY+Q +PI+N SY + C A
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 192
Query: 67 SCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
+C S C +C YGI YGD + + D TL D + FG
Sbjct: 193 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDV-----FDGVYFG 247
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C ++ + +AG++GL D SF Q FS CL S+ L FG
Sbjct: 248 CGENNQGLFT----GVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGS 302
Query: 180 QIIAGKSLNLPP-------------NSFTIKLNGQR-----------GCINDCGSVLTVI 215
I+ +S+ P N I + GQ+ G + D G+V+T +
Sbjct: 303 AGIS-RSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 361
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLVVE 274
+ YA L + F S++ + TCF+L + + P + + F G +V
Sbjct: 362 PPKAYAALRSSFKAKMSKYPTTSGVSILD---TCFDLSGFKTVTIPKVAFSFSGGAVVEL 418
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ + S + I G Q + VYD
Sbjct: 419 GSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYD 461
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 146/351 (41%), Gaps = 56/351 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+++ + +G P + ++DT + LTW Q +PC++C+EQ DPI++ +Y K+ C ++
Sbjct: 25 FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84
Query: 68 CKSPFHC----FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL- 122
C +C Y YGD T+ S +T T + + + ++FG S+
Sbjct: 85 CADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITA------TDTAGEEVKFGASVY 138
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ 180
+ F + GI+GL S QLG ++ ++FS CLV S S + FGD
Sbjct: 139 NTGTFGDTGGE---GILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDA 195
Query: 181 I-----------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ G L++ + + I G G I D G+
Sbjct: 196 AVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTT 255
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFNS-FPSMTYHFQGA 269
+T ++ EV+ L A Y SQ + T G+ CFN + FP+MT H G
Sbjct: 256 ITYLQQEVFNALVAA---YTSQ--VRYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLDGV 310
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFT-PRKGKTILGARHQHNTQFVYDLD 319
L + N FI + F A P I G Q N VYDLD
Sbjct: 311 HLELPTANTFISLETNIICLAFASALDFPIA---IFGNIQQQNFDIVYDLD 358
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 148/348 (42%), Gaps = 61/348 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + L+ +LDT + +TW QCQPC CY Q+DP+Y+ SY + C
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPR 222
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + G C Y + YGD T V T TL D +PVS N+ GC +
Sbjct: 223 CRDLDAAACRNSTGSCLYEVAYGDGSYT--VGDFATETLTLGDS-APVS--NVAIGCGHD 277
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ-- 180
++ FV + G L++ S ++ FS CLV D S L+FGD
Sbjct: 278 NEGLFVGAAGLLALGGGPLSFPS--------QISATTFSYCLVDRDSPSSSTLQFGDSEQ 329
Query: 181 ------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G++L++P ++F + G G I D G+ +T ++
Sbjct: 330 PAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQ 389
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQ-GAD 270
Y L F+ + L R GV TC++L R + P++ F+ G +
Sbjct: 390 SGAYGALREAFV-----QGTQSL--PRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGE 442
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L + +N I + ++ F P +I+G Q + +D
Sbjct: 443 LKLPAKNYLIPVDAAGTYCLAFAGTSGP---VSIIGNVQQQGVRVSFD 487
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 145/346 (41%), Gaps = 55/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P +S + ++D+ + + W QC+PC CY Q DP+++ S+ + C A
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C G C Y ++YGD TK +L+T T V+N+ GC ++
Sbjct: 103 CDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF------GRTVVRNVAIGCGHSNR 156
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + S SFM QL + FS CLV + + LEFG + +
Sbjct: 157 GMFVGAAGLLGL-----GGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPV 211
Query: 185 KSLNLP-------PNSFTIKL---------------------NGQRGCINDCGSVLTVIE 216
+ +P P+ + I+L G G + D G+ +T
Sbjct: 212 GAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFP 271
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADL 271
Y FI+ + R GV TC+NL + P+++++F G +
Sbjct: 272 TVAYEAFRNAFIE-------QTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPI 324
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P N F+ D+ F F A +P G +ILG Q Q D
Sbjct: 325 LTIPANNFLIPVDDAGTFCFAFAPSP-SGLSILGNIQQEGIQISVD 369
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/347 (27%), Positives = 152/347 (43%), Gaps = 59/347 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y L++GIG P + +LDT + ++W QC PC CY+Q+DPI++ S SY + C +
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQ 208
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C G C Y ++YGD T + +T TL +V+N+ GC ++
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL------GSAAVENVAIGCGHNNE 262
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L SF Q+ FS CLV D S LEF +
Sbjct: 263 GLFVGAAGLLGLGGGKL-----SFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRN 314
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G++L +P +SF + G G I D G+ +T +
Sbjct: 315 AATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRS 374
Query: 218 EVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLV 272
EVY L F+ + + LF TC++L +R + P++++ F +G +L
Sbjct: 375 EVYDALRDAFVKGAKGIPKANGVSLFD------TCYDLSSRESVEIPTVSFRFPEGRELP 428
Query: 273 VEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + +F F F P + +I+G Q T+ +D+
Sbjct: 429 LPARNYLIPVDSVGTFCFAFAPTTS---SLSIIGNVQQQGTRVGFDI 472
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 62/350 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P + L+ +LDT + +TW QCQPC CY+Q+DP+++ SY + C
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPR 228
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + G C Y + YGD T V T TL D +PV+ N+ GC +
Sbjct: 229 CRDLDTAACRNATGACLYEVAYGDGSYT--VGDFATETLTLGDS-TPVT--NVAIGCGHD 283
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---- 178
++ FV + G L++ S ++ FS CLV D S L+FG
Sbjct: 284 NEGLFVGAAGLLALGGGPLSFPS--------QISASTFSYCLVDRDSPAASTLQFGADGA 335
Query: 179 --DQIIA----------------------GKSLNLPPNSFTI-KLNGQRGCINDCGSVLT 213
D + A G++L++P ++F + +G G I D G+ +T
Sbjct: 336 EADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVT 395
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQG 268
++ YA L F+ L R GV TC++L R + P+++ F+G
Sbjct: 396 RLQSSAYAALRDAFV-----RGTPSL--PRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEG 448
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
+ P ++ + + AF P +I+G Q T+ +D
Sbjct: 449 GGALRLPAKNYLIPVDGAGTYCL--AFAPTNAAVSIIGNVQQQGTRVSFD 496
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/347 (27%), Positives = 151/347 (43%), Gaps = 59/347 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y L++GIG P + +LDT + ++W QC PC CY+Q+DPI++ S SY + C
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQ 208
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C G C Y ++YGD T + +T TL +V+N+ GC ++
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL------GTAAVENVAIGCGHNNE 262
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
FV + G L SF Q+ FS CLV D S LEF +
Sbjct: 263 GLFVGAAGLLGLGGGKL-----SFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRN 314
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G++L +P + F + G G I D G+ +T +
Sbjct: 315 VVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRS 374
Query: 218 EVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLV 272
EVY L F+ + + LF TC++L +R + P++++HF +G +L
Sbjct: 375 EVYDALRDAFVKGAKGIPKANGVSLFD------TCYDLSSRESVQVPTVSFHFPEGRELP 428
Query: 273 VEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + +F F F P + +I+G Q T+ +D+
Sbjct: 429 LPARNYLIPVDSVGTFCFAFAPTTS---SLSIMGNVQQQGTRVGFDI 472
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 144/345 (41%), Gaps = 51/345 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P L + DT + LTWTQCQPC ++CY+Q +PI+N SY + C A
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 163
Query: 67 SCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
+C S C +C YGI YGD + + + TL D + FG
Sbjct: 164 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV-----FDGVYFG 218
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C ++ + +AG++GL D SF Q FS CL S+ L FG
Sbjct: 219 CGENNQGLFT----GVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGS 273
Query: 180 QIIAGKSLNLPP-------------NSFTIKLNGQR-----------GCINDCGSVLTVI 215
I+ +S+ P N I + GQ+ G + D G+V+T +
Sbjct: 274 AGIS-RSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 332
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV-V 273
+ YA L + F S++ + TCF+L + + P + + F G +V +
Sbjct: 333 PPKAYAALRSSFKAKMSKYPTTSGVSILD---TCFDLSGFKTVTIPKVAFSFSGGAVVEL 389
Query: 274 EPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ +F +F F G + I G Q + VYD
Sbjct: 390 GSKGIFYVFKISQVCLAFAGN--SDDSNAAIFGNVQQQTLEVVYD 432
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 151/353 (42%), Gaps = 61/353 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P +S + LDT + +TW QC PC SCY Q DPIY+ + SY+++ C A
Sbjct: 45 YFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 104
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++ + C C Y + YGD + +++ L P S +++NI FGC +
Sbjct: 105 CQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGP---NSSTAMRNIAFGCGHSNS 161
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGDQII 182
+ AG++G+ + SF Q+ + FS CLV SR L FG I
Sbjct: 162 GLF----RGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAI 217
Query: 183 ----------------------------AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
G +L +PP F + NG G I D G+ +T
Sbjct: 218 PFAARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTR 277
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFN---LPARFNSFPSMTYHFQ 267
+ YAVL + + ++ GV TCFN LP PS+ HF
Sbjct: 278 VVPAAYAVLRDAY--RAASRNLPP-----APGVYLLDTCFNFQGLPT--VQIPSLVLHFD 328
Query: 268 G-ADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
D+V+ N+ I + +F F P+ P +++G Q + +DL
Sbjct: 329 NDVDMVLPGGNILIPVDRSGTFCLAFAPSSMP---ISVIGNVQQQTFRIGFDL 378
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 147/348 (42%), Gaps = 57/348 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + + +LDT + + W QC+PC+ CY Q DPI+N S+ + C A
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAV 216
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C G C Y +YGD + S T TL SV N+ GC ++
Sbjct: 217 CSQLDAYDCHSGGCLYEASYGD--GSYSTGSFATETL----TFGTTSVANVAIGCGHKNV 270
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ----- 180
++ L + SF Q+G FS CLV + L+FG +
Sbjct: 271 GLFIGAAGLLG----LGAGALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVG 326
Query: 181 ----------------------IIAGKSL--NLPPNSFTI-KLNGQRGCINDCGSVLTVI 215
I G +L ++PP F I + +G G I D G+V+T +
Sbjct: 327 SIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRL 386
Query: 216 ECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHF-QGAD 270
Y + F+ Q D +F TC++L +F S P++ +HF GA
Sbjct: 387 VTSAYDAVRDAFVAGTGQLPRTDAVSIFD------TCYDLSGLQFVSVPTVGFHFSNGAS 440
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L++ +N I + +F F F PA + +I+G Q + + +D
Sbjct: 441 LILPAKNYLIPMDTVGTFCFAFAPAAS---SVSIMGNTQQQHIRVSFD 485
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 144/345 (41%), Gaps = 51/345 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P L + DT + LTWTQCQPC ++CY+Q +PI+N SY + C A
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 191
Query: 67 SCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
+C S C +C YGI YGD + + + TL D + FG
Sbjct: 192 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV-----FDGVYFG 246
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C ++ + +AG++GL D SF Q FS CL S+ L FG
Sbjct: 247 CGENNQGLFT----GVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGS 301
Query: 180 QIIAGKSLNLPP-------------NSFTIKLNGQR-----------GCINDCGSVLTVI 215
I+ +S+ P N I + GQ+ G + D G+V+T +
Sbjct: 302 AGIS-RSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 360
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV-V 273
+ YA L + F S++ + TCF+L + + P + + F G +V +
Sbjct: 361 PPKAYAALRSSFKAKMSKYPTTSGVSILD---TCFDLSGFKTVTIPKVAFSFSGGAVVEL 417
Query: 274 EPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ +F +F F G + I G Q + VYD
Sbjct: 418 GSKGIFYVFKISQVCLAFAGN--SDDSNAAIFGNVQQQTLEVVYD 460
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 143/359 (39%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+GDP ++DT + L W QC PC+ CY Q P+Y+ R+ K+++++PC
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASPQ 151
Query: 68 CKSPFH-----CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ G C Y + YGD + L T TL+ PD+ V N+ GC
Sbjct: 152 CRGVLRYPGCDARTGGCVYMVVYGD--GSASSGDLATDTLVLPDD---TRVHNVTLGCGH 206
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
+++ ++ AG++G SF QL FS CL D+ +R ++
Sbjct: 207 DNEGLLASA----AGLLGAGRGQLSFPTQLAPAYGHVFSYCL--GDRMSRAR-NSSSYLV 259
Query: 183 AGKSLNLPPNSFT---------------------------------IKLN---GQRGCIN 206
G++ LP +FT + LN G+ G +
Sbjct: 260 FGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVV 319
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF----NLPARFNSFPSM 262
D G+ ++ + YA + F+ + + + +L TC+ N P PS+
Sbjct: 320 DSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSI 379
Query: 263 TYHF-QGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF AD+ + N I D +F G +LG Q V+D++
Sbjct: 380 VLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVE 438
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 151/350 (43%), Gaps = 63/350 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G+P + + +LDT + + W QCQPC CY+Q DPI++ + +Y + C
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 220
Query: 68 CKS--PFHCFEGDCFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C S C G C Y + YGD T E S S SV+N+ GC
Sbjct: 221 CSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG----------SVKNVALGC 270
Query: 121 SLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG- 178
+++ FV + G L+ + +L FS CLV D + S L+F
Sbjct: 271 GHDNEGLFVGAAGLLGLGGGPLSLTN--------QLKATSFSYCLVNRDSAGSSTLDFNS 322
Query: 179 -----DQIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSV 211
D + A G+ +++P ++F + +G G I DCG+
Sbjct: 323 AQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTA 382
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGA 269
+T ++ + Y L F+ +Q+ KL + TC++L + + P++++HF G
Sbjct: 383 ITRLQTQAYNPLRDAFV-RMTQN--LKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGK 439
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + ++ F F P + +I+G Q T+ +DL
Sbjct: 440 SWNLPAANYLIPVDSAGTYCFAFAPTTS---SLSIIGNVQQQGTRVTFDL 486
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 142/367 (38%), Gaps = 72/367 (19%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ L +G P ++DT + L WTQC+PC C+ Q P+++ + +Y LPC
Sbjct: 113 NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCS 172
Query: 65 DASCK----------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
A C S C Y TYGD T+ V + +T TL + V
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTL------ARQKVP 226
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
+ FGC ++ Q AG++GL S + QLG DRFS CL D +
Sbjct: 227 GVAFGCGDTNEGDGFTQG---AGLVGLGRGPLSLVSQLGI---DRFSYCLTSLDDAAGRS 280
Query: 175 -----------------------------------LEFGDQIIAGKSLNLPPNSFTIKLN 199
+ + L LP ++F I+ +
Sbjct: 281 PLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDD 340
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARF-- 256
G G I D G+ +T +E Y L F+ + S ++ + G+ CF PA
Sbjct: 341 GTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVD----ASEIGLDLCFQGPAGAVD 396
Query: 257 ----NSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHN 311
P + HF GADL + EN + + +G +I+G Q N
Sbjct: 397 QDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCL---TVMASRGLSIIGNFQQQN 453
Query: 312 TQFVYDL 318
QFVYD+
Sbjct: 454 FQFVYDV 460
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 153/351 (43%), Gaps = 65/351 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G+P + + +LDT + + W QCQPC CY+Q DPI++ + +Y + C
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 79
Query: 68 CKS--PFHCFEGDCFYGITYGDV------YETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C S C G C Y + YGD + T+ V ++ SV+N+ G
Sbjct: 80 CSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG-----------SVKNVALG 128
Query: 120 CSLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
C +++ FV + G L+ + +L FS CLV D + S L+F
Sbjct: 129 CGHDNEGLFVGAAGLLGLGGGPLSLTN--------QLKATSFSYCLVNRDSAGSSTLDFN 180
Query: 179 ------DQIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGS 210
D + A G+ +++P ++F + +G G I DCG+
Sbjct: 181 SAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGT 240
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QG 268
+T ++ + Y L F+ +Q+ KL + TC++L + + P++++HF G
Sbjct: 241 AITRLQTQAYNPLRDAFV-RMTQN--LKLTSAVALFDTCYDLSGQASVRVPTVSFHFADG 297
Query: 269 ADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + ++ F F P + +I+G Q T+ +DL
Sbjct: 298 KSWNLPAANYLIPVDSAGTYCFAFAPTTSSL---SIIGNVQQQGTRVTFDL 345
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 155/347 (44%), Gaps = 47/347 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P + ++DT + + W QC+PC+ CY Q P +N SYK + C
Sbjct: 87 YIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKL 146
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+S + +C Y I YG+ ++ SL+T T L PVS GC +
Sbjct: 147 CQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLT-LESTTGRPVSFPKTVIGCGTNN 205
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH------SRLEFG 178
+ K++ +G++GL S + QLG + +FS CLV+ + S+L FG
Sbjct: 206 ---IGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFG 262
Query: 179 D-QIIAGKSLNLPP--------------NSFTI-----------KLNGQRGCINDCGSVL 212
D I++G ++ P +F++ K + I D +++
Sbjct: 263 DVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIIIDSSTIV 322
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T + +VY L + +D + +E++ + C+N+ + FP MT HF+GAD+
Sbjct: 323 TFVPSDVYTKLNSAIVDLVT---LERVDDPNQQFSLCYNVSSDEEYDFPYMTAHFKGADI 379
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ N F+ +D F AF P G I G+ Q + YDL
Sbjct: 380 LLYATNTFVEVARDVLCF----AFAPSNGGAIFGSFSQQDFMVGYDL 422
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 147/348 (42%), Gaps = 57/348 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P++ + +LDT + + W QC+PC CY Q DPI+N S+ L C A
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAV 256
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C ++C G C Y ++YGD T + S T L SV+N+ GC ++
Sbjct: 257 CSYLDAYNCHGGGCLYKVSYGDGSYT--IGSFATEML----TFGTTSVRNVAIGCGHDNA 310
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ---- 180
FV + SF QLG FS CLV LEFG +
Sbjct: 311 GLFVGAAGLLGL-----GAGLLSFPSQLGTQTGRAFSYCLVDRFSESSGTLEFGPESVPL 365
Query: 181 -----------------------IIAGKSL--NLPPNSFTI-KLNGQRGCINDCGSVLTV 214
I G +L ++PP+ F I + +G+ G I D G+ +T
Sbjct: 366 GSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTR 425
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPAR-FNSFPSMTYHF-QGAD 270
++ VY + F+ +L + TC++L + P++ +HF GA
Sbjct: 426 LQTPVYDAVRDAFV-----AGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFSNGAS 480
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
L++ +N I + +F F F PA + +I+G Q + +D
Sbjct: 481 LILPAKNYMIPMDFMGTFCFAFAPATS---DLSIMGNIQQQGIRVSFD 525
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 153/355 (43%), Gaps = 57/355 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN+ ++LG K++ ++DT + LTW QCQPC+SCY Q P+Y+ SYK +
Sbjct: 135 TLNYIVTVELG----GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVF 190
Query: 63 CYDASCK---------SPFHCFEG----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
C ++C+ P F G C Y ++YGD T+ L + +++ D
Sbjct: 191 CNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTR--GDLASESIVLGD--- 245
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++N+ FGC +K +G+MGL S S + Q + FS CL +
Sbjct: 246 -TKLENLVFGCGRNNKGLFGGA----SGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED 300
Query: 170 SFHSRLEFGDQIIAGKS--------------------LNLPPNSFT----IKLNGQRGCI 205
L FG+ K+ LNL S L+ RG +
Sbjct: 301 GASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGIL 360
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + +Y + EF+ FS ++ TCFNL + + S P++
Sbjct: 361 IDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILD---TCFNLTSYEDISIPTIKM 417
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
F+G A+L V+ VF F D+ A + + I+G Q N + +YD
Sbjct: 418 IFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYD 472
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 141/346 (40%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P ++ +LDT + + W QC PC CY Q DPI+ S SY L C
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQ 203
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+S C C Y ++YGD T +T TL SV N+ GC ++
Sbjct: 204 CQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITL------GSASVDNVAIGCGHNNE 257
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
F+ + G L++ S ++ FS CLV D S LEF +
Sbjct: 258 GLFIGAAGLLGLGGGKLSFPS--------QINASSFSYCLVDRDSDSASTLEFNSALLPH 309
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L++P + F + +G G I D G+ +T ++
Sbjct: 310 AITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQT 369
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLVVE 274
Y L F+ + L + + TC++L + + P++T+H G ++
Sbjct: 370 AAYNALRDAFV-----KGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPL 424
Query: 275 PENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P ++ + +F F F P +I+G Q T+ +DL
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPT---SSALSIIGNVQQQGTRVGFDL 467
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 146/342 (42%), Gaps = 60/342 (17%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS-CKSPFHCF-EGDCFYG 82
LD GL+W QC PC+ C Q P+++ ++ +P ++ C+ P+ G C +
Sbjct: 115 LDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGACGFD 174
Query: 83 ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLN 142
I Y D + DT + P V + I FGC+ +++ F + ++ +AGI+GL
Sbjct: 175 IAYRDNTHASGYLARDTFS-FPAGNDDFVPLSAIVFGCAHQTEHFKN--QRAVAGILGLG 231
Query: 143 WD-----STSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI---------------- 181
T+F Q+ RFS C P S +S L FG I
Sbjct: 232 MGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHRQSTPVL 291
Query: 182 ------------IAGKSL------NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL 223
+AG S+ + P F +G GC+ D G+ +T Y
Sbjct: 292 APAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAYV-- 349
Query: 224 TAEFIDYFSQHDIEK----LFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GADLVVEPEN 277
ID+ + +++ + R G TC PA + PSMT HF+ GA L V PE+
Sbjct: 350 ---HIDHAVRQHLQRRGAHIVVVR--GNTCVQQPAPHHDVLPSMTLHFENGAWLRVMPEH 404
Query: 278 VFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
VF+ F + FG F T++GAR Q N +F++DL
Sbjct: 405 VFMPFVVGGHHYQCFG--FVSSTDLTVIGARQQVNHRFIFDL 444
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 134/341 (39%), Gaps = 48/341 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P + L + DT + L+W QC+PC +CY+Q+DP+++ +Y +PC
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQE 247
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C C G C Y + YGD+ +T + DT TL PS +Q FGC +
Sbjct: 248 CLDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL----GPSSDQLQGFVFGCGDDDTGL 303
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL----------------VQPDKSF 171
G+ GL D S Q FS CL P F
Sbjct: 304 FGRAD----GLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPPHAQF 359
Query: 172 HSRLEFGDQ-----------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
+ + D +AG+++ + P F G + D G+V+T + Y
Sbjct: 360 TAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAY 414
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPENV 278
+ L + F + ++ + TC++ R PS+ F GA L + V
Sbjct: 415 SALRSSFAGFMRRYKRAPALSILD---TCYDFTGRTKVQIPSVALLFDGGATLNLGFGGV 471
Query: 279 -FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ N + F G ILG Q VYDL
Sbjct: 472 LYVANRSQACLAFASNGDDTSVG--ILGNMQQKTFAVVYDL 510
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 156/348 (44%), Gaps = 55/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + + DT + L WTQC+PC CY Q DP+++ ++ +YK + C +
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 68 C---KSPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C ++ C D C Y +YGD TK ++DT TL D PV ++NI GC
Sbjct: 154 CTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDT-RPVQLKNIIIGC-- 210
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG-D 179
+ K +GI+GL + S + QLG + +FS CLV + S++ FG +
Sbjct: 211 -GHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTN 269
Query: 180 QIIAGKSLNLPP--------------NSFTI-----------KLNGQRGCINDCGSVLTV 214
+++G + P S ++ +G+ I D G+ LT+
Sbjct: 270 AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTL 329
Query: 215 IECEVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADL 271
+ E Y+ L A ID + D + + C++ P++T HF GAD+
Sbjct: 330 LPTEFYSELEDAVASSIDAEKKQDPQTGLSL------CYSATGDLK-VPAITMHFDGADV 382
Query: 272 VVEPENVFIFNHQDSFFFFF--GPAFTPRKGKTILGARHQHNTQFVYD 317
++P N F+ +D F F P+F +I G Q N YD
Sbjct: 383 NLKPSNCFVQISEDLVCFAFRGSPSF------SIYGNVAQMNFLVGYD 424
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 152/369 (41%), Gaps = 69/369 (18%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
T + YM K+ +G P +DT + +TW QCQPC+ CY Q+ P+++ R SY+++
Sbjct: 129 TTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREM- 187
Query: 63 CYDA-SCKSPFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
YDA C++ GD C Y + YGD T D ++ + V V ++
Sbjct: 188 GYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTF----AGGVQVPHM 243
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD--RFSCCLV-----QPDK 169
GC ++K + AGI+GL S Q+ L + FS CL P +
Sbjct: 244 SIGCGHDNKGLFAAPA---AGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGR 300
Query: 170 SFHSRLEFGDQIIAGKSLNLPPNSFT---------------------------------I 196
S S L GD AG PP SFT +
Sbjct: 301 SVSSTLTIGDGAAAGS----PPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDL 356
Query: 197 KLN---GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFN 251
KL+ G+ G I D G+ +T + Y + + D+ ++ G TC+
Sbjct: 357 KLDPYTGRGGVILDSGTAVTRLARRAY--IAFRDAFRAAAVDLGQVSIGGPSGFFDTCYT 414
Query: 252 LPARFNSFPSMTYHFQGA-DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
+ R P+++ HF G +L + P+N I + + F F A T + +I+G Q
Sbjct: 415 MGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAF--AGTGDRSVSIIGNIQQ 472
Query: 310 HNTQFVYDL 318
+ VY++
Sbjct: 473 QGFRVVYNI 481
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 56/357 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++ + +G P S+ + DT + L W QC+PC SCYEQ +PI++ K+Y+ L C
Sbjct: 92 NGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCE 151
Query: 65 DASCKS---PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
SC + C + + C Y +YGD T ++DT T + PVSV + FGC
Sbjct: 152 GKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLT-IGSTTGRPVSVPKVVFGC 210
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG 178
+ + +G++GL S + QL L+ RFS CLV D S S++ FG
Sbjct: 211 GHNNGGTF---ELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFG 267
Query: 179 DQ-IIAGKS----------------LNLPPNSFTIKLNGQRG---------------CIN 206
+ I++G L L S K +G I
Sbjct: 268 SRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIII 327
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
D G+ LT++ + Y L + + D +F+ ++ + P++T
Sbjct: 328 DSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSNLSGLRI-------PTIT 380
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
HF GADL ++P N F+ +D F F A P I G Q N YDL +
Sbjct: 381 AHFVGADLELKPLNTFVQVQEDLFCF----AMIPVSDLAIFGNLAQMNFLVGYDLKS 433
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 151/344 (43%), Gaps = 43/344 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ IG P L+ ++DT W QC PCK C+ P+++ +YK +PC
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPK 148
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CK+ HC D C Y TYG ++ S+DT TL ++ +P+S +NI GC
Sbjct: 149 CKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNND-TPISFKNIVIGCGH 207
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ 180
+K + + ++G +GL SF+ QL + +FS CLV ++ +L FGD+
Sbjct: 208 RNKGPL---EGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGDK 264
Query: 181 ------------IIAGK-----SLN--------LPPNSFTIKLNGQRGCINDCGSVLTVI 215
I AG+ +LN + + T K + I D G+ LT++
Sbjct: 265 SVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLTIL 324
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
VY+ L + S +E+ + + C+ + P +T HF GAD+ +
Sbjct: 325 PENVYSRLESIVT---SMVKLERAKSPNQQFKLCYKATLKNLDVPIITAHFNGADVHLNS 381
Query: 276 ENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N F +H+ F F P TI+G Q N +DL
Sbjct: 382 LNTFYPIDHEVVCFAFVSVGNFP---GTIIGNIAQQNFLVGFDL 422
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 132/337 (39%), Gaps = 59/337 (17%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFHCFEGDCFYG 82
+DT + L WTQC PC C +Q P ++ + +Y+ LPC + C S CF+ C Y
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60
Query: 83 ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLESKDFVSIQKKIIAGIMGL 141
YGD T V + +T T + + V NI FGC SL + D + +G++G
Sbjct: 61 YYYGDTASTAGVLANETFTFGAANS-TKVRATNIAFGCGSLNAGDLAN-----SSGMVGF 114
Query: 142 NWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-------------------- 181
S + QLG P RFS CL + SRL FG
Sbjct: 115 GRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVI 171
Query: 182 ----------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTA 225
+ K L + P F I +G G I D G+ +T ++ + Y +
Sbjct: 172 NPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRR 231
Query: 226 EFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFN---SFPSMTYHFQGADLVVEPENVFIF 281
+ + G+ TCF P N + P + +HF A++ + PEN +
Sbjct: 232 GLVSAIPLPAMND----TDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLI 287
Query: 282 NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ P TI+G Q N +YD+
Sbjct: 288 ASTTGYLCL---VMAPTGVGTIIGNYQQQNLHLLYDI 321
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 140/353 (39%), Gaps = 57/353 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + +LDT + L WTQC PC C +Q P ++ SY KLPC
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C + ++ C+ C Y YGD T V S +T T D + V+V I FGC +L +
Sbjct: 149 CNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTND--TRVTVPRIAFGCGNLNA 206
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ +G++G S + QLG RFS CL SRL FG
Sbjct: 207 GSLFN-----GSGMVGFGRGPLSLVSQLGS---PRFSYCLTSFMSPVPSRLYFGAYATLN 258
Query: 182 ---------------------------------IAGKSLNLPPNSFTIK-LNGQRGCIND 207
+ G+ L + P+ F I +G G I D
Sbjct: 259 STSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIID 318
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA---RFNSFPSMTY 264
GS +T + Y ++ F D + + TCF P + + P + +
Sbjct: 319 SGSTITYLARAAYDMVHQAFADQVGL-PLTNATSLADVLDTCFVWPPPPRKIVTMPELAF 377
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
HF+GA++ + EN + + A +I+G+ N +YD
Sbjct: 378 HFEGANMELPLENYMLIDGDTGNLCL---AIAASDDGSIIGSFQHQNFHVLYD 427
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 151/338 (44%), Gaps = 40/338 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + + DT + L WTQC+PC CY Q DP+++ ++ +YK + C +
Sbjct: 94 YLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 68 C---KSPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C ++ C D C Y ++Y D T ++DT TL D PV ++NI GC
Sbjct: 154 CTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDN-RPVQLKNIIIGCG- 211
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQI 181
+ V+ + K + + S + QLG + +FS CLV P+ S++ FG + +
Sbjct: 212 -QNNAVTFRNKSSGVVGLGGG-AVSLIKQLGDSIDGKFSYCLV-PENDQTSKINFGTNAV 268
Query: 182 IAG---------------------KSLNL-PPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++G KS+++ N T N + + D G+ LT++ +
Sbjct: 269 VSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKY 328
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVF 279
Y + + +K R C+N A N P +T HF+GAD+ + P N F
Sbjct: 329 YIEIENAVASLI---NADKSKDERIGSSLCYNATADLN-IPVITMHFEGADVKLYPYNSF 384
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+D FG +F R G I G Q N YD
Sbjct: 385 FKVTEDLVCLAFGMSFY-RNG--IYGNVAQKNFLVGYD 419
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 144/347 (41%), Gaps = 78/347 (22%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++K+ IG+P L+ + DT + L WT + N F+ C +
Sbjct: 91 YLVKVRIGNPGIPLYLVPDTGSALIWT--------------VNNQNIFQ------CRNNK 130
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C +G G+ D+ +++ + + FGCS ++++F
Sbjct: 131 CSYTRRYDDGSITTGVAAQDILQSEGSERIP-----------------FYFGCSRDNQNF 173
Query: 128 VSIQKK-IIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------VQPDKSFHSRLEFGDQ 180
+ G+MGLN S + QL + RFS CL +P S S L FG+
Sbjct: 174 SVFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPS--SLLRFGND 231
Query: 181 I-----------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
I +AG+ L+LPP +F ++ +G G I D G+
Sbjct: 232 IRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTG 291
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADL 271
LT I Y L + F +YF +++ F F+ SMT+HF+ AD
Sbjct: 292 LTFITQTAYPRLISAFQNYFDHRGFQRVHIPEFDLCYSFRGNHTFHDHASMTFHFERADF 351
Query: 272 VVEPENVFIFNHQDSFF-FFFGPAFTPRKGKTILGARHQHNTQFVYD 317
V+ + V++ D+ F P TP + +T++GA +Q NT+F+YD
Sbjct: 352 TVQADYVYLPMEDDNAFCVALQP--TPPQQRTVIGAINQGNTRFIYD 396
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 153/353 (43%), Gaps = 58/353 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P ++ +DT + + W QCQPC +C+ Q PI+N SYK +PC ++
Sbjct: 89 YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSST 148
Query: 68 CK----SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV--QNIRFG 119
CK + C G C Y ITYG +++ S D+ TL D S SV NI G
Sbjct: 149 CKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTL---DSTSGSSVLFPNIVIG 205
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLV--QPDKSFHSRLE 176
C + V +G++G+ S + Q+G V +FS CL+ D + S+L
Sbjct: 206 CGHIN---VLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLI 262
Query: 177 FG-DQIIAGKSLNLPPNSFTIKLNGQ----------------------------RGCIND 207
FG D +++G+ + P +K+NGQ + + D
Sbjct: 263 FGEDVVVSGEIVVSTP---MVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNILID 319
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQH-DIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
G+ LT++ + ++ + Y +Q + ++ C+N + + P +T HF
Sbjct: 320 SGTPLTMLP----NLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHF 375
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GAD+ + F F +D F F G I G Q+N YDL+
Sbjct: 376 NGADVKLNSNGTF-FPFEDGIMCF---GFISSNGLEIFGNIAQNNLLIDYDLE 424
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 50/345 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + DT + L WTQC PC+ CY+Q P+++ + +Y+K+ C +
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145
Query: 68 CKS--PFHCF--EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C++ C E C Y ITYGD TK ++DT T+ PVS++N+ GC E
Sbjct: 146 CRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGR-RPVSLRNMIIGCGHE 204
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG-DQ 180
+ +GI+GL STS + QL + + +FS CLV + S++ FG +
Sbjct: 205 NTGTF---DPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNG 261
Query: 181 IIAGKS----------------LNLPPNSF---------TIKLNGQRGCINDCGSVLTVI 215
I++G LNL S TI G+ + D G+ LT++
Sbjct: 262 IVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTLL 321
Query: 216 ECEVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLV 272
Y L A I D + + + C+ + F P +T HF+G D+
Sbjct: 322 PSNFYYELESVVASTIKAERVQDPDGILSL------CYRDSSSFK-VPDITVHFKGGDVK 374
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ N F+ +D F AF + TI G Q N YD
Sbjct: 375 LGNLNTFVAVSEDVSCF----AFAANEQLTIFGNLAQMNFLVGYD 415
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 146/353 (41%), Gaps = 53/353 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +SL+ ++DT + L W QCQPCKSCY+Q DPI++ R+ S++++PC
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 188
Query: 68 CKS-PFHCFEGD------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
CK+ H G C Y + YGD + S D TL + ++ FGC
Sbjct: 189 CKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL-----GTGSKAMSVAFGC 243
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLE 176
+++ F + G L++ S F + FS CLV P S L
Sbjct: 244 GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLI 303
Query: 177 FGDQII----------------------------AGKSLNLPPNSFTIKLNGQRGCINDC 208
FG I G L + S + +G G I D
Sbjct: 304 FGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDS 363
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
G+ +T VYA + F + + ++ TC+N + + P++ HF+
Sbjct: 364 GTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFD---TCYNFSGKASVDVPALVLHFE 420
Query: 268 -GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GADL + P N I N SF F P I+G Q + + +DL
Sbjct: 421 NGADLQLPPTNYLIPINTAGSFCLAFAPTSMELG---IIGNIQQQSFRIGFDL 470
>gi|255563737|ref|XP_002522870.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537954|gb|EEF39568.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 341
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 113/248 (45%), Gaps = 41/248 (16%)
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKK-IIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
EP+ S FGCS ++++F + + GIMGLN S + QL + RFS CL
Sbjct: 80 EPTNNSRILFNFGCSKDNRNFSAFSRTGKTDGIMGLNMSPVSILQQLRNVTNQRFSYCLT 139
Query: 166 ----QPDKSFHSRLEFGDQI-----------------------------IAGKSLNLPPN 192
+P + S L FG+ I +AG+ L LPP
Sbjct: 140 PYGSRPPAT--SLLRFGNDISTWGRGFYSTPFVDPPDMPNYFLNLLDLSVAGQRLRLPPE 197
Query: 193 SFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL 252
+F +K +G G I D G+ LT++ Y L ++F H ++ +N
Sbjct: 198 TFALKRDGTGGTIIDSGTGLTLVVQPAYRHLLGALQNHFDHHGFHRVHIPDTNLELRYNF 257
Query: 253 PAR--FNSFPSMTYHFQGADLVVEPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
F + S+TYHFQGAD VEP + ++N +++F + +G+ I+GA HQ
Sbjct: 258 AQNRTFQNHASLTYHFQGADFTVEPRYAYVVYNDENAFCVALLASHI--EGRAIIGALHQ 315
Query: 310 HNTQFVYD 317
NT+FVY+
Sbjct: 316 ANTRFVYN 323
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 143/360 (39%), Gaps = 69/360 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYD- 65
Y++ L IG P S + DT + L WTQC PC S C++Q YN S ++ LPC
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147
Query: 66 ----ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSVQNIRFG 119
A+ P C Y TYG + T + S++T T P D+ V I FG
Sbjct: 148 VSMCAALAGPSPPPGCSCMYNQTYGTGW-TAGIQSVETFTFGSTPADQ---TRVPGIAFG 203
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR------LVP----DRFSCCLVQPDK 169
CS S D AG++GL S S + QLG L P + S L+ P
Sbjct: 204 CSNASSD----DWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLTPFQDANSTSTLLLGPSA 259
Query: 170 SFHSR---------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
+ + L I +L++PPN+F ++ +G G I D
Sbjct: 260 ALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDS 319
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-------CFNLPARFN---S 258
G+ +T + Y + A IE L T + CF L + + S
Sbjct: 320 GTTITSLVDAAYQQVRAA---------IESLVTLPVADGSDSTGLDLCFALTSETSTPPS 370
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
PSMT+HF GAD+V+ +N I S + + G Q N +YD+
Sbjct: 371 MPSMTFHFDGADMVLPVDNYMILG---SGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDI 427
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 154/355 (43%), Gaps = 50/355 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
M + Y++K +G P + + DT + L WTQC+PC CYEQ+ P+++ +S +Y+
Sbjct: 85 MISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRD 144
Query: 61 LPCYDASC---KSPFHCF-EGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+ C C K C EG+ C Y +YGD T + DT T L PV +
Sbjct: 145 ISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTIT-LGSTSGRPVLLP 203
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFH 172
GC + S +K I+GL S + QLG + +FS CLV + +
Sbjct: 204 KAIIGCGHNNGG--SFTEKGSG-IVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNS 260
Query: 173 SRLEFG-DQIIAGKSLNLPP-------------------NSFTIKLNG------QRGCIN 206
S+L FG + I++G + P S IK G + I
Sbjct: 261 SKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIII 320
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTY 264
D G+ LT+ + ++ L++ D + +E G+ C+++ A FPS+T
Sbjct: 321 DSGTTLTLFPEDFFSELSSAVQDAVAGTPVED-----PSGILSLCYSIDADLK-FPSITA 374
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF GAD+ + P N F+ D+ F AF P I G Q N YDL+
Sbjct: 375 HFDGADVKLNPLNTFV-QVSDTVLCF---AFNPINSGAIFGNLAQMNFLVGYDLE 425
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 141/347 (40%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P +S + ++D+ + + W QCQPC CY Q+DP+++ S+ + C +
Sbjct: 140 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSV 199
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C G C Y ++YGD TK +L+T T V+++ GC ++
Sbjct: 200 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF------GRTMVRSVAIGCGHRNR 253
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + S SF+ QLG FS CLV L FG + +
Sbjct: 254 GMFVGAAGLLGL-----GGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPA 308
Query: 185 KSLNLP-------PNSFTIKLN---------------------GQRGCINDCGSVLTVIE 216
+ +P P+ + I L G G + D G+ +T +
Sbjct: 309 GAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLP 368
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADL 271
Y F+ + R GV TC++L + P+++++F G +
Sbjct: 369 TLAYQAFRDAFL-------AQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPI 421
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPR-KGKTILGARHQHNTQFVYD 317
+ P F+ D+ F F AF P G +ILG Q Q +D
Sbjct: 422 LTLPARNFLIPMDDAGTFCF--AFAPSTSGLSILGNIQQEGIQISFD 466
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 137/328 (41%), Gaps = 38/328 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P +S + ++D+ + + W QCQPC CY Q+DP+++ S+ + C +
Sbjct: 201 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSV 260
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C G C Y ++YGD TK +L+T T V+++ GC ++
Sbjct: 261 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF------GRTMVRSVAIGCGHRNR 314
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---------QPDKSFHSRL 175
FV + G S SF+ QLG FS CLV P +
Sbjct: 315 GMFVGAAGLLGLGGG-----SMSFVGQLGGQTGGAFSYCLVSAAWVPLVRNPRAPSFYYI 369
Query: 176 EFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHD 235
+ G + + F + G G + D G+ +T + Y F+
Sbjct: 370 GLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFL------- 422
Query: 236 IEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFF 290
+ R GV TC++L + P+++++F G ++ P F+ D+ F
Sbjct: 423 AQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFC 482
Query: 291 FGPAFTPR-KGKTILGARHQHNTQFVYD 317
F AF P G +ILG Q Q +D
Sbjct: 483 F--AFAPSTSGLSILGNIQQEGIQISFD 508
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 140/352 (39%), Gaps = 63/352 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G G P K+ ++DT + LTW QC+PC CY Q D I+ + SYK LPC A+
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSAT 196
Query: 68 C-------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C +P C G C Y I YGD ++ S +T TL S QN FGC
Sbjct: 197 CTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTL------GSDSFQNFAFGC 250
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD------------ 168
+ K +G++GL +S SF Q +F+ CL PD
Sbjct: 251 GHTNTGLF----KGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCL--PDFGSSTSTGSFSV 304
Query: 169 --KSFHSRLEFGDQI-----------------IAGKSLNLPPNSFTIKLNGQRGCINDCG 209
S + F + + G L++PP + G+ I D G
Sbjct: 305 GKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPP-----AVLGRGSTIVDSG 359
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ- 267
+V+T + + Y L F K F+ TC++L P++T+HFQ
Sbjct: 360 TVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILD---TCYDLSRHSQVRIPTITFHFQN 416
Query: 268 GADLVVEPENVF--IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
AD+ V + + N F A + G I+G Q + +D
Sbjct: 417 NADVAVSDVGILVPVQNGGSQVCLAFASA-SQMDGFNIIGNFQQQRMRVAFD 467
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 139/359 (38%), Gaps = 65/359 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P ++DT + L WTQC PC C +Q P + +Y+ +PC
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 68 CKS-PF-HCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + P+ CF+ C Y YGD T V + +T T + S V V ++ FGC
Sbjct: 152 CAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANS-SKVMVSDVAFGCG--- 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG------ 178
+ S Q +G++GL S + QLG P RFS CL SRL FG
Sbjct: 208 -NINSGQLANSSGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLN 263
Query: 179 -------------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
+ K L + P F I +G G D
Sbjct: 264 GTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFID 323
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFS----QHDIEKLFTCRKCGVTCFNLPARFN---SFP 260
G+ LT ++ + Y + E + +D E TCF P + + P
Sbjct: 324 SGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLE------TCFPWPPPPSVAVTVP 377
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
M HF GA++ V PEN + + F A TI+G Q N +YD+
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCL---AMIRSGDATIIGNYQQQNMHILYDI 433
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 142/353 (40%), Gaps = 55/353 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQCQPCK C+ Q+ P+Y+ + ++ +PC A+
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 136
Query: 68 CKSPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C YG +Y D + + +T TL VSV ++ FGC
Sbjct: 137 CLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGT 196
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------------------ 164
++ G +GL + S + QLG +FS CL
Sbjct: 197 DNGG----DSLNSTGTVGLGRGTLSLLAQLGV---GKFSYCLTDFFNSTLDSPFLLGTLA 249
Query: 165 --------VQPDKSFHSRLEFGDQIIAGKSLNL-------PPNSFTIKLNGQRGCINDCG 209
VQ S L +++ + + L P +F + N G + D G
Sbjct: 250 ELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSG 309
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA---RFNSFPSMTYHF 266
+ +++ + V+ +D+ +Q + CF PA + P + HF
Sbjct: 310 TTFSILPESGFRVV----VDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHF 365
Query: 267 QG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
G AD+ + +N +N +DS F T ++LG Q N Q ++D+
Sbjct: 366 AGGADMRLHRDNYMSYNQEDSSFCLNIVGTT--STWSMLGNFQQQNIQMLFDM 416
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/373 (25%), Positives = 149/373 (39%), Gaps = 84/373 (22%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F Y++++ +G P S+ + DT + + WTQC+PC +CY+QN P+++ +YK
Sbjct: 76 IFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKN 135
Query: 61 LPCYDASCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
+ +C SP + GD C Y I YGD ++ ++DT T+ PV
Sbjct: 136 V-----ACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQ-STSGRPV 189
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPD 168
+ GC D ++GI+GL S + QLG +FS CL+
Sbjct: 190 AFPRTVIGC---GHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGS 246
Query: 169 KSFHSRLEFGDQI-IAGK-----------------SLNLPPNSF----------TIKLNG 200
+ ++L FG ++G SL L S KL G
Sbjct: 247 TNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGG 306
Query: 201 QRGCINDCGSVLTVIECEV---------------YAVLTAEFIDYFSQHDIEKLFTCRKC 245
+ I D G+ LT + + +A +EF+DY
Sbjct: 307 ESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDY--------------- 351
Query: 246 GVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILG 305
CF P +T HF+GAD+ ++ EN+F+ D+ FG +F P I G
Sbjct: 352 ---CFATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFG-SF-PDDNIFIYG 406
Query: 306 ARHQHNTQFVYDL 318
Q N YD+
Sbjct: 407 NIAQSNFLVGYDI 419
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 139/359 (38%), Gaps = 65/359 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P ++DT + L WTQC PC C +Q P + +Y+ +PC
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 68 CKS-PF-HCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + P+ CF+ C Y YGD T V + +T T + S V V ++ FGC
Sbjct: 152 CAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANS-SKVMVSDVAFGCG--- 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG------ 178
+ S Q +G++GL S + QLG P RFS CL SRL FG
Sbjct: 208 -NINSGQLANSSGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLN 263
Query: 179 -------------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
+ K L + P F I +G G D
Sbjct: 264 GTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFID 323
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFS----QHDIEKLFTCRKCGVTCFNLPARFN---SFP 260
G+ LT ++ + Y + E + +D E TCF P + + P
Sbjct: 324 SGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLE------TCFPWPPPPSVAVTVP 377
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
M HF GA++ V PEN + + F A TI+G Q N +YD+
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCL---AMIRSGDATIIGNYQQQNMHILYDI 433
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 94/349 (26%), Positives = 143/349 (40%), Gaps = 56/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + ++ +LDT + + W QC PCK CY Q DP++N +S+ +PC
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPL 206
Query: 68 CK---SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ SP + C Y ++YGD T S +T T V + GC +
Sbjct: 207 CRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTF------RGTRVGRVALGCGHD 260
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGDQ 180
++ ++ L SF Q+GR +FS CLV D+S S+ + FGD
Sbjct: 261 NEGLFIGAAGLLG----LGRGRLSFPSQIGRRFSRKFSYCLV--DRSASSKPSYMVFGDS 314
Query: 181 IIAGKSLNLPPNS-----------------------------FTIKLNGQRGCINDCGSV 211
I+ + P S F + G G I D G+
Sbjct: 315 AISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTS 374
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGAD 270
+T + Y L F S F+ TCF+L + P++ HF+GAD
Sbjct: 375 VTRLTRPAYVALRDAFRVGASNLKRAPEFSLFD---TCFDLSGKTEVKVPTVVLHFRGAD 431
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + N I ++ SF F F + G +I+G Q + VYDL
Sbjct: 432 VSLPASNYLIPVDNSGSFCFAFAGTMS---GLSIVGNIQQQGFRVVYDL 477
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 144/328 (43%), Gaps = 33/328 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + LTWTQC+PC CY+Q P+++ ++ +Y+ C +
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL-LPPDEPSPVSVQNIRFGCSL 122
C K E C + +Y D T +L + TL + PVS FGC
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTG--GNLASETLTVDSTAGKPVSFPGFAFGCGH 209
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLEFGDQ 180
S I K +GI+GL S + QL + FS CL V D S SR+ FG
Sbjct: 210 SSG---GIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGAS 266
Query: 181 -IIAG-----KSLNLPPNSFTIKLNGQRG-CINDCGSVLTVIECEVYAVL---TAEFIDY 230
++G L LP ++ K + G I D G+ T + E Y+ L A I
Sbjct: 267 GRVSGYGTVSTPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKG 326
Query: 231 FSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFF 290
D +F+ C+N A N+ P +T HF+ A++ ++P N F+ +D F
Sbjct: 327 KRVRDPNGIFSL------CYNTTAEINA-PIITAHFKDANVELQPLNTFMRMQEDLVCF- 378
Query: 291 FGPAFTPRKGKTILGARHQHNTQFVYDL 318
P +LG Q N +DL
Sbjct: 379 ---TVAPTSDIGVLGNLAQVNFLVGFDL 403
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 148/345 (42%), Gaps = 64/345 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P L+ + DT + + W QC+PCK CY Q P + +YK +PC
Sbjct: 87 YLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDL 146
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
CKS +G+ S+DT T L P+S GC ++
Sbjct: 147 CKS---GQQGNL----------------SVDTLT-LESSTGHPISFPKTVIGCGTDNT-- 184
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-QP-DKSFHSRLEFGD-QIIAG 184
VS + +GI+GL S + QLG + +FS CL+ P + + S+L FGD +++G
Sbjct: 185 VSFEGA-SSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSG 243
Query: 185 KSLNLPP--------------NSFTI---------KLNG--QRGCINDCGSVLTVIECEV 219
+ P +F++ NG + I D G+ LTVI +V
Sbjct: 244 DGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDV 303
Query: 220 YAVLTA---EFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPE 276
Y L + E + +D +LF C+++ + FP +T HF+GAD+ + P
Sbjct: 304 YNNLESAVLELVKLKRVNDPTRLFNL------CYSVTSDGYDFPIITTHFKGADVKLHPI 357
Query: 277 NVFIFNHQDSFF---FFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F+ + D F AF P +I G Q N YDL
Sbjct: 358 STFV-DVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDL 401
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 139/326 (42%), Gaps = 42/326 (12%)
Query: 21 LWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS----PFHCFE 76
++ L+DT + +TW QC PC CY+Q D ++ +YK LPC C+ C
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60
Query: 77 GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIA 136
C Y ++YGD T+ +L+T TL D+ VSV N FGC +K + A
Sbjct: 61 SSCNYMVSYGDKSTTRGDFALETLTLR-SDDTILVSVPNFAFGCGHANKGLFNGA----A 115
Query: 137 GIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIAGKSLNLP----- 190
G+MGL S F Q FS CL + S L FG+ + +
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDS 175
Query: 191 ---PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL---FTCRK 244
P+ + + + G IN G L I V V + I F Q E+L FT
Sbjct: 176 SSGPSQYFVSMTG----IN-VGDELLPISATVM-VDSGTVISRFEQSAYERLRDAFTQIL 229
Query: 245 CGV----------TCFNLPARFN-SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFG 292
G+ TCF + + + P +T HF+ A+L + P ++ F F
Sbjct: 230 PGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFA 289
Query: 293 PAFTPRKGKTILGARHQHNTQFVYDL 318
P+ G+++LG Q N +FVYD+
Sbjct: 290 PS---SSGRSVLGNFQQQNLRFVYDI 312
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 149/357 (41%), Gaps = 49/357 (13%)
Query: 7 TYMLKLGIGDP--VKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
Y + +G+G ++ +D AG +W QC PC C Q +P+++ +++ + +
Sbjct: 100 VYAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGH 159
Query: 65 DAS-CKSPFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
+A C+ P+H + G C +GI Y + + DT + P + + + I FGC+
Sbjct: 160 NAVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFS-FPTGDNNFQHLPGIVFGCAN 218
Query: 123 ESKDFVSIQKKIIAGIMGLNWDS-----TSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
F + +AG++G+ + T FM QL RFS C + P + +S L F
Sbjct: 219 RIARFDT--HGALAGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRF 276
Query: 178 GDQI-------------------------------IAGKSLNLP---PNSFTIKLNGQRG 203
G+ I I+ +L +P P F +G+ G
Sbjct: 277 GNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGG 336
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
C D G+ +T I YA + A + Q + + + PA PSMT
Sbjct: 337 CAIDIGTKMTAIVQTAYAHVEAAVRGHL-QRNRARFVQSPGHHLCVHRTPAIEERLPSMT 395
Query: 264 YHFQGAD-LVVEPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF G L V+P+++F + + P T++GA Q +T+F++DL
Sbjct: 396 LHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAEMTVIGAMQQIDTRFIFDL 452
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 139/345 (40%), Gaps = 53/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + + ++D+ + + W QCQPCK CY+Q+DP+++ SY + C +
Sbjct: 132 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSV 191
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C G C Y + YGD TK +L+T T + V+N+ GC ++
Sbjct: 192 CDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF------AKTVVRNVAMGCGHRNR 245
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII-AG 184
++ + S SF+ QL F CLV L FG + + G
Sbjct: 246 GMFIGAAGLLG----IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVG 301
Query: 185 KS---------------------------LNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
S + LP F + G G + D G+ +T +
Sbjct: 302 ASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPT 361
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-----PSMTYHFQGADLV 272
YA F D F R GV+ F+ + F P+++++F ++
Sbjct: 362 GAYAA----FRDGFKSQTAN---LPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 414
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P F+ DS + F A +P G +I+G Q Q +D
Sbjct: 415 TLPARNFLMPVDDSGTYCFAFAASP-TGLSIIGNIQQEGIQVSFD 458
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 152/358 (42%), Gaps = 57/358 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G+P + ++DT + LTW QC+PCK+C++Q+ P+++ S+K +PC A+
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 68 CKSPFH--CFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C H C + C Y YGD T +L++ ++ D PS + ++++
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GRLVPDRFSCCLVQP--DKSFHSRL 175
GC +K ++ + SF QL + FS CLV + S S +
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQ----GALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 262
Query: 176 EFG---------DQI------------------------IAGKSLNLPPNSFTIKLNGQR 202
FG DQ+ I + L +P F I NG
Sbjct: 263 SFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSG 322
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G I D G+ LT + + Y + + F+ S + G+ C+N R FP+
Sbjct: 323 GTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDI---LGI-CYNATGRAAVPFPA 378
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ FQ GA+L + EN FI D A P G +I+G Q N F+YD+
Sbjct: 379 LSIVFQNGAELDLPQENYFI--QPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDV 434
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 147/366 (40%), Gaps = 75/366 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + DT + L WTQC PC S C+ Q P+YN S ++ LPC +
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 67 -----------SCKSPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPV 111
P C C Y +TYG V++ E + ++ P
Sbjct: 152 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSGWTSVFQGSETFTFGST---PAGH---A 202
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC-------- 163
V I FGCS S F +G++GL S + QLG VP +FS C
Sbjct: 203 RVPGIAFGCSTASSGF---NASSASGLVGLGRGRLSLVSQLG--VP-KFSYCLTPYQDTN 256
Query: 164 -----LVQPDKSFHSR----------------------LEFGDQIIAGKSLNLPPNSFTI 196
L+ P S + L + +L++PP++F++
Sbjct: 257 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 316
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPAR 255
+G G I D G+ +T++ Y + A + + + G+ CF LP+
Sbjct: 317 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG---SADTGLDLCFMLPSS 373
Query: 256 FN---SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
+ + PSMT HF GAD+V+ P + ++ + DS + ILG Q N
Sbjct: 374 TSAPPAMPSMTLHFNGADMVL-PADSYMMS-DDSGLWCLAMQNQTDGEVNILGNYQQQNM 431
Query: 313 QFVYDL 318
+YD+
Sbjct: 432 HILYDI 437
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 142/354 (40%), Gaps = 57/354 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++GIG P + +LDT + L WTQC PC C +Q P ++ + +Y+ L C +
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPA 151
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C + ++ C++ C Y YGD T V + +T T D + V++ I FGC +L +
Sbjct: 152 CNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTND--TRVTLPRISFGCGNLNA 209
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ +G++G S S + QLG RFS CL SRL FG
Sbjct: 210 GSLAN-----GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVRSRLYFGAYATLN 261
Query: 182 ------------------------------IAGKSLNLPPNSFTIK-LNGQRGCINDCGS 210
+ G L + P I +G G I D G+
Sbjct: 262 STNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGT 321
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPA---RFNSFPSMTYH 265
+T + Y + F+ Y + L + V TCF P + + P + H
Sbjct: 322 TITYLAEPAYYAVREAFVLYL--NSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLH 379
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F GAD + +N + + A +I+G+ N +YDL+
Sbjct: 380 FDGADWELPLQNYMLVDPSTGGLCL---AMATSSDGSIIGSYQHQNFNVLYDLE 430
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 145/354 (40%), Gaps = 58/354 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN+ ++LG + + ++DT + L+W QCQPCK CY Q DP++N + SY+ +
Sbjct: 132 TLNYIVTVELG----GRKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVL 187
Query: 63 CYDASCKSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C+S G+ C Y + YGD T+ L T L D + +
Sbjct: 188 CSSPTCQS-LQSATGNLGVCGSNPPSCNYVVNYGDGSYTR--GELGTEHL---DLGNSTA 241
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V N FGC ++ +G++GL S S + Q + FS CL +
Sbjct: 242 VNNFIFGCGRNNQGLFGGA----SGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEAS 297
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLN--------------------------GQRGCIN 206
L G K N P S+T + G+ G +
Sbjct: 298 GSLVMGGNSSVYK--NTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMI 355
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYH 265
D G+V+T + +Y L EF+ FS F TCFNL + P++ H
Sbjct: 356 DSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILD---TCFNLSGYQEVEIPNIKMH 412
Query: 266 FQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
F+G A+L V+ VF F D+ A + + I+G Q N + +YD
Sbjct: 413 FEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYD 466
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 147/345 (42%), Gaps = 48/345 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + + DT + L WTQC+PC+ CY+Q DP+++ +S K+Y+ C
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQ 154
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C C Y +YGD T + DT T L SPVS GC E+
Sbjct: 155 CSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTIT-LDSTTGSPVSFPKTVIGCGHEND 213
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG-DQII 182
S + +GI+GL S + Q+G V +FS CLV S+L FG + ++
Sbjct: 214 GTFSDKG---SGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVV 270
Query: 183 AGKSLNLPP-------NSF-------------TIKLN------GQRGCINDCGSVLTVIE 216
+G + P +SF IK G+ I D G+ LT++
Sbjct: 271 SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGTTLTIVP 330
Query: 217 CEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVV 273
+ ++ L+ ++ D + C++ + P++T HF GAD+ +
Sbjct: 331 DDFFSNLSTAVGNQVEGRRAEDPSGFLS------VCYSATSDLK-VPAITAHFTGADVKL 383
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+P N F+ D F + G +I G Q N Y++
Sbjct: 384 KPINTFVQVSDDVVCLAFA---STTSGISIYGNVAQMNFLVEYNI 425
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 146/354 (41%), Gaps = 55/354 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +SL+ ++DT + L W QCQPCKSCY+Q DPI++ R+ S++++PC
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113
Query: 68 CKS-PFHCFEGD------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
CK+ H G C Y + YGD + S D TL + ++ FGC
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL-----GTGSKAMSVAFGC 168
Query: 121 SLESKDFVSIQKKIIAGIMG-LNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLE 176
+++ + ++ G L++ S F + FS CLV P S L
Sbjct: 169 GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLI 228
Query: 177 FGDQII----------------------------AGKSLNLPPNSFTIKLNGQRGCINDC 208
FG I G L + S + +G G I D
Sbjct: 229 FGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDS 288
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFN-SFPSMTYHF 266
G+ +T VYA + D F I R TC+N + + P++ HF
Sbjct: 289 GTSVTRFPTSVYATIR----DAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHF 344
Query: 267 Q-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ GADL + P N I N SF F P I+G Q + + +DL
Sbjct: 345 ENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELG---IIGNIQQQSFRIGFDL 395
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 147/366 (40%), Gaps = 75/366 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + DT + L WTQC PC S C+ Q P+YN S ++ LPC +
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 67 -----------SCKSPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPV 111
P C C Y +TYG V++ E + ++ P
Sbjct: 92 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSGWTSVFQGSETFTFGST---PAGH---A 142
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC-------- 163
V I FGCS S F +G++GL S + QLG VP +FS C
Sbjct: 143 RVPGIAFGCSTASSGF---NASSASGLVGLGRGRLSLVSQLG--VP-KFSYCLTPYQDTN 196
Query: 164 -----LVQPDKSFHSR----------------------LEFGDQIIAGKSLNLPPNSFTI 196
L+ P S + L + +L++PP++F++
Sbjct: 197 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 256
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPAR 255
+G G I D G+ +T++ Y + A + + + G+ CF LP+
Sbjct: 257 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG---SADTGLDLCFMLPSS 313
Query: 256 FN---SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
+ + PSMT HF GAD+V+ P + ++ + DS + ILG Q N
Sbjct: 314 TSAPPAMPSMTLHFNGADMVL-PADSYMMS-DDSGLWCLAMQNQTDGEVNILGNYQQQNM 371
Query: 313 QFVYDL 318
+YD+
Sbjct: 372 HILYDI 377
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 147/369 (39%), Gaps = 81/369 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + DT + L WTQC PC S C+ Q P+YN S ++ LPC +
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 67 -----------SCKSPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPV 111
P C C Y +TYG V++ E + ++ P +
Sbjct: 150 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSGWTSVFQGSETFTFGST---PAGQ---S 200
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC-------- 163
V I FGCS S F +G++GL S + QLG VP +FS C
Sbjct: 201 RVPGIAFGCSTASSGF---NASSASGLVGLGRGRLSLVSQLG--VP-KFSYCLTPYQDTN 254
Query: 164 -----LVQPDKSFHSR----------------------LEFGDQIIAGKSLNLPPNSFTI 196
L+ P S + L + +L++PP++F +
Sbjct: 255 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLL 314
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNL 252
+G G I D G+ +T++ Y + A + + L T T CF L
Sbjct: 315 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVV------SLVTLPTTDGSAATGLDLCFML 368
Query: 253 PARFN---SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
P+ + + PSMT HF GAD+V+ P + ++ + DS + ILG Q
Sbjct: 369 PSSTSAPPAMPSMTLHFNGADMVL-PADSYMMS-DDSGLWCLAMQNQTDGEVNILGNYQQ 426
Query: 310 HNTQFVYDL 318
N +YD+
Sbjct: 427 QNMHILYDI 435
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 152/347 (43%), Gaps = 44/347 (12%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
++ L IG+P +++ +LDT + L W QC+PC CY+Q DPIYN SY ++ C +
Sbjct: 105 AFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEP 164
Query: 67 SCKS---PFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C S C + G C Y +Y D T + S + + Q + FGC L
Sbjct: 165 PCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQ-VGFGCGL 223
Query: 123 ESKDFVSIQK------------KIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
++ +FV+ + +++ + + S SF G L LV D +
Sbjct: 224 QNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDAT 283
Query: 171 FHSRLEFGDQIIAG---------------KSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ + + +IA L++ +SF K +G G I D GS L++
Sbjct: 284 YLNG-DMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIF 342
Query: 216 ECEVYAVLTAEFIDYFSQ-HDIEKLFTCRKC--GVTCFNLPARFNSFPSMTYHFQGADLV 272
EVY V+ +D + ++I L + C G +LP FP++ + + ++
Sbjct: 343 PPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIGRDLPL----FPTLVLYLESTGIL 398
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ ++F+ + + F FT +G +I+G Q + +F Y+L+
Sbjct: 399 NDRWSIFLQRYDELFCL----GFTSGEGLSIIGTLAQQSYKFGYNLE 441
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 144/346 (41%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K + +LDT + + W QC+PC CY+Q+DPI++ + SY L C
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQ 216
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ C G C Y ++YGD T +T + SV + GC +++
Sbjct: 217 CQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSF------GAGSVNRVAIGCGHDNE 270
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-----GD 179
FV + G L+ S ++ FS CLV D S LEF GD
Sbjct: 271 GLFVGSAGLLGLGGGPLSLTS--------QIKATSFSYCLVDRDSGKSSTLEFNSPRPGD 322
Query: 180 QIIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
++A G+ + +PP +F + +G G I D G+ +T +
Sbjct: 323 SVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRT 382
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPA-RFNSFPSMTYHFQGADLVVE 274
+ Y + F + L + TC++L + + P++++HF G
Sbjct: 383 QAYNSVRDAF-----KRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWAL 437
Query: 275 PENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P ++ + ++ F F P + +I+G Q T+ +DL
Sbjct: 438 PAKNYLIPVDGAGTYCFAFAPTTS---SMSIIGNVQQQGTRVSFDL 480
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 140/358 (39%), Gaps = 59/358 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L L +G P + + LLDT + L WTQC C +C Q DP+++ R SY+ + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 68 CKSPFH--CFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C H C D C Y +YGD T + + T S Q++ G +
Sbjct: 158 CGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTF----ASSSGETQSVPLGFGCGT 213
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD----- 179
+ S+ +GI+G D S + QL RFS CL S S L+FG
Sbjct: 214 MNVGSLNNA--SGIVGFGRDPLSLVSQLS---IRRFSYCLTPYASSRKSTLQFGSLADVG 268
Query: 180 ------------------------------QIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
+ + L +P ++F ++ +G G I D G
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328
Query: 210 SVLTVIECEVYAVLTAEFIDYF-------SQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
+ LT+ V A + F S D F + AR + P M
Sbjct: 329 TALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRM-ARQVAVPRM 387
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFF-FFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+HFQGADL + EN + +H+ G + G TI G Q + + VYDL+
Sbjct: 388 VFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGD--DGATI-GNFVQQDMRVVYDLE 442
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 152/358 (42%), Gaps = 57/358 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G+P + ++DT + LTW QC+PCK+C++Q+ P+++ S+K +PC A+
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 68 CKSPFH--CFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C H C + C Y YGD T +L++ ++ D PS + ++++
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GRLVPDRFSCCLVQP--DKSFHSRL 175
GC +K ++ + SF QL + FS CLV + S S +
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQ----GALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 346
Query: 176 EFG---------DQI------------------------IAGKSLNLPPNSFTIKLNGQR 202
FG DQ+ I + L +P F I NG
Sbjct: 347 SFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSG 406
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G I D G+ LT + + Y + + F+ S + G+ C+N R FP+
Sbjct: 407 GTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDI---LGI-CYNATGRTAVPFPT 462
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ FQ GA+L + EN FI D A P G +I+G Q N F+YD+
Sbjct: 463 LSIVFQNGAELDLPQENYFI--QPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDV 518
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 128/309 (41%), Gaps = 46/309 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G G P K+ ++DT + +TW QC+PC CY Q DPI+ + SYK L C ++
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSA 197
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + HC G C Y I YGD ++ S +T TL PS FGC +
Sbjct: 198 CTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDSFPS------FAFGCGHTN 251
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD---KSFHSRLEFGDQI 181
K AG++GL + SF Q +FS CL PD + G
Sbjct: 252 TGLF----KGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL--PDFVSSTSTGSFSVGQGS 305
Query: 182 IAGKSLNLP-------PNSFTIKLN----------------GQRGCINDCGSVLTVIECE 218
I + +P P+ + + LN G+ G I D G+V+T + +
Sbjct: 306 IPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLVPQ 365
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPE 276
Y L F K F+ TC++L + P++T+HFQ AD+ V
Sbjct: 366 AYDALKTSFRSKTRNLPSAKPFSILD---TCYDLSSYSQVRIPTITFHFQNNADVAVSAV 422
Query: 277 NVFIFNHQD 285
+ D
Sbjct: 423 GILFTIQSD 431
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 150/358 (41%), Gaps = 65/358 (18%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +G+G +++ ++DT + LTW QC+PC+SCY QN P++ + SY+ +
Sbjct: 119 TLN--YIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPIL 174
Query: 63 CYDASCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C +C+S C Y + YGD T ++ +SV N
Sbjct: 175 CNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF------GGISVSN 228
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS-R 174
FGC +K +G+MGL S + Q FS CL D++ S
Sbjct: 229 FVFGCGRNNKGLFGGA----SGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGS 284
Query: 175 LEFGDQI--------------------------------IAGKSLNLPPNSFTIKLNGQR 202
L G+Q + G SL++ +SF G
Sbjct: 285 LVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNG 339
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G I D G+V++ + VY L A+F++ FS F+ TCFNL + P+
Sbjct: 340 GVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILD---TCFNLTGYDQVNIPT 396
Query: 262 MTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYD 317
++ +F+G A+L V+ +F +D+ A + I+G Q N + +YD
Sbjct: 397 ISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYD 454
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 150/354 (42%), Gaps = 61/354 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P ++ +LDT + L WTQC+PC C+ Q+ PI++ + S+ KL C
Sbjct: 94 NGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCS 153
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C++ P C Y +YGD T+ + + +T T SV N+ FGC +
Sbjct: 154 SQLCEALPQSSCNNGCEYLYSYGDYSSTQGILASETLTF------GKASVPNVAFGCGAD 207
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG----- 178
++ Q AG++GL S + QL +FS CL D + S L G
Sbjct: 208 NEGSGFSQG---AGLVGLGRGPLSLVSQLKE---PKFSYCLTTVDDTKTSTLLMGSLASV 261
Query: 179 --------------------------DQIIAGKS-LNLPPNSFTIKLNGQRGCINDCGSV 211
+ I G + L + ++F+++ +G G I D G+
Sbjct: 262 NASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTT 321
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLPARFNSF--PSMTYH 265
+T +E + ++ EF + G T CF LP+ + P + +H
Sbjct: 322 ITYLEESAFNLVAKEFTAKIN-------LPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFH 374
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F GADL + EN I DS A G +I G Q N ++DL+
Sbjct: 375 FDGADLELPAENYMI---GDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLE 425
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 140/358 (39%), Gaps = 59/358 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L L +G P + + LLDT + L WTQC C +C Q DP+++ R SY+ + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 68 CKSPFH--CFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C H C D C Y +YGD T + + T S Q++ G +
Sbjct: 158 CGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTF----ASSSGETQSVPLGFGCGT 213
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD----- 179
+ S+ +GI+G D S + QL RFS CL S S L+FG
Sbjct: 214 MNVGSLNNA--SGIVGFGRDPLSLVSQLS---IRRFSYCLTPYASSRKSTLQFGSLADVG 268
Query: 180 ------------------------------QIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
+ + L +P ++F ++ +G G I D G
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328
Query: 210 SVLTVIECEVYAVLTAEFIDYF-------SQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
+ LT+ V A + F S D F + AR + P M
Sbjct: 329 TALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRM-ARQVAVPRM 387
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFF-FFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+HFQGADL + EN + +H+ G + G TI G Q + + VYDL+
Sbjct: 388 VFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGD--DGATI-GNFVQQDMRVVYDLE 442
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 153/346 (44%), Gaps = 42/346 (12%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
++ L IG+P +++ +LDT + L W QC+PC CY+Q DPIYN SY ++ C +
Sbjct: 92 AFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEP 151
Query: 67 SCKS---PFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C S C + G C Y Y D T + S + + Q + FGC L
Sbjct: 152 PCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQ-VGFGCGL 210
Query: 123 ESKDFVSIQK------------KIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
++ +F++ + +++ + + S SF G + LV D +
Sbjct: 211 QNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDAT 270
Query: 171 FHSR-------LEF------GDQIIAGK-SLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ + EF G + G+ L++ +SF K +G G I D GS L+V
Sbjct: 271 YLNGDMTPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFP 330
Query: 217 CEVYAVLTAEFIDYFSQ-HDIEKLFTCRKC--GVTCFNLPARFNSFPSMTYHFQGADLVV 273
EVY V+ +D + ++I L + C G +LP FP++ + + ++
Sbjct: 331 PEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIERDLPL----FPTLVLYLESTGILN 386
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ ++F+ + + F FT +G +I+G Q + +F Y+L+
Sbjct: 387 DRWSIFLQRYDELFCL----GFTSGEGLSIIGTLAQQSYKFGYNLE 428
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 138/345 (40%), Gaps = 53/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + + ++D+ + + W QCQPCK CY+Q+DP+++ SY + C +
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSV 190
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C G C Y + YGD TK +L+T T + V+N+ GC ++
Sbjct: 191 CDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF------AKTVVRNVAMGCGHRNR 244
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII-AG 184
++ + S SF+ QL F CLV L FG + + G
Sbjct: 245 GMFIGAAGLLG----IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVG 300
Query: 185 KS---------------------------LNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
S + LP F + G G + D G+ +T +
Sbjct: 301 ASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPT 360
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-----PSMTYHFQGADLV 272
Y F D F R GV+ F+ + F P+++++F ++
Sbjct: 361 AAYVA----FRDGFKSQTAN---LPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P F+ DS + F A +P G +I+G Q Q +D
Sbjct: 414 TLPARNFLMPVDDSGTYCFAFAASP-TGLSIIGNIQQEGIQVSFD 457
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 144/343 (41%), Gaps = 40/343 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P ++ DT + L W QC PC CY+Q +P+++ RS SY + C S
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTES 119
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C S + C Y +Y D T+ V + +T TL PV+ Q I FGC
Sbjct: 120 CNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLT-STTGEPVAFQGIIFGCGHN 178
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV---PDRFSCCLV--QPDKSFHSRLEF- 177
+ F + G++GL S + Q+G + + FS CLV D S S++ F
Sbjct: 179 NSGFNDRE----MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFG 234
Query: 178 -GDQIIAGKSLNLPPNS------FTIKLNGQRGCIN---DCGSVLTVIECEVYAVLTAEF 227
G +++ +++ P S F L IN GS L I + +
Sbjct: 235 KGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITKGNILIDSGTT 294
Query: 228 IDYFSQHDIEKLFTCRKCGVT-----------CFNLPARFNSFPSMTYHFQGADLVVEPE 276
I Y + +L + V C+ P N P++T HF+G D+++ P
Sbjct: 295 ITYLPEEFYHRLIEQVRNKVALEPFRIDGYELCYQTPTNLNG-PTLTIHFEGGDVLLTPA 353
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+FI D+F F F + G Q N +DL+
Sbjct: 354 QMFIPVQDDNFCF---AVFDTNEEYVTYGNYAQSNYLIGFDLE 393
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 139/347 (40%), Gaps = 53/347 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC PCK+CY Q DP++N S+ K+ C
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 101
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ SP C Y ++YGD T +T T V+ + GC ++
Sbjct: 102 CRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTF------RRTKVEQVALGCGHDN 155
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQII 182
+ FV + SF Q GR +FS CLV S S + FG+ +
Sbjct: 156 EGLFVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV 210
Query: 183 A-----------------------GKSLNLPPNS------FTIKLNGQRGCINDCGSVLT 213
+ G S+ P S F + G G I DCG+ +T
Sbjct: 211 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVT 270
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
+ Y L F S F+ TC++L + P++ HF+GAD+
Sbjct: 271 RLNKPAYIALRDAFRAGASSLKSAPEFSLFD---TCYDLSGKTTVKVPTVVLHFRGADVS 327
Query: 273 VEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + F F F + G +I+G Q + VYDL
Sbjct: 328 LPASNYLIPVDGSGRFCFAFAGTTS---GLSIIGNIQQQGFRVVYDL 371
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/327 (28%), Positives = 140/327 (42%), Gaps = 50/327 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ IG P L+ ++DT + W QC+PCK C Q PI+N +YK + C
Sbjct: 90 YVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSPI 149
Query: 68 CK--SPFHCFEG---DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CK C C Y ITY D ++ S DT TL D SP+S I GC
Sbjct: 150 CKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDG-SPISFPKIVIGCGH 208
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ 180
++ + + +GI+G + S + QLG + +FS CL + S+L FGD
Sbjct: 209 KNS---LTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDM 265
Query: 181 -IIAGKSLNLPP--------NSFT-----------IKL-------NGQRGCINDCGSVLT 213
+++G + P N FT IKL + + + D GS +T
Sbjct: 266 AVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVIDSGSTIT 325
Query: 214 VIECEVYAVLTAEFIDYFSQHDI----EKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA 269
+ +VY+ L I + ++L C K + + +P +T HF+GA
Sbjct: 326 QLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPI-------ITAHFRGA 378
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAF 295
D+ + N FI NH+ F F AF
Sbjct: 379 DVKLNAFNTFIQMNHEVMCFAFNSSAF 405
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 144/354 (40%), Gaps = 61/354 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ L IG P ++ ++DT + L WTQC+PC C++Q PI++ + S+ KL C
Sbjct: 97 NGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCS 156
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
CK+ P C Y TYGD T+ + +T T VS+ N+ FGC +
Sbjct: 157 SQLCKALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF------GKVSIPNVGFGCGED 210
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-- 181
++ Q +G++GL S + QL +FS CL D + S L G
Sbjct: 211 NEGDGFTQG---SGLVGLGRGPLSLVSQLKEA---KFSYCLTSIDDTKTSTLLMGSLASV 264
Query: 182 ------------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ G L + ++F ++ +G G I D G+
Sbjct: 265 NGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTT 324
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLPARFNSF--PSMTYH 265
+T +E + ++ EF G T C+NLP+ + P + H
Sbjct: 325 ITYLEESAFDLVKKEFTSQMG-------LPVDNSGATGLELCYNLPSDTSELEVPKLVLH 377
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F GADL + EN I DS A G +I G Q N +DL+
Sbjct: 378 FTGADLELPGENYMI---ADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLE 428
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 147/352 (41%), Gaps = 62/352 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC---- 63
Y ++GIG P + L+ +LDT + +TW QC PC CY Q+DP+++ SY +PC
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPH 255
Query: 64 ---YDAS-CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
DAS C + C Y + YGD T V T TL + S +V ++ G
Sbjct: 256 CRALDASACHNNAANGNSSCVYEVAYGDGSYT--VGDFATETLTLGGDGS-AAVHDVAIG 312
Query: 120 CSLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
C +++ FV + G L++ S ++ FS CLV D S L+FG
Sbjct: 313 CGHDNEGLFVGAAGLLALGGGPLSFPS--------QISATEFSYCLVDRDSPSASTLQFG 364
Query: 179 DQ--------------------------IIAGKSL-NLPPNSFTIKLNGQRGCINDCGSV 211
+ G++L ++PP +F + G G I D G+
Sbjct: 365 ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSGTA 424
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHF 266
+T ++ Y+ L F+ + L R GV TC++L R + P+++ F
Sbjct: 425 VTRLQSSAYSALRDAFV-----RGTQAL--PRASGVSLFDTCYDLAGRSSVQVPAVSLRF 477
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
+G + P ++ + + AF G +I+G Q + +D
Sbjct: 478 EGGGELKLPAKNYLIPVDGAGTYCL--AFAATGGAVSIVGNVQQQGIRVSFD 527
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 143/347 (41%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P +S + ++D+ + + W QCQPC CY+Q+DP+++ +Y + C +
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSV 196
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS-LES 124
C C +G C Y ++YGD T+ +L+T T V ++NI GC +
Sbjct: 197 CDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF------GRVLIRNIAIGCGHMNR 250
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII-A 183
F+ + G + SF+ QLG FS CLV LEFG +
Sbjct: 251 GMFIGAAGLLGLGGGAM-----SFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPV 305
Query: 184 GKS---------------------------LNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
G + + +P F + G G + D G+ +T +
Sbjct: 306 GAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLP 365
Query: 217 CEVYAVLTAEFIDY---FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
Y FI + D +F TC+NL + P+++++F G ++
Sbjct: 366 APAYEAFRDTFIGQTANLPRSDRVSIFD------TCYNLNGFVSVRVPTVSFYFSGGPIL 419
Query: 273 VEPENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P F+ + + +F F F + + G +I+G Q Q D
Sbjct: 420 TLPARNFLIPVDGEGTFCFAFAASAS---GLSIIGNIQQEGIQISID 463
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 150/363 (41%), Gaps = 61/363 (16%)
Query: 8 YMLKLGIGDPVKSLWF--LLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y +++GIG ++ LD V LTW QC+PC Q+ ++N+ + Y + D
Sbjct: 79 YSVRVGIGSGGTQHFYKLALDLVRPLTWMQCKPCVPEKRQDGSVFNTAASPHYHHIASTD 138
Query: 66 ASCKSPF-HCFEGDCFYGIT--YGDVYETKEVDSLDTSTLLPPDEPSPV-SVQNIRFGCS 121
C +P+ +G C + + YGD + V D SP+ SV + FGC+
Sbjct: 139 PRCMAPYTRAGQGRCTFDVKFQYGDS-RARGVLGSDDFVFDGSGPGSPISSVNGLVFGCA 197
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQ------------- 166
+ DF + + AG+M LN TSF+ QL L RFS CL
Sbjct: 198 HNTHDFYN--HDLWAGVMSLNRHPTSFIRQLSARGLAAPRFSYCLASRQHRDRRGFLRFG 255
Query: 167 ---PDKSFHSR---LEFGDQIIAG---------------KSLNLPPNSFTIKLNGQR-GC 204
PD+S H+R L GD G + + P F + R GC
Sbjct: 256 ADIPDQS-HARSTPLLHGDLAQGGGMYYVGVVGVSLGGRRLTAITPVMFELNRRSLRGGC 314
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE-KLFTCRKCGVTCFNLPARFNSFPSMT 263
I D G+ LT++ Y VL AE I + ++ +F+ + + PS+T
Sbjct: 315 IIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPGQKHCFRGKWESIHRHLPSVT 374
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFF--------FFGPAFTPRKGKTILGARHQHNTQFV 315
HFQ PE+V +F + F + A P +TI+GA +T+F
Sbjct: 375 LHFQ-----FHPESVALFIRPELLFVAMTGERTDYVCLAIVPYAERTIIGAGQMLDTRFT 429
Query: 316 YDL 318
+DL
Sbjct: 430 FDL 432
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 141/347 (40%), Gaps = 56/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + +LDT + +TW QC+PC CY+Q+DPIYN SYK + C
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANL 204
Query: 68 CKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ G C Y ++YGD T+ + T TL P +QN+ GC ++
Sbjct: 205 CQQLDVSGCSRNGSCLYQVSYGDGSYTQ--GNFATETLTLGGAP----LQNVAIGCGHDN 258
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ ++ G + + G++ FS CLV D S L+FG
Sbjct: 259 EGLFVGAAGLLGLGGGSLSFPSQLTDENGKI----FSYCLVDRDSESSSTLQFGRAAVPN 314
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ GK L++ + F I +G G I D G+ +T ++
Sbjct: 315 GAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQ 374
Query: 217 CEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
Y L F D LF TC++L ++ + P++ +HF G +
Sbjct: 375 TAAYDSLRDAFRAGTKNLPSTDGVSLFD------TCYDLSSKESVDVPTVVFHFSGGGSM 428
Query: 273 VEPENVFI--FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P ++ + +F F F P + +I+G Q + +D
Sbjct: 429 SLPAKNYLVPVDSMGTFCFAFAPTSSSL---SIVGNIQQQGIRVSFD 472
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 145/356 (40%), Gaps = 60/356 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + +G P + ++D+ + L W QC PC+ CY Q+ P+Y + ++ +PC +
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSD 123
Query: 68 C-----KSPFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C F C + G C Y Y D +K V + +++T+ V + + FG
Sbjct: 124 CLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV------DGVRIDKVAFG 177
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEF 177
C +++ + G++GL SF Q+G ++F+ CLV S S L F
Sbjct: 178 CGSDNQGSFAAA----GGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLIF 233
Query: 178 GDQIIA------------------------------GKSLNLPPNSFTIKLNGQRGCIND 207
GD++I+ GKSL + +++ I L G G I D
Sbjct: 234 GDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFD 293
Query: 208 CGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
G+ LT Y+ + A F + Y ++ L C + +T + P SFPS T
Sbjct: 294 SGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLDLCVE--LTGVDQP----SFPSFTI 347
Query: 265 HF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F GA E EN F+ + +P G +G Q N YD +
Sbjct: 348 EFDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDRE 403
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 146/359 (40%), Gaps = 68/359 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P + ++D+ + + W QC+PC CY Q DP+++ + ++ + C A
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAI 230
Query: 68 CK--SPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ C +G+ C Y ++Y D TK +L+T TL +V+ + GC
Sbjct: 231 CRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTL------GGTAVEGVVIGCGH 284
Query: 123 ESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--------------- 166
++ FV AG+MGL W S + QLG V FS CL
Sbjct: 285 RNRGLFVG-----AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGW 339
Query: 167 ---------PDK-------------SFH----SRLEFGDQIIAGKSLNLPPNSFTIKLNG 200
P+ SF+ S +E GD+ L L F + +G
Sbjct: 340 LVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDE-----RLPLQAGLFQLTEDG 394
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SF 259
+ D G+ +T + E YA L F+ + TC++L +
Sbjct: 395 AGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRV 454
Query: 260 PSMTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P++++ F G A L++ NV + + F P+ + G +I+G Q Q D
Sbjct: 455 PTVSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSS---GLSIMGNTQQAGIQITVD 510
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 142/349 (40%), Gaps = 57/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P K ++ +LDT + + W QC PCK+CY Q DP++N S+ K+ C
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 188
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ SP C Y ++YGD T +T T V+ + GC ++
Sbjct: 189 CRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTF------RRTKVEQVALGCGHDN 242
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGDQ 180
+ FV + SF Q GR +FS CLV D+S S+ + FG+
Sbjct: 243 EGLFVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLV--DRSASSKPSSVVFGNS 295
Query: 181 IIA-----------------------GKSLNLPPNS------FTIKLNGQRGCINDCGSV 211
++ G S+ P S F + G G I DCG+
Sbjct: 296 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTS 355
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGAD 270
+T + Y L F S F+ TC++L + P++ HF+GAD
Sbjct: 356 VTRLNKPAYIALRDAFRAGASSLKSAPEFSLFD---TCYDLSGKTTVKVPTVVLHFRGAD 412
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + N I + F F F + G +I+G Q + VYDL
Sbjct: 413 VSLPASNYLIPVDGSGRFCFAFAGTTS---GLSIIGNIQQQGFRVVYDL 458
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 146/344 (42%), Gaps = 47/344 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P +++ ++DT + + W QC+PC+ CY+Q PI+N SYK +PC
Sbjct: 87 YLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNL 146
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+S + + C Y I + D ++ S++T TL S VS GC +
Sbjct: 147 CQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHS-VSFPKTVIGCGHNN 205
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ-I 181
+ + +GI+GL S QL + +FS CL+ D + S+L FGD +
Sbjct: 206 RGMFQGET---SGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAV 262
Query: 182 IAGKSLNLPP-----------------------NSFTIKLNGQRG-CINDCGSVLTVIEC 217
++G + P F + + + G I D G+ LT++
Sbjct: 263 VSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTLTLLPS 322
Query: 218 EVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
VY L A+ + D +L C+++ + FP +T HF+GAD+ +
Sbjct: 323 HVYTNLESAVAQLVKLDRVDDPNQLLNL------CYSITSDQYDFPIITAHFKGADIKLN 376
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + F H AFT + I G Q N YDL
Sbjct: 377 P--ISTFAHVADGVVCL--AFTSSQTGPIFGNLAQLNLLVGYDL 416
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 148/355 (41%), Gaps = 71/355 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P S L+DT + L+W QC PC S CY Q DP+++ +Y +PC
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNT 179
Query: 66 ASCKS------PFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+C+ C G C Y ITYGD +T V S +T T+ P V+V+
Sbjct: 180 DACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAP-----GVTVK 234
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------- 164
+ FGC + G++GL S +VQ + FS CL
Sbjct: 235 DFHFGCGHDQDG----PNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQAGFL 290
Query: 165 -----------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
V+ ++F+ + + G+ +++PP++F+ G I D
Sbjct: 291 ALGAPVNDASGFVFTPMVREQQTFYV-VNMTGITVGGEPIDVPPSAFS------GGMIID 343
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF 266
G+V+T ++ YA L A F + + + TC+N N + P + F
Sbjct: 344 SGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELD----TCYNFTGHSNVTVPRVALTF 399
Query: 267 QGA---DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
G DL V P+ + + N F GP P ILG +Q + +YD+
Sbjct: 400 SGGATVDLDV-PDGILLDNCL--AFQEAGPDNQP----GILGNVNQRTLEVLYDV 447
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 139/353 (39%), Gaps = 55/353 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P ++DT + + W QC+PC CY Q P+Y+ R +Y + PC
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQ 158
Query: 68 CKSPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C++P C G C Y I YGD T +L T L+ ++ SV N+ GC +++
Sbjct: 159 CRNPQTCDGTTGGCGYRIVYGDASSTS--GNLATDRLVFSND---TSVGNVTLGCGHDNE 213
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGK 185
AG++G+ + SF Q+ F+ CL +S S ++ G+
Sbjct: 214 GLFGSA----AGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSS----SYLVFGR 265
Query: 186 SLNLPPNSFTIKL-------------------------------------NGQRGCINDC 208
+ PP+S L G+ G + D
Sbjct: 266 TAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDS 325
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ 267
G+ +T + Y L F ++ + K+ C++L P + HF
Sbjct: 326 GTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFA 385
Query: 268 -GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GAD+ + PEN ++ + + F G +++G Q + V+D++
Sbjct: 386 GGADVALPPEN-YLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVE 437
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/311 (27%), Positives = 131/311 (42%), Gaps = 47/311 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLP 62
N Y++++ IG P + DT + LTW QC PC + C+ QN P+Y+ + ++ LP
Sbjct: 93 NGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLP 152
Query: 63 CYDASCK----SPFHCFE-GDCFYGITYGD---VYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C C S + C + GDC Y TYGD Y DS+ L
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYN------S 206
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
I FGC ++K F + + GI+GL S + QLG + +FS CL+ + +S+
Sbjct: 207 KICFGCGFQNK-FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSK 265
Query: 175 LEFGD-QIIAGKSLNLPP-------------------NSFTIKLNGQRG-CINDCGSVLT 213
L+FG+ I+ G + P + T+K G I D GS LT
Sbjct: 266 LKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLT 325
Query: 214 VIECEVY---AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
+E Y L E + I F CF ++ P + +HF G D
Sbjct: 326 YLEESFYNEFVSLVKETVAVEEDQYIPYPFDF------CFTYKEGMSTPPDVVFHFTGGD 379
Query: 271 LVVEPENVFIF 281
+V++P N +
Sbjct: 380 VVLKPMNTLVL 390
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 141/330 (42%), Gaps = 39/330 (11%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
TY++ + IG P L +LDT + L WTQC PC+ C+ Q P+Y +Y + C
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 66 ASC---KSPF-HCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C +SP+ C D C Y +YGD T V + +T TL S +V+ + FG
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL-----GSDTAVRGVAFG 205
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK-----SFHSR 174
C E + S +G++G+ S + QLG P R SC + + S
Sbjct: 206 CGTE--NLGSTDNS--SGLVGMGRGPLSLVSQLGVTRPRR-SCRARAAARGGGAPTTTSP 260
Query: 175 LE---FGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYF 231
LE GD + L + P F + G G I D G+ T +E + L
Sbjct: 261 LEGITVGDTL-----LPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRV 315
Query: 232 SQHDIEKLFTCRKCGVT-CFNLPA-RFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFF 289
L + G++ CF + P + HF GAD+ + E+ ++ + +
Sbjct: 316 RL----PLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRES-YVVEDRSAGVA 370
Query: 290 FFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G +G ++LG+ Q NT +YDL+
Sbjct: 371 CLG--MVSARGMSVLGSMQQQNTHILYDLE 398
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 143/365 (39%), Gaps = 65/365 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--------KSCYEQNDPIYNSRSFKSYK 59
Y++ L IG P S + DT + L WTQC PC C++Q+ +YN S ++
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFG 146
Query: 60 KLPCYD-----ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
LPC A+ P C Y TYG + T V S++T T P V V
Sbjct: 147 VLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTGW-TAGVQSVETFTFGSSSTPPAVRVP 205
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHS 173
NI FGCS S + AG++GL S S + QLG FS CL D + S
Sbjct: 206 NIAFGCSNASSN----DWNGSAGLVGLGRGSMSLVSQLGA---GAFSYCLTPFQDANSTS 258
Query: 174 RLEFGDQI---------------IAGKS---------------------LNLPPNSFTIK 197
L G +AG S L +PP++F+++
Sbjct: 259 TLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLR 318
Query: 198 LNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARF 256
+G G I D G+ +T + Y + A G+ CF L A
Sbjct: 319 ADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKAST 378
Query: 257 --NSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
+ PSMT HF+ GAD+V+ EN I S + +++G Q N
Sbjct: 379 PPPAMPSMTLHFEGGADMVLPVENYMILG---SGVWCLAMRNQTVGAMSMVGNYQQQNIH 435
Query: 314 FVYDL 318
+YD+
Sbjct: 436 VLYDV 440
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 152/354 (42%), Gaps = 61/354 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++GIG P K + ++DT + + W QC PCKSCY+QND +++ R+ S+++L C
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 68 CK--SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
CK C D C Y ++YGD T D S L+ SPV FGC +
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVG-DLASDSFLVSRGRTSPVV-----FGCGHD 127
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH--SRLEFGDQ 180
++ FV + G L++ S +L +FS CLV D S L FGD
Sbjct: 128 NEGLFVGAAGLLGLGAGKLSFPS--------QLSSRKFSYCLVSRDNGVRASSALLFGDS 179
Query: 181 I------------------------------IAGKSLNLPPNSFTIKLN-GQRGCINDCG 209
I G L++P +F + + G+ G I D G
Sbjct: 180 ALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSG 239
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF 266
+ +T + Y V+ F + +KL + TC++ A + + P++++HF
Sbjct: 240 TSVTRLPTYAYTVMRDAF-----RSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHF 294
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+G V P + ++ S F F + T +I+G Q + DLD+
Sbjct: 295 EGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIGNIQQQTMRVAIDLDS 347
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 154/349 (44%), Gaps = 56/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + DT + L WTQC PC CY Q DP+++ ++ +YK + C +
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149
Query: 68 C---KSPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C ++ C D C Y ++YGD TK ++DT TL D P+ ++NI GC
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDT-RPMQLKNIIIGC-- 206
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG-D 179
+ K +GI+GL S + QLG + +FS CLV K S++ FG +
Sbjct: 207 -GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265
Query: 180 QIIAGKSLNLPP-------NSF------TIKLNGQR-------------GCINDCGSVLT 213
I++G + P +F +I + ++ I D G+ LT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLT 325
Query: 214 VIECEVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
++ E Y+ L A ID + D + + C++ P +T HF GAD
Sbjct: 326 LLPTEFYSELEDAVASSIDAEKKQDPQSGLSL------CYSATGDLK-VPVITMHFDGAD 378
Query: 271 LVVEPENVFIFNHQDSFFFFF--GPAFTPRKGKTILGARHQHNTQFVYD 317
+ ++ N F+ +D F F P+F +I G Q N YD
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPSF------SIYGNVAQMNFLVGYD 421
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 154/349 (44%), Gaps = 56/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + DT + L WTQC PC CY Q DP+++ ++ +YK + C +
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149
Query: 68 C---KSPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C ++ C D C Y ++YGD TK ++DT TL D P+ ++NI GC
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDT-RPMQLKNIIIGC-- 206
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG-D 179
+ K +GI+GL S + QLG + +FS CLV K S++ FG +
Sbjct: 207 -GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265
Query: 180 QIIAGKSLNLPP-------NSF------TIKLNGQR-------------GCINDCGSVLT 213
I++G + P +F +I + ++ I D G+ LT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLT 325
Query: 214 VIECEVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
++ E Y+ L A ID + D + + C++ P +T HF GAD
Sbjct: 326 LLPTEFYSELEDAVASSIDAEKKQDPQSGLSL------CYSATGDLK-VPVITMHFDGAD 378
Query: 271 LVVEPENVFIFNHQDSFFFFF--GPAFTPRKGKTILGARHQHNTQFVYD 317
+ ++ N F+ +D F F P+F +I G Q N YD
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPSF------SIYGNVAQMNFLVGYD 421
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 134/343 (39%), Gaps = 44/343 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P + L + DT + L+W QC+PC CY+Q+DP+++ +Y +PC
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQE 197
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV-SVQNIRFGCSLES 124
C+ C G C Y + YGD+ +T + DT TL P S +Q FGC +
Sbjct: 198 CRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDD 257
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------------VQPDK 169
G+ GL D S Q FS CL P+
Sbjct: 258 TGLFGKAD----GLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAAPPNA 313
Query: 170 SFHSRLEFGDQ-----------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
F + + D +AG+++ + P F G + D G+V+T +
Sbjct: 314 RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTVITRLPSR 368
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVE--P 275
YA L + F ++ ++ TC++ R PS+ F G +
Sbjct: 369 AYAALRSSFAGLMRRYSYKRAPALSILD-TCYDFTGRNKVQIPSVALLFDGGATLNLGFG 427
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
E +++ N + F + ILG Q VYD+
Sbjct: 428 EVLYVANKSQACLAF--ASNGDDTSIAILGNMQQKTFAVVYDV 468
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 150/360 (41%), Gaps = 78/360 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P S L+DT + L+W QCQPC S CY Q DP+++ +Y +PC
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNT 183
Query: 66 ASCKS------PFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
+C+ C GD C + ITYGD +T+ V S +T L P V+V++
Sbjct: 184 DACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP-----GVAVKD 238
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL----------- 164
RFGC + G++GL S +VQ + FS CL
Sbjct: 239 FRFGCGHDQDG----ANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLA 294
Query: 165 ----------------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
++ +++F+ + + G+ +++PP++F+
Sbjct: 295 LGGGGAPSGGVVNTSGFVFTPMIREEETFYV-VNMTGITVGGEPIDVPPSAFS------G 347
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G I D G+V+T ++ Y L A F + + + + TC++ N + P
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELD----TCYDFSGYSNVTLPK 403
Query: 262 MTYHFQGA---DLVVEPENVFIFNHQDSFFF-FFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F G DL V P + + D F GP P ILG +Q + +YD
Sbjct: 404 VALTFSGGATIDLDV-PNGILL---DDCLAFQESGPDDQP----GILGNVNQRTLEVLYD 455
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 143/353 (40%), Gaps = 60/353 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + ++ +LDT + + W QC PC CY Q DP+++ +S+ +PC
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPL 204
Query: 68 CKSPFH----CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + + C Y ++YGD T S +T T V + GC +
Sbjct: 205 CRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTF------RGTRVGRVVLGCGHD 258
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGD 179
++ FV + SF Q+GR +FS CL D+S SR + FGD
Sbjct: 259 NEGLFVGAAGLLGL-----GRGRLSFPSQIGRRFNSKFSYCL--GDRSASSRPSSIVFGD 311
Query: 180 QIIAGKSLNLP-----------------------------PNSFTIKLNGQRGCINDCGS 210
I+ + P + F + G G I D G+
Sbjct: 312 SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGT 371
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
+T + Y L F+ S F+ TCF+L + P++ HF+GA
Sbjct: 372 SVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFD---TCFDLSGKTEVKVPTVVLHFRGA 428
Query: 270 DLVVEPENVFI-FNHQDSF-FFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
D+ + N I ++ SF F F G A G +I+G Q + VYDL T
Sbjct: 429 DVPLPASNYLIPVDNSGSFCFAFAGTA----SGLSIIGNIQQQGFRVVYDLAT 477
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/347 (22%), Positives = 138/347 (39%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P ++ + ++D+ + + W QC+PC CY+Q+DP+++ S+ + C
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDV 202
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C G C Y ++YGD TK +L+T T+ V ++++ GC ++
Sbjct: 203 CDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTV------GQVMIRDVAIGCGHTNQ 256
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI---- 181
++ L S SF+ QLG FS CLV LEFG
Sbjct: 257 GMFIGAAGLLG----LGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGALPVG 312
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G +++P +F + G G + D G+ +T
Sbjct: 313 ATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPT 372
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-----PSMTYHFQGADLV 272
Y F S R GV+ F+ N F P+++++F ++
Sbjct: 373 AAYVAFRDSFTAQTSN-------LPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVL 425
Query: 273 VEPENVFIF--NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P F+ + +F F P+ G +I+G Q Q +D
Sbjct: 426 TLPARNFLIPVDGGGTFCLAFAPS---PSGLSIIGNIQQEGIQISFD 469
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 148/350 (42%), Gaps = 53/350 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P ++ ++DT + L WTQC+PC C++Q PI++ + S+ KL C
Sbjct: 94 NGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCS 153
Query: 65 DASCKS-PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C++ P C Y YGD T+ + + +T T VSV + FGC +
Sbjct: 154 SKLCEALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTF------GKVSVPEVAFGCGED 207
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-- 181
++ Q +G++GL S + QL +FS CL D + S L G
Sbjct: 208 NEGSGFSQG---SGLVGLGRGPLSLVSQLKE---PKFSYCLTSVDDTKASTLLMGSLASV 261
Query: 182 ------------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ SL + ++F+++ +G G I D G+
Sbjct: 262 KASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTT 321
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF--PSMTYHFQGA 269
+T +E + ++ EF SQ ++ + CF LP+ P + +HF GA
Sbjct: 322 ITYLEQSAFDLVAKEFT---SQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGA 378
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
DL + EN I D+ A G +I G Q N ++DL+
Sbjct: 379 DLELPAENYMI---ADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLE 425
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 140/365 (38%), Gaps = 64/365 (17%)
Query: 3 TLNHTYMLKLG--IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
TLN+ + LG G P +L ++DT + LTW QC+PC +CY Q DP+++ +Y
Sbjct: 141 TLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAA 200
Query: 61 LPCYDASCKSPFHCFEG-------------DCFYGITYGDVYETKEVDSLDTSTLLPPDE 107
+ C ++C G C+Y + YGD ++ V + DT L
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL----- 255
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--- 164
S+ FGC L ++ AG+MGL S + Q FS CL
Sbjct: 256 -GGASLGGFVFGCGLSNRGLFG----GTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 310
Query: 165 VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLN------------------------- 199
D S L GD A N P ++T +
Sbjct: 311 TSGDASGSLSLGGGDD-AASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ 369
Query: 200 --GQRGCINDCGSVLTVIECEVYAVLTAEFIDYF--SQHDIEKLFTCRKCGVTCFNLPAR 255
G + D G+V+T + VY + AEF+ F + + F+ TC++L
Sbjct: 370 GLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILD---TCYDLTGH 426
Query: 256 FN-SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNT 312
P +T + GAD+ V+ + +D A + +T I+G Q N
Sbjct: 427 DEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNK 486
Query: 313 QFVYD 317
+ VYD
Sbjct: 487 RVVYD 491
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 133/315 (42%), Gaps = 57/315 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KL +G P + + DT + + WTQC+PC +CY+Q+ P++N +Y+K+ S
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKV-----S 139
Query: 68 CKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP F G DC Y I+YGD ++ ++DT T + V+
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAI 198
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLE 176
GC ++ ++GI+GL S + Q+G V +FS CL + D ++L
Sbjct: 199 GCGHDNAGSFDAN---VSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 177 FGDQI-IAGKSLNLPP-----------------------NSFTIKLN----GQRGCINDC 208
FG ++G P N+F N G+ I D
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 209 GSVLTVIECEVY---AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
G+ LT++ ++Y A + I+ D + CF P + H
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE------YCFETTTDDYKVPFIAMH 369
Query: 266 FQGADLVVEPENVFI 280
F+GA+L ++ ENV I
Sbjct: 370 FEGANLRLQRENVLI 384
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 59/348 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P ++ + ++D+ + + W QC+PC CY Q+DP++N S+ + C
Sbjct: 136 YFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTV 195
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C C EG C Y ++YGD TK +L+T T ++N+ GC ++
Sbjct: 196 CSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITF------GRTLIRNVAIGCGHHNQ 249
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
FV + G + SF+ QLG FS CLV LEFG + +
Sbjct: 250 GMFVGAAGLLGLGGGPM-----SFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGREAMPV 304
Query: 185 KSLNLP----------------------------PNSFTIKLNGQRGCINDCGSVLTVIE 216
+ +P + F + G G + D G+ +T +
Sbjct: 305 GAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTRLP 364
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADL 271
Y FI + R GV TC++L + P+++++F G +
Sbjct: 365 TVAYEAFRDGFI-------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPI 417
Query: 272 VVEPENVFIFNHQD--SFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ P F+ D +F F F P+ G +I+G Q Q D
Sbjct: 418 LTLPARNFLIPVDDVGTFCFAFAPS---SSGLSIIGNIQQEGIQISVD 462
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 148/353 (41%), Gaps = 62/353 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++G+G P + ++D+ + + W QC+PC CY+Q DP+++ + S+ +PC
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGV 192
Query: 68 CKS----PFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C++ C + G C Y ++YGD T+ V +++T T + +P VQ + GC
Sbjct: 193 CRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTF---GDSTP--VQGVAIGCGH 247
Query: 123 ESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-QPDKSFHSRLEFG-- 178
++ FV AG++GL W S + QLG FS CL + + L FG
Sbjct: 248 RNRGLFVG-----AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRD 302
Query: 179 DQIIAG---------------------------KSLNLPPNSFTIKLNGQRGCINDCGSV 211
D + G + L L F + +G G + D G+
Sbjct: 303 DAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTA 362
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHF 266
+T + + YA L F D+ R GV TC++L + P++ +F
Sbjct: 363 VTRLPPDAYAALRDAFASTIG-GDLP-----RAPGVSLLDTCYDLSGYASVRVPTVALYF 416
Query: 267 --QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
GA L + N+ + + F + + G +ILG Q Q D
Sbjct: 417 GRDGAALTLPARNLLVEMGGGVYCLAFAASAS---GLSILGNIQQQGIQITVD 466
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 141/362 (38%), Gaps = 63/362 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + +GDP ++DT + L W QC PC+ CY Q P+Y+ RS +++++PC
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPR 147
Query: 68 CKSPFH-----CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ G C Y + YGD + L T L+ PD+ V N+ GC
Sbjct: 148 CRDVLRYPGCDARTGGCVYMVVYGD--GSASSGDLATDRLVFPDD---THVHNVTLGCGH 202
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ-I 181
++ + + AG++G+ SF QL FS CL SR + G +
Sbjct: 203 DNVGLL----ESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCL----GDRLSRAQNGSSYL 254
Query: 182 IAGKSLNLPPNSFT---------------------------------IKLN---GQRGCI 205
+ G++ P +FT + LN G+ G +
Sbjct: 255 VFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIV 314
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHD-IEKLFTCRKCGVTCFNL-----PARFNSF 259
D G+ ++ + YA + F + + + KL T C++L PA
Sbjct: 315 VDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRV 374
Query: 260 PSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
PS+ HF GAD+ + N I D +F G +LG Q V+D
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFD 434
Query: 318 LD 319
++
Sbjct: 435 VE 436
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 142/346 (41%), Gaps = 55/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P +S++ + DT + ++W QC PC+ CY Q DPI+N S+K L C +
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 140
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C K + +C Y ++YGD T V T TL + +V+++ GC +
Sbjct: 141 CGKLKIKGCSRKNECMYQVSYGDGSFT--VGDFSTETLSFGEH----AVRSVAMGCGRNN 194
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ ++ L SF Q G FS CL + + + + L FG
Sbjct: 195 QGLFHGAAGLLG----LGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSAVPE 250
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+AG +N+PP++F + G G I D G+ ++ +
Sbjct: 251 KARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLT 310
Query: 217 CEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV 272
Y L F + + S I LF TC++L + + + P++ F G +
Sbjct: 311 TPAYTALRDAFRSLVTFPSAPGIS-LFD------TCYDLSSMKTATLPAVVLDFDGGASM 363
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
P + + N D + AF P + +I+G Q + D
Sbjct: 364 PLPADGILVNVDDEGTYCL--AFAPEEEAFSIIGNVQQQTFRISID 407
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/351 (26%), Positives = 140/351 (39%), Gaps = 60/351 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + L+ +LDT + + W QC PC+ CY Q+DPI+N KS+ +PC
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPL 169
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ S C Y ++YGD T + +T T + + GC
Sbjct: 170 CRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTF------RGNKIAKVALGCGHH 223
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQI 181
++ FV + SF Q G +FS CLV S S + FGD
Sbjct: 224 NEGLFVGAAGLLGL-----GRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAA 278
Query: 182 IA-----------------------GKSL------NLPPNSFTIKLNGQRGCINDCGSVL 212
I+ G S+ + P+ F + G G I D G+ +
Sbjct: 279 ISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSV 338
Query: 213 TVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
T + Y L F + + LF TC++L + + P++ HF+G
Sbjct: 339 TRLTRPAYTALRDAFRVGARHLKRGPEFSLFD------TCYDLSGQSSVKVPTVVLHFRG 392
Query: 269 ADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
AD+ + N I + SF F F + G +I+G Q + VYDL
Sbjct: 393 ADMALPATNYLIPVDENGSFCFAFAGTIS---GLSIIGNIQQQGFRVVYDL 440
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 146/351 (41%), Gaps = 60/351 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + L F+ DT + LTWTQC+PC + CY Q +PI+N SY + C
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 67 SC--------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+C SP C C YGI YGD + + D L D N F
Sbjct: 198 TCDELKSGTGNSP-SCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDV-----FNNFLF 251
Query: 119 GCSLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
GC ++ FV +AG++GL ++ S + Q + FS CL S L F
Sbjct: 252 GCGQNNRGLFVG-----VAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSS-TGYLTF 305
Query: 178 GDQIIAGKSLNLPP-------------NSFTIKLNGQR-----------GCINDCGSVLT 213
G K++ P N I + G++ G I D G+V++
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVIS 365
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNS--FPSMTYHFQ-G 268
+ Y+ L A F Q + K + TC++ +++++ P + +F G
Sbjct: 366 RLPPTAYSDLRASF-----QQQMSKYPKAAPASILDTCYDF-SQYDTVDVPKINLYFSDG 419
Query: 269 ADLVVEPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A++ ++P +F I N F G + ILG Q VYD+
Sbjct: 420 AEMDLDPSGIFYILNISQVCLAFAGN--SDATDIAILGNVQQKTFDVVYDV 468
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 144/357 (40%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + + LLDT + L WTQC PC SC Q DPI++ + SY+ + C
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGEL 163
Query: 68 CKSPFH--CFEGD-CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNIRFGCS 121
C H C D C Y +YGD T+ V + + T E + +S + FGC
Sbjct: 164 CNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP-LGFGCG 222
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL----------------- 164
+K ++ +GI+G S + Q L RFS CL
Sbjct: 223 TMNKGSLNNG----SGIVGFGRAPLSLVSQ---LAIRRFSYCLTPYASGRKSTLLFGSLR 275
Query: 165 ----------VQPDKSFHSR-------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
VQ + SR + F + + L +P ++F ++ +G G I D
Sbjct: 276 GGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVD 335
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA----RFNSFPSMT 263
G+ LT+ V A + F + GV CF A R P M
Sbjct: 336 SGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGV-CFAAAASRVPRPAVVPRMV 394
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+H QGADL + P ++ + Q A + G TI G Q + + +YDL+
Sbjct: 395 FHLQGADLDL-PRRNYVLDDQRKGNLCLLLADSGDSGTTI-GNFVQQDMRVLYDLEA 449
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 140/370 (37%), Gaps = 78/370 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC +C+EQN P Y+ + S+K + C+D
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254
Query: 68 CK-----SPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C+ P +G+ C Y YGD T +L+T T+ P +P V+N+
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV + S S+
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFATQLQSLYGHSFSYCLVDRNSNSSVSSK 370
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ L +P ++ + G
Sbjct: 371 LIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGG 430
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRK-----CGVTCFNLP--- 253
G I D G+ LT Y ++ F+ + + F K GV LP
Sbjct: 431 GGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEFA 490
Query: 254 -----ARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARH 308
FP Y Q +EPE+V G TPR +I+G
Sbjct: 491 ILFADGAMWDFPVENYFIQ-----IEPEDVVCLA-------ILG---TPRSALSIIGNYQ 535
Query: 309 QHNTQFVYDL 318
Q N +YDL
Sbjct: 536 QQNFHILYDL 545
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 151/354 (42%), Gaps = 61/354 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++GIG P K + ++DT + + W QC PCKSCY+QND +++ R+ S+++L C
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 68 CK--SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
CK C D C Y ++YGD T D S + SPV FGC +
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVG-DLASDSFSVSRGRTSPVV-----FGCGHD 127
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH--SRLEFGDQ 180
++ FV + G L++ S +L +FS CLV D S L FGD
Sbjct: 128 NEGLFVGAAGLLGLGAGKLSFPS--------QLSSRKFSYCLVSRDNGVRASSALLFGDS 179
Query: 181 I------------------------------IAGKSLNLPPNSFTIKLN-GQRGCINDCG 209
I G L++P +F + + G+ G I D G
Sbjct: 180 ALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSG 239
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF 266
+ +T + Y V+ F + +KL + TC++ A + + P++++HF
Sbjct: 240 TSVTRLPTYAYTVMRDAF-----RSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHF 294
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+G V P + ++ S F F + T +I+G Q + DLD+
Sbjct: 295 EGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIGNIQQQTMRVAIDLDS 347
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 146/363 (40%), Gaps = 71/363 (19%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +GIG +L ++DT + LTW QC PC+ CY Q +P++N + S+ LP
Sbjct: 142 TLN--YIVTVGIGGQNSTL--IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLP 197
Query: 63 CYDASCKS--PFHCFEG--------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C + P G C Y I YGD ++ + TL
Sbjct: 198 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL------GKTE 251
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
+ N FGC +K +G+MGL S + Q L FS CL
Sbjct: 252 IDNFIFGCGRNNKGLFGGA----SGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS 307
Query: 173 SRLEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLN 199
L G I G +LN+P +L+
Sbjct: 308 GSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLS 361
Query: 200 GQRGCIN--DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
G ++ D G+V+T + +Y AEF FS + F+ TCFNL
Sbjct: 362 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN---TCFNLTGYEE 418
Query: 258 -SFPSMTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQF 314
+ P++ + F+G A+++V+ E VF F D+ A + +T I+G Q N +
Sbjct: 419 VNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV 478
Query: 315 VYD 317
+Y+
Sbjct: 479 IYN 481
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 139/345 (40%), Gaps = 53/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P +S++ + DT + ++W QC PC+ CY Q DPI+N S+K L C +
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 73
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C K + C Y ++YGD T V T TL + +V+++ GC +
Sbjct: 74 CGKLKIKGCSRKNKCMYQVSYGDGSFT--VGDFSTETLSFGEH----AVRSVAMGCGRNN 127
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ ++ L SF Q G FS CL + + + + L FG
Sbjct: 128 QGLFHGAAGLLG----LGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSAVPE 183
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+AG +N+PP++F + G G I D G+ ++ +
Sbjct: 184 KARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLT 243
Query: 217 CEVYAVLTAEF--IDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLVV 273
Y L F + F LF TC++L + + + P++ F G +
Sbjct: 244 TPAYTALRDAFRSLVTFPSAPGISLFD------TCYDLSSMKTATLPAVVLDFDGGASMP 297
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
P + + N D + AF P + +I+G Q + D
Sbjct: 298 LPADGILVNVDDEGTYCL--AFAPEEEAFSIIGNVQQQTFRISID 340
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 146/363 (40%), Gaps = 71/363 (19%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +GIG +L ++DT + LTW QC PC+ CY Q +P++N + S+ LP
Sbjct: 63 TLN--YIVTVGIGGQNSTL--IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLP 118
Query: 63 CYDASCKS--PFHCFEG--------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C + P G C Y I YGD ++ + TL
Sbjct: 119 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL------GKTE 172
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
+ N FGC +K +G+MGL S + Q L FS CL
Sbjct: 173 IDNFIFGCGRNNKGLFGGA----SGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS 228
Query: 173 SRLEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLN 199
L G I G +LN+P +L+
Sbjct: 229 GSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLS 282
Query: 200 GQRGCIN--DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
G ++ D G+V+T + +Y AEF FS + F+ TCFNL
Sbjct: 283 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN---TCFNLTGYEE 339
Query: 258 -SFPSMTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQF 314
+ P++ + F+G A+++V+ E VF F D+ A + +T I+G Q N +
Sbjct: 340 VNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV 399
Query: 315 VYD 317
+Y+
Sbjct: 400 IYN 402
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 136/345 (39%), Gaps = 45/345 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++ +G P + ++ ++DT + + W QC PC SCY Q D +++ +Y L C
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQ 96
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C C Y + YGD + + D +L V + I GC +++
Sbjct: 97 CLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNE 156
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQI-- 181
+ ++ L SF Q+ RFS CL D + S L FGD
Sbjct: 157 GYFVGAAGLLG----LGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFGDAAVP 212
Query: 182 ---------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ G L +P ++F + G G I D G+ +T
Sbjct: 213 PAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTR 272
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLV 272
++ YA L F S + F+ TC+NL + P++T HFQ GADL
Sbjct: 273 LQNAAYASLREAFRAGTSDLVLTTEFSLFD---TCYNLSDLSSVDVPTVTLHFQGGADLK 329
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ N + S F AF G +I+G Q + +YD
Sbjct: 330 LPASNYLVPVDNSSTFCL---AFAGTTGPSIIGNIQQQGFRVIYD 371
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 146/354 (41%), Gaps = 58/354 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +G+G+ +++ ++DT + LTW QC PC SCY Q P++N + SY L
Sbjct: 130 TLN--YIVTIGLGN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLL 185
Query: 63 CYDASCK-------SPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+ + C + C + ++YGD T D + +S
Sbjct: 186 CNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFT------DGELGVEHLSFGGIS 239
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V N FGC +K ++GIMGL + S + Q FS CL D
Sbjct: 240 VSNFVFGCGRNNKGLFG----GVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGAS 295
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLN---------------------------GQRGCI 205
L G++ K NL P ++T ++ G G +
Sbjct: 296 GSLVIGNESSLFK--NLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGIL 353
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTY 264
D G+V+T + +Y L AEF+ FS + I + TCFNL S P+++
Sbjct: 354 IDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILD---TCFNLTGIEEVSIPTLSM 410
Query: 265 HFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
HF+ DL V+ + S + + I+G Q N + +YD
Sbjct: 411 HFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYD 464
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 136/336 (40%), Gaps = 41/336 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G G P K+ + DT + + W QC+PC SCY Q +P+++ +Y+ + C A
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C S C C YG+TYGD T + +T TL + N FGC +
Sbjct: 76 ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNV-----FNNFIFGCGQNN 130
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
+ + AG++GL S QL + + FS CL S L G+ +
Sbjct: 131 QGLFTGA----AGLIGLGRSPYSLNSQLATSLGNIFSYCL-PSTSSATGYLNIGNPLRTP 185
Query: 185 KSLNLPPNS----------FTIKLNGQR-----------GCINDCGSVLTVIECEVYAVL 223
+ NS I + G R G I D G+V+T + Y L
Sbjct: 186 GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGAL 245
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPENVF-IF 281
F +Q+ + TC++ +FP++ H+ G D+ + VF +
Sbjct: 246 RTAFRAAMTQYTRAAAASILD---TCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVI 302
Query: 282 NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F G + + + G I+G Q + YD
Sbjct: 303 SSSQVCLAFAGNSDSTQIG--IIGNVQQRTMEVTYD 336
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 131/315 (41%), Gaps = 57/315 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KL +G P + + DT + + WTQC PC +CY+Q+ P++N +Y+K+ S
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKV-----S 139
Query: 68 CKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP F G DC Y I+YGD ++ ++DT T + V+
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAI 198
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLE 176
GC ++ ++GI+GL S + Q+G V +FS CL + D ++L
Sbjct: 199 GCGHDNAGSFDAN---VSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 177 FGDQI-IAGKSLNLPPNSFTIK---------------------------LNGQRGCINDC 208
FG ++G P + K L G+ I D
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 209 GSVLTVIECEVY---AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
G+ LT++ ++Y A + I+ D + CF P + H
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE------YCFETTTDDYKVPFIAMH 369
Query: 266 FQGADLVVEPENVFI 280
F+GA+L ++ ENV I
Sbjct: 370 FEGANLRLQRENVLI 384
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 141/359 (39%), Gaps = 74/359 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQC+PCK C+ Q+ PIY++ + S+ LPC A+
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C S C Y Y D + E + +SV I FGC ++
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGAYSPEC--------------AGISVGGIAFGCGVD 188
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFGDQI- 181
+ G +GL S S + QLG +FS CL + S S + FG
Sbjct: 189 NGGL----SYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLSSPVFFGSLAE 241
Query: 182 -------------------------------IAGKSLN---LPPNSFTIKLN---GQRGC 204
+ G SL LP + T LN G G
Sbjct: 242 LAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGM 301
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA----RFNSFP 260
I D G++ T++ + V+ +D+ + + + CF PA P
Sbjct: 302 IVDSGTIFTILVETGFRVV----VDHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPDMP 357
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
M HF GAD+ + +N FN ++S F T ++LG Q N Q ++D+
Sbjct: 358 DMVLHFAGGADMRLHRDNYMSFNEEES-SFCLNIVGTESASGSVLGNFQQQNIQMLFDI 415
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 137/346 (39%), Gaps = 44/346 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P L DT + LTWTQCQPCK C+ Q+ P+Y+ + ++ +PC A+
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 125
Query: 68 CKSPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-- 120
C + C Y +Y D + + +T T+ VSV ++ FGC
Sbjct: 126 CLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGT 185
Query: 121 -----SLESKDFVSIQKKIIAGIMGLN---------------WDSTSFMVQLGRLVPD-- 158
SL S V + + ++ + L DS F+ L L P
Sbjct: 186 DNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPG 245
Query: 159 --RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ + L P + + L +P +F ++ +G G + D G+ T++
Sbjct: 246 TVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILA 305
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEP 275
+ E +D +Q + CF P P + HF GAD+ +
Sbjct: 306 KSGF----REVVDRVAQLLGQPPVNASSLDSPCFPSPDGEPFMPDLVLHFAGGADMRLHR 361
Query: 276 ENVFIFNHQDSFF---FFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+N +N DS F P+ R LG Q N Q ++D+
Sbjct: 362 DNYMSYNEDDSSFCLNIVGSPSTWSR-----LGNFQQQNIQMLFDM 402
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 143/367 (38%), Gaps = 74/367 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + LLDT + L WTQC PC SC Q DP++ SY+ + C
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTL 155
Query: 68 CKSPFH--CFEGD-CFYGITYGD------VYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C H C D C Y YGD VY T+ + P+ F
Sbjct: 156 CSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLG-----F 210
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC S + S+ +GI+G + S + QL RFS CL S L FG
Sbjct: 211 GCG--SVNVGSLNNG--SGIVGFGRNPLSLVSQLS---IRRFSYCLTSYASRRQSTLLFG 263
Query: 179 D----------------------------------QIIAGKSLNLPPNSFTIKLNGQRGC 204
+ + L +P ++F ++ +G G
Sbjct: 264 SLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 323
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCR---KCGVTCFNLPARFNS--- 258
I D G+ LT++ A + AE + F Q + F + GV CF +PA +
Sbjct: 324 IVDSGTALTLLP----AAVLAEVVRAFRQQ-LRLPFANGGNPEDGV-CFLVPAAWRRSSS 377
Query: 259 -----FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
P M HFQGADL + N + +H+ A + G TI G Q + +
Sbjct: 378 TSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLL-LADSGDDGSTI-GNLVQQDMR 435
Query: 314 FVYDLDT 320
+YDL+
Sbjct: 436 VLYDLEA 442
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 144/350 (41%), Gaps = 56/350 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + LTWTQC+PC CY+Q P+++ ++ +Y+ C +
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL-LPPDEPSPVSVQNIRFGCSL 122
C K E C + +Y D T +L + TL + PVS FGC
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTG--GNLASETLTVDSTAGKPVSFPGFAFGCGH 209
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLEFG-- 178
S I K +GI+GL S + QL + FS CL V D S SR+ FG
Sbjct: 210 SSG---GIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGAS 266
Query: 179 --------------------------DQIIAGKSLNLPPNSFTIKLNGQRG-CINDCGSV 211
+ I GK LP ++ K + G I D G+
Sbjct: 267 GRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKK-RLPYKGYSKKTEVEEGNIIVDSGTT 325
Query: 212 LTVIECEVYAVL---TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQG 268
T + E Y+ L A I D +F+ C+N A N+ P +T HF+
Sbjct: 326 YTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSL------CYNTTAEINA-PIITAHFKD 378
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A++ ++P N F+ +D F P +LG Q N +DL
Sbjct: 379 ANVELQPLNTFMRMQEDLVCF----TVAPTSDIGVLGNLAQVNFLVGFDL 424
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 147/345 (42%), Gaps = 52/345 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+++++G+G P + + + D TW QCQPC CY+Q D I++ SY L C
Sbjct: 187 FLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLSCETKH 246
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + +G C Y ITY D T+ V +T + S V + GCS ++
Sbjct: 247 CNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSF-----ESSGWVDRVSLGCSNKN 301
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQ-- 180
+ FV G GL S SF R+ S CLV+ + S LEF
Sbjct: 302 QGPFVGSD-----GTFGLGRGSLSFP---SRINASSMSYCLVESKDGYSSSTLEFNSPPC 353
Query: 181 -------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ G+ +++P ++FTI G G I S++T++
Sbjct: 354 SGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITML 413
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVE 274
E + Y V+ F+ +QH +E+L + TC+NL + P + + +
Sbjct: 414 ENDTYNVVRDAFVAK-TQH-LERLKAFLQFD-TCYNLSSNNTVELPILEFEVNDGKSWLL 470
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
P+ +++ + F F AF P KG +ILG Q+ T+ +DL
Sbjct: 471 PKESYLYAVDKNGTFCF--AFAPSKGSFSILGTLQQYGTRVTFDL 513
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 142/355 (40%), Gaps = 56/355 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L +G P + + LDT + L WTQC PC+ C++Q+ P+ + + +Y LPC A
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 68 CKS-PFHC-------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS---PVSVQNI 116
C++ PF C Y YGD ++ V + T D + + +
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGD--KSLTVGEIATDRFTFGDSGGSGESLHTRRL 201
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLE 176
FGC +K + IAG W S +L FS C +S S +
Sbjct: 202 TFGCGHLNKGVFQSNETGIAGFGRGRWSLPS------QLNVTSFSYCFTSMFESKSSLVT 255
Query: 177 FGDQIIA-------GKSLNLP-------PNSFTIKLNGQ--------------RGCINDC 208
G A G+ P P+ + + L G R I D
Sbjct: 256 LGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 315
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA----RFNSFPSMTY 264
G+ +T + EVY + AEF +Q + CF LP R + PS+T
Sbjct: 316 GASITTLPEEVYEAVKAEFA---AQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTL 372
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
H +GAD + P + ++F + P + +T++G Q NT VYDL+
Sbjct: 373 HLEGADWEL-PRSNYVFEDLGARVMCIVLDAAPGE-QTVIGNFQQQNTHVVYDLE 425
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/346 (27%), Positives = 138/346 (39%), Gaps = 48/346 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + + DT + L WTQC PC CY+Q P+++ +S K+Y+ L C
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTRQ 152
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ S E C Y YGD T ++DT T LP PV GC
Sbjct: 153 CQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVT-LPSTNGGPVYFPKTVIGCGRR 211
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFGDQ 180
+ + KK +GI+GL S + Q+G V +FS CLV S+L FG
Sbjct: 212 NNG--TFDKK-DSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRN 268
Query: 181 IIAGKS-------LNLPPNSF------TIKLNGQR-------------GCINDCGSVLTV 214
+ S ++ P++F + + ++ I D G+ LT+
Sbjct: 269 AVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIIIDSGTSLTL 328
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT--CFNLPARFNSFPSMTYHFQGADLV 272
+ EF I T G+ C+ P P +T HF GAD+V
Sbjct: 329 FPVNFF----TEFATAVENAVINGERTQDASGLLSHCYR-PTPDLKVPVITAHFNGADVV 383
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ N FI D AF + I G Q N YD+
Sbjct: 384 LQTLNTFILISDDVLCL----AFNSTQSGAIFGNVAQMNFLIGYDI 425
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 145/346 (41%), Gaps = 40/346 (11%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y+++ IG P DT + L W QC PC SC+ Q+ P++ ++ C
Sbjct: 87 NGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCR 146
Query: 65 DASCKSPF----HCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS--PVSVQNIR 117
C C + G+C Y YGD Y E L T TL + V+ N
Sbjct: 147 SQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSE-GLLSTETLRFDSQGGVQTVAFPNSF 205
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC L + V K+ GIMGL S + Q+G + +FS CL+ + S+L+F
Sbjct: 206 FGCGLYNNITVFPSYKL-TGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKLKF 264
Query: 178 GDQ-IIAGKS-----------------LNLPPNSF---TIKLNGQRG-CINDCGSVLTVI 215
G++ II G+ LNL + T+ G I D G++LT +
Sbjct: 265 GNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTYL 324
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVE 274
Y A + + ++ + + F P R N FP + + F GA + ++
Sbjct: 325 GESFYYNFAASLQESLAVELVQDVLSPLP-----FCFPYRDNFVFPEIAFQFTGARVSLK 379
Query: 275 PENVFIFNH-QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
P N+F+ +++ P+ G +I G+ Q + Q YDL+
Sbjct: 380 PANLFVMTEDRNTVCLMIAPSSV--SGISIFGSFSQIDFQVEYDLE 423
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 145/356 (40%), Gaps = 55/356 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +GIG P ++ L DT + LTW QC+PC SCY+Q +P+++ +Y +PC
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTP 185
Query: 67 SCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CK C C Y + YGD T+ + + TL P P+ + FGCS
Sbjct: 186 QCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPA----AGVVFGCSH 241
Query: 123 ESKDFV--SIQKKIIAGIMGLNWDSTSFMVQLGRL-VPDRFSCCLVQPDKSFHSRLEFGD 179
E V + ++ +AG++GL +S + Q R D FS CL P S L G
Sbjct: 242 EYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCL-PPRGSSAGYLTIGA 300
Query: 180 QIIAGKSLNLPP--------------NSFTIKLNGQR----------GCINDCGSVLTVI 215
+L+ P N I ++G G + D G+V+T +
Sbjct: 301 AAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVITHM 360
Query: 216 ECEVYAVLTAEFIDYFS------QHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-QG 268
Y VL EF + + +E L TC VT ++ + P + F G
Sbjct: 361 PAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYD--VTGHDVV----TAPPVALEFGGG 414
Query: 269 ADLVVEPEN---VFIFNHQDSFFFFFGPAFTPRK--GKTILGARHQHNTQFVYDLD 319
A + V+ VF + AF P G I+G Q V+D++
Sbjct: 415 ARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVE 470
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 90/341 (26%), Positives = 138/341 (40%), Gaps = 43/341 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++ +G P + DT + L+W QC PCK+CY Q P+++ +Y +PC
Sbjct: 88 YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147
Query: 68 C----KSPFHCFEG-DCFYGITYG-DVYETKEVD----SLDTSTLLPPDEPSPVSVQNIR 117
C ++ C C Y YG D + + S ++ + P SV
Sbjct: 148 CTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV---- 203
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC+ S I K G +GL S QLG + +FS C+V + +L+F
Sbjct: 204 FGCAFYSNFTFKISTK-ANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKF 262
Query: 178 GDQIIAGKSLNLP-------PNSFTIK-----------LNGQRG--CINDCGSVLTVIEC 217
G + ++ P P+ + + L GQ G I D +LT +E
Sbjct: 263 GSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHLEQ 322
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
+Y + + + E T + C P N FP +HF GAD+V+ P+N
Sbjct: 323 GIYTDFISSVKEAINVEVAEDAPTPFE---YCVRNPTNLN-FPEFVFHFTGADVVLGPKN 378
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+FI + P KG +I G Q N Q YDL
Sbjct: 379 MFIALDNNLVCM----TVVPSKGISIFGNWAQVNFQVEYDL 415
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 139/354 (39%), Gaps = 67/354 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ +LG+G P + ++D+ + LTW QC PC SC+ Q P+Y+ R+ +Y +PC
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAP 167
Query: 67 SCK-------SPFHCF-EGDCFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPVSV 113
C +P C G C Y +YGD Y +K+ SL +S P
Sbjct: 168 QCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFP--------- 218
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
+GC +D V + + AG++GL + S + QL V + F+ CL +
Sbjct: 219 -GFYYGC---GQDNVGLFGR-AAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAG 273
Query: 174 RLEFGDQ--------------------------IIAGKSLNLPPNSFTIKLNGQRGCIND 207
L FG +AG S+ P + G I D
Sbjct: 274 YLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPTIID 333
Query: 208 CGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
G+V+T + VY L+ + S L TC K V +PA +F
Sbjct: 334 SGTVITRLPTPVYTALSKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVNMAFAG--- 390
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GA L + P NV + ++ + AF P I+G Q VYD+
Sbjct: 391 ---GATLRLTPGNVLVDVNETTTCL----AFAPTDSTAIIGNTQQQTFSVVYDV 437
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 151/369 (40%), Gaps = 72/369 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
YM+ L IG P + + DT + LTW Q +PC CY Q PI++ + ++ KLPC A
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAP 139
Query: 68 C----KSPFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C +S C + C Y +YGD T + DT T+ + V ++N+ FGC
Sbjct: 140 CNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTV----GNASVQIRNVAFGCGT 195
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--------QP-DKSFHS 173
+ Q I G+ G N SF+ QLG + +FS CL+ QP D S
Sbjct: 196 RNGGNFDEQGSGIVGLGGGNL---SFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATS 252
Query: 174 RLEFGDQIIAGKS------------LNLPPNSF---TIKL-------------------- 198
R+ FGD + S +N P+++ TI+
Sbjct: 253 RIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASY 312
Query: 199 -NGQRGCIN------DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CF 250
+G + + D G+ LT +E E Y L A ++ + +E++ + + CF
Sbjct: 313 DSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVE---EIKMERVNDVKNSMFSLCF 369
Query: 251 NLPARFNSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
P M HF+ GAD+ ++P N F+ + F P I G Q
Sbjct: 370 KSGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCF----TMLPTNDVGIYGNLAQ 425
Query: 310 HNTQFVYDL 318
N YDL
Sbjct: 426 MNFVVGYDL 434
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 144/361 (39%), Gaps = 64/361 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + LLDT + L WTQC PC SC Q DP++ SY+ + C
Sbjct: 102 YVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQL 161
Query: 68 CKSPFH--CFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C H C D C Y YGD T V + + T ++V + FGC S
Sbjct: 162 CSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LGFGCG--S 218
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD----- 179
+ S+ +GI+G + S + QL RFS CL S L FG
Sbjct: 219 MNVGSLNNG--SGIVGFGRNPLSLVSQLS---IRRFSYCLTSYGSGRKSTLLFGSLSGGV 273
Query: 180 -----------------------------QIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ + L +P ++F ++ +G G I D G+
Sbjct: 274 YGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGT 333
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCR---KCGVTCFNLPARFNS--------F 259
LT++ V AE + F Q + F + GV CF +PA +
Sbjct: 334 ALTLLPGAVL----AEVVRAFRQQ-LRLPFANGGNPEDGV-CFLVPAAWRRSSSTSQVPV 387
Query: 260 PSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
P M +HFQ ADL + N + +H+ A + G TI G Q + + +YDL+
Sbjct: 388 PRMVFHFQDADLDLPRRNYVLDDHRKGRLCLL-LADSGDDGSTI-GNLVQQDMRVLYDLE 445
Query: 320 T 320
Sbjct: 446 A 446
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 144/362 (39%), Gaps = 71/362 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ K+ +G P LDT + LTW QCQPC+ CY Q+ P+++ R SY+++ A
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAAD 197
Query: 68 CKSPFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C++ GD C Y + YGD T +T T V + I GC
Sbjct: 198 CQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFA-----GGVRLPRISIGCGH 252
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFGD 179
++K AGI+GL SF Q+ FS CLV S S L FG
Sbjct: 253 DNKGLFGAPA---AGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFG- 306
Query: 180 QIIAGKSLNLPPNSFT---IKLN---------------------------------GQRG 203
AG PP SFT + LN G+ G
Sbjct: 307 ---AGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGG 363
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFS--QHDIEKLFTCRKCGV--TCFNLPAR-FNS 258
I D G+ +T + Y F D F D+ ++ G TC+ + R
Sbjct: 364 VIVDSGTAVTRLARPAYTA----FRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKK 419
Query: 259 FPSMTYHFQGA-DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P+++ HF G+ ++ ++P+N I + + F F A T +I+G Q + VY
Sbjct: 420 VPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAF--AATGDHSVSIIGNIQQQGFRIVY 477
Query: 317 DL 318
D+
Sbjct: 478 DI 479
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 132/343 (38%), Gaps = 44/343 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L++ +G P + ++DT + L W QC PC C+EQ DP++ + SY C D+
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67
Query: 68 CKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + P C Y +YGD T+ + +T TL + ++ I FGC
Sbjct: 68 CDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTL------NGSTLARIGFGCGHNQ 121
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-QPDKSFHSRLEFGDQI-- 181
+ + G++GL S QL FS CLV Q S + FG+
Sbjct: 122 EGTFAGAD----GLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAEN 177
Query: 182 --------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ + + PP++F I NG G I D G+ +T
Sbjct: 178 SRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYW 237
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
+ + AE S + + ++ A + PSMT H D +
Sbjct: 238 RLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIPV 297
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N+++ D+F A + +I+G Q N V D+
Sbjct: 298 SNLWVL--VDNFGETVCTAMSTSDQFSIIGNVQQQNNLIVTDV 338
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 141/345 (40%), Gaps = 44/345 (12%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYK 59
+F + Y++ +G G P ++ + DT + + W QC+PC CY Q +P+++ +Y+
Sbjct: 9 LFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYR 68
Query: 60 KLPCYDASCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
+ C + +C S C C YG+ YGD T ++DT L P + +N
Sbjct: 69 NVSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQK-----FKNFI 123
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDST-SFMVQLGRLVPDRFSCCLVQPDKSFHSRLE 176
FGC + + AG++GL ST S Q+ + + FS CL S L
Sbjct: 124 FGCGQNNTGLF----QGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL-PSTSSATGYLN 178
Query: 177 FGD-QIIAGKSLNLPPNS---------FTIKLNGQR-----------GCINDCGSVLTVI 215
G+ Q G + L I + G R G I D G+V+T +
Sbjct: 179 IGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRL 238
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGADLVV 273
Y+ L +Q+ + T TC++ +R S +P + HF G D+ +
Sbjct: 239 PPTAYSALKTAVRAAMTQYTLAPAVTILD---TCYDF-SRTTSVVYPVIVLHFAGLDVRI 294
Query: 274 EPENV-FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
V F+FN F G + G I+G Q + YD
Sbjct: 295 PATGVFFVFNSSQVCLAFAGNTDSTMIG--IIGNVQQLTMEVTYD 337
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 147/363 (40%), Gaps = 60/363 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +GIG P ++ L DT + LTW QC PC SCY Q +P+++ +Y +PC
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181
Query: 66 ASCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C C C Y + YGD ET + +T TL PP +P + + FGCS
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAAT-GVVFGCS 240
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR---FSCCL-------------- 164
E + +AG++GL +S + Q R + FS CL
Sbjct: 241 HEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSSTGYLTIGG 300
Query: 165 --VQPDKSFHSRLEFGDQI------------------IAGKSLNLPPNSFTIKLNGQRGC 204
P + + S L F I + G ++++P ++F++ G
Sbjct: 301 GAAAPQQQY-SNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL------GA 353
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMT 263
+ D G+V+T + Y L EF + + + + + TC+++ + + P +
Sbjct: 354 VIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLD-TCYDVTGQDVVTAPRVA 412
Query: 264 YHF-QGADLVVEPENVFIF----NHQDSFFFFFGPAFTP--RKGKTILGARHQHNTQFVY 316
F GA + V+ + + + AF P G I+G Q V+
Sbjct: 413 LEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVF 472
Query: 317 DLD 319
D+D
Sbjct: 473 DVD 475
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 144/340 (42%), Gaps = 50/340 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++KL +G P + ++DT + +TWTQC PC CYEQN PI++ ++K+ C
Sbjct: 62 NSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD 121
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL-LPPDEPSPVSVQNIRFGCSLE 123
SC Y + Y D T + +L T T+ L P + GC
Sbjct: 122 GHSCP-----------YEVDYFD--HTYTMGTLATETITLHSTSGEPFVMPETIIGCGHN 168
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQII 182
+ F K +G++GLNW +S + Q+G P S C S++ FG + I+
Sbjct: 169 NSWF----KPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGT---SKINFGANAIV 221
Query: 183 AGKSL--------NLPPNSFTIKLNG---QRGCINDCGSVLTVIECEVYAVLTAEFIDYF 231
AG + P + + L+ I G+ +E + + + + YF
Sbjct: 222 AGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNI-VIDSGTTLTYF 280
Query: 232 S-------QHDIEKLFTCRKCG------VTCFNLPARFNSFPSMTYHFQGA-DLVVEPEN 277
+ +E + T + + C+N + FP +T HF G DLV++ N
Sbjct: 281 PVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN-SDTIDIFPVITMHFSGGVDLVLDKYN 339
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+++ ++ F +P + + I G R Q+N YD
Sbjct: 340 MYMESNNGGVFCLAIICNSPTQ-EAIFGNRAQNNFLVGYD 378
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 140/361 (38%), Gaps = 61/361 (16%)
Query: 3 TLNHTYMLKLG---IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYK 59
TLN+ + LG G P +L ++DT + LTW QC+PC +CY Q DP+++ +Y
Sbjct: 182 TLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYA 241
Query: 60 KLPCYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
+ C ++C + P C G+ C+Y + YGD ++ V + DT L
Sbjct: 242 AVRCNASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVAL------G 295
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPD 168
S+ FGC L ++ AG+MGL S + Q FS CL
Sbjct: 296 GASLDGFVFGCGLSNRGLFG----GTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTS 351
Query: 169 KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLN---------------------------GQ 201
L G A N P ++T + G
Sbjct: 352 GDASGSLSLGGD--ASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 409
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ--HDIEKLFTCRKCGVTCFNLPARFN-S 258
+ D G+V+T + VY + AEF F+ + F+ TC++L
Sbjct: 410 SNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILD---TCYDLTGHDEVK 466
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVY 316
P +T + GA++ V+ + +D A + +T I+G Q N + VY
Sbjct: 467 VPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVY 526
Query: 317 D 317
D
Sbjct: 527 D 527
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 141/363 (38%), Gaps = 73/363 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ L IG P S + DT + L WTQC PC C+ Q P+YN S ++ LPC
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151
Query: 66 ---------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSVQ 114
A P C C Y TYG + T V +T T D+ V
Sbjct: 152 SLSMCAGVLAGKAPPPGCA---CMYNQTYGTGW-TAGVQGSETFTFGSAAADQ---ARVP 204
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------- 164
I FGCS S D+ AG++GL S S + QLG RFS CL
Sbjct: 205 GIAFGCSNASSSDW-----NGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNST 256
Query: 165 ----------------------VQPDKSFHSR---LEFGDQIIAGKSLNLPPNSFTIKLN 199
P K+ S L + K+L++ P++F++K +
Sbjct: 257 STLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKAD 316
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFN- 257
G G I D G+ +T + Y + A + I+ G+ C+ LP +
Sbjct: 317 GTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAIDG---SDSTGLDLCYALPTPTSA 373
Query: 258 --SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
+ PSMT HF GAD+V+ ++ I S + + G Q N +
Sbjct: 374 PPAMPSMTLHFDGADMVLPADSYMI---SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHIL 430
Query: 316 YDL 318
YD+
Sbjct: 431 YDV 433
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 143/357 (40%), Gaps = 64/357 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
TY++ + IG P L +LDT + L WTQC PC+ C+ Q P+Y +Y + C
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 66 ASC---KSPF-HCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C +SP+ C D C Y +YGD T V + +T TL S +V+ + FG
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL-----GSDTAVRGVAFG 205
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG- 178
C E + S +G++G+ S + QLG RFS C + + S L G
Sbjct: 206 CGTE--NLGSTDNS--SGLVGMGRGPLSLVSQLGV---TRFSYCFTPFNATAASPLFLGS 258
Query: 179 --------------------------------DQIIAGKSLNLP--PNSFTIKLNGQRGC 204
+ I G +L LP P F + G G
Sbjct: 259 SARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTL-LPIDPAVFRLTPMGDGGV 317
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPA-RFNSFPSM 262
I D G+ T +E + L L + G++ CF + P +
Sbjct: 318 IIDSGTTFTALEERAFVALARALASRVRL----PLASGAHLGLSLCFAAASPEAVEVPRL 373
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF GAD+ + E+ ++ + + G +G ++LG+ Q NT +YDL+
Sbjct: 374 VLHFDGADMELRRES-YVVEDRSAGVACLG--MVSARGMSVLGSMQQQNTHILYDLE 427
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 135/335 (40%), Gaps = 62/335 (18%)
Query: 23 FLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK----SPFHCFEGD 78
+LDT + +TW QCQPC CY+Q+DP+++ SY + C C+ + G
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 79 CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD-FVSIQKKIIAG 137
C Y + YGD T V T TL D V N+ GC +++ FV + G
Sbjct: 61 CLYEVAYGDGSYT--VGDFATETLTLGDS---TPVGNVAIGCGHDNEGLFVGAAGLLALG 115
Query: 138 IMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI---------------- 181
L++ S ++ FS CLV D S L+FGD
Sbjct: 116 GGPLSFPS--------QISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRT 167
Query: 182 ------------IAGKSLNLPPNSFTI-KLNGQRGCINDCGSVLTVIECEVYAVLTAEFI 228
+ G+ L++P ++F + +G G I D G+ +T ++ YA L F+
Sbjct: 168 STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFV 227
Query: 229 DYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGADLVVEPENVFIFNH 283
R GV TC++L R + P+++ F+G + P ++
Sbjct: 228 QGAPSLP-------RTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPV 280
Query: 284 QDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
+ + AF P +I+G Q T+ +D
Sbjct: 281 DGAGTYCL--AFAPTNAAVSIIGNVQQQGTRVSFD 313
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 152/341 (44%), Gaps = 43/341 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y+ ++G+G PVK + + DT + +TW QCQPC S CY+Q DPI++ +S SY L C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 65 DASCK--SPFHCFEGDCFYGITYGD-VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK +C C Y + YGD + T E L T TL + S+ N+ GC
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE---LATETL---SFGNSNSIPNLPIGCG 261
Query: 122 LESKDF--------------VSIQKKIIAG-----IMGLNWDSTSFMVQLGRLVPDRFSC 162
+++ +S+ ++ A ++ L+ DS+S + + D +
Sbjct: 262 HDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTS 321
Query: 163 CLVQPDKSFHS--RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
LV+ D+ FHS ++ + GK+L + P F I +G G I D G++++ + +VY
Sbjct: 322 PLVKNDR-FHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVY 380
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLVVEPENV 278
L F+ S + TC+N + N P++ + +G L + N
Sbjct: 381 ESLREAFVKLTSSLSPAPGISVFD---TCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNY 437
Query: 279 FIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
I + AF K +I+G+ Q + YDL
Sbjct: 438 LIMLDTAGTYCL---AFIKTKSSLSIIGSFQQQGIRVSYDL 475
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 43/334 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++GIG P + ++D+ + + W QC+PC CY Q DPI+N + S+ + C
Sbjct: 129 YFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNV 188
Query: 68 CKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C C +G C Y + YGD TK +L+T T+ +Q+ GC +
Sbjct: 189 CNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITI------GRTVIQDTAIGCGHWN 242
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR--------- 174
+ FV + G + SF+ QLG F CLV +
Sbjct: 243 EGMFVGAAGLLGLGGGPM-----SFVGQLGAQTGGAFGYCLVSRAMPVGAMWVPLIHNPF 297
Query: 175 ------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFI 228
+ + G + + F + G G + D G+ +T + Y FI
Sbjct: 298 YPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFI 357
Query: 229 DYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-----PSMTYHFQGADLVVEPENVFIFNH 283
+ R GV+ F+ N F P+++++F G ++ P F+
Sbjct: 358 -------AQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPA 410
Query: 284 QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
D F F A +P G +I+G Q Q D
Sbjct: 411 DDVGTFCFAFAPSP-SGLSIIGNIQQEGIQVSID 443
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 146/358 (40%), Gaps = 66/358 (18%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN+ +++G +++ ++DT + LTW QCQPC+ CY Q DP++N SY+ +
Sbjct: 64 TLNYIVTVEIG----GRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 119
Query: 63 CYDASCKSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+S G+ C Y + YGD T+ ++ L
Sbjct: 120 CNSSTCQS-LQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL------GTTH 172
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V N FGC +K +G+MGL S + Q + FS CL P +
Sbjct: 173 VSNFIFGCGRNNKGLFGGA----SGLMGLGKSDLSLVSQTSAIFEGVFSYCL--PTTAAD 226
Query: 173 SRLEFGDQIIAGKS---LNLPPNSFT------------------IKLNG---------QR 202
+ G I+ G S N P S+T I + G Q
Sbjct: 227 AS---GSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQS 283
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G + D G+V+T + VY L AEF+ FS F+ TCFNL P+
Sbjct: 284 GILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILD---TCFNLNGYDEVDIPT 340
Query: 262 MTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
+ F+G A+L V+ +F F D+ A + I+G Q N + +Y+
Sbjct: 341 IRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYN 398
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 143/357 (40%), Gaps = 64/357 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
TY++ + IG P L +LDT + L WTQC PC+ C+ Q P+Y +Y + C
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 66 ASC---KSPF-HCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C +SP+ C D C Y +YGD T V + +T TL S +V+ + FG
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL-----GSDTAVRGVAFG 205
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG- 178
C E + S +G++G+ S + QLG RFS C + + S L G
Sbjct: 206 CGTE--NLGSTDNS--SGLVGMGRGPLSLVSQLGV---TRFSYCFTPFNATAASPLFLGS 258
Query: 179 --------------------------------DQIIAGKSLNLP--PNSFTIKLNGQRGC 204
+ I G +L LP P F + G G
Sbjct: 259 SARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTL-LPIDPAVFRLTPMGDGGV 317
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPA-RFNSFPSM 262
I D G+ T +E + L L + G++ CF + P +
Sbjct: 318 IIDSGTTFTALEESAFVALARALASRVRL----PLASGAHLGLSLCFAAASPEAVEVPRL 373
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF GAD+ + E+ ++ + + G +G ++LG+ Q NT +YDL+
Sbjct: 374 VLHFDGADMELRRES-YVVEDRSAGVACLG--MVSARGMSVLGSMQQQNTHILYDLE 427
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 137/355 (38%), Gaps = 58/355 (16%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
+ Y++KLG G P +S + +LDT + + W C PC C + P S+S +Y L C
Sbjct: 121 SSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEPSKS-STYNYLTCA 179
Query: 65 DASCKSPFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C+ C + D C YGD E E+ S +T ++ V+N FGC
Sbjct: 180 SQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV------GSQQVENFVFGC 233
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP-DKSFHSRLEFGD 179
S ++ + ++ G + SF+ Q L FS CL +F L G
Sbjct: 234 SNAARGLIQRTPSLV----GFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLLGK 289
Query: 180 QIIAGKSLNLPP---NS-----FTIKLNG---------------------QRGCINDCGS 210
+ ++ + L P NS + + LNG RG I D G+
Sbjct: 290 EALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGT 349
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEK---LFTCRKCGVTCFNLPARFNSFPSMTYHF- 266
V+T + Y + F S + LF TC+N P+ FP +T HF
Sbjct: 350 VITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFD------TCYNRPSGDVEFPLITLHFD 403
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
DL + +N+ + D P G +L G Q + V+D+
Sbjct: 404 DNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDV 458
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 137/346 (39%), Gaps = 57/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ +LG+G P S ++DT + LTW QC PC SC+ Q P+Y+ R+ +Y +PC +
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSAS 193
Query: 67 SCK-------SPFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C C Y +YGD + S DT + P N +
Sbjct: 194 QCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGSYP------NFYY 247
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP----------- 167
GC +++ AG++GL + S + QL + FS CL P
Sbjct: 248 GCGQDNEGLFGRS----AGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPY 303
Query: 168 -----------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
S + L F ++G S+ P + + I D G+V+T +
Sbjct: 304 TSGHYSYTPMASSSLDASLYF--VTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLP 361
Query: 217 CEVYAVLT----AEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLV 272
VY L+ A + S L TC + + +PA +F GA L
Sbjct: 362 TAVYTALSKAVAAAMVGVQSAPAFSILDTCFQGQASQLRVPAVAMAFAG------GATLK 415
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +NV I + DS AF P TI+G Q VYD+
Sbjct: 416 LATQNVLI-DVDDSTTCL---AFAPTDSTTIIGNTQQQTFSVVYDV 457
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 137/333 (41%), Gaps = 71/333 (21%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK---- 60
+ Y +++ +G P K ++DT + L W QC+PC CY Q+DPIY+ + ++ K
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 61 ------LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
LP + C S C YG YGD T+ +L+T TL S +
Sbjct: 61 TSSCQSLPA--SGCSSSAKT----CIYGYQYGDSSSTQGDFALETLTLRSSGGSSK-AFP 113
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSF 171
N +FGC L S F AGI+GL S QLG + ++FS CLV D S
Sbjct: 114 NFQFGCGRLNSGSFGG-----AAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSK 168
Query: 172 HSRLEFGDQIIAGK-SLNLP--PNSFT----------IKLNGQR---------------- 202
S L FG G +++ P PNS I + G++
Sbjct: 169 TSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSK 228
Query: 203 -------------GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTC 249
G I D G+ LT+++ VY+ + + F S ++ + C
Sbjct: 229 KKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDL---C 285
Query: 250 FNLPARFN-SFPSMTYHFQGADLVVEPENVFIF 281
+++ N FP++T F+G +N F+
Sbjct: 286 YDVSKSKNFKFPALTLAFKGTKFSPPQKNYFVI 318
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 139/345 (40%), Gaps = 47/345 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P + L + DT + LTWTQC+PC SCY+Q DPI++ SY + C +
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSS 199
Query: 67 SCKSPFHCF------EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C + F + C Y + YGD ++ S + T+ D V + FGC
Sbjct: 200 LC-TQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD-----IVHDFLFGC 253
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+++ + AG+MGL+ SF+ Q + FS CL S L FG
Sbjct: 254 GQDNEGLF----RGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSSL-GHLTFGAS 308
Query: 181 IIAGKSLNLPP-------NSF------TIKLNGQR------------GCINDCGSVLTVI 215
+L P NSF I + G + G I D G+V+T +
Sbjct: 309 AATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRL 368
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLVVE 274
YA L + F + ++ + + R TC++ + S P + + F G V
Sbjct: 369 PPTAYAALRSAFRQFMMKYPVA--YGTRLLD-TCYDFSGYKEISVPRIDFEFAGGVKVEL 425
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
P ++ A TI G Q + VYD++
Sbjct: 426 PLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVE 470
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 136/349 (38%), Gaps = 52/349 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P + L + DT + LTWTQC+PC SCY+Q D I++ SY + C +
Sbjct: 136 YFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSS 195
Query: 67 SC--------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C KS C YGI YGD + S + T+ D V + F
Sbjct: 196 LCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD-----IVDDFLF 250
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC +++ S AG++GL SF+ Q + FS CL S L FG
Sbjct: 251 GCGQDNEGLFSGS----AGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSL-GHLTFG 305
Query: 179 DQIIAGKSLNLPP-------NSF------TIKLNGQR------------GCINDCGSVLT 213
+L P N+F I + G + G I D G+V+T
Sbjct: 306 ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVIT 365
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPA-RFNSFPSMTYHFQGAD 270
+ YA L + F + +EK + G+ TC++ + S P + + F G
Sbjct: 366 RLAPTAYAALRSAF-----RQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGGV 420
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
V P + A TI G Q + VYD++
Sbjct: 421 TVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVE 469
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 139/364 (38%), Gaps = 79/364 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y +KLG+G P K +LDT + L+W QCQPC C+ Q DP+Y+ K+YKKL C
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 67 SCKSPFHCFEGD---------CFYGITYGDV-----YETKEVDSLDTSTLLPPDEPSPVS 112
C D C Y +YGD Y ++++ +L +S LP
Sbjct: 185 ECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLP-------- 236
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
+GC +++ AGI+GL D S + QL FS CL +
Sbjct: 237 --QFTYGCGQDNQGLFGRA----AGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSS 290
Query: 173 SR-----------------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
L ++G+ L+L + +
Sbjct: 291 GGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP------ 344
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEK-----LFTCRKCGVTCFNLPARFNS 258
+ D G+V+T + +YA L F+ S + L TC K + ++
Sbjct: 345 TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLK------SISA 398
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQD-SFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P + FQ GADL + ++ I + + F G + T + I+G R Q Y
Sbjct: 399 VPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIA--IIGNRQQQTYNIAY 456
Query: 317 DLDT 320
D+ T
Sbjct: 457 DVST 460
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 152/341 (44%), Gaps = 43/341 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y+ ++G+G PVK + + DT + +TW QCQPC S CY+Q DPI++ +S SY L C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 65 DASCK--SPFHCFEGDCFYGITYGD-VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK +C C Y + YGD + T E L T TL + S+ N+ GC
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE---LATETL---SFGNSNSIPNLPIGCG 261
Query: 122 LESKDF--------------VSIQKKIIAG-----IMGLNWDSTSFMVQLGRLVPDRFSC 162
+++ +S+ ++ A ++ L+ DS+S + + D +
Sbjct: 262 HDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSYMPSDSLTS 321
Query: 163 CLVQPDKSFHS--RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
LV+ D+ FHS ++ + GK+L + P F I +G G I D G++++ + +VY
Sbjct: 322 PLVKNDR-FHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVY 380
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLVVEPENV 278
L F+ S + TC+N + N P++ + +G L + N
Sbjct: 381 ESLREAFVKLTSSLSPAPGISVFD---TCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNY 437
Query: 279 FIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
I + AF K +I+G+ Q + YDL
Sbjct: 438 LIMLDTAGTYCL---AFIKTKSSLSIIGSFQQQGIRVSYDL 475
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 129/320 (40%), Gaps = 60/320 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++GIG P + +LDT + L WTQC PC C +Q P ++ +Y+ L C +
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPA 149
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + ++ C++ C Y YGD T V + +T T + VS+ I FGC
Sbjct: 150 CNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTF--GTNETRVSLPGISFGCG---- 203
Query: 126 DFVSIQKKIIA---GIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
++ ++A G++G S S + QLG RFS CL SRL FG
Sbjct: 204 ---NLNAGLLANGSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYAT 257
Query: 182 ----------------------------------IAGKSLNLPPNSFTIK-LNGQRGCIN 206
+ G L + P F I +G G I
Sbjct: 258 LNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTII 317
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL-FTCRKCGVTCFNLPA---RFNSFPSM 262
D G+ +T + Y + A F SQ + L T TCF P + + P +
Sbjct: 318 DSGTTITYLAEPAYDAVRAAFA---SQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374
Query: 263 TYHFQGADLVVEPENVFIFN 282
HF GAD + +N + +
Sbjct: 375 VLHFDGADWELPLQNYMLVD 394
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 141/362 (38%), Gaps = 70/362 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++GIG P + DT + + W QC PC CY Q DP+++ + S+ +PC
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGV 182
Query: 68 CKSPFH-------CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C++ G+C Y ++YGD T V +L+T TL E VQ + GC
Sbjct: 183 CRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE-----VQGVAMGC 237
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--------------- 165
E++ + AG++GL W S + QLG FS CL
Sbjct: 238 GHENRGLFAEA----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVL 293
Query: 166 -----------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
PD + +AG+ L L F + +G G + D
Sbjct: 294 GREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDT 353
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMT 263
G+ +T + E YA L F F + R GV TC++L + P++
Sbjct: 354 GTAVTRLPAEAYAALRGAFAGAFEEG------APRAPGVSLFDTCYDLSGYASVRVPTVA 407
Query: 264 YHF-------QGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
+F + A L + N+ + + ++ F + G +ILG Q +
Sbjct: 408 LYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVAS---GPSILGNIQQQGIEIT 464
Query: 316 YD 317
D
Sbjct: 465 VD 466
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 144/362 (39%), Gaps = 63/362 (17%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + LG + ++DT + LTW QCQPC+SC++Q DP+++ S SY
Sbjct: 115 LRTLNYVATVGLGAAEAT----VVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAA 170
Query: 61 LPCYDASCKSPFHCFEGD-------------CFYGITYGDVYETKEVDSLDTSTLLPPDE 107
+PC +SC + C Y ++Y D ++ V + D L D
Sbjct: 171 VPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQD- 229
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP 167
++ FGC ++ +G+MGL S + Q FS CL
Sbjct: 230 -----IEGFVFGCGTSNQ---GAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMR 281
Query: 168 DKSFHSRLEFGDQIIAGKSLNLPPNSFT--------------------IKLNGQR----- 202
+ L GD A + N P +T I + GQ
Sbjct: 282 ESGSSGSLVLGDDSSAYR--NSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPW 339
Query: 203 ----GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFN 257
I D G+++T + VY + AEF+ +++ F+ TCFNL +
Sbjct: 340 FSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILD---TCFNLTGLKEV 396
Query: 258 SFPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFV 315
PS+ + F+G+ ++ V+ + V F D+ A +I+G Q N + +
Sbjct: 397 QVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVI 456
Query: 316 YD 317
+D
Sbjct: 457 FD 458
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 137/362 (37%), Gaps = 76/362 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPC-- 63
Y++ LGIG P L+DT + L+W QC+PC + CY Q DP+++ S SY +PC
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 150
Query: 64 ----------YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
Y C C YGI YG+ T V S +T TL P V V
Sbjct: 151 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP-----GVVV 205
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---------- 163
+ FGC D + G++GL S + Q FS C
Sbjct: 206 ADFGFGCG----DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGF 261
Query: 164 --LVQPDKSFHSRLEFGDQI---------------------IAGKSLNLPPNSFTIKLNG 200
L P S S G + G L +PP++F+
Sbjct: 262 LTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFS----- 316
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN- 257
G + D G+V+T + YA L + F S++ +L GV TC++ N
Sbjct: 317 -SGMVIDSGTVITGLPATAYAALRSAFRSAMSEY---RLLPPSNGGVLDTCYDFTGHANV 372
Query: 258 SFPSMTYHFQGADLV--VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
+ P+++ F G + P V + D F G G I+G +Q + +
Sbjct: 373 TVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIG--IIGNVNQRTFEVL 426
Query: 316 YD 317
YD
Sbjct: 427 YD 428
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 142/354 (40%), Gaps = 59/354 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
+LN+ ++LG + + ++DT + L+W QCQPC CY Q DP++N SY+ +
Sbjct: 63 SLNYIVTVELG----GRKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVL 118
Query: 63 CYDASCKSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C+S G+ C Y + YGD T ++ L +
Sbjct: 119 CNSLTCRS-LQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL------GNTT 171
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V N FGC +++ +G++GL S + Q+ + FS CL +
Sbjct: 172 VNNFIFGCGRKNQGLFGGA----SGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEAS 227
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLN--------------------------GQRGCIN 206
L G K N P S+T ++ G+ I
Sbjct: 228 GSLVMGGNSSVYK--NTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMII 285
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYH 265
D G+V++ + +Y L AEF+ FS + F +CFNL + P + +
Sbjct: 286 DSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILD---SCFNLSGYQEVKIPDIKMY 342
Query: 266 FQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
F+G A+L V+ VF D+ A P + + I+G Q N + +YD
Sbjct: 343 FEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYD 396
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 139/352 (39%), Gaps = 64/352 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P + +DT + ++W QC PC + CY Q +++ +Y+ + C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186
Query: 66 ASCKSPFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
A C G +C YG+ YGD T S DT TL + +V+ +FG
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASD----AVKGFQFG 242
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--------- 170
CS F G+MGL + S + Q + FS CL S
Sbjct: 243 CSHVESGF----SDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGG 298
Query: 171 --------------------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ +RL+ D + GK L L P+ F G + D G+
Sbjct: 299 GGVSGFVTTRMLRSRQIPTFYGARLQ--DIAVGGKQLGLSPSVFAA------GSVVDSGT 350
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
++T + Y+ L++ F Q+ + R TCF+ + S P++ F G
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQTQISIPTVALVFSGG 407
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDLDT 320
+ N ++ + +F A T G T I+G Q + +YD+ +
Sbjct: 408 AAIDLDPNGIMYGNCLAF------AATGDDGTTGIIGNVQQRTFEVLYDVGS 453
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 145/362 (40%), Gaps = 66/362 (18%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKK 60
TLN+ + LG G K+L ++DT + LTW QC+PC SCY Q DP+++ + ++
Sbjct: 177 TLNYVTTIALG-GGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAA 235
Query: 61 LPCYDASCK--------SPFHCF------EGDCFYGITYGDVYETKEVDSLDTSTLLPPD 106
+PC +C +P C E C+Y ++YGD ++ V + DT L
Sbjct: 236 VPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGL---- 291
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ 166
+ + FGC L ++ AG+MGL S + Q FS CL
Sbjct: 292 -GTTTKLDGFVFGCGLSNRGLFG----GTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPA 346
Query: 167 PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLN--------------------------- 199
S S L G G S + P ++T +
Sbjct: 347 TTTSTGS-LSLGP----GPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPG 401
Query: 200 -GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN- 257
G + D G+V+T + VY + AEF F ++ F+ C++L R
Sbjct: 402 FGAGNVLVDSGTVITRLAPSVYKAVRAEFARRF-EYPAAPGFSILDA---CYDLTGRDEV 457
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFV 315
+ P +T + GA + V+ + +D A P + +T I+G Q N + V
Sbjct: 458 NVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVV 517
Query: 316 YD 317
YD
Sbjct: 518 YD 519
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 60/356 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +G+G K++ ++DT + LTW QC+PC SCY Q PI+ + SY+ +
Sbjct: 62 TLN--YIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 63 CYDASCKS-------PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+S C + C Y + YGD T ++ + VS
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF------GGVS 171
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V + FGC +K ++G+MGL S + Q FS CL +
Sbjct: 172 VSDFVFGCGRNNKGLFGG----VSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSS 227
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLN----------------------------GQRGC 204
L G++ K+ N P ++T L+ G G
Sbjct: 228 GSLVMGNESSVFKNAN--PITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGI 285
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMT 263
+ D G+V+T + VY L AEF+ F+ F+ TCFNL S P+++
Sbjct: 286 LIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILD---TCFNLTGYDEVSIPTIS 342
Query: 264 YHFQG-ADLVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYD 317
F+G A L V+ F +D+ A + I+G Q N + +YD
Sbjct: 343 LRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYD 398
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 141/348 (40%), Gaps = 54/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P K L+ +LDT + + W QC+PC CY Q D I++ KS+ +PCY
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPL 189
Query: 68 CK---SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ SP + + C Y ++YGD T S +T T +V + GC +
Sbjct: 190 CRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF------RRAAVPRVAIGCGHD 243
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQI 181
++ FV + SF Q G ++FS CL S S + FGD
Sbjct: 244 NEGLFVGAAGLLGL-----GRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSA 298
Query: 182 IA-----------------------GKSLNLPP------NSFTIKLNGQRGCINDCGSVL 212
++ G S+ P + F + G G I D G+ +
Sbjct: 299 VSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSV 358
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T + Y L F S F+ TC++L P++ HF+GAD+
Sbjct: 359 TRLTRPAYVSLRDAFRVGASHLKRAPEFSLFD---TCYDLSGLSEVKVPTVVLHFRGADV 415
Query: 272 VVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N + ++ SF F F + G +I+G Q + V+DL
Sbjct: 416 SLPAANYLVPVDNSGSFCFAFAGTMS---GLSIIGNIQQQGFRVVFDL 460
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 137/362 (37%), Gaps = 76/362 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPC-- 63
Y++ LGIG P L+DT + L+W QC+PC + CY Q DP+++ S SY +PC
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 230
Query: 64 ----------YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
Y C C YGI YG+ T V S +T TL P V V
Sbjct: 231 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP-----GVVV 285
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-------- 165
+ FGC D + G++GL S + Q FS CL
Sbjct: 286 ADFGFGCG----DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGF 341
Query: 166 ----QPDKSFHSRLEFGDQI---------------------IAGKSLNLPPNSFTIKLNG 200
P S S G + G L +PP++F+
Sbjct: 342 LTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFS----- 396
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN- 257
G + D G+V+T + YA L + F S++ +L GV TC++ N
Sbjct: 397 -SGMVIDSGTVITGLPATAYAALRSAFRSAMSEY---RLLPPSNGGVLDTCYDFTGHANV 452
Query: 258 SFPSMTYHFQGADLV--VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
+ P+++ F G + P V + D F G G I+G +Q + +
Sbjct: 453 TVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIG--IIGNVNQRTFEVL 506
Query: 316 YD 317
YD
Sbjct: 507 YD 508
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 137/357 (38%), Gaps = 60/357 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + DT + L WTQC PC + C+EQ P+YN S ++ LPC +
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 171
Query: 67 ------SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+ C Y TYG + T V +T T V + FGC
Sbjct: 172 LSMCAGALAGAAPPPGCACMYNQTYGTGW-TAGVQGSETFT-FGSSAADQARVPGVAFGC 229
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFGD 179
S S S AG++GL S S + QLG RFS CL D + S L G
Sbjct: 230 SNAS----SSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGP 282
Query: 180 QI---------------------------------IAGKSLNLPPNSFTIKLNGQRGCIN 206
+ K+L + P +F++K +G G I
Sbjct: 283 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 342
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNS----FPS 261
D G+ +T + Y + A + + + G+ CF LPA ++ PS
Sbjct: 343 DSGTTITSLANAAYQQVRAAVKSLVTT--LPTVDGSDSTGLDLCFALPAPTSAPPAVLPS 400
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
MT HF GAD+V+ ++ I S + + G Q N +YD+
Sbjct: 401 MTLHFDGADMVLPADSYMI---SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDV 454
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 141/345 (40%), Gaps = 57/345 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P KS L+DT + ++W QC+PC C+ Q DP+++ S +Y C A+
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAA 192
Query: 68 C----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + C C Y +TYGD T S DT L +V+ +FGCS
Sbjct: 193 CAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL------GSNAVRKFQFGCSNV 246
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFG---- 178
F G+MGL + S + Q FS CL P S S L G
Sbjct: 247 ESGF----NDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCL--PATSSSSGFLTLGAGTS 300
Query: 179 ----------DQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
Q+ + G+ L++P + F+ G I D G+VLT +
Sbjct: 301 GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSA------GTIMDSGTVLTRLP 354
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLVV 273
Y+ L++ F + +++ + G+ TCF+ + + S P++ F G +V
Sbjct: 355 PTAYSALSSAF-----KAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVD 409
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + +S A + I+G Q + +YD+
Sbjct: 410 IASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDV 454
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 88/343 (25%), Positives = 137/343 (39%), Gaps = 56/343 (16%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++KL +G P + ++DT + +TWTQC PC CY+QN PI++ ++K+ C+
Sbjct: 377 NSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCH 436
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
D SC Y + Y D TK + DT T+ V + I GC +
Sbjct: 437 DHSCP-----------YEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETI-IGCGRNN 484
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
F + G +GLNW S + Q+G P S C S++ FG I G
Sbjct: 485 SWF----RPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGT---SKINFGTNAIVG 537
Query: 185 KS---------LNLPPNSFTIKLNG------------------QRGCINDCGSVLTVIEC 217
P + + L+ + + D G+ LT
Sbjct: 538 GGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTYFP- 596
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCG--VTCFNLPARFNSFPSMTYHFQ-GADLVVE 274
E Y L + + +H + + G + C+ FP +T HF GADLV++
Sbjct: 597 ESYCNLVRQAV----EHVVPAVPAADPTGNDLLCY-YSNTTEIFPVITMHFSGGADLVLD 651
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
N+F+ ++ F P + + I G R Q+N YD
Sbjct: 652 KYNMFMESYSGGLFCLAIICNNPTQ-EAIFGNRAQNNFLVGYD 693
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 136/324 (41%), Gaps = 40/324 (12%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+ Y++KL IG P + +LDT + L WTQC PC CY+Q PI++ ++K +
Sbjct: 63 YEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----E 117
Query: 66 ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C +P H C Y + Y D T+ + +T T + P + GCS +
Sbjct: 118 TRCNTPDH----SCPYKLVYDDKSYTQGTLATETVT-IHSTSGVPFVMPETIIGCSRNNS 172
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR---FSCCLVQPDKSFHSRLEFGDQII 182
S + +GI+GL+ S S + Q+G P + + K L D +
Sbjct: 173 G--SGFRPSSSGIVGLSRGSLSLISQMGGAYPGDGVVSTTMFAKTAKRGQYYLNL-DAVS 229
Query: 183 AGKSLNLPPNSFTIKLNGQRGCINDCGSVLT--------VIECEVYAVLTAEFIDYFSQH 234
G + + LNG + D G+ LT ++ V V+TA+ + S++
Sbjct: 230 VGDTRIETVGTPFHALNGN--IVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRN 287
Query: 235 DIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGP 293
D+ ++ FP +T HF GADLV++ N+++ ++ F
Sbjct: 288 DMLCYYS------------NTIEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAII 335
Query: 294 AFTPRKGKTILGARHQHNTQFVYD 317
P + I G R Q+N YD
Sbjct: 336 CNNPTQ-VAIFGNRAQNNFLVGYD 358
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 141/348 (40%), Gaps = 54/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + ++ +LDT + + W QC PC+ CY Q DP+++ ++Y +PC
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPL 188
Query: 68 CK---SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ SP C + C Y ++YGD T S +T T V + GC
Sbjct: 189 CRRLDSP-GCNNKNKVCQYQVSYGDGSFTFGDFSTETLTF------RRTRVTRVALGCGH 241
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQI 181
+++ ++ L SF VQ GR +FS CLV S S + FGD
Sbjct: 242 DNEGLFIGAAGLLG----LGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSA 297
Query: 182 IA-----------------------GKSLNLPP------NSFTIKLNGQRGCINDCGSVL 212
++ G S+ P + F + G G I D G+ +
Sbjct: 298 VSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSV 357
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T + Y L F S F+ TCF+L P++ HF+GAD+
Sbjct: 358 TRLTRPAYIALRDAFRVGASHLKRAAEFSLFD---TCFDLSGLTEVKVPTVVLHFRGADV 414
Query: 272 VVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I ++ SF F F + G +I+G Q + +DL
Sbjct: 415 SLPATNYLIPVDNSGSFCFAFAGTMS---GLSIIGNIQQQGFRVSFDL 459
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 142/360 (39%), Gaps = 60/360 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + +LDT + LTWTQC PC SC+ Q+ P +N ++ LPC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 170
Query: 68 CK--SPFHCFE-----GDCFYGITYGD-VYETKEVDSLDTSTLLPPDEP-SPVSVQNIRF 118
C+ + C E G C Y Y D T +DS DT + D SV ++ F
Sbjct: 171 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDS-DTFSFASADHAIGGASVPDLTF 229
Query: 119 GCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS------- 170
GC L + FVS + GI G + + S QL D FS C S
Sbjct: 230 GCGLFNNGIFVSNET----GIAGFSRGALSMPAQLKV---DNFSYCFTAITGSEPSPVFL 282
Query: 171 -------------------------FHSR------LEFGDQIIAGKSLNLPPNSFTIKLN 199
+HS + + L +P + F +K +
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKED 342
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-S 258
G G I D G+ +T++ VY ++ F+ +Q + + CF++P
Sbjct: 343 GTGGTIVDSGTGMTMLPEAVYNLVCDAFV---AQTKLTVHNSTSSLSQLCFSVPPGAKPD 399
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P++ HF+GA L + EN + A + +++G Q N +YDL
Sbjct: 400 VPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDL 459
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 129/318 (40%), Gaps = 56/318 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++GIG P + +LDT + L WTQC PC C +Q P ++ +Y+ L C +
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPA 149
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C + ++ C++ C Y YGD T V + +T T + VS+ I FGC +L +
Sbjct: 150 CNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTF--GTNETRVSLPGISFGCGNLNA 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ +G++G S S + QLG RFS CL SRL FG
Sbjct: 208 GSLAN-----GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATLN 259
Query: 182 --------------------------------IAGKSLNLPPNSFTIK-LNGQRGCINDC 208
+ G L + P F I +G G I D
Sbjct: 260 STNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDS 319
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKL-FTCRKCGVTCFNLPA---RFNSFPSMTY 264
G+ +T + Y + A F SQ + L T TCF P + + P +
Sbjct: 320 GTTITYLAEPAYDAVRAAFA---SQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVL 376
Query: 265 HFQGADLVVEPENVFIFN 282
HF GAD + +N + +
Sbjct: 377 HFDGADWELPLQNYMLVD 394
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 142/360 (39%), Gaps = 60/360 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + +LDT + LTWTQC PC SC+ Q+ P +N ++ LPC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 170
Query: 68 CK--SPFHCFE-----GDCFYGITYGD-VYETKEVDSLDTSTLLPPDEP-SPVSVQNIRF 118
C+ + C E G C Y Y D T +DS DT + D SV ++ F
Sbjct: 171 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDS-DTFSFASADHAIGGASVPDLTF 229
Query: 119 GCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS------- 170
GC L + FVS + GI G + + S QL D FS C S
Sbjct: 230 GCGLFNNGIFVSNET----GIAGFSRGALSMPAQLKV---DNFSYCFTAITGSEPSPVFL 282
Query: 171 -------------------------FHSR------LEFGDQIIAGKSLNLPPNSFTIKLN 199
+HS + + L +P + F +K +
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKED 342
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-S 258
G G I D G+ +T++ VY ++ F+ +Q + + CF++P
Sbjct: 343 GTGGTIVDSGTGMTMLPEAVYNLVCDAFV---AQTKLTVHNSTSSLSQLCFSVPPGAKPD 399
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P++ HF+GA L + EN + A + +++G Q N +YDL
Sbjct: 400 VPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDL 459
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 142/360 (39%), Gaps = 60/360 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + + +LDT + LTWTQC PC SC+ Q+ P +N ++ LPC
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 144
Query: 68 CK--SPFHCFE-----GDCFYGITYGD-VYETKEVDSLDTSTLLPPDEP-SPVSVQNIRF 118
C+ + C E G C Y Y D T +DS DT + D SV ++ F
Sbjct: 145 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDS-DTFSFASADHAIGGASVPDLTF 203
Query: 119 GCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS------- 170
GC L + FVS + GI G + + S QL D FS C S
Sbjct: 204 GCGLFNNGIFVSNET----GIAGFSRGALSMPAQLKV---DNFSYCFTAITGSEPSPVFL 256
Query: 171 -------------------------FHSR------LEFGDQIIAGKSLNLPPNSFTIKLN 199
+HS + + L +P + F +K +
Sbjct: 257 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKED 316
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-S 258
G G I D G+ +T++ VY ++ F+ +Q + + CF++P
Sbjct: 317 GTGGTIVDSGTGMTMLPEAVYNLVCDAFV---AQTKLTVHNSTSSLSQLCFSVPPGAKPD 373
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P++ HF+GA L + EN + A + +++G Q N +YDL
Sbjct: 374 VPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDL 433
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 142/363 (39%), Gaps = 64/363 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + LLDT + L WTQC PC SC Q DP++ + SY + C
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQL 162
Query: 68 CKSPFH--CFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C H C D C Y YGD T V + + T +SV + FGC +
Sbjct: 163 CNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFA-SSSGEKLSVP-LGFGCG--T 218
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG---DQI 181
+ S+ +GI+G D S + QL RFS CL + S L FG D +
Sbjct: 219 MNVGSLNNG--SGIVGFGRDPLSLVSQLSI---RRFSYCLTPYTSTRKSTLMFGSLSDGV 273
Query: 182 IAG----------------------------------KSLNLPPNSFTIKLNGQRGCIND 207
G + L +P ++F ++ +G G I D
Sbjct: 274 FEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVD 333
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---------- 257
G+ LT+ AVLT + +Q + + CF P
Sbjct: 334 SGTALTLFPA---AVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVV 390
Query: 258 SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
S P M +HFQGADL + P ++ + A + G TI G Q + + +YD
Sbjct: 391 SVPRMAFHFQGADLEL-PRRNYVLDDPRRGSLCILLADSGDSGATI-GNFVQQDMRVLYD 448
Query: 318 LDT 320
L+
Sbjct: 449 LEA 451
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 141/344 (40%), Gaps = 48/344 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P ++ ++DT + + W QCQ C++CY P+++ K+YK LPC +
Sbjct: 88 YLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTT 147
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CKS C + C + + Y D ++ ++T TL ++P V GC
Sbjct: 148 CKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPF-VHFPRTVIGCIR 206
Query: 123 ESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD-Q 180
+ F SI GI+GL S + QL + +FS CL P S+L+FGD
Sbjct: 207 NTNVSFDSI------GIVGLGGGPVSLVPQLSSSISKKFSYCLA-PISDRSSKLKFGDAA 259
Query: 181 IIAG--------------KSLNLPPNSFTIKLN------------GQRGCINDCGSVLTV 214
+++G K L +F++ N G+ I D G+ TV
Sbjct: 260 MVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGTTFTV 319
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ +VY+ L + D +E+ K C+ P +T HF GAD+ +
Sbjct: 320 LPDDVYSKLESAVADVVK---LERAEDPLKQFSLCYKSTYDKVDVPVITAHFSGADVKLN 376
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N FI AF + I G Q N YDL
Sbjct: 377 ALNTFIVASHRVVCL----AFLSSQSGAIFGNLAQQNFLVGYDL 416
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 137/349 (39%), Gaps = 54/349 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y +K+G+G P + ++DT + L+W QC+PC C+ Q DP+++ + K+YK L C +
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + C Y +YGD + S D TL P ++
Sbjct: 73 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ-----TLPGFV 127
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
+GC +S+ AGI+GL + S + Q+ FS CL P + L
Sbjct: 128 YGCGQDSEGLFGRA----AGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGGGGFLSI 181
Query: 178 GDQIIAGKSLNLPPNS------------FTIKLNGQRG-----------CINDCGSVLTV 214
G +AG + P + T G R I D G+V+T
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241
Query: 215 IECEVYAVLTAEFIDYF-SQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMTYHFQ-GADL 271
+ VY F+ S++ F+ TCF + S P + FQ GADL
Sbjct: 242 LPMSVYTPFQQAFVKIMSSKYARAPGFSILD---TCFKGNLKDMQSVPEVRLIFQGGADL 298
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ P NV + D AF G I+G Q + +D+ T
Sbjct: 299 NLRPVNVLL--QVDEGLTCL--AFAGNNGVAIIGNHQQQTFKVAHDIST 343
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 148/353 (41%), Gaps = 57/353 (16%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
++ + IGDP L+DT + LTW QC PCK CY Q P ++ +Y+ +A
Sbjct: 87 AFLANISIGDPPVPQLLLIDTGSDLTWIQCLPCK-CYPQTIPFFHPSRSSTYR-----NA 140
Query: 67 SCKSPFHCF--------EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
SC+S H G+C Y + Y D T+ + + + T DE +S NI F
Sbjct: 141 SCESAPHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDE-GLISKPNIVF 199
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPD------- 168
GC ++ F +G++GL + S + R +FS C L+ P
Sbjct: 200 GCGQDNSGFTQ-----YSGVLGLGPGTFSIVT---RNFGSKFSYCFGSLIDPTYPHNFLI 251
Query: 169 --------------KSFHSRLEFGDQIIA--GKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
+ F R Q I+ K L++ P F + + G + D G
Sbjct: 252 LGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQ-RYRSKGGTVIDTGCSP 310
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHFQ-GA 269
T++ E Y L+ E ID+ + ++ + C+ NL FP +T+HF GA
Sbjct: 311 TILAREAYETLSEE-IDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGA 369
Query: 270 DLVVEPENVFIFNHQ-DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
+L ++ E++F+ + DSF +++GA Q N Y+L T
Sbjct: 370 ELALDVESLFVSSESGDSFCLAM--TMNTFDDMSVIGAMAQQNYNVGYNLRTM 420
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 138/354 (38%), Gaps = 50/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + ++DT + L W QC PC C+EQ P+++ + SY+ + C D
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQR 208
Query: 68 C------KSPFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C ++P C E C Y YGD T +L++ T+ + V + F
Sbjct: 209 CGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVF 268
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ ++ L SF QL + FS CLV+ S++ FG
Sbjct: 269 GCGHRNRGLFHGAAGLLG----LGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVVFG 324
Query: 179 DQ--------------------------------IIAGKSLNLPPNSFTIKLNGQRGCIN 206
+ ++ G LN+ +++ + +G G I
Sbjct: 325 EDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTII 384
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYH 265
D G+ L+ Y V+ F+D S+ + L C+N+ P ++
Sbjct: 385 DSGTTLSYFVEPAYQVIRQAFVDLMSR--LYPLIPDFPVLNPCYNVSGVERPEVPELSLL 442
Query: 266 F-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA EN F+ D TPR G +I+G Q N VYDL
Sbjct: 443 FADGAVWDFPAENYFVRLDPDG-IMCLAVRGTPRTGMSIIGNFQQQNFHVVYDL 495
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/348 (25%), Positives = 136/348 (39%), Gaps = 54/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + ++ +LDT + + W QC PCK CY Q+DP+++ R +S+ + C
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPL 185
Query: 68 CK---SPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C SP + + C Y ++YGD T S +T T V + GC +
Sbjct: 186 CHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTF------RRTRVARVALGCGHD 239
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQI 181
++ FV + SF Q GR +FS CLV S S + FGD
Sbjct: 240 NEGLFVGAAGLLGL-----GRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSA 294
Query: 182 IAGKSLNLPPNS-----------------------------FTIKLNGQRGCINDCGSVL 212
++ + P S F + G G I D G+ +
Sbjct: 295 VSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSV 354
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T + Y F S F+ TCF+L + P++ HF+GAD+
Sbjct: 355 TRLTRPAYIAFRDAFRAGASNLKRAPQFSLFD---TCFDLSGKTEVKVPTVVLHFRGADV 411
Query: 272 VVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I + +F F G +I+G Q + VYDL
Sbjct: 412 SLPASNYLIPVDTSGNFCLAFAGTM---GGLSIIGNIQQQGFRVVYDL 456
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 141/358 (39%), Gaps = 61/358 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPC--- 63
Y +K+G+G P K ++DT + L+W QCQPC C+ Q DPI+ + K+YK LPC
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 64 -----YDASCKSPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
++ +P G C Y +YGD + S D TL P + PS
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPS----SGFV 228
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
+GC +++ +GI+GL D S + QL + + FS CL + +S
Sbjct: 229 YGCGQDNQGLFGRS----SGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLS 284
Query: 178 GDQIIAGKSLNLPPNSFTIKLNGQR----------------------------GCINDCG 209
G I SL P FT + Q+ I D G
Sbjct: 285 GFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSG 344
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEK-----LFTCRKCGVTCFNLPARFNSFPSMTY 264
+V+T + VY L F+ S+ + L TC K V ++ P +
Sbjct: 345 TVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSV------KEMSTVPEIQI 398
Query: 265 HFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
F+ GA L ++ N + + + + P +I+G Q + YD+ F
Sbjct: 399 IFRGGAGLELKAHNSLVEIEKGTTCLAIAASSNPI---SIIGNYQQQTFKVAYDVANF 453
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 144/348 (41%), Gaps = 49/348 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
++ + IG+P L+DT + LTW C PCK CY Q P ++ +Y+ C A
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCK-CYPQTIPFFHPSRSSTYRNASCVSAP 136
Query: 68 CKSPFHCFE----GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
P F G+C Y + Y D T+ + + + T D+ +S QNI FGC +
Sbjct: 137 HAMP-QIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDD-GLISKQNIVFGCGQD 194
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ----------------- 166
+ F +G++GL + S + R +FS C
Sbjct: 195 NSGFTK-----YSGVLGLGPGTFSIVT---RNFGSKFSYCFGSLTNPTYPHNILILGNGA 246
Query: 167 -------PDKSFHSRLEFGDQIIA--GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
P + F R Q I+ K L++ P +F + Q G + D G T++
Sbjct: 247 KIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQ-RYRSQGGTVIDTGCSPTILAR 305
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHFQ-GADLVVE 274
E Y L+ E ID+ + ++ + C+ NL FP +T+HF GA+L ++
Sbjct: 306 EAYETLSEE-IDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALD 364
Query: 275 PENVFIFNHQ-DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
E++F+ + DSF +++GA Q N Y+L T
Sbjct: 365 VESLFVSSESGDSFCLAM--TMNTFDDMSVIGAMAQQNYNVGYNLRTM 410
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 136/365 (37%), Gaps = 67/365 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +W +LDT + L+W QC PC C+EQN P YN SY+ + CYD
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPR 229
Query: 68 CK---SP---FHCFEGD--CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C+ SP HC + C Y Y D T +L+T T+ P + V ++
Sbjct: 230 CQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDV 289
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSR 174
FGC +K F ++ G SF QL + FS CL + S S+
Sbjct: 290 MFGCGHWNKGFFHGAGGLLGLGRG----PLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSK 345
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ L++P ++ G
Sbjct: 346 LIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGV 405
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT------CFNLPAR 255
G I D GS LT Y V+ F EK ++ C+N+
Sbjct: 406 GGTIIDSGSTLTFFPDSAYDVIKEAF---------EKKIKLQQIAADDFIMSPCYNVSGA 456
Query: 256 FN-SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQ 313
P HF + P + + ++ TP TI+G Q N
Sbjct: 457 MQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFH 516
Query: 314 FVYDL 318
+YD+
Sbjct: 517 ILYDV 521
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 136/331 (41%), Gaps = 66/331 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + +DT + ++W QC+PC C+ + D +++ + +Y C A+
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAA 190
Query: 68 C------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C + C C Y ++Y D T S DT TL +++ +FGCS
Sbjct: 191 CVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTL------GSNAIKGFQFGCS 244
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKS------FHSR 174
S Q G+MGL D+ S + Q FS CL P S SR
Sbjct: 245 QSESGGFSDQTD---GLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGAASR 301
Query: 175 LEF-------GDQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
F QI + G+ LN+P + F+ G + D G+V+T +
Sbjct: 302 SGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA------GSVMDSGTVITRL 355
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLV 272
Y+ L++ F + ++K + G+ TCF+ + + S PS+ F G +V
Sbjct: 356 PPTAYSALSSAF-----KAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVV 410
Query: 273 --------VEPEN---VFIFNHQDSFFFFFG 292
+E +N F N DS F G
Sbjct: 411 NLDFNGIMLELDNWCLAFAANSDDSSLGFIG 441
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 136/360 (37%), Gaps = 65/360 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + DT + L WTQC PC + C+EQ P+YN S ++ LPC +
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 173
Query: 67 ------SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+ C Y TYG + T V +T T V + FGC
Sbjct: 174 LSMCAGALAGAAPPPGCACMYYQTYGTGW-TAGVQGSETFT-FGSSAADQARVPGVAFGC 231
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFGD 179
S S S AG++GL S S + QLG RFS CL D + S L G
Sbjct: 232 SNAS----SSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGP 284
Query: 180 QI---------------------------------IAGKSLNLPPNSFTIKLNGQRGCIN 206
+ K+L + P +F++K +G G I
Sbjct: 285 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 344
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLPARFNS---- 258
D G+ +T + Y + A + L T T CF LPA ++
Sbjct: 345 DSGTTITSLANAAYQQVRAAVKSQL----VTTLPTVDGSDSTGLDLCFALPAPTSAPPAV 400
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
PSMT HF GAD+V+ ++ I S + + G Q N +YD+
Sbjct: 401 LPSMTLHFDGADMVLPADSYMI---SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDV 457
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 142/353 (40%), Gaps = 60/353 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + ++DT + LTWTQC+PC CY+Q P ++ ++ +Y+ D+S
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYR-----DSS 146
Query: 68 CKSPF--------HCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C + F C G C + +Y D T +++T T + PVS F
Sbjct: 147 CGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLT-VASTAGKPVSFPGFAF 205
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLE 176
GC S I + +GI+GL S + QL + RFS CL V D S SR+
Sbjct: 206 GCVHRSG---GIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRIN 262
Query: 177 FGDQ-IIAGKSLNLPP---------------------------NSFTIKLNGQRG-CIND 207
FG I++G P F+ K + G I D
Sbjct: 263 FGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVD 322
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT--CFNLPARFNSFPSMTYH 265
G+ T + E Y L H I+ G++ C+N P +T H
Sbjct: 323 SGTTYTYLPLEFYVKLEESV-----AHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAH 377
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F+ A++ ++P N F+ +D F P ILG Q N +DL
Sbjct: 378 FKDANVELQPWNTFLRMQEDLVCF----TVLPTSDIGILGNLAQVNFLVGFDL 426
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 137/351 (39%), Gaps = 61/351 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P K ++DT + LTW QC PC+ SC+ Q+ P+++ ++ SY + C
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSP 176
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + SV N +
Sbjct: 177 QCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF------GANSVPNFYY 230
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC +++ AG+MGL + S + QL + FS CL P S L G
Sbjct: 231 GCGQDNEGLFGRS----AGLMGLARNKLSLLYQLAPTLGYSFSYCL--PSTSSSGYLSIG 284
Query: 179 D---------------------------QIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+AGK L + + +T I D G+V
Sbjct: 285 SYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYT-----SLPTIIDSGTV 339
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GA 269
+T + VY L+ + K TCF A + + P+++ F GA
Sbjct: 340 ITRLPTSVYTALSKAVAA--AMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGA 397
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
L + N+ + + AF P + I+G Q VYD+ +
Sbjct: 398 TLKLSAGNLLVDVDGATTCL----AFAPARSAAIIGNTQQQTFSVVYDVKS 444
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 140/353 (39%), Gaps = 67/353 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P K ++DT + LTW QC PC+ SC+ Q+ P+++ ++ SY + C
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDV-----YETKEVDSLDTSTLLPPDEPSPVSV 113
C +P C D C Y +YGD Y +K+ S ++ SV
Sbjct: 197 QCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSN-----------SV 245
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------- 164
N +GC +++ AG+MGL + S + QL + FS CL
Sbjct: 246 PNFYYGCGQDNEGLFGRS----AGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYL 301
Query: 165 ----VQPDKSFHSRL------------EFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
P + ++ + + +AGK L + + ++ I D
Sbjct: 302 SIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYS-----SLPTIIDS 356
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ- 267
G+V+T + VY L+ ++ TCF A P+++ F
Sbjct: 357 GTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILD---TCFVGQASSLRVPAVSMAFSG 413
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
GA L + +N+ + + AF P + I+G Q VYD+ +
Sbjct: 414 GAALKLSAQNLLVDVDSSTTCL----AFAPARSAAIIGNTQQQTFSVVYDVKS 462
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 145/355 (40%), Gaps = 60/355 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +G+G ++ ++DT + LTW QC+PC SCY Q PI+ + SY+ +
Sbjct: 62 TLN--YIVTMGLGS--TNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 63 CYDASCKSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+S G+ C Y + YGD T ++ + VS
Sbjct: 118 CNSSTCQS-LQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF------GGVS 170
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V + FGC +K ++G+MGL S + Q FS CL +
Sbjct: 171 VSDFVFGCGRNNKGLFG----GVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGAS 226
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLN---------------------------GQRGCI 205
L G++ K N+ P ++T L G G +
Sbjct: 227 GSLVMGNESSVFK--NVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVL 284
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + VY L A F+ F+ F+ TCFNL S P+++
Sbjct: 285 IDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILD---TCFNLTGYDEVSIPTISM 341
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYD 317
HF+G A+L V+ F +D+ A + I+G Q N + +YD
Sbjct: 342 HFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYD 396
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 43/344 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++ +G P + ++ ++DT + + W QC PC +CY Q+D I++ +Y L C
Sbjct: 58 YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQ 117
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C C Y + YGD T D +L V + I GC +++
Sbjct: 118 CLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNE 177
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQI-- 181
+ ++ L SF Q+ RFS CL + D + S L FG+
Sbjct: 178 GYFVGAAGLLG----LGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVP 233
Query: 182 ---------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ G L +P ++F + G G I D G+ +T
Sbjct: 234 PAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTR 293
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVV 273
++ YA L F S F+ TC++L + P++T HFQG +
Sbjct: 294 LQNAAYASLRDAFRAGTSDLAPTAGFSLFD---TCYDLSGLASVDVPTVTLHFQGGTDLK 350
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + ++ +S F AF G +I+G Q + +YD
Sbjct: 351 LPASNYLIPVDNSNTFCL--AFAGTTGPSIIGNIQQQGFRVIYD 392
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 84/174 (48%), Gaps = 12/174 (6%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P ++DT + L WTQC PC C +Q P ++ + +Y+ LPC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 68 CK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLES 124
C S CF+ C Y YGD T V + +T T + + V NI FGC SL +
Sbjct: 149 CASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANS-TKVRATNIAFGCGSLNA 207
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
D + +G++G S + QLG P RFS CL + SRL FG
Sbjct: 208 GDLAN-----SSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFG 253
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 147/343 (42%), Gaps = 36/343 (10%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++ L IG P + DT + L W QC PC++C+ Q+ P++ ++K C
Sbjct: 89 NGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCD 148
Query: 65 DASCK----SPFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C S C + G C Y +YGD T V +T + + VS + FG
Sbjct: 149 SQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C + + +F + G++GL S + QLG + +FS CL+ + S+L+FG
Sbjct: 209 CGVYN-NFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNSTSKLKFGS 267
Query: 180 QIIAGKS--LNLP-------PNSFTIKLN----GQR---------GCINDCGSVLTVIEC 217
+ I + ++ P P+ + + L GQ+ I D G+VLT +E
Sbjct: 268 EAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLTYLEQ 327
Query: 218 EVYAVLTAEFIDYFSQHDIEKL-FTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPE 276
Y A + S + L F + C P R + P + + F GA + ++P+
Sbjct: 328 TFYNNFVASLQEVLSVESAQDLPFPFKFC------FPYRDMTIPVIAFQFTGASVALQPK 381
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
N+ I QD + G +I G Q + Q VYDL+
Sbjct: 382 NLLI-KLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLE 423
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 147/349 (42%), Gaps = 50/349 (14%)
Query: 5 NH-TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
NH Y+++ IG P + DT + L W QC PC++C+ Q+ P++ ++ L C
Sbjct: 86 NHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSC 145
Query: 64 YDASCKSP--FHC-FEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C S ++C G+ C Y TYGD TK V ++ P ++ FG
Sbjct: 146 DSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTI----FG 201
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG- 178
C + DF+ + GI+GL S + QLG + +FS CL+ + +L+FG
Sbjct: 202 CG-SNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGN 260
Query: 179 DQIIAGKSL-NLP-------PNSFTIKLNG----------------QRGCINDCGSVLTV 214
D I G + + P P+ + + L G I D G+VLT
Sbjct: 261 DTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTY 320
Query: 215 IECEVY----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
+E Y +L ++ DI F F P + N +FP + + F GA
Sbjct: 321 LEVNFYHNFVTLLREALGISETKDDIPYPFD--------FCFPNQANITFPKIVFQFTGA 372
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ + P+N+F F+ + P F KG ++ G Q + Q YD
Sbjct: 373 KVFLSPKNLFFRFDDLNMICLAVLPDFY-AKGFSVFGNLAQVDFQVEYD 420
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 129/348 (37%), Gaps = 53/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P + L + DT + LTWTQC+PC +SCY+Q D I++ SY + C A
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSA 205
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C D C YGI YGD + S + T+ D V N
Sbjct: 206 LCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATDV-----VDNFL 260
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC ++ AG++GL SF+ Q FS CL S L F
Sbjct: 261 FGCGQNNQGLFGGS----AGLIGLGRHPISFVQQTAAKYRKIFSYCLPSTSSS-TGHLSF 315
Query: 178 GDQ--------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
G IA + LP +S T G I D G+V
Sbjct: 316 GPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTG---GAIIDSGTV 372
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGAD 270
+T + Y L + F S++ + TC++L + S P++ + F G
Sbjct: 373 ITRLPPTAYGALRSAFRQGMSKYPSAGELSILD---TCYDLSGYKVFSIPTIEFSFAGGV 429
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V P +F A TI G Q + VYD+
Sbjct: 430 TVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 435
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 127/336 (37%), Gaps = 48/336 (14%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEGD--CFYG 82
LD V LTW QCQPC Q ++ S YK D C P+ G+ FY
Sbjct: 86 LDLVGNLTWIQCQPCVPEVRQEGAVFKSAVSPRYKDTKATDPKCTPPYTPSVGNRCSFYT 145
Query: 83 ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLN 142
++ + P V + FGC+ + F + ++AG + L+
Sbjct: 146 TSWNVAAHGYLGSDMFGFAGSPGTGGHGTDVDKLTFGCAHTTDGFERLNHGVLAGALSLS 205
Query: 143 WDSTSFMVQLG--RLVPDRFSCCLVQPDKSF----HSRLEFGDQIIAGKSLNLPPNSFT- 195
TSF+ QL RL RFS CL P +S H L FG I + FT
Sbjct: 206 RHPTSFLSQLTARRLADSRFSYCLF-PGQSHPNARHGFLRFGRDIPRHDHAHSTSLLFTG 264
Query: 196 -------------IKLNGQR------------------GCINDCGSVLTVIECEVYAVLT 224
I LNG+R G + D G+ LT + E Y ++
Sbjct: 265 RGSGSMYYIGVTSISLNGKRIIGLQPAFFRRNPQTRRGGSVVDPGTPLTRLVREAYNIVE 324
Query: 225 AEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ--GADLVVEPENVFIFN 282
AE + Y + + CF + PSMT + A L ++PE +F+
Sbjct: 325 AELVAYMQTQGSRRAPAPVQGHRLCF-VSWGHAHLPSMTINMNEDRAKLFIKPELLFLKV 383
Query: 283 HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F P + T+LGA Q +T+F +DL
Sbjct: 384 THEHLCFL----VVPDEEMTVLGAAQQVDTRFTFDL 415
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 141/348 (40%), Gaps = 54/348 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + ++ +LDT + + W QC PC+ CY Q D +++ ++Y +PC
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPL 177
Query: 68 CK---SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ SP C + C Y ++YGD T S +T T V + GC
Sbjct: 178 CRRLDSP-GCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF------RRNRVTRVALGCGH 230
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQI 181
+++ + ++ L SF VQ GR +FS CLV S S + FGD
Sbjct: 231 DNEGLFTGAAGLLG----LGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGDSA 286
Query: 182 IA-----------------------GKSLNLPP------NSFTIKLNGQRGCINDCGSVL 212
++ G S+ P + F + G G I D G+ +
Sbjct: 287 VSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSV 346
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T + Y L F S F+ TCF+L P++ HF+GAD+
Sbjct: 347 TRLTRPAYIALRDAFRIGASHLKRAPEFSLFD---TCFDLSGLTEVKVPTVVLHFRGADV 403
Query: 272 VVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ N I ++ SF F F + G +I+G Q + YDL
Sbjct: 404 SLPATNYLIPVDNSGSFCFAFAGTMS---GLSIIGNIQQQGFRISYDL 448
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 148/364 (40%), Gaps = 68/364 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA- 66
YM K+ +G P LDT + LTW QCQPC+ CY Q+ P+++ R SY ++ YDA
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEM-NYDAP 192
Query: 67 SCKSPFHCFEGD-----CFYGITYGDVY--ETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C++ GD C Y + YGD + + V L TL V + G
Sbjct: 193 DCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETL---TFAGGVRQAYLSIG 249
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-RFSCCLV---QPDKSFHSRL 175
C ++K AGI+GL S Q+ L + FS CLV S S L
Sbjct: 250 CGHDNKGLFGAPA---AGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTL 306
Query: 176 EFGDQIIAGKSLNLPPNSFT-------------IKL-----------------------N 199
FG AG PP SFT ++L
Sbjct: 307 TFG----AGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYT 362
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN 257
G+ G I D G+ +T + Y + + + ++ T G+ TC+ + R
Sbjct: 363 GRGGVILDSGTTVTRLARPAY--VAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAG 420
Query: 258 -SFPSMTYHFQGA-DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQF 314
P+++ HF G ++ ++P+N I + + + F F A T + +++G Q +
Sbjct: 421 VKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAF--AGTGDRSVSVIGNILQQGFRV 478
Query: 315 VYDL 318
VYDL
Sbjct: 479 VYDL 482
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 137/360 (38%), Gaps = 75/360 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ LGIG P L+DT + L+W QC+PC + CY Q DP+++ S SY +PC
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 177
Query: 66 ASCKS------PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
+C+ C G C YGI YG+ T V S +T TL P V V +
Sbjct: 178 DACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKP-----GVVVADF 232
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLE 176
FGC D + G++GL S + Q FS CL P L
Sbjct: 233 GFGCG----DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL-PPTSGGAGFLA 287
Query: 177 FGDQ----------------------------------IIAGKSLNLPPNSFTIKLNGQR 202
G + G L +PP++F+
Sbjct: 288 LGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFS------S 341
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SF 259
G + D G+V+T + YA L + F S++ +L V TC++ N +
Sbjct: 342 GMVIDSGTVITGLPATAYAALRSAFRSAMSEY---RLLPPSNGAVLDTCYDFTGHTNVTV 398
Query: 260 PSMTYHFQGADLV--VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P++ F G + P V + D F G G I+G +Q + +YD
Sbjct: 399 PTIALTFSGGATIDLATPAGVLV----DGCLAFAGAGTDDTIG--IIGNVNQRTFEVLYD 452
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 136/342 (39%), Gaps = 39/342 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++ IG P + + DT + L W QC PC+ C QN P+++ R ++K +PC
Sbjct: 92 YLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQP 151
Query: 68 CK----SPFHCF--EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C S C G C+Y YGD + ++ + + + + FGC+
Sbjct: 152 CTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINF--GSKNNAIKFPKLTFGCT 209
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
+ D V K+ + G++GL S + QLG + +FS C + S++ FG+
Sbjct: 210 FSNNDTVDESKRNM-GLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDA 268
Query: 182 IAGK----------SLNLPPNSFTIKLNG---------------QRGCINDCGSVLTVIE 216
I + ++ P+ + + L G + D G+ T+++
Sbjct: 269 IVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILK 328
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPE 276
Y A + + +E + CF + FP + + F GA + V+
Sbjct: 329 QSFYNKFVALVKEVYG---VEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDAS 385
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N +F +D+ T + +I G Q Q YDL
Sbjct: 386 N--LFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDL 425
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 133/352 (37%), Gaps = 61/352 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y LKLG+G P K +LDT + L+W QC+PC C+ Q DP++ + +Y+ L C +
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 67 SCK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C P G C Y +YGD + S D TL P ++ + +
Sbjct: 180 ECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQ-----TLPSFTY 234
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC +++ AGI+GL D S + QL FS CL S L G
Sbjct: 235 GCGQDNEGLFGKA----AGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIG 290
Query: 179 D---------------------------QIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+AG+ + + + + I D G+V
Sbjct: 291 KISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVP------TIIDSGTV 344
Query: 212 LTVIECEVYAVLTAEFIDYFS-QHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMTYHFQ-G 268
+T + +YA L F+ S +++ ++ TCF + + P + FQ G
Sbjct: 345 VTRLPISIYAALREAFVKIMSRRYEQAPAYSILD---TCFKGSLKSMSGAPEIRMIFQGG 401
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
ADL + N+ I + AF I+G Q YD+
Sbjct: 402 ADLSLRAPNILIEADKGIACL----AFASSNQIAIIGNHQQQTYNIAYDVSA 449
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + ++DT + L W QC PC C+EQ P+++ + SY+ + C D
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPR 211
Query: 68 C------KSPFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C C Y YGD T +L+ T+ + V ++ F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ ++ L + SF QL + FS CLV S S++ FG
Sbjct: 272 GCGHSNRGLFHGAAGLLG----LGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFG 327
Query: 179 DQ---------------------------------IIAGKSLNLPPNSFTIKLNGQRGCI 205
D ++ G+ LN+ P+++ + +G G I
Sbjct: 328 DDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTI 387
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
D G+ L+ Y V+ F++ + + L A F S Y+
Sbjct: 388 IDSGTTLSYFAEPAYEVIRRAFVERMDK---------------AYPLVADFPVL-SPCYN 431
Query: 266 FQGADLVVEPENVFIFNH-------QDSFFFFFGP--------AFTPRKGKTILGARHQH 310
G + V PE +F +++F P TPR +I+G Q
Sbjct: 432 VSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQ 491
Query: 311 NTQFVYDL 318
N +YDL
Sbjct: 492 NFHVLYDL 499
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + ++DT + L W QC PC C+EQ P+++ + SY+ + C D
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPR 211
Query: 68 C------KSPFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C C Y YGD T +L+ T+ + V ++ F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ ++ L + SF QL + FS CLV S S++ FG
Sbjct: 272 GCGHSNRGLFHGAAGLLG----LGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFG 327
Query: 179 DQ---------------------------------IIAGKSLNLPPNSFTIKLNGQRGCI 205
D ++ G+ LN+ P+++ + +G G I
Sbjct: 328 DDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTI 387
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
D G+ L+ Y V+ F++ + + L A F S Y+
Sbjct: 388 IDSGTTLSYFAEPAYEVIRRAFVERMDK---------------AYPLVADFPVL-SPCYN 431
Query: 266 FQGADLVVEPENVFIFNH-------QDSFFFFFGP--------AFTPRKGKTILGARHQH 310
G + V PE +F +++F P TPR +I+G Q
Sbjct: 432 VSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQ 491
Query: 311 NTQFVYDL 318
N +YDL
Sbjct: 492 NFHVLYDL 499
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 147/355 (41%), Gaps = 57/355 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
+LN+ ++LG K++ ++DT + LTW QCQPC+SCY Q P+Y+ SYK +
Sbjct: 84 SLNYIVTVELG----GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVF 139
Query: 63 CYDASCKSPFHCFEGD-------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
C ++C+ C Y ++YGD T+ L + ++L D
Sbjct: 140 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTR--GDLASESILLGD--- 194
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++N FGC +K ++ L S S + Q + FS CL +
Sbjct: 195 -TKLENFVFGCGRNNKGLFGGSSGLMG----LGRSSVSLVSQTLKTFNGVFSYCLPSLED 249
Query: 170 SFHSRLEFGD--------------------QIIAGKSLNLPPNSF-TIKLNGQ---RGCI 205
L FG+ Q+ + LNL S ++L RG +
Sbjct: 250 GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGIL 309
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + +Y + EF+ FS ++ TCFNL + + S P +
Sbjct: 310 IDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILD---TCFNLTSYEDISIPIIKM 366
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
FQG A+L V+ VF F D+ A + + I+G Q N + +YD
Sbjct: 367 IFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYD 421
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 126/302 (41%), Gaps = 42/302 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G+C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 240 ACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 294
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEF--GDQI 181
+ AG++GL TS VQ F+ CL P +S + L+F G
Sbjct: 295 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL--PARSSGTGYLDFGPGSPA 348
Query: 182 IAGKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEV 219
AG L P P + + + G R G I D G+V+T +
Sbjct: 349 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAA 408
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPEN 277
Y+ L + F + +K TC++ + P+++ FQ GA L V+
Sbjct: 409 YSSLRSAFASAMAARGYKKAPAVSLLD-TCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 278 VF 279
+
Sbjct: 468 IM 469
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 147/355 (41%), Gaps = 57/355 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
+LN+ ++LG K++ ++DT + LTW QCQPC+SCY Q P+Y+ SYK +
Sbjct: 132 SLNYIVTVELG----GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVF 187
Query: 63 CYDASCKSPFHCFEGD-------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
C ++C+ C Y ++YGD T+ L + ++L D
Sbjct: 188 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTR--GDLASESILLGD--- 242
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++N FGC +K ++ L S S + Q + FS CL +
Sbjct: 243 -TKLENFVFGCGRNNKGLFGGSSGLMG----LGRSSVSLVSQTLKTFNGVFSYCLPSLED 297
Query: 170 SFHSRLEFGD--------------------QIIAGKSLNLPPNSF-TIKLNGQ---RGCI 205
L FG+ Q+ + LNL S ++L RG +
Sbjct: 298 GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGIL 357
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + +Y + EF+ FS ++ TCFNL + + S P +
Sbjct: 358 IDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILD---TCFNLTSYEDISIPIIKM 414
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
FQG A+L V+ VF F D+ A + + I+G Q N + +YD
Sbjct: 415 IFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYD 469
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 139/352 (39%), Gaps = 64/352 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P + +DT + ++W QC PC + C+ Q +++ +Y+ + C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186
Query: 66 ASCKSPFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
A C G +C YG+ YGD T S DT TL + +V+ +FG
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASD----AVKGFQFG 242
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--------- 170
CS F G+MGL + S + Q + FS CL S
Sbjct: 243 CSHLESGF----SDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGG 298
Query: 171 --------------------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ +RL+ D + GK L L P+ F G + D G+
Sbjct: 299 GGASGFVTTRMLRSKQIPTFYGARLQ--DIAVGGKQLGLSPSVFAA------GSVVDSGT 350
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
++T + Y+ L++ F Q+ + R TCF+ + S P++ F G
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQTQISIPTVALVFSGG 407
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDLDT 320
+ N ++ + +F A T G T I+G Q + +YD+ +
Sbjct: 408 AAIDLDPNGIMYGNCLAF------AATGDDGTTGIIGNVQQRTFEVLYDVGS 453
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 142/354 (40%), Gaps = 66/354 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + ++ +LDT + + W QC PC+ CY Q+DPI++ R K+Y +PC
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + + C Y ++YGD T V T TL V+ + GC +
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFT--VGDFSTETLTFRRN----RVKGVALGCGHD 255
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGD 179
++ FV + SF Q G +FS CLV D+S S+ + FG+
Sbjct: 256 NEGLFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLV--DRSASSKPSSVVFGN 308
Query: 180 QIIAGKSLNLPPNS------------FTIKLNGQR-----------------GCINDCGS 210
++ + P S I + G R G I D G+
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368
Query: 211 VLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYH 265
+T + Y + F + LF TCF+L + N P++ H
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFD------TCFDL-SNMNEVKVPTVVLH 421
Query: 266 FQGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F+GAD+ + N I + F F F G +I+G Q + VYDL
Sbjct: 422 FRGADVSLPATNYLIPVDTNGKFCFAFAGTM---GGLSIIGNIQQQGFRVVYDL 472
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 147/355 (41%), Gaps = 57/355 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
+LN+ ++LG K++ ++DT + LTW QCQPC+SCY Q P+Y+ SYK +
Sbjct: 132 SLNYIVTVELG----GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVF 187
Query: 63 CYDASCKSPFHCFEGD-------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
C ++C+ C Y ++YGD T+ L + ++L D
Sbjct: 188 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTR--GDLASESILLGD--- 242
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++N FGC +K ++ L S S + Q + FS CL +
Sbjct: 243 -TKLENFVFGCGRNNKGLFGGSSGLMG----LGRSSVSLVSQTLKTFNGVFSYCLPSLED 297
Query: 170 SFHSRLEFGD--------------------QIIAGKSLNLPPNSF-TIKLNGQ---RGCI 205
L FG+ Q+ + LNL S ++L RG +
Sbjct: 298 GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGIL 357
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+V+T + +Y + EF+ FS ++ TCFNL + + S P +
Sbjct: 358 IDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILD---TCFNLTSYEDISIPIIKM 414
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
FQG A+L V+ VF F D+ A + + I+G Q N + +YD
Sbjct: 415 IFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYD 469
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 143/354 (40%), Gaps = 66/354 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + ++ +LDT + + W QC PC+ CY Q+DPI++ R K+Y +PC
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + + C Y ++YGD T V T TL V+ + GC +
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFT--VGDFSTETLTFRRN----RVKGVALGCGHD 255
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGD 179
++ FV + SF Q G +FS CLV D+S S+ + FG+
Sbjct: 256 NEGLFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLV--DRSASSKPSSVVFGN 308
Query: 180 QIIA-----------------------GKSL------NLPPNSFTIKLNGQRGCINDCGS 210
++ G S+ + + F + G G I D G+
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGT 368
Query: 211 VLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYH 265
+T + Y + F + LF TCF+L + N P++ H
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFD------TCFDL-SNMNEVKVPTVVLH 421
Query: 266 FQGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F+GAD+ + N I + F F F G +I+G Q + VYDL
Sbjct: 422 FRGADVSLPATNYLIPVDTNGKFCFAFAGTM---GGLSIIGNIQQQGFRVVYDL 472
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 148/374 (39%), Gaps = 73/374 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + L +G P + ++DT + ++W QC PCK C P +N R S+ KLPC ++
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198
Query: 68 CKS------PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPD--EPSPVSVQNIRF 118
C + PF G C + I YGD + + +++T P+ + PV + NI
Sbjct: 199 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 258
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH----SR 174
GC+ ++ + +G++G++ SF QL +FS C PDK H
Sbjct: 259 GCADIDREGLPTGA---SGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGL 313
Query: 175 LEFGDQIIAGKSLNLPP---------------------------------NSFTI-KLNG 200
+ FG+ I L P +F I K+ G
Sbjct: 314 VFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTG 373
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
G I D G+ T ++ + + EF+ + ++ D FT C+N+ +
Sbjct: 374 SGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT------PCYNITSGTA 427
Query: 258 S-----FPSMTYHFQGADLVVEPENVFIF-----NHQDSFFFFFGPAFTPRKGKTILGAR 307
+ PS+T HF+G VV P+N + Q + F + I+G
Sbjct: 428 ALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF--LMSGDIPFNIIGNY 485
Query: 308 HQHNTQFVYDLDTF 321
Q N YDL+
Sbjct: 486 QQQNLWVEYDLEKL 499
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 147/350 (42%), Gaps = 55/350 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P L+ ++DT + + W QC+PC+ CY Q I++ +YK LP +
Sbjct: 86 YLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTT 145
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+S C + C Y I YGD ++ S++T TL + S V + GC
Sbjct: 146 CQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTN-GSSVKFRRTVIGCG- 203
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL---VPDRFSCCLVQPDKSFHSRLEFGD 179
+ VS + K +GI+GL S + QL R + +FS CL + S+L FGD
Sbjct: 204 -RNNTVSFEGK-SSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMS-NISSKLNFGD 260
Query: 180 -QIIAGKSLNLPP--------------NSFTIKLN-----------GQRG-CINDCGSVL 212
+++G P +F++ N G++G I D G+ L
Sbjct: 261 AAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGTTL 320
Query: 213 TVIECEVYAVLTAEFIDYFS----QHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQG 268
T++ ++Y+ L + D + +++L C + N P + HF G
Sbjct: 321 TLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFDELNAPV-------IMAHFSG 373
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
AD+ + N FI Q AF K I G Q N YDL
Sbjct: 374 ADVKLNAVNTFIEVEQGVTCL----AFISSKIGPIFGNMAQQNFLVGYDL 419
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 136/340 (40%), Gaps = 56/340 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KL +G P + +DT + L WTQC PC +CY Q PI++ + ++K+ C S
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNS 120
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C Y I Y D +K + +T T + P + GC S F
Sbjct: 121 CH-----------YKIIYADTTYSKGTLATETVT-IHSTSGEPFVMPETTIGCGHNSSWF 168
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQIIAGKS 186
K +G++GL+W +S + Q+G P S C S++ FG + I+AG
Sbjct: 169 ----KPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDG 221
Query: 187 L--------NLPPNSFTIKLNG------------------QRGCINDCGSVLTVIECEVY 220
+ P + + L+ + I D G+ LT Y
Sbjct: 222 VVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-Y 280
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCG--VTCFNLPARFNSFPSMTYHFQ-GADLVVEPEN 277
L E +D H + + T G + C+ + FP +T HF GADLV++ N
Sbjct: 281 CNLVREAVD----HYVTAVRTADPTGNDMLCY-YTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++I F P + I G R Q+N YD
Sbjct: 336 MYIETITRGTFCLAIICNNPPQ-DAIFGNRAQNNFLVGYD 374
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 136/340 (40%), Gaps = 56/340 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KL +G P + +DT + L WTQC PC +CY Q PI++ + ++K+ C S
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNS 120
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C Y I Y D +K + +T T + P + GC S F
Sbjct: 121 CH-----------YKIIYADTTYSKGTLATETVT-IHSTSGEPFVMPETTIGCGHNSSWF 168
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQIIAGKS 186
K +G++GL+W +S + Q+G P S C S++ FG + I+AG
Sbjct: 169 ----KPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDG 221
Query: 187 L--------NLPPNSFTIKLNG------------------QRGCINDCGSVLTVIECEVY 220
+ P + + L+ + I D G+ LT Y
Sbjct: 222 VVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-Y 280
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCG--VTCFNLPARFNSFPSMTYHFQ-GADLVVEPEN 277
L E +D H + + T G + C+ + FP +T HF GADLV++ N
Sbjct: 281 CNLVREAVD----HYVTAVRTADPTGNDMLCY-YTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++I F P + I G R Q+N YD
Sbjct: 336 MYIETITRGTFCLAIICNNPPQ-DAIFGNRAQNNFLVGYD 374
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 148/374 (39%), Gaps = 73/374 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + L +G P + ++DT + ++W QC PCK C P +N R S+ KLPC ++
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197
Query: 68 CKS------PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPD--EPSPVSVQNIRF 118
C + PF G C + I YGD + + +++T P+ + PV + NI
Sbjct: 198 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 257
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH----SR 174
GC+ ++ + +G++G++ SF QL +FS C PDK H
Sbjct: 258 GCADIDREGLPTGA---SGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGL 312
Query: 175 LEFGDQIIAGKSLNLPP---------------------------------NSFTI-KLNG 200
+ FG+ I L P +F I K+ G
Sbjct: 313 VFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTG 372
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
G I D G+ T ++ + + EF+ + ++ D FT C+N+ +
Sbjct: 373 SGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT------PCYNITSGTA 426
Query: 258 S-----FPSMTYHFQGADLVVEPENVFIF-----NHQDSFFFFFGPAFTPRKGKTILGAR 307
+ PS+T HF+G VV P+N + Q + F + I+G
Sbjct: 427 ALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF--QMSGDIPFNIIGNY 484
Query: 308 HQHNTQFVYDLDTF 321
Q N YDL+
Sbjct: 485 QQQNLWVEYDLEKL 498
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 124/300 (41%), Gaps = 40/300 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 238
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 239 ACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 293
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIA 183
+ AG++GL TS VQ F+ CL P +S + L+FG A
Sbjct: 294 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL--PARSTGTGYLDFGAGSPA 347
Query: 184 GKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEVYA 221
+ P P + + L G R G I D G+V+T + Y+
Sbjct: 348 ARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYS 407
Query: 222 VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPENVF 279
L + F S +K TC++ + P+++ FQ GA L V+ +
Sbjct: 408 SLRSAFAAAMSARGYKKAPAVSLLD-TCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIM 466
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 152/357 (42%), Gaps = 59/357 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ IG+P + + DT + L W QCQPC+ CY+QN PI++ R SY+ + C +
Sbjct: 93 YLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEF 152
Query: 68 CKS--------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV---QNI 116
C F C Y +YGD + +++ + + + ++ Q +
Sbjct: 153 CNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEV 212
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ +GI+GL S S + QLG + +FS CLV ++ S+
Sbjct: 213 AFGCGTKNGGTF---DELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSK 269
Query: 175 LEFGDQI-IAGKSLN------LP--PNSF------TIKLNGQR--------------GCI 205
+ FG+ I I+G + N LP P ++ I + +R I
Sbjct: 270 INFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYTNLWNGEVEKGNII 329
Query: 206 NDCGSVLTVIECEVYAVLTA---EFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
D G+ LT ++ E + L + E + D LF CF + P +
Sbjct: 330 IDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNI------CFK-DEKAIELPII 382
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
T HF GAD+ ++P N F +D F P I G Q N YDL+
Sbjct: 383 TAHFTGADVELQPVNTFAKVEEDLLCF----TMIPSNDIAIFGNLAQMNFLVGYDLE 435
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 130/357 (36%), Gaps = 54/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C+EQN P Y+ SY+ + C+D+
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240
Query: 68 CK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C P C Y YGD T +L+T T+ + +P V+N+
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV D + S+
Sbjct: 301 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSK 356
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ +N+P + I +G
Sbjct: 357 LIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGS 416
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
G I D G+ L+ Y V+ F+ + + K F + C+N+ P
Sbjct: 417 GGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLE---PCYNVTGVEQPDLP 473
Query: 261 SMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F + P + + TP +I+G Q N +YD
Sbjct: 474 DFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYD 530
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 123/294 (41%), Gaps = 46/294 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +GIG P L + DT + LTWTQC+PC SCY Q +P +N S +Y+ + C
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
C+ C +C Y I YGD T+ + + TL D ++++ FGC ++
Sbjct: 192 MCEDAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD-----VLEDVYFGCGENNQG 246
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------------------- 164
++ L S Q + FS CL
Sbjct: 247 LFDGVAGLLG----LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 302
Query: 165 -VQPDKSFHSRLEFGDQII----AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
P SF S +G II K L + PNSF+ + G I D G+V T + +V
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKV 357
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
YA L + F + S + + TC++ ++P++ + F G+ +V
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFD---TCYDFTGLDTVTYPTIAFSFAGSTVV 408
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 123/294 (41%), Gaps = 41/294 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 67 SC--KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 239 ACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 293
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEF--GDQI 181
+ AG++GL TS VQ F+ CL P +S + L+F G
Sbjct: 294 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL--PARSSGTGYLDFGPGSPA 347
Query: 182 IAGKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEV 219
AG L P P + + + G R G I D G+V+T +
Sbjct: 348 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 407
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
Y+ L + F+ + +K TC++ + P+++ FQG ++
Sbjct: 408 YSSLRSAFVSAMAARGYKKAPAVSLLD-TCYDFTGMSQVAIPTVSLLFQGGAIL 460
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 138/339 (40%), Gaps = 64/339 (18%)
Query: 20 SLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEG 77
S ++DT + + W QC PC C+ Q DP+Y+ ++ +PC +CK +
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227
Query: 78 -------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSI 130
+C Y + YGD T DT T+ P + V++ RFGCS + S
Sbjct: 228 GCSPTTDECKYIVNYGDGKATTGTYVTDTLTMSP-----TIVVKDFRFGCSHAVRGSFSN 282
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--------FHSRLEFG---- 178
Q AGI+ L S + Q + FS C+ +P + + L+F
Sbjct: 283 QN---AGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPL 339
Query: 179 ---------------DQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL 223
I+AGK L +PP +F G + D G+V+T + +VYA L
Sbjct: 340 IKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAAL 393
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYHFQ-GADLVVEPENVFI 280
A F + + L + TC++ RF P ++ F GA L +EP ++ +
Sbjct: 394 RAAFRSAMAAYG--PLAAPVRNLDTCYDF-TRFPDVKVPKVSLVFAGGATLDLEPASIIL 450
Query: 281 FNHQDSFFFFFGPAFTP-RKGKTILGARHQHNTQFVYDL 318
D F A TP + +G Q + +YD+
Sbjct: 451 ----DGCLAF---AATPGEESVGFIGNVQQQTYEVLYDV 482
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 143/351 (40%), Gaps = 50/351 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + LDT + L WTQC+PC SC++Q P +++ + LPC
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQ 94
Query: 68 CK---SPFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK + C + + C Y +YGD T + + D T + + S+ + FG
Sbjct: 95 CKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFV-----AGTSLPGVTFG 149
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR-------------------LVPDRF 160
C L + + + IAG G S +++G L D F
Sbjct: 150 CGLNNTGVFNSNETGIAG-FGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLF 208
Query: 161 S--------CCLVQPDKSFHSR----LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
S L+Q K+ + L + L +P ++F + NG G I D
Sbjct: 209 SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDS 267
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
G+ +T + +VY V+ EF +Q + + TCF+ P++ P + HF+
Sbjct: 268 GTSITSLPPQVYQVVRDEFA---AQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 324
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GA + + EN D+ A TI+G Q N +YDL
Sbjct: 325 GATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDL 375
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 135/305 (44%), Gaps = 41/305 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P + +DT + L W QC PC CY Q +P+++ +Y + C
Sbjct: 64 YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123
Query: 68 CKSPF--HCF-EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C P+ C E C Y Y D TK V + +T TL + P+S+Q I FGC +
Sbjct: 124 CYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLT-SNTGKPISLQGILFGCGHNN 182
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV-PDRFSCCLVQ--PDKSFHSRLEFGD-Q 180
+ + G++GL TS + Q+G L +FS CLV D + S++ FG
Sbjct: 183 TGNFNDHE---MGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGS 239
Query: 181 IIAGKSLNLPP--------NSFTIKLNG--------------QRG-CINDCGSVLTVIEC 217
+ G+ + P S+ + L G ++G + D G+ ++
Sbjct: 240 EVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILPQ 299
Query: 218 EVYAVLTAEFIDYFSQHDI--EKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
++Y + E + I + + C T NL P++TYHF+GA+L++ P
Sbjct: 300 QLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKG-----PTLTYHFEGANLLLTP 354
Query: 276 ENVFI 280
FI
Sbjct: 355 IQTFI 359
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 139/359 (38%), Gaps = 57/359 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P K +LDT + L W QC PC +C+EQ+ P Y+ + S++ + C+D
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPR 251
Query: 68 CK------SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
CK P C + + C Y YGD T +L+T T+ P + V+N+
Sbjct: 252 CKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENV 311
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL + FS CLV D S S+
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSK 367
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ L +P ++ + G
Sbjct: 368 LIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGG 427
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
G I D G+ LT Y ++ F+ +++ + F K C+N+ P
Sbjct: 428 GGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLK---PCYNVSGIEKMELP 484
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA EN FI D TP+ +I+G Q N +YD+
Sbjct: 485 DFGILFSDGAMWDFPVENYFIQIEPD--LVCLAILGTPKSALSIIGNYQQQNFHILYDM 541
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 143/347 (41%), Gaps = 46/347 (13%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+++ +G P +DT + L W QC+PC C+ Q+ PI++ +Y L YD
Sbjct: 89 QAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL-SYD 147
Query: 66 ASC--KSPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+ SP + C Y +Y D T + + + V+V ++ FGC
Sbjct: 148 SPICPNSPQKKYNHLNQCIYNASYAD-GSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG 206
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHSRLEFG 178
++ Q+ +GI+GL+ S + +LG RFS C L P + H++L G
Sbjct: 207 HSNRGRFDGQQ---SGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYT-HNQLVLG 258
Query: 179 DQI------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
D + + L++ P F +GQ G + D G+ T
Sbjct: 259 DGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATF 318
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHF-QGADL 271
+ + + L+ E H +++ G C+ + FP + +HF +GADL
Sbjct: 319 LAKDGFDPLSNEIQRLVRGH-FQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADL 377
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V++ ++F+ +QD F + G +++G Q + YDL
Sbjct: 378 VLDANSLFVQKNQDVFCLAVLESNLKNIG-SVIGIMAQQHYNVAYDL 423
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 138/350 (39%), Gaps = 55/350 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + L + DT + LTWTQC+PC SCY+Q D I++ SY + C +
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSS 105
Query: 67 SC--------KSP-FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C KS + C Y YGD + S + T+ D V +
Sbjct: 106 LCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATD-----IVDDFL 160
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC +++ + AG+MGL S + Q FS CL S L F
Sbjct: 161 FGCGQDNEGLFNGS----AGLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSL-GHLTF 215
Query: 178 GDQIIAGKSLNLPP-------NSF------TIKLNGQR------------GCINDCGSVL 212
G SL P NSF +I + G + G I D G+V+
Sbjct: 216 GASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVI 275
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPA-RFNSFPSMTYHFQGA 269
T + VYA L + F + +EK + G+ TC++L + S P + + F G
Sbjct: 276 TRLAPTVYAALRSAF-----RRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGG 330
Query: 270 DLV-VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V + + + F A T+ G Q + VYD+
Sbjct: 331 VTVELXHRGILXVESEQQVCLAF-AANGSDNDITVFGNVQQKTLEVVYDV 379
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 143/347 (41%), Gaps = 46/347 (13%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+++ +G P +DT + L W QC+PC C+ Q+ PI++ +Y L YD
Sbjct: 57 QAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL-SYD 115
Query: 66 ASC--KSPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+ SP + C Y +Y D T + + + V+V ++ FGC
Sbjct: 116 SPICPNSPQKKYNHLNQCIYNASYAD-GSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG 174
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHSRLEFG 178
++ Q+ +GI+GL+ S + +LG RFS C L P + H++L G
Sbjct: 175 HSNRGRFDGQQ---SGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYT-HNQLVLG 226
Query: 179 DQI------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
D + + L++ P F +GQ G + D G+ T
Sbjct: 227 DGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATF 286
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHF-QGADL 271
+ + + L+ E H +++ G C+ + FP + +HF +GADL
Sbjct: 287 LAKDGFDPLSNEIQRLVRGH-FQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADL 345
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V++ ++F+ +QD F + G +++G Q + YDL
Sbjct: 346 VLDANSLFVQKNQDVFCLAVLESNLKNIG-SVIGIMAQQHYNVAYDL 391
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 122/294 (41%), Gaps = 46/294 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +GIG P L + DT + LTWTQC+PC SCY Q +P +N S +Y+ + C
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
C+ C +C Y I YGD T+ + + TL D ++++ FGC ++
Sbjct: 192 MCEDAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD-----VLEDVYFGCGENNQG 246
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------------------- 164
++ L S Q + FS CL
Sbjct: 247 LFDGVAGLLG----LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 302
Query: 165 -VQPDKSFHSRLEFGDQII----AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
P SF S +G II K L + PNSF+ + G I D G+V T + +V
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKV 357
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV 272
YA L + F + S + + TC++ ++P++ + F G +V
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFD---TCYDFTGLDTVTYPTIAFSFAGGTVV 408
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 141/351 (40%), Gaps = 52/351 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++KL +G P ++ L+DT + L W QC PC CY Q P++ K+Y +PC
Sbjct: 79 NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCE 138
Query: 65 DASCK------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP + C Y +Y D TK V + + T D PV V +I F
Sbjct: 139 SEQCSFFGYSCSP----QKMCAYSYSYADSSVTKGVLAREAITFSSTDG-DPVVVGDIIF 193
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV-PDRFSCCLV--QPDKSFHSRL 175
GC + + I G+ G S + Q+G L RFS CLV D +
Sbjct: 194 GCGHSNSGTFNENDMGIIGMGGGPL---SLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTI 250
Query: 176 EFGDQI-IAGKSLNLPP-------NSFTIKLNG----------------QRGCIN-DCGS 210
FG++ ++G+ + P S+ + L G +G I D G+
Sbjct: 251 NFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSGT 310
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEK--LFTCRKCGVTCFNLPARFNSFPSMTYHFQG 268
T I E Y L E S IE + C + NL P +T HF+G
Sbjct: 311 PATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYRSETNLEG-----PILTAHFEG 365
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
AD+ + P FI +D F F T G I G Q N +DLD
Sbjct: 366 ADVQLLPIQTFI-PPKDGVFCFAMAGST--DGDYIFGNFAQSNILMGFDLD 413
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 138/361 (38%), Gaps = 62/361 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C++QN Y+ ++ SYK + C D
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229
Query: 68 CK------SPFHCFEGD--CFYGITYGDVYETK---EVDSLDTSTLLPPDEPSPVSVQNI 116
C P C + C Y YGD T V++ + +V+N+
Sbjct: 230 CNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENM 289
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV D + S+
Sbjct: 290 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 345
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++AG+ LN+P ++ I +G
Sbjct: 346 LIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGA 405
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFI-DYFSQHDIEKLFTCRKCGVT--CFNLPARFN- 257
G I D G+ L+ Y EFI + ++ K R + CFN+ N
Sbjct: 406 GGTIIDSGTTLSYFAEPAY-----EFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNV 460
Query: 258 SFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P + F GA EN FI+ ++D TP+ +I+G Q N +Y
Sbjct: 461 QLPELGIAFADGAVWNFPTENSFIWLNED--LVCLAMLGTPKSAFSIIGNYQQQNFHILY 518
Query: 317 D 317
D
Sbjct: 519 D 519
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 143/347 (41%), Gaps = 46/347 (13%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+++ +G P +DT + L W QC+PC C+ Q+ PI++ +Y L YD
Sbjct: 57 QAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL-SYD 115
Query: 66 ASC--KSPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+ SP + C Y +Y D T + + + V+V ++ FGC
Sbjct: 116 SPICPNSPQKKYNHLNQCIYNASYAD-GSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG 174
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHSRLEFG 178
++ Q+ +GI+GL+ S + +LG RFS C L P + H++L G
Sbjct: 175 HSNRGRFDGQQ---SGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYT-HNQLVLG 226
Query: 179 DQI------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
D + + L++ P F +GQ G + D G+ T
Sbjct: 227 DGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATF 286
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHF-QGADL 271
+ + + L+ E H +++ G C+ + FP + +HF +GADL
Sbjct: 287 LAKDGFDPLSNEIQRLVRGH-FQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADL 345
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V++ ++F+ +QD F + G +++G Q + YDL
Sbjct: 346 VLDANSLFVQKNQDVFCLAVLESNLKNIG-SVIGIMAQQHYNVAYDL 391
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 144/345 (41%), Gaps = 48/345 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ +G G P ++ + DT + ++W QC PC CY+Q+DPI++ +Y +PC
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + G C Y + YGD T V S +T +L S ++ FGC +
Sbjct: 180 QCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLT-----SARALPGFAFGCGETN 234
Query: 125 -KDFVSIQKKIIAGIMGL-------------------NWDSTSFMVQLGRLVPD------ 158
DF + I G L +++++ + +G P
Sbjct: 235 LGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSDGV 294
Query: 159 RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
R++ + + D ++ ++ G L +PP FT + G + D G+VLT + E
Sbjct: 295 RYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFT-----RDGTLLDSGTVLTYLPPE 349
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTYHFQ-GADLVVEPE 276
Y L F +Q+ + TC++ + F P +++ F G+ + P
Sbjct: 350 AYTALRDRFKFTMTQYKPAPAYDPFD---TCYDFAGQNAIFMPLVSFKFSDGSSFDLSPF 406
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGK---TILGARHQHNTQFVYDL 318
V IF D+ AF PR TI+G Q NT+ +YD+
Sbjct: 407 GVLIFP-DDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDV 450
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 143/355 (40%), Gaps = 57/355 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + LG G+ ++DT + LTW QC PC SC++Q P+++ S SY
Sbjct: 121 LRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAV 176
Query: 61 LPCYDASCKS-----------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
LPC +SC + + C Y ++Y D ++ V + D +L +
Sbjct: 177 LPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSL------A 230
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
+ FGC ++ +G+MGL S + Q FS CL +
Sbjct: 231 GEVIDGFVFGCGTSNQGPFG----GTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKES 286
Query: 170 SFHSRLEFGDQIIAGKSLNLPPNSFT------------------IKLNGQR------GCI 205
L GD + N P +T I + GQ I
Sbjct: 287 ESSGSLVLGDDTSVYR--NSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVI 344
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTY 264
D G+++T + VY + AEF+ F+++ F+ TCFNL R PS+ +
Sbjct: 345 VDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILD---TCFNLTGFREVQIPSLKF 401
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
F+G ++ V+ V F DS A + +T I+G Q N + ++D
Sbjct: 402 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFD 456
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 143/355 (40%), Gaps = 57/355 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + LG G+ ++DT + LTW QC PC SC++Q P+++ S SY
Sbjct: 122 LRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAV 177
Query: 61 LPCYDASCKS-----------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
LPC +SC + + C Y ++Y D ++ V + D +L +
Sbjct: 178 LPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSL------A 231
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
+ FGC ++ +G+MGL S + Q FS CL +
Sbjct: 232 GEVIDGFVFGCGTSNQGPFG----GTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKES 287
Query: 170 SFHSRLEFGDQIIAGKSLNLPPNSFT------------------IKLNGQR------GCI 205
L GD + N P +T I + GQ I
Sbjct: 288 ESSGSLVLGDDTSVYR--NSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVI 345
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTY 264
D G+++T + VY + AEF+ F+++ F+ TCFNL R PS+ +
Sbjct: 346 VDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILD---TCFNLTGFREVQIPSLKF 402
Query: 265 HFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
F+G ++ V+ V F DS A + +T I+G Q N + ++D
Sbjct: 403 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFD 457
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 149/352 (42%), Gaps = 56/352 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + + DT + L W QC PC +CYEQ +P+++ + ++YK L C +
Sbjct: 94 YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153
Query: 68 CKSPFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ D C Y +YGD T+ S DT T + E P S I FGC +
Sbjct: 154 CQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLT-IGSTEGDPASFPGIAFGCGHD 212
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQ- 180
+ + + + G+ S ++QL V +FS CLV D + S++ FG
Sbjct: 213 NGGTFNEKDGGLIGLG---GGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSG 269
Query: 181 IIAGK-SLNLP-----PNSF-------------TIKLNG------------QRGCINDCG 209
+++G +++ P P++F T+ G + I D G
Sbjct: 270 VVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSG 329
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQH---DIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
+ LT++ + Y + + + D +F+ C + NL P++T HF
Sbjct: 330 TTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSL--CYSSVNNL-----EIPTITAHF 382
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GAD+ + P N F+ +D F + P I G Q N YDL
Sbjct: 383 TGADVQLPPLNTFVQVQEDLVCF----SMIPSSNLAIFGNLAQINFLVGYDL 430
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 110/266 (41%), Gaps = 57/266 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + +G P + ++DT + L + QC PC CYEQ+ P+Y + ++ +PC A
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAE 93
Query: 68 C-------------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C P +G C Y YGD T V + +T+T+ + V
Sbjct: 94 CLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV------GGIRVN 147
Query: 115 NIRFGCSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSF 171
++ FGC ++ FVS G++GL + SF Q G ++F+ CL S
Sbjct: 148 HVAFGCGNRNQGSFVS-----AGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSV 202
Query: 172 HSRLEFGDQIIA------------------------------GKSLNLPPNSFTIKLNGQ 201
S L FGD +++ G++L +P +++ I G
Sbjct: 203 FSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGN 262
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEF 227
G I D G+ +T + YA + A F
Sbjct: 263 GGTIFDSGTTVTYWSPQAYARIIAAF 288
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 149/350 (42%), Gaps = 49/350 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L IG P + +DT + L W QC PC +CY+Q +P+++ +S +Y + S
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSES 118
Query: 68 CKSPFHCF----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + + +C Y +Y D T+ V + +T TL PV+++ + FGC
Sbjct: 119 CSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLT-STTGKPVALKGVIFGCGHN 177
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR-FSCCLV--QPDKSFHSRLEFGDQ 180
+ + ++ GI+GL S + Q+G + FS CLV + S S + FG
Sbjct: 178 NNGVFNDKE---MGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKG 234
Query: 181 I----------------------------IAGKSLNLPPN-SFTIKLNGQRGCINDCGSV 211
I+ + +NLP N +++ + + D G+
Sbjct: 235 SEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTP 294
Query: 212 LTVIECEVYAVLTAEFIDYFSQH--DIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA 269
T++ + Y L E + + I+ + C T NL ++T HF+GA
Sbjct: 295 TTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKGT-----TLTAHFEGA 349
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
D+++ P +FI F F F F+ G I G Q N +DL+
Sbjct: 350 DVLLTPTQIFIPVQDGIFCFAFTSTFSNEYG--IYGNHAQSNYLIGFDLE 397
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 133/354 (37%), Gaps = 51/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P + L + DT + L+W QC PC S CY Q DP++ S ++ + C +
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGE 144
Query: 66 ASC-KSPFHCFE--GD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN----- 115
C ++ C GD C Y + YGD +++ V L TL PS + +N
Sbjct: 145 PECPRARQSCSSSPGDDRCPYEVVYGD--KSRTVGHLGNDTLTLGTTPSTNASENNSNKL 202
Query: 116 --IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
FGC + G+ GL S Q + FS CL + H
Sbjct: 203 PGFVFGCGENNTGLFGKAD----GLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHG 258
Query: 174 RLEFGDQIIAGKSLNL--------PPNSFTIKLNGQR-----------------GCINDC 208
L G A P+ + +KL G R G I D
Sbjct: 259 YLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDS 318
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYH 265
G+V+T + Y+ L F+ ++ ++ TC++ A N S P++
Sbjct: 319 GTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILD-TCYDFTAHANATVSIPAVALV 377
Query: 266 FQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA + V+ V F P R ILG Q VYD+
Sbjct: 378 FAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAG-ILGNTQQRTVAVVYDV 430
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 141/354 (39%), Gaps = 65/354 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ LGIG P L+DT + L+W QC+PC SCY Q DP+Y+ + +Y +PC
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDS 186
Query: 66 ASCKS------PFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
+CK C C YGI YG+ T V S +T TL P VSV++
Sbjct: 187 KACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSP-----QVSVKD 241
Query: 116 IRFGCSL-----------------ESKDFVSIQKKIIAGIMGL---NWDSTSFMVQLGRL 155
FGC L + VS + G +ST+ + LG
Sbjct: 242 FGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFLALGAP 301
Query: 156 VPDR------FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
+ F+ P+++ + + GK L++PP + G I D G
Sbjct: 302 TNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS------GGMIIDSG 355
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF 266
+++T + Y+ L F S + L V TC+N N + P++ F
Sbjct: 356 TIITGLPDTAYSALRTAFRTAMSAY---PLLPPNNDDVLDTCYNFTGIANVTVPTVALTF 412
Query: 267 QGA---DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
G DL V P V I QD F G + I+G +Q + +YD
Sbjct: 413 DGGATIDLDV-PSGVLI---QDCLAFAGGAS---DGDVGIIGNVNQRTFEVLYD 459
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 147/354 (41%), Gaps = 55/354 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++K+ IG P ++ + DT + L WTQC PC SCY+Q +P+++ S+K++ C
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 65 DASCK--SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C+ C + C + YGD + V + +T T L + P S+ NI FGC
Sbjct: 148 SQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLT-LNSNSGQPXSIXNIVFGC 206
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTS-FMVQLGRLVPDRFSCCLV--QPDKSFHSRLEF 177
+ + + + G G TS M LG +FS CLV + D S S++ F
Sbjct: 207 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGS--GRKFSQCLVPFRTDPSITSKIIF 264
Query: 178 GDQIIAGKS--LNLP------PNSFTIKLNG------------------QRGCINDCGSV 211
G + S ++ P P + + L+G + D G+
Sbjct: 265 GPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTP 324
Query: 212 LTVIECEVY-----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
T++ + Y V A ++ D++ R A P +T HF
Sbjct: 325 PTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRS---------ATLIDGPILTAHF 375
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDLD 319
GAD+ ++P N FI + + F A P G T I G Q N +DLD
Sbjct: 376 DGADVQLKPLNTFISPKEGVYCF----AMQPIDGDTGIFGNFVQMNFLIGFDLD 425
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 132/362 (36%), Gaps = 64/362 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C++QN Y+ ++ SYK + C D
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPR 214
Query: 68 CK--------SPFHCFEGDCFYGITYGDVYETK---EVDSLDTSTLLPPDEPSPVSVQNI 116
C P C Y YGD T V++ + +V+N+
Sbjct: 215 CNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENM 274
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV D + S+
Sbjct: 275 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 330
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ I+AG+ LN+P ++ I +G
Sbjct: 331 LIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGA 390
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFID-----YFSQHDIEKLFTC-RKCGVTCFNLPAR 255
G I D G+ L+ Y + + + Y D L C G+ LP
Sbjct: 391 GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPEL 450
Query: 256 FNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
+F GA EN FI+ ++D TP+ +I+G Q N +
Sbjct: 451 GIAFA------DGAVWNFPTENSFIWLNED--LVCLAILGTPKSAFSIIGNYQQQNFHIL 502
Query: 316 YD 317
YD
Sbjct: 503 YD 504
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 147/354 (41%), Gaps = 55/354 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++K+ IG P ++ + DT + L WTQC PC SCY+Q +P+++ S+K++ C
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 65 DASCK--SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C+ C + C + YGD + V + +T T L + P S+ NI FGC
Sbjct: 148 SQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLT-LNSNSGQPTSILNIVFGC 206
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTS-FMVQLGRLVPDRFSCCLV--QPDKSFHSRLEF 177
+ + + + G G TS M LG +FS CLV + D S S++ F
Sbjct: 207 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGS--GRKFSQCLVPFRTDPSITSKIIF 264
Query: 178 GDQIIAGKS--LNLP------PNSFTIKLNG------------------QRGCINDCGSV 211
G + S ++ P P + + L+G + D G+
Sbjct: 265 GPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTP 324
Query: 212 LTVIECEVY-----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
T++ + Y V A ++ D++ R A P +T HF
Sbjct: 325 PTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRS---------ATLIDGPILTAHF 375
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDLD 319
GAD+ ++P N FI + + F A P G T I G Q N +DLD
Sbjct: 376 DGADVQLKPLNTFISPKEGVYCF----AMQPIDGDTGIFGNFVQMNFLIGFDLD 425
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 133/357 (37%), Gaps = 72/357 (20%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKK 60
TLN Y++ + +G P + +DT + L+W QC PC + CY Q DP+++ SY
Sbjct: 137 TLN--YVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAA 194
Query: 61 LPCYDASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
+PC C C C Y ++YGD +T V S DT TL P D +V+
Sbjct: 195 VPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND-----AVRGF 249
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRL 175
FGC F G++GL + S + Q FS CL +P + + L
Sbjct: 250 FFGCGHAQSGFTGND-----GLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGY--L 302
Query: 176 EFGDQIIAG------KSLNLPPNSFT--------IKLNGQR----------GCINDCGSV 211
G A L PN+ T I + GQ+ G + D G+V
Sbjct: 303 TLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTV 362
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-ARFNSFPSMTYHFQGAD 270
+T + YA L + F + G+ + P A Y+F G
Sbjct: 363 ITRLPPTAYAALRSAF----------------RSGMASYGYPSAPATGILDTCYNFSGYG 406
Query: 271 LVVEPENVFIFN-------HQDSFFFFFGPAFTPR---KGKTILGARHQHNTQFVYD 317
V P F+ D F AF P G ILG Q + + D
Sbjct: 407 TVTLPNVALTFSGGATVTLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 463
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 144/366 (39%), Gaps = 74/366 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYD- 65
Y++ L IG P +S + DT + L WTQC PC + C++Q P+YN S +++ LPC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 66 ----------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSV 113
A P C C Y TYG + T + +T T P D+ V V
Sbjct: 152 LNLCAAEARLAGATPPPGC---ACRYNQTYGTGW-TSGLQGSETFTFGSSPADQ---VRV 204
Query: 114 QNIRFGCSLESKD--------------FVSIQKKIIAGIMGL------NWDSTSFMVQLG 153
I FGCS S D +S+ ++ AG+ + S S ++ LG
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLL-LG 263
Query: 154 -------------RLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNG 200
R P F +P S + L + +L +PP +F ++ +G
Sbjct: 264 PAAAAAALNGTGVRSTP--FVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADG 321
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--- 253
G I D G+ +T + Y + A + KL T CF LP
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAV------RSLVKLPVTDGSNATGLDLCFALPSSS 375
Query: 254 ARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
A + PSMT HF GAD+V+ EN I D + + LG Q N
Sbjct: 376 APPATLPSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQTDGELSTLGNYQQQNL 432
Query: 313 QFVYDL 318
+YD+
Sbjct: 433 HILYDV 438
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 109/246 (44%), Gaps = 41/246 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P K + + DT +GL WTQC+PCK+CY + P+++ S+K LPC
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKL 190
Query: 68 CKSPFH-CFEGDCFYGITYGDVYETKEVDSLDTSTL----LPPDEPSPVSVQNIRFGCSL 122
C+S C C Y Y D + +L T T+ L D +NI GCS
Sbjct: 191 CQSIRQGCSSPKCTYLTAYVD--NSSSTGTLATETISFSHLKYD------FKNILIGCS- 241
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLEFGDQI 181
D VS + +GIMGLN S Q + FS C+ P + H L FG ++
Sbjct: 242 ---DQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGH--LTFGGKV 296
Query: 182 -----IAGKSLNLPPNSFTIKLNG---------------QRGCINDCGSVLTVIECEVYA 221
+ S P + + IK+ G + D G+VLT + + Y+
Sbjct: 297 PNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAVLTRLPPKAYS 356
Query: 222 VLTAEF 227
L + F
Sbjct: 357 ALRSVF 362
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 146/375 (38%), Gaps = 79/375 (21%)
Query: 8 YMLKLGIGDPVKSLWFL-LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ LGIG P L LDT + L WTQC C C++Q P++ + ++ ++PC D
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDP 152
Query: 67 SCKSPFH-----CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEP-SPVSVQNIRF 118
C + C D CFY Y D T + DT T PD + +V NIRF
Sbjct: 153 LCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRF 212
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC + + + + IAG T + +L RFS C ++S S + G
Sbjct: 213 GCGMMNYGLFTPNQSGIAGF------GTGPLSLPSQLKVRRFSYCFTAMEESRVSPVILG 266
Query: 179 DQ--------------------------------------IIAGKSLNLPPNS--FTIKL 198
+ + G++ LP N+ F +K
Sbjct: 267 GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGET-RLPFNASTFALKG 325
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN- 257
+G G D G+ +T V+ L F+ + K +T + CF++PA+
Sbjct: 326 DGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPL-PVAKGYT-DPDNLLCFSVPAKKKA 383
Query: 258 -SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-----------GKTILG 305
+ P + H +GAD + EN + N D + RK TI+G
Sbjct: 384 PAVPKLILHLEGADWELPRENYVLDNDDDG-------SGAGRKLCVVILSAGNSNGTIIG 436
Query: 306 ARHQHNTQFVYDLDT 320
Q N VYDL++
Sbjct: 437 NFQQQNMHIVYDLES 451
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 137/354 (38%), Gaps = 59/354 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD-- 65
Y++ L IG P + + LDT + L WTQCQPC +C++Q P ++ + + C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 66 ------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
ASC SP C Y +YGD T +D T + + SV + FG
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFV----GAGASVPGVAFG 197
Query: 120 CSLESKDFVSIQKKIIAG-----------------------IMGLN-----WDSTSFMVQ 151
C L + + IAG + GL D + + +
Sbjct: 198 CGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 152 LGRLVPDRFSCCLVQ-PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
GR S L+Q P L + L +P + FT+K NG G I D G+
Sbjct: 258 SGRGAVQ--STPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLK-NGTGGTIIDSGT 314
Query: 211 VLTVIECEVYAVLTAEF-----IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTY 264
+T + VY ++ F + S + + F C + P R + P +
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------CLSAPLRAKPYVPKLVL 366
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF+GA + + EN ++F +D+ A T +G Q N +YDL
Sbjct: 367 HFEGATMDLPREN-YVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDL 419
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 144/344 (41%), Gaps = 49/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + LG+G P +++ + DT + + W QC PC+SCY Q DP++N +++ + C +
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSL 140
Query: 68 CKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ C C Y ++YGD T V T TL +V ++ GC ++
Sbjct: 141 CQQLLIRGCRRNQCLYQVSYGDGSFT--VGEFSTETL----SFGSNAVNSVAIGCGHNNQ 194
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA-- 183
+ ++ L SF Q+G+L FS CL + + L FG+Q +A
Sbjct: 195 GLFTGAAGLLG----LGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASN 250
Query: 184 --------------------------GKSLNLPPNSFTI-KLNGQRGCINDCGSVLTVIE 216
G S+N+P S ++ G G I D G+ +T +
Sbjct: 251 AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLV 310
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVE 274
Y + F K+ + TC++L R + P++++ F GA + +
Sbjct: 311 TSAYNPMRDAFRAGMPSD--AKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALP 368
Query: 275 PENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+N+ + ++ ++ F P + +I+G Q + + +D
Sbjct: 369 AQNIMVPVDNSGTYCLAFAPN---SENFSIIGNIQQQSFRMSFD 409
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 133/353 (37%), Gaps = 64/353 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P + L + DT + LTWTQC+PC +SCY+Q D I++ SY + C
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTST 204
Query: 67 SCKSPFHCFEGD----------CFYGITYGDV-----YETKEVDSLDTSTLLPPDEPSPV 111
C + G+ C YGI YGD Y ++E S+ + +
Sbjct: 205 LC-TQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATDI--------- 254
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
V N FGC ++ AG++GL SF+ Q + FS CL S
Sbjct: 255 -VDNFLFGCGQNNQGLFGGS----AGLIGLGRHPISFVQQTAAVYRKIFSYCLPATSSS- 308
Query: 172 HSRLEFGDQI----------------------IAGKSLN---LPPNSFTIKLNGQRGCIN 206
RL FG I G S+ LP +S T G I
Sbjct: 309 TGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTG---GAII 365
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYH 265
D G+V+T + Y L + F S++ + TC++L S P + +
Sbjct: 366 DSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILD---TCYDLSGYEVFSIPKIDFS 422
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F G V P ++ A TI G Q + VYD+
Sbjct: 423 FAGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 131/305 (42%), Gaps = 47/305 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+++ +G P ++DT + + W +C PCK C +QN P+ + +Y LPC +
Sbjct: 99 FLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTM 158
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C S + C Y ++Y + V + + DE +V ++ FGCS E+
Sbjct: 159 CHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDE-GVNAVPSVVFGCSHEN 217
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLEFGDQI- 181
D+ + + G+ GL TSF+ ++G +FS CL + +++L FG++
Sbjct: 218 GDY---KDRRFTGVFGLGKGITSFVTRMG----SKFSYCLGNIADPHYGYNQLVFGEKAN 270
Query: 182 -----------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
+ K L++ +F++K N ++ + D G+ LT +
Sbjct: 271 FEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGN-EKSALIDSGTALTWLAES 329
Query: 219 VYAVLTAE---FIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVE 274
+ L E +D F C K V+ FP +T+HF GADL ++
Sbjct: 330 AFRALDNEVRQLLDGVLMPFWRGSFACYKGTVS-----QDLIGFPVVTFHFSGGADLDLD 384
Query: 275 PENVF 279
E++F
Sbjct: 385 TESMF 389
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 137/350 (39%), Gaps = 54/350 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L L IG P + + LDT + L WTQCQPC C+ Q+ P Y++ ++ C
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 94
Query: 68 CK---SPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK S C C Y +YGD T ++T + + + SV + FGC
Sbjct: 95 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFV-----AGASVPGVVFGCG 149
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-------------GR--------LVPDRF 160
L + I + GI G S QL GR L D +
Sbjct: 150 LNN---TGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLY 206
Query: 161 ---------SCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ + P L + L +P ++F +K NG G I D G+
Sbjct: 207 KNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTA 265
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG-VTCFNLP--ARFNSFPSMTYHFQG 268
T + VY ++ D F+ H + + G + CF+ P + P + HF+G
Sbjct: 266 FTSLPPRVYRLVH----DEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEG 321
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A + + EN ++F +D A + TI+G Q N +YDL
Sbjct: 322 ATMHLPREN-YVFEAKDGGNCSICLAIIEGE-MTIIGNFQQQNMHVLYDL 369
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 141/354 (39%), Gaps = 66/354 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + ++ +LDT + + W QC PC+ CY Q+DPI++ R K+Y +PC
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ + + C Y ++YGD T V T TL V+ + GC +
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFT--VGDFSTETLTFRRN----RVKGVALGCGHD 255
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR---LEFGD 179
++ FV + SF Q G +FS CLV D+S S+ + FG+
Sbjct: 256 NEGLFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLV--DRSASSKPSSVVFGN 308
Query: 180 QIIAGKSLNLPPNS------------FTIKLNGQR-----------------GCINDCGS 210
++ + P S I + G R G I D G+
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368
Query: 211 VLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN--SFPSMTYH 265
+T + Y + F + LF TCF+L + N P++ H
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFD------TCFDL-SNMNEVKVPTVVLH 421
Query: 266 FQGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F+ AD+ + N I + F F F G +I+G Q + VYDL
Sbjct: 422 FRRADVSLPATNYLIPVDTNGKFCFAFAGTM---GGLSIIGNIQQQGFRVVYDL 472
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 138/351 (39%), Gaps = 49/351 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD-- 65
Y++ L IG P + + LDT + L WTQCQPC +C++Q P ++ + + C
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 94
Query: 66 ------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
ASC SP C Y +YGD T +D T + + SV + FG
Sbjct: 95 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFV----GAGASVPGVAFG 150
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR-------------------LVPDRF 160
C L + + IAG G S +++G L D F
Sbjct: 151 CGLFNNGVFKSNETGIAG-FGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLF 209
Query: 161 S--------CCLVQPDKSFHSR----LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
S L+Q K+ + L + L +P ++F + NG G I D
Sbjct: 210 SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDS 268
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
G+ +T + +VY V+ EF +Q + + TCF+ P++ P + HF+
Sbjct: 269 GTSITSLPPQVYQVVRDEFA---AQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 325
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GA + + EN D+ A TI+G Q N +YDL
Sbjct: 326 GATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDL 376
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 144/348 (41%), Gaps = 63/348 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P S L+DT + ++W QC+PC C+ Q DP+++ S +Y C A+
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAA 187
Query: 68 CKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C G C Y +TYGD T S DT L +V++ +FGCS
Sbjct: 188 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVKSFQFGCSN 241
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLE----- 176
F G+MGL + S + Q + FS CL P S L
Sbjct: 242 VESGF----NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 177 -------------------FGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+G ++ + G+ L++P + F+ G + D G+V+T
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVIT 351
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGAD 270
+ Y+ L++ F + +++ + G+ TCF+ + + S PS+ F G
Sbjct: 352 RLPPTAYSALSSAF-----KAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 406
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+V + I ++ +F A + I+G Q + +YD+
Sbjct: 407 VVSLDASGIILSNCLAFA-----ANSDDSSLGIIGNVQQRTFEVLYDV 449
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 137/345 (39%), Gaps = 50/345 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P + + +LDT + + W QC PC CY Q DP++N + +Y+K+PC
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPL 212
Query: 68 CKS--PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
CK C C Y ++YGD T V T TL + ++ + GC ++
Sbjct: 213 CKKLDISGCRNKRYCEYQVSYGDGSFT--VGDFSTETLTFRGQV----IRRVALGCGHDN 266
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEFGDQII- 182
+ ++ L S SF Q G RFS CLV S S L FG I
Sbjct: 267 EGLFIGAAGLLG----LGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIP 322
Query: 183 ---------------------------AGKSL-NLPPNSFTIKLNGQRGCINDCGSVLTV 214
G+ L ++P + F + G G I D G+ +T
Sbjct: 323 KSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTR 382
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLVV 273
+ Y+ + F F+ TC++L + P++ +HFQG +
Sbjct: 383 LVDSAYSTMRDAFRVGTGNLKSAGGFSLFD---TCYDLSGLKTVKVPTLVFHFQGGAHIS 439
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
P ++ S F F AF G +I+G Q + V+D
Sbjct: 440 LPATNYLIPVDSSATFCF--AFAGNTGGLSIIGNIQQQGYRVVFD 482
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 140/343 (40%), Gaps = 47/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P ++DT + LTW QC PC SC+ Q+ P++N +S +Y + C
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + S+ N +
Sbjct: 182 QCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSLPNFYY 235
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------V 165
GC +++ AG++GL + S + QL + F+ CL
Sbjct: 236 GCGQDNEGLFGRS----AGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSY 291
Query: 166 QPDKSFHSRL---EFGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
P + ++ + D + ++G ++ P S + I D G+V+T +
Sbjct: 292 NPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTS 351
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPEN 277
VY+ L+ ++ TCF A S P++T F GA L + +N
Sbjct: 352 VYSALSKAVAAAMKGTSRASAYSILD---TCFKGQASRVSAPAVTMSFAGGAALKLSAQN 408
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ + + DS AF P + I+G Q VYD+ +
Sbjct: 409 LLV-DVDDSTTCL---AFAPARSAAIIGNTQQQTFSVVYDVKS 447
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 147/355 (41%), Gaps = 57/355 (16%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
L H Y+++L IG P ++ + DT + LTWT C PC +CY+Q +P+++ + +Y+ + C
Sbjct: 69 LGH-YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISC 127
Query: 64 YDASCK-------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C SP + C Y Y T+ V + +T TL S V ++ I
Sbjct: 128 DSKLCHKLDTGVCSP----QKRCNYTYAYASAAITRGVLAQETITLSSTKGKS-VPLKGI 182
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLV--QPDKSFHS 173
FGC + + + GI+GL S + Q+G RFS CLV D S S
Sbjct: 183 VFGCGHNNTGGFNDHE---MGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSS 239
Query: 174 RLEFGD-QIIAGKSLNLPP--------------------NSFTIKLNGQRGCIN------ 206
++ FG ++GK + P N++ + NG +
Sbjct: 240 KMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTY-LHFNGSSQNVEKGNMFL 298
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDI--EKLFTCRKCGVTCFNLPARFNSFPSMTY 264
D G+ T++ ++Y + A+ + + + + C T NL P +T
Sbjct: 299 DSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRG-----PVLTA 353
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF+GAD+ + P FI F F T G + G Q N +DLD
Sbjct: 354 HFEGADVKLSPTQTFISPKDGVFCLGF--TNTSSDGG-VYGNFAQSNYLIGFDLD 405
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/362 (23%), Positives = 145/362 (40%), Gaps = 66/362 (18%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + +G G+ ++DT + LTW QC+PC +C++Q +P+++ S SY
Sbjct: 108 LRTLNYVATVGIGGGEAT----VIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAA 163
Query: 61 LPCYDASCK--------SPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSP 110
+PC +SC S C + C Y ++Y D ++ V + D +L D
Sbjct: 164 VPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED---- 219
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
+Q FGC ++ +G+MGL S + Q FS CL +
Sbjct: 220 --IQGFVFGCGTSNQGPFG----GTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESG 273
Query: 171 FHSRLEFGDQ--------------------------------IIAGKSLNLPPNSFTIKL 198
L GD + G+ + P F+
Sbjct: 274 SSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP--GFSAGG 331
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFN 257
G+ I D G+++T + VYA + AEF+ +++ F+ TCF+L R
Sbjct: 332 GGK--AIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILD---TCFDLTGLREV 386
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFV 315
PS+ F GA++ V+ + V D+ A + T I+G Q N + +
Sbjct: 387 QVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVI 446
Query: 316 YD 317
+D
Sbjct: 447 FD 448
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 125/343 (36%), Gaps = 49/343 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P + + + DT + L+W QC PC CYEQ DP+++ +Y +PC
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPE 205
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ S + C Y + YGD +T + DT TL D + FGC +
Sbjct: 206 CQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDV-----LPGFVFGCGEQD 260
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
G++GL + S Q FS CL S L G
Sbjct: 261 TGLFGRAD----GLVGLGREKVSLSSQAASKYGAGFSYCLPS-SPSAAGYLSLGGPAPAN 315
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+AG+++ + P F+ G + D G+V+T +
Sbjct: 316 ARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAA-----GTVIDSGTVITRLPP 370
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLV-VEP 275
VYA L + F ++ ++ TC++ PS+ F G V ++
Sbjct: 371 RVYAALRSAFARSMGRYGYKRAPALSILD-TCYDFTGHTTVRIPSVALVFAGGAAVGLDF 429
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V F P I+G Q VYD+
Sbjct: 430 SGVLYVAKVSQACLAFAPNGDGADAG-IIGNTQQKTLAVVYDV 471
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 140/360 (38%), Gaps = 58/360 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C+ QN+ Y+ ++ S+K + C D
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221
Query: 68 C------KSPFHCFEGD--CFYGITYGDVYETK---EVDSLDTSTLLPPDEPSPVSVQNI 116
C + P C + C Y YGD T V++ + S V+N+
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ S ++ SF QL L FS CLV D + S+
Sbjct: 282 MFGCGHWNRGLFSGASGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 337
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G++L++P ++ I +G
Sbjct: 338 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGA 397
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--- 258
G I D G+ L+ Y ++ +F + ++ + +F CFN+ +
Sbjct: 398 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYL--VFRDFPVLDPCFNVSGIEENNIH 455
Query: 259 FPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + F GA EN FI+ +D TP+ +I+G Q N +YD
Sbjct: 456 LPELGIAFADGAVWNFPAENSFIWLSED--LVCLAILGTPKSTFSIIGNYQQQNFHILYD 513
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 136/346 (39%), Gaps = 50/346 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K L + DT + LTWTQCQPC + CY Q DP++ +Y + C
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S G+ C YGI YGD + + +T TL D ++N
Sbjct: 191 DC-SQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDV-----IENFL 244
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ----------- 166
FGC ++ AG++GL D S + Q + FS CL +
Sbjct: 245 FGCGQNNRGLFGSA----AGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTFG 300
Query: 167 -----------PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P H F I G + + + G I D G+V+T +
Sbjct: 301 GGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRL 360
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGA-DLV 272
+ Y+ L + F +++ + TC++L +++++ P + + F+G +L
Sbjct: 361 PPDAYSALKSAFEKGMAKYPKAPELSILD---TCYDL-SKYSTIQIPKVGFVFKGGEELD 416
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ + F P I+G Q Q VYD+
Sbjct: 417 LDGIGIMYGASTSQVCLAFAGNQDPST-VAIIGNVQQKTLQVVYDV 461
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 137/350 (39%), Gaps = 54/350 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L L IG P + + LDT + L WTQCQPC C+ Q+ P Y++ ++ C
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 68 CK---SPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK S C C Y +YGD T ++T + + + SV + FGC
Sbjct: 151 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFV-----AGASVPGVVFGCG 205
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-------------GR--------LVPDRF 160
L + I + GI G S QL GR L D +
Sbjct: 206 LNN---TGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLY 262
Query: 161 ---------SCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ + P L + L +P ++F +K NG G I D G+
Sbjct: 263 KNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTA 321
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG-VTCFNLP--ARFNSFPSMTYHFQG 268
T + VY ++ D F+ H + + G + CF+ P + P + HF+G
Sbjct: 322 FTSLPPRVYRLVH----DEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEG 377
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A + + EN ++F +D A + TI+G Q N +YDL
Sbjct: 378 ATMHLPREN-YVFEAKDGGNCSICLAIIEGE-MTIIGNFQQQNMHVLYDL 425
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 144/366 (39%), Gaps = 74/366 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYD- 65
Y++ L IG P +S + DT + L WTQC PC + C++Q P+YN S +++ LPC
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 156
Query: 66 ----------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSV 113
A P C C Y TYG + T + +T T P D+ V V
Sbjct: 157 LNLCAAEARLAGATPPPGC---ACRYNQTYGTGW-TSGLQGSETFTFGSSPADQ---VRV 209
Query: 114 QNIRFGCSLESKD--------------FVSIQKKIIAGIMGL------NWDSTSFMVQLG 153
I FGCS S D +S+ ++ AG+ + S S ++ LG
Sbjct: 210 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLL-LG 268
Query: 154 -------------RLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNG 200
R P F +P S + L + +L +PP +F ++ +G
Sbjct: 269 PAAAAAALNGTGVRSTP--FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 326
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--- 253
G I D G+ +T + Y + A + KL T CF LP
Sbjct: 327 TGGLIIDSGTTITSLVDAAYKRVRAAV------RSLVKLPVTDGSNATGLDLCFALPSSS 380
Query: 254 ARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
A + PSMT HF GAD+V+ EN I D + + LG Q N
Sbjct: 381 APPATLPSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQTDGELSTLGNYQQQNL 437
Query: 313 QFVYDL 318
+YD+
Sbjct: 438 HILYDV 443
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 144/366 (39%), Gaps = 74/366 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYD- 65
Y++ L IG P +S + DT + L WTQC PC + C++Q P+YN S +++ LPC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 66 ----------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSV 113
A P C C Y TYG + T + +T T P D+ V V
Sbjct: 152 LNLCAAEARLAGATPPPGC---ACRYNQTYGTGW-TSGLQGSETFTFGSSPADQ---VRV 204
Query: 114 QNIRFGCSLESKD--------------FVSIQKKIIAGIMGL------NWDSTSFMVQLG 153
I FGCS S D +S+ ++ AG+ + S S ++ LG
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLL-LG 263
Query: 154 -------------RLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNG 200
R P F +P S + L + +L +PP +F ++ +G
Sbjct: 264 PAAAAAALNGTGVRSTP--FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 321
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT----CFNLP--- 253
G I D G+ +T + Y + A + KL T CF LP
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAV------RSLVKLPVTDGSNATGLDLCFALPSSS 375
Query: 254 ARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
A + PSMT HF GAD+V+ EN I D + + LG Q N
Sbjct: 376 APPATLPSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQTDGELSTLGNYQQQNL 432
Query: 313 QFVYDL 318
+YD+
Sbjct: 433 HILYDV 438
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 135/352 (38%), Gaps = 63/352 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P K ++ +LDT + + W QC PC+ CY Q DP+++ + S+ + C
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPL 206
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C SP C Y + YGD T S +T T P + GC ++
Sbjct: 207 CLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVP------KVALGCGHDN 260
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-SRLEFGDQII 182
+ FV + SF Q G +FS CLV S S + FG +
Sbjct: 261 EGLFVGAAGLLGL-----GRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAV 315
Query: 183 AGKSLNLP-----------------------------PNSFTIKLNGQRGCINDCGSVLT 213
+ ++ P + F + G G I D G+ +T
Sbjct: 316 SRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSVT 375
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEK-----LFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
+ Y L F D+++ LF TCF+L + P++ HF+
Sbjct: 376 RLTRRAYVSLRDAF--RAGAADLKRAPDYSLFD------TCFDLSGKTEVKVPTVVMHFR 427
Query: 268 GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GAD+ + N I + F F F + G +I+G Q + V+D+
Sbjct: 428 GADVSLPATNYLIPVDTNGVFCFAFAGTMS---GLSIIGNIQQQGFRVVFDV 476
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 133/364 (36%), Gaps = 70/364 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G P +LDT + + W QC PC+ CYEQ+ P+++ R SY + C A
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAAL 188
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ G C Y + YGD T +T T V + GC +
Sbjct: 189 CRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTF-----AGGARVARVALGCGHD 243
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV---------QPDKSFHS 173
++ FV+ + L SF Q+ R FS CLV P S
Sbjct: 244 NEGLFVAAAGLLG-----LGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSS 298
Query: 174 RLEFGDQIIAGKSLNLPP-------------NSFTIKLNGQR------------------ 202
+ FG + S + P I + G R
Sbjct: 299 TVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRG 358
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPA-RFN 257
G I D G+ +T + Y+ L F + G TC++L R
Sbjct: 359 GVIVDSGTSVTRLARASYSALRDAF-----RAAAAGGLRLSPGGFSLFDTCYDLGGRRVV 413
Query: 258 SFPSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
P+++ HF GA+ + PEN I + + +F F F G +I+G Q + V
Sbjct: 414 KVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGT---DGGVSIIGNIQQQGFRVV 470
Query: 316 YDLD 319
+D D
Sbjct: 471 FDGD 474
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 139/353 (39%), Gaps = 50/353 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI----YNSRSFKSYKKLPC 63
Y K+G+G P + +DT + + W C C C ++D + Y+ + + K + C
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSC 144
Query: 64 YDASC---KSPFHCFEGD-CFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPVSVQ 114
D C C G C Y I YGD Y K+V LD L+ + + +
Sbjct: 145 SDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLD---LVTGNRQTGSTNG 201
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL-------- 164
I FGC + + + + GIMG ++SF+ QL V F+ CL
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI 261
Query: 165 ------VQP--------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
V P KS H + + L L N+F +G I D G+
Sbjct: 262 FAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAF--DSGDDKGVIIDSGT 319
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-QGA 269
L + VY L E + + H L T ++ TCF+ + + FP++T+ F +
Sbjct: 320 TLVYLPDAVYNPLLNEIL---ASHPELTLHTVQES-FTCFHYTDKLDRFPTVTFQFDKSV 375
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK---TILGARHQHNTQFVYDLD 319
L V P +D++ F + KG TILG N VYD++
Sbjct: 376 SLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIE 428
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 143/348 (41%), Gaps = 63/348 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P S L+DT + ++W QC+PC C+ Q DP+++ S +Y C A
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 111
Query: 68 CKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C G C Y +TYGD T S DT L +V++ +FGCS
Sbjct: 112 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSN 165
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLE----- 176
F G+MGL + S + Q + FS CL P S L
Sbjct: 166 VESGF----NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 221
Query: 177 -------------------FGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+G ++ + G+ L++P + F+ G + D G+V+T
Sbjct: 222 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVIT 275
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGAD 270
+ Y+ L++ F + +++ + G+ TCF+ + + S PS+ F G
Sbjct: 276 RLPPTAYSALSSAF-----KAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 330
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+V + I ++ F G + G I+G Q + +YD+
Sbjct: 331 VVSLDASGIILSN---CLAFAGNSDDSSLG--IIGNVQQRTFEVLYDV 373
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 150/355 (42%), Gaps = 61/355 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +++ IG P + + DT + L W QCQPC+ CY+Q PI+N + +Y+++ C
Sbjct: 94 YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRY 153
Query: 68 CKS--------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C + H F C Y +YGD + + L T + + S+Q + FG
Sbjct: 154 CNALNSDMRACSAHGFFKACGYSYSYGD--HSFTMGYLATERFIIGSTNN--SIQELAFG 209
Query: 120 C-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP-DKSFHS--RL 175
C + +F ++ +GI+GL S S + QLG + ++FS CLV +KS S ++
Sbjct: 210 CGNSNGGNF----DEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKI 265
Query: 176 EFGDQ-IIAGKS-------LNLPPNSF------TIKLNGQR---------------GCIN 206
FGD I+G ++ P +F I + +R I
Sbjct: 266 VFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIII 325
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTY 264
D G+ LT ++ ++Y L + + +E G+ CF P +T
Sbjct: 326 DSGTTLTFLDSKLYNKL-----ELVLEKAVEGERVSDPNGIFSICFRDKIGI-ELPIITV 379
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF AD+ ++P N F +D F P G I G Q N YDLD
Sbjct: 380 HFTDADVELKPINTFAKAEEDLLCF----TMIPSNGIAIFGNLAQMNFLVGYDLD 430
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 132/311 (42%), Gaps = 54/311 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC---- 63
+++++ IG P + L+DT + L W QC PC CY+Q P+++ +Y + C
Sbjct: 68 HLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPL 127
Query: 64 ---YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
D SP E C Y YGD TK V + DT+T + PVS+ FGC
Sbjct: 128 CHKLDTGVCSP----EKRCNYTYGYGDNSLTKGVLAQDTATFT-SNTGKPVSLSRFLFGC 182
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV-PDRFSCCLVQ--PDKSFHSRLEF 177
+ + + G++GL TS + Q+G L +FS CLV D SR+ F
Sbjct: 183 GHNNTGGFNDHE---MGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSF 239
Query: 178 --GDQIIAGKSLNLP------------------------PNSFTIKLNGQRGCINDCGSV 211
G Q++ + P P + TI G+ + D G+
Sbjct: 240 GKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTI---GKANMLVDSGTP 296
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDI--EKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA 269
++ ++Y + AE + + I + + C T NL P++T+HF GA
Sbjct: 297 PILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLKG-----PTLTFHFVGA 351
Query: 270 DLVVEPENVFI 280
++++ P FI
Sbjct: 352 NVLLTPIQTFI 362
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 143/348 (41%), Gaps = 63/348 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P S L+DT + ++W QC+PC C+ Q DP+++ S +Y C A
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 187
Query: 68 CKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C G C Y +TYGD T S DT L +V++ +FGCS
Sbjct: 188 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSN 241
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLE----- 176
F G+MGL + S + Q + FS CL P S L
Sbjct: 242 VESGF----NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 177 -------------------FGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+G ++ + G+ L++P + F+ G + D G+V+T
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVIT 351
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGAD 270
+ Y+ L++ F + +++ + G+ TCF+ + + S PS+ F G
Sbjct: 352 RLPPTAYSALSSAF-----KAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 406
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+V + I ++ F G + G I+G Q + +YD+
Sbjct: 407 VVSLDASGIILSN---CLAFAGNSDDSSLG--IIGNVQQRTFEVLYDV 449
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 140/370 (37%), Gaps = 80/370 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P K +LDT + L W QC PC C+EQN P Y+ + S++ + C+D
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149
Query: 68 CK------SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C P C + C Y YGD T + +T T+ P + V+N+
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV D + S+
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 265
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ LN+P +++ + +G
Sbjct: 266 LIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGV 325
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTC-----RKCGVTCFNLP--- 253
G I D G+ L+ Y ++ F+ + I + F GV +LP
Sbjct: 326 GGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDFG 385
Query: 254 ------ARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGAR 307
A +N FP Y + ++PE V G TPR +I+G
Sbjct: 386 ILFADGAVWN-FPVENYFIR-----LDPEEVVCLA-------ILG---TPRSALSIIGNY 429
Query: 308 HQHNTQFVYD 317
Q N +YD
Sbjct: 430 QQQNFHVLYD 439
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 138/367 (37%), Gaps = 62/367 (16%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+ + Y++ L +G P + + LDT + L WTQC PC+ C+ Q P+ + + +Y LPC
Sbjct: 88 VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPC 147
Query: 64 YDASCKS-PFHCFEG-----------DCFYGITYGDVYETKEVDSLDTSTLLP--PDEPS 109
C++ PF G C Y YGD T + D T D S
Sbjct: 148 GAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDS 207
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
+ + + FGC +K + IAG W S +L FS C +
Sbjct: 208 RLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPS------QLNVTTFSYCFTSMFE 261
Query: 170 SFHSRLEFGDQI-----------IAGKSLNLP-------PNSFTIKLNG----------- 200
S S + G I+G+ P P+ + + L G
Sbjct: 262 SKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVP 321
Query: 201 ---QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPA-- 254
R I D G+ +T + VY + AEF +Q + + CF LP
Sbjct: 322 EAKLRSTIIDSGASITTLPEAVYEAVKAEFA---AQVGLPPTGVVEGSALDLCFALPVTA 378
Query: 255 --RFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
R PS+T H GAD + P ++F + P +T++G Q NT
Sbjct: 379 LWRRPPVPSLTLHLDGADWEL-PRGNYVFEDLAARVMCVVLDAAPGD-QTVIGNFQQQNT 436
Query: 313 QFVYDLD 319
VYDL+
Sbjct: 437 HVVYDLE 443
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 138/349 (39%), Gaps = 47/349 (13%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++KL +G P ++ L+DT + L W QC PC+ CY Q P++ +Y +PC
Sbjct: 47 NGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCD 106
Query: 65 DASCKSPFH---CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C S F + C Y Y D TK V + +T T D PV V +I FGC
Sbjct: 107 SEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDG-EPVVVGDIVFGCG 165
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV-PDRFSCCLV--QPDKSFHSRLEFG 178
+ + I G+ S + Q G L RFS CLV D + FG
Sbjct: 166 HSNSGTFNENDMGIIGLG---GGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTISFG 222
Query: 179 DQI-IAGKSLNLPP-------NSFTIKLNG----------------QRGCIN-DCGSVLT 213
D ++G+ + P + + L G +G I D G+ T
Sbjct: 223 DASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGTPAT 282
Query: 214 VIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
+ E Y L E + D L T + C + NL P + HF+GAD
Sbjct: 283 YLPQEFYDRLVKELKVQSNMLPIDDDPDLGT-QLCYRSETNLEG-----PILIAHFEGAD 336
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ + P FI +D F F T G+ I G Q N +DLD
Sbjct: 337 VQLMPIQTFI-PPKDGVFCFAMAGTT--DGEYIFGNFAQSNVLIGFDLD 382
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 139/352 (39%), Gaps = 63/352 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ +G G P ++ + DT + ++W QC PC CY+Q+DPI++ +Y +PC
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHP 194
Query: 67 SCKSP--FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C G C Y + YGD + V S +T +L S ++ FGC +
Sbjct: 195 QCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT-----STRALPGFAFGCGQTN 249
Query: 125 -KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG----- 178
DF + G++GL S Q FS CL D + H L G
Sbjct: 250 LGDFGDVD-----GLIGLGRGQLSLSSQAAASFGGTFSYCLPS-DNTTHGYLTIGPTTPA 303
Query: 179 --DQI-----------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
D + I G L +PP FT G D G++LT
Sbjct: 304 SNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTILT 358
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTYHFQ-GADL 271
+ E Y L F +Q+ + TC++ + F P++++ F G+
Sbjct: 359 YLPPEAYTALRDRFKFTMTQYKPAPAYDPFD---TCYDFTGQSAIFIPAVSFKFSDGSVF 415
Query: 272 VVEPENVFIFNHQDS-----FFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + IF + F P+ P TI+G Q NT+ +YD+
Sbjct: 416 DLSFFGILIFPDDTAPAIGCLGFVARPSAMPF---TIVGNMQQRNTEVIYDV 464
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 139/361 (38%), Gaps = 61/361 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P K +LDT + L W QC PC C+EQN P Y+ + S++ + C D
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 68 CK------SPFHC-FEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS----VQN 115
C+ P C FE C Y YGD T +L+T T+ + S V+N
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHS 173
+ FGC ++ ++ SF QL L FS CLV D S S
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371
Query: 174 RLEFGD-------------QIIAGKS--------------------LNLPPNSFTIKLNG 200
+L FG+ +IAGK L +P ++ + +G
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SF 259
G I D G+ L+ Y ++ F+ + + + F C+N+ +F
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILH---PCYNVSGTDELNF 488
Query: 260 PSMTYHF-QGADLVVEPENVFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P F GA EN FI Q G TP+ +I+G Q N +Y
Sbjct: 489 PEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLG---TPKSALSIIGNYQQQNFHILY 545
Query: 317 D 317
D
Sbjct: 546 D 546
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 140/339 (41%), Gaps = 51/339 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ--PCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ L G P + + LDT + +TWTQC+ P +C+ Q P+++ + S+ LPC
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147
Query: 66 ASCKSPFHCFEGD------CFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPVSVQ 114
+C++ C G+ C Y I+YGD + +EV + + T E S +V
Sbjct: 148 PACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGT----GEGSSAAVP 203
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
+ FGC ++ + + GI G S S QL FS C S S
Sbjct: 204 GLVFGCGHANRGVFTSNET---GIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSKTSA 257
Query: 175 LEFGDQIIAGKSLNLPPNSFTI-KLNGQRGC-----INDCGSVLTVIECEVYAVLTAEFI 228
+ G +A PP++ + + G C ++ G+ +T + Y + EF
Sbjct: 258 VLLGLPGVA------PPSASPLGRRRGSYRCRSTPRSSNSGTSITSLPPRTYRAVREEFA 311
Query: 229 DYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGADLVVEPENVFIFNHQD- 285
+Q + + TCF+ P R P+M HF+GA + + EN ++F D
Sbjct: 312 ---AQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQEN-YVFEVVDD 367
Query: 286 ------SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
S G+ ILG Q N +YDL
Sbjct: 368 DDAGNSSRIICLA---VIEGGEIILGNIQQQNMHVLYDL 403
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 136/354 (38%), Gaps = 59/354 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD-- 65
Y++ L IG P + + LDT + L WTQCQPC +C++Q P ++ + + C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 66 ------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
ASC SP C Y +YGD T +D T + + SV + FG
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFV----GAGASVPGVAFG 197
Query: 120 CSLESKDFVSIQKKIIAG-----------------------IMGLN-----WDSTSFMVQ 151
C L + + IAG + GL D + + +
Sbjct: 198 CGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 152 LGRLVPDRFSCCLVQ-PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
GR S L+Q P L + L +P + F +K NG G I D G+
Sbjct: 258 SGRGAVQ--STPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGTIIDSGT 314
Query: 211 VLTVIECEVYAVLTAEF-----IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTY 264
+T + VY ++ F + S + + F C + P R + P +
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------CLSAPLRAKPYVPKLVL 366
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF+GA + + EN ++F +D+ A T +G Q N +YDL
Sbjct: 367 HFEGATMDLPREN-YVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDL 419
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 137/350 (39%), Gaps = 54/350 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L L IG P + + LDT + L WTQCQPC C+ Q+ P Y++ ++ C
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 68 CK---SPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK S C C + +YGD T ++T + + + SV + FGC
Sbjct: 151 CKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFV-----AGASVPGVVFGCG 205
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-------------GR--------LVPDRF 160
L + I + GI G S QL GR L D +
Sbjct: 206 LNN---TGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLY 262
Query: 161 ---------SCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ + P L + L +P ++F +K NG G I D G+
Sbjct: 263 KNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTA 321
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG-VTCFNLP--ARFNSFPSMTYHFQG 268
T + VY ++ D F+ H + + G + CF+ P + P + HF+G
Sbjct: 322 FTSLPPRVYRLVH----DEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEG 377
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A + + EN ++F +D A + TI+G Q N +YDL
Sbjct: 378 ATMHLPREN-YVFEAKDGGNCSICLAIIEGE-MTIIGNFQQQNMHVLYDL 425
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/348 (22%), Positives = 138/348 (39%), Gaps = 54/348 (15%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+ +G G K LDT ++W C+PC+ Q +++ + ++ + D C
Sbjct: 71 VSIGTGQGFKLQVLGLDTSTSMSWVMCEPCQPSLPQAGHLFSPAASPTFHGVHSNDPVCT 130
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV-SVQNIRFGCSLESKDFV 128
+P+ C + + Y +++ L L +P+ SV I FGC+ F
Sbjct: 131 APYRPTANGCSFRFPFASGYLSRDTFHLRNGGL---SGGAPIESVPGIMFGCAHSVAGFH 187
Query: 129 SIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF-HSRLEFGDQI------ 181
+ + G++ L+ S + QL RFS CL +P + H L G +
Sbjct: 188 N--DGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKPTQGNPHGFLRLGADVLPPLPH 245
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+A K L + P F G+ GC + + +T I
Sbjct: 246 SHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVFAA---GRGGCSINPAATITAIME 302
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF------NLPARFNSFPSMTYHFQ-GAD 270
Y V+ + Y + +++ G F ++ AR PSM +HF+ GA+
Sbjct: 303 PAYLVVERALVAYMKELGSDRVKKGPPGGGALFFDRMYKSVQAR---LPSMAFHFKDGAE 359
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
L PE +F + ++F G + +T++GA Q NT+F +D+
Sbjct: 360 LWFTPEQLFEVHGMVAWFMMVGKGYR----RTVIGAPQQVNTRFTFDV 403
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 129/349 (36%), Gaps = 66/349 (18%)
Query: 20 SLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEG-- 77
+L ++DT + LTW QC+PC CY Q DP+++ SY +PC ++C++ G
Sbjct: 176 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 235
Query: 78 ----------------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+Y + YGD ++ V + DT L SV FGC
Sbjct: 236 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL------GGASVDGFVFGCG 289
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLEFGDQ 180
L ++ AG+MGL S + Q FS CL L G
Sbjct: 290 LSNRGLFG----GTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 345
Query: 181 IIAGKSLNLPPNSFTIKL---------------------------NGQRGCINDCGSVLT 213
+ + N P S+T + G + D G+V+T
Sbjct: 346 TSSYR--NATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVIT 403
Query: 214 VIECEVYAVLTAEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GA 269
+ VY + AEF F ++ F+ C+NL P +T + GA
Sbjct: 404 RLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDA---CYNLTGHDEVKVPLLTLRLEGGA 460
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
D+ V+ + +D A + +T I+G Q N + VYD
Sbjct: 461 DMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYD 509
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 129/349 (36%), Gaps = 66/349 (18%)
Query: 20 SLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEG-- 77
+L ++DT + LTW QC+PC CY Q DP+++ SY +PC ++C++ G
Sbjct: 175 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 234
Query: 78 ----------------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+Y + YGD ++ V + DT L SV FGC
Sbjct: 235 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL------GGASVDGFVFGCG 288
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLEFGDQ 180
L ++ AG+MGL S + Q FS CL L G
Sbjct: 289 LSNRGLFG----GTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 344
Query: 181 IIAGKSLNLPPNSFTIKL---------------------------NGQRGCINDCGSVLT 213
+ + N P S+T + G + D G+V+T
Sbjct: 345 TSSYR--NATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVIT 402
Query: 214 VIECEVYAVLTAEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GA 269
+ VY + AEF F ++ F+ C+NL P +T + GA
Sbjct: 403 RLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDA---CYNLTGHDEVKVPLLTLRLEGGA 459
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
D+ V+ + +D A + +T I+G Q N + VYD
Sbjct: 460 DMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYD 508
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 107/255 (41%), Gaps = 38/255 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K DT + LTWTQC+PC C+ QN P ++ + SYK + C
Sbjct: 140 YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSE 199
Query: 67 SCK-------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK C C YGI YG Y + L T TL S +N FG
Sbjct: 200 FCKLIAEGNYPAQDCISNTCLYGIQYGSGY---TIGFLATETL---AIASSDVFKNFLFG 253
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLEFG 178
CS ES+ + G++GL + Q + FS CL P + H L FG
Sbjct: 254 CSEESRGTFNGT----TGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTGH--LSFG 307
Query: 179 DQII-AGKSLNLPP--------NSFTIKLNGQRGCIN--------DCGSVLTVIECEVYA 221
++ A KS + P N+ I + G+ IN D G+ T + Y+
Sbjct: 308 VEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLPSPTYS 367
Query: 222 VLTAEFIDYFSQHDI 236
L + F + + + +
Sbjct: 368 ALGSAFREMMANYTL 382
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 139/361 (38%), Gaps = 61/361 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P K +LDT + L W QC PC C+EQN P Y+ + S++ + C D
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 68 CK------SPFHC-FEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS----VQN 115
C+ P C FE C Y YGD T +L+T T+ + S V+N
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHS 173
+ FGC ++ ++ SF QL L FS CLV D S S
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371
Query: 174 RLEFGD-------------QIIAGKS--------------------LNLPPNSFTIKLNG 200
+L FG+ +IAGK L +P ++ + +G
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SF 259
G I D G+ L+ Y ++ F+ + + + F C+N+ +F
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILH---PCYNVSGTDELNF 488
Query: 260 PSMTYHF-QGADLVVEPENVFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P F GA EN FI Q G TP+ +I+G Q N +Y
Sbjct: 489 PEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLG---TPKSALSIIGNYQQQNFHILY 545
Query: 317 D 317
D
Sbjct: 546 D 546
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 141/348 (40%), Gaps = 63/348 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P S L+DT + ++W QC+PC C+ Q DP+++ S +Y C A
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 257
Query: 68 CKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C G C Y +TYGD T S DT L +V++ +FGCS
Sbjct: 258 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSN 311
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLE----- 176
F G+MGL + S + Q + FS CL P S L
Sbjct: 312 VESGF----NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 367
Query: 177 -------------------FGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+G ++ + G+ L++P + F+ G + D G+V+T
Sbjct: 368 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVIT 421
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGAD 270
+ Y+ L++ F Q+ + G+ TCF+ + + S PS+ F G
Sbjct: 422 RLPPTAYSALSSAFKAGMKQYP-----PAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 476
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+V + I ++ F G + G I+G Q + +YD+
Sbjct: 477 VVSLDASGIILSN---CLAFAGNSDDSSLG--IIGNVQQRTFEVLYDV 519
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 136/356 (38%), Gaps = 51/356 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P K + +DT + + W C C SC ++ +Y+ + S K +
Sbjct: 89 YFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVT 148
Query: 63 CYDASCKS-------PFHCFEGDCFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSV 113
C C + P C Y ITYGD T V + D + ++
Sbjct: 149 CGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLAN 208
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL------- 164
++ FGC + + + GI+G ++S + QL V FS CL
Sbjct: 209 ASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGG 268
Query: 165 -------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP H + + G +L LP N F I G RG I D G
Sbjct: 269 IFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIG-GGSRGTIIDSG 327
Query: 210 SVLTVIECEVY-AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
+ L + VY AVL+A FS H L + CF N FP +T+HF
Sbjct: 328 TTLAYLPEVVYKAVLSA----VFSNHPDVTLKNVQD--FLCFQYSGSVDNGFPEVTFHFD 381
Query: 268 G-ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
G LVV P + N +D + F G K +LG N VYDL+
Sbjct: 382 GDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLE 437
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 140/353 (39%), Gaps = 50/353 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI----YNSRSFKSYKKLPC 63
Y K+G+G P + +DT + + W C C C ++D + Y++ + + K + C
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSVSC 144
Query: 64 YDASC---KSPFHCFEGD-CFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPVSVQ 114
D C C G C Y I YGD Y ++V LD L+ + + +
Sbjct: 145 SDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLD---LVTGNRQTGSTNG 201
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL-------- 164
I FGC + + + + GIMG ++SF+ QL V F+ CL
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI 261
Query: 165 ------VQP--------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
V P KS H + + L L ++F +G I D G+
Sbjct: 262 FAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAF--DSGDDKGVIIDSGT 319
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-QGA 269
L + VY L + + + H L T + TCF+ R + FP++T+ F +
Sbjct: 320 TLVYLPDAVYNPLMNQIL---ASHQELNLHTVQDS-FTCFHYIDRLDRFPTVTFQFDKSV 375
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK---TILGARHQHNTQFVYDLD 319
L V P+ +D++ F + KG TILG N VYD++
Sbjct: 376 SLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIE 428
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 87/211 (41%), Gaps = 29/211 (13%)
Query: 3 TLNHTYMLKLG--IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
TLN+ + LG G P +L ++DT + LTW QC+PC +CY Q DP+++ +Y
Sbjct: 89 TLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAA 148
Query: 61 LPCYDASCKSPFHCFEG-------------DCFYGITYGDVYETKEVDSLDTSTLLPPDE 107
+ C ++C G C+Y + YGD ++ V + DT L
Sbjct: 149 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL----- 203
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--- 164
S+ FGC L ++ AG+MGL S + Q FS CL
Sbjct: 204 -GGASLGGFVFGCGLSNRGLFG----GTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 258
Query: 165 VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFT 195
D S L GD A N P ++T
Sbjct: 259 TSGDASGSLSLGGGDD-AASSYRNTTPVAYT 288
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 130/318 (40%), Gaps = 62/318 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + +G P + ++D+ + L W QC PC CY Q+ P+Y + ++ +PC
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPE 124
Query: 68 C-----KSPFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C F C + G C Y Y D +K V + +++T+ V + + FG
Sbjct: 125 CLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV------DDVRIDKVAFG 178
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEF 177
C +++ + G++GL SF Q+G ++F+ CLV S S L F
Sbjct: 179 CGRDNQGSFAAA----GGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIF 234
Query: 178 GDQIIA------------------------------GKSLNLPPNSFTIKLNGQRGCIND 207
GD++I+ G+SL + +++++ G G I D
Sbjct: 235 GDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFD 294
Query: 208 CGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
G+ +T Y + A F + Y ++ L C VT + P SFPS T
Sbjct: 295 SGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGLDLCVD--VTGVDQP----SFPSFTI 348
Query: 265 HFQGADLVVEPE--NVFI 280
G V +P+ N F+
Sbjct: 349 VL-GGGAVFQPQQGNYFV 365
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/344 (22%), Positives = 144/344 (41%), Gaps = 49/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + LG+G P +++ + DT + + W QC PC+SCY Q DP++N +++ + C +
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSL 140
Query: 68 CKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ C C Y ++YGD T V T TL +V ++ GC ++
Sbjct: 141 CQQLLIRGCRRNQCLYQVSYGDGSFT--VGEFSTETL----SFGSNAVNSVAIGCGHNNQ 194
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA-- 183
+ ++ L SF Q+G+L FS CL + + L FG+Q +A
Sbjct: 195 GLFTGAAGLLG----LGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASN 250
Query: 184 --------------------------GKSLNLPPNSFTI-KLNGQRGCINDCGSVLTVIE 216
G S+++P S ++ G G I D G+ +T +
Sbjct: 251 AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLV 310
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVE 274
Y + F K+ + TC++L R + P++++ F GA + +
Sbjct: 311 TSAYNPMRDAFRAGMPSD--AKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALP 368
Query: 275 PENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+N+ + ++ ++ F P + +I+G Q + + +D
Sbjct: 369 AQNIMVPVDNSGTYCLAFAPN---SENFSIIGNIQQQSFRMSFD 409
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 123/304 (40%), Gaps = 49/304 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY +PC
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196
Query: 66 ASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
++C C C Y ++YGD T V S DT TL + +VQ FGC
Sbjct: 197 SACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLA-----ANATVQGFLFGCG 251
Query: 122 LESKD--FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFG 178
F I G++G + S + Q FS CL P KS + L G
Sbjct: 252 HAQSGGLFTGID-----GLLGFGREQPSLVQQTAGAYGGVFSYCL--PTKSSTTGYLTLG 304
Query: 179 --DQIIAGKSLN--LP-PNSFT--------IKLNGQ----------RGCINDCGSVLTVI 215
+ G S LP PN+ T I + GQ G + D G+V+T +
Sbjct: 305 GPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRL 364
Query: 216 ECEVYAVLTAEF----IDYFSQHDIEKLFTCRK-CGVTCFNLPARFNSFPSMTYHFQGAD 270
YA L + F Y S I L TC G NL + +F S GAD
Sbjct: 365 PPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGAD 424
Query: 271 LVVE 274
++
Sbjct: 425 GIMS 428
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 135/354 (38%), Gaps = 62/354 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLP 62
+ Y+ +G+G P +LDT + LTW QC+PC S CY Q P+++ + SY +P
Sbjct: 126 SQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVP 185
Query: 63 CYDASCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
C C++ +GD C Y I YG S D TL P V
Sbjct: 186 CDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGP-----GAIV 240
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GRLVPDRFSCCLVQPDKS-- 170
+ FGC + + + G++GL S Q R FS CL S
Sbjct: 241 KRFHFGCGHHQQRG---KFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGVSTG 297
Query: 171 --------------FHSRLEFGDQ-----------IIAGKSLNLPPNSFTIKLNGQRGCI 205
F L DQ +AG+ L++PP F + G I
Sbjct: 298 FLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF------REGVI 351
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+VL+ ++ Y L F +++ + TCFN N + P+++
Sbjct: 352 TDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLD---TCFNFTGYDNVTVPTVSL 408
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F+G V + + F+ G +T ++G+ Q + +YD+
Sbjct: 409 TFRGGATVHLDASSGVLMDGCLAFWSSGDEYT-----GLIGSVSQRTIEVLYDM 457
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 90/203 (44%), Gaps = 27/203 (13%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN+ ++LG D + ++DT + LTW QC+PC SCY Q P++ + SY+ +P
Sbjct: 142 TLNYIVTMELGGQD----MTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIP 197
Query: 63 CYDASCKS---------PFHCFEGDCFYGITYGD-VYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+S +C Y + YGD Y E+ + S +S
Sbjct: 198 CNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF-------GGIS 250
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V N FGC +K ++G+MGL + S + Q FS CL D
Sbjct: 251 VSNFVFGCGKNNKGLFG----GVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGAS 306
Query: 173 SRLEFGDQIIAGKSLNLPPNSFT 195
L G++ K NL P ++T
Sbjct: 307 GSLAMGNESSVFK--NLTPIAYT 327
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 149/366 (40%), Gaps = 58/366 (15%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F +++ L IG P + ++DT + L W QC PC +C++Q+ ++ S+K L
Sbjct: 98 FNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157
Query: 62 PC----YD----------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPD- 106
C Y+ + GD GI E+ ++LD + +
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL---AKESLLFETLDEGRVFQYNA 214
Query: 107 ---EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLN-WDSTSFMVQLGRLVPDRFSC 162
+ S + NI FGC + + G+ GL + + QLG ++FS
Sbjct: 215 ISTQISKIKKSNITFGCG--HMNIKTNNDDAYNGVFGLGAYPHITMATQLG----NKFSY 268
Query: 163 CLVQPDKSF--HSRLEFGD-----------QI-------------IAGKSLNLPPNSFTI 196
C+ + H+ L G QI + K+L + PN+F I
Sbjct: 269 CIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKI 328
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN--LPA 254
+G G + D G T + + +L E +D + +E++ T RK CF +
Sbjct: 329 SSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM-KGLLERIPTQRKFEGLCFKGVVSR 387
Query: 255 RFNSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
FP++T+HF GADLV+E ++F + D F P+ + +++G Q N
Sbjct: 388 DLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYN 447
Query: 314 FVYDLD 319
+DL+
Sbjct: 448 VGFDLE 453
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 147/370 (39%), Gaps = 74/370 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA- 66
Y+ K+ +G P LDT + LTW QCQPC+ CY Q+ P+++ R SY ++ YDA
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEM-NYDAP 199
Query: 67 SCKSPFHCFEGD-----CFYGITYGD----VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C++ GD C Y + YGD + V L TL V +
Sbjct: 200 DCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLT---FAGGVRQAYLS 256
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-RFSCCLV---QPDKSFHS 173
GC ++K AGI+GL+ S Q+ L + FS CLV S S
Sbjct: 257 IGCGHDNKGLFGAPA---AGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSS 313
Query: 174 RLEFGDQIIAGKSLNLPPNSFT-------------IKL---------------------- 198
L FG AG PP SFT ++L
Sbjct: 314 TLTFG----AGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDP 369
Query: 199 -NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPAR 255
G G I D G+ +T + Y F + + ++ T G+ TC+ + R
Sbjct: 370 YTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAAT--GLGQVSTGGPSGLFDTCYTVGGR 427
Query: 256 FN-----SFPSMTYHFQGA-DLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARH 308
P+++ HF G +L ++P+N I + + + F F A T + +++G
Sbjct: 428 AGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAF--AGTGDRSVSVIGNIL 485
Query: 309 QHNTQFVYDL 318
Q + VYD+
Sbjct: 486 QQGFRVVYDI 495
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 104/246 (42%), Gaps = 38/246 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QC+PC CY+Q +P+++ +Y + C D+
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C Y + YGD T + DT T+ + +++ RFGC ++
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTI------AHDAIKGFRFGCGEKN 276
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
AG+MGL TS VQ F+ CL + L+FG AG
Sbjct: 277 NGLFG----KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCL-PALTTGTGYLDFGPG-SAG 330
Query: 185 KSLNLPP------------NSFTIKLNGQR-----------GCINDCGSVLTVIECEVYA 221
+ L P I++ GQ+ G + D G+V+T + Y
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390
Query: 222 VLTAEF 227
L++ F
Sbjct: 391 ALSSAF 396
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 141/354 (39%), Gaps = 70/354 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPC-Y 64
YM+ LG G P L+DT + ++W QC PC S CY Q DP+++ +Y + C
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184
Query: 65 DASCKSPFH----CFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
DA K H C G C Y + YGD T+ V S +T T P ++V++ F
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAP-----GITVKDFHF 239
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------V 165
GC + + G++GL S +VQ + FS CL V
Sbjct: 240 GCGHDQRG----PSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGV 295
Query: 166 QPDKSFHSR------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
+P + ++ + + GK L++P ++F + G + D
Sbjct: 296 RPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF------RGGMLID 349
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF 266
G+++T + Y L A F+ + + + TC+N N + P + F
Sbjct: 350 SGTIVTELPETAYNALNAALRKAFAAYPM----VASEDFDTCYNFTGYSNVTVPRVALTF 405
Query: 267 QGA---DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
G DL V P + + +D F P G I+G +Q + +YD
Sbjct: 406 SGGATIDLDV-PNGILV---KDCLAFR---ESGPDVGLGIIGNVNQRTLEVLYD 452
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 80/184 (43%), Gaps = 16/184 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLGIG P +DT + L WTQCQPC CY Q DP++N R +Y LPC +
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C D C Y TY T+ ++D + + + + FGCS
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI------GEDAFRGVAFGCST 202
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
S + +G++GL S + QL RF+ CL P +L G
Sbjct: 203 SSTGGAPPPQA--SGVVGLGRGPLSLVSQLS---VRRFAYCLPPPASRIPGKLVLGADAD 257
Query: 183 AGKS 186
A ++
Sbjct: 258 AARN 261
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 80/184 (43%), Gaps = 16/184 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLGIG P +DT + L WTQCQPC CY Q DP++N R +Y LPC +
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C D C Y TY T+ ++D + + + + FGCS
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI------GEDAFRGVAFGCST 202
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
S + +G++GL S + QL RF+ CL P +L G
Sbjct: 203 SSTGGAPPPQA--SGVVGLGRGPLSLVSQLS---VRRFAYCLPPPASRIPGKLVLGADAD 257
Query: 183 AGKS 186
A ++
Sbjct: 258 AARN 261
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 150/347 (43%), Gaps = 47/347 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P + ++DT + + W QCQPC+ CY Q PI++ K+YK LPC
Sbjct: 94 YLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNI 153
Query: 68 C---KSPFHCFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C +S C +C Y ITYGD ++ S++T TL D S V GC
Sbjct: 154 CQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDG-SSVQFPKTVIGCGH 212
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ 180
+K + Q++ I+GL S + QL + +FS CL + S+L FGD+
Sbjct: 213 NNKG--TFQREGSG-IVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDE 269
Query: 181 -IIAGKSLNLPP--------------NSFTIKLN-------------GQRGCINDCGSVL 212
+++G+ P +F++ N G+ I D G+ L
Sbjct: 270 AVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTL 329
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADL 271
T++ + Y L + D ++E++ K C+ + + P +T HF+GAD+
Sbjct: 330 TILPEDDYLNLESAVADAI---ELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGADV 386
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ P + FI + F AF K I G Q N YDL
Sbjct: 387 ELNPISTFIEVDEGVVCF----AFRSSKIGPIFGNLAQQNLLVGYDL 429
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 146/355 (41%), Gaps = 56/355 (15%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
L H Y++++ IG P ++ + DT + LTWT C PC CY+Q +PI++ + SY+ + C
Sbjct: 22 LGH-YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISC 80
Query: 64 YDASCK-------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C SP + C Y Y T+ V + +T TL S V ++ I
Sbjct: 81 DSKLCHKLDTGVCSP----QKHCNYTYAYASAAITQGVLAQETITLSSTKGES-VPLKGI 135
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLV--QPDKSFHS 173
FGC + + ++ GI+GL SF+ Q+G RFS CLV D S S
Sbjct: 136 VFGCGHNNTGGFNDRE---MGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSS 192
Query: 174 RLEFGD-QIIAGKS-------------------LNLPPNSFTIKLNG------QRGCI-N 206
++ G ++GK L + + + NG ++G +
Sbjct: 193 KMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFL 252
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTC--RKCGVTCFNLPARFNSFPSMTY 264
D G+ T++ ++Y L A+ + + + C T NL P +T
Sbjct: 253 DSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRG-----PVLTA 307
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF+G D+ + P F+ F F T G + G Q N +DLD
Sbjct: 308 HFEGGDVKLLPTQTFVSPKDGVFCLGF--TNTSSDGG-VYGNFAQSNYLIGFDLD 359
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 140/364 (38%), Gaps = 61/364 (16%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYKKLP 62
+ + Y++ + +G P + + LDT + L WTQC PC C+EQ P+ + + ++ LP
Sbjct: 86 VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALP 145
Query: 63 CYDASCKS-PFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C C++ PF G C Y YGD T + D+ T D ++ +
Sbjct: 146 CDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARR 205
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNW------DSTSF--------------MVQLGRL 155
+ FGC +K + IAG W + TSF +V LG
Sbjct: 206 VTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSSVVTLGAA 265
Query: 156 VPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNG--------------- 200
+ L+ + H+ ++I S P+ + + L G
Sbjct: 266 AAE-----LLHTHHAAHTGDVRTTRLIKNPS---QPSLYFVPLRGISVGGARVAVPESRL 317
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA----RF 256
+ I D G+ +T + +VY + AEF+ SQ + CF LP R
Sbjct: 318 RSSTIIDSGASITTLPEDVYEAVKAEFV---SQVGLPAAAAGSAALDLCFALPVAALWRR 374
Query: 257 NSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKG-KTILGARHQHNTQFV 315
+ P++T H G P ++F +D G + ++G Q NT V
Sbjct: 375 PAVPALTLHLDGGADWELPRGNYVF--EDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVV 432
Query: 316 YDLD 319
YDL+
Sbjct: 433 YDLE 436
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 132/358 (36%), Gaps = 68/358 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +++ +LDT + + W QC PCK CY Q+DP++N K++ +PC
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRL 195
Query: 68 CKSPFHCFE------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+ E C Y ++YGD T S +T T V ++ GC
Sbjct: 196 CRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF------HGARVDHVALGCG 249
Query: 122 LESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK----------- 169
+++ FV + SF Q +FS CLV
Sbjct: 250 HDNEGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 304
Query: 170 -----------------------SFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
+F+ G + + + + F + G G I
Sbjct: 305 VFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 364
Query: 207 DCGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSM 262
D G+ +T + Y L F + LF TCF+L P++
Sbjct: 365 DSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFD------TCFDLSGMTTVKVPTV 418
Query: 263 TYHFQGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
+HF G ++ + N I N+Q F F AF G +I+G Q + YDL
Sbjct: 419 VFHFTGGEVSLPASNYLIPVNNQGRFCF----AFAGTMGSLSIIGNIQQQGFRVAYDL 472
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 137/362 (37%), Gaps = 63/362 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC +C+EQ+ P Y+ + S++ + C+D
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254
Query: 68 CK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTST--LLPPDEPSPVS-VQNI 116
C+ +P C Y YGD T +L+T T L P+ S + V+N+
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF Q+ L FS CLV + S S+
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGK----GPLSFASQMQSLYGQSFSYCLVDRNSNASVSSK 370
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ + L +P ++ + G
Sbjct: 371 LIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGA 430
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHD-IEKLFTCRKC----GVTCFNLPARF 256
G I D G+ LT Y ++ F+ ++ +E L + C G+ LP
Sbjct: 431 GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFG 490
Query: 257 NSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
F GA EN FI D PR +I+G Q N +Y
Sbjct: 491 ILFA------DGAVWNFPVENYFI--QIDPDVVCLAILGNPRSALSIIGNYQQQNFHILY 542
Query: 317 DL 318
D+
Sbjct: 543 DM 544
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 77/157 (49%), Gaps = 10/157 (6%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +GIG P + + DT + LTWTQC+PC SCY Q +P +N S SY + C
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSP 193
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
C +P C +C YGI YGD T + + TL D + +I FGC E+
Sbjct: 194 MCGNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLTNSD-----VLDDIYFGCG-ENNK 247
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC 163
V I AGI+GL SF +Q + FS C
Sbjct: 248 GVFIGS---AGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 120/312 (38%), Gaps = 64/312 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPC-- 63
Y++ LGIG P L+DT + L+W QC+PC + CY Q DP+++ ++ +PC
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184
Query: 64 ----------YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
YD C + C Y I YG+ T+ V S +T L S V
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL-----GSSAVV 239
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-------- 165
++ RFGC + G++GL S + Q + FS CL
Sbjct: 240 KSFRFGCGSDQHG----PYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGF 295
Query: 166 ----QPDKSFHSRL--------EFGDQI------------IAGKSLNLPPNSFTIKLNGQ 201
P+ + +S F +I + GK+L++PP F
Sbjct: 296 LTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF------A 349
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFP 260
+G I D G+V+T I Y L F +++ + L TC+N + P
Sbjct: 350 KGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPL--LPPADSALDTCYNFTGHGTVTVP 407
Query: 261 SMTYHFQGADLV 272
+ F G V
Sbjct: 408 KVALTFVGGATV 419
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 138/346 (39%), Gaps = 54/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ + +G P + + DT + TW QCQPC + CY Q +P+++ +Y + C +
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155
Query: 67 SCKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C G C YGI YGD T + DT TL + +++N RFGC ++
Sbjct: 156 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTL------AYDTIKNFRFGCGEKN 209
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIA 183
+ AG++GL TS VQ F+ CL P S + L+ G A
Sbjct: 210 RGLFGRA----AGLLGLGRGKTSLPVQAYDKYGGVFAYCL--PATSAGTGFLDLGPGAPA 263
Query: 184 GKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEVYA 221
+ P P + + + G + G + D G+V+T + YA
Sbjct: 264 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 323
Query: 222 VLTAEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYHFQ-GADLVVEP 275
L + F + F+ TC++L + P+++ FQ GA L V+
Sbjct: 324 PLRSAFSKAMQGLGYSAAPAFSILD---TCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYDL 318
+ AF P T I+G Q +YD+
Sbjct: 381 SGILYVADVSQACL----AFAPNADDTDVAIVGNTQQKTHGVLYDI 422
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 124/309 (40%), Gaps = 55/309 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P K L DT + LTWTQC+PC C+ QND ++ SYK L C
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191
Query: 67 SCK-----SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
CK S C + C YG+ YG Y T + +T T+ P D +N GC
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTGY-TVGFLATETLTITPSDV-----FENFVIGC 245
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ S AG++GL + Q + FS CL S L FG
Sbjct: 246 GERNGGRFS----GTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSS-TGHLSFGGG 300
Query: 181 I-------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ + G+ L + P+ F G I D G+ LT +
Sbjct: 301 VSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTA-----GTIIDSGTTLTYL 355
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYHFQGA-DL 271
++ L++ F + + + + K + + C++ N + P ++ F+G ++
Sbjct: 356 PSTAHSALSSAFQEMMTNYTLTKGTSGLQ---PCYDFSKHANDNITIPQISIFFEGGVEV 412
Query: 272 VVEPENVFI 280
++ +FI
Sbjct: 413 DIDDSGIFI 421
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 146/369 (39%), Gaps = 72/369 (19%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + LG G+ ++DT + LTW QC PC+SC++Q DP+++ S SY
Sbjct: 148 LRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAA 203
Query: 61 LPCYDASCKS-----------PFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLP 104
+PC +SC + C D C Y ++Y D ++ V + D +L
Sbjct: 204 VPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL-- 261
Query: 105 PDEPSPVSVQNIRFGCSLESKD--FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSC 162
+ + FGC ++ F +G+MGL S + Q FS
Sbjct: 262 ----AGEVIDGFVFGCGTSNQGPPFGG-----TSGLMGLGRSQLSLVSQTMDQFGGVFSY 312
Query: 163 CLVQPDKSFHSRLEFGDQ---------IIAGKSLNLP-------PNSFTIKLNGQR---- 202
CL + L GD I+ ++ P N I + GQ
Sbjct: 313 CLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESS 372
Query: 203 ---------GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP 253
I D G+V+T + +Y + AEF+ F+++ F+ TCFN+
Sbjct: 373 GFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILD---TCFNMT 429
Query: 254 A-RFNSFPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK---TILGARH 308
R PS+ F G ++ V+ V F DS A P K + I+G
Sbjct: 430 GLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCL--AMAPLKSEYETNIIGNYQ 487
Query: 309 QHNTQFVYD 317
Q N + ++D
Sbjct: 488 QKNLRVIFD 496
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 142/344 (41%), Gaps = 47/344 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + DT + LTW QC PC CY+Q PI+N S+ +PC +
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQT 151
Query: 68 CKS--PFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + HC +G C Y TYGD +K + T+ SV+++ GC S
Sbjct: 152 CHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITI------GSSSVKSV-IGCGHAS 204
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFHSRLEFGDQ-I 181
+G++GL S + Q+ + + RFS CL + ++ FG+ +
Sbjct: 205 SGGFGFA----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAV 260
Query: 182 IAGKSLNLPP----NSFT--------IKLNGQR--------GCINDCGSVLTVIECEVYA 221
++G + P N+ T I + +R I D G+ LT++ E+Y
Sbjct: 261 VSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLTILPKELYD 320
Query: 222 VLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPE 276
+ + + ++ L C G+ N A P +T HF GA++ + P
Sbjct: 321 GVVSSLLKVVKAKRVKDPHGSLDLCFDDGI---NAAASLG-IPVITAHFSGGANVNLLPI 376
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
N F D+ A +P I+G Q N YDL+
Sbjct: 377 NTF-RKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEA 419
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 140/353 (39%), Gaps = 56/353 (15%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y++ + +G P S+ + DT + L W QC PC CY+Q +P+++ + K+YK L C +
Sbjct: 93 SYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNND 152
Query: 67 SCK---SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ C + + C +YGD T+ S +T T + E P S + FGC
Sbjct: 153 FCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFT-IGSTEGDPASFPGLAFGCGH 211
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQ 180
+ + + + G+ S ++QL V +FS CLV D + S++ FG
Sbjct: 212 SNGGTFNEKDSGLIGLG---GGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKS 268
Query: 181 IIAGKSLNL--------PPNSFTIKLNG------------------------QRGCINDC 208
+ S + P + + L G + I D
Sbjct: 269 AVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDS 328
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQH---DIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
G+ LT++ + Y + + D F+ GV + P++T H
Sbjct: 329 GTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI-------PTITAH 381
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GAD+ + P N F+ +D F + P I G Q N YDL
Sbjct: 382 FIGADVQLPPLNTFVQAQEDLVCF----SMIPSSNLAIFGNLSQMNFLVGYDL 430
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 145/370 (39%), Gaps = 78/370 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+ + + IG P ++ + DT + LTW QC+PC+ CY++N PI++ + +YK PC +
Sbjct: 85 FFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144
Query: 68 CKS----PFHCFEGD--CFYGITYGDVYETK-----EVDSLDTSTLLPPDEPSPVSVQNI 116
C++ C E + C Y +YGD +K E S+D+++ SPVS
Sbjct: 145 CQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSAS------GSPVSFPGT 198
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV----------- 165
FGC + + +GI+GL S + QLG + +FS CL
Sbjct: 199 VFGCGYNNGGTF---DETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSV 255
Query: 166 --------------------------QPDKSFHSRLE---FGDQIIAGKSLNLPPNSFTI 196
+P ++ LE G + I + PN I
Sbjct: 256 INLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGI 315
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG------VTCF 250
I D G+ LT++E A F D FS E + ++ CF
Sbjct: 316 LSETSGNIIIDSGTTLTLLE--------AGFFDKFSSAVEESVTGAKRVSDPQGLLSHCF 367
Query: 251 NLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQH 310
+ P +T HF GAD+ + P N F+ +D + P I G Q
Sbjct: 368 KSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCL----SMVPTTEVAIYGNFAQM 423
Query: 311 NTQFVYDLDT 320
+ YDL+T
Sbjct: 424 DFLVGYDLET 433
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 124/303 (40%), Gaps = 47/303 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC +CYEQ + +++ S +Y + C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 239
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YG+ YGD + ++DT TL D +V+ RFGC E
Sbjct: 240 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCG-ER 293
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
D + + AG++GL TS VQ F+ CL P + L+FG
Sbjct: 294 NDGLFGEA---AGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PPRSTGTGYLDFGAGSPPA 349
Query: 182 -----------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
+ G+ L + P+ F G I D G+V+T +
Sbjct: 350 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPA 404
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPE 276
Y+ L + F + K TC++ + P+++ FQ GA L V+
Sbjct: 405 AYSSLRSAFAAAMAARGYRKAAAVSLLD-TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463
Query: 277 NVF 279
+
Sbjct: 464 GIM 466
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 132/360 (36%), Gaps = 60/360 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P + +LDT + L W QC PC C+ QN P Y+ + S+K + C+D
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251
Query: 68 CK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C P C Y YGD T +L+T T+ P + V+N+
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF QL L FS CLV D + S+
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 367
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ G+ L +P ++ + G
Sbjct: 368 LIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGA 427
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
G I D G+ L+ Y ++ F+ + + K F C+N+ P
Sbjct: 428 GGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILD---PCYNVSGVEKMELP 484
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F+ GA EN FI + G TPR +I+G Q N +YD
Sbjct: 485 EFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILG---TPRSALSIIGNYQQQNFHILYD 541
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 80/184 (43%), Gaps = 16/184 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLGIG P +DT + L WTQCQPC CY Q DP++N R +Y LPC +
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C D C Y TY T+ ++D + + + + FGCS
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI------GEDAFRGVAFGCST 202
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
S + +G++GL S + QL RF+ CL P +L G
Sbjct: 203 SSTGGAPPPQA--SGVVGLGRGPLSLVSQLS---VRRFAYCLPPPASRIPGKLVLGADAD 257
Query: 183 AGKS 186
A ++
Sbjct: 258 AARN 261
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 138/346 (39%), Gaps = 54/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ + +G P + + DT + TW QCQPC + CY Q +P+++ +Y + C +
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 220
Query: 67 SCKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C G C YGI YGD T + DT TL + +++N RFGC ++
Sbjct: 221 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTL------AYDTIKNFRFGCGEKN 274
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIA 183
+ AG++GL TS VQ F+ CL P S + L+ G A
Sbjct: 275 RGLFGRA----AGLLGLGRGKTSLPVQAYDKYGGVFAYCL--PATSAGTGFLDLGPGAPA 328
Query: 184 GKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEVYA 221
+ P P + + + G + G + D G+V+T + YA
Sbjct: 329 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 388
Query: 222 VLTAEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYHFQ-GADLVVEP 275
L + F + F+ TC++L + P+++ FQ GA L V+
Sbjct: 389 PLRSAFSKAMQGLGYSAAPAFSILD---TCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYDL 318
+ AF P T I+G Q +YD+
Sbjct: 446 SGILYVADVSQACL----AFAPNADDTDVAIVGNTQQKTHGVLYDI 487
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 103/246 (41%), Gaps = 38/246 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QC+PC CY+Q P+++ +Y + C D+
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDS 222
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C Y + YGD T + DT T+ + +++ RFGC ++
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTI------AHDAIKGFRFGCGEKN 276
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
AG+MGL TS VQ F+ CL + L+FG AG
Sbjct: 277 NGLFG----KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCL-PALTTGTGYLDFGPG-SAG 330
Query: 185 KSLNLPP------------NSFTIKLNGQR-----------GCINDCGSVLTVIECEVYA 221
+ L P I++ GQ+ G + D G+V+T + Y
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390
Query: 222 VLTAEF 227
L++ F
Sbjct: 391 ALSSAF 396
>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 133/345 (38%), Gaps = 67/345 (19%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEGD--CFYG 82
LD V LTW QCQPC Q +++S YK + D C P+ G+ FY
Sbjct: 87 LDLVGNLTWMQCQPCVPEVRQEGAVFDSAESPRYKHMKATDPMCTPPYTPSVGNRCSFYT 146
Query: 83 ITY---------GDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKK 133
T+ D++ + ST V + FGC+ + +
Sbjct: 147 TTWNVAAHGYLGSDMFAFAGTGAGGHST----------DVDQLIFGCAHTTDGLERLSHG 196
Query: 134 IIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSF----HSRLEFGDQI------ 181
++AG + L+ SF+ QL L RFS CL P++S H L FG I
Sbjct: 197 VLAGALSLSRHPMSFLSQLTARGLADSRFSYCLF-PEQSHPIAKHGFLRFGRDIPRHDHA 255
Query: 182 -------------------IAGKSLN------LPPNSFTIKLNGQR-GCINDCGSVLTVI 215
+ G SLN L P FT L +R G + D G+ LT +
Sbjct: 256 HSTSLLFTGPGSGGMYHIRVVGISLNGRRIMRLQPAMFTRNLQTRRGGSVVDPGTPLTRL 315
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF--QGADLVV 273
+ Y ++ AE + + + + CF + PS+T + A L +
Sbjct: 316 VRQAYDIVEAEVVANMQKQGARRAKAQVQGHRLCFVSWGHVH-LPSLTINMYEDTAKLFI 374
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+PE +F + F P + T+LGA Q +T+F +DL
Sbjct: 375 KPE--LLFRKVTARLLCF--TVMPDEEMTVLGAAQQMDTRFTFDL 415
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 103/244 (42%), Gaps = 39/244 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 294
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIA 183
+ AG++GL TS VQ F+ CL P +S + L+FG +A
Sbjct: 295 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL--PARSTGTGYLDFGAGSLA 348
Query: 184 GKSLNLPPNSFT-------------IKLNGQ-----------RGCINDCGSVLTVIECEV 219
L T I++ GQ G I D G+V+T +
Sbjct: 349 AARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408
Query: 220 YAVL 223
Y+ L
Sbjct: 409 YSSL 412
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 81/185 (43%), Gaps = 12/185 (6%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + + +LDT + + W QC+PC+ CY Q DPI+N S+ + C A
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAV 216
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C + C G C Y +YGD + S T TL SV N+ GC ++
Sbjct: 217 CSQLDAYDCHSGGCLYEASYGD--GSYSTGSFATETLT----FGTTSVANVAIGCGHKNV 270
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGK 185
++ + SF Q+G FS CLV + L+FG + +
Sbjct: 271 GLFIGAAGLLGLGA----GALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVG 326
Query: 186 SLNLP 190
S+ P
Sbjct: 327 SIFTP 331
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 131/339 (38%), Gaps = 55/339 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ IG P ++ +DT + L W QC+PCK CY Q PI++ SY+ +PC +
Sbjct: 88 YLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSDT 147
Query: 68 CKSPFHCFEGDCFYGITYGDV--YETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C S T DV Y + E +LD++T VS GC +
Sbjct: 148 CHS----------MRTTSCDVRGYLSVETLTLDSTTGY------SVSFPKTMIGCGYRNT 191
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII--- 182
+GI+GL S QLG + +FS CL + S+L FGD I
Sbjct: 192 GTFHGPS---SGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNSTSKLNFGDAAIVYG 248
Query: 183 --------------AGKSLNLPPNSFTIKL--------NGQRGCI-NDCGSVLTVIECEV 219
+G L L S KL G G I D G+ T + +V
Sbjct: 249 DGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYDV 308
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVF 279
Y + +Y + +E K C+N+ P +T HF+GAD+ + + F
Sbjct: 309 YYRFESAVAEYINLEHVEDPNGTFKL---CYNVAYHGFEAPLITAHFKGADIKLYYISTF 365
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
I F P+ T I G Q N Y+L
Sbjct: 366 IKVSDGIACLAFIPSQT-----AIFGNVAQQNLLVGYNL 399
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 144/362 (39%), Gaps = 63/362 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F +++ L IG P + ++DT + L W QC PC +C++Q+ ++ S+K L
Sbjct: 98 FNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157
Query: 62 PC----YD----------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDE 107
C Y+ + GD GI KE S L +
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL------AKE------SLLFETLD 205
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLN-WDSTSFMVQLGRLVPDRFSCCLVQ 166
+ NI FGC + + G+ GL + + QLG ++FS C+
Sbjct: 206 EGKIKKSNITFGCG--HMNIKTNNDDAYNGVFGLGAYPHITMATQLG----NKFSYCIGD 259
Query: 167 PDKSF--HSRLEFGD-----------QI-------------IAGKSLNLPPNSFTIKLNG 200
+ H+ L G QI + K+L + PN+F I +G
Sbjct: 260 INNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDG 319
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN--LPARFNS 258
G + D G T + + +L E +D + +E++ T RK CF +
Sbjct: 320 SGGVLIDSGMTYTKLANGGFELLYDEIVDLM-KGLLERIPTQRKFEGLCFKGVVSRDLVG 378
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
FP++T+HF GADLV+E ++F + D F P+ + +++G Q N +D
Sbjct: 379 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFD 438
Query: 318 LD 319
L+
Sbjct: 439 LE 440
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 142/367 (38%), Gaps = 74/367 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P + DT + LTW QC+PC+ CY+QN P+++ + +YK C +
Sbjct: 85 YFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSIT 144
Query: 68 CKSPFHCFEG------DCFYGITYGDVYETK-----EVDSLDTSTLLPPDEPSPVSVQNI 116
C + EG C Y +YGD TK E S+D+S+ SPVS
Sbjct: 145 CNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSS------GSPVSFPGT 198
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH--SR 174
FGC + ++ +GI+GL S + QLG + +FS CL + + S
Sbjct: 199 AFGCGYNNGGTF---EETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNGTSV 255
Query: 175 LEFG--------------------------------DQIIAGKSLNLPPN-----SFTIK 197
+ G + I GK+ LP S K
Sbjct: 256 INLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKT-KLPYTGGGGYSLNRK 314
Query: 198 LNGQRGCINDCGSVLTVIECEVY----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP 253
I D G+ LT+++ Y AV+ D + + T CF
Sbjct: 315 SKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILT------HCFKSG 368
Query: 254 ARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
+ P++T HF GAD+ + P N F+ +D + P I G Q +
Sbjct: 369 DKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVCL----SMIPTTEVAIYGNMVQMDFL 424
Query: 314 FVYDLDT 320
YDL+T
Sbjct: 425 VGYDLET 431
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 151/360 (41%), Gaps = 65/360 (18%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+++ + IG P + +DT + L W QC+PC +CY Q+ PI++ +++ C
Sbjct: 83 QAFLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRT 142
Query: 66 ASCKSP---FHCFEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRFG 119
+ P F+ C Y + Y D +K + + L +T+ DE S ++ ++ FG
Sbjct: 143 SQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIY--DESSSAALHDVVFG 200
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSF-HSRLEF 177
C ++ + + GI+GL + S + + G +FS C D S+ H+ L
Sbjct: 201 CGHDNYG----EPLVGTGILGLGYGEFSLVHRFGT----KFSYCFGSLDDPSYPHNVLVL 252
Query: 178 GD-----------------------QIIAGKSLNLPPNSFTIKLNGQR---GCINDCGSV 211
GD + I+ + LP + + N Q G I D G+
Sbjct: 253 GDDGANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG------VTCFNLPARFN----SFPS 261
LT + E Y L + DYF E FT V C+N + FP
Sbjct: 313 LTSLVEEAYKPLKNKIEDYF-----EGRFTAADVNQDDMFKVECYNGNLERDLVESGFPI 367
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+T+HF GA+L ++ ++VF+ + F A TP +I GA Q + YDL+
Sbjct: 368 VTFHFSDGAELSLDVKSVFMKLSPNVFCL----AVTPGNMNSI-GATAQQSYNIGYDLEA 422
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 140/340 (41%), Gaps = 59/340 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ IG P L+ L+DT W QC+PCK C Q P+++ +YK +PC
Sbjct: 90 YVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPI 149
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
CK+ + G VD+L L + +P+S +NI GC ++
Sbjct: 150 CKNADGHYLG----------------VDTLT----LNSNNGTPISFKNIVIGCGHRNQGP 189
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGDQ-IIAG 184
+ + ++G +GL SF+ QL + +FS CLV ++ S+L FGD+ ++G
Sbjct: 190 L---EGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSG 246
Query: 185 KSLNLPP----NSFTIKLNG--------------QRG-CINDCGSVLTVIECEVYAVLTA 225
P N + + L RG I D G+ +T++ +VY+ L +
Sbjct: 247 LGTVSTPIKEENGYFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRLES 306
Query: 226 EFIDYFSQHDI----EKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVF-- 279
+D + ++ C + T +T HF G+++ + N F
Sbjct: 307 VVLDMVKLKRVKDPSQQFNLCYQTTSTTL-----LTKVLIITAHFSGSEVHLNALNTFYP 361
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
I + F F G F+ I G Q N +DL+
Sbjct: 362 ITDEVICFAFVSGGNFS---SLAIFGNVVQQNFLVGFDLN 398
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 135/366 (36%), Gaps = 78/366 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G P +LDT + + W QC PC+ CYEQ+ +++ R +SY + C
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPL 199
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C Y + YGD T + +T T V + GC +
Sbjct: 200 CRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFA-----GGARVARVALGCGHD 254
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-----SRLEFG 178
++ ++ G S SF Q+ R FS CLV S + S + FG
Sbjct: 255 NEGLFVAAAGLLGLGRG----SLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFG 310
Query: 179 DQI--------------------------------------IAGKSLNLPPNSFTIKLNG 200
+A L L P+S G
Sbjct: 311 SGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSS------G 364
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPAR- 255
+ G I D G+ +T + Y+ L F + + LF TC++L R
Sbjct: 365 RGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFD------TCYDLSGRK 418
Query: 256 FNSFPSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
P+++ HF GA+ + PEN I + + +F F F G +I+G Q +
Sbjct: 419 VVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFA---GTDGGVSIIGNIQQQGFR 475
Query: 314 FVYDLD 319
V+D D
Sbjct: 476 VVFDGD 481
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 124/302 (41%), Gaps = 42/302 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QC+PC CY+Q + +++ +Y + C
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 67 SCKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C YG+ YGD + ++DT TL D +++ RFGC +
Sbjct: 221 ACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AIKGFRFGCGERN 275
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQIIA 183
+ AG++GL TS VQ F+ C P +S + L+FG +
Sbjct: 276 EGLYGEA----AGLLGLGRGKTSLPVQAYDKYGGVFAHCF--PARSSGTGYLDFGPGSLP 329
Query: 184 GKSLNLP--------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEV 219
S L P + + L G R G I D G+V+T +
Sbjct: 330 AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITRLPPAA 389
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPEN 277
Y+ L + F ++ +K TC++ + P+++ FQ GA L V
Sbjct: 390 YSSLRSAFASAMAERGYKKAPALSLLD-TCYDFTGMSEVAIPTVSLLFQGGASLDVHASG 448
Query: 278 VF 279
+
Sbjct: 449 II 450
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 125/307 (40%), Gaps = 52/307 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QC+PC CYEQ + +++ + + C
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 67 SCKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C YG+ YGD + ++DT TL D +++ RFGC +
Sbjct: 246 ACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AIKGFRFGCGERN 300
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQI-- 181
+ AG++GL TS VQ F+ C P +S + L+FG
Sbjct: 301 EGLFGEA----AGLLGLGRGKTSLPVQAYDKYGGVFAHCF--PARSSGTGYLDFGPGSSP 354
Query: 182 ---------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ GK L++PP+ FT G I D G+V+T
Sbjct: 355 AVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-----GTIVDSGTVITR 409
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLV 272
+ Y+ L + F + +K TC++ + P+++ FQ GA L
Sbjct: 410 LPPAAYSSLRSAFASAIAARGYKKAPALSLLD-TCYDFTGMSQVAIPTVSLLFQGGASLD 468
Query: 273 VEPENVF 279
V+ +
Sbjct: 469 VDASGII 475
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 130/343 (37%), Gaps = 57/343 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P K+ L+D+ + ++W QC+PC C+ Q DP+++ +Y C A+
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAA 190
Query: 68 CKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C G C Y + Y D T S DT L ++ N +FGCS
Sbjct: 191 CAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL------GSNTISNFQFGCSH 244
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI- 181
F + G+MGL + S Q FS CL P S L G
Sbjct: 245 VESGF----NDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCL-PPTPSSSGFLTLGAGTS 299
Query: 182 -------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+ G L++P + F+ G + D G+++T +
Sbjct: 300 GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSA------GMVMDSGTIITRLP 353
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEP 275
Y+ L++ F Q+ + R TCF+ + + PS+ F G +V
Sbjct: 354 RTAYSALSSAFKAGMKQY---RPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLD 410
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N I + +F A + I+G Q + +YD+
Sbjct: 411 ANGIILGNCLAF-----AANSDDSSPGIVGNVQQRTFEVLYDV 448
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 131/347 (37%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ +LG+G P S ++DT + LTW QC PC SC+ Q P+++ R+ +Y + C +
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + P +
Sbjct: 191 ECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGSFP------GFYY 244
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------- 164
GC +++ AG++GL + S + QL + FS CL
Sbjct: 245 GCGQDNEGLFGRS----AGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSSAAAGYLSIGS 300
Query: 165 ---------VQPDKSFHSRLEF---GDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
S + L F +AG L +PP+ + I D G+V+
Sbjct: 301 YNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYR-----SLPTIIDSGTVI 355
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADL 271
T + VY L+ + TCF A P + F GA L
Sbjct: 356 TRLPPNVYTALSRAVA--AAMASAAPRAPTYSILDTCFRGSAAGLRVPRVDMAFAGGATL 413
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ P NV I + DS AF P G I+G Q VYD+
Sbjct: 414 ALSPGNVLI-DVDDSTTCL---AFAPTGGTAIIGNTQQQTFSVVYDV 456
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 137/339 (40%), Gaps = 47/339 (13%)
Query: 12 LGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDASCK- 69
+G+G P ++DT + LTW QC PC SC+ Q+ P++N +S +Y + C C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 70 ------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
+P C + C Y +YGD + S DT + S+ N +GC
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSLPNFYYGCGQ 114
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------VQPDK 169
+++ AG++GL + S + QL + F+ CL P +
Sbjct: 115 DNEGLFGRS----AGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQ 170
Query: 170 SFHSRL---EFGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAV 222
++ + D + ++G ++ P S + I D G+V+T + VY+
Sbjct: 171 YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSA 230
Query: 223 LTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFIF 281
L+ ++ TCF A S P++T F GA L + +N+ +
Sbjct: 231 LSKAVAAAMKGTSRASAYSILD---TCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLV- 286
Query: 282 NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ DS AF P + I+G Q VYD+ +
Sbjct: 287 DVDDSTTCL---AFAPARSAAIIGNTQQQTFSVVYDVKS 322
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 144/348 (41%), Gaps = 51/348 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++ +G P + ++DT + + W QC+PC+ CY+Q PI++ K+YK LPC +
Sbjct: 91 YLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNT 150
Query: 68 CKSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+S + + C Y I YGD + S++T TL D S V GC +
Sbjct: 151 CESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDG-SSVHFPKTVIGCGHNN 209
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFGD-QI 181
+ Q++ I+GL S + QL + +FS CL + + S+L FGD +
Sbjct: 210 GG--TFQEEGSG-IVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAV 266
Query: 182 IAGKSLNLPPNSFTIKLNGQ-------------------------------RGCINDCGS 210
++G+ P LNGQ I D G+
Sbjct: 267 VSGRGTVSTPLD---PLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGT 323
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
LT++ E Y L + D +E+ K C+ + P +T HF+GAD
Sbjct: 324 TLTLLPQEDYLNLESAVSDVIK---LERARDPSKLLSLCYKTTSDELDLPVITAHFKGAD 380
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + P + F+ + F AF K I G Q N YDL
Sbjct: 381 VELNPISTFVPVEKGVVCF----AFISSKIGAIFGNLAQQNLLVGYDL 424
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 102/253 (40%), Gaps = 48/253 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P K + DT + LTWTQC+PC K+CY+Q +P + SYK + C A
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 67 SCK-----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
CK C C Y + YGD + + +T TL S +N FGC
Sbjct: 193 FCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTL-----SSSNVFKNFLFGCG 247
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
++ + AG++GL S Q + FS CL S L FG Q+
Sbjct: 248 QQNSGLF----RGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSS-KGYLSFGGQV 302
Query: 182 ---------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ G L++ + F+ G + D G+V+T
Sbjct: 303 SKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFST-----SGTVIDSGTVITR 357
Query: 215 IECEVYAVLTAEF 227
+ Y+ L++ F
Sbjct: 358 LPSTAYSALSSAF 370
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 108/281 (38%), Gaps = 55/281 (19%)
Query: 2 FTLNHT-YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSY 58
++L T Y++ + IG P + +DT + ++W QC PC +SC Q D +++ +Y
Sbjct: 122 YSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATY 181
Query: 59 KKLPCYDASCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C A C C + C Y + YGD T DT +L D +V+
Sbjct: 182 SAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD-----AVK 236
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
+ +FGCS + FV + G+MGL D+ S + Q FS CL P S
Sbjct: 237 SFQFGCSHRAAGFVG----ELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGF 292
Query: 175 LEFGDQ-----------------------------IIAGKSLNLPPNSFTIKLNGQRGCI 205
L G +AG LN+P + F+ +
Sbjct: 293 LTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFS------GASV 346
Query: 206 NDCGSVLTVIECEVYAVLTAEFID----YFSQHDIEKLFTC 242
D G+V+T + Y L F Y S + L TC
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTC 387
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 82/372 (22%), Positives = 139/372 (37%), Gaps = 83/372 (22%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC +C+EQ+ P Y+ + S++ + C+D
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256
Query: 68 CK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTST--LLPPDEPSPVS-VQNI 116
C+ P C Y YGD T +L+T T L P+ S + V+N+
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSR 174
FGC ++ ++ SF Q+ L FS CLV + S S+
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGLGK----GPLSFASQMQSLYGQSFSYCLVDRNSNASVSSK 372
Query: 175 LEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNGQ 201
L FG+ ++ + L +P ++ + G
Sbjct: 373 LIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGA 432
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHD-IEKLFTCRKCGVTCFNLPARFNSFP 260
G I D G+ LT Y ++ F+ + +E L + C
Sbjct: 433 GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPC--------------- 477
Query: 261 SMTYHFQGADLVVEPENVFIFNHQ-------DSFFFFFGPAFT-------PRKGKTILGA 306
Y+ G + + P+ +F + +++F + P PR +I+G
Sbjct: 478 ---YNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGN 534
Query: 307 RHQHNTQFVYDL 318
Q N +YD+
Sbjct: 535 YQQQNFHILYDM 546
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 121/318 (38%), Gaps = 58/318 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P ++DT + L W QC PC+ CY Q +++ R +Y+++PC
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 68 CKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C++ G C Y + YGD + L T L ++ V N+ GC
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGD--GSSSTGDLATDKLAFAND---TYVNNVTLGC 200
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+++ AG++G+ S Q+ F CL D++ SR
Sbjct: 201 GRDNEGLF----DSAAGLLGVGRGKISISTQVAPAYGSVFEYCLG--DRT--SRSTRSSY 252
Query: 181 IIAGKSLNLPPNSFTIKLN------------------------------------GQRGC 204
++ G++ P +FT L+ G+ G
Sbjct: 253 LVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGV 312
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMT 263
+ D G+ ++ + YA L F + +L C++L R S P +
Sbjct: 313 VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIV 372
Query: 264 YHFQ-GADLVVEPENVFI 280
HF GAD+ + PEN F+
Sbjct: 373 LHFAGGADMALPPENYFL 390
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 128/317 (40%), Gaps = 63/317 (19%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
+G P ++ + DT + L W QC PC CY Q PI++ +Y+ + C +
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 72 FHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVS 129
C EGD C Y TYGD TK S D P + V V + FGCS ++K +
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTR-TIVEVGYLTFGCSHDTKARLK 181
Query: 130 IQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP-DKSFHSRLEFGDQ--IIAGKS 186
+ AG++GLN S + QL +FS C+V P D SR+ FG + I+ GK+
Sbjct: 182 GHQ---AGVVGLNRHPNSLVSQLKV---KKFSYCMVIPDDHGSGSRMYFGSRAVILGGKT 235
Query: 187 LNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG 246
+L ++ YF T + G
Sbjct: 236 ----------------------------------PLLKGDYSHYF--------VTLK--G 251
Query: 247 VTCFNLPARFNSF----PSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT 302
++ R + P +T+HF GAD ++ ++ + + + + RK +
Sbjct: 252 ISVGEEKGRSDELASAGPDITFHFYGADFILTKXTTYVEVEKGLWCLAMLSSNSTRK-LS 310
Query: 303 ILGARHQHNTQFVYDLD 319
ILG Q N YDL+
Sbjct: 311 ILGNIQQQNYHVGYDLE 327
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 11/121 (9%)
Query: 35 QCQPCKSCYEQNDPIYNSRSFKSYKKLP-----CYDASCKSPFHCFEGDCFYGITYGDVY 89
+ Q C+ Q PI++ +Y +P CY A + H E DC Y I+YG
Sbjct: 327 EAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYA-CHIDEEDCCYRISYGSGS 385
Query: 90 ETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQ-KKIIAGIMGLNWDSTSF 148
+ E + + + V V ++ FGCS D+ + K GI+GLN DS S
Sbjct: 386 TSTEGTISIDAFAFEDNRQNMVDVXHLVFGCS----DYTTGTFKGYEVGIVGLNQDSLSL 441
Query: 149 M 149
+
Sbjct: 442 V 442
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 126/304 (41%), Gaps = 49/304 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC +CYEQ + +++ S +Y + C
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 242
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YG+ YGD + ++DT TL D +V+ RFGC E
Sbjct: 243 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCG-ER 296
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQI-- 181
D + + AG++GL TS VQ F+ CL P +S + L+FG
Sbjct: 297 NDGLFGEA---AGLLGLGRGKTSLPVQTYGKYGGVFAHCL--PARSTGTGYLDFGAGSPP 351
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L + P+ F G I D G+V+T +
Sbjct: 352 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 406
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEP 275
Y+ L + F + K TC++ + P+++ FQ GA L V+
Sbjct: 407 AAYSSLRSAFAAAMAARGYRKAAAVSLLD-TCYDFTGMSQVAIPTVSLLFQGGAALDVDA 465
Query: 276 ENVF 279
+
Sbjct: 466 SGIM 469
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 136/361 (37%), Gaps = 73/361 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKK------ 60
Y +K+G+G P K ++DT + L+W QCQPC C+ Q DPI+ K+YK
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSS 166
Query: 61 --LPCYDASCKSPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
++ +P G C Y +YGD + S D TL P PS
Sbjct: 167 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAPS----SGFV 222
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-----VQPDKSFH 172
+GC +++ AGI+GL D S + QL + FS CL QP+ S
Sbjct: 223 YGCGQDNQGLFGRS----AGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVS 278
Query: 173 SRLEFGDQ-----------------------------IIAGKSLNLPPNSFTIKLNGQRG 203
L G +AGK L + +S+ +
Sbjct: 279 GFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP------ 332
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEK-----LFTCRKCGVTCFNLPARFNS 258
I D G+V+T + +Y L F+ S+ + L TC K V ++
Sbjct: 333 TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSV------KEMST 386
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + F+ GA L ++ N + + + + P +I+G Q YD
Sbjct: 387 VPEIRIIFRGGAGLELKVHNSLVEIEKGTTCLAIAASSNPI---SIIGNYQQQTFTVAYD 443
Query: 318 L 318
+
Sbjct: 444 V 444
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 126/304 (41%), Gaps = 49/304 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC +CYEQ + +++ S +Y + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 238
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YG+ YGD + ++DT TL D +V+ RFGC E
Sbjct: 239 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCG-ER 292
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEFGDQI-- 181
D + + AG++GL TS VQ F+ CL P +S + L+FG
Sbjct: 293 NDGLFGEA---AGLLGLGRGKTSLPVQTYGKYGGVFAHCL--PARSTGTGYLDFGAGSPP 347
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ L + P+ F G I D G+V+T +
Sbjct: 348 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 402
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEP 275
Y+ L + F + K TC++ + P+++ FQ GA L V+
Sbjct: 403 AAYSSLRSAFAAAMAARGYRKAAAVSLLD-TCYDFTGMSQVAIPTVSLLFQGGAALDVDA 461
Query: 276 ENVF 279
+
Sbjct: 462 SGIM 465
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 141/347 (40%), Gaps = 58/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N RS SY + C
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 67 SCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C + P C + C Y +YGD + S DT + SV N +
Sbjct: 181 QCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSVPNFYY 234
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV------------- 165
GC +++ AG++GL + S + QL + FS CL
Sbjct: 235 GCGQDNEGLFGQS----AGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGS 290
Query: 166 ----QPDKSFHSRLEFGDQI---------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
Q + ++ D + +AGK L++ ++++ I D G+V+
Sbjct: 291 YNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYS-----SLPTIIDSGTVI 345
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQG-ADL 271
T + +VY+ L+ F+ TCF A P ++ F G A L
Sbjct: 346 TRLPTDVYSALSKAVAGAMKGTPRASAFSILD---TCFQGQASRLRVPQVSMAFAGGAAL 402
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ N+ + DS AF P + I+G Q VYD+
Sbjct: 403 KLKATNLLV--DVDSATTCL--AFAPARSAAIIGNTQQQTFSVVYDV 445
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 121/318 (38%), Gaps = 58/318 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P ++DT + L W QC PC+ CY Q +++ R +Y+++PC
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 68 CKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C++ G C Y + YGD + L T L ++ V N+ GC
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGD--GSSSTGELATDKLAFAND---TYVNNVTLGC 200
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+++ AG++G+ S Q+ F CL D++ SR
Sbjct: 201 GRDNEGLF----DSAAGLLGVARGKISISTQVAPAYGSVFEYCLG--DRT--SRSTRSSY 252
Query: 181 IIAGKSLNLPPNSFTIKLN------------------------------------GQRGC 204
++ G++ P +FT L+ G+ G
Sbjct: 253 LVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGV 312
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMT 263
+ D G+ ++ + YA L F + +L C++L R S P +
Sbjct: 313 VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIV 372
Query: 264 YHFQ-GADLVVEPENVFI 280
HF GAD+ + PEN F+
Sbjct: 373 LHFAGGADMALPPENYFL 390
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 141/347 (40%), Gaps = 50/347 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ +G G P ++ +DT + ++W QC PC CY+Q+DP+++ +Y +PC
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHP 220
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + G C Y +TYGD T V S +T +L S + FGC +
Sbjct: 221 QCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSL-----SSTRDLPGFAFGCGQTN 275
Query: 125 --------------KDFVSIQKKIIAGIMGL------NWDSTSFMVQLGRLVPD------ 158
+ +S+ + A ++D+T + +G P
Sbjct: 276 LGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPAASNDDD 335
Query: 159 --RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+++ + + D +E I G L +PP FT + G + D G++LT +
Sbjct: 336 DVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT-----RDGTLFDSGTILTYLP 390
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTYHFQ-GADLVVE 274
E YA L F +Q+ + TC++ F P++ + F GA +
Sbjct: 391 PEAYASLRDRFKFTMTQYKPAPAYDPFD---TCYDFTGHNAIFMPAVAFKFSDGAVFDLS 447
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGK---TILGARHQHNTQFVYDL 318
P + I+ D+ AF PR I+G Q T+ +YD+
Sbjct: 448 PVAILIYP-DDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDV 493
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 135/359 (37%), Gaps = 62/359 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + +LDT + L WTQC+PC C+ + + + ++ LPC
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPV 474
Query: 68 CKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C + + C Y Y D T +T T D +V ++ FGC
Sbjct: 475 CDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGC 534
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC----------------- 163
L + + + GI G + S QL D FS C
Sbjct: 535 GLFNNGIFTSNET---GIAGFGRGALSLPSQLKV---DNFSHCFTAITGSEPSSVLLGLP 588
Query: 164 ---------------LVQPDKSFHS-RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
LVQ S + L + L +P ++F +K +G G I D
Sbjct: 589 ANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIID 648
Query: 208 CGSVLTVIECEVYAVLTAEF-------IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SF 259
G+ +T + + Y ++ F +D + + +L F++P R
Sbjct: 649 SGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRL-------CFSFSVPRRAKPDV 701
Query: 260 PSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + HF+GA L + EN ++F +D+ A TI+G Q N +YDL
Sbjct: 702 PKLVLHFEGATLDLPREN-YMFEFEDAGGSVTCLAINAGDDLTIIGNYQQQNLHVLYDL 759
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 130/352 (36%), Gaps = 66/352 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G+G P K + DT + LTWTQC+PC KSCY Q + I+N SY + C
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGST 212
Query: 67 SCKS-------PFHCFEGDCFYGITYGDV-----YETKEVDSLDTSTLLPPDEPSPVSVQ 114
C S F+C C YGI YGD + KE SL + +
Sbjct: 213 LCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATDVF----------N 262
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------- 164
+ FGC +K ++ L D S + Q + FS CL
Sbjct: 263 DFYFGCGQNNKGLFGGAAGLLG----LGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFL 318
Query: 165 -----------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
+ SF+ L+ + G+ L + P+ F+ G I D
Sbjct: 319 TFGGSTSKSASFTPLATISGGSSFYG-LDLTGISVGGRKLAISPSVFSTA-----GTIID 372
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF 266
G+V+T + Y+ L++ F SQ+ + TCF+ S P + F
Sbjct: 373 SGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILD---TCFDFSNHDTISVPKIGLFF 429
Query: 267 QGADLV-VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
G +V ++ +F N F + I G Q + VYD
Sbjct: 430 SGGVVVDIDKTGIFYVNDLTQVCLAFA-GNSDASDVAIFGNVQQKTLEVVYD 480
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 135/349 (38%), Gaps = 62/349 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ +LG+G P S ++DT + LTW QC PC SC+ Q P+++ R+ +Y + C +
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR- 117
C +P C + C Y +YGD + V SL T T VS + R
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGD--SSFSVGSLSTDT---------VSFGSTRY 242
Query: 118 ----FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
+GC +++ AG++GL + S + QL + FS CL P +
Sbjct: 243 PSFYYGCGQDNEGLFGRS----AGLIGLARNKLSLLYQLAPSLGYSFSYCL--PTAASTG 296
Query: 174 RLEFGDQ-----------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
L G ++G S+ P + + I D G+
Sbjct: 297 YLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGT 356
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GA 269
V+T + V+ L+ + F+ TCF A P++ F GA
Sbjct: 357 VITRLPTAVHTALSKAVAQAMAGAQRAPAFSILD---TCFEGQASQLRVPTVAMAFAGGA 413
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + NV I + DS AF P I+G Q +YD+
Sbjct: 414 SMKLTTRNVLI-DVDDSTTCL---AFAPTDSTAIIGNTQQQTFSVIYDV 458
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 76/160 (47%), Gaps = 21/160 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + L F+ DT + LTWTQC+PC + CY Q +PI+N SY + C
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 67 SC--------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+C SP C C YGI YGD + + D L D N F
Sbjct: 198 TCDELKSGTGNSP-SCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDV-----FNNFLF 251
Query: 119 GCSLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP 157
GC ++ FV +AG++GL ++ S M + + P
Sbjct: 252 GCGQNNRGLFVG-----VAGLIGLGRNALSLMSKYPKAAP 286
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 137/367 (37%), Gaps = 63/367 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + ++DT + L W QC PC C+EQ P+++ + SY+ + C D
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHR 210
Query: 68 C--------------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
C ++ E C Y YGD T +L++ T+ + V
Sbjct: 211 CGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV 270
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
+ FGC ++ ++ L SF QL + FS CLV S
Sbjct: 271 DGVVFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGS 326
Query: 174 RLEFGDQ--------------------------------------IIAGKSLNLPPNSFT 195
++ FG+ ++ G+ LN+ +++
Sbjct: 327 KVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWD 386
Query: 196 IKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ-HDIEKLFTCRKCGVTCFNLPA 254
+ +G G I D G+ L+ Y V+ F+D S+ + + F C+N+
Sbjct: 387 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLS---PCYNVSG 443
Query: 255 -RFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHN 311
P ++ F GA EN FI D TPR G +I+G Q N
Sbjct: 444 VERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIGNFQQQN 503
Query: 312 TQFVYDL 318
VYDL
Sbjct: 504 FHVVYDL 510
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 131/350 (37%), Gaps = 46/350 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ +G+G P + L + DT + L+W QC PC S CY+Q DP++ ++ + C
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213
Query: 66 ASCKSPFHC--FEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN------ 115
C++ C GD C Y + YGD T+ DT T L P+ S +N
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLT-LGTMAPANASAENDNKLPG 272
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
FGC + G+ GL S Q + FS CL S L
Sbjct: 273 FVFGCGENNTGLFGQAD----GLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYL 328
Query: 176 EFGDQIIAGK--------SLNLPPNSFTIKLNGQRGC---------------INDCGSVL 212
G + A + P+ + +KL G R I D G+V+
Sbjct: 329 SLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVI 388
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYHFQ-G 268
T + Y L A F+ ++ ++ TC++ A N S P++ F G
Sbjct: 389 TRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILD-TCYDFTAHANATVSIPAVALVFAGG 447
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A + V+ V F P R ILG Q VYD+
Sbjct: 448 ATISVDFSGVLYVAKVAQACLAFAPNGDGRSAG-ILGNTQQRTLAVVYDV 496
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 146/367 (39%), Gaps = 70/367 (19%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ TLN+ + LG G+ ++DT + LTW QC PC+SC++Q P+++ S SY
Sbjct: 138 LRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAA 193
Query: 61 LPCYDASCKSPFH------------CFEGD---CFYGITYGDVYETKEVDSLDTSTLLPP 105
+PC SC + C G C Y ++Y D ++ V + D +L
Sbjct: 194 VPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL--- 250
Query: 106 DEPSPVSVQNIRFGCSLESKD--FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC 163
+ + FGC ++ F +G+MGL S + Q FS C
Sbjct: 251 ---AGEVIDGFVFGCGTSNQGPPFGG-----TSGLMGLGRSQLSLVSQTVDQFGGVFSYC 302
Query: 164 L-VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFT---------------------IKLNGQ 201
L + + L GD A + N P +T I + GQ
Sbjct: 303 LPLSRESDASGSLVLGDDPSAYR--NSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ 360
Query: 202 R--------GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP 253
I D G+V+T + VY + AEF+ +++ F+ TCFN+
Sbjct: 361 EVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILD---TCFNMT 417
Query: 254 A-RFNSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQH 310
+ PS+T F GA++ V+ V F DS A + +T I+G Q
Sbjct: 418 GLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQK 477
Query: 311 NTQFVYD 317
N + V+D
Sbjct: 478 NLRVVFD 484
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 143/351 (40%), Gaps = 63/351 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC---KSCYEQNDPIYNSRSFKSYKKLPCY 64
Y ++G+G PV+S +F+ DT + ++W QCQPC CY+Q PI++ +S SY L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 65 DASCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C C Y + YGD T V L T T S+ N+ GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFT--VGELATETF---SFRHSNSIPNLPIGCGH 298
Query: 123 ESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-GDQ 180
+++ FV I G ++ S +L FS CLV D S L+F DQ
Sbjct: 299 DNEGLFVGAAGLIGLGGGAISLSS--------QLEATSFSYCLVDLDSESSSTLDFNADQ 350
Query: 181 --------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ GK L + +SF I +G G I D G+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITE 410
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGA 269
I +VY VL F+ GV TC++L ++ N P++ + G
Sbjct: 411 IPSDVYDVLRDAFVGLTKNLPPAP-------GVSPFDTCYDLSSQSNVEVPTIAFILPGE 463
Query: 270 DLVVEPEN--VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + P +F + +F F P+ P +I+G Q + YDL
Sbjct: 464 NSLQLPAKNCLFQVDSAGTFCLAFLPSTFPL---SIIGNVQQQGIRVSYDL 511
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 146/363 (40%), Gaps = 64/363 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+ + + IG P ++ + DT + LTW QC+PC+ CY++N PI++ + +YK PC +
Sbjct: 85 FFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144
Query: 68 C----KSPFHCFEGD--CFYGITYGDVYETK-----EVDSLDTSTLLPPDEPSPVSVQNI 116
C S C E C Y +YGD +K E S+D+++ SPVS
Sbjct: 145 CHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSAS------GSPVSFPGT 198
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-------- 168
FGC + + +GI+GL S + QLG + +FS CL
Sbjct: 199 VFGCGYNNGGTF---DETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSV 255
Query: 169 -----KSFHSRLEFGDQIIAGKSLNLPPNSF------TIKLNGQR-------------GC 204
S S L +I+ ++ P ++ I + ++ G
Sbjct: 256 INLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGI 315
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRK-----CGVT--CFNLPARFN 257
++ + + +L + F D F +E+L T K G+ CF +
Sbjct: 316 FSETSGNIIIDSGTTLTLLDSGFFDKFGAA-VEELVTGAKRVSDPQGLLSHCFKSGSAEI 374
Query: 258 SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P +T HF GAD+ + P N F+ +D + P I G Q + YD
Sbjct: 375 GLPEITVHFTGADVRLSPINAFVKVSEDMVCL----SMVPTTEVAIYGNFAQMDFLVGYD 430
Query: 318 LDT 320
L+T
Sbjct: 431 LET 433
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 140/337 (41%), Gaps = 47/337 (13%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC-- 63
+ Y++KL IG P + +LDT + WTQC PC CY Q PI++ ++K++ C
Sbjct: 57 YEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 116
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+D SC Y + YG TK +T T + P + GC
Sbjct: 117 HDHSCP-----------YELVYGGKSYTKGTLVTETVT-IHSTSGQPFVMPETIIGCGRN 164
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQII 182
+ F K AG++GL+ S + Q+G P S C S++ FG + I+
Sbjct: 165 NSGF----KPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIV 217
Query: 183 AGKSL--------NLPPNSFTIKLNG---QRGCINDCGSVLTVIECEVYAVLTAEFIDYF 231
AG + P + + L+ I G+ ++ + + + + YF
Sbjct: 218 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNI-VIDSGSTLTYF 276
Query: 232 SQ-------HDIEKLFTCR---KCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFI 280
+ +E++ T + + C+ + FP +T HF GADLV++ N+++
Sbjct: 277 PESYCNLVRKAVEQVVTAVRFPRSDILCY-YSKTIDIFPVITMHFSGGADLVLDKYNMYV 335
Query: 281 FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++ F +P + + I G R Q+N YD
Sbjct: 336 ASNTGGVFCLAIICNSPIE-EAIFGNRAQNNFLVGYD 371
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 140/337 (41%), Gaps = 47/337 (13%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC-- 63
+ Y++KL IG P + +LDT + WTQC PC CY Q PI++ ++K++ C
Sbjct: 63 YEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 122
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+D SC Y + YG TK +T T + P + GC
Sbjct: 123 HDHSCP-----------YELVYGGKSYTKGTLVTETVT-IHSTSGQPFVMPETIIGCGRN 170
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG-DQII 182
+ F K AG++GL+ S + Q+G P S C S++ FG + I+
Sbjct: 171 NSGF----KPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIV 223
Query: 183 AGKSL--------NLPPNSFTIKLNG---QRGCINDCGSVLTVIECEVYAVLTAEFIDYF 231
AG + P + + L+ I G+ ++ + + + + YF
Sbjct: 224 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNI-VIDSGSTLTYF 282
Query: 232 SQ-------HDIEKLFTCR---KCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFI 280
+ +E++ T + + C+ + FP +T HF GADLV++ N+++
Sbjct: 283 PESYCNLVRKAVEQVVTAVRFPRSDILCY-YSKTIDIFPVITMHFSGGADLVLDKYNMYV 341
Query: 281 FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++ F +P + + I G R Q+N YD
Sbjct: 342 ASNTGGVFCLAIICNSPIE-EAIFGNRAQNNFLVGYD 377
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 93/184 (50%), Gaps = 12/184 (6%)
Query: 3 TLNH-TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+ NH Y+++L IG P ++ DT + L W QC PC +CY+Q +P+++S+S ++ +
Sbjct: 53 SANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNI 112
Query: 62 PCYDASCKSPFHCF----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C SC + + +C Y +Y D ET+ V + +T TL PV+ + +
Sbjct: 113 ACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLT-STTGEPVAFKGVI 171
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLV--QPDKSFHSR 174
FGC + + ++ GI+GL S + Q+G L + FS CLV + S S
Sbjct: 172 FGCGHNNNGAFNDKE---MGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSP 228
Query: 175 LEFG 178
+ FG
Sbjct: 229 MSFG 232
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 21/192 (10%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+ + Y++KLG+G P +DT + L WTQCQPC CY+Q DP++N + SY
Sbjct: 81 VLSAGGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAV 140
Query: 61 LPCYDASCKS--PFHCF-EGD------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
+PC +C C +GD C Y +YG T+ + ++D L D+
Sbjct: 141 VPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVD--RLAIGDD---- 194
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
+ + FGCS S V ++G++GL + S + QL RF CL P
Sbjct: 195 VFRGVVFGCSSSS---VGGPPPQVSGVVGLGRGALSLVSQLSV---RRFMYCLPPPVSRS 248
Query: 172 HSRLEFGDQIIA 183
RL G A
Sbjct: 249 AGRLVLGADAAA 260
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 132/359 (36%), Gaps = 61/359 (16%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y + L +G P + ++DT + LTWTQC PC +C+ Q P+Y+ ++ KLPC
Sbjct: 95 AYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCAS 154
Query: 66 ASCK---SPFH-CFEGDCFYGITYGDVYETK--EVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C+ S F C C Y Y + D+L + S S + FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASS-SFAGVAFG 213
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-- 177
CS + +GI+GL + S + Q+G RFS CL + S + F
Sbjct: 214 CSTANGG----DMDGASGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILFGA 266
Query: 178 -----GDQI-------------------------IAGKSLNLPPNS--FTIKLNGQRGCI 205
GD++ IA S +LP S F G G I
Sbjct: 267 LANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVI 326
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-----CFNLPARFNSFP 260
D G+ T + Y +L F+ L T R G CF A P
Sbjct: 327 VDSGTTFTYLAEAGYTMLRQAFLS-----QTAGLLT-RVSGAQFDFDLCFEAGAADTPVP 380
Query: 261 SMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ + F G P + F+ D P +G +++G Q + +YDLD
Sbjct: 381 RLVFRFAGGAEYAVPRQSY-FDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLD 438
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 135/366 (36%), Gaps = 78/366 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G P +LDT + + W QC PC+ CY+Q+ +++ R +SY + C
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPL 201
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C Y + YGD T + +T T V I GC +
Sbjct: 202 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-----GGARVARIALGCGHD 256
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH-----SRLEFG 178
++ ++ G S SF Q+ R FS CLV S + S + FG
Sbjct: 257 NEGLFVAAAGLLGLGRG----SLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFG 312
Query: 179 DQI--------------------------------------IAGKSLNLPPNSFTIKLNG 200
+A L L P+S G
Sbjct: 313 SGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSS------G 366
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPAR- 255
+ G I D G+ +T + Y+ L F + + LF TC++L R
Sbjct: 367 RGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFD------TCYDLSGRK 420
Query: 256 FNSFPSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
P+++ HF GA+ + PEN I + + +F F F G +I+G Q +
Sbjct: 421 VVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGT---DGGVSIIGNIQQQGFR 477
Query: 314 FVYDLD 319
V+D D
Sbjct: 478 VVFDGD 483
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 132/344 (38%), Gaps = 52/344 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ +LG+G P S ++DT + LTW QC PC SC+ Q P+++ R+ +Y + C +
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + PS +
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTSYPS------FYY 247
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC +++ AG++GL + S + QL + FS CL P + L G
Sbjct: 248 GCGQDNEGLFGRS----AGLIGLARNKLSLLYQLAPSLGYSFSYCL--PTAASTGYLSIG 301
Query: 179 DQ-----------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
++G S+ P + + I D G+V+T +
Sbjct: 302 PYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVE 274
V+ L+ + F+ TCF A P++ F GA + +
Sbjct: 362 PTAVHTALSKAVAQAMAGAQRAPAFSILD---TCFEGQASQLRVPTVVMAFAGGASMKLT 418
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
NV I + DS AF P I+G Q +YD+
Sbjct: 419 TRNVLI-DVDDSTTCL---AFAPTDSTAIIGNTQQQTFSVIYDV 458
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 137/345 (39%), Gaps = 62/345 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K L + DT + LTW +C S E DP ++ SY + C
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARC----SAAETFDPTKST----SYANVSCSTPL 185
Query: 68 CKS-------PFHCFEGDCFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C S P C C YGI YGD + KE ++ ++ + N
Sbjct: 186 CSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIF----------NN 235
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
FGC +D + K AG++GL D S + Q FS CL P S L
Sbjct: 236 FYFGC---GQDVDGLFGK-AAGLLGLGRDKLSVVSQTAPKYNQLFSYCL--PSSSSTGFL 289
Query: 176 EFG-DQIIAGK--SLNLPPNSF------TIKLNGQR-----------GCINDCGSVLTVI 215
FG Q + K L+ P+SF I + GQ+ G I D G+V+T +
Sbjct: 290 SFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRL 349
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGA-DLVV 273
Y+ L + F + + + K + TC++ + P + F G D+ V
Sbjct: 350 PPAAYSALRSAFRKAMASYPMGKPLSILD---TCYDFSKYKTIKVPKIVISFSGGVDVDV 406
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +F+ N F T + I G Q N + VYD+
Sbjct: 407 DQAGIFVANGLKQVCLAFA-GNTGARDTAIFGNTQQRNFEVVYDV 450
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 136/361 (37%), Gaps = 60/361 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C+ QN Y+ ++ S+K + C D
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 68 CK------SPFHCFEGD---CFYGITYGDVYETK---EVDSLDTSTLLPPDEPSPVSVQN 115
C P C E D C Y YGD T V++ + S V N
Sbjct: 220 CSLISSPDPPVQC-ESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGN 278
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHS 173
+ FGC ++ S ++ SF QL L FS CLV + + S
Sbjct: 279 MMFGCGHWNRGLFSGASGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSS 334
Query: 174 RLEFGDQ---------------------------------IIAGKSLNLPPNSFTIKLNG 200
+L FG+ ++ GK+L++P ++ I +G
Sbjct: 335 KLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDG 394
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-- 258
G I D G+ L+ Y ++ +F + ++ +F CFN+ +
Sbjct: 395 DGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKEN--YPIFRDFPVLDPCFNVSGIEENNI 452
Query: 259 -FPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
P + F G EN FI+ +D TP+ +I+G Q N +Y
Sbjct: 453 HLPELGIAFVDGTVWNFPAENSFIWLSED--LVCLAILGTPKSTFSIIGNYQQQNFHILY 510
Query: 317 D 317
D
Sbjct: 511 D 511
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 130/349 (37%), Gaps = 79/349 (22%)
Query: 23 FLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEGD--- 78
+LDT + L+W QCQPC C+ Q DP+Y+ K+YKKL C C D
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 79 ------CFYGITYGDV-----YETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C Y +YGD Y ++++ +L +S LP +GC +++
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLP----------QFTYGCGQDNQGL 110
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR------------- 174
AGI+GL D S + QL FS CL +
Sbjct: 111 FGRA----AGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSY 166
Query: 175 ----------------LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
L ++G+ L+L + + + D G+V+T +
Sbjct: 167 KFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP------TLIDSGTVITRLPMS 220
Query: 219 VYAVLTAEFIDYFSQHDIEK-----LFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLV 272
+YA L F+ S + L TC K + ++ P + FQ GADL
Sbjct: 221 MYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLK------SISAVPEIKMIFQGGADLT 274
Query: 273 VEPENVFIFNHQD-SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ ++ I + + F G + T + I+G R Q YD+ T
Sbjct: 275 LRAPSILIEADKGITCLAFAGSSGTNQIA--IIGNRQQQTYNIAYDVST 321
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 142/346 (41%), Gaps = 52/346 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + L F+ DT + LTWTQC+PC CY+Q + I++ + SY + C
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 206
Query: 67 SCK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
SC+ SP C C YGI YGD + + + +L D N +F
Sbjct: 207 SCEKLESATGNSP-GCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD-----VFNNFQF 260
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ AG++GL + S + Q + FS CL S L FG
Sbjct: 261 GCGQNNRGLFG----GTAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSSTGYLSFG 315
Query: 179 DQIIAGKSLNLPPNSFT-----------IKLN-GQR------------GCINDCGSVLTV 214
K++ P+ + ++ G+R G I D G+V++
Sbjct: 316 SGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDSGTVISR 375
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GADLV 272
+ VY+ + F + S + K + TC++L + P + +F GA++
Sbjct: 376 LPPTVYSSVQKVFRELMSDYPRVKGVSILD---TCYDLSKYKTVKVPKIILYFSGGAEMD 432
Query: 273 VEPEN-VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ PE +++ F G + I+G Q VYD
Sbjct: 433 LAPEGIIYVLKVSQVCLAFAGNSDDDE--VAIIGNVQQKTIHVVYD 476
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/354 (22%), Positives = 132/354 (37%), Gaps = 51/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P + ++DT + L W QC PC C++Q P+++ + SY+ + C D
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTR 209
Query: 68 C------KSPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C C Y YGD T +L+ T + S V +
Sbjct: 210 CGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFT-VNLTASSSRRVDGVVL 268
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ ++ SF QL + FS CLV + S++ FG
Sbjct: 269 GCGHRNRGLFHGAAGLLGLGR----GPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVFG 324
Query: 179 DQ-------------------------------IIAGKSLNLPPNSFTI-KLNGQRGCIN 206
D ++ G+ L++P N++ + K +G G I
Sbjct: 325 DDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTII 384
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYH 265
D G+ L+ Y + F+D + L C+N+ P +
Sbjct: 385 DSGTTLSYFPEPAYKAIRQAFVDRMDK--AYPLIADFPVLSPCYNVSGVERVEVPEFSLL 442
Query: 266 F-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA EN FI + TPR +I+G Q N +YDL
Sbjct: 443 FADGAVWDFPAENYFIRLDTEG-IMCLAVLGTPRSAMSIIGNYQQQNFHVLYDL 495
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 149/367 (40%), Gaps = 66/367 (17%)
Query: 4 LNHTYMLKLGIGDPV-KSLWFL-LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
L +TY + + IG KS +FL LDT + L W +C C Q P+++ SY+ L
Sbjct: 70 LEYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPL 129
Query: 62 PCYDASCKSPFHCF-EGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C++P GD C + + E DT L P P + ++ FG
Sbjct: 130 HPTSPLCRAPNPVLPAGDKCSFHLP----GEAHGYVGTDTIILGNPTLP----IHSVAFG 181
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ------------- 166
C+ ++ F + K AG +G+ TS ++Q+ V RFS CL+
Sbjct: 182 CAQSTEGFDT--KGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRF 239
Query: 167 ----PDKSF--HSRLEF-----------GDQI----IAGKSLNLPP------NSFTIKLN 199
PD + H R++ D + G SLN P F + +
Sbjct: 240 GADIPDPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSD 299
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFN-LPARFN 257
G GC D G+ +T + YAV+ Q +++ R + CF P ++
Sbjct: 300 GSGGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRV---RDPNFSLCFREHPGIWS 356
Query: 258 SFPSMTYHFQG------ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHN 311
P +T F+G A L + N+F+ + FG T R T++GA Q +
Sbjct: 357 HIPKLTLDFEGPASRTVAHLEIVSRNLFL-KVDNQPLVCFGVYRTSRGSPTVVGAMQQVD 415
Query: 312 TQFVYDL 318
T+F++DL
Sbjct: 416 TRFIFDL 422
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 100/247 (40%), Gaps = 36/247 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ + +G P + +DT + ++W QC+PC S CY Q DP+++ SY +PC
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201
Query: 66 ASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
ASC C G C Y ++YGD T V S DT TL + +++ FGC
Sbjct: 202 ASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSN-----ALKGFLFGCG 256
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
+ + + G++GL S + Q FS CL S G
Sbjct: 257 HAQQGLFA----GVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSS 312
Query: 182 IAGKS------LNLPPNSFTIKLNG---------------QRGCINDCGSVLTVIECEVY 220
AG S + P + + L G G + D G+V+T + Y
Sbjct: 313 TAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAY 372
Query: 221 AVLTAEF 227
+ L + F
Sbjct: 373 SALRSAF 379
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 136/356 (38%), Gaps = 52/356 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G+P K + +DT + + W C PC C + +N S + ++P
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 63 CYDASCKSPFHCFEG----------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPS 109
C D C + E C Y TYGD T DT T++ +E +
Sbjct: 149 CSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVM-GNEQT 207
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL--- 164
S ++ FGCS + + + GI G S + QL L P FS CL
Sbjct: 208 ANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGS 267
Query: 165 ------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
V+P F H L ++G+ LP +S + +G
Sbjct: 268 DNGGGILVLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQ--KLPIDSSLFATSNTQGT 325
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMT 263
I D G+ L + Y FI+ + + + G+ CF + + SFP+ T
Sbjct: 326 IVDSGTTLVYLVDGAY----DPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTAT 381
Query: 264 YHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F+G + V+PEN + + + +G TILG + FVYDL
Sbjct: 382 LYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYDL 437
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 100/247 (40%), Gaps = 36/247 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ + +G P + +DT + ++W QC+PC S CY Q DP+++ SY +PC
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190
Query: 66 ASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
ASC C G C Y ++YGD T V S DT TL + +++ FGC
Sbjct: 191 ASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSN-----ALKGFLFGCG 245
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
+ + + G++GL S + Q FS CL S G
Sbjct: 246 HAQQGLFA----GVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSS 301
Query: 182 IAGKS------LNLPPNSFTIKLNG---------------QRGCINDCGSVLTVIECEVY 220
AG S + P + + L G G + D G+V+T + Y
Sbjct: 302 TAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAY 361
Query: 221 AVLTAEF 227
+ L + F
Sbjct: 362 SALRSAF 368
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 136/348 (39%), Gaps = 64/348 (18%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+ Y++KL +G P + +DT + L WTQC PC CY Q DPI++ ++ + C+
Sbjct: 80 NIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHG 139
Query: 66 ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
SC Y I Y D +K + + +T T + P + GC L +
Sbjct: 140 KSCH-----------YEIIYEDNTYSKGILATETVT-IHSTSGEPFVMAETTIGCGLHNT 187
Query: 126 DF-VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
D S +GI+GLN S + Q+ P S C S++ FG I
Sbjct: 188 DLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCF---SGQGTSKINFGTNAIVA 244
Query: 185 KSLNLPPNSFTIKLN-------------------------GQRGCIN-DCGSVLT----- 213
+ + F K N + G I D GS +T
Sbjct: 245 GDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVS 304
Query: 214 ---VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GA 269
++ V V+TA + S +D+ F+ + FP +T HF GA
Sbjct: 305 YCNLVRKAVEQVVTAVRVPDPSGNDMLCYFS------------ETIDIFPVITMHFSGGA 352
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
DLV++ N+++ ++ F +P + + I G R Q+N YD
Sbjct: 353 DLVLDKYNMYMESNSGGLFCLAIICNSPTQ-EAIFGNRAQNNFLVGYD 399
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 135/337 (40%), Gaps = 46/337 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KL +G P + +DT + + WTQC PC +CY Q PI++ ++++ C S
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGNS 480
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C Y I Y D +K + + +T T +P P + + GC L++ +
Sbjct: 481 CH-----------YEIIYADKTYSKGILATETVT-IPSTSGEPFVMAETKIGCGLDNTNL 528
Query: 128 -VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKS 186
S +GI+GLN S + Q+ P S C S++ FG I
Sbjct: 529 QYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF---SGQGTSKINFGTNAIVAGD 585
Query: 187 LNLPPNSFTIK------LNGQRGCINDCGSVLTVIECEVYAVLTAEFID------YFS-- 232
+ + F K LN + D +++ + +A FID YF
Sbjct: 586 GTVAADMFIKKDNPFYYLNLDAVSVED--NLIATLGTPFHAEDGNIFIDSGTTLTYFPMS 643
Query: 233 -----QHDIEKLFTCRKC------GVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFI 280
+ +E++ T K + C+ + FP +T HF GADLV++ N+++
Sbjct: 644 YCNLVREAVEQVVTAVKVPDMGSDNLLCY-YSDTIDIFPVITMHFSGGADLVLDKYNMYL 702
Query: 281 FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F P + G R Q+N YD
Sbjct: 703 ETITGGIFCLAIGCNDPSM-PAVFGNRAQNNFLVGYD 738
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 134/357 (37%), Gaps = 66/357 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +++ +LDT + + W QC PCK+CY Q D I++ + K++ +PC
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRL 194
Query: 68 CKSPFHCFE------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+ E C Y ++YGD T+ S +T T V ++ GC
Sbjct: 195 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF------HGARVDHVPLGCG 248
Query: 122 LESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-----KSFHSRL 175
+++ FV + SF Q +FS CLV S +
Sbjct: 249 HDNEGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 303
Query: 176 EFGDQIIAGKSLNLP-----------------------------PNSFTIKLNGQRGCIN 206
FG+ + S+ P + F + G G I
Sbjct: 304 VFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 363
Query: 207 DCGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSM 262
D G+ +T + Y L F + LF TCF+L P++
Sbjct: 364 DSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFD------TCFDLSGMTTVKVPTV 417
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
+HF G ++ + N I + + F F AF G +I+G Q + YDL
Sbjct: 418 VFHFGGGEVSLPASNYLIPVNTEGRFCF---AFAGTMGSLSIIGNIQQQGFRVAYDL 471
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 140/366 (38%), Gaps = 66/366 (18%)
Query: 8 YMLKLGIGDPVK---SLWFLL--DTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
Y+ K+ +G P + S LL D + +TW QC PC CY Q P+YN S +
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVG 184
Query: 63 CYDASCK---SPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
CY +C+ S C F +C Y + YGD + ++T T P V V +
Sbjct: 185 CYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF-----PPGVRVPGVA 239
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV-QPDKSFHSRLE 176
GC +++ AGI+GL S SF Q+ FS CL Q S L
Sbjct: 240 IGCGSDNQGLFPAPA---AGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLT 296
Query: 177 FGDQIIAGKSLNLPPNSFTIKLN-----------------------------------GQ 201
FG A + PP+ + N G
Sbjct: 297 FGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGH 356
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-----TCF-NLPAR 255
G I D G+ +T + YA F D F +++L G TC+ ++ R
Sbjct: 357 GGVIVDSGTAVTRLSGPAYAA----FRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGR 412
Query: 256 -FNSFPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
P+++ HF G ++ + P+N I + F A + +G +I+G +
Sbjct: 413 VMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFR 472
Query: 314 FVYDLD 319
VYD+D
Sbjct: 473 VVYDVD 478
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 135/357 (37%), Gaps = 66/357 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +++ +LDT + + W QC PCK+CY Q+D I++ + K++ +PC
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRL 197
Query: 68 CKSPFHCFE------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+ E C Y ++YGD T+ S +T T V ++ GC
Sbjct: 198 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF------HGARVDHVPLGCG 251
Query: 122 LESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-----KSFHSRL 175
+++ FV + SF Q +FS CLV S +
Sbjct: 252 HDNEGLFVGAAGLLGL-----GRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTI 306
Query: 176 EFGDQIIAGKSLNLP-----------------------------PNSFTIKLNGQRGCIN 206
FG+ + S+ P + F + G G I
Sbjct: 307 VFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 366
Query: 207 DCGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSM 262
D G+ +T + Y L F + LF TCF+L P++
Sbjct: 367 DSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLFD------TCFDLSGMTTVKVPTV 420
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
+HF G ++ + N I + + F F AF G +I+G Q + YDL
Sbjct: 421 VFHFGGGEVSLPASNYLIPVNTEGRFCF---AFAGTMGSLSIIGNIQQQGFRVAYDL 474
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 138/345 (40%), Gaps = 38/345 (11%)
Query: 8 YMLKLGIGDPV-KSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ GIG P + + +DT + + WTQC+PC C+ Q P +++ + + + C D
Sbjct: 92 YLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDP 151
Query: 67 SCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C++ P CF G C Y + YGD T + D+ T V+V ++ FGC +
Sbjct: 152 ICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFT-FDGKGGGKVTVPDLVFGCGQYN 210
Query: 125 KDFVSIQKKIIAGI----------MGLN---------WDSTSFMVQLGRLVPDRFSCCLV 165
+ IAG +G++ ++S S V LG D
Sbjct: 211 TGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVFLGGAPADGLRAHAT 270
Query: 166 Q--------PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
P+ + L + L +P ++F +K +G G I D G+ +T
Sbjct: 271 GPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPR 330
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS----FPSMTYHFQGADLVV 273
V+ L F+ T + CF+ + ++ P MT H +GAD +
Sbjct: 331 AVFRSLWEAFVAQVPLPHTSYNDTGEPT-LQCFSTESVPDASKVPVPKMTLHLEGADWEL 389
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
EN ++ + DS +T++G Q N V+DL
Sbjct: 390 PREN-YMAEYPDSDQLCV-VVLAGDDDRTMIGNFQQQNMHIVHDL 432
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 96/210 (45%), Gaps = 24/210 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y L + +G P K +W +LDT + L+W QC PC C+EQN Y + +Y+ + CYD
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPR 230
Query: 68 CK-----SPF-HCFEGD--CFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQNI 116
C+ P HC + C Y Y D T + +T T+ P + V ++
Sbjct: 231 CQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDV 290
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSR 174
FGC +K F +G++GL SF Q+ + FS CL + S S+
Sbjct: 291 MFGCGHWNKGFF----YGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSK 346
Query: 175 LEFGD--QIIAGKSLNLPPNSFTIKLNGQR 202
L FG+ +++ +LN FT L G+
Sbjct: 347 LIFGEDKELLNNHNLN-----FTTLLAGEE 371
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 144/351 (41%), Gaps = 63/351 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC---KSCYEQNDPIYNSRSFKSYKKLPCY 64
Y ++G+G PV+S +F+ DT + ++W QCQPC CY+Q PI++ +S SY L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 65 DASCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C C Y + YGD T V L T T S+ N+ GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFT--VGELATETF---SFRHSNSIPNLPIGCGH 298
Query: 123 ESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-GDQ 180
+++ FV I G ++ S +L FS CLV D S L+F DQ
Sbjct: 299 DNEGLFVGADGLIGLGGGAISLSS--------QLEATSFSYCLVDLDSESSSTLDFNADQ 350
Query: 181 --------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ GK L + +SF I +G G I D G+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITE 410
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPARFN-SFPSMTYHFQGA 269
I +VY VL F+ GV TC++L ++ N P++ + G
Sbjct: 411 IPSDVYDVLRDAFVGLTKNLPPAP-------GVSPFDTCYDLSSQSNVEVPTIAFILPGE 463
Query: 270 D-LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ L + +N I + +F F P+ P +I+G Q + YDL
Sbjct: 464 NSLQLPAKNCLIQVDSAGTFCLAFLPSTFPL---SIIGNVQQQGIRVSYDL 511
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 136/359 (37%), Gaps = 72/359 (20%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 137 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYA 194
Query: 60 KLPCYDASCKS-----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+PC C C C Y ++YGD T V S DT TL + +VQ
Sbjct: 195 AVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-----SASSAVQ 249
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFH 172
FGC +S F + G++GL + S + Q FS CL +P + +
Sbjct: 250 GFFFGCGHAQSGLFNGVD-----GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 304
Query: 173 SRLEFGDQIIAGKSLN----LP-PNSFT--------IKLNGQR----------GCINDCG 209
L G A + LP PN+ T I + GQ+ G + D G
Sbjct: 305 LTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG 364
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-ARFNSFPSMTYHFQG 268
+V+T + YA L + F + G+ + P A N Y+F G
Sbjct: 365 TVITRLPPTAYAALRSAF----------------RSGMASYGYPTAPSNGILDTCYNFAG 408
Query: 269 ADLVVEPENVFIFNH-------QDSFFFFFGPAFTPR---KGKTILGARHQHNTQFVYD 317
V P F D F AF P G ILG Q + + D
Sbjct: 409 YGTVTLPNVALTFGSGATVMLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 106/259 (40%), Gaps = 39/259 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CY+Q + +++ +Y + C
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 67 SCKSPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C Y + YGD + ++DT TL D +V+ RFGC +
Sbjct: 242 ACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 296
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR-LEF--GDQI 181
+ AG++GL TS VQ F+ CL P +S + L+F G
Sbjct: 297 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL--PARSSGTGYLDFGPGSPA 350
Query: 182 IAGKSLNLP------PNSFTIKLNGQR----------------GCINDCGSVLTVIECEV 219
G P P + + + G R G I D G+V+T +
Sbjct: 351 AVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAA 410
Query: 220 YAVLTAEFIDYFSQHDIEK 238
Y+ L + F + +K
Sbjct: 411 YSSLRSAFASAMAARGYKK 429
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 140/346 (40%), Gaps = 46/346 (13%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
T+++ IG P + ++DT + LTW QC+PC +C++Q P+YN S +Y +D
Sbjct: 109 TFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDR 168
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
+ + DC Y TY D T+ + + PD+ + + ++ FGC +
Sbjct: 169 TDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITI-MHDVIFGCGHNNTQ 227
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ---PDKSFHSRLEFGDQI-- 181
+G+ GL +S + +LG FS C+ P FH RL G+++
Sbjct: 228 LPG-PTGYASGVFGLGDSGSSIISKLGF----GFSYCIGNIGDPLYGFH-RLTLGNKLKI 281
Query: 182 ---------------------IAGKSLNLPPNSFT-IKLNG-QRGCINDCGSVLTVIECE 218
I + L++ P F + LNG + D G+ L+ I +
Sbjct: 282 EGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLSYIPRQ 341
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHF-QGADLVVEP 275
Y V+ + S + R + C+ L FP T+H GADLV +
Sbjct: 342 AYNVVRDKVSSILSGFLSRYRYIARHLSL-CYIGKLNQDLQGFPDATFHLADGADLVFQV 400
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYDL 318
E +F F + D+ A P + ++G Q YDL
Sbjct: 401 EGLF-FQYTDNVLCL---ALVPTESDEETCLIGLLAQQYYNVAYDL 442
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 139/358 (38%), Gaps = 67/358 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C CK C + DP + SYK L C
Sbjct: 77 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN 136
Query: 65 -DASCKSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C EG C Y Y ++ + V S D L+ S ++ Q FGC +
Sbjct: 137 PDCNCDD-----EGKLCVYERRYAEMSSSSGVLSED---LISFGNESQLTPQRAVFGCEN 188
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S + GIMGL S + QL ++ D FS C
Sbjct: 189 VETGDLFSQRAD---GIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKI 245
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + ++ +AGKSL L P F NG+ G + D G+
Sbjct: 246 SPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGT---- 297
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCFNLPAR-----FNSFPSM 262
YA E I+++ + ++ CF+ R N FP +
Sbjct: 298 ----TYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 353
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G L++ PEN ++F H + F R T+LG NT YD +
Sbjct: 354 DMEFGNGQKLILSPEN-YLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 410
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 132/355 (37%), Gaps = 61/355 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P+ + + DT + L WTQC PC C++Q P + S ++ KLPC +
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ S C C Y YG Y L T TL D P ++ FGCS E
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGY---LATETLKVGDASFP----SVAFGCSTE 198
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD--QI 181
+ +GI GL + S + QLG RFS CL + S + FG +
Sbjct: 199 NG-----VGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAAGASPILFGSLANL 250
Query: 182 IAGKSLNLP--------PNSFTIKLNG----------------------QRGCINDCGSV 211
G + P P+ + + L G G I D G+
Sbjct: 251 TDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTT 310
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---SFPSMTYHFQ- 267
LT + + Y ++ F+ D+ + R + CF + PS+ F
Sbjct: 311 LTYLAKDGYEMVKQAFLS--QTADVTTVNGTRGLDL-CFKSTGGGGGGIAVPSLVLRFDG 367
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK---TILGARHQHNTQFVYDLD 319
GA+ V + P KG +++G Q + +YDLD
Sbjct: 368 GAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLD 422
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/351 (24%), Positives = 134/351 (38%), Gaps = 66/351 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++++ G P ++DT + ++W QC+PC S C+ Q DP+Y+ +Y +PC
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138
Query: 66 ASCKS------PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
CK C G C + I+Y D T S D TL P VQN F
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAP-----GAIVQNFYF 193
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------- 164
GC K V + + G++GL S + G + FS CL
Sbjct: 194 GCG-HGKHAV---RGLFDGVLGLGRLRESLGARYGGV----FSYCLPSVSSKPGFLALGA 245
Query: 165 -------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
P + S + + GK L+L P++F+ G I D G+V
Sbjct: 246 GKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMIVDSGTV 299
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GA 269
+T ++ Y L + F + + TC+NL N P + F GA
Sbjct: 300 ITGLQSTAYRALRSAFRKAMEAYRLLPNGDLD----TCYNLTGYKNVVVPKIALTFTGGA 355
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ ++ N + N +F P +LG +Q + ++D T
Sbjct: 356 TINLDVPNGILVNGCLAF-----AESGPDGSAGVLGNVNQRAFEVLFDTST 401
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 75/160 (46%), Gaps = 12/160 (7%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G PV + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 294
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
+ AG++GL TS VQ F+ CL
Sbjct: 295 EGLFGEA----AGLLGLGRGKTSLPVQTYDKYGGVFAHCL 330
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 136/352 (38%), Gaps = 68/352 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ LG G P L+DT + ++W QC PC S CY Q DP+++ +Y + C
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNT 190
Query: 66 ASCKS---PFH--CFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+C+ +H C G C Y + Y D ++ V S +T TL P ++V++ F
Sbjct: 191 DACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLAP-----GITVEDFHF 245
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV------------Q 166
GC + + G++GL S +VQ + FS CL
Sbjct: 246 GCGRDQRG----PSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSEAGFLVLGS 301
Query: 167 PDKSFHSRLEFGDQ-----------------IIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
P S F + GK L++P ++F + G I D G
Sbjct: 302 PPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAF------RGGMIIDSG 355
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDI--EKLFTCRKCGVTCFNLPARFN-SFPSMTYHF 266
+V T + Y L A + + F TC+N N + P + + F
Sbjct: 356 TVDTELPETAYNALEAALRKALKAYPLVPSDDFD------TCYNFTGYSNITVPRVAFTF 409
Query: 267 Q-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
GA + ++ N + N +F P G I+G +Q + +YD
Sbjct: 410 SGGATIDLDVPNGILVNDCLAF-----QESGPDDGLGIIGNVNQRTLEVLYD 456
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 85/351 (24%), Positives = 134/351 (38%), Gaps = 66/351 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++++ G P ++DT + ++W QC+PC S C+ Q DP+Y+ +Y +PC
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 66 ASCKS------PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
CK C G C + I+Y D T S D TL P VQN F
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAP-----GAIVQNFYF 227
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------- 164
GC K V + + G++GL S + G + FS CL
Sbjct: 228 GCG-HGKHAV---RGLFDGVLGLGRLRESLGARYGGV----FSYCLPSVSSKPGFLALGA 279
Query: 165 -------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
P + S + + GK L+L P++F+ G I D G+V
Sbjct: 280 GKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMIVDSGTV 333
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GA 269
+T ++ Y L + F + + TC+NL N P + F GA
Sbjct: 334 ITGLQSTAYRALRSAFRKAMEAYRLLPNGDLD----TCYNLTGYKNVVVPKIALTFTGGA 389
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ ++ N + N +F P +LG +Q + ++D T
Sbjct: 390 TINLDVPNGILVNGCLAF-----AESGPDGSAGVLGNVNQRAFEVLFDTST 435
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/318 (25%), Positives = 131/318 (41%), Gaps = 44/318 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +K+ IG P+ + + DT + LTW QC PC CY Q P+++ SY+ + C
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRF 153
Query: 68 CK----SPFHCF--EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C S C C Y +YGD T + + T + PV + I FGC
Sbjct: 154 CNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFT-IGSTSSRPVHLSPIVFGCG 212
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFG- 178
+ ++ +GI+GL + S + QL ++ +FS CLV + S+++FG
Sbjct: 213 TGNG---GTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGT 269
Query: 179 DQIIAGKSL--------------------------NLPPNSFTIKLNGQRG-CINDCGSV 211
D +I+G + LP + + N ++G I D G+
Sbjct: 270 DSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTT 329
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADL 271
LT ++ E + L + E++ R CF + P + HF AD+
Sbjct: 330 LTFLDSEFFTELERVLEETVKA---ERVSDPRGLFSVCFRSAGDID-LPVIAVHFNDADV 385
Query: 272 VVEPENVFIFNHQDSFFF 289
++P N F+ +D F
Sbjct: 386 KLQPLNTFVKADEDLLCF 403
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 103/247 (41%), Gaps = 37/247 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ +G G P ++ + DT + L+W QCQPC CY+Q+DP+++ SY +PC
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGTT 171
Query: 67 SCKSP-FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES- 124
C + C C YG+ YGD T V + +T T E FGC +
Sbjct: 172 ECAAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSE-----FTGFIFGCGETNL 226
Query: 125 KDFVSI----------------QKKIIAGIMGL---NWDSTSFMVQLGRL-----VPDRF 160
DF + GI ++++T + +G +P ++
Sbjct: 227 GDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVTGQIPVQY 286
Query: 161 SCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
+ + +PD +E I G L +PP+ FT + G + D G++LT + Y
Sbjct: 287 TAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFT-----KTGTLLDSGTILTYLPPPAY 341
Query: 221 AVLTAEF 227
L F
Sbjct: 342 TALRDRF 348
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 142/358 (39%), Gaps = 71/358 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ LG G P L+DT + L+W QCQPC S CY Q DP+++ + +Y +PC
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181
Query: 66 ASCK--SPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+C+ P G C YGI YG+ T V S +T TL P E + V V
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSP--EAATV-VN 238
Query: 115 NIRFGCSLESKDFVSIQ----------KKIIAGIMGL----------NWDSTSFMVQLGR 154
N FGC L K + + +++ G +ST+ + LG
Sbjct: 239 NFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGA 298
Query: 155 LVPD-----RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
F +Q ++ ++ + GK L++ P F G I D G
Sbjct: 299 PATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVF------AGGMIIDSG 352
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQH------DIEKLFTCRKCGVTCFNLPARFN-SFPSM 262
+++T + Y+ L F S + D E L TC++ N + P++
Sbjct: 353 TIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLD-------TCYDFTGNTNVTVPTV 405
Query: 263 TYHFQGA---DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F+G DL V P V + D F A G I+G +Q + +YD
Sbjct: 406 ALTFEGGVTIDLDV-PSGVLL----DGCLAFVAGASDGDTG--IIGNVNQRTFEVLYD 456
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 151/360 (41%), Gaps = 63/360 (17%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
+++ + IG P + +DT + L W QC PC +CY Q+ PI++ +++ C
Sbjct: 83 QAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRT 142
Query: 66 ASCKSP---FHCFEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRFG 119
+ P F+ C Y + Y D +K + + L +T+ DE S ++ ++ FG
Sbjct: 143 SQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIY--DESSSAALHDVVFG 200
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSF-HSRLEF 177
C ++ + + GI+GL + S + + G+ +FS C D S+ H+ L
Sbjct: 201 CGHDNYG----EPLVGTGILGLGYGEFSLVHRFGK----KFSYCFGSLDDPSYPHNVLVL 252
Query: 178 GD-----------------------QIIAGKSLNLPPNSFTIKLNGQR---GCINDCGSV 211
GD + I+ + LP + N Q G I D G+
Sbjct: 253 GDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNS 312
Query: 212 LTVIECEVYAVLTAEFIDYF---------SQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
LT + E Y L D F SQ D+ K+ +C F + FP +
Sbjct: 313 LTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKM----ECYNGNFERDLVESGFPIV 368
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
T+HF +GA+L ++ +++F+ + F A TP +I GA Q + YDL+
Sbjct: 369 TFHFSEGAELSLDVKSLFMKLSPNVFCL----AVTPGNLNSI-GATAQQSYNIGYDLEAM 423
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 80/177 (45%), Gaps = 16/177 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG P K ++ ++DT + + W QC PC CY+Q DPI+ SY L C
Sbjct: 53 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQ 112
Query: 68 CKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
CKS C C Y ++YGD T + +T TL S+ N+ GC +++
Sbjct: 113 CKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITL-----DGSASLNNVAIGCGHDNE 167
Query: 126 D-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
FV + G L++ S ++ FS CLV D S LEF I
Sbjct: 168 GLFVGAAGLLGLGGGSLSFPS--------QINASSFSYCLVNRDTDSASTLEFNSPI 216
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 84/175 (48%), Gaps = 18/175 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP----- 62
Y + +G P + ++DT + LTW QC PCK C D IY++ SY+ +
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQ 159
Query: 63 -CYDASCKSPFHCFEGD-CFYGITYGD---VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C ++S + +C G C + YGD Y + D+L T++ PV+VQ+
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVV---GGKPVTVQDFA 216
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
FGC+ + V +GI+GLN + +QLG+ +FS C PD+S H
Sbjct: 217 FGCAQGDLELVPTGA---SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSH 266
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 138/367 (37%), Gaps = 79/367 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G P +LDT + + W QC PC+ CY+Q+ P+++ R SY + C
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPL 199
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C Y + YGD T + +T T V + GC +
Sbjct: 200 CRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFA-----GGARVARVALGCGHD 254
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ----------------- 166
++ ++ G S SF Q+ R FS CLV
Sbjct: 255 NEGLFVAAAGLLGLGRG----SLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSST 310
Query: 167 -----PDKSFHS--------RLEF------------GDQI--IAGKSLNLPPNSFTIKLN 199
P S S R+E G ++ +A L L P++
Sbjct: 311 VTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPST------ 364
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPAR 255
G+ G I D G+ +T + Y+ L F + + LF TC++L R
Sbjct: 365 GRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFD------TCYDLGGR 418
Query: 256 -FNSFPSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
P+++ HF GA+ + PEN I + + +F F F G +I+G Q
Sbjct: 419 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGT---DGGVSIIGNIQQQGF 475
Query: 313 QFVYDLD 319
+ V+D D
Sbjct: 476 RVVFDGD 482
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 146/354 (41%), Gaps = 58/354 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y++ IG+P + LDT GL W QC C S E +S+SF +Y+ P
Sbjct: 75 YLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSF-TYEMEP 133
Query: 63 CYDASCKSPFH---CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C C S C D C Y + YGD T + S D+ D V V +
Sbjct: 134 CGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSD-GMLVDVGFLN 192
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLE 176
FGC S+ ++ ++ G +GLN S + QLG +FS CLV + S++
Sbjct: 193 FGC---SEAPLTGDEQSYTGNVGLNQTPLSLISQLGI---KKFSYCLVPFNNLGSTSKMY 246
Query: 177 FGDQII--AGKSLNLPPNS--FTIKLNG-------------------QRGCINDCGSVLT 213
FG + G++ L PNS + +K+ G + G I D G +
Sbjct: 247 FGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGWIIDTGITYS 306
Query: 214 VIECEVYAVLTAEFI---DYFSQHDIEKLFTCRKCGVTCFNL--PARFNSFPSMTYHFQG 268
+E + + L A+F+ D+ + D K + CF L SFP +T HF G
Sbjct: 307 SLETDAFDSLLAKFLTLKDFPQRKDDPK-----ERFELCFELQNANDLESFPDVTVHFDG 361
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDLDT 320
ADL++ E+ F+ D F R G +ILG N YDL+
Sbjct: 362 ADLILNVESTFVKIEDDGIFCL----ALLRSGSPVSILGNFQLQNYHVGYDLEA 411
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 133/366 (36%), Gaps = 69/366 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD-- 65
Y++ + +G P + ++DT + L W QC PC C+EQ P+++ + SY+ L C D
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPR 205
Query: 66 ------------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
+C+ P E C Y YGD + +L++ T+ + V
Sbjct: 206 CGHVAPPEAPAPRACRRP---GEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRV 262
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR-FSCCLVQPDKSFH 172
+ FGC ++ ++ L SF QL + FS CLV
Sbjct: 263 DGVVFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQLRAVYGGHTFSYCLVDHGSDVA 318
Query: 173 SRLEFGDQ----------------------------------IIAGKSLNLPPNSFTIKL 198
S++ FG+ ++ G+ LN+ +++
Sbjct: 319 SKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDASE 378
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLP 253
G G I D G+ L+ Y V+ FID S D L C V+ P
Sbjct: 379 GGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYN--VSGVERP 436
Query: 254 ARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
P ++ F GA EN FI D TPR G +I+G Q N
Sbjct: 437 ----EVPELSLLFADGAVWDFPAENYFIRLDPDG-IMCLAVLGTPRTGMSIIGNFQQQNF 491
Query: 313 QFVYDL 318
YDL
Sbjct: 492 HVAYDL 497
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 121/321 (37%), Gaps = 57/321 (17%)
Query: 20 SLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEG-- 77
+L ++DT + LTW QC+PC CY Q DP+++ SY +PC ++C++ G
Sbjct: 175 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 234
Query: 78 ----------------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+Y + YGD ++ V + DT L SV FGC
Sbjct: 235 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL------GGASVDGFVFGCG 288
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
L ++ AG+MGL D + L P F F +
Sbjct: 289 LSNRGLFG----GTAGLMGLGPDGA--LAGLPDGAPPPF---------------YFMNVT 327
Query: 182 IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFS--QHDIEKL 239
A + N + D G+V+T + VY + AEF F ++
Sbjct: 328 GASVGGAAVAAAGLGAAN----VLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPP 383
Query: 240 FTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTP 297
F+ C+NL P +T + GAD+ V+ + +D A
Sbjct: 384 FSLLDA---CYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLS 440
Query: 298 RKGKT-ILGARHQHNTQFVYD 317
+ +T I+G Q N + VYD
Sbjct: 441 FEDQTPIIGNYQQKNKRVVYD 461
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 77/176 (43%), Gaps = 19/176 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P+ + + DT + L WTQC PC C++Q P + S ++ KLPC +
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ S C C Y YG Y L T TL D P ++ FGCS E
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGY---LATETLKVGDASFP----SVAFGCSTE 198
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
+ +GI GL + S + QLG RFS CL + S + FG
Sbjct: 199 NG-----VGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAAGASPILFGS 246
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 121/319 (37%), Gaps = 58/319 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD-- 65
Y++ L IG P + + LDT + L WTQCQPC +C++Q P ++ + + C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 66 ------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
ASC SP C Y +YGD T +D T + + SV + FG
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFV----GAGASVPGVAFG 197
Query: 120 CSLESKDFVSIQKKIIAG-----------------------IMGLN-----WDSTSFMVQ 151
C L + + IAG + GL D + + +
Sbjct: 198 CGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 152 LGRLVPDRFSCCLVQ-PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
GR S L+Q P L + L +P + F +K NG G I D G+
Sbjct: 258 SGRGAVQ--STPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGTIIDSGT 314
Query: 211 VLTVIECEVYAVLTAEF-----IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTY 264
+T + VY ++ F + S + + F C + P R + P +
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------CLSAPLRAKPYVPKLVL 366
Query: 265 HFQGADLVVEPENVFIFNH 283
HF+GA + + EN H
Sbjct: 367 HFEGATMDLPRENYVWLKH 385
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 144/366 (39%), Gaps = 69/366 (18%)
Query: 8 YMLKLGIGDPV-KSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ IG P + + +DT + L WTQC PC C++Q P+++ +++ + C D
Sbjct: 87 YLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDP 146
Query: 67 SCKS---------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPP--DEPSPVSVQN 115
C+ F CFY +YGD T DT T + P + PV+V
Sbjct: 147 ICRPSSGLSVSACALKTFR--CFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSG 204
Query: 116 IRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS---- 170
+ FGC + F S + +GI G S QL R+ RFS CL D++
Sbjct: 205 LAFGCGDYNTGVFASNE----SGIAGFGRGPLSLPSQL-RV--GRFSYCLTSHDETESNK 257
Query: 171 --------------FHSRLEFG------------------DQIIAGKSLNLPPNS--FTI 196
HS F + I GK+ LP +S F +
Sbjct: 258 TSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKT-RLPVDSSVFAL 316
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG-VTCFNLP-- 253
K +G G + D G+ +T V+ L EF+ +Q + + + G + CF P
Sbjct: 317 KKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFV---AQLPLPRYDNTSEVGNLLCFQRPKG 373
Query: 254 ARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
+ P + +H AD+ + EN +I DS ++G Q N
Sbjct: 374 GKQVPVPKLIFHLASADMDLPREN-YIPEDTDSGVMCLM-INGAEVDMVLIGNFQQQNMH 431
Query: 314 FVYDLD 319
VYD++
Sbjct: 432 IVYDVE 437
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++GIG+P + +LDT + ++W QC PC CY Q DPI+ + SY L C A
Sbjct: 132 YFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYAPLSCEAAQ 191
Query: 68 CK--SPFHCFEGDCFYGITYGD 87
C+ C G+C Y ++YGD
Sbjct: 192 CRYLDQSQCRNGNCLYQVSYGD 213
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 140/361 (38%), Gaps = 74/361 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPC--- 63
Y +K+G+G P K ++DT + +W QCQPC C+ Q DP++N + K+YK +PC
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 64 -----YDASCKSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
A+ P + + C Y +YGD + S D TL P ++ +
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQ-----TLSSFV 217
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------------- 164
+GC +++ GI+GL + S + QL + FS CL
Sbjct: 218 YGCGQDNQGLFGRTD----GIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEG 273
Query: 165 --------VQPDKSF----------HSRLEFGDQ---IIAGKSLNLPPNSFTIKLNGQRG 203
+ P S+ + L F D +AG+ L + +S+ +
Sbjct: 274 FLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------ 327
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLPARFNS 258
I D G+V+T + VY L ++ S+ I L TC K + + A
Sbjct: 328 TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA---- 383
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + F+ GADL ++ N + A I+G Q + YD
Sbjct: 384 -PDIRIIFKGGADLQLKGHNSLVELETGITCL----AMAGSSSIAIIGNYQQQTVKVAYD 438
Query: 318 L 318
+
Sbjct: 439 V 439
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/359 (22%), Positives = 144/359 (40%), Gaps = 58/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+G+G+P K + +DT + + W C C C ++D +Y+ +S S ++
Sbjct: 82 YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141
Query: 63 CYDASCKSPFH-CFEG-----DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSV 113
C D C + ++ +G C Y + YGD T D+L + + S +
Sbjct: 142 CDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSAN- 200
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--------------------- 152
++ FGC + + + + GI+G ++S + QL
Sbjct: 201 GSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGG 260
Query: 153 ----GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
G +V + + + P++ H + + + G L LP + F +RG I D
Sbjct: 261 IFAIGEVVSPKVNTTPMVPNQP-HYNVVMKEIEVGGNVLELPTDIF--DTGDRRGTIIDS 317
Query: 209 GSVLTVIECEVYAVLTAEFIDY---FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
G+ L + VY + + + H +E+ F TCF N FP + +
Sbjct: 318 GTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-------TCFQYTGNVNEGFPVVKF 370
Query: 265 HFQGA-DLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
HF G+ L V P + H++ + F + + G+ T+LG N +YDL+
Sbjct: 371 HFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLE 429
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 140/361 (38%), Gaps = 74/361 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPC--- 63
Y +K+G+G P K ++DT + +W QCQPC C+ Q DP++N + K+YK +PC
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 64 -----YDASCKSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
A+ P + + C Y +YGD + S D TL P ++ +
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQ-----TLSSFV 217
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------------- 164
+GC +++ GI+GL + S + QL + FS CL
Sbjct: 218 YGCGQDNQGLFGRTD----GIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEG 273
Query: 165 --------VQPDKSF----------HSRLEFGDQ---IIAGKSLNLPPNSFTIKLNGQRG 203
+ P S+ + L F D +AG+ L + +S+ +
Sbjct: 274 FLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------ 327
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLPARFNS 258
I D G+V+T + VY L ++ S+ I L TC K + + A
Sbjct: 328 TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA---- 383
Query: 259 FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + F+ GADL ++ N + A I+G Q + YD
Sbjct: 384 -PDIRIIFKGGADLQLKGHNSLVELETGITCL----AMAGSSSIAIIGNYQQQTVKVAYD 438
Query: 318 L 318
+
Sbjct: 439 V 439
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 136/357 (38%), Gaps = 65/357 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C CK C + DP + SY+ L C
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 65 -DASCKSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
D +C EG C Y Y ++ + V S D L+ S +S Q FGC
Sbjct: 133 PDCNCDD-----EGKLCVYERRYAEMSSSSGVLSED---LISFGNESQLSPQRAVFGCEN 184
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL---------------- 164
E + Q+ GIMGL S + QL ++ D FS C
Sbjct: 185 EETGDLFSQRA--DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKIS 242
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + ++ +AGKSL L P F NG+ G + D G+
Sbjct: 243 PPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGT----- 293
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCFNLPAR-----FNSFPSMT 263
YA E I+++ + ++ CF+ R N FP +
Sbjct: 294 ---TYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIA 350
Query: 264 YHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G L++ PEN ++F H + F R T+LG NT YD +
Sbjct: 351 MEFGNGQKLILSPEN-YLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 122/317 (38%), Gaps = 56/317 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y++ + +G P S+ + DT + L W QC PC CY+Q +P+++ + K+YK L Y +
Sbjct: 28 SYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTL-GYLS 86
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
S EGD P S + FGC +
Sbjct: 87 SETFTIGSTEGD-------------------------------PASFPGLAFGCGHSNGG 115
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQIIAG 184
+ + + G+ S ++QL V +FS CLV D + S++ FG +
Sbjct: 116 TFNEKDSGLIGLG---GGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVS 172
Query: 185 KSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH---DIEKLFT 241
S P + + I D G+ LT++ + Y + + D F+
Sbjct: 173 GSGTSSPAAAE-----ESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFS 227
Query: 242 CRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK 301
GV +P ++T HF GAD+ + P N F+ +D F + P
Sbjct: 228 LCYSGVKKLEIP-------TITAHFIGADVQLPPLNTFVQAQEDLVCF----SMIPSSNL 276
Query: 302 TILGARHQHNTQFVYDL 318
I G Q N YDL
Sbjct: 277 AIFGNLSQMNFLVGYDL 293
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/356 (20%), Positives = 130/356 (36%), Gaps = 45/356 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP-IYNSRSFKSYKKLPCYDA 66
Y + L IG P +SL + DT + L W +C C++C + ++ R ++ CYD
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 142
Query: 67 SCK---SPFHC-------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C+ P C Y Y D T + + +T++ L ++++
Sbjct: 143 VCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTS-LKTSSGKEAKLKSV 201
Query: 117 RFGCS--LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-----PDK 169
FGC + + G+MGL SF QLGR ++FS CL+ P
Sbjct: 202 AFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPT 261
Query: 170 SFHSRLEFGDQI--------------------------IAGKSLNLPPNSFTIKLNGQRG 203
S+ + GD + + G L + P+ + I +G G
Sbjct: 262 SYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGG 321
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
+ D G+ L + Y ++ A + ++L V + P +
Sbjct: 322 TVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLK 381
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F G + V P + ++ + P+ G +++G Q F +D D
Sbjct: 382 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRD 437
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 133/354 (37%), Gaps = 59/354 (16%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C CK C + DP + SY+ L C
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 65 -DASCKSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
D +C EG C Y Y ++ + V S D L+ S +S Q FGC
Sbjct: 133 PDCNCDD-----EGKLCVYERRYAEMSSSSGVLSED---LISFGNESQLSPQRAVFGCEN 184
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL---------------- 164
E + Q+ GIMGL S + QL ++ D FS C
Sbjct: 185 EETGDLFSQRA--DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKIS 242
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + ++ +AGKSL L P F NG+ G + D G+
Sbjct: 243 PPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYF 298
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCR----KCGVTCFNLPAR-----FNSFPSMTYHF 266
E + + I +I L CF+ R N FP + F
Sbjct: 299 PKEAFIAIKDAVI-----KEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEF 353
Query: 267 -QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L++ PEN ++F H + F R T+LG NT YD +
Sbjct: 354 GNGQKLILSPEN-YLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 68/150 (45%), Gaps = 13/150 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLGIG P +DT + L WTQCQPC CY Q DP++N R +Y LPC +
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C D C Y TY T+ ++D + + + + FGCS
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI------GEDAFRGVAFGCST 202
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
S + +G++GL S + QL
Sbjct: 203 SSTGGAPPPQA--SGVVGLGRGPLSLVSQL 230
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 132/358 (36%), Gaps = 52/358 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P + + +DT + + W C+PC +C + ++ R + L
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100
Query: 63 CYDASCKSPFHCFEGDCF------YGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSV 113
C D+ C S E C Y YGD T D D + + + S
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSF 171
+ I FGCS ++ + + GI G + S + QL L P FS CL D
Sbjct: 161 K-ITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGG 219
Query: 172 -----------------------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
H L + G+ L++ P F RG I DC
Sbjct: 220 GILVLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFAT--TNTRGTIIDC 277
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQG 268
G+ L + E Y I SQ + C +T ++ FPS+T +F+G
Sbjct: 278 GTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSID---EIFPSVTLYFEG 334
Query: 269 ADLVVEPENVFIFN-HQDSFFFF------FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
A + ++P++ I DS + G T TILG + FVYDL+
Sbjct: 335 APMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLE 392
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 74/160 (46%), Gaps = 13/160 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QC+PC SCY+Q D +++ +Y + C D
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C C G C YGI YGD T + DT + + +++ +FGC ++
Sbjct: 223 ACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAV------AQDAIKGFKFGCGEKN 276
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
+ AG++GL TS VQ FS CL
Sbjct: 277 RGLFG----QTAGLLGLGRGPTSITVQAYEKYGGSFSYCL 312
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 127/326 (38%), Gaps = 55/326 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ +G G P ++ +LDT + L+W QC+PC CY Q+DP ++ SY +PC
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196
Query: 67 SCKSP-FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES- 124
C + C C YG+ YGD T V S DT T S FGC ++
Sbjct: 197 VCAAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFN-----SSSKFTGFTFGCGEKNI 251
Query: 125 KDFVSIQKKI----------------IAGIMGL---NWDSTSFMVQLG-----RLVPDRF 160
DF + + G+ ++++T + +G VP ++
Sbjct: 252 GDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTSTVPVQY 311
Query: 161 SCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVY 220
+ + +P +E I G L +PP+ FT + G + D G++LT + Y
Sbjct: 312 TAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFT-----KTGTLLDSGTILTYLPPPAY 366
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFI 280
L F FT + N PA Y F G +V P F
Sbjct: 367 TSLRDRF-----------KFTMQG------NKPAPPYEPLDTCYDFTGQGAIVIPAVSFN 409
Query: 281 FNHQDSF-FFFFGPAFTPRKGKTILG 305
F+ F F+G P K ++G
Sbjct: 410 FSDGAVFDLDFYGIMIFPDDAKPLIG 435
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 145/356 (40%), Gaps = 62/356 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G P + ++DT +G+TW QCQ C+ CYEQ PI++ K+YK LPC
Sbjct: 97 YLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSNM 156
Query: 68 CKSPF---HCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+S C C Y I YGD ++ S++T TL + S V N GC
Sbjct: 157 CQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNG-SSVQFPNTVIGCGH 215
Query: 123 ESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV----QPDKSFHSRLEF 177
+K F ++ G + +G +FS CL Q + S S+L F
Sbjct: 216 NNKGTFQGEGSGVVGLGGGPVSLISQLSSSIG----GKFSYCLAPMFSQSNSS--SKLNF 269
Query: 178 GD-QIIAG-KSLNLPPNSFT------------IKLNGQR-----------------GCIN 206
GD +++G +++ P S T + +R I
Sbjct: 270 GDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIII 329
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEK----LFTCRKCGVTCFNLPARFNSFPSM 262
D G+ LT++ E Y+ L + D + + L C + P+ P +
Sbjct: 330 DSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQ------TTPSGQLDVPVI 383
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
T HF+GAD+ + P + F+ + F AF + +I G Q N YDL
Sbjct: 384 TAHFKGADVELNPISTFVQVAEGVVCF----AFHSSEVVSIFGNLAQLNLLVGYDL 435
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 70/147 (47%), Gaps = 12/147 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + DT + TW QCQPC CYEQ + +++ +Y + C
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
+C + C G C YG+ YGD + ++DT TL D +V+ RFGC +
Sbjct: 238 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-----AVKGFRFGCGERN 292
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQ 151
+ AG++GL TS VQ
Sbjct: 293 EGLFGEA----AGLLGLGRGKTSLPVQ 315
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 137/362 (37%), Gaps = 51/362 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSF 55
+ T Y ++GIG P K + +DT + + W C C C +++ +Y+ R
Sbjct: 83 LATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGS 142
Query: 56 KSYKKLPCYDASCKS------PFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPD 106
+S + + C C + P C Y I+YGD T D L + +
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL 164
+ +P + ++ FGC + + + GI+G ++S + QL V F+ CL
Sbjct: 203 QTTPANA-SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
VQP H + + G +L LP N F +
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIF--DSGNSK 319
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPS 261
G I D G+ L + VY L A D ++ L +CF + FP
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD-----FSCFQYSGSVDDGFPE 374
Query: 262 MTYHFQG-ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYD 317
+T+HF+G L+V P + N ++ + F G K +LG N +YD
Sbjct: 375 VTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYD 434
Query: 318 LD 319
L+
Sbjct: 435 LE 436
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 84/175 (48%), Gaps = 18/175 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP----- 62
Y + +G P + ++DT + LTW +C PCK C D IY++ SYK +
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159
Query: 63 -CYDASCKSPFHCFEGD-CFYGITYGD---VYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C ++S + +C G C + YGD Y + D+L T++ PV+VQ+
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVV---GGKPVTVQDFA 216
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
FGC+ + V +GI+GLN + +QLG+ +FS C PD+S H
Sbjct: 217 FGCAQGDLELVPTGA---SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSH 266
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 134/361 (37%), Gaps = 67/361 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G PV +LDT + + W QC PC+ CY+Q+ +++ R+ SY + C
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C Y + YGD T + +T T S V + GC +
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-----SGARVPRVALGCGHD 261
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA 183
++ ++ G S SF Q+ R FS CLV S S +
Sbjct: 262 NEGLFVAAAGLLGLGRG----SLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTF 317
Query: 184 GKSLNLP--PNSFT------------------IKLNGQR------------------GCI 205
G P SFT I + G R G I
Sbjct: 318 GSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVI 377
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPA-RFNSFP 260
D G+ +T + YA L F + + LF TC++L + P
Sbjct: 378 VDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFD------TCYDLSGLKVVKVP 431
Query: 261 SMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+++ HF GA+ + PEN I + + +F F F G +I+G Q + V+D
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFA---GTDGGVSIIGNIQQQGFRVVFDG 488
Query: 319 D 319
D
Sbjct: 489 D 489
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 150/358 (41%), Gaps = 64/358 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+++ IG+P ++DT + LTW C PC SC +Q+ PI++ +Y L C +
Sbjct: 92 VFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCSEC 151
Query: 67 S-CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC----S 121
+ C G+C Y + Y ++ + + + TL DE S + V ++ FGC S
Sbjct: 152 NKCD----VVNGECPYSVEYVGSGSSQGIYAREQLTLETIDE-SIIKVPSLIFGCGRKFS 206
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH--SRLEFGD 179
+ S + + I G+ GL S + G+ +FS C+ + + +RL GD
Sbjct: 207 ISSNGY---PYQGINGVFGLGSGRFSLLPSFGK----KFSYCIGNLRNTNYKFNRLVLGD 259
Query: 180 QI------------------------IAGKSLNLPPNSFTIKL-NGQRGCINDCGSVLTV 214
+ I G+ L++ P F + + G I D G+ T
Sbjct: 260 KANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTW 319
Query: 215 IECEVYAVLTAEF------IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-Q 267
+ + VL+ E + +Q D +T GV +L FP +T+HF +
Sbjct: 320 LTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSG----FPLVTFHFAE 375
Query: 268 GADLVVEPENVFIFNHQDSFFF------FFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GA L ++ ++FI ++ F +FG + + + +G Q N YDL+
Sbjct: 376 GAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDY---ESFSSIGMLAQQNYNVGYDLN 430
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 137/362 (37%), Gaps = 51/362 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSF 55
+ T Y ++GIG P K + +DT + + W C C C +++ +Y+ R
Sbjct: 83 LATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGS 142
Query: 56 KSYKKLPCYDASCKS------PFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPD 106
+S + + C C + P C Y I+YGD T D L + +
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL 164
+ +P + ++ FGC + + + GI+G ++S + QL V F+ CL
Sbjct: 203 QTTPANA-SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
VQP H + + G +L LP N F +
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIF--DSGNSK 319
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPS 261
G I D G+ L + VY L A D ++ L +CF + FP
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD-----FSCFQYSGSVDDGFPE 374
Query: 262 MTYHFQG-ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYD 317
+T+HF+G L+V P + N ++ + F G K +LG N +YD
Sbjct: 375 VTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYD 434
Query: 318 LD 319
L+
Sbjct: 435 LE 436
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 138/353 (39%), Gaps = 51/353 (14%)
Query: 4 LNHTYMLKLGIGDP-VKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
+N Y++ L IG P + + LDT + + WTQC+PC C+ Q P +++ + + + +
Sbjct: 88 VNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVA 147
Query: 63 CYDASCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C D C S CF C Y YGD + D+ T V+V +I FGC
Sbjct: 148 CSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGC 207
Query: 121 SL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--------------------------- 152
+ + F+ + GI G S QL
Sbjct: 208 GMYNAGRFLQTE----TGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVFLGGAGD 263
Query: 153 ------GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
G ++ F L + H L F + L +P IK +G
Sbjct: 264 LKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADGSGATFI 319
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYH 265
D G+ +T V+ L + FI +Q + T + + CF+ + + P + +H
Sbjct: 320 DSGTDITTFPDAVFRQLKSAFI---AQAALPVNKTADEDDI-CFSWDGKKTAAMPKLVFH 375
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+GAD + EN ++ ++S + + + +T++G Q NT VYDL
Sbjct: 376 LEGADWDLPREN-YVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDL 427
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 127/351 (36%), Gaps = 58/351 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y L +G P L LDT + +W QC+PC CYEQ++ +++ +Y + C
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRE 193
Query: 68 CK----SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+ S H D C Y ITY D T + DT TL P D +V FGC
Sbjct: 194 CQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTD-----AVPGFVFGCG 248
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL----------------- 164
+ I G++GL S Q+ FS CL
Sbjct: 249 HNNAGSFG----EIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFSGAAA 304
Query: 165 VQPDKSFHSRLEFGDQ-----------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
P + + + G +AG+++ +PP+ F G I D G+ +
Sbjct: 305 AAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAA----GTIIDSGTAFS 360
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADL 271
+ YA L + ++ T TC++L PS+ F GA +
Sbjct: 361 CLPPSAYAALRSSVRSAMGRYKRAPSSTIFD---TCYDLTGHETVRIPSVALVFADGATV 417
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYDLD 319
+ P V S AF P T +LG Q +YD+D
Sbjct: 418 HLHPSGVLYTWSNVSQTCL---AFLPNPDDTSLGVLGNTQQRTLAVIYDVD 465
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 116/311 (37%), Gaps = 64/311 (20%)
Query: 15 GDP--VKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCKS 70
GDP V +DT + W QC PC CY Q DP+++ + + + C +C+S
Sbjct: 140 GDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRS 199
Query: 71 --PF------HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
P+ +C Y I Y D T DT T+ +V+N RFGCS
Sbjct: 200 LGPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLTI-----SGTTAVRNFRFGCSH 254
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ-- 180
+ S + AG M L + S + Q R + + FS C+ P S L G
Sbjct: 255 AVRGRFS---DLTAGTMSLGGGAQSLLAQTARSLGNAFSYCV--PQASASGFLSIGGPAT 309
Query: 181 -----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
++AG+ L +PP +F+ G + D +V
Sbjct: 310 TNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFSA------GAVMDSSAV 363
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGAD 270
+T + Y L F + + TC++ N P+++ F G
Sbjct: 364 ITQLPPTAYRALRRAFRNAMRAYPRSGATGTLD---TCYDFLGLTNVRVPAVSLVFGGGA 420
Query: 271 LVV-EPENVFI 280
+VV +P V I
Sbjct: 421 VVVLDPPAVMI 431
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 142/346 (41%), Gaps = 59/346 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQ-NDPIYNSRSFKSYKKLPCYDA 66
+++ +G P ++DT + L W QC PCKSC +Q P+++ +Y L C +
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNI 161
Query: 67 SCK-SPF-HC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ +P C C Y TY + + V + + DE +V N+ FGCS
Sbjct: 162 ICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRN-AVNNVLFGCSHR 220
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSF--------- 171
+ ++ + + G+ GL TS + Q+G +FS C + PD S+
Sbjct: 221 NGNY---KDRRFTGVFGLGSGITSVVNQMG----SKFSYCIGNIADPDYSYNQLVLSEGV 273
Query: 172 --------------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
H ++ + L + P++F + QR I D G+ T +
Sbjct: 274 NMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFK-RTEKQRRVIIDSGTAPTWLAE 332
Query: 218 EVYAVLTAE---FIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-QGADLVV 273
Y L E +D F + + F C K V FP++T+HF +GADLVV
Sbjct: 333 NEYRALEREVRNLLDRFLTPFMRESFLCYKGKVG-----QDLVGFPAVTFHFAEGADLVV 387
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ E Q S +G F K +++G Q YDL+
Sbjct: 388 DTE-----MRQAS---VYGKDF---KDFSVIGLMAQQYYNVAYDLN 422
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 136/352 (38%), Gaps = 48/352 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + ++DT + L W QC PC C+EQ+ PI++ + SY+ + C D
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDR 208
Query: 68 CK--------SPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C+ +P C C Y YGD T +L+ T + + V +
Sbjct: 209 CRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFT-VNLTQSGTRRVDGV 267
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GRLVPDRFSCCLVQPDKSFHSRL 175
FGC ++ ++ L SF QL G FS CLV+ + S++
Sbjct: 268 AFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKI 323
Query: 176 EFG--DQIIAGKSLNLPPNSFT-------------IKLNGQR-----------GCINDCG 209
FG D ++A LN + T I + G+ G I D G
Sbjct: 324 IFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSG 383
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-ARFNSFPSMTYHFQ- 267
+ L+ Y + FID S L C+N+ A P ++ F
Sbjct: 384 TTLSYFPEPAYQAIRQAFIDRMSPS--YPLILGFPVLSPCYNVSGAEKVEVPELSLVFAD 441
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GA EN FI + TPR G +I+G Q N +YDL+
Sbjct: 442 GAAWEFPAENYFI-RLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLE 492
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/352 (23%), Positives = 133/352 (37%), Gaps = 68/352 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ + +G P + +DT + ++W QC PC +SC Q D +++ +Y C
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSS 189
Query: 66 ASCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
A C C C Y + Y D T DT L D +V+N +FGCS
Sbjct: 190 AQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSD-----AVKNFQFGCS 244
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ- 180
+ FV + G+MGL D+ S + Q FS CL S L G
Sbjct: 245 HRANGFVG----QLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAA 300
Query: 181 -----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+AG LN+P + F+ + D G+V
Sbjct: 301 GGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFS------GASVVDSGTV 354
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPA-RFNSFPSMTYHF-Q 267
+T + Y L F + +++ + G+ TCF+ + P +T F +
Sbjct: 355 ITQLPPTAYQALRTAF-----KKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSR 409
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDL 318
GA + ++ +F + F T + G T ILG Q + ++D+
Sbjct: 410 GAVMDLDVSGIF---YAGCLAF----TATAQDGDTGILGNVQQRTFEMLFDV 454
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/338 (23%), Positives = 127/338 (37%), Gaps = 64/338 (18%)
Query: 20 SLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEG-- 77
+L ++DT + LTW QC+PC CY Q DP+++ SY +PC ++C++ G
Sbjct: 121 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 180
Query: 78 ----------------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C+Y + YGD ++ V + DT L SV FGC
Sbjct: 181 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL------GGASVDGFVFGCG 234
Query: 122 LESKDFVSIQKKI-------------IAGIMGLNWDSTSFM----VQLGRLVPDRFSCCL 164
L ++ AG + L D++S+ V R++ D
Sbjct: 235 LSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADP----- 289
Query: 165 VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
QP F + A + L + + D G+V+T + VY +
Sbjct: 290 AQPPFYFMNVTGASVGGAAVAAAGLGAANVLL----------DSGTVITRLAPSVYRAVR 339
Query: 225 AEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPENVFI 280
AEF F ++ F+ C+NL P +T + GAD+ V+ +
Sbjct: 340 AEFARQFGAERYPAAPPFSLLDA---CYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLF 396
Query: 281 FNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
+D A + +T I+G Q N + VYD
Sbjct: 397 MARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYD 434
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 132/342 (38%), Gaps = 46/342 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ + +G P + DT + TW QCQPC + CY+Q +P++ +Y + C +
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224
Query: 67 SCK--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C C G C Y + YGD T + DT TL +V++ RFGC ++
Sbjct: 225 YCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTL------GYDTVKDFRFGCGEKN 278
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
+ AG+MGL TS VQ F+ C + S L+FG A
Sbjct: 279 RGLFGKA----AGLMGLGRGKTSVPVQAYDKYSGVFAYC-IPATSSGTGFLDFGPGAPAA 333
Query: 185 KSLNLPP----NSFT--------IKLNGQ-----------RGCINDCGSVLTVIECEVYA 221
+ L P N T IK+ G G + D G+V+T + Y
Sbjct: 334 ANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAYE 393
Query: 222 VLTAEFIDYFS--QHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQ-GADLVVEPE 276
L + F + F+ TC++L S P+++ FQ GA L V+
Sbjct: 394 PLRSAFAKGMEGLGYKTAPAFSILD---TCYDLTGYQGSIALPAVSLVFQGGACLDVDAS 450
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F A TI+G Q +YDL
Sbjct: 451 GILYVADVSQACLAFA-ANDDDTDMTIVGNTQQKTYSVLYDL 491
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 82/336 (24%), Positives = 130/336 (38%), Gaps = 70/336 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSF-----K 56
+T Y ++ +G P + + DT + LTW +C+ ++ P+ + R F K
Sbjct: 104 YTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSK 163
Query: 57 SYKKLPCYDASCKS--PF---HCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPD 106
S+ +PC +CKS PF +C G C Y Y D + V D +T+
Sbjct: 164 SWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSG 223
Query: 107 EPS--PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
S +Q + GC+ S D S Q G++ L + SF + RFS CL
Sbjct: 224 SGSDRKAKLQEVVLGCT-TSYDGQSFQSS--DGVLSLGNSNISFASRAAARFGGRFSYCL 280
Query: 165 VQ--PDKSFHSRLEFG---------------------------DQI-IAGKSLNLPPNSF 194
V ++ S L FG D + +AGK+LN+P +
Sbjct: 281 VDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVW 340
Query: 195 TIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT------ 248
+K NG G I D G+ LT++ Y + A + R VT
Sbjct: 341 DVKKNG--GAILDSGTSLTILATPAYKAVVAALSKQLA----------RVPRVTMDPFEY 388
Query: 249 CFNLPA--RFNSFPSMTYHFQGADLVVEPENVFIFN 282
C+N A R + P + F G+ + P ++ +
Sbjct: 389 CYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVID 424
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 78/166 (46%), Gaps = 19/166 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + L F+ DT + LTWTQC+PC CY+Q + I++ + SY + C
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 148
Query: 67 SCK--------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
SC+ SP C C YGI YGD + + + +L D N +F
Sbjct: 149 SCEKLESATGNSP-GCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDV-----FNNFQF 202
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
GC ++ AG++GL + S + Q + FS CL
Sbjct: 203 GCGQNNRGLFG----GTAGLLGLARNPLSLVSQTAQKYGKVFSYCL 244
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 64/123 (52%), Gaps = 8/123 (6%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N ++++L IG P + +LDT + LTWTQC PC CY+Q PIY+ +Y + C
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
+ C + C C Y TYGD T+ + S +T TL S S+ +I FGC
Sbjct: 78 SSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL------SSQSIPHIAFGCGQ 131
Query: 123 ESK 125
+++
Sbjct: 132 DNE 134
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 139/357 (38%), Gaps = 53/357 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ IG P K +DT + + W C C C ++ +Y+ + S +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 63 CYDASCKSPF-------HCFEGD-CFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPV 111
C + C + + C G C Y YGD T DSL + L + +
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQ-LSGNAQTRH 205
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL----- 164
+ N+ FGC + + + + GI+G +TS + QL V FS CL
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKG 265
Query: 165 ---------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
VQP H + +AG +L LPP+ F + + +RG I D
Sbjct: 266 GGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIF--ETSEKRGTIID 323
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
G+ LT + VY + A F +H T + G CF + FP +T+HF
Sbjct: 324 SGTTLTYLPELVYKDILAAV---FQKHQDITFRTIQ--GFLCFEYSESVDDGFPKITFHF 378
Query: 267 Q-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
+ L V P + F N + + F F P+ K +LG N VYDL+
Sbjct: 379 EDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLE 435
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 135/350 (38%), Gaps = 86/350 (24%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y++K+ IG P ++ + DT + L WTQC PC SCY+Q +P+++ S+K++ C
Sbjct: 21 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 80
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ LDT P S+ NI FGC +
Sbjct: 81 SQQCRL--------------------------LDT----------PTSILNIVFGCGHNN 104
Query: 125 KDFVSIQKKIIAGIMGLNWDSTS-FMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQI 181
+ + + G G TS M LG +FS CLV + D S S++ FG +
Sbjct: 105 SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSG--RKFSQCLVPFRTDPSITSKIIFGPEA 162
Query: 182 IAGKS--LNLP------PNSFTIKLNG------------------QRGCINDCGSVLTVI 215
S ++ P P + + L+G + D G+ T++
Sbjct: 163 EVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLL 222
Query: 216 ECEVY-----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGAD 270
+ Y V A ++ D++ R A P +T HF GAD
Sbjct: 223 PRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRS---------ATLIDGPILTAHFDGAD 273
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDLD 319
+ ++P N FI + + F A P G T I G Q N +DLD
Sbjct: 274 VQLKPLNTFISPKEGVYCF----AMQPIDGDTGIFGNFVQMNFLIGFDLD 319
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 137/344 (39%), Gaps = 59/344 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P + L+DT + ++W QC+PC C+ Q D +++ S +Y C A+
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAA 186
Query: 68 CKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL-ES 124
C C C Y + YGD S DT L +V+N +FGCS ES
Sbjct: 187 CAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL------GSSTVENFQFGCSQSES 240
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI--- 181
+ + Q +MGL + S Q FS CL P L G
Sbjct: 241 GNLLQDQTAG---LMGLGGGAESLATQTAGTFGKAFSYCL-PPTPGSSGFLTLGASTSGF 296
Query: 182 ------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ G+ LN+P ++F+ G I D G+++T +
Sbjct: 297 VVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSA------GSIMDSGTIITRLPR 350
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLVVE 274
Y+ L++ F + +++ + G+ TCF+ + + S P++ F G +V
Sbjct: 351 TAYSALSSAF-----KAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVDL 405
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ I +F A + I+G Q + +YD+
Sbjct: 406 ASDGIILGSCLAF-----AANSDDTSLGIIGNVQQRTFEVLYDV 444
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/363 (22%), Positives = 144/363 (39%), Gaps = 62/363 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+G+G P K + +DT + + W C C +C +++ +Y+ K+ +P
Sbjct: 72 YYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVP 131
Query: 63 CYDASC----KSPFHCFEGD--CFYGITYGD--------VYETKEVDSLDTSTLLPPDEP 108
C D C P + D C Y ITYGD V ++ D + + PD
Sbjct: 132 CGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNS 191
Query: 109 SPVSVQNIRFGCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--------------- 152
S + FGC +S S + + GI+G ++S + QL
Sbjct: 192 SVI------FGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLD 245
Query: 153 ----------GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
G+++ +F+ + P + H + D + G+ + LP + R
Sbjct: 246 SHHGGGIFSIGQVMEPKFNTTPLVP-RMAHYNVILKDMDVDGEPILLPL--YLFDSGSGR 302
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPS 261
G I D G+ L + +Y L + + Q ++ + + TCF+ + + FP
Sbjct: 303 GTIIDSGTTLAYLPLSIYNQLLPKVLG--RQPGLKLMIVEDQ--FTCFHYSDKLDEGFPV 358
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-GK--TILGARHQHNTQFVYDL 318
+ +HF+G L V P + +D + + + T K G+ ++G N VYDL
Sbjct: 359 VKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDL 418
Query: 319 DTF 321
+
Sbjct: 419 ENM 421
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 68/371 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W C+ + + DP+ +S SY +
Sbjct: 57 FHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSS----SYSPI 112
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC +C++ P C + C I+Y D + +L + T + P ++
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIE--GNLASDTFHIGNSAIPATI 170
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--- 170
FGC S + G++G+N S SF+ Q+G +FS C+ D S
Sbjct: 171 ----FGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGL---QKFSYCISGQDSSGIL 223
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ +A L LP + + G
Sbjct: 224 LFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTG 283
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPARF 256
+ D G+ T + VY L EF+ ++ ++ L F + C+ +P
Sbjct: 284 AGQTMVDSGTQFTFLLGPVYTALKNEFVRQ-TKASLKVLEDPNFVFQGAMDLCYRVPLTR 342
Query: 257 NS---FPSMTYHFQGADLVVEPENVF-----IFNHQDSFF-FFFGPAFTPRKGKTILGAR 307
+ P++T F+GA++ V E + + DS + F FG + I+G
Sbjct: 343 RTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHH 402
Query: 308 HQHNTQFVYDL 318
HQ N +DL
Sbjct: 403 HQQNVWMEFDL 413
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 153/364 (42%), Gaps = 52/364 (14%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKS 57
T+ Y K+G+G P K + +DT + + W C C C ++D +Y+ + K+
Sbjct: 64 TVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKT 123
Query: 58 YKKLPCYDASCKSPFHC------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
+ + C C S + E C Y I+YGD T D T +
Sbjct: 124 SEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHT 183
Query: 112 SVQN--IRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-- 164
+ QN I FGC + +S F S ++ + GI+G ++S + QL V FS CL
Sbjct: 184 ATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDT 243
Query: 165 ------------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
V+P H + + + G L LP ++F + NG +G
Sbjct: 244 NVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSE-NG-KGT 301
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMT 263
+ D G+ L + VY L ++ + ++ K++ + +CF +S FP +
Sbjct: 302 VIDSGTTLAYLPRIVYDQLMSKVL---AKQPRLKVYLVEE-QYSCFQYTGNVDSGFPIVK 357
Query: 264 YHFQGA-DLVVEPENVFIFNHQDSFFFFFG---PAFTPRKGK--TILGARHQHNTQFVYD 317
HF+ + L V P + ++FN++ ++ G A + GK T+LG N VYD
Sbjct: 358 LHFEDSLSLTVYPHD-YLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416
Query: 318 LDTF 321
L+
Sbjct: 417 LENM 420
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 79/338 (23%), Positives = 124/338 (36%), Gaps = 41/338 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K + DT + L+W QC+PC CYEQ DP+++ +Y + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ + + C Y + YGD +T DT TL D ++ FGC ++
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD-----TLPGFVFGCGDQN 263
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
+ G+ GL + S Q F+ CL S L G A
Sbjct: 264 AGLFG----QVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS-SSSGRGYLSLGGAPPAN 318
Query: 185 KSLNLPPNSFT----------IKLNGQR------------GCINDCGSVLTVIECEVYAV 222
+ T IK+ G+ G + D G+V+T + YA
Sbjct: 319 AQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAP 378
Query: 223 LTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GADLVVEPENVFI 280
L A F +Q+ + TC++ R P++ F GA + ++ V
Sbjct: 379 LRAAFARSMAQYKKAPALSILD---TCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY 435
Query: 281 FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F P ILG Q YD+
Sbjct: 436 VSKVSQACLAFAP-NADDSSIAILGNTQQKTFAVAYDV 472
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 68/371 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W C+ + + DP+ +S SY +
Sbjct: 50 FHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSS----SYSPI 105
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC +C++ P C + C I+Y D + +L + T + P ++
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIE--GNLASDTFHIGNSAIPATI 163
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--- 170
FGC S + G++G+N S SF+ Q+G +FS C+ D S
Sbjct: 164 ----FGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGL---QKFSYCISGQDSSGIL 216
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ +A L LP + + G
Sbjct: 217 LFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTG 276
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPARF 256
+ D G+ T + VY L EF+ ++ ++ L F + C+ +P
Sbjct: 277 AGQTMVDSGTQFTFLLGPVYTALKNEFVRQ-TKASLKVLEDPNFVFQGAMDLCYRVPLTR 335
Query: 257 NS---FPSMTYHFQGADLVVEPENVF-----IFNHQDSFF-FFFGPAFTPRKGKTILGAR 307
+ P++T F+GA++ V E + + DS + F FG + I+G
Sbjct: 336 RTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHH 395
Query: 308 HQHNTQFVYDL 318
HQ N +DL
Sbjct: 396 HQQNVWMEFDL 406
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 110/291 (37%), Gaps = 39/291 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K + DT + L+W QC+PC CYEQ DP+++ +Y + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 68 CK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C+ + + C Y + YGD +T DT TL D ++ FGC ++
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD-----TLPGFVFGCGDQN 263
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG 184
+ G+ GL + S Q F+ CL S L G A
Sbjct: 264 AGLFG----QVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS-SSSGRGYLSLGGAPPAN 318
Query: 185 KSLNLPPNSFT----------IKLNGQR------------GCINDCGSVLTVIECEVYAV 222
+ T IK+ G+ G + D G+V+T + YA
Sbjct: 319 AQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAP 378
Query: 223 LTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV 272
L A F +Q+ + TC++ R P++ F G V
Sbjct: 379 LRAAFARSMAQYKKAPALSILD---TCYDFTGHRTAQIPTVELAFAGGATV 426
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 138/337 (40%), Gaps = 34/337 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+G+G+P K + +DT + + W C C C ++D +Y+ S S ++
Sbjct: 27 YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86
Query: 63 CYDASCKSPFHCFEGDCF------YGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSVQ 114
C D C S ++ DC Y + YGD T D + + + +S
Sbjct: 87 CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNG 146
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMG-----LNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
+ FGC + + + + GI+G L+ + + +G LV + + + P++
Sbjct: 147 TVTFGCGAQQSGGLGTSGEALDGILGAFAHCLDNVNGGGIFAIGELVSPKVNTTPMVPNQ 206
Query: 170 SFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFID 229
+ H + + + G L LP + F +RG I D G+ L + VY + E
Sbjct: 207 A-HYNVYMKEIEVGGTVLELPTDVF--DSGDRRGTIIDSGTTLAYLPEVVYDSMMNEIRS 263
Query: 230 Y---FSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA-DLVVEPENVFIFNHQD 285
S H +E+ F C K + FP + +HF+ + L V P + +D
Sbjct: 264 QQPGLSLHTVEEQFICFKYSGNV------DDGFPDIKFHFKDSLTLTVYPHDYLFQISED 317
Query: 286 SF-FFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
+ F + + G+ T+LG N +YD++
Sbjct: 318 IWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIE 354
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 78/347 (22%), Positives = 131/347 (37%), Gaps = 67/347 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + ++DT + ++W +C +++ +Y C A+
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGLT-----LFDPSKSTTYAPFSCSSAA 183
Query: 68 CKSPFH----CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + C C Y + YGD T S DT L D +V + FGCS
Sbjct: 184 CAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASD-----TVTDFHFGCSHH 238
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG----- 178
+DF + I G+MGL D+ S + Q FS CL P L FG
Sbjct: 239 EEDF---DGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL-PPTNRTSGFLTFGAPNGT 294
Query: 179 ------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
D + G L + P+ + G + D G+V+T
Sbjct: 295 SGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS------NGSVMDSGTVITW 348
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADL 271
+ Y+ L++ F ++ ++ G+ TC++ N S P+++ G +
Sbjct: 349 LPRRAYSALSSAFRSSMTRLRHQR---AAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAV 405
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V N + QD AF G +I+G Q + ++D+
Sbjct: 406 VDLDGNGIMI--QDCL------AFAATSGDSIIGNVQQRTFEVLHDV 444
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 83/191 (43%), Gaps = 21/191 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y +K+G G P + ++DT + L+W QC+PC C+ Q DP+++ + K+YK L C +
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + C Y +YGD + S D TL P ++
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ-----TLPGFV 232
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
+GC +S AGI+GL + S + Q+ FS CL P + L
Sbjct: 233 YGCGQDSDGLFGRA----AGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGGGGFLSI 286
Query: 178 GDQIIAGKSLN 188
G +AG + N
Sbjct: 287 GKASLAGSAYN 297
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 88/343 (25%), Positives = 139/343 (40%), Gaps = 55/343 (16%)
Query: 12 LGIGDPVKSLWFLLDTVAGLTWTQCQPC---KSCYEQNDPIYNSRSFKSYKKLPCYDASC 68
+ +G P + +F+LDT + +TW QC PC CYEQ PI++ SY + C C
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60
Query: 69 K--SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
+ C C Y + YGD T + +T T + + S+ NI GC +++
Sbjct: 61 QLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN-----SIPNISIGCGHDNEG 115
Query: 127 -FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF-----GDQ 180
FV I G ++ S +L FS CLV D S L+F D
Sbjct: 116 LFVGADGLIGLGGGAISISS--------QLKASSFSYCLVDIDSPSFSTLDFNTDPPSDS 167
Query: 181 IIA----------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
+I+ GK L + + F I +G G I D G+ +T + +
Sbjct: 168 LISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSD 227
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGAD-LVVEPE 276
VY VL F+ + + TC++L ++ N P++ + G + L + +
Sbjct: 228 VYEVLREAFLGLTTNLPPAPEISPFD---TCYDLSSQSNVEVPTIAFILPGENSLQLPAK 284
Query: 277 NVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
N I + +F F A P +I+G Q + YDL
Sbjct: 285 NCLIQVDSAGTFCLAFVSATFPL---SIIGNFQQQGIRVSYDL 324
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 64/123 (52%), Gaps = 8/123 (6%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N ++++L IG P + +LDT + LTWTQC PC CY+Q PIY+ +Y + C
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
+ C + C C Y TYGD T+ + S +T TL S S+ +I FGC
Sbjct: 78 SSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL------SSQSIPHIAFGCGQ 131
Query: 123 ESK 125
+++
Sbjct: 132 DNE 134
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 136/352 (38%), Gaps = 71/352 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + +DT + ++W +C+ +Y+ + +Y C +
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTYAPFSCSAPA 181
Query: 68 C----KSPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C + C G C Y + YGD T DT TL EP + +FGCS
Sbjct: 182 CAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEP---LISGFQFGCSA 238
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------------------ 164
F ++ G+MGL D+ SF+ Q FS CL
Sbjct: 239 VEHGF---EEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLTLGAPSSS 295
Query: 165 ------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
+ +F+ L G + GK+L +P + F+ G I D G+V+
Sbjct: 296 TSAAFSTTPMLRSKQAATFYGLLLRGIS-VGGKTLEIPSSVFSA------GSIVDSGTVI 348
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP--ARFNSF--PSMTYHFQG 268
T + Y L+A F D +++ + R TCF+ N+F PS+ G
Sbjct: 349 TRLPPTAYGALSAAFRDGMARYQYQPA-APRGLLDTCFDFTGHGEGNNFTVPSVALVLDG 407
Query: 269 ADLV-VEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYDL 318
+V + P + QD F A T G+T I+G Q + +YD+
Sbjct: 408 GAVVDLHPNGIV----QDGCLAF---AATDDDGRTGIIGNVQQRTFEVLYDV 452
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 83/194 (42%), Gaps = 21/194 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS-CYEQNDPIYNSRSFKSYKKLPCYDA 66
Y +K+G G P + ++DT + L+W QC+PC C+ Q DP+++ + K+YK L C +
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + C Y +YGD + S D TL P ++
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ-----TLPGFV 232
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
+GC +S AGI+GL + S + Q+ FS CL P + L
Sbjct: 233 YGCGQDSDGLFGRA----AGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGGGGFLSI 286
Query: 178 GDQIIAGKSLNLPP 191
G +AG + P
Sbjct: 287 GKASLAGSAYKFTP 300
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 144/354 (40%), Gaps = 50/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C C ++D +Y+ ++ + +
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 63 CYDASCK---SPF-HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQ 114
C D C P C G C Y + YGD T D + + + + +P +
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTN-G 273
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-------- 164
+ FGC + + + + GI+G ++S + QL V FS CL
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI 333
Query: 165 ------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
V+P + H + + + G L++P ++F + ++G I D G+
Sbjct: 334 FAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRKGTIIDSGT 391
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF-QG 268
L EVY L + + SQ +L T + TCF+ + FP++T HF +
Sbjct: 392 TLAYFPQEVYVPLIEKIL---SQQPDLRLHTVEQA-FTCFDYTGNVDDGFPTVTLHFDKS 447
Query: 269 ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
L V P ++F H+ + + G K T+LG N VYDL+
Sbjct: 448 ISLTVYPHE-YLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLE 500
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 144/368 (39%), Gaps = 76/368 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + IG P ++ + DT + LTW QC+PC+ CY+QN P+++ + +YK C +
Sbjct: 85 YFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKT 144
Query: 68 CKSPFHCFEG------DCFYGITYGDVYETK---EVDSLDTSTLLPPDEPSPVSVQNIRF 118
C++ EG C Y +YGD TK +++ + P +V F
Sbjct: 145 CQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTV----F 200
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--------------- 163
GC + ++ +GI+GL S + QLG + +FS C
Sbjct: 201 GCGYNNGGTF---EETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVIN 257
Query: 164 --------------------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
L+Q D + L + + GK+ LP LNG+
Sbjct: 258 LGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTL-EAVTVGKT-KLPYTGGGYGLNGKSS 315
Query: 204 -----CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG------VTCFNL 252
I D G+ LT+++ + F D F E + ++ CF
Sbjct: 316 KRTGNIIIDSGTTLTLLD--------SGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKS 367
Query: 253 PARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
+ P++T HF AD+ + P N F+ ++D+ + P I G Q +
Sbjct: 368 GDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCL----SMIPTTEVAIYGNMVQMDF 423
Query: 313 QFVYDLDT 320
YDL+T
Sbjct: 424 LVGYDLET 431
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 141/371 (38%), Gaps = 58/371 (15%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-PIYNSRSFKSYKKLP 62
+ + Y++ L +G P + + LDT + L WTQC PC +C++Q P+ + + ++ +
Sbjct: 90 VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVR 149
Query: 63 CYDASCKS-PF-HCFEG-------DCFYGITYGDVYETKEVDSLDTSTLLPPD--EPSPV 111
C C++ PF C G C Y YGD T + D T P D + V
Sbjct: 150 CDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNW-------------------DSTSFMVQL 152
S + + FGC +K + IAG W +STS +V L
Sbjct: 210 SERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSLVTL 269
Query: 153 GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR----GCINDC 208
G + VQ + ++ K++ + I QR I D
Sbjct: 270 GVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASAIIDS 329
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA----------RFNS 258
G+ +T + +VY + AEF+ +Q + CF LP+ R+
Sbjct: 330 GASITTLPEDVYEAVKAEFV---AQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRG 386
Query: 259 --------FPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARH 308
P + +H GAD + EN ++ A T +T ++G
Sbjct: 387 RGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIGNYQ 446
Query: 309 QHNTQFVYDLD 319
Q NT VYDL+
Sbjct: 447 QQNTHVVYDLE 457
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 143/354 (40%), Gaps = 51/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K + +DT + + W C PC C + D +Y+S++ + K +
Sbjct: 77 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVG 136
Query: 63 CYDASC----KSPFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSVQN 115
C DA C +S + C Y + YGD + D++ + +P++ Q
Sbjct: 137 CEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLA-QE 195
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------- 164
+ FGC + + + GIMG +TS + QL G V FS CL
Sbjct: 196 VVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIF 255
Query: 165 --------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ P++ H + + G+ ++LPP+ NG G I D G+
Sbjct: 256 AIGEVESPVVKTTPLVPNQ-VHYNVILKGMDVDGEPIDLPPS--LASTNGDGGTIIDSGT 312
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
L + +Y L I+ + KL ++ CF+ + + +FP + HF+ +
Sbjct: 313 TLAYLPQNLYNSL----IEKITAKQQVKLHMVQET-FACFSFTSNTDKAFPVVNLHFEDS 367
Query: 270 -DLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
L V P + +D + F + T + G + LG N VYDL+
Sbjct: 368 LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLE 421
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 79/186 (42%), Gaps = 17/186 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLG G P +DT + L W QCQPC SCY Q DP++N + SY +PC +
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151
Query: 68 CKS--PFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C E G C Y Y TK ++D + + FGCS
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI------GGDVFHAVVFGCSD 205
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQII 182
S + Q +G++GL S + QL RF CL P +L G
Sbjct: 206 SSVGGPAAQA---SGLVGLGRGPLSLVSQLS---VHRFMYCLPPPMSRTSGKLVLGAGAD 259
Query: 183 AGKSLN 188
A ++++
Sbjct: 260 AVRNMS 265
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 137/357 (38%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P K + +DT + + W C C C ++ +Y+ + + K+
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 63 CYDASCKSPFHCFEGDCF------YGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C C + + C Y +TYGD T V L + D + +
Sbjct: 64 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL-------- 164
+ FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 124 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 183
Query: 165 ------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
VQP H + + G +L LP + F ++G I D G+
Sbjct: 184 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF--DTGEKKGTIIDSGT 241
Query: 211 VLTVIECEVYA-VLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
LT + VY ++ A F + + H++++ CF R + FP +T+HF
Sbjct: 242 TLTYLPEIVYKEIMLAVFAKHKDITFHNVQEFL--------CFQYVGRVDDDFPKITFHF 293
Query: 267 QG-ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L V P + F N + + F G KG +LG N VYDL+
Sbjct: 294 ENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLE 350
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 137/356 (38%), Gaps = 52/356 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C PC C + +N + + K+P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 63 CYDASCKSPFHCFEG--------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPSPV 111
C D C + E C Y TYGD T DT T++ +E +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVM-GNEQTAN 209
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL----- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 165 ----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
V+P + H L ++ G+ L + + FT + +G I
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTIV 327
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYH 265
D G+ L + Y F++ + + + G CF + + SFP+++ +
Sbjct: 328 DSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 383
Query: 266 FQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
F G + V+PEN + + + +G+ TILG + FVYDL
Sbjct: 384 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 439
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 128/293 (43%), Gaps = 48/293 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCY-- 64
Y++ +G+G P K L + DT + +TWTQCQPC +SCY+Q + I++ SY + C
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 65 ------DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
A+ +P C C YGI YGD + + TL D + NI F
Sbjct: 209 ICNSLTSATGNTP-GCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTD-----AFNNIYF 262
Query: 119 GCSLES--------------KDFVSI-----QK--KIIAGIMGLNWDSTSFMVQLGRLVP 157
GC + +D +S+ QK KI + + + ST F+ G
Sbjct: 263 GCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFGGSASK 322
Query: 158 D-RFS-CCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ +F+ + SF+ L+F + GK L + + F+ G I D G+V+T +
Sbjct: 323 NAKFTPLSTISAGPSFYG-LDFTGISVGGKKLAISASVFSTA-----GAIIDSGTVITRL 376
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
Y+ L A F + S++ + K + TC++ + S P + + F
Sbjct: 377 PPAAYSALRASFRNLMSKYPMTKALSILD---TCYDFSSYTTISVPKIGFSFS 426
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 51/84 (60%), Gaps = 4/84 (4%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+++ + IG P ++ + DT + LTWTQC PC+ C+ Q+ PI+N R SY+K+ C +
Sbjct: 90 FLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDT 149
Query: 68 CKS--PFHCFEG--DCFYGITYGD 87
C+S +HC C YG +YGD
Sbjct: 150 CRSLESYHCGPDLQSCSYGYSYGD 173
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 132/367 (35%), Gaps = 77/367 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + +LDT + + W QC PC+ CY Q+ +++ R +SY + C
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 181
Query: 68 CKS--PFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C C Y + YGD T + +T T VQ + GC +
Sbjct: 182 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFA-----RGARVQRVAIGCGHD 236
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------VQPDKSFHSRLE 176
++ F++ + SF Q+ R FS CL V+P + S +
Sbjct: 237 NEGLFIAASGLLGL-----GRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291
Query: 177 F-----------------------------------GDQIIAGKS---LNLPPNSFTIKL 198
F G + G S L L P +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT----- 346
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPA 254
G+ G I D G+ +T + VY + F + LF TC+NL
Sbjct: 347 -GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFD------TCYNLSG 399
Query: 255 -RFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
R P+++ H G V P ++ S F F A T G +I+G Q +
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD-GGVSIIGNIQQQGFR 458
Query: 314 FVYDLDT 320
V+D D
Sbjct: 459 VVFDGDA 465
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 137/357 (38%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P K + +DT + + W C C C ++ +Y+ + + K+
Sbjct: 89 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 148
Query: 63 CYDASCKSPFHCFEGDCF------YGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C C + + C Y +TYGD T V L + D + +
Sbjct: 149 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 208
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL-------- 164
+ FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 209 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 268
Query: 165 ------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
VQP H + + G +L LP + F ++G I D G+
Sbjct: 269 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF--DTGEKKGTIIDSGT 326
Query: 211 VLTVIECEVYA-VLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
LT + VY ++ A F + + H++++ CF R + FP +T+HF
Sbjct: 327 TLTYLPEIVYKEIMLAVFAKHKDITFHNVQEFL--------CFQYVGRVDDDFPKITFHF 378
Query: 267 QG-ADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L V P + F N + + F G KG +LG N VYDL+
Sbjct: 379 ENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLE 435
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 143/357 (40%), Gaps = 62/357 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN--DPIYNSRSFKSYKKLPCYD 65
+ + +G P + ++DT + L W QC PCK C + P++N ++ + C D
Sbjct: 68 FFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDD 127
Query: 66 ASCK-SP-FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ +P HC C Y Y +K V + + T P+ + V+ Q I FGC E
Sbjct: 128 RFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVT-QPIAFGCGHE 186
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSF-HSRLEFGDQI 181
+ + + + GI+GL TS VQLG +FS C+ +K++ +++L G+
Sbjct: 187 NGEQLESE---FTGILGLGAKPTSLAVQLG----SKFSYCIGDLANKNYGYNQLVLGEDA 239
Query: 182 --------------------------IAGKSLNLPPNSFTIKLNGQR-GCINDCGSVLTV 214
+ K LN+ P F K G R G I D G++ T
Sbjct: 240 DILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVF--KRRGSRTGVILDTGTLYTW 297
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPARFNSFPSMTYHFQ-GA 269
+ Y L E I +E+ F C V FP +T+HF GA
Sbjct: 298 LADIAYRELYNE-IKSILDPKLERFWFRDFLCYHGRVN-----EELIGFPVVTFHFAGGA 351
Query: 270 DLVVEPENVFI-FNHQDSFFFFFGPAFTPR-------KGKTILGARHQHNTQFVYDL 318
+L +E ++F D++ F + P K T +G Q YDL
Sbjct: 352 ELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDL 408
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 134/344 (38%), Gaps = 51/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N ++ SY + C
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP C + C Y +YGD + S DT + SV N +
Sbjct: 189 QCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSVPNFYY 242
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------VQPDKS 170
GC +++ AG++GL + S + QL + FS CL
Sbjct: 243 GCGQDNEGLFGQS----AGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIG 298
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC----------------INDCGSVLTV 214
++ ++ +A SL+ + + IK+ G + I D G+V+T
Sbjct: 299 SYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ VY+ L+ F+ TCF A P +T F G +
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRASAFSILD---TCFQGQAARLRVPEVTMAFAGGAALKL 413
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + + AF P + I+G Q VYD+
Sbjct: 414 AARNLLVDVDSATTCL---AFAPARSAAIIGNTQQQTFSVVYDV 454
>gi|361067983|gb|AEW08303.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 44/83 (53%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G+G P K+ ++DT + LTW QC+PC CY Q DPI+ R SYK LPC A+
Sbjct: 71 YIVTAGLGTPTKNFLLIIDTGSDLTWIQCKPCLDCYSQVDPIFEPRQSSSYKSLPCLSAT 130
Query: 68 CKSPF-------HCFEGDCFYGI 83
C C G C Y I
Sbjct: 131 CTELLISESNLTPCLLGGCSYEI 153
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 127/350 (36%), Gaps = 72/350 (20%)
Query: 23 FLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK----SPFHCFEGD 78
+LDT + + W QC PC+ CYEQ+ P+++ R SY + C A C+ G
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 79 CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD-FVSIQKKIIAG 137
C Y + YGD T +T T V + GC +++ FV+ +
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTF-----AGGARVARVALGCGHDNEGLFVAAAGLLG-- 113
Query: 138 IMGLNWDSTSFMVQLGRLVPDRFSCCLV---------QPDKSFHSRLEFGDQIIAGKSLN 188
L SF Q+ R FS CLV P S + FG + S +
Sbjct: 114 ---LGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSAS 170
Query: 189 LPP-------------NSFTIKLNGQR------------------GCINDCGSVLTVIEC 217
P I + G R G I D G+ +T +
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLAR 230
Query: 218 EVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GAD 270
Y+ L F + LF TC++L R P+++ HF GA+
Sbjct: 231 ASYSALRDAFRAAAAGGLRLSPGGFSLFD------TCYDLGGRRVVKVPTVSMHFAGGAE 284
Query: 271 LVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ PEN I + + +F F F G +I+G Q + V+D D
Sbjct: 285 AALPPENYLIPVDSRGTFCFAFA---GTDGGVSIIGNIQQQGFRVVFDGD 331
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/356 (21%), Positives = 128/356 (35%), Gaps = 45/356 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP-IYNSRSFKSYKKLPCYDA 66
Y + L IG P +SL + DT + L W +C C++C + ++ R ++ CYD
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 143
Query: 67 SC-------KSPF--HC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C ++P H C Y Y D T + + +T++ L ++++
Sbjct: 144 VCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTS-LKTSSGKEARLKSV 202
Query: 117 RFGCS--LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-----PDK 169
FGC + + G+MGL SF QLGR ++FS CL+ P
Sbjct: 203 AFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPT 262
Query: 170 SFHSRLEFGDQI--------------------------IAGKSLNLPPNSFTIKLNGQRG 203
S+ GD I + G L + P+ + I +G G
Sbjct: 263 SYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGG 322
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
+ D G+ L + Y + A + L V + P +
Sbjct: 323 TVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLK 382
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F G + V P + ++ + P+ G +++G Q F +D D
Sbjct: 383 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRD 438
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 127/335 (37%), Gaps = 48/335 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P +LDT + + W QC PC+ CY Q+ +++ R +SY + C
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPP 201
Query: 68 C-------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C G C Y + YGD T L T TL V + GC
Sbjct: 202 CRGLDAGGGGGCDRRRGTCLYQVAYGDGSVT--AGDLATETLW---FARGARVPRVAVGC 256
Query: 121 SLESKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
+++ FV+ + S Q R RFS C D + +
Sbjct: 257 GHDNEGLFVAAAGLLGL-----GRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVH 311
Query: 180 QIIAG--------KSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYF 231
Q + G +SL L P++ G+ G I D G+ +T + VY + F
Sbjct: 312 QHVGGARVRGVGERSLRLDPST------GRGGVILDSGTSVTRLARPVYVAVREAFRAAA 365
Query: 232 SQHDIE----KLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQ-GADLVVEPENVFI-FNHQ 284
+ LF TC++L R P+++ H GA++ + PEN I + +
Sbjct: 366 GGLRLAPGGFSLFD------TCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTR 419
Query: 285 DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+F G +I+G Q + V+D D
Sbjct: 420 GTFCLALA---GTDGGVSIVGNIQQQGFRVVFDGD 451
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 137/356 (38%), Gaps = 52/356 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C PC C + +N + + K+P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 63 CYDASCKSPFHCFEG--------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPSPV 111
C D C + E C Y TYGD T DT T++ +E +
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVM-GNEQTAN 235
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL----- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 236 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 295
Query: 165 ----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
V+P + H L ++ G+ L + + FT + +G I
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTIV 353
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYH 265
D G+ L + Y F++ + + + G CF + + SFP+++ +
Sbjct: 354 DSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 409
Query: 266 FQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
F G + V+PEN + + + +G+ TILG + FVYDL
Sbjct: 410 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 465
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 150/367 (40%), Gaps = 65/367 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +G P +++ +LDT + L+W C+ + +P+ +S SY
Sbjct: 54 FHHNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSS----SYTPT 109
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC + C + P C + C ++Y D + + +T +L +P +
Sbjct: 110 PCNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTL- 168
Query: 113 VQNIRFGCSLESKDFVS--IQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-- 168
FGC ++S + S + G+MG+N S S + Q+ +P +FS C+ D
Sbjct: 169 -----FGC-MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS--LP-KFSYCISGEDAL 219
Query: 169 --------------------------KSFHSRLEFGDQI----IAGKSLNLPPNSFTIKL 198
+ +R+ + Q+ ++ K L LP + F
Sbjct: 220 GVLLLGDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDH 279
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPA 254
G + D G+ T + VY+ L EF++ ++ + ++ F C++ PA
Sbjct: 280 TGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQ-TKGVLTRIEDPNFVFEGAMDLCYHAPA 338
Query: 255 RFNSFPSMTYHFQGADLVVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHN 311
F + P++T F GA++ V E + + S + F FG + ++G HQ N
Sbjct: 339 SFAAVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQN 398
Query: 312 TQFVYDL 318
+DL
Sbjct: 399 VWMEFDL 405
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 119/312 (38%), Gaps = 67/312 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y++ +G+G P + ++DT + ++W QC+PC + C+ +++ + +Y C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 65 DASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
A+C E + C Y + YGD T S D TL D V+ +
Sbjct: 195 AAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD-----VVRGFQ 249
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKS------ 170
FGCS + + G++GL D+ S + Q FS CL P S
Sbjct: 250 FGCS--HAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLG 307
Query: 171 ---------------------------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
+ + LE D + GK L L P+ F G
Sbjct: 308 APASGGGGGASRFATTPMLRSKKVPTYYFAALE--DIAVGGKKLGLSPSVFAA------G 359
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFP 260
+ D G+V+T + YA L++ F + + + G+ TCFN S P
Sbjct: 360 SLVDSGTVITRLPPAAYAALSSAF-----RAGMTRYARAEPLGILDTCFNFTGLDKVSIP 414
Query: 261 SMTYHFQGADLV 272
++ F G +V
Sbjct: 415 TVALVFAGGAVV 426
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 136/335 (40%), Gaps = 36/335 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-------PCKSCYEQNDPIYNSRSFKSYKK 60
Y++ + +G P +S+ + DT + L W +C+ + Q DP SRS +Y +
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP---SRS-STYGR 156
Query: 61 LPCYDASCKS--PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQ 114
+ C +C++ C +G +C Y YGD T V S +T T P V +
Sbjct: 157 VSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRIG 216
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFH 172
++FGCS + + + S + QLG + RFS CLV +
Sbjct: 217 GVKFGCSTATAGSFPADGLVGL-----GGGAVSLVTQLGGATSLGRRFSYCLVPHSVNAS 271
Query: 173 SRLEFG---DQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFID 229
S L FG D G + + T+ I D G+ LT ++ + + E
Sbjct: 272 SALNFGALADVTEPGAASTPLVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSR 331
Query: 230 YFSQHDIEKLFTCRKCGVTCFNLPAR----FNSFPSMTYHF-QGADLVVEPENVFIFNHQ 284
+ ++ + C+N+ R S P +T F GA + ++PEN F+ +
Sbjct: 332 RITLPPVQSPDGLLQL---CYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQE 388
Query: 285 DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ A T ++ +ILG Q N YDLD
Sbjct: 389 GTLCLAI-VATTEQQPVSILGNLAQQNIHVGYDLD 422
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 134/357 (37%), Gaps = 52/357 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G+P K + +DT + + W C PC C + +N S + ++
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 63 CYDASCKSPFHCFEG----------DCFYGITYGDVYETKEVDSLDTS--TLLPPDEPSP 110
C D C + F E C Y TYGD T DT + +E +
Sbjct: 65 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 124
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL---- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 125 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSD 184
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
V+P + H L + IA LP +S + +G I
Sbjct: 185 NGGGILVLGEIVEPGLVYTPLVPSQPHYNLNL--ESIAVNGQKLPIDSSLFTTSNTQGTI 242
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+ L + Y F+ + + + G CF + + SFP++T
Sbjct: 243 VDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTL 298
Query: 265 HFQGA-DLVVEPENVFI--FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F G + V+PEN + + +S + G + TILG + FVYDL
Sbjct: 299 YFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDL 355
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 135/355 (38%), Gaps = 50/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C PC C + +N + + K+P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 63 CYDASCKSPFHCFEG--------DCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVS 112
C D C + E C Y TYGD T DT + +E + S
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANS 210
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------ 164
+I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNG 270
Query: 165 ---------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
V+P + H L ++ G+ L + + FT + +G I D
Sbjct: 271 GGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTIVD 328
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHF 266
G+ L + Y F++ + + + G CF + + SFP+++ +F
Sbjct: 329 SGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLYF 384
Query: 267 QGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
G + V+PEN + + + +G+ TILG + FVYDL
Sbjct: 385 MGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 439
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/331 (22%), Positives = 126/331 (38%), Gaps = 48/331 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSF 55
+ T Y ++GIG P K + +DT + + W C C C +++ +Y+ R
Sbjct: 83 LATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGS 142
Query: 56 KSYKKLPCYDASCKS------PFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPD 106
+S + + C C + P C Y I+YGD T D L + +
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL 164
+ +P + ++ FGC + + + GI+G ++S + QL V F+ CL
Sbjct: 203 QTTPANA-SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
VQP H + + G +L LP N F +
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIF--DSGNSK 319
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPS 261
G I D G+ L + VY L A D ++ L +CF + FP
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD-----FSCFQYSGSVDDGFPE 374
Query: 262 MTYHFQG-ADLVVEPENVFIFNHQDSFFFFF 291
+T+HF+G L+V P + N ++ + F
Sbjct: 375 VTFHFEGDVSLIVSPHDYLFQNGKNLYCMGF 405
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 136/356 (38%), Gaps = 69/356 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
YM L IG P + ++ WTQC PC+ C++Q+ P++N + +Y+ PC A
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 68 CKS-PFHCFEGD--CFYGI--TYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+S P GD C Y + +GD D+ T + ++ FGC++
Sbjct: 88 CESVPASTCSGDGVCSYEVETMFGDTSGIGGTDTFAIGT----------ATASLAFGCAM 137
Query: 123 ESKDFVSIQKKIIA-GIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFG 178
+S I++ + A G++GL S +G++ FS CL K L
Sbjct: 138 DSN----IKQLLGASGVVGLGRTPWSL---VGQMNATAFSYCLAPHGAAGKKSALLLGAS 190
Query: 179 DQIIAGKSLNLPP--------NSFTIKLNGQRGCINDC-------GSVLTVIECEVYAVL 223
++ GKS P + + I L G + D GSV+ V +
Sbjct: 191 AKLAGGKSAATTPLVNTSDDSSDYMIHLEGIK--FGDVIIAPPPNGSVVL-----VDTIF 243
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR--------------FNS---FPSMTYHF 266
F+ + I+K T G P + NS P + F
Sbjct: 244 GVSFLVDAAFQAIKKAVTV-AVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTF 302
Query: 267 QGADLVVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
QGA + P + ++++ + +ILG HQ N F++DLD
Sbjct: 303 QGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLD 358
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/352 (23%), Positives = 135/352 (38%), Gaps = 45/352 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K +DT + + W C C +C + + +++ + +P
Sbjct: 78 YYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIP 137
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDTS--TLLPPDEPSPVSV 113
C D C S + C Y YGD T D +L+ P+ S
Sbjct: 138 CSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSS 197
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
I FGCS+ ++ K + GI G S + QL + P FS CL
Sbjct: 198 ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGG 257
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
++P + H L + G+ L + P F+I N + G I DC
Sbjct: 258 GVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSIS-NNRGGTIVDC 316
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ- 267
G+ L + E Y L SQ + +C + ++ + FPS++ +F+
Sbjct: 317 GTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQCYLVSTSIG---DIFPSVSLNFEG 373
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTP-RKGKTILGARHQHNTQFVYDL 318
GA +V++PE + N + F ++G +ILG + VYD+
Sbjct: 374 GASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDI 425
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 135/357 (37%), Gaps = 52/357 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G+P K + +DT + + W C PC C + +N S + ++
Sbjct: 91 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150
Query: 63 CYDASCKSPFHCFEG----------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPS 109
C D C + F E C Y TYGD T DT T++ +E +
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVM-GNEQT 209
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL--- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 210 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGS 269
Query: 165 ------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
V+P + H L + IA LP +S + +G
Sbjct: 270 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNL--ESIAVNGQKLPIDSSLFTTSNTQGT 327
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
I D G+ L + Y + S + +C +T ++ + SFP++T
Sbjct: 328 IVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDS---SFPTVTL 384
Query: 265 HFQGA-DLVVEPENVFIFNHQ--DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F G + V+PEN + +S + G + TILG + FVYDL
Sbjct: 385 YFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDL 441
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 135/357 (37%), Gaps = 52/357 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G+P K + +DT + + W C PC C + +N S + ++
Sbjct: 89 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148
Query: 63 CYDASCKSPFHCFEG----------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPS 109
C D C + F E C Y TYGD T DT T++ +E +
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVM-GNEQT 207
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL--- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGS 267
Query: 165 ------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
V+P + H L + IA LP +S + +G
Sbjct: 268 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNL--ESIAVNGQKLPIDSSLFTTSNTQGT 325
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
I D G+ L + Y + S + +C +T ++ + SFP++T
Sbjct: 326 IVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDS---SFPTVTL 382
Query: 265 HFQGA-DLVVEPENVFIFNHQ--DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F G + V+PEN + +S + G + TILG + FVYDL
Sbjct: 383 YFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDL 439
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 137/364 (37%), Gaps = 68/364 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + L IG P + L DT + L WTQC PC C + P + S ++ KLPC +
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 68 CK---SPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ SP+ C C Y YG + L T TL S + FGCS E
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMGFTAGY---LATETL----HVGGASFPGVAFGCSTE 202
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD--QI 181
+ +GI+GL S + Q+G RFS CL + S + FG ++
Sbjct: 203 NG-----VGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCLRSDADAGDSPILFGSLAKV 254
Query: 182 IAGKSLNLP-------PNS--FTIKLNG-------------------------QRGCIND 207
G + P P+S + + L G G I D
Sbjct: 255 TGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVD 314
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNS----FPSM 262
G+ LT + E YA++ F+ + ++ + G CF+ A P++
Sbjct: 315 SGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTL 374
Query: 263 TYHFQ-GADLVVEPEN---VFIFNHQDSFF---FFFGPAFTPRKGKTILGARHQHNTQFV 315
F GA+ V + V + Q PA + + +I+G Q + +
Sbjct: 375 VLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPA-SEKLSISIIGNVMQMDLHVL 433
Query: 316 YDLD 319
YDLD
Sbjct: 434 YDLD 437
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 132/367 (35%), Gaps = 77/367 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + +LDT + + W QC PC+ CY Q+ +++ R +SY + C
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 181
Query: 68 CKS--PFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C C Y + YGD T + +T T VQ + GC +
Sbjct: 182 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFA-----RGARVQRVAIGCGHD 236
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------VQPDKSFHSRLE 176
++ F++ + SF Q+ R FS CL V+P + S +
Sbjct: 237 NEGLFIAASGLLGL-----GRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291
Query: 177 F-----------------------------------GDQIIAGKS---LNLPPNSFTIKL 198
F G + G S L L P +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT----- 346
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPA 254
G+ G I D G+ +T + VY + F + LF TC+NL
Sbjct: 347 -GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFD------TCYNLSG 399
Query: 255 -RFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
R P+++ H G V P ++ S F F A T G +I+G Q +
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD-GGVSIIGNIQQQGFR 458
Query: 314 FVYDLDT 320
V+D D
Sbjct: 459 VVFDGDA 465
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 145/355 (40%), Gaps = 66/355 (18%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F + +++ + G P + +LDT + +TWTQC+ C +C + ++ ++S + +Y
Sbjct: 121 LFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSF 180
Query: 61 LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C ++ ++ Y +TYGD + DT TL EPS V Q +FGC
Sbjct: 181 GSCIPSTVENN---------YNMTYGDDSTSVGNYGCDTMTL----EPSDV-FQKFQFGC 226
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK---------- 169
+K DF S + G++GL S + Q FS CL + D
Sbjct: 227 GRNNKGDFGS----GVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKA 282
Query: 170 -SFHSRLEF--------------------GDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
S S L+F D + + LN+P + F G I D
Sbjct: 283 TSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFA-----SPGTIIDS 337
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG---VTCFNLPARFNS-FPSMTY 264
+V+T + Y+ L A F +++ + RK G TC+NL R + P +
Sbjct: 338 RTVITRLPQRAYSALKAAFKKAMAKYPLSN--GRRKKGDILDTCYNLSGRKDVLLPEIVL 395
Query: 265 HF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF GAD+ + N+ + AF TI+G R Q + +YD+
Sbjct: 396 HFGGGADVRLNGTNIVWGSDASRLCL----AFAGTSELTIIGNRQQLSLTVLYDI 446
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/363 (23%), Positives = 133/363 (36%), Gaps = 63/363 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P + ++DT + L W QC PC C++Q P+++ + SY+ + C D
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQR 210
Query: 68 C------KSPFHCF---EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C + P C E C Y YGD T +L++ T+ + V ++ F
Sbjct: 211 CGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVF 270
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ ++ SF QL + FS CLV S++ FG
Sbjct: 271 GCGHWNRGLFHGAAGLLGLGR----GPLSFASQLRAVYGHTFSYCLVDHGSDVASKVVFG 326
Query: 179 DQIIAGKSLNLPPNSFT--------------IKLNG-----------------------Q 201
+ + P ++T +KL G
Sbjct: 327 EDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGS 386
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLPARF 256
G I D G+ L+ Y V+ FID + D L C V+ + P
Sbjct: 387 GGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYN--VSGVDRP--- 441
Query: 257 NSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
P ++ F GA EN FI D TPR G +I+G Q N V
Sbjct: 442 -EVPELSLLFADGAVWDFPAENYFIRLDPDG-IMCLAVLGTPRTGMSIIGNFQQQNFHVV 499
Query: 316 YDL 318
YDL
Sbjct: 500 YDL 502
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 111/297 (37%), Gaps = 55/297 (18%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC--- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 139 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 198
Query: 69 -KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
+ C C Y + YGD T +D TL PS V V N RFGCS +
Sbjct: 199 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL----NPSTV-VMNFRFGCSHAVRGN 253
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS----------------- 170
S +G M L S + Q + FS C+ P S
Sbjct: 254 FSAST---SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRF 310
Query: 171 FHSRLEFGDQII-------------AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ L II G+ LN+PP F G + D ++T +
Sbjct: 311 ARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPP 364
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGADLV 272
Y L F + + ++ R TC++ RF S P+++ F G +V
Sbjct: 365 TAYRALRLAFRSAMAAY--PRVAGGRAGLDTCYDF-VRFTSVTVPAVSLVFDGGAVV 418
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 111/297 (37%), Gaps = 55/297 (18%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC--- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 155 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 214
Query: 69 -KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
+ C C Y + YGD T +D TL PS V V N RFGCS +
Sbjct: 215 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL----NPSTV-VMNFRFGCSHAVRGN 269
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS----------------- 170
S +G M L S + Q + FS C+ P S
Sbjct: 270 FSAST---SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRF 326
Query: 171 FHSRLEFGDQII-------------AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ L II G+ LN+PP F G + D ++T +
Sbjct: 327 ARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPP 380
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHFQGADLV 272
Y L F + + ++ R TC++ RF S P+++ F G +V
Sbjct: 381 TAYRALRLAFRSAMAAY--PRVAGGRAGLDTCYDF-VRFTSVTVPAVSLVFDGGAVV 434
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 132/367 (35%), Gaps = 77/367 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + +LDT + + W QC PC+ CY Q+ +++ R +SY + C
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 187
Query: 68 CKS--PFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C C Y + YGD T + +T T VQ + GC +
Sbjct: 188 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFA-----RGARVQRVAIGCGHD 242
Query: 124 SKD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL------VQPDKSFHSRLE 176
++ F++ + SF Q+ R FS CL V+P + S +
Sbjct: 243 NEGLFIAASGLLGL-----GRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 297
Query: 177 F-----------------------------------GDQIIAGKS---LNLPPNSFTIKL 198
F G + G S L L P +
Sbjct: 298 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT----- 352
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE----KLFTCRKCGVTCFNLPA 254
G+ G I D G+ +T + VY + F + LF TC+NL
Sbjct: 353 -GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFD------TCYNLSG 405
Query: 255 -RFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
R P+++ H G V P ++ S F F A T G +I+G Q +
Sbjct: 406 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD-GGVSIIGNIQQQGFR 464
Query: 314 FVYDLDT 320
V+D D
Sbjct: 465 VVFDGDA 471
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 136/356 (38%), Gaps = 49/356 (13%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP----IYNSRSFK 56
+ T + Y++ + +G P L + DT + L W C D ++
Sbjct: 96 IITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSS 155
Query: 57 SYKKLPCYDASCK--SPFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
+Y +L C +C+ S C + +C Y +YGD T V S +T + + V V
Sbjct: 156 TYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRV 215
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQP-DKS 170
+ FGCS S G++GL + S + QLG + + S CL+ D +
Sbjct: 216 PRVNFGCSTASAGTFRSD-----GLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDAN 270
Query: 171 FHSRLEFGDQII----AGKSLNLPPNSF---------TIKLNGQRGC------INDCGSV 211
S L FG + + S L P+ ++ + GQ I D G+
Sbjct: 271 SSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHDSRIIVDSGTT 330
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDI---EKLFTCRKCGVTCFNLPARFNS----FPSMTY 264
LT ++ + L E + E+L C+++ + + P +T
Sbjct: 331 LTFLDPALLGPLVTELERRIKLQRVQPPEQLLQL------CYDVQGKSETDNFGIPDVTL 384
Query: 265 HF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F GA + + PEN F + + P + + +ILG Q N YDLD
Sbjct: 385 RFGGGAAVTLRPENTFSLLQEGTLCLVLVP-VSESQPVSILGNIAQQNFHVGYDLD 439
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/341 (19%), Positives = 132/341 (38%), Gaps = 39/341 (11%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
H + +G G + LDT A +W C+PC+ Q +++ +++ + D
Sbjct: 66 HGVFVSIGTGQGGRRKILALDTAASTSWVMCEPCRPPLHQLGRLFSPAESPTFRGVRRDD 125
Query: 66 ASCKSPFHCFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C P+H C + Y + DT L + S+ + FGC+
Sbjct: 126 PVCVPPYHRLHSTNGCSFAFPSAIGYLAR-----DTFHLRHSERSVVKSISGVAFGCAHT 180
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR--LEFGDQI 181
+ F + + I+ G++ L+ SF+ Q G RFS CL P S + ++FG ++
Sbjct: 181 TTGFYN--EDILGGVLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSGFIQFGIEV 238
Query: 182 IA------GKSLNLPPNSFTIKLNG----------------QRGCINDCGSVLTVIECEV 219
+ +L + + + + L G GC + +T I
Sbjct: 239 PSLPRHAHTTTLTVSASGYHLSLIGISLGNKRLDIDRHILTSHGCSINPAETITKIAEPA 298
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTYHFQ-GADLVVEPEN 277
Y ++ E + ++ +++ + + R + P+M +HF G D+
Sbjct: 299 YIIVARELMAQMNELGSKQVKGPPSSPLVFNKISRRVRARLPNMVFHFADGGDMWFTAGK 358
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F + F G +T++GA Q N +F++++
Sbjct: 359 LFQVIGTTARFLVEGHG----SHRTVIGAAQQVNARFIFNV 395
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 72/176 (40%), Gaps = 17/176 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++KLGIG P +DT + L W QCQPC SCY Q DPI+N R SY +PC +
Sbjct: 88 YLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDT 147
Query: 68 CKS--PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C E D C Y Y T ++D + + GCS
Sbjct: 148 CSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV------GGNVFHAVVLGCSD 201
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
S V +G++GL S + QL RF CL P +L G
Sbjct: 202 SS---VGGPPPQASGLVGLARGPLSLLSQLSV---RRFMYCLPPPMSRTPGKLVLG 251
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 142/360 (39%), Gaps = 53/360 (14%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP--IYNSRSFKSY 58
+ T + Y++ + +G P + + DT + L W C +D +++ +Y
Sbjct: 93 IITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTY 152
Query: 59 KKLPCYDASCK--SPFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSP--VSV 113
L C A+C+ S C + +C Y YGD T V S +T + V V
Sbjct: 153 SLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRV 212
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSF 171
+ FGCS S S + G++GL + S + QLG + RFS CLV P +
Sbjct: 213 PRVSFGCSTGSAG--SFRSD---GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAA 267
Query: 172 HSR--LEFGDQII----AGKSLNLPPNS----FTIKL-------------NGQRGCINDC 208
+S L FG + + S L P+ +T+ L N R I D
Sbjct: 268 NSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSR-IIVDS 326
Query: 209 GSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNS----FPS 261
G+ LT ++ + L AE I E+L C+++ + + P
Sbjct: 327 GTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQL------CYDVQGKSQAEDFGIPD 380
Query: 262 MTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+T F GA + + PEN F + + P + + +ILG Q N YDLD
Sbjct: 381 VTLRFGGGASVTLRPENTFSLLEEGTLCLVLVP-VSESQPVSILGNIAQQNFHVGYDLDA 439
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/334 (23%), Positives = 131/334 (39%), Gaps = 45/334 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L IG P + + LDT + L WTQCQPC +C++Q P ++ + + C
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 148
Query: 68 CKS-PFHCF----------EGDCFYGITYG------DVYETKEVDSLDTSTLLPPDEPSP 110
C+ P G G+ +G V+++ E + P PS
Sbjct: 149 CQGLPVASLPRSDKFTFVGAGASVPGVAFGCGLFNNGVFKSNET-GIAGFGRGPLSLPSQ 207
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
+ V N S F +I I + ++ L+ + F G + + + + P
Sbjct: 208 LKVGNF-------SHCFTTITGAIPSTVL-LDLPADLFSNGQGAV---QTTPLIQNPANP 256
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEF--- 227
L + L +P + F +K NG G I D G+ +T + VY ++ F
Sbjct: 257 TFYYLSLKGITVGSTRLPVPESEFALK-NGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQ 315
Query: 228 --IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF-PSMTYHFQGADLVVEPENVFIFNHQ 284
+ S + + F C + P R + P + HF+GA + + EN ++F +
Sbjct: 316 VKLPVVSGNTTDPYF--------CLSAPLRAKPYVPKLVLHFEGATMDLPREN-YVFEVE 366
Query: 285 DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
D+ A T +G Q N +YDL
Sbjct: 367 DAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDL 400
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 82/363 (22%), Positives = 133/363 (36%), Gaps = 67/363 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
TY++ IG P +L +LDT + L WTQC PC+ C+ Q P+Y +Y + C
Sbjct: 99 TYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGS 158
Query: 66 ASCKS---------------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSP 110
C + G C Y +YGD T V + +T T +
Sbjct: 159 RLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF-----GAG 213
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG----------------- 153
+V ++ FGC ++ +G++G+ S + QLG
Sbjct: 214 TTVHDLAFGCGTDNLGGTDNS----SGLVGMGRGPLSLVSQLGVTKFSYCFTPFNDTTTS 269
Query: 154 ---------RLVPDRFSCCLV----QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNG 200
L P S V P +S + L + L + P F + +G
Sbjct: 270 SPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASG 329
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFN-- 257
+ G I D G+ T +E + VL + L + G++ CF P
Sbjct: 330 RGGLIIDSGTTFTALEERAFVVLARA----VAARVALPLASGAHLGLSVCFAAPQGRGPE 385
Query: 258 --SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
P + HF GAD+ + P + + + + G +G ++LG+ Q N
Sbjct: 386 AVDVPRLVLHFDGADMEL-PRSSAVVEDRVAGVACLG--IVSARGMSVLGSMQQQNMHVR 442
Query: 316 YDL 318
YD+
Sbjct: 443 YDV 445
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 128/339 (37%), Gaps = 70/339 (20%)
Query: 24 LLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH---CFE 76
++D+ + ++W QC+PC C+ Q DP+++ +Y +PC A+C P+
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230
Query: 77 GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIA 136
C +GI YGD S D TL P D ++ RFGC+ D S +A
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLGPYDV-----IRGFRFGCA--HADRGSAFDYDVA 283
Query: 137 GIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------VQPDK-----SFHS----- 173
G + L S S + Q FS CL V P++ SF S
Sbjct: 284 GSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLS 343
Query: 174 --------RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTA 225
R+ I+AG+ L +PP F+ + D ++++ + Y L A
Sbjct: 344 SSMAPTFYRVLLRAIIVAGRPLAVPPAVFSAS------SVIDSSTIISRLPPTAYQALRA 397
Query: 226 EFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLVVEPENVFIFNHQ 284
F + + + TC++ R + PS+ F G V +
Sbjct: 398 AFRSAMTMYRAAPPVSILD---TCYDFTGVRSITLPSIALVFDGGATV----------NL 444
Query: 285 DSFFFFFGP--AFTPRKGKTI---LGARHQHNTQFVYDL 318
D+ G AF P + +G Q + VYD+
Sbjct: 445 DAAGILLGSCLAFAPTASDRMPGFIGNVQQKTLEVVYDV 483
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 134/344 (38%), Gaps = 51/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N ++ SY + C
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + SV N +
Sbjct: 189 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSVPNFYY 242
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------VQPDKS 170
GC +++ AG++GL + S + QL + FS CL
Sbjct: 243 GCGQDNEGLFGQS----AGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIG 298
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC----------------INDCGSVLTV 214
++ ++ +A SL+ + + IK+ G + I D G+V+T
Sbjct: 299 SYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ VY+ L+ F+ TCF A P +T F G +
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRASAFSILD---TCFQGQAARLRVPEVTMAFAGGAALKL 413
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + + AF P + I+G Q VYD+
Sbjct: 414 AARNLLVDVDSATTCL---AFAPARSAAIIGNTQQQTFSVVYDV 454
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 141/349 (40%), Gaps = 57/349 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++K+ +G P SL LDT + +TWTQC+PC SCY Q ++ R SYK + C +
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSS 104
Query: 67 SCK------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
SC+ C C Y + YGD + + + T+ P D + N FGC
Sbjct: 105 SCRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDV-----ISNFLFGC 159
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
++ + IAG++GL S +Q + F+ CL S L G Q
Sbjct: 160 GQQNAG----RFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQ 215
Query: 181 IIAGKSLNLPPNS--------FTIKLNG----------------QRGCINDCGSVLTVIE 216
+ KS+ P S + I + G G I D G+V+T ++
Sbjct: 216 VP--KSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQ 273
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEP 275
VY+ L+++F + F+ TC++ + S P +++ F+G VE
Sbjct: 274 PTVYSALSSKFQQLMKDYPKTDGFSILD---TCYDFSGNESISVPRISFFFKGG---VEV 327
Query: 276 ENVF-----IFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
+ F + N D F P G + G Q V+DL
Sbjct: 328 DIKFFGILTVINAWDKVCLAFAP--NDDDGDFVVFGNSQQQTYDVVHDL 374
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 137/350 (39%), Gaps = 64/350 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P + + DT +G+TWTQCQPC SCY Q + ++ SY + C A
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSA 194
Query: 67 SCK----SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
SC S C + C Y I YGD ++ + +T T+ D N FGC
Sbjct: 195 SCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD-----VFTNFLFGC 249
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ AG++GL+ S S Q +FS CL S L FG +
Sbjct: 250 GQSNNGLFGQA----AGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSS-TGYLNFGGK 304
Query: 181 I-------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ +AG L + P+ FT G I D G+V+T +
Sbjct: 305 VSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTT-----SGAIIDSGTVITRL 359
Query: 216 ECEVYAVLTAEFIDYFSQH---DIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA-D 270
Y L F + S + + ++L TC++ SFP ++ F+G +
Sbjct: 360 PPTAYKALKEAFDEKMSNYPKTNGDELLD------TCYDFSNYTTVSFPKVSVSFKGGVE 413
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYD 317
+ ++ + + AF K + I G Q + VYD
Sbjct: 414 VDIDASGILYLVNGVKMVCL---AFAANKDDSEFGIFGNHQQKTYEVVYD 460
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 132/356 (37%), Gaps = 69/356 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + + ++D L WTQC Q C+ C++Q+ P++++ + +++ PC A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 67 SCKS-----------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C+S +E +G T G + D++ T +
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRI----GTDAVAIGT---------AATAR 157
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC++ S+ +G +GL + S Q+ FS CL PD S L
Sbjct: 158 LAFGCAVASEMDTMWGS---SGSVGLGRTNLSLAAQMNATA---FSYCLAPPDTGKSSAL 211
Query: 176 EFGDQII---AGKSL--------NLPPN-----SFTIKLNGQRG-----CINDCGSVLTV 214
G AGK + PPN S+ ++L R + G+ +TV
Sbjct: 212 FLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITV 271
Query: 215 IEC--------EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
VY L D + CF + P + F
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDL---CFPKASASGGAPDLVLAF 328
Query: 267 QGADLVVEPENVFIF---NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
QG + P + ++F N PA G +ILG+ Q N ++DLD
Sbjct: 329 QGGAEMTVPVSSYLFDAGNDTACVAILGSPAL---GGVSILGSLQQVNIHLLFDLD 381
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 139/355 (39%), Gaps = 65/355 (18%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYN---SRSFKSYKKLP 62
T M + IG P ++DT + + W C PC +C +++ S +F K P
Sbjct: 99 RTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTP 158
Query: 63 CYDASCKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C C+ PF +TY D DT DE + + ++ FG
Sbjct: 159 CDFEGCRCDPIPFT---------VTYADNSTASGTFGRDTVVFETTDEGTS-RISDVLFG 208
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHSRLE 176
C + GI+GLN S + +LG+ +FS C L P ++H +L
Sbjct: 209 C---GHNIGHDTDPGHNGILGLNNGPDSLVTKLGQ----KFSYCIGNLADPYYNYH-QLI 260
Query: 177 FGDQI------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
G+ + K L++ P +F +K N G I D GS +
Sbjct: 261 LGEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTI 320
Query: 213 TVIECEVYAVLTAEFIDY----FSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYHF 266
T + V+ +L+ E + F Q IEK + CF ++ FP +T+HF
Sbjct: 321 TFLVDSVHKLLSKEVRNLLGWSFRQATIEK-----SPWMQCFYGSISRDLVGFPVVTFHF 375
Query: 267 Q-GADLVVEPENVFIFNHQDSFFFFFGP--AFTPRKGKTILGARHQHNTQFVYDL 318
GADL ++ + F + + F GP + + +++G Q + YDL
Sbjct: 376 SDGADLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDL 430
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 147/367 (40%), Gaps = 65/367 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L IG P +++ +LDT + L+W C+ + +P+ +S SY
Sbjct: 53 FQHNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSS----SYTPT 108
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC + C + P C + C ++Y D + + +T +L +P +
Sbjct: 109 PCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTL- 167
Query: 113 VQNIRFGCSLESKDFVS--IQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-- 168
FGC ++S + S + G+MG+N S S + Q +V +FS C+ D
Sbjct: 168 -----FGC-MDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCISGEDAF 218
Query: 169 --------------------------KSFHSRLEFGDQI----IAGKSLNLPPNSFTIKL 198
+ R+ + Q+ ++ K L LP + F
Sbjct: 219 GVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDH 278
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPA 254
G + D G+ T + VY L EF++ ++ + ++ F C++ PA
Sbjct: 279 TGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQ-TKGVLTRIEDPNFVFEGAMDLCYHAPA 337
Query: 255 RFNSFPSMTYHFQGADLVVEPENVF--IFNHQDSFF-FFFGPAFTPRKGKTILGARHQHN 311
+ P++T F GA++ V E + + +D + F FG + ++G HQ N
Sbjct: 338 SLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQN 397
Query: 312 TQFVYDL 318
+DL
Sbjct: 398 VWMEFDL 404
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 100/248 (40%), Gaps = 28/248 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ L +G P + + LDT + L WTQC PC+ C++Q P+ + + +Y LPC
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPR 145
Query: 68 CKS-PF-HCFEGDCFYGITYGDVYETKEVDSLDTSTL----LPPDEPSPVSVQNIRFGCS 121
C++ PF C C Y YGD T + D T + S + + + FGC
Sbjct: 146 CRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCG 205
Query: 122 LESKDFVSIQKKIIAGIMGLNW-------------------DSTSFMVQLGRLVPDRFS- 161
+K + IAG W DS S +V LG +S
Sbjct: 206 HFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSIVTLGGAPAALYSH 265
Query: 162 --CCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
V+ F + + ++ K +++ + R I D G+ +T + EV
Sbjct: 266 AHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSGASITTLPEEV 325
Query: 220 YAVLTAEF 227
Y + AEF
Sbjct: 326 YEAVKAEF 333
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 6/86 (6%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++LG+G P +++ +LDT + + W QC PCK+CY Q D I++ + K++ +PC
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRL 194
Query: 68 CKSPFHCFE------GDCFYGITYGD 87
C+ E C Y ++YGD
Sbjct: 195 CRRLDDSSECVTRRSKTCLYQVSYGD 220
>gi|383156234|gb|AFG60356.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156236|gb|AFG60358.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156239|gb|AFG60361.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G G P K ++DT + LTW QC+PC CY Q DPI+ SYK LPC A+
Sbjct: 71 YIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSLPCLSAT 130
Query: 68 CKSPFH-------CFEGDCFYGI 83
C CF G C Y I
Sbjct: 131 CTELLTSESNLTPCFLGGCSYEI 153
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 141/346 (40%), Gaps = 50/346 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
+++ +G+G P + + DT + L+W QCQPC S C+ Q DP+++ +Y + C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 65 DASCKSPFH-CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+ C + C E + C Y + YGD T V S DT L S ++ FGC
Sbjct: 204 EPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALT-----SSRALTGFPFGCG 258
Query: 122 LES-KDFVSIQKKI------------IAGIMGL-------NWDSTSFMVQLGRLVPD--- 158
+ DF + + A G + +ST+ + +G
Sbjct: 259 TRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATPATDTG 318
Query: 159 --RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+++ L +P +E I G L +PP FT + G + D G+VLT +
Sbjct: 319 AAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSGTVLTYLP 373
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHF-QGADLV 272
+ YA+L F + +E+ V C++ P++++ F GA
Sbjct: 374 AQAYALLRDRF-----RLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFE 428
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ V IF ++ F T +I+G Q + + +YD+
Sbjct: 429 LDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDV 474
>gi|357116104|ref|XP_003559824.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 489
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 94/405 (23%), Positives = 145/405 (35%), Gaps = 96/405 (23%)
Query: 7 TYMLKLGIGDPVKSLWFLL--DTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
TY +++G+G ++ L D V LTW QC P +Q+ PI++ ++ YK +
Sbjct: 73 TYSVRVGVGSGDTQHFYRLAVDMVGNLTWMQCLPSNPKLKQDAPIFDPKTSHRYKNVGHD 132
Query: 65 DASCKSPF--HCFEGDCFYGITY-------------------GDVYETKEVDSL------ 97
D CK+PF E C + I + G T VD L
Sbjct: 133 DPLCKAPFTPRPTEHRCGFNIRFRAEAMATGYLGKDEFAFGAGSGSRTTNVDGLVFGCAH 192
Query: 98 --------DTSTLLPPDEPSPVS-------------VQNIRFGCSLESKDFVSIQKKIIA 136
D +P P S V + FGC+ + + + ++A
Sbjct: 193 RINGWNNKDVLAGIPSLNRRPTSFVRQLSTHGGGGAVDGLVFGCAHAINGWKN--QDVLA 250
Query: 137 GIMGLNWDSTSFMVQL---GRLVPDRFSCCLVQPDK--SFHSRLEFGDQI---------- 181
GI+ LN TSF+ QL G RFS CLV K + H L FG +
Sbjct: 251 GILSLNRRPTSFVRQLSVHGGGTTPRFSYCLVDHKKYPNKHGFLRFGADVPDHSHAQSTA 310
Query: 182 ---------------------IAGKSL-NLPPNSFTIKLNGQ-RGCINDCGSVLTVIECE 218
+AG+ L + P F + GC D G+ T
Sbjct: 311 LLYGEPDGGFGMYYVRLVGVSVAGRKLTGITPKMFQRDRRSRLGGCYVDVGNPTTRFAEA 370
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF---QGADLVVEP 275
Y +L A + + H + + P PS+T HF + A L ++
Sbjct: 371 PYDILEAGVAAHMASHGLHRTPVPGHRLCVRGTSPEVMPKLPSITLHFAEDEAAGLEIKS 430
Query: 276 ENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+F H + + F P T++G Q +T+F +DL+
Sbjct: 431 RLLFATVKHAGADYVCFIVQRAPV--TTVIGGHQQVDTRFTFDLE 473
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 134/344 (38%), Gaps = 51/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N ++ SY + C
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + SV N +
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSVPNFYY 240
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------VQPDKS 170
GC +++ AG++GL + S + QL + FS CL
Sbjct: 241 GCGQDNEGLFGQS----AGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIG 296
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC----------------INDCGSVLTV 214
++ ++ +A SL+ + + IK+ G + I D G+V+T
Sbjct: 297 SYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ VY+ L+ F+ TCF A P +T F G +
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRASAFSILD---TCFQGQAARLRVPEVTMAFAGGAALKL 411
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + + AF P + I+G Q VYD+
Sbjct: 412 AARNLLVDVDSATTCL---AFAPARSAAIIGNTQQQTFSVVYDV 452
>gi|361067981|gb|AEW08302.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156226|gb|AFG60348.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156228|gb|AFG60350.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156229|gb|AFG60351.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156230|gb|AFG60352.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156231|gb|AFG60353.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156232|gb|AFG60354.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156233|gb|AFG60355.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156235|gb|AFG60357.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156237|gb|AFG60359.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156238|gb|AFG60360.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156240|gb|AFG60362.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156241|gb|AFG60363.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G G P K ++DT + LTW QC+PC CY Q DPI+ SYK LPC A+
Sbjct: 71 YIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSLPCLSAT 130
Query: 68 CKSPFH-------CFEGDCFYGI 83
C CF G C Y I
Sbjct: 131 CTELLTSESNLTPCFLGGCSYEI 153
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 134/344 (38%), Gaps = 51/344 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N ++ SY + C
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 67 SCK-------SPFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C +P C + C Y +YGD + S DT + SV N +
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSVPNFYY 240
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------VQPDKS 170
GC +++ AG++GL + S + QL + FS CL
Sbjct: 241 GCGQDNEGLFGQS----AGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIG 296
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC----------------INDCGSVLTV 214
++ ++ +A SL+ + + IK+ G + I D G+V+T
Sbjct: 297 SYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ VY+ L+ F+ TCF A P +T F G +
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRASAFSILD---TCFQGQAARLRVPEVTMAFAGGAALKL 411
Query: 275 PENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + + AF P + I+G Q VYD+
Sbjct: 412 AARNLLVDVDSATTCL---AFAPARSAAIIGNTQQQTFSVVYDV 452
>gi|383156225|gb|AFG60347.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156227|gb|AFG60349.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ G G P K ++DT + LTW QC+PC CY Q DPI+ SYK LPC A+
Sbjct: 71 YIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSLPCLSAT 130
Query: 68 CKSPFH-------CFEGDCFYGI 83
C CF G C Y I
Sbjct: 131 CTELLTSESNLTPCFLGGCSYEI 153
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 137/343 (39%), Gaps = 47/343 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + DT + L W QC PC CY+Q+ PI++ S+ +PC +
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQN 151
Query: 68 CKS--PFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
CK+ HC +G C Y TYGD TK + T+ SV+++ GC ES
Sbjct: 152 CKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITI------GSSSVKSV-IGCGHES 204
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFHSRLEFG-DQI 181
+I GL S + Q+ + + RFS CL + ++ FG + +
Sbjct: 205 GGGFGFASGVI----GLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAV 260
Query: 182 IAGKSLNLPP----NSFT--------IKLNGQR--------GCINDCGSVLTVIECEVYA 221
++G + P N T I + +R I D G+ L+ + E+Y
Sbjct: 261 VSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYD 320
Query: 222 VLTAEFIDYFSQHDIEKLFT----CRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPE 276
+ + + ++ C G+ A + P +T F GA++ + P
Sbjct: 321 GVVSSLLKVVKAKRVKDPGNFWDLCFDDGINV----ATSSGIPIITAQFSGGANVNLLPV 376
Query: 277 NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
N F + PA +P I+G N YDL+
Sbjct: 377 NTFQKVANNVNCLTLTPA-SPTDEFGIIGNLALANFLIGYDLE 418
>gi|376337722|gb|AFB33417.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337724|gb|AFB33418.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337726|gb|AFB33419.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337728|gb|AFB33420.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337730|gb|AFB33421.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337732|gb|AFB33422.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L G G P K ++DT + LTW QC+PC CY Q DPI++ SYK LPC A+
Sbjct: 71 YILTAGFGTPTKKFLLVIDTGSDLTWIQCKPCLGCYSQVDPIFDPSQSSSYKSLPCLSAT 130
Query: 68 CKSPFH-------CFEGDCFYGI 83
C C G C Y I
Sbjct: 131 CTELLTSESNLTPCLLGGCSYEI 153
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 112/288 (38%), Gaps = 55/288 (19%)
Query: 24 LLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH---CFE 76
++D+ + ++W QC+PC C+ Q DP+++ +Y +PC A+C P+
Sbjct: 80 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 139
Query: 77 GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIA 136
C +GI YGD S D TL P D ++ RFGC+ D S +A
Sbjct: 140 AQCQFGINYGDGSTATGTYSFDDLTLGPYDV-----IRGFRFGCA--HADRGSAFDYDVA 192
Query: 137 GIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------VQPDK-----SFHS----- 173
G + L S S + Q FS CL V P++ SF S
Sbjct: 193 GSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLS 252
Query: 174 --------RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTA 225
R+ I+AG+ L +PP F+ + D ++++ + Y L A
Sbjct: 253 SSMAPTFYRVLLRAIIVAGRPLAVPPAVFSAS------SVIDSSTIISRLPPTAYQALRA 306
Query: 226 EFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV 272
F + + + TC++ R + PS+ F G V
Sbjct: 307 AFRSAMTMYRAAPPVSILD---TCYDFTGVRSITLPSIALVFDGGATV 351
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/358 (21%), Positives = 139/358 (38%), Gaps = 61/358 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ IG P +++ ++D L WTQC C+S C++Q P+++ + +Y+ C
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 66 ASCKS--PFHC-FEGDCFYGI--TYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
CKS +C +G+C Y +GD + D++ + + FGC
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGDTFGIASTDAIAIGN----------AEGRLAFGC 171
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ S + +G +GL S +G+ FS CL S L G
Sbjct: 172 VVASDGSIDGAMDGPSGFVGLGRTPWSL---VGQSNVTAFSYCLALHGPGKKSALFLGAS 228
Query: 181 I-IAGKSLNLPPNS-----------------FTIKLNGQR------GCINDCGSVLTVIE 216
+AG + PP +T++L G + + G +TV++
Sbjct: 229 AKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITVLQ 288
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP---------ARFNSFPSMTYHFQ 267
E + L+ ++ + +EK+ T + N P A + P + + FQ
Sbjct: 289 LETFRPLS--YLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQ 346
Query: 268 -GADLVVEPENVFIFNHQDSFFFFFGPAFTPR-----KGKTILGARHQHNTQFVYDLD 319
GA L +P + + + + R G +ILG+ Q N F++DL+
Sbjct: 347 GGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 108/274 (39%), Gaps = 56/274 (20%)
Query: 24 LLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH---CFE 76
++D+ + ++W QC+PC C+ Q DP+++ +Y +PC A+C P+
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230
Query: 77 GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIA 136
C +GI YGD S D TL P D ++ RFGC+ D S +A
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLGPYDV-----IRGFRFGCA--HADRGSAFDYDVA 283
Query: 137 GIMGLNWDSTSFMVQLGRLVPDRFSCCL-------------VQPDK-----SFHS----- 173
G + L S S + Q FS CL V P++ SF S
Sbjct: 284 GSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLS 343
Query: 174 --------RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTA 225
R+ I+AG+ L +PP F+ + D ++++ + Y L A
Sbjct: 344 SSMAPTFYRVLLRAIIVAGRPLAVPPAVFSAS------SVIDSSTIISRLPPTAYQALRA 397
Query: 226 EFID----YFSQHDIEKLFTCRK-CGVTCFNLPA 254
F Y + + L TC GV LP+
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPS 431
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 135/356 (37%), Gaps = 54/356 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C C + + P+ ++ S + +
Sbjct: 68 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 127
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C D C + C Y YGD T S LL D SV N
Sbjct: 128 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTS---GYYVSDLLNFDAIVGSSVTN 184
Query: 116 ----IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC------ 163
I FGCS+ ++ + + GI G S + Q+ + P FS C
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244
Query: 164 ---------LVQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
+V+ D + H L + GKSL + P F N RG I
Sbjct: 245 GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTN--RGTIV 302
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTYH 265
D G+ L + E Y + + SQ + L + G C+ + + FP+++ +
Sbjct: 303 DSGTTLAYLAEEAYDPFVSAITEAVSQS-VRPLLS---KGTQCYLITSSVKGIFPTVSLN 358
Query: 266 FQGA-DLVVEPENVFIFNHQ--DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F G + ++PE+ + + D+ + G +G TILG + FVYDL
Sbjct: 359 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDL 414
>gi|376337718|gb|AFB33415.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337720|gb|AFB33416.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+L G G P K ++DT + LTW QC+PC CY Q DPI+ SYK LPC A+
Sbjct: 71 YILTAGFGTPTKKFLLVIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSLPCLSAT 130
Query: 68 CKSPFH-------CFEGDCFYGI 83
C C G C Y I
Sbjct: 131 CTELLTSESNLTPCLLGGCSYEI 153
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 143/354 (40%), Gaps = 49/354 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C C ++D +Y+ ++ + +
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 63 CYDASCK---SPF-HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQ 114
C D C P C G C Y + YGD T D + + + + +P +
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTN-G 273
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-------- 164
+ FGC + + + + GI+G ++S + QL V FS CL
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI 333
Query: 165 ------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
V+P + H + + + G L++P ++F + ++G I D G+
Sbjct: 334 FAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRKGTIIDSGT 391
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF-QG 268
L EVY L + + SQ +L T + TCF+ + FP++T HF +
Sbjct: 392 TLAYFPQEVYVPLIEKIL---SQQPDLRLHTVEQA-FTCFDYTGNVDDGFPTVTLHFDKS 447
Query: 269 ADLVVEP-ENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
L V P E +F + + + GK T+LG N VYDL+
Sbjct: 448 ISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLE 501
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 143/354 (40%), Gaps = 49/354 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C C ++D +Y+ ++ + +
Sbjct: 74 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133
Query: 63 CYDASCK---SPF-HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQ 114
C D C P C G C Y + YGD T D + + + + +P +
Sbjct: 134 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTN-G 192
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-------- 164
+ FGC + + + + GI+G ++S + QL V FS CL
Sbjct: 193 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI 252
Query: 165 ------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
V+P + H + + + G L++P ++F + ++G I D G+
Sbjct: 253 FAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRKGTIIDSGT 310
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF-QG 268
L EVY L + + SQ +L T + TCF+ + FP++T HF +
Sbjct: 311 TLAYFPQEVYVPLIEKIL---SQQPDLRLHTVEQA-FTCFDYTGNVDDGFPTVTLHFDKS 366
Query: 269 ADLVVEP-ENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
L V P E +F + + + GK T+LG N VYDL+
Sbjct: 367 ISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLE 420
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 142/365 (38%), Gaps = 70/365 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C+ C + +YN + S K +P
Sbjct: 86 YYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVP 145
Query: 63 CYDASC----KSPFHCFEGD--CFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPV 111
C + C P + C Y YGD T K+V D + D +
Sbjct: 146 CDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDR---VSGDLQTTS 202
Query: 112 SVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL---- 164
S ++ FGC + +S D ++ + GI+G ++S + QL R V F+ CL
Sbjct: 203 SNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGIN 262
Query: 165 ----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
VQP + H + + L+LP F + ++G I
Sbjct: 263 GGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEF--EAGDRKGAII 320
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYH 265
D G+ L + VY L ++ I SQ K+ R TCF + FP++T+H
Sbjct: 321 DSGTTLAYLPEIVYEPLVSKII---SQQPDLKVHIVRD-EYTCFQYSGSVDDGFPNVTFH 376
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFF-----------GPAFTPRKGKTILGARHQHNTQF 314
F+ +VF+ H + F F G R+ T+LG N
Sbjct: 377 FE--------NSVFLKVHPHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLV 428
Query: 315 VYDLD 319
+YDL+
Sbjct: 429 LYDLE 433
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 145/359 (40%), Gaps = 58/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C+ C + + +YN + K +P
Sbjct: 78 YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137
Query: 63 CYDASC------KSPFHCFEGDCFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C C + P C Y YGD T V + + D + +
Sbjct: 138 CDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANG 197
Query: 115 NIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------- 164
++ FGC + +S D S ++ + GI+G ++S + QL V F+ CL
Sbjct: 198 SVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGG 257
Query: 165 -------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP + H + + + L+LP + F + ++G I D G
Sbjct: 258 IFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVF--EAGDRKGAIIDSG 315
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQG 268
+ L + VY L ++ I SQ K+ T R TCF + FP++T+HF+
Sbjct: 316 TTLAYLPEMVYKPLVSKII---SQQPDLKVHTVRD-EYTCFQYSDSLDDGFPNVTFHFEN 371
Query: 269 ADLV-VEP-------ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ ++ V P E ++ Q+S G R+ T+LG N +YDL+
Sbjct: 372 SVILKVYPHEYLFPFEGLWCIGWQNS-----GVQSRDRRNMTLLGDLVLSNKLVLYDLE 425
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 127/342 (37%), Gaps = 47/342 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++L +G P + +DT + L WTQC PC +CY Q PI++ ++K+ C+ S
Sbjct: 61 YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCHGNS 120
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C Y I Y D + + + +T T + P + GC L + +
Sbjct: 121 CP-----------YEIIYADESYSTGILATETVT-IQSTSGEPFVMAETSIGCGLNNSNL 168
Query: 128 VSIQ-KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKS 186
++ +GI+GLN +S + Q+ +P S C S++ FG +
Sbjct: 169 MTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF---SSQGTSKINFGTNAVVAGD 225
Query: 187 LNLPPNSFTIK------LNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLF 240
+ + F K LN + D + + +A FID + +
Sbjct: 226 GTVAADMFIKKDQPFYYLNLDAVSVGD--KRIETLGTPFHAQDGNIFIDSGTTYTYLPTS 283
Query: 241 TCRKC--------------------GVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVF 279
C + C+N FP +T HF GADLV++ N++
Sbjct: 284 YCNLVREAVAASVVAANQVPDPSSENLLCYNW-DTMEIFPVITLHFAGGADLVLDKYNMY 342
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
+ F P I G R +N YD T
Sbjct: 343 VETITGGTFCLAIGCVDPSM-PAIFGNRAHNNLLVGYDSSTL 383
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 144/359 (40%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C+ C ++ +Y+ ++ + +
Sbjct: 86 YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVM 145
Query: 63 CYDASCKSPF-----HCFEG-DCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSV 113
C A C + F C C Y +TYGD T D+L + + P +
Sbjct: 146 CDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANA 205
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL------- 164
I FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 206 SVI-FGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGG 264
Query: 165 -------VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP DK H + + G +L LP + F + ++G I D
Sbjct: 265 IFSIGDVVQPKVKTTPLVADKP-HYNVNLKTIDVGGTTLQLPAHIF--EPGEKKGTIIDS 321
Query: 209 GSVLTVIECEVYA-VLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTY 264
G+ LT + V+ V+ A F + + HD++ G CF P + FP++T+
Sbjct: 322 GTTLTYLPELVFKEVMLAVFNKHQDITFHDVQ--------GFLCFQYPGSVDDGFPTITF 373
Query: 265 HFQ-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
HF+ L V P F N D + F A + GK I +G N +YDL+
Sbjct: 374 HFEDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLE 432
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 135/356 (37%), Gaps = 54/356 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C C + + P+ ++ S + +
Sbjct: 83 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 142
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C D C + C Y YGD T S LL D SV N
Sbjct: 143 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTS---GYYVSDLLNFDAIVGSSVTN 199
Query: 116 ----IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC------ 163
I FGCS+ ++ + + GI G S + Q+ + P FS C
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259
Query: 164 ---------LVQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
+V+ D + H L + GKSL + P F N RG I
Sbjct: 260 GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTN--RGTIV 317
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTYH 265
D G+ L + E Y + + SQ + L + G C+ + + FP+++ +
Sbjct: 318 DSGTTLAYLAEEAYDPFVSAITEAVSQ-SVRPLLS---KGTQCYLITSSVKGIFPTVSLN 373
Query: 266 FQGA-DLVVEPENVFIFNHQ--DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F G + ++PE+ + + D+ + G +G TILG + FVYDL
Sbjct: 374 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDL 429
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 140/346 (40%), Gaps = 51/346 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N TY+++ IG P +++ +DT + + W C C C + ++NS + +YK L C
Sbjct: 98 NPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQ 154
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-- 120
A CK C G C + +TYG + S DT TL + +V FGC
Sbjct: 155 AAQCKQVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITL------ATDAVPGYSFGCIQ 207
Query: 121 -----SLESKD------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP 157
SL ++ + S + LN+ + + +G+
Sbjct: 208 KATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR 267
Query: 158 DRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+++ L P + + + + +++PP SFT + G I D G+V T +
Sbjct: 268 IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVT 327
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y + F ++ + + T G TC+ +P + P++T+ F G ++ + P
Sbjct: 328 PAYIAVRDAF-----RNRVGRNLTVTSLGGFDTCYTVPI---AAPTITFMFTGMNVTLPP 379
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
+N+ I + S A P ++L Q N + +YD+
Sbjct: 380 DNLLIHSTAGS-TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDV 424
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 109/257 (42%), Gaps = 41/257 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K + DT + +TWTQC+PC K+CY+Q +P N + SYK + C A
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 178
Query: 67 SCK-----SPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK F C C Y + YGD + + +T TL S +N FG
Sbjct: 179 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL-----SSSNVFKNFLFG 233
Query: 120 CSLESK---------------------DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD 158
C ++ KK+ + + + S ++ G++
Sbjct: 234 CGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKS 293
Query: 159 -RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+F+ D + L+ + G+ L++ ++F+ G + D G+V+T +
Sbjct: 294 VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA------GTVIDSGTVITRLSP 347
Query: 218 EVYAVLTAEFIDYFSQH 234
Y+ L++ F + + +
Sbjct: 348 TAYSELSSAFQNLMTDY 364
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 109/257 (42%), Gaps = 41/257 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K + DT + +TWTQC+PC K+CY+Q +P N + SYK + C A
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 130
Query: 67 SCK-----SPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK F C C Y + YGD + + +T TL S +N FG
Sbjct: 131 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL-----SSSNVFKNFLFG 185
Query: 120 CSLESK---------------------DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD 158
C ++ KK+ + + + S ++ G++
Sbjct: 186 CGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKS 245
Query: 159 -RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+F+ D + L+ + G+ L++ ++F+ G + D G+V+T +
Sbjct: 246 VKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSA------GTVIDSGTVITRLSP 299
Query: 218 EVYAVLTAEFIDYFSQH 234
Y+ L++ F + + +
Sbjct: 300 TAYSELSSAFQNLMTDY 316
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 140/346 (40%), Gaps = 51/346 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N TY+++ IG P +++ +DT + + W C C C + ++NS + +YK L C
Sbjct: 33 NPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQ 89
Query: 65 DASCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-- 120
A CK C G C + +TYG + S DT TL + +V FGC
Sbjct: 90 AAQCKQVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITL------ATDAVPGYSFGCIQ 142
Query: 121 -----SLESKD------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP 157
SL ++ + S + LN+ + + +G+
Sbjct: 143 KATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR 202
Query: 158 DRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+++ L P + + + + +++PP SFT + G I D G+V T +
Sbjct: 203 IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVT 262
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y + F ++ + + T G TC+ +P + P++T+ F G ++ + P
Sbjct: 263 PAYIAVRDAF-----RNRVGRNLTVTSLGGFDTCYTVPI---AAPTITFMFTGMNVTLPP 314
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
+N+ I + S A P ++L Q N + +YD+
Sbjct: 315 DNLLIHSTAGS-TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDV 359
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 139/354 (39%), Gaps = 51/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K + +DT + + W C PC C + D +Y+S++ + K +
Sbjct: 78 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 137
Query: 63 CYDASC----KSPFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSVQN 115
C D C +S + C Y + YGD + D++ + +P++ Q
Sbjct: 138 CEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA-QE 196
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------- 164
+ FGC + + GIMG +TS + QL G FS CL
Sbjct: 197 VVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIF 256
Query: 165 --------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ P++ H + + G ++LPP+ NG G I D G+
Sbjct: 257 AVGEVESPVVKTTPIVPNQ-VHYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGT 313
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
L + +Y L I+ + KL ++ CF+ + + +FP + HF+ +
Sbjct: 314 TLAYLPQNLYNSL----IEKITAKQQVKLHMVQET-FACFSFTSNTDKAFPVVNLHFEDS 368
Query: 270 -DLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
L V P + +D + F + T + G + LG N VYDL+
Sbjct: 369 LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLE 422
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/358 (20%), Positives = 138/358 (38%), Gaps = 61/358 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ IG P +++ ++D L WTQC C+S C++Q P+++ + +Y+ C
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 66 ASCKS--PFHC-FEGDCFYGI--TYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
CKS +C +G+C Y +GD + D++ + + FGC
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGDTFGIASTDAIAIGN----------AEGRLAFGC 171
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ S + +G +GL S +G+ FS CL S L G
Sbjct: 172 VVASDGSIDGAMDGPSGFVGLGRTPWSL---VGQSNVTAFSYCLAPHGPGKKSALFLGAS 228
Query: 181 I-IAGKSLNLPPNS-----------------FTIKLNGQR------GCINDCGSVLTVIE 216
+AG + PP +T++L G + + G +T+++
Sbjct: 229 AKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITILQ 288
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP---------ARFNSFPSMTYHFQ 267
E + L+ ++ + +EK+ T + N P A + P + + FQ
Sbjct: 289 LETFRPLS--YLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQ 346
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRK------GKTILGARHQHNTQFVYDLD 319
G + P + ++ + + + G +ILG+ Q N F++DL+
Sbjct: 347 GGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/374 (22%), Positives = 147/374 (39%), Gaps = 70/374 (18%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F N + + L +G P +++ +LDT + L+W C+ + + ++N S K+Y K
Sbjct: 62 LFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKTQFL----NSVFNPLSSKTYSK 117
Query: 61 LPCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
+PC +CK+ P C C ++Y D + + +T L +P+ +
Sbjct: 118 VPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATI- 176
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC S + G++G+N S SF+ Q+G +FS C+ D +
Sbjct: 177 -----FGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGY---PKFSYCISGFDSAGV 228
Query: 171 ---------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLN 199
+ R+ + Q+ + K L+LP + F
Sbjct: 229 LLLGNASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHT 288
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL-----FTCRKCGVTCFNLPA 254
G + D G+ T + VY L EF+ I K+ F + C+ L +
Sbjct: 289 GAGQTMVDSGTQFTFLLGPVYTALKNEFLS--QTRGILKVLNDDNFVFQGAMDLCYLLDS 346
Query: 255 ---RFNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILG 305
+ P ++ FQGA++ V E + +DS + F FG + ++G
Sbjct: 347 SRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIG 406
Query: 306 ARHQHNTQFVYDLD 319
HQ N +DL+
Sbjct: 407 HHHQQNVWMEFDLE 420
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 139/354 (39%), Gaps = 51/354 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K + +DT + + W C PC C + D +Y+S++ + K +
Sbjct: 74 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 133
Query: 63 CYDASC----KSPFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSVQN 115
C D C +S + C Y + YGD + D++ + +P++ Q
Sbjct: 134 CEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA-QE 192
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------- 164
+ FGC + + GIMG +TS + QL G FS CL
Sbjct: 193 VVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIF 252
Query: 165 --------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ P++ H + + G ++LPP+ NG G I D G+
Sbjct: 253 AVGEVESPVVKTTPIVPNQ-VHYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGT 309
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGA 269
L + +Y L I+ + KL ++ CF+ + + +FP + HF+ +
Sbjct: 310 TLAYLPQNLYNSL----IEKITAKQQVKLHMVQET-FACFSFTSNTDKAFPVVNLHFEDS 364
Query: 270 -DLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
L V P + +D + F + T + G + LG N VYDL+
Sbjct: 365 LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLE 418
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 131/350 (37%), Gaps = 76/350 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P K +LDT + L W QC PC C++QND + S
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND-----------------NQS 212
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C P++ + GD T GD V++ + +V+N+ FGC ++
Sbjct: 213 C--PYYYWYGDS--SNTTGDF----AVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGL 264
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--QPDKSFHSRLEFGDQ----- 180
++ SF QL L FS CLV D + S+L FG+
Sbjct: 265 FHGAAGLLGLGR----GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLS 320
Query: 181 ----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
++AG+ LN+P ++ I +G G I D G+ L
Sbjct: 321 HPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTL 380
Query: 213 TVIECEVYAVLTAEFI-DYFSQHDIEKLFTCRKCGVT--CFNLPARFN-SFPSMTYHF-Q 267
+ Y EFI + ++ K R + CFN+ N P + F
Sbjct: 381 SYFAEPAY-----EFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFAD 435
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
GA EN FI+ ++D TP+ +I+G Q N +YD
Sbjct: 436 GAVWNFPTENSFIWLNED--LVCLAMLGTPKSAFSIIGNYQQQNFHILYD 483
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 109/257 (42%), Gaps = 41/257 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ +G+G P K + DT + +TWTQC+PC K+CY+Q +P N + SYK + C A
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 190
Query: 67 SCK-----SPF--HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK F C C Y + YGD + + +T TL S +N FG
Sbjct: 191 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL-----SSSNVFKNFLFG 245
Query: 120 CSLESK---------------------DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD 158
C ++ KK+ + + + S ++ G++
Sbjct: 246 CGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKS 305
Query: 159 -RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+F+ D + L+ + G+ L++ ++F+ G + D G+V+T +
Sbjct: 306 VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA------GTVIDSGTVITRLSP 359
Query: 218 EVYAVLTAEFIDYFSQH 234
Y+ L++ F + + +
Sbjct: 360 TAYSELSSAFQNLMTDY 376
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 74/176 (42%), Gaps = 19/176 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + L IG P + L DT + L WTQC PC C + P + S ++ KLPC +
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 68 CK---SPFH-CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ SP+ C C Y YG + L T TL S + FGCS E
Sbjct: 150 CQFLTSPYRTCNATGCVYYYPYGMGFTAGY---LATETL----HVGGASFPGVTFGCSTE 202
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
+ +GI+GL S + Q+G RFS CL + S + FG
Sbjct: 203 NG-----VGNSSSGIVGLGRSPLSLVSQVGV---ARFSYCLRSNADAGDSPILFGS 250
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 127/346 (36%), Gaps = 65/346 (18%)
Query: 15 GDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--S 70
G S ++D+ + + W QCQPC C+ Q DP+++ + +Y +PC A+C
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 71 PFH--CFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
P+ C C +GITY + S D TL P D V+ FGC+ D
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDV-----VRGFLFGCA--HADQ 187
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG--------- 178
S +AG + L S SF+ Q FS C V P S + FG
Sbjct: 188 GSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGVPPQRAALV 246
Query: 179 ------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
I+AG+ L +PP F+ + D +V++
Sbjct: 247 PTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSAS------SVIDSATVISR 300
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQGADLV- 272
I Y L A F + + + TC++ R + PS+ F G V
Sbjct: 301 IPPTAYQALRAAFRSAMTMYRPAPPVSILD---TCYDFSGVRSITLPSIALVFDGGATVN 357
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ + + F P + R +G Q + VYD+
Sbjct: 358 LDAAGILLQG-----CLAFAPTASDRM-PGFIGNVQQRTLEVVYDV 397
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 132/360 (36%), Gaps = 67/360 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+T Y+ IG P + ++D L WTQC+ C C+EQ+ P+++ + +Y+
Sbjct: 45 WTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAE 104
Query: 62 PCYDASCKS----PFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
PC C+S +C C Y GD D+ T + +
Sbjct: 105 PCGTPLCESIPSDSRNCSGNVCAYQASTNAGDTGGKVGTDTFAVGT----------AKAS 154
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC + S D ++ +GI+GL S + Q G FS CL D +S L
Sbjct: 155 LAFGCVVAS-DIDTMGGP--SGIVGLGRTPWSLVTQTGVAA---FSYCLAPHDAGRNSAL 208
Query: 176 EFGDQII---AGKSLNLP-----------PNSFTIKLNGQRGCINDCGSVLTVIECEVYA 221
G GK+ + P N + ++L G + G + +
Sbjct: 209 FLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLK-----AGDAMIPLPPSGST 263
Query: 222 VLTAEFIDYFSQHD--IEKLFTCRKCGVT-----------------CFNLPARFNSFPSM 262
VL +D FS ++ + K VT CF + P +
Sbjct: 264 VL----LDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDL 319
Query: 263 TYHFQGADLVVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F+G + P ++ ++++ ++LG+ Q N F++DLD
Sbjct: 320 VFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLD 379
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 89/219 (40%), Gaps = 44/219 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +LG+G P K ++ +LDT + + W QC PC+ CY Q DP+++ + S+ + C
Sbjct: 174 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPL 233
Query: 68 C---KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C SP C Y + YGD T S +T T V + GC ++
Sbjct: 234 CLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTF------RGTRVPKVALGCGHDN 287
Query: 125 KD-FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA 183
+ FV AG++G LGR QP +R G +A
Sbjct: 288 EGLFVG-----AAGLLG-----------LGR-----------QPR---LNRPPVGGARVA 317
Query: 184 GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAV 222
G + +L F + G G I D G+ +T + Y
Sbjct: 318 GITASL----FKLDTAGNGGVIIDSGTSVTRLTRRAYGT 352
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 137/349 (39%), Gaps = 62/349 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + +DT + ++W QC+PC C+ + D +++ S +Y C A
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAP 181
Query: 68 CKSPFHCFEGD------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
C EG+ C Y + YGD T S DT TL ++ + +FGCS
Sbjct: 182 CAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTL------GSSAMTDFQFGCS 235
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG--- 178
+ Q G+MGL + S Q FS CL P L G
Sbjct: 236 QSESGGFNDQTD---GLMGLGGGAQSLASQTAGTFGTAFSYCL-PPTSGSSGFLTLGTGS 291
Query: 179 -----------DQI------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
QI + + LNLP + F+ G + D G+++T +
Sbjct: 292 SGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS------AGSLMDSGTIITRL 345
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLV 272
Y+ L++ F + +++ G+ TCF+ + + S P++T F G V
Sbjct: 346 PPTAYSALSSAF-----KAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAV 400
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQFVYDL 318
+ + S AFTP + I+G Q + +YD+
Sbjct: 401 DLAFDGIMLEISSSIRCL---AFTPNGDDSSLGIIGNVQQRTFEVLYDV 446
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/360 (21%), Positives = 141/360 (39%), Gaps = 65/360 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N TY+++ IG P +++ +DT + + W C C C + ++NS + +YK L C
Sbjct: 98 NPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQ 154
Query: 65 DASCKSPFH----------------CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEP 108
A CK H C G C + +TYG + S DT TL
Sbjct: 155 AAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITL------ 207
Query: 109 SPVSVQNIRFGC-------SLESKD------------------FVSIQKKIIAGIMGLNW 143
+ +V FGC SL ++ + S + LN+
Sbjct: 208 ATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF 267
Query: 144 DSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
+ + +G+ +++ L P + + + + +++PP SFT + G
Sbjct: 268 SGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAG 327
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPS 261
I D G+V T + Y + F ++ + + T G TC+ +P + P+
Sbjct: 328 TIFDSGTVFTRLVTPAYIAVRDAF-----RNRVGRNLTVTSLGGFDTCYTVPI---AAPT 379
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
+T+ F G ++ + P+N+ I + S A P ++L Q N + +YD+
Sbjct: 380 ITFMFTGMNVTLPPDNLLIHSTAGS-TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDV 438
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 140/379 (36%), Gaps = 81/379 (21%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +GDP +++ +LDT + L+W C+ + ++N S +Y +
Sbjct: 59 FRHNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPV 114
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC C++ P C C I+Y D + + +T + P +
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTL- 173
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC S + G+MG+N S SF+ QLG +FS C+ D S
Sbjct: 174 -----FGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCISGSDSSVF 225
Query: 171 ---------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLN 199
+ R+ + Q+ + K L+LP + F
Sbjct: 226 LLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHT 285
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFI-------------DYFSQHDIEKLFTCRKCG 246
G + D G+ T + VY L EFI D+ Q ++ C K G
Sbjct: 286 GAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMD---LCYKVG 342
Query: 247 VTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIF-------NHQDSFFFFFGPAFTPRK 299
T F+ P ++ F+GA++ V + + ++ + F FG +
Sbjct: 343 STT---RPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGI 399
Query: 300 GKTILGARHQHNTQFVYDL 318
++G HQ N +DL
Sbjct: 400 EAFVIGHHHQQNVWMEFDL 418
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 144/357 (40%), Gaps = 62/357 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN--DPIYNSRSFKSYKKLPCYD 65
+++ +G P ++DT + L W QCQPCK C + P++N ++ + C D
Sbjct: 96 FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDD 155
Query: 66 ASCK-SP-FHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C+ +P HC + C Y Y +K V + + T P+ + V+ Q I FGC
Sbjct: 156 RFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVT-QPIAFGCGY 214
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSF-HSRLEFGDQ 180
E+ + + GI+GL TS VQLG +FS C+ +K++ +++L G+
Sbjct: 215 ENGEQLESH---FTGILGLGAKPTSLAVQLG----SKFSYCIGDLANKNYGYNQLVLGED 267
Query: 181 I--------------------------IAGKSLNLPPNSFTIKLNGQR-GCINDCGSVLT 213
+ LN+ P F K G R G I D G++ T
Sbjct: 268 ADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVF--KRRGPRTGVILDSGTLYT 325
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPARFNSFPSMTYHFQ-G 268
+ Y L E I +E+ F C V+ FP +T+HF G
Sbjct: 326 WLADIAYRELYNE-IKSILDPKLERFWFRDFLCYHGRVS-----EELIGFPVVTFHFAGG 379
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-------TILGARHQHNTQFVYDL 318
A+L +E ++F + + F F + P K T +G Q YDL
Sbjct: 380 AELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDL 436
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 134/355 (37%), Gaps = 58/355 (16%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+N Y +L IG P + ++DT + +T+ C C+ C DP + ++Y+ + C
Sbjct: 85 INGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC 144
Query: 64 Y-DASCKSPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
D +C +GD C Y Y ++ + V D + E +P Q FG
Sbjct: 145 TPDCNC-------DGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAP---QRAVFG 194
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL------------- 164
C E+ + + + GIMGL S M QL +++ D FS C
Sbjct: 195 C--ENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG 252
Query: 165 ------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
PD+S + + + +AGK L L P F +G+ G + D G+
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVF----DGKHGTVLDSGTTY 308
Query: 213 TVI-ECEVYAVLTAEFIDYFSQHDIE------KLFTCRKCGVTCFNLPARFNSFPSMTYH 265
+ E A A + S I K G+ L SFP +
Sbjct: 309 AYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLA---KSFPVVDMV 365
Query: 266 FQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F+ G L + PEN + + + G R T+LG NT +YD +
Sbjct: 366 FENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRE 420
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 137/357 (38%), Gaps = 66/357 (18%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y +L IG P + ++DT + +T+ C C+ C DP ++ S +YK + C
Sbjct: 79 LNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138
Query: 64 -YDASCKSPFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC- 120
D C S +G C Y Y ++ + V D + E P Q FGC
Sbjct: 139 NIDCICDS-----DGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIP---QRAVFGCE 190
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC--------------- 163
++E+ D S + GIMGL S + QL + D FS C
Sbjct: 191 NMETGDLFSQRAD---GIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGG 247
Query: 164 ----------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
P +S + ++ + +AGK L L F +G+ G + D G+
Sbjct: 248 ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGT--- 300
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPS 261
YA L AE F ++++ + +K CF + N FP+
Sbjct: 301 -----TYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F+ G L + PEN F + + + G T+LG NT +YD
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYD 412
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 137/357 (38%), Gaps = 66/357 (18%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y +L IG P + ++DT + +T+ C C+ C DP ++ S +YK + C
Sbjct: 79 LNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138
Query: 64 -YDASCKSPFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC- 120
D C S +G C Y Y ++ + V D + E P Q FGC
Sbjct: 139 NIDCICDS-----DGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIP---QRAVFGCE 190
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC--------------- 163
++E+ D S + GIMGL S + QL + D FS C
Sbjct: 191 NMETGDLFSQRAD---GIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGG 247
Query: 164 ----------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
P +S + ++ + +AGK L L F +G+ G + D G+
Sbjct: 248 ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGT--- 300
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPS 261
YA L AE F ++++ + +K CF + N FP+
Sbjct: 301 -----TYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F+ G L + PEN F + + + G T+LG NT +YD
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYD 412
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/356 (23%), Positives = 138/356 (38%), Gaps = 66/356 (18%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYN---SRSFKSYKKLP 62
T M + IG P ++DT + + W C PC +C +++ S +F K P
Sbjct: 99 RTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTP 158
Query: 63 CYDASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C C PF +TY D + DT DE + + ++ F
Sbjct: 159 CDFKGCSRCDPIPFT---------VTYADNSTASGMFGRDTVVFETTDEGTS-RIPDVLF 208
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHSRL 175
GC + GI+GLN S ++G+ +FS C L P ++H +L
Sbjct: 209 GC---GHNIGQDTDPGHNGILGLNNGPDSLATKIGQ----KFSYCIGDLADPYYNYH-QL 260
Query: 176 EFGDQI------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
G+ + K L++ P +F +K N G I D GS
Sbjct: 261 ILGEGADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGST 320
Query: 212 LTVIECEVYAVLTAEFIDY----FSQHDIEKLFTCRKCGVTCF--NLPARFNSFPSMTYH 265
+T + V+ +L+ E + F Q IEK + CF ++ FP +T+H
Sbjct: 321 ITFLVDSVHRLLSKEVRNLLGWSFRQTTIEK-----SPWMQCFYGSISRDLVGFPVVTFH 375
Query: 266 F-QGADLVVEPENVFIFNHQDSFFFFFGP--AFTPRKGKTILGARHQHNTQFVYDL 318
F GADL ++ + F + + F GP + + +++G Q + YDL
Sbjct: 376 FADGADLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDL 431
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 140/379 (36%), Gaps = 81/379 (21%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +GDP +++ +LDT + L+W C+ + ++N S +Y +
Sbjct: 59 FRHNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPV 114
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC C++ P C C I+Y D + + +T + P +
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTL- 173
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC S + G+MG+N S SF+ QLG +FS C+ D S
Sbjct: 174 -----FGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCISGSDSSGF 225
Query: 171 ---------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLN 199
+ R+ + Q+ + K L+LP + F
Sbjct: 226 LLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHT 285
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFI-------------DYFSQHDIEKLFTCRKCG 246
G + D G+ T + VY L EFI D+ Q ++ C K G
Sbjct: 286 GAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMD---LCYKVG 342
Query: 247 VTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIF-------NHQDSFFFFFGPAFTPRK 299
T F+ P ++ F+GA++ V + + ++ + F FG +
Sbjct: 343 STT---RPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGI 399
Query: 300 GKTILGARHQHNTQFVYDL 318
++G HQ N +DL
Sbjct: 400 EAFVIGHHHQQNVWMEFDL 418
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 149/374 (39%), Gaps = 70/374 (18%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYK 59
+F N T + L G P++++ +LDT + L+W C+ E N + I+N + K+Y
Sbjct: 60 LFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKK-----EPNFNSIFNPLASKTYT 114
Query: 60 KLPCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
K+PC +C++ P C C + I+Y D + + +T + P+ V
Sbjct: 115 KIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATV 174
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS- 170
FGC S + G+MG+N S SF+ Q+G +FS C+ D S
Sbjct: 175 ------FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISDRDSSG 225
Query: 171 ----------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKL 198
+ R+ + Q+ ++ K L+LP + F
Sbjct: 226 VLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDH 285
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-----P 253
G + D G+ T + VY+ L EF+ ++ + L R +L P
Sbjct: 286 TGAGQTMVDSGTQFTFLLGPVYSALKQEFL-LQTKGVLRVLNEPRYVFQGAMDLCYLIEP 344
Query: 254 AR--FNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILG 305
R + P + F+GA++ V + + +DS + F FG + + ++G
Sbjct: 345 TRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIG 404
Query: 306 ARHQHNTQFVYDLD 319
Q N YDL+
Sbjct: 405 HHQQQNVWMEYDLE 418
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/373 (23%), Positives = 130/373 (34%), Gaps = 78/373 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYKKLPCYDA 66
Y + L IG P ++L + DT + L W +C PC++C ++ + +R +Y + CY
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145
Query: 67 SCK-----SPFHC----FEGDCFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPVS 112
C+ P C C Y TY D T KE +L+TST
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTST------GKVKK 199
Query: 113 VQNIRFGC-------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
+ + FGC SL F Q G+MGL SF QLGR +FS CL+
Sbjct: 200 LNGLSFGCGFRISGPSLTGASFEGAQ-----GVMGLGRAPISFSSQLGRRFGSKFSYCLM 254
Query: 166 Q-----PDKSFHS------------------------------RLEFGDQIIAGKSLNLP 190
P SF + + + G L +
Sbjct: 255 DYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPIN 314
Query: 191 PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGV 247
P+ ++I G G I D G+ LT I Y + F + S + F
Sbjct: 315 PSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDL----- 369
Query: 248 TCFNLPARFN-SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGA 306
C N+ + P M+++ G + P + D + G ++LG
Sbjct: 370 -CMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGN 428
Query: 307 RHQHNTQFVYDLD 319
Q +D D
Sbjct: 429 LMQQGFLLEFDRD 441
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 130/325 (40%), Gaps = 59/325 (18%)
Query: 31 LTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFHCFEGDCFYGITYGDVYE 90
+TWTQC+PC C + + ++ + +Y C ++ + Y +TYGD
Sbjct: 98 ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPSTVGN---------TYNMTYGDKST 148
Query: 91 TKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK-DFVSIQKKIIAGIMGLNWDSTSFM 149
+ DT TL EPS V +FGC ++ DF S G++GL S +
Sbjct: 149 SVGNYGCDTMTL----EPSDV-FPKFQFGCGRNNEGDFGSGAD----GMLGLGQGQLSTV 199
Query: 150 VQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSL------NLPPNS---------- 193
Q FS CL P++ L FG++ + SL N P S
Sbjct: 200 SQTASKFKKVFSYCL--PEEDSIGSLLFGEKATSQSSLKFTSLVNGPGTSGLEESGYYFV 257
Query: 194 --FTIKLNGQR-----------GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLF 240
I + +R G I D G+V+T + Y+ LTA F +++ +
Sbjct: 258 KLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSN-- 315
Query: 241 TCRKCG---VTCFNLPARFNS-FPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFG--P 293
RK G TC+NL R + P + HF +GAD+ + + V N F
Sbjct: 316 GRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAGNS 375
Query: 294 AFTPRKGKTILGARHQHNTQFVYDL 318
T TI+G R Q + +YD+
Sbjct: 376 KSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 134/337 (39%), Gaps = 47/337 (13%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
IG P + DT + LTW QC PC CY+Q PI+N S+ +PC +C +
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD 145
Query: 72 FHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSI 130
HC +G C Y TYGD +K + T+ SV+++ GC S
Sbjct: 146 GHCGVQGVCDYSYTYGDRTYSKGDLGFEKITI------GSSSVKSV-IGCGHASSGGFGF 198
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFHSRLEFG-DQIIAGKSL 187
+G++GL S + Q+ + + RFS CL + ++ FG + +++G +
Sbjct: 199 A----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 254
Query: 188 NLPP----NSFT--------IKLNGQR--------GCINDCGSVLTVIECEVYAVLTAEF 227
P N+ T I + +R I D G+ L+ + E+Y + +
Sbjct: 255 VSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSL 314
Query: 228 IDYFSQHDIEKLFT----CRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFIFN 282
+ ++ C G+ A + P +T F GA++ + P N F
Sbjct: 315 LKVVKAKRVKDPGNFWDLCFDDGINV----ATSSGIPIITAQFSGGANVNLLPVNTFQKV 370
Query: 283 HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ PA +P I+G N YDL+
Sbjct: 371 ANNVNCLTLTPA-SPTDEFGIIGNLALANFLIGYDLE 406
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 109/268 (40%), Gaps = 42/268 (15%)
Query: 13 GIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC-- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 156 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 215
Query: 69 --KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
+ C C Y + YGD T +D TL PS V V N RFGCS
Sbjct: 216 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL----NPSTV-VMNFRFGCSH---- 266
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKS 186
+++ A G + T +V+ ++P + L +E G G+
Sbjct: 267 --AVRGNFSASTSGTMFARTP-LVRNPSIIPTLYLVRL--------RGIEVG-----GRR 310
Query: 187 LNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG 246
LN+PP F G + D ++T + Y L F + + ++ R
Sbjct: 311 LNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY--PRVAGGRAGL 362
Query: 247 VTCFNLPARFNS--FPSMTYHFQGADLV 272
TC++ RF S P+++ F G +V
Sbjct: 363 DTCYDF-VRFTSVTVPAVSLVFDGGAVV 389
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 130/357 (36%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC---------YEQNDPIYNSRSFKSY 58
Y ++ IG P K + +DT + + W C C C Q DP + +
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGTTVGCE 143
Query: 59 KKLPCYDASCKSPFHC--FEGDCFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
++ +++ P C C + ITYGD T V + + + S
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL-------- 164
+I FGC + + + + GI+G +S + QL R V F+ CL
Sbjct: 204 SITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGI 263
Query: 165 ------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
VQP H + + G +L LP ++F +G I D G+
Sbjct: 264 FAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTF--DSGDSKGTIIDSGT 321
Query: 211 VLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
L + EVY L A D + H+ + CF + FP +T+ F
Sbjct: 322 TLAYLPREVYRTLLAAVFDKYQDLPLHNYQDF--------VCFQFSGSIDDGFPVITFSF 373
Query: 267 QG-ADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKT--ILGARHQHNTQFVYDLD 319
+G L V P++ N D + F + GK +LG N VYDL+
Sbjct: 374 EGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLE 430
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 137/374 (36%), Gaps = 81/374 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PIYNSRSFKSYKKLPCY 64
Y ++L +G P K ++DT + LTW QC P + + P Y+ S SY+++PC
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 86
Query: 65 DASC-------------KSPFHCFEGDCFYGITYGD--------VYETKEVDSLDTSTLL 103
D C KSP C D YG Y D YET + S S
Sbjct: 87 DDECLFLPAPIGSSCSIKSPSPC---DYTYG--YSDQSRTTGILAYETISMKSRKRSGKR 141
Query: 104 PPDEPS-PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQ-----LGRLVP 157
+ + + ++N+ GCS ES V +G++GL S Q LG +
Sbjct: 142 AGNHKTRTIRIKNVALGCSRES---VGASFLGASGVLGLGQGPISLATQTRHTALGGI-- 196
Query: 158 DRFSCCLV-----------------------------QPDKSFHSRLEFGDQIIAGKSLN 188
FS CLV P + + GK ++
Sbjct: 197 --FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVD 254
Query: 189 -LPPNSFTIKLNGQRGCINDCGSVLTVIECEVYA-VLTAEFIDYF--SQHDIEKLFTCRK 244
+ + + I +G +G I D G+ L+ + Y+ VL A + +I + F
Sbjct: 255 GIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFEL-- 312
Query: 245 CGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL 304
C+N+ P + FQG ++ P N ++ ++ T G IL
Sbjct: 313 ----CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNIL 368
Query: 305 GARHQHNTQFVYDL 318
G Q + YDL
Sbjct: 369 GNLLQQDHHIEYDL 382
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 134/358 (37%), Gaps = 74/358 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
+++ +G+G P + + DT + L+W QCQPC S C+ Q DP+++ +Y + C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 65 DASCKSPFH-CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+ C + C E + C Y + YGD T V S DT L S ++ FGC
Sbjct: 209 EPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALT-----SSRALAGFPFGCG 263
Query: 122 LES-KDFVSIQKKI------------IAGIMGL-------NWDSTSFMVQLGRLVPD--- 158
+ DF + + A G + +ST+ + +G
Sbjct: 264 TRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATPATDTG 323
Query: 159 --RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
+++ L +P +E I G L +PP FT + G + D G+VLT +
Sbjct: 324 AAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSGTVLTYLP 378
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPE 276
+ Y +L F ++ PA N Y F G V+ P
Sbjct: 379 AQAYELLRDRFRLTMERYT-----------------PAPPNDVLDACYDFAGESEVIVPA 421
Query: 277 NVFIFNHQDSFFF-FFGP-----------AFTPRKGK----TILGARHQHNTQFVYDL 318
F F F FFG AF +I+G Q + + +YD+
Sbjct: 422 VSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDV 479
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/347 (22%), Positives = 141/347 (40%), Gaps = 59/347 (17%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+ +G G+ + LDT A +W C+PC+ Q +++ + +++ + C
Sbjct: 72 VSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVGHLFSPAASPTFQGVRGDGPVCT 131
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVS 129
P+ + C + + Y +++ L + E SV I FGC+ F +
Sbjct: 132 VPYRHTDKGCSFRFPFAAGYLSRDTFHLRSGRSGTVME----SVPGIMFGCAHSVTGFHN 187
Query: 130 IQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP-DKSFHSRLEFGDQIIAGKSLN 188
++G++ L+ SF+ LG RFS CL +P + S L FG + +
Sbjct: 188 --DGTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTHNPDSFLRFGADVP-----S 240
Query: 189 LPPNSFT-----------------IKLNGQR------------GC-INDCGSVLTVIECE 218
LPP++ T I L +R GC IN ++ ++E
Sbjct: 241 LPPHAHTTTLVHAGVPGYHLNIVGISLGNKRLHIDRHVFAAGGGCSINPAVTITRIMELA 300
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGV----TCFNLPAR--FNSFPSMTYHFQ-GADL 271
AV ++ +++L + R G+ CF+ R P M++HF+ GA+L
Sbjct: 301 YLAV------EHALVAHMKELGSGRVKGMPGRSLCFDHMDRSVRVQLPGMSFHFEDGAEL 354
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
E +F + F G +T++GA Q +T+F +D+
Sbjct: 355 RFAAEQLFDVRVMAACFLVVGRGHH----QTVIGAAQQVDTRFTFDI 397
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 109/268 (40%), Gaps = 42/268 (15%)
Query: 13 GIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC-- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 69 --KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
+ C C Y + YGD T +D TL PS V V N RFGCS
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL----NPSTV-VMNFRFGCSH---- 248
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKS 186
+++ A G + T +V+ ++P + L +E G G+
Sbjct: 249 --AVRGNFSASTSGTMFARTP-LVRNPSIIPTLYLVRL--------RGIEVG-----GRR 292
Query: 187 LNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG 246
LN+PP F G + D ++T + Y L F + + ++ R
Sbjct: 293 LNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY--PRVAGGRAGL 344
Query: 247 VTCFNLPARFNS--FPSMTYHFQGADLV 272
TC++ RF S P+++ F G +V
Sbjct: 345 DTCYDF-VRFTSVTVPAVSLVFDGGAVV 371
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 109/267 (40%), Gaps = 42/267 (15%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC--- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 139 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 198
Query: 69 -KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
+ C C Y + YGD T +D TL PS V V N RFGCS
Sbjct: 199 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL----NPSTV-VMNFRFGCSH----- 248
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSL 187
+++ A G + T +V+ ++P + L +E G G+ L
Sbjct: 249 -AVRGNFSASTSGTMFARTP-LVRNPSIIPTLYLVRL--------RGIEVG-----GRRL 293
Query: 188 NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV 247
N+PP F G + D ++T + Y L F + + ++ R
Sbjct: 294 NVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY--PRVAGGRAGLD 345
Query: 248 TCFNLPARFNS--FPSMTYHFQGADLV 272
TC++ RF S P+++ F G +V
Sbjct: 346 TCYDF-VRFTSVTVPAVSLVFDGGAVV 371
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 130/357 (36%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC---------YEQNDPIYNSRSFKSY 58
Y ++ IG P K + +DT + + W C C C Q DP + +
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGTTVGCE 143
Query: 59 KKLPCYDASCKSPFHC--FEGDCFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
++ +++ P C C + ITYGD T V + + + S
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL-------- 164
+I FGC + + + + GI+G +S + QL R V F+ CL
Sbjct: 204 SITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGI 263
Query: 165 ------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
VQP H + + G +L LP ++F +G I D G+
Sbjct: 264 FAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTF--DSGDSKGTIIDSGT 321
Query: 211 VLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
L + EVY L A D + H+ + CF + FP +T+ F
Sbjct: 322 TLAYLPREVYRTLLAAVFDKYQDLPLHNYQDF--------VCFQFSGSIDDGFPVITFSF 373
Query: 267 QG-ADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKT--ILGARHQHNTQFVYDLD 319
+G L V P++ N D + F + GK +LG N VYDL+
Sbjct: 374 KGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLE 430
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 131/360 (36%), Gaps = 67/360 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+T Y+ IG P + ++D L WTQC+ C C+EQ P+++ + +Y+
Sbjct: 45 WTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAE 104
Query: 62 PCYDASCKS----PFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
PC C+S +C C Y GD D+ T + +
Sbjct: 105 PCGTPLCESIPSDVRNCSGNVCAYEASTNAGDTGGKVGTDTFAVGT----------AKAS 154
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC + S D ++ +GI+GL S + Q G FS CL D +S L
Sbjct: 155 LAFGCVVAS-DIDTMGGP--SGIVGLGRTPWSLVTQTGVAA---FSYCLAPHDAGKNSAL 208
Query: 176 EFGDQII---AGKSLNLP-----------PNSFTIKLNGQRGCINDCGSVLTVIECEVYA 221
G GK+ + P N + ++L G + G + +
Sbjct: 209 FLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLK-----AGDAMIPLPPSGST 263
Query: 222 VLTAEFIDYFSQHD--IEKLFTCRKCGVT-----------------CFNLPARFNSFPSM 262
VL +D FS ++ + K VT CF + P +
Sbjct: 264 VL----LDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDL 319
Query: 263 TYHFQGADLVVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F+G + P ++ ++++ ++LG+ Q N F++DLD
Sbjct: 320 VFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLD 379
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 38/145 (26%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++KL IG P ++ ++DT + L WTQC+PCK C++Q PI++ + S+ KLPC
Sbjct: 87 NGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPC- 145
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS--- 121
D +Y T G L T T D SV I FGC
Sbjct: 146 -----------SSDLYYSSTQG---------VLATETFAFGD----ASVSKIGFGCGEDN 181
Query: 122 ----------LESKDFVSIQKKIIA 136
LE F +++K+ I+
Sbjct: 182 DGNSGTTITYLEDSAFAALKKEFIS 206
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 139/359 (38%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C C ++ +Y+ ++ + +
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147
Query: 63 CYDASCKSPF-----HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSV 113
C C F C C Y +TYGD T D+L + + P +
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------- 164
I FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 208 SVI-FGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGG 266
Query: 165 -------VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP DK H + + G +L LP + F K +RG I D
Sbjct: 267 IFAIGDVVQPKVKTTPLVADKP-HYNVNLKTIDVGGTTLELPADIF--KPGEKRGTIIDS 323
Query: 209 GSVLTVIECEVY-AVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTY 264
G+ LT + V+ V+ A F + + HD++ CF + FP++T+
Sbjct: 324 GTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFL--------CFEYSGSVDDGFPTLTF 375
Query: 265 HFQ-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
HF+ L V P F N D + F A + GK I +G N VYDL+
Sbjct: 376 HFEDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLE 434
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 128/356 (35%), Gaps = 69/356 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + + ++D L WTQC Q C+ C++Q+ P++++ + +++ PC A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 67 SCKS-----------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C+S +E +G T G + D++ T +
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRI----GTDAVAIGT---------AATAR 157
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC++ S+ +G +GL + S Q+ FS CL PD S L
Sbjct: 158 LAFGCAVASEMDTMWGS---SGSVGLGRTNLSLAAQMNATA---FSYCLAPPDTGKSSAL 211
Query: 176 EFGDQII---AGKSLNLPP-------------NSFTIKLNGQRG-------------CIN 206
G AGK P S+ ++L R +
Sbjct: 212 FLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMV 271
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
+ +T + VY L D + CF + P + F
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDL---CFPKASASGGAPDLVLAF 328
Query: 267 QGADLVVEPENVFIF---NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
QG + P + ++F N PA G +ILG+ Q N ++DLD
Sbjct: 329 QGGAEMTVPVSSYLFDAGNDTACVAILGSPAL---GGVSILGSLQQVNIHLLFDLD 381
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 141/357 (39%), Gaps = 68/357 (19%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F + +++ + G P + +LDT + +TWTQC+PC C + + ++ + +Y
Sbjct: 155 LFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSL 214
Query: 61 LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C ++ + Y +TYGD + DT TL E S V +FGC
Sbjct: 215 GSCIPSTVGNT---------YNMTYGDKSTSVGNYGCDTMTL----EHSDV-FPKFQFGC 260
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK---------- 169
++ DF S G++GL S + Q FS CL + D
Sbjct: 261 GRNNEGDFGSGAD----GMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKA 316
Query: 170 -SFHSRLEFG----------------------DQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
S S L+F D + K LN+P + F G I
Sbjct: 317 TSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFA-----SPGTII 371
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG---VTCFNLPARFNS-FPSM 262
D G+V+T + Y+ L A F +++ + RK G TC+NL R + P +
Sbjct: 372 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSN--GRRKKGDILDTCYNLSGRKDVLLPEI 429
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF +GAD+ + + V N AF TI+G R Q + +YD+
Sbjct: 430 VLHFGEGADVRLNGKRVIWGNDASRLCL----AFAGNSELTIIGNRQQVSLTVLYDI 482
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 118/312 (37%), Gaps = 67/312 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y++ +G+G P + ++DT + ++W QC+PC + C+ +++ + +Y C
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 65 DASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
A+C E + C Y + YGD T S D TL D V+ +
Sbjct: 168 AAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDV-----VRGFQ 222
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQ-LGRLVPDRFSCCLVQPDKS------ 170
FGCS + + G++GL D+ S + Q R F C P S
Sbjct: 223 FGCS--HAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLG 280
Query: 171 ---------------------------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
+ + LE D + GK L L P+ F G
Sbjct: 281 APASGGGGGASRFATTPMLRSKKVPTYYFAALE--DIAVGGKKLGLSPSVFAA------G 332
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN-SFP 260
+ D G+V+T + YA L++ F + + + G+ TCFN S P
Sbjct: 333 SLVDSGTVITRLPPAAYAALSSAF-----RAGMTRYARAEPLGILDTCFNFTGLDKVSIP 387
Query: 261 SMTYHFQGADLV 272
++ F G +V
Sbjct: 388 TVALVFAGGAVV 399
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 56/358 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P KS + +DT + + W C CK C ++ +YN S K +
Sbjct: 80 YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139
Query: 63 CYDASC----KSPFHCFEGD--CFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPV 111
C D C P + + C Y YGD T K+V D+ + D +
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDS---VAGDLKTQT 196
Query: 112 SVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRFSCCL--- 164
+ ++ FGC + +S D S ++ + GI+G ++S + QL GR V F+ CL
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR-VKKIFAHCLDGR 255
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
VQP + H + + + L +P + F + ++G I
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLF--QPGDRKGAI 313
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+ L + +Y L + SQ K+ K CF R + FP++T+
Sbjct: 314 IDSGTTLAYLPEIIYEPLVKKIT---SQEPALKVHIVDK-DYKCFQYSGRVDEGFPNVTF 369
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTP---RKGKTILGARHQHNTQFVYDLD 319
HF+ + + + ++F H+ + + + R+ T+LG N +YDL+
Sbjct: 370 HFENSVFLRVYPHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLE 427
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 148/358 (41%), Gaps = 56/358 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P KS + +DT + + W C CK C ++ +YN S K +
Sbjct: 80 YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139
Query: 63 CYDASC----KSPFHCFEGD--CFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPV 111
C D C P + + C Y YGD T K+V D+ + D +
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDS---VAGDLKTQT 196
Query: 112 SVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRFSCCL--- 164
+ ++ FGC + +S D S ++ + GI+G ++S + QL GR V F+ CL
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR-VKKIFAHCLDGR 255
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
VQP + H + + + LN+P + F + ++G I
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLF--QPGDRKGAI 313
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+ L + +Y L + SQ K+ K CF R + FP++T+
Sbjct: 314 IDSGTTLAYLPEIIYEPLVKKIT---SQEPALKVHIVDK-DYKCFQYSGRVDEGFPNVTF 369
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTP---RKGKTILGARHQHNTQFVYDLD 319
HF+ + + + ++F ++ + + + R+ T+LG N +YDL+
Sbjct: 370 HFENSVFLRVYPHDYLFPYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLE 427
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 84/381 (22%), Positives = 143/381 (37%), Gaps = 86/381 (22%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYKK 60
F N T + L +G P +S+ +LDT + L+W C+ +QN + ++N SY
Sbjct: 64 FYHNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKK-----QQNINSVFNPHLSSSYTP 118
Query: 61 LPCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
+PC CK+ P C + C ++Y D + + DT + +P
Sbjct: 119 IPCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQP---- 174
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
I FG + + G+MG+N S SF+ Q+G +FS C+ D S
Sbjct: 175 --GIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCISGKDASGV 229
Query: 171 ---------------------FHSRLEFGDQI----------IAGKSLNLPPNSFTIKLN 199
++ L + D++ + K L +P F
Sbjct: 230 LLFGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHT 289
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFI-------------DYFSQHDIEKLFTCRKCG 246
G + D G+ T + VY L EF+ ++ + ++ F R+ G
Sbjct: 290 GAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGG 349
Query: 247 VTCFNLPARFNSFPSMTYHFQGADLVVEPENVF---------IFNHQDSFFFFFGPAFTP 297
V +PA P++T F+GA++ V E + + D + FG +
Sbjct: 350 V----VPA----VPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLL 401
Query: 298 RKGKTILGARHQHNTQFVYDL 318
++G HQ N +DL
Sbjct: 402 GIEAYVIGHHHQQNVWMEFDL 422
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 134/358 (37%), Gaps = 57/358 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C+ C ++ Y+ ++ S +
Sbjct: 84 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVS 143
Query: 63 CYDASC------KSPFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSV 113
C C K P C Y + YGD T D+L + + P +
Sbjct: 144 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNA 203
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------- 164
+ FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 204 -TVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGG 262
Query: 165 -------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP H + + G +L LP + F + ++G I D G
Sbjct: 263 IFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVF--ETGERKGTIIDSG 320
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYH 265
+ LT + V+ + A + H+++ CF P + FP++T+H
Sbjct: 321 TTLTYLPELVFKEVMAAIFNKHQDIVFHNVQDFM--------CFQYPGSVDDGFPTITFH 372
Query: 266 FQ-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGARHQHNTQFVYDLD 319
F+ L V P F N D + F A + GK I +G N +YDL+
Sbjct: 373 FEDDLALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLE 430
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 139/374 (37%), Gaps = 80/374 (21%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-------QPCKSCYEQNDPIYNSRSFKS 57
+ + L +GIG P + ++DT + L WTQC + S Q +P+Y R S
Sbjct: 81 DQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSS 140
Query: 58 YKKLPCYDASCKSPFHCFEGDCFYG--ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
+ LPC D C+ ++ +C Y ++Y + E + S + VS+
Sbjct: 141 FAYLPCSDRLCQEGQFSYK-NCARNNRCMYDELYGSAEAGGVLASETFTFGVNAKVSLP- 198
Query: 116 IRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
+ FGC +L + D V +G+MGL+ S + QL VP RFS CL + S
Sbjct: 199 LGFGCGALSAGDLVG-----ASGLMGLSPGIMSLVSQLS--VP-RFSYCLTPFAERKTSP 250
Query: 175 LEFGDQI-----------------------------------IAGKSLNLPPNSF-TIKL 198
L FG + K L++P S IK
Sbjct: 251 LLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKP 310
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYF---------SQHDIEKLFTCRKCGVTC 249
+G G I D GS ++ +E + + ++ +D +L C
Sbjct: 311 DGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYEL---------C 361
Query: 250 FNLPARFN----SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-GKTIL 304
F LP P + HF G + P + + F + +P G +I+
Sbjct: 362 FALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNY-FQEPRAGLMCLAVGTSPDGFGVSII 420
Query: 305 GARHQHNTQFVYDL 318
G Q N ++D+
Sbjct: 421 GNVQQQNMHVLFDV 434
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 144/356 (40%), Gaps = 65/356 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + ++D L WTQC C+ C++Q+ P++ + ++K PC A
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 68 CKS-PFHCFEGD-CFY--------GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C+S P GD C Y G T G + + ++ T+T+ +
Sbjct: 122 CESIPTRSCSGDVCSYKGPPTQLRGNTSG--FAATDTFAIGTATV------------RLA 167
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC + S D ++ +G +GL S + Q+ +L RFS CL + SRL
Sbjct: 168 FGCVVAS-DIDTMDGP--SGFIGLGRTPWSLVAQM-KLT--RFSYCLSPRNTGKSSRLFL 221
Query: 178 GD--QIIAGKSLNLPP-----------NSFTIKLNGQRG-----CINDCGSVLTVIECEV 219
G ++ G+S + P + + + L+ R G +L +
Sbjct: 222 GSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSP 281
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVT--------CFNLPARFN--SFPSMTYHFQGA 269
+++L F + E + + CF A F+ + P + + FQGA
Sbjct: 282 FSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 341
Query: 270 DLVVEPENVFIFN---HQDSF-FFFFGPAFTPR---KGKTILGARHQHNTQFVYDL 318
+ P ++ + +D+ A+ R +G ++LG+ Q + F+YDL
Sbjct: 342 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 397
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 136/359 (37%), Gaps = 70/359 (19%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y +L IG P + ++DT + +T+ C C+ C DP + S +Y+ + C
Sbjct: 108 LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 167
Query: 64 -YDASCKSPFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
D +C +GD C Y Y ++ + V D + E +P Q FG
Sbjct: 168 TIDCNC-------DGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAP---QRAVFG 217
Query: 120 C-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCC------------- 163
C ++E+ D S GIMGL S M QL +++ D FS C
Sbjct: 218 CENVETGDLYSQHAD---GIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVL 274
Query: 164 ------------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
PD+S + ++ + +AGK L L N F +G+ G + D G+
Sbjct: 275 GGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVF----DGKHGTVLDSGT- 329
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSF 259
YA L F +++L + ++ CF ++ SF
Sbjct: 330 -------TYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSF 382
Query: 260 PSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P + F G + PEN + + + G T+LG NT +YD
Sbjct: 383 PVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYD 441
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 137/355 (38%), Gaps = 52/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P KS + +DT + + W C C +C ++ +Y+ S +
Sbjct: 81 YFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVT 140
Query: 63 CYDASCKS------PFHCFEGDCFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C C + P C Y I+YGD T V + + + ++
Sbjct: 141 CGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANT 200
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL-------- 164
+I FGC + + + + GI+G ++S + QL R F+ CL
Sbjct: 201 SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGGI 260
Query: 165 ------VQPDKS----------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP S ++ LE D + G L LP N F I + +G I D
Sbjct: 261 FAIGDVVQPKVSTTPLVPGMPHYNVNLEAID--VGGVKLQLPTNIFDIGES--KGTIIDS 316
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
G+ L + VY + ++ F+Q+ L + CF + FP +T+HF+
Sbjct: 317 GTTLAYLPGVVYNAIMSKV---FAQYGDMPLKNDQD--FQCFRYSGSVDDGFPIITFHFE 371
Query: 268 GA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
G L + P + N + F + GK +LG N +YDL+
Sbjct: 372 GGLPLNIHPHDYLFQNGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLE 426
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 133/356 (37%), Gaps = 62/356 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++DT + +T+ C CK C DP + + ++Y+ + C
Sbjct: 90 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT 149
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+ +C C Y Y ++ + V D + E SP Q FGC E
Sbjct: 150 WQCNCDDD----RKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSP---QRAIFGC--E 200
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLV---------------- 165
+ + I + GIMGL S M QL +++ D FS C
Sbjct: 201 NDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISP 260
Query: 166 ---------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
P +S + ++ + +AGK L+L P F +G+ G + D G+
Sbjct: 261 PADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVF----DGKHGTVLDSGT------ 310
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSMTY 264
YA L F +++ + ++ CF N+ SFP +
Sbjct: 311 --TYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEM 368
Query: 265 HF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G L + PEN + + + G T+LG NT +YD +
Sbjct: 369 VFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDRE 424
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 142/380 (37%), Gaps = 82/380 (21%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYK 59
+F N T L IG P +++ +LDT + L+W +C+ E N I+N + K+Y
Sbjct: 60 LFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKK-----EPNFTSIFNPLASKTYT 114
Query: 60 KLPCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
K+PC +CK+ P C C + I+Y D + + +T P+ V
Sbjct: 115 KIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATV 174
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS- 170
FGC + + G+MG+N S SF+ Q+G +FS C+ D +
Sbjct: 175 ------FGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISGLDSTG 225
Query: 171 ----------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKL 198
+ R+ + Q+ + K L LP + F
Sbjct: 226 FLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDH 285
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFI-------------DYFSQHDIEKLFTCRKC 245
G + D G+ T + VY+ L EF+ Y Q ++ +
Sbjct: 286 TGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDST 345
Query: 246 GVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRK 299
T NLP + F+GA++ V + + +DS + F FG +
Sbjct: 346 SSTLPNLPV-------VKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGI 398
Query: 300 GKTILGARHQHNTQFVYDLD 319
++G Q N YDL+
Sbjct: 399 SSFLIGHHQQQNVWMEYDLE 418
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 69/162 (42%), Gaps = 13/162 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+G+G PV +LDT + + W QC PC+ CY+Q+ +++ R+ SY + C
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ C Y + YGD T + +T T S V + GC +
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-----SGARVPRVALGCGHD 261
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
++ ++ G S SF Q+ R FS CLV
Sbjct: 262 NEGLFVAAAGLLGLGRG----SLSFPSQISRRFGRSFSYCLV 299
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 129/348 (37%), Gaps = 50/348 (14%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 45 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYA 102
Query: 60 KLPCYDASCKS-----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+PC C C C Y ++YGD T V S DT TL + +VQ
Sbjct: 103 AVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-----SASSAVQ 157
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFH 172
FGC +S F + G++GL + S + Q FS CL +P + +
Sbjct: 158 GFFFGCGHAQSGLFNGVD-----GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 212
Query: 173 SRLEFGDQIIAGKSLN----LP----PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
L G A + LP P + + L G I+ G L+V +A T
Sbjct: 213 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTG----ISVGGQQLSV-PASAFAGGT 267
Query: 225 AEFIDYFSQH----DIEKLFTCRKCGVTCFNLP-ARFNSFPSMTYHFQGADLVVEPENVF 279
L + + G+ + P A N Y+F G V P
Sbjct: 268 VVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVAL 327
Query: 280 IFNH-------QDSFFFFFGPAFTPR---KGKTILGARHQHNTQFVYD 317
F D F AF P G ILG Q + + D
Sbjct: 328 TFGSGATVTLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 375
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 110/300 (36%), Gaps = 63/300 (21%)
Query: 23 FLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYDASCK---------SP 71
LLDT + + W QC PC + CY Q D +Y+ +S + C +C+ S
Sbjct: 184 MLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSS 243
Query: 72 FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQ 131
G C Y + Y D T D +L P + V FGCS ++ S
Sbjct: 244 SSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ-----VPKFEFGCSHAARG--SFS 296
Query: 132 KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC------------LVQPDKS--------- 170
+ AGIM L S + Q FS C L P +S
Sbjct: 297 RSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSSRYAVTPM 356
Query: 171 ------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
+ RLE +AG+ L++PP F G D +V+T + Y L
Sbjct: 357 LKTPMLYQVRLEA--IAVAGQRLDVPPTVFAA------GAALDSRTVITRLPPTAYQALR 408
Query: 225 AEFIDYFSQHDIE----KLFTCRK-CGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVF 279
+ F D S + +L TC GV+ LP S+ + GA + ++P V
Sbjct: 409 SAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTI-----SLVFDRTGAGVQLDPSGVL 463
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 143/356 (40%), Gaps = 65/356 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + ++D L WTQC C+ C++Q+ P++ + ++K PC A
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104
Query: 68 CKS-PFHCFEGD-CFY--------GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C+S P GD C Y G T G + + ++ T+T+ +
Sbjct: 105 CESIPTRSCSGDVCSYKGPPTQLRGNTSG--FAATDTFAIGTATV------------RLA 150
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
FGC + S D ++ +G +GL S + Q+ +L RFS CL + SRL
Sbjct: 151 FGCVVAS-DIDTMDGP--SGFIGLGRTPWSLVAQM-KLT--RFSYCLSPRNTGKSSRLFL 204
Query: 178 GD--QIIAGKSLNLPP-----------NSFTIKLNGQRG-----CINDCGSVLTVIECEV 219
G ++ +S + P N + + L+ R G +L +
Sbjct: 205 GSSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATAQSGGILVMHTVSP 264
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVT--------CFNLPARFN--SFPSMTYHFQGA 269
+++L F + E + + CF A F+ + P + + FQGA
Sbjct: 265 FSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 324
Query: 270 DLVVEPENVFIFN---HQDSF-FFFFGPAFTPR---KGKTILGARHQHNTQFVYDL 318
+ P ++ + +D+ A+ R +G ++LG+ Q + F+YDL
Sbjct: 325 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 380
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 144/347 (41%), Gaps = 56/347 (16%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNS-RSFKSYKKLPC 63
N Y++KL +G P ++ L+DT + L W QC PC+ CY+Q +P+++ + S+
Sbjct: 28 NGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSF----- 82
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+D SC SP E C Y Y D TK + + + +T D P+ V++I FGC
Sbjct: 83 FDHSC-SP----EKACDYVYAYADDSATKGMLAKEIATFSSTDG-KPI-VESIIFGCGHN 135
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV-PDRFSCCLV--QPDKSFHSRLEFGDQ 180
+ + + G+ G S Q+G L RFS CLV D + G+
Sbjct: 136 NTGVFNENDMGLIGLGGGPLSLVS---QMGNLYGSKRFSQCLVPFHADPHTSGTISLGEA 192
Query: 181 I-IAGKSLNLPP-------NSFTIKLNG----------------QRGCIN-DCGSVLTVI 215
++G+ + P + + L G +G I D G+ T +
Sbjct: 193 SDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETYL 252
Query: 216 ECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLV 272
E Y L E I+ H ++ + C + NL P +T HF+GAD+
Sbjct: 253 PQEFYDRLVEELKVQINLPPIH-VDPDLGTQLCYKSETNLEG-----PILTAHFEGADVK 306
Query: 273 VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ P FI +D F F T G I G Q N +DLD
Sbjct: 307 LLPLQTFI-PPKDGVFCFAMTGTT--DGLYIFGNFAQSNVLIGFDLD 350
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 128/347 (36%), Gaps = 48/347 (13%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 137 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYA 194
Query: 60 KLPCYDASCKS-----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+PC C C C Y ++YGD T V S DT TL + +VQ
Sbjct: 195 AVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-----SASSAVQ 249
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFH 172
FGC +S F + G++GL + S + Q FS CL +P + +
Sbjct: 250 GFFFGCGHAQSGLFNGVD-----GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 304
Query: 173 SRLEFGDQIIAGKSLN----LP----PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
L G A + LP P + + L G I+ G L+V
Sbjct: 305 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTG----ISVGGQQLSVPASAFAGGTV 360
Query: 225 AEFIDYFSQHDIEKLFTCR---KCGVTCFNLP-ARFNSFPSMTYHFQGADLVVEPENVFI 280
+ ++ R + G+ + P A N Y+F G V P
Sbjct: 361 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALT 420
Query: 281 FNH-------QDSFFFFFGPAFTPR---KGKTILGARHQHNTQFVYD 317
F D F AF P G ILG Q + + D
Sbjct: 421 FGSGATVTLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 133/359 (37%), Gaps = 87/359 (24%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ + G P ++DT + LTW QC+PC S C Q DP+++ +Y +PC
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171
Query: 66 ASCKS------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
CK C G C + I+Y D T V D TL P V++ F
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAP-----GAIVKDFYF 226
Query: 119 GCSLESKD-------------------------------FVSIQKKIIAGIMGLNWDSTS 147
GC ++ K G + +
Sbjct: 227 GCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKPGFLAFGAGRNPSG 286
Query: 148 FM-VQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
F+ +GR+ P + S + + GK L+L P++F+ G I
Sbjct: 287 FVFTPMGRV-----------PGQPTFSTVTLAGITVGGKKLDLRPSAFS------GGMIV 329
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQH-----DIEKLFTCRKCGVTCFNLPARFN-SFP 260
D G+V+TV++ VY L A F + + D++ TC++L N P
Sbjct: 330 DSGTVVTVLQSTVYRALRAAFREAMKAYRLVHGDLD----------TCYDLTGYKNVVVP 379
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT-ILGARHQHNTQFVYD 317
+ F GA + ++ N + N +F A T + G +LG +Q + ++D
Sbjct: 380 KIALTFSGGATINLDVPNGILVNGCLAF------AETGKDGTAGVLGNVNQRTFEVLFD 432
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 137/374 (36%), Gaps = 81/374 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PIYNSRSFKSYKKLPCY 64
Y ++L +G P K ++DT + LTW QC P + + P Y+ S SY+++PC
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 118
Query: 65 DASCK-------------SPFHCFEGDCFYGITYGD--------VYETKEVDSLDTSTLL 103
D C+ SP C D YG Y D YET + S S
Sbjct: 119 DDECQFLPAPIGSSCSITSPSPC---DYTYG--YSDQSRTTGILAYETISMKSRKRSGKR 173
Query: 104 PPDEPS-PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQ-----LGRLVP 157
+ + + ++N+ GCS ES V +G++GL S Q LG +
Sbjct: 174 AGNHKTRRIRIKNVALGCSRES---VGASFLGASGVLGLGQGPISLATQTRHTALGGI-- 228
Query: 158 DRFSCCLV-----------------------------QPDKSFHSRLEFGDQIIAGKSLN 188
FS CLV P + + GK ++
Sbjct: 229 --FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVD 286
Query: 189 -LPPNSFTIKLNGQRGCINDCGSVLTVIECEVYA-VLTAEFIDYF--SQHDIEKLFTCRK 244
+ + + I +G +G I D G+ L+ + Y+ VL A + +I + F
Sbjct: 287 GIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFEL-- 344
Query: 245 CGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL 304
C+N+ P + FQG ++ P N ++ ++ T G IL
Sbjct: 345 ----CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNIL 400
Query: 305 GARHQHNTQFVYDL 318
G Q + YDL
Sbjct: 401 GNLLQQDHHIEYDL 414
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/367 (21%), Positives = 145/367 (39%), Gaps = 66/367 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQ---NDPI--YNSRSFKSYKKLP 62
Y ++G+G+PVK +DT + + W C+PC C + N P+ Y+ R + +
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 63 CYDASCK-----SPFHCFEG--DCFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C + C + +C Y +YGD V + + + + ++ L
Sbjct: 62 CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLAN--- 118
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL- 164
+ + FGCS+ +S ++ + GI+G S QL + +P FS CL
Sbjct: 119 ----TTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLE 174
Query: 165 --------------VQPDKSFHSRLEFG---DQIIAGKSLN---LPPNSFTIKLNGQRGC 204
+P ++ + + ++ G S+N LP ++ G
Sbjct: 175 GEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGV 234
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMT 263
I D G+ L Y V F+ + + CF + R + FP++T
Sbjct: 235 IMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVT 290
Query: 264 YHFQGADLVVEPENVFIFNH------QDSFFFFFGPAFT---PRKGK--TILGARHQHNT 312
+F+G + ++P+N ++ D + + + + P+ G TILG +
Sbjct: 291 LNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDK 350
Query: 313 QFVYDLD 319
VYDLD
Sbjct: 351 LVVYDLD 357
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 132/357 (36%), Gaps = 75/357 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ + +G P S +DT + ++W QC+PC +C Q D +++ +Y +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 66 ASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+C C C Y ++YGD T V DT L P + +V FGC
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGN-----TVGTFLFGCG 257
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK-SFHSRLEFGD 179
++ F I G++ L S S Q FS CL P K S L G
Sbjct: 258 HAQAGMFAGID-----GLLALGRQSMSLKSQAAGAYGGVFSYCL--PSKQSAAGYLTLGG 310
Query: 180 QI----------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ G+ + +P ++F G + D G+V
Sbjct: 311 PTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTV 364
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFN--SFPSMTYHFQ 267
+T + YA L + F + + + G+ TC++ +R+ + P++ F
Sbjct: 365 ITRLPPTAYAALRSAFRGAIAPYGYP---SAPANGILDTCYDF-SRYGVVTLPTVALTFS 420
Query: 268 -GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKG---KTILGARHQHNTQFVYDLDT 320
GA L +E + AF P G ILG Q + +D T
Sbjct: 421 GGATLALEAPGILSSGCL---------AFAPNGGDGDAAILGNVQQRSFAVRFDGST 468
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 133/356 (37%), Gaps = 64/356 (17%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y +L IG P + ++DT + +T+ C C+ C DP + S +Y+ + C
Sbjct: 80 LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 139
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C S C Y Y ++ + V D + E +P Q FGC +
Sbjct: 140 TIDCNCDSD----RMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAP---QRAVFGCEN 192
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCC---------------- 163
+E+ D S GIMGL S M QL ++ D FS C
Sbjct: 193 VETGDLYSQHAD---GIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGI 249
Query: 164 ---------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + ++ + +AGK L L N F +G+ G + D G+
Sbjct: 250 SPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVF----DGKHGTVLDSGT---- 301
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSM 262
YA L F +++L + +K CF ++ SFP +
Sbjct: 302 ----TYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVV 357
Query: 263 TYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F+ G + PEN + + + G T+LG NT VYD
Sbjct: 358 DMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYD 413
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/367 (21%), Positives = 143/367 (38%), Gaps = 66/367 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQ---NDPI--YNSRSFKSYKKLP 62
Y ++G+G+PVK +DT + + W C+PC C + N P+ Y+ R + +
Sbjct: 29 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C E +C Y +YGD V + + + + ++ L
Sbjct: 89 CSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLAN--- 145
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL- 164
+ + FGCS+ +S ++ + GI+G S QL + +P FS CL
Sbjct: 146 ----TTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLE 201
Query: 165 --------------VQPDKSFHSRLEFG---DQIIAGKSLN---LPPNSFTIKLNGQRGC 204
+P ++ + + ++ G S+N LP ++ G
Sbjct: 202 GEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGV 261
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMT 263
I D G+ L Y V F+ + + CF + R + FP++T
Sbjct: 262 IMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVT 317
Query: 264 YHFQGADLVVEPENVFIFNH------QDSFFFFFGPAFT---PRKGK--TILGARHQHNT 312
+F+G + ++P+N ++ D + + + + P+ G TILG +
Sbjct: 318 LNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDK 377
Query: 313 QFVYDLD 319
VYDLD
Sbjct: 378 LVVYDLD 384
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 140/358 (39%), Gaps = 66/358 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C C+ C + DP + S +YK + C
Sbjct: 85 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQC- 143
Query: 65 DASCKSPFHC-FEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
+ SC +C EG C Y Y ++ + + + D + E +P Q FGC +
Sbjct: 144 NPSC----NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTP---QRAIFGCET 196
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ + S + GIMGL S + QL +V + FS C
Sbjct: 197 VETGELFSQRAD---GIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNI 253
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + +E + +AGK L L P F +G+ G + D G+
Sbjct: 254 PPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVF----DGKHGTVLDSGT---- 305
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCFNLPARFNS-----FPSM 262
YA L E F I+++ ++ CF+ R S FP +
Sbjct: 306 ----TYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEV 361
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G L + PEN + + S + G + T+LG NT YD D
Sbjct: 362 NMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRD 419
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 128/347 (36%), Gaps = 48/347 (13%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 137 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYA 194
Query: 60 KLPCYDASCKS-----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
+PC C C C Y ++YGD T V S DT TL + +VQ
Sbjct: 195 AVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-----SASSAVQ 249
Query: 115 NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFH 172
FGC +S F + G++GL + S + Q FS CL +P + +
Sbjct: 250 GFFFGCGHAQSGLFNGVD-----GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 304
Query: 173 SRLEFGDQIIAGKSLN----LP----PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
L G A + LP P + + L G I+ G L+V
Sbjct: 305 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTG----ISVGGQQLSVPASAFAGGTV 360
Query: 225 AEFIDYFSQHDIEKLFTCR---KCGVTCFNLP-ARFNSFPSMTYHFQGADLVVEPENVFI 280
+ ++ R + G+ + P A N Y+F G V P
Sbjct: 361 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALT 420
Query: 281 FNH-------QDSFFFFFGPAFTPR---KGKTILGARHQHNTQFVYD 317
F D F AF P G ILG Q + + D
Sbjct: 421 FGSGATVTLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/360 (21%), Positives = 132/360 (36%), Gaps = 67/360 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+T Y+ IG P + ++D L WTQC+ C C+EQ+ P+++ + +Y+
Sbjct: 45 WTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAE 104
Query: 62 PCYDASCKS----PFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
PC C+S +C C Y GD D+ T + +
Sbjct: 105 PCGTPLCESIPSDSRNCSGNVCAYQASTNAGDTGGKVGTDTFAVGT----------AKAS 154
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC + S D ++ +GI+GL S + Q G FS CL D +S L
Sbjct: 155 LAFGCVVAS-DIDTMGGP--SGIVGLGRTPWSLVTQTGVAA---FSYCLAPHDAGKNSAL 208
Query: 176 EFGDQII---AGKSLNLP-----------PNSFTIKLNGQRGCINDCGSVLTVIECEVYA 221
G GK+ + P N + ++L G + G + +
Sbjct: 209 FLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLK-----AGDAMIPLPPSGST 263
Query: 222 VLTAEFIDYFSQHD--IEKLFTCRKCGVT-----------------CFNLPARFNSFPSM 262
VL +D FS ++ + K VT CF + P +
Sbjct: 264 VL----LDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDL 319
Query: 263 TYHFQGADLVVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F+G + + ++ ++++ ++LG+ Q N F++DLD
Sbjct: 320 VFTFRGGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLD 379
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 139/352 (39%), Gaps = 53/352 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-------PCKSCYEQNDPIYNSRSFKSYKK 60
Y++ + +G P +S+ + DT + L W +C+ + Q DP SRS +Y +
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP---SRS-STYGR 156
Query: 61 LPCYDASCKS--PFHCFEG-DCFYGITYGDVYETKEVDSLDTSTL---LPPDEPSPVSVQ 114
+ C +C++ C +G +C Y YGD T V S +T T P V V
Sbjct: 157 VSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRVG 216
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFH 172
++FGCS + + + S + QLG + RFS CLV +
Sbjct: 217 GVKFGCSTATAGSFPADGLVGL-----GGGAVSLVTQLGGATSLGRRFSYCLVPHSVNAS 271
Query: 173 SRLEFG------------DQIIAGKS--------LNLPPNSFTIKLNGQRGCINDCGSVL 212
S L FG ++AG ++ + T+ I D G+ L
Sbjct: 272 SALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVDSGTTL 331
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR----FNSFPSMTYHF-Q 267
T ++ + + E + ++ + C+N+ R S P +T F
Sbjct: 332 TFLDPSLLGPIVDELSRRITLPPVQSPDGLLQL---CYNVAGREVEAGESIPDLTLEFGG 388
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GA + ++PEN F+ + + A T ++ +ILG Q N YDLD
Sbjct: 389 GAAVALKPENAFVAVQEGTLCLAI-VATTEQQPVSILGNLAQQNIHVGYDLD 439
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 112/315 (35%), Gaps = 60/315 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+++ IG+P +W +DT + L W +C PC C P+Y+ +S KLPC
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146
Query: 68 CKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C++ C + C Y YG + L T T D N+ F
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDG---YVANNVSF 203
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV------------- 165
G S Q AG++GL S + QLG RF+ CL
Sbjct: 204 G---RSDTIDGSQFGGTAGLVGLGRGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGS 257
Query: 166 --------------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
+PD+ H + + G L + +F I +G G
Sbjct: 258 LAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVF 317
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR--FNSFPSMT 263
D G++ T ++ Y V+ +I++L TCF + P +
Sbjct: 318 FDSGAIDTSLKDAAYQVVRQAIT-----SEIQRL-GYDAGDDTCFVAANQQAVAQMPPLV 371
Query: 264 YHF-QGADLVVEPEN 277
HF GAD+ + N
Sbjct: 372 LHFDDGADMSLNGRN 386
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 144/360 (40%), Gaps = 62/360 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P + + +DT + + W C C C +++ +Y+ + + K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 63 -----CYDASCKSPFHCFEG-DCFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPV 111
CY + P +C C Y Y D Y +++ D + D +
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQ---VSGDLETTS 214
Query: 112 SVQNIRFGCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---- 164
+ ++ FGCS +S D S ++ + GI+G +TS + QL V F+ CL
Sbjct: 215 ANGSVIFGCSATQSGDLSS--EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 165 ----------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
VQP H + + G LNLP + F + ++G I
Sbjct: 273 GGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKKGTII 330
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARF-NSFPSM 262
D G+ L + VY L ++ + S H I F TCF + FP++
Sbjct: 331 DSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-------TCFQYSESLDDGFPAV 383
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
T+HF+ + + + ++F++ + + G R+ T+LG N +YDL+
Sbjct: 384 TFHFENSLYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 126/337 (37%), Gaps = 45/337 (13%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPFH 73
+G P + L+ L W P C+EQ P + +F + LP ASC SP
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFS--RGLPF--ASCGSPKF 56
Query: 74 CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKK 133
C Y +YGD T +D T + + SV + FGC L + +
Sbjct: 57 WPNQTCVYTYSYGDKSVTTGFLEVDKFTFVG----AGASVPGVAFGCGLFNNGVFKSNET 112
Query: 134 IIAGIMGLNWDSTSFMVQLGR-------------------LVPDRFS--------CCLVQ 166
IAG G S +++G L D FS L+Q
Sbjct: 113 GIAG-FGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQ 171
Query: 167 PDKSFHSR----LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAV 222
K+ + L + L +P ++F + NG G I D G+ +T + +VY V
Sbjct: 172 YAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQV 230
Query: 223 LTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPENVFIF 281
+ EF +Q + + TCF+ P++ P + HF+GA + + EN
Sbjct: 231 VRDEFA---AQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFE 287
Query: 282 NHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
D+ A TI+G Q N +YDL
Sbjct: 288 VPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDL 324
>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
Length = 343
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 39/61 (63%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++G+G P + L+ +LDT + +TW QCQPC CY+Q+DP+++ SY + C +
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 226
Query: 68 C 68
C
Sbjct: 227 C 227
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 144/360 (40%), Gaps = 62/360 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P + + +DT + + W C C C +++ +Y+ + + K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 63 -----CYDASCKSPFHCFEG-DCFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSPV 111
CY + P +C C Y Y D Y +++ D + D +
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQ---VSGDLETTS 214
Query: 112 SVQNIRFGCSL-ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---- 164
+ ++ FGCS +S D S ++ + GI+G +TS + QL V F+ CL
Sbjct: 215 ANGSVIFGCSATQSGDLSS--EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 165 ----------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
VQP H + + G LNLP + F + ++G I
Sbjct: 273 GGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKKGTII 330
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCRKCGVTCFNLPARF-NSFPSM 262
D G+ L + VY L ++ + S H I F TCF + FP++
Sbjct: 331 DSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-------TCFQYSESLDDGFPAV 383
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
T+HF+ + + + ++F++ + + G R+ T+LG N +YDL+
Sbjct: 384 TFHFENSLYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 87/363 (23%), Positives = 135/363 (37%), Gaps = 57/363 (15%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC---------YEQNDPIYNSR 53
T Y ++ IG P K + +DT + + W C C C Q DP +
Sbjct: 80 TATGLYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT 139
Query: 54 SFKSYKKLPCYDASCKSPFHC--FEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEP 108
+ ++ ++ P C C + I YGD T DS+ + + +
Sbjct: 140 TVGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQT 199
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL-- 164
+P S +I FGC + + + + GI+G +S + QL R V F+ CL
Sbjct: 200 TP-SNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDT 258
Query: 165 ------------VQP--------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
VQP H + + G +L LP ++F +G
Sbjct: 259 VHGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTF--DSGDSKGT 316
Query: 205 INDCGSVLTVIECEVY-AVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARF-NSFP 260
I D G+ L + EVY +LTA F Y + H+ + CF + FP
Sbjct: 317 IIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQDF--------VCFQFSGSIDDGFP 368
Query: 261 SMTYHFQGA-DLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGK--TILGARHQHNTQFVY 316
+T+ F+G L V P + N D + F + GK +LG N VY
Sbjct: 369 VVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVY 428
Query: 317 DLD 319
DL+
Sbjct: 429 DLE 431
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 130/353 (36%), Gaps = 45/353 (12%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKL 61
TL + ++LG P KS L+DT + ++W +C+PC + C Q DP+++ +Y
Sbjct: 137 TLEYVITVRLG-SPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPF 195
Query: 62 PCYDASCKSPFH-------CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C A+C F G C Y YGD + + TL + V V
Sbjct: 196 SCSSAACAQLFQEGNANGCSSSGQCQYIAMYGD-GSVGTTGTYSSDTLALGSNSNTVVVS 254
Query: 115 NIRFGCSLESKDFVSIQKKI-------------IAGIMGLNW--------DSTSFMVQLG 153
RFGCS + + AG G S+S + LG
Sbjct: 255 KFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSGFLTLG 314
Query: 154 RLVPDRFSCCLVQPDKSFHSRLEFGDQI----IAGKSLNLPPNSFTIKLNGQRGCINDCG 209
+S +G ++ + G+ L++P F+ G I D G
Sbjct: 315 AAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFSA------GMIMDSG 368
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
+V+T + Y+ L++ F Q+ TCF++ + + S P++ F G
Sbjct: 369 TVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFSG 428
Query: 269 ADLVV---EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
A V + + + S F A + I+G Q Q +YD+
Sbjct: 429 AGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDV 481
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 132/353 (37%), Gaps = 75/353 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + +G P K ++DT + LTW +C PC C D + ++ +YK L C D
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKALTCAD- 57
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
D YG YGD T+ S+DT + FGC K
Sbjct: 58 -----------DYSYG--YGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKG 104
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--------------------- 165
+S + GI+ L+ S SF Q+G ++FS CL+
Sbjct: 105 LISGE----VGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160
Query: 166 --QPDKSFHSRLEFG-------------DQIIAG-KSLNLPPNSFTIKLNGQ-RGCINDC 208
+P L++ D I G + L+L P++F LNGQ + I D
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAF---LNGQDKPTIFDS 217
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNSFPSMTYHFQ 267
G+ LT++ V + S + F K CF + P+ P +T+HF
Sbjct: 218 GTTLTMLPPGVCDSIKQSLASMVSGAE----FVAIKGLDACFRVPPSSGQGLPDITFHFN 273
Query: 268 -GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
GAD V P N I F P +I G Q + ++D+D
Sbjct: 274 GGADFVTRPSNYVIDLGSLQCLI-----FVPTNEVSIFGNLQQQDFFVLHDMD 321
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 84/188 (44%), Gaps = 24/188 (12%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
TLN Y++ +G+G K++ ++DT + LTW QC+PC SCY Q PI+ + SY+ +
Sbjct: 62 TLN--YIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 63 CYDASCKS-------PFHCFEGD---CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C ++C+S C + C Y + YGD T ++ + VS
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSF------GGVS 171
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFH 172
V + FGC +K ++G+MGL S + Q FS CL +
Sbjct: 172 VSDFVFGCGRNNKGLFG----GVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSS 227
Query: 173 SRLEFGDQ 180
L G++
Sbjct: 228 GSLVMGNE 235
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 129/353 (36%), Gaps = 62/353 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + + ++D L WTQC PC+ C+EQ+ P+++ +++ LPC
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 68 C----KSPFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC- 120
C +S +C C Y GD D+ + + + FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAIG----------AAKETLGFGCV 166
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--------------Q 166
+ K +I +GI+GL S + Q+ FS CL Q
Sbjct: 167 VMTDKRLKTIGGP--SGIVGLGRTPWSLVTQMNVTA---FSYCLAGKSSGALFLGATAKQ 221
Query: 167 PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL--- 223
+S F + AG S N + +KL G I G+ L VL
Sbjct: 222 LAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAG----IKTGGAPLQAASSSGSTVLLDT 277
Query: 224 --TAEFIDYFSQHDIEKLFTCRKCGVTCFNLP----------ARFNSFPSMTYHFQ-GAD 270
A ++ + ++K T GV P A P + + F GA
Sbjct: 278 VSRASYLADGAYKALKKALTA-AVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAA 336
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFT-----PRKGKTILGARHQHNTQFVYDL 318
L V P N + + + G + + +G +ILG+ Q N ++DL
Sbjct: 337 LTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDL 389
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 122/341 (35%), Gaps = 62/341 (18%)
Query: 23 FLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYDASCKS--PFH---CF 75
++DT + + W QC PC + C+ Q D +Y+ S PC +C++ P+
Sbjct: 158 MVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTP 217
Query: 76 EGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKI 134
GD C Y + Y D + D TL P S +S RFGCS S K
Sbjct: 218 AGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAIS--EFRFGCSHALLQPGSFSNK- 274
Query: 135 IAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS--------------------- 173
+GIM L + S Q D FS CL P HS
Sbjct: 275 TSGIMALGRGAQSLPTQTKATYGDVFSYCL--PPTPVHSGFFILGVPRVAASRYAVTPML 332
Query: 174 RLEFGDQI---------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLT 224
R + + +AGK L +PP F G + D +++T + Y L
Sbjct: 333 RSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAA------GAVMDSRTIVTRLPPTAYMALR 386
Query: 225 AEFI----DYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE--PENV 278
A F+ Y + E L TC P +T F G + VE P V
Sbjct: 387 AAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVK-LPKITLVFDGPNGAVELDPSGV 445
Query: 279 FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F P T + I+G Q + +Y++D
Sbjct: 446 LLDG-----CLAFAP-NTDDQMTGIIGNVQQQALEVLYNVD 480
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 79/368 (21%), Positives = 126/368 (34%), Gaps = 80/368 (21%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC--- 63
+Y+++ G+G P + + LDT A TW C PC +C ++ + SY LPC
Sbjct: 76 SYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPANSTSYAPLPCSST 134
Query: 64 ---------------YDASCKSPFHCFE---GDCFYGITYGDVYETKEVDSLDTSTLLPP 105
YD+S P F D + + + D
Sbjct: 135 MCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHLGKD---------- 184
Query: 106 DEPSPVSVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ N FGC S S ++ K+ G++GL + + Q+G + FS CL
Sbjct: 185 ------AIPNYAFGCVSAVSGPTANLPKQ---GLLGLGRGPMALLSQVGNMYNGVFSYCL 235
Query: 165 V-----------------------------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFT 195
P++S + + + +P SF
Sbjct: 236 PSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFA 295
Query: 196 IKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PA 254
G + D G+V+T VYA L EF + + +T TCFN
Sbjct: 296 FDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAA---PSGYTSLGAFDTCFNTDEV 352
Query: 255 RFNSFPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQH 310
P++T H G DL + EN I + A P+ +L Q
Sbjct: 353 AAGVAPAVTVHMDGGLDLALPMENTLIHSSATP-LACLAMAEAPQNVNAVVNVLANLQQQ 411
Query: 311 NTQFVYDL 318
N + V+D+
Sbjct: 412 NLRVVFDV 419
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 127/353 (35%), Gaps = 62/353 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + + ++D L WTQC PC+ C+EQ+ P+++ +++ LPC
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 68 C----KSPFHCFEGDCFYG--ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC- 120
C +S +C C Y GD D+ + + + FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAIG----------AAKETLGFGCV 166
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV--------------Q 166
+ K +I +GI+GL S + Q+ FS CL Q
Sbjct: 167 VMTDKRLKTIGGP--SGIVGLGRTPWSLVTQMNVTA---FSYCLAGKSSGALFLGATAKQ 221
Query: 167 PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL--T 224
+S F + AG S N + +KL G I G+ L VL T
Sbjct: 222 LAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAG----IKAGGAPLQAASSSGSTVLLDT 277
Query: 225 AEFIDYFSQHDIEKLFTCRKCGVT-------------CFNLPARFNSFPSMTYHFQ-GAD 270
Y + + L V CF+ A P + + F GA
Sbjct: 278 VSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFS-KAVAGDAPELVFTFDGGAA 336
Query: 271 LVVEPENVFIFNHQDSFFFFFGPAFT-----PRKGKTILGARHQHNTQFVYDL 318
L V P N + + + G + + +G +ILG+ Q N ++DL
Sbjct: 337 LTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDL 389
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 134/366 (36%), Gaps = 69/366 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS----CYEQNDPIYNSRSFKSYKK 60
+ + L +GIG P + ++DT + L WTQC+ S + P+Y+ ++
Sbjct: 88 DQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAF 147
Query: 61 LPCYDASCKSPFHCFEGDCFYG--ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
LPC D C+ F+ +C Y DVY + + S VS++ + F
Sbjct: 148 LPCSDRLCQEGQFSFK-NCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLR-LGF 205
Query: 119 GC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
GC +L + + GI+GL+ +S S + QL RFS CL S L F
Sbjct: 206 GCGALSAGSLIG-----ATGILGLSPESLSLITQLKI---QRFSYCLTPFADKKTSPLLF 257
Query: 178 GDQI----------------------------------IAGKSLNLPPNSFTIKLNGQRG 203
G + K L +P S ++ +G G
Sbjct: 258 GAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGG 317
Query: 204 CINDCGSVLT-VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG---VTCFNLPARFNS- 258
I D GS + ++E AV A D+ +L + CF LP R +
Sbjct: 318 TIVDSGSTVAYLVEAAFEAVKEAVM-------DVVRLPVANRTVEDYELCFVLPRRTAAA 370
Query: 259 ------FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
P + HF G +V P + + + T G +I+G Q N
Sbjct: 371 AMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNM 430
Query: 313 QFVYDL 318
++D+
Sbjct: 431 HVLFDV 436
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 75/170 (44%), Gaps = 13/170 (7%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC-YEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + +G P +SL + DT + L W +C C++C + + R S+ C+D
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 147
Query: 67 SCK----SPFHC-----FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C+ +P H C + +Y D + S +T+TL S + ++ +
Sbjct: 148 HCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG-SEIHLKGLS 206
Query: 118 FGCS--LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
FGC + Q G+MGL S SF QLGR ++FS CL+
Sbjct: 207 FGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLM 256
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 132/370 (35%), Gaps = 79/370 (21%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
L+HT L + IG P + +LDT + L WTQC+ + + P+Y+ S+ PC
Sbjct: 87 LHHT--LTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPC 144
Query: 64 YDASCKSPF----HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C++ +C C Y YG E+ S +T T E VSV ++ FG
Sbjct: 145 DGRLCETGSFNTKNCSRNKCIYTYNYGSATTKGELAS-ETFTF---GEHRRVSV-SLDFG 199
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG 178
C S +GI+G++ D S + QL +P RFS CL D++ S + FG
Sbjct: 200 CG----KLTSGSLPGASGILGISPDRLSLVSQL--QIP-RFSYCLTPFLDRNTTSHIFFG 252
Query: 179 DQI-----------------------------------IAGKSLNLPPNSFTIKLNGQRG 203
+ K LN+P +SF I +G G
Sbjct: 253 AMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGG 312
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYF-------SQHDIEKLFTCRKCGVTCFNLPARF 256
D G ++ V L ++ + H E CF LP
Sbjct: 313 TFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYEL--------CFQLPRNG 364
Query: 257 NS-------FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
P + YHF G ++ + ++ + I+G Q
Sbjct: 365 GGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCL---VISSGARGAIIGNYQQ 421
Query: 310 HNTQFVYDLD 319
N ++D++
Sbjct: 422 QNMHVLFDVE 431
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/354 (21%), Positives = 126/354 (35%), Gaps = 70/354 (19%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P +++ LDT W C C C + +++ S + L C
Sbjct: 87 TYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAP 144
Query: 67 SCK---SPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK +P C + +TYG + Y T++ +L T + N FG
Sbjct: 145 QCKQAPNPSCTVSKSCGFNMTYGGSAIEAYLTQDTLTLATDV-----------IPNYTFG 193
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEFG 178
C + S+ + G+MGL S + Q L FS CL S F L G
Sbjct: 194 C-INKASGTSLPAQ---GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 179 DQ----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ + K +++P ++ G I D G+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLF---TCRKCGVTCFNLPARFNSFPSMTYHFQ 267
V T + Y + EF + L TC V FPS+T+ F
Sbjct: 310 VYTRLVEPAYVAMRNEFRRRVKNANATSLGGFDTCYSGSVV----------FPSVTFMFA 359
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
G ++ + P+N+ I + + A P ++L + Q N + + D+
Sbjct: 360 GMNVTLPPDNLLIHSSAGN-LSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDV 412
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 61/140 (43%), Gaps = 16/140 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y +G+G P ++DT + L W QC PC+ CY Q +++ R +Y+++PC
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 68 CKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C++ G C Y + YGD + L T L ++ V N+ GC
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGD--GSSSTGDLATDKLAFAND---TYVNNVTLGC 200
Query: 121 SLESKDFVSIQKKIIAGIMG 140
+++ AG++G
Sbjct: 201 GRDNEGLF----DSAAGLLG 216
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/349 (21%), Positives = 133/349 (38%), Gaps = 48/349 (13%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
T + TY++K IG P ++L +DT +W C C C P ++S ++KK+
Sbjct: 92 ITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGC-STTTPFAPAKS-TTFKKV 149
Query: 62 PCYDASCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C + CK + C C + TYG + SL T+ +P P FG
Sbjct: 150 GCGASQCKQVRNPTCDGSACAFNFTYG---TSSVAASLVQDTVTLATDPVPAYA----FG 202
Query: 120 C-------------------------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR 154
C + K + S + LN+ + + + +
Sbjct: 203 CIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGSLRLGPVAQ 262
Query: 155 LVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+F+ L P +S + + + +++PP + N G + D G+V T
Sbjct: 263 PKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTR 322
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYHFQGADLVV 273
+ Y + EF + H +KL G TC+ P P++T+ F G ++ +
Sbjct: 323 LVEPAYNAVRNEFRRRIAVH--KKLTVTSLGGFDTCYTAPI---VAPTITFMFSGMNVTL 377
Query: 274 EPENVFIFNHQDSFF-FFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
P+N+ I + S PA P ++L Q N + ++D+
Sbjct: 378 PPDNILIHSTAGSVTCLAMAPA--PDNVNSVLNVIANMQQQNHRVLFDV 424
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 143/355 (40%), Gaps = 50/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K+ + +DT + + W C CK C +++ +Y+ + S K +P
Sbjct: 85 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVP 144
Query: 63 CYDASCKSPFHCFEGDCFYGIT------YGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C CK C I+ YGD T V + + D + +
Sbjct: 145 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 204
Query: 115 NIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
+I FGC + +S D S ++ + GI+G ++S + QL V F+ CL
Sbjct: 205 SIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 264
Query: 165 -------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP + H + + L+L ++ T ++G I D G
Sbjct: 265 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTST--QGDRKGTIIDSG 322
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ- 267
+ L + +Y L + I SQH K+ T TCF + FP++T++F+
Sbjct: 323 TTLAYLPEGIYEPLVYKII---SQHPDLKVRTLHD-EYTCFQYSESVDDGFPAVTFYFEN 378
Query: 268 GADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L V P + ++F D + + G K T+LG N YDL+
Sbjct: 379 GLSLKVYPHD-YLFPSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLE 432
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 118/322 (36%), Gaps = 65/322 (20%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G PV+ L LDT A TW+ C PC +C + I S S SY LPC
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSS--SYASLPCASD 135
Query: 67 SCKSPFHCFEGD--------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C FEG C + + D T SL + TL + +
Sbjct: 136 WCP----LFEGQPCPANQDASAPLPACAFSKPFAD---TSFQASLGSDTLRLGKD----A 184
Query: 113 VQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
+ FGC + ++ K+ G++GL S + Q G FS CL +
Sbjct: 185 IAGYAFGCVGAVAGPTTNLPKQ---GLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYY 241
Query: 172 HS---RLEFGDQ-------------------------IIAGKS-LNLPPNSFTIKLNGQR 202
S RL Q + G++ + +P SF
Sbjct: 242 FSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGA 301
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNSFPS 261
G + D G+V+T VYA L EF Q +T TCFN P
Sbjct: 302 GTVIDSGTVITRWTAPVYAALREEF---RRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPP 358
Query: 262 MTYHFQGA-DLVVEPENVFIFN 282
+T H G DL + EN I +
Sbjct: 359 VTLHMDGGVDLTLPMENTLIHS 380
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/355 (23%), Positives = 138/355 (38%), Gaps = 50/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + +DT + + W C C +C + + +++ + +P
Sbjct: 84 YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPV- 111
C D C S C Y Y D T V D++ +L P+ V
Sbjct: 144 CSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVA 203
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL----- 164
S I FGCS ++ K + GI+G S + QL + P FS CL
Sbjct: 204 SSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGN 263
Query: 165 ----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
++P + H L + G+ L++ P F + +RG I
Sbjct: 264 GGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFAT--SDKRGTII 321
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
D G+ L+ + E Y L SQ + +C + L + +SFP+++++F
Sbjct: 322 DSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQCYLV---LTSIDDSFPTVSFNF 378
Query: 267 Q-GADLVVEPENVFIFNH--QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ GA + ++P ++ N QD + ++G TILG + VYDL
Sbjct: 379 EGGASMDLKPSQ-YLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDL 432
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 62/281 (22%), Positives = 107/281 (38%), Gaps = 62/281 (22%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y++ +G+G P + ++DT + ++W QC+PC + C+ +++ + +Y C
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165
Query: 65 DASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
A+C E + C Y + YGD T +
Sbjct: 166 AAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT--------------------GFQ 205
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG---RLVPDRFSCCLVQPDKSFHSR 174
FGCS + + G++GL D+ S + Q + VP + L
Sbjct: 206 FGCS--HAELGAGMDDKTDGLIGLGGDAQSLVSQTAARSKKVPTYYFAALE--------- 254
Query: 175 LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH 234
D + GK L L P+ F G + D G+V+T + YA L++ F +
Sbjct: 255 ----DIAVGGKKLGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAF-----RA 299
Query: 235 DIEKLFTCRKCGV--TCFNLPARFN-SFPSMTYHFQGADLV 272
+ + G+ TCFN S P++ F G +V
Sbjct: 300 GMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVV 340
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 11/121 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + + +G P+ + + DT + L WTQC PC C++Q P + S ++ KLPC +
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 68 CK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C+ S C C Y YG Y L T TL D P ++ FGCS E
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGY---LATETLKVGDASFP----SVAFGCSTE 198
Query: 124 S 124
+
Sbjct: 199 N 199
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 118/322 (36%), Gaps = 65/322 (20%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G PV+ L LDT A TW+ C PC +C + I S S SY LPC
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSS--SYASLPCASD 135
Query: 67 SCKSPFHCFEGD--------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C FEG C + + D T SL + TL + +
Sbjct: 136 WCP----LFEGQPCPANQDASAPLPACAFSKPFAD---TSFQASLGSDTLRLGKD----A 184
Query: 113 VQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
+ FGC + ++ K+ G++GL S + Q G FS CL +
Sbjct: 185 IAGYAFGCVGAVAGPTTNLPKQ---GLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYY 241
Query: 172 HS---RLEFGDQ-------------------------IIAGKS-LNLPPNSFTIKLNGQR 202
S RL Q + G++ + +P SF
Sbjct: 242 FSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGA 301
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNSFPS 261
G + D G+V+T VYA L EF Q +T TCFN P
Sbjct: 302 GTVIDSGTVITRWTAPVYAALREEF---RRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPP 358
Query: 262 MTYHFQGA-DLVVEPENVFIFN 282
+T H G DL + EN I +
Sbjct: 359 VTLHMDGGVDLTLPMENTLIHS 380
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 132/364 (36%), Gaps = 68/364 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G P + L LDT A TW C PC +C + ++ + SY LPC +
Sbjct: 80 SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 137
Query: 67 SC------KSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSP 110
C P GD C + + D + S DT L
Sbjct: 138 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL------GK 190
Query: 111 VSVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++ N FGC S + ++ ++ G++GL + + Q G L FS CL
Sbjct: 191 DAIPNYTFGCVSSVTGPTTNMPRQ---GLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRS 247
Query: 170 SFHS---RLEFGD--------------------------QIIAGKS-LNLPPNSFTIKLN 199
+ S RL G + G++ + +P SF
Sbjct: 248 YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAA 307
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNS 258
G + D G+V+T VYA L EF Q +T TCFN
Sbjct: 308 TGAGTVVDSGTVITRWTAPVYAALREEF---RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 364
Query: 259 FPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQF 314
P++T H G DL + EN I + A P+ + ++ Q N +
Sbjct: 365 APAVTVHMDGGVDLALPMENTLIHSSATP-LACLAMAEAPQNVNSVVNVIANLQQQNIRV 423
Query: 315 VYDL 318
V+D+
Sbjct: 424 VFDV 427
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 132/354 (37%), Gaps = 56/354 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ L +G P L LDT + +W QC+PC CYEQ DP+++ + +Y +PC
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARE 198
Query: 68 CKS---------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV-SVQNIR 117
C+ +C Y ++Y D T + DT TL P PSP +V
Sbjct: 199 CQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFV 258
Query: 118 FGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLE 176
FGC + F + + G+ S Q+ FS CL S L
Sbjct: 259 FGCGHSNAGTFGEVDGLLGLGLG-----KASLPSQVAARYGAAFSYCLPS-SPSAAGYLS 312
Query: 177 FGDQ---------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
FG ++AG+++ +P ++F G I D G
Sbjct: 313 FGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAA----GTIIDSG 368
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ- 267
+ + + YA L + F ++ ++ + TC++ P++ F
Sbjct: 369 TAFSRLPPSAYAALRSSFRSAMGRYRYKRAPS-SPIFDTCYDFTGHETVRIPAVELVFAD 427
Query: 268 GADLVVEPENV-FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
GA + + P V + +N AF P ILG Q +YD+ +
Sbjct: 428 GATVHLHPSGVLYTWNDVAQTCL----AFVPNHDLGILGNTQQRTLAVIYDVGS 477
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 138/359 (38%), Gaps = 66/359 (18%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F + +++ + G P + +LDT + +TWTQC+ C C + + ++S + +Y
Sbjct: 120 LFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSF 179
Query: 61 LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C ++ + Y +TYGD + DT TL P D Q +FGC
Sbjct: 180 GSCIPSTVGNT---------YNMTYGDKSTSVGNYGCDTMTLEPSD-----VFQKFQFGC 225
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
++ DF S G++GL S + Q FS CL P+++ L FG+
Sbjct: 226 GRNNEGDFGSGAD----GMLGLGQGQLSTVSQTASKFKKVFSYCL--PEENSIGSLLFGE 279
Query: 180 QI-----------------------------------IAGKSLNLPPNSFTIKLNGQRGC 204
+ + K LN+P + F G
Sbjct: 280 KATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGT 334
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNS-FPSM 262
I D G+V+T + Y+ L A F +++ + + TC+NL R + P
Sbjct: 335 IIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEX 394
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFG--PAFTPRKGKTILGARHQHNTQFVYDL 318
HF GAD+ + + V N F T TI+G R Q + +YD+
Sbjct: 395 VLHFGDGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDI 453
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 134/352 (38%), Gaps = 55/352 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++DT + +T+ C C+ C + DP + +Y + C
Sbjct: 85 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144
Query: 64 YDASCKSPFHCFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C +G +C Y Y ++ + V D + E P Q FGC +
Sbjct: 145 MDCNCD-----HDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSEVVP---QRAVFGCEN 196
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S + GIMGL S + QL ++ D FS C
Sbjct: 197 VETGDLYSQRAD---GIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGGI 253
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + +E + +AGK L L P++F K G + D G+
Sbjct: 254 PPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRK----HGTVLDSGTTYAY 309
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCR-KCGVTCFNLPAR-----FNSFPSMTYHF-Q 267
+ E + I H+++++ CF+ R +FP + F
Sbjct: 310 LPEEAFVAFRDAIIK--KSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSN 367
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L + PEN ++F H + F T+LG NT YD +
Sbjct: 368 GQKLSLTPEN-YLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRE 418
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 118/322 (36%), Gaps = 65/322 (20%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G PV+ L LDT A TW+ C PC +C + I S S SY LPC
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSS--SYASLPCASD 135
Query: 67 SCKSPFHCFEGD--------------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C FEG C + + D T SL + TL + +
Sbjct: 136 WCP----LFEGQPCPANQDASAPLPACAFSKPFAD---TSFQASLGSDTLRLGKD----A 184
Query: 113 VQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
+ FGC + ++ K+ G++GL S + Q G FS CL +
Sbjct: 185 IAGYAFGCVGAVAGPTTNLPKQ---GLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYY 241
Query: 172 HS---RLEFGDQ-------------------------IIAGKS-LNLPPNSFTIKLNGQR 202
S RL Q + G++ + +P SF
Sbjct: 242 FSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGA 301
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNSFPS 261
G + D G+V+T VYA L EF Q +T TCFN P
Sbjct: 302 GTVIDSGTVITRWTAPVYAALREEF---RRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPP 358
Query: 262 MTYHFQGA-DLVVEPENVFIFN 282
+T H G DL + EN I +
Sbjct: 359 VTLHMDGGVDLTLPMENTLIHS 380
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 135/355 (38%), Gaps = 51/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G+P K + +DT + + W C C C + P+ ++ S + +
Sbjct: 83 YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142
Query: 63 CYDASC-----KSPFHCF--EGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSV 113
C D C S CF C Y YGD T +D L + + S
Sbjct: 143 CSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSS 202
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
++ FGCS ++ + + GI G S + QL + P FS CL
Sbjct: 203 ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGG 262
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
V+P+ + H L + G+ L + P F + +G I D
Sbjct: 263 GILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFAT--SSSQGTIIDS 320
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ- 267
G+ L + E Y + SQ + +C VT ++ + FP ++ +F
Sbjct: 321 GTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVS---DIFPQVSLNFAG 377
Query: 268 GADLVVEPENVFIFNHQDSF----FFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GA LV+ ++ I Q+S + G P +G TILG + F+YDL
Sbjct: 378 GASLVLGAQDYLI--QQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYDL 430
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 121/305 (39%), Gaps = 53/305 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + + W C C C ++D +Y+ ++ + +
Sbjct: 78 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 137
Query: 63 CYDASCK---SPF-HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQ 114
C D C P C G C Y + YGD T D + + + + +P +
Sbjct: 138 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTN-G 196
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-------- 164
+ FGC + + + + GI+G ++S + QL V FS CL
Sbjct: 197 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI 256
Query: 165 ------VQPDKSF----------------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P F H + + + G L++P ++F + ++
Sbjct: 257 FAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRK 314
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPS 261
G I D G+ L EVY L + + SQ +L T + TCF+ + FP+
Sbjct: 315 GTIIDSGTTLAYFPQEVYVPLIEKIL---SQQPDLRLHTVEQA-FTCFDYTGNVDDGFPT 370
Query: 262 MTYHF 266
+T HF
Sbjct: 371 VTLHF 375
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/373 (23%), Positives = 145/373 (38%), Gaps = 70/373 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W +C ++ DP +S SY +
Sbjct: 79 FHHNVSLTVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQTTFDPNRSS----SYSPV 134
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC +C P C C ++Y D ++ + DT + D P +
Sbjct: 135 PCSSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTI-- 192
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
FGC S + + G+MG+N S SF+ Q+ +FS C+ D F
Sbjct: 193 ----FGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDF---PKFSYCI--SDSDFSG 243
Query: 174 RLEFGD---------------QI--------------------IAGKSLNLPPNSFTIKL 198
L GD QI ++ K L LP + F
Sbjct: 244 VLLLGDANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDH 303
Query: 199 NGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ--HDIEKLFTCRKCGV-TCFNLPAR 255
G + D G+ T + VY+ L EF++ SQ +E + G+ C+ +P
Sbjct: 304 TGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLS 363
Query: 256 FNS---FPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILGA 306
S P+++ F+GA++ V + + DS + F FG + ++G
Sbjct: 364 QTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGH 423
Query: 307 RHQHNTQFVYDLD 319
HQ N +DL+
Sbjct: 424 HHQQNVWMEFDLE 436
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 97/256 (37%), Gaps = 54/256 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYD 65
Y++ + +G P S +DT + ++W QC+PC +C Q D +++ +Y +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 66 ASCKS----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+C C C Y ++YGD T V DT L P + +V FGC
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGN-----TVGTFLFGCG 257
Query: 122 -LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK-SFHSRLEFGD 179
++ F I G++ L S S Q FS CL P K S L G
Sbjct: 258 HAQAGMFAGID-----GLLALGRQSMSLKSQAAGAYGGVFSYCL--PSKQSAAGYLTLGG 310
Query: 180 Q----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ G+ + +P ++F G + D G+V
Sbjct: 311 PSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTV 364
Query: 212 LTVIECEVYAVLTAEF 227
+T + YA L + F
Sbjct: 365 ITRLPPTAYAALRSAF 380
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 143/365 (39%), Gaps = 55/365 (15%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKS 57
T N Y K+G+G K + +DT + W C C +C +++ +Y+ K+
Sbjct: 71 TSNGLYYTKIGLGP--KDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKT 128
Query: 58 YKKLPCYDASCKSPFH-----CFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEP 108
K +PC D C S + C +G C Y ITYGD T D L ++
Sbjct: 129 SKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRT 188
Query: 109 SPVSVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL- 164
P + I FGC S +S S + GI+G ++S + QL V FS CL
Sbjct: 189 VPDNTSVI-FGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLD 247
Query: 165 -------------VQP--------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
VQP H + D +AG + LP + + + RG
Sbjct: 248 SISGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSD--ILDSSSGRG 305
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP---ARFNSFP 260
I D G+ L + +Y L + + +Q KL+ TCF+ + + FP
Sbjct: 306 TIIDSGTTLAYLPVSIYDQLLEKIL---AQRSGMKLYLVED-QFTCFHYSDEESVDDLFP 361
Query: 261 SMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-GK--TILGARHQHNTQFVY 316
++ + F +G L P + +D + + + K GK +LG N VY
Sbjct: 362 TVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVY 421
Query: 317 DLDTF 321
DLD
Sbjct: 422 DLDNM 426
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 126/335 (37%), Gaps = 45/335 (13%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLPCYDASCKSPFHCFEGD- 78
+DT + + W C C +C + + +++ + +PC D C S +
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144
Query: 79 ------CFYGITYGDVYETKEVDSLDTS--TLLPPDEPSPVSVQNIRFGCSLESKDFVSI 130
C Y YGD T D L+ P+ S I FGCS+ ++
Sbjct: 145 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTK 204
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---------------VQPDKSF-- 171
K + GI G S + QL + P FS CL ++P +
Sbjct: 205 TDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILVLGEILEPSIVYSP 264
Query: 172 ------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTA 225
H L + G+ L + P F+I N + G I DCG+ L + E Y L
Sbjct: 265 LVPSQPHYNLNLQSIAVNGQPLPINPAVFSIS-NNRGGTIVDCGTTLAYLIQEAYDPLVT 323
Query: 226 EFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ-GADLVVEPENVFIFNHQ 284
SQ + +C + ++ + FP ++ +F+ GA +V++PE + N
Sbjct: 324 AINTAVSQSARQTNSKGNQCYLVSTSIG---DIFPLVSLNFEGGASMVLKPEQYLMHNGY 380
Query: 285 DSFFFFFGPAFTP-RKGKTILGARHQHNTQFVYDL 318
+ F ++G +ILG + VYD+
Sbjct: 381 LDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDI 415
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 134/349 (38%), Gaps = 45/349 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI-----YNSRSFKSYKKLP 62
Y +GIG P + LDT + W CK C ++D + Y+ RS S K++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142
Query: 63 CYDASCKSPFHC-FEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRF 118
C D C S C C Y Y D T + D L L + P S ++ F
Sbjct: 143 CDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTST-SVTF 201
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------------ 164
GC L+ ++ I GI+G + + + QL + FS CL
Sbjct: 202 GCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIG 261
Query: 165 --VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
V+P + + + +AG +L LP N F +G D GS L
Sbjct: 262 EVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIF--GTTKTKGTFIDSGSTLV 319
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN-LPARFNSFPSMTYHFQG-ADL 271
+ +Y+ L + F++H + CF+ L + + FP +T+HF+ L
Sbjct: 320 YLPEIIYSEL---ILAVFAKH--PDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTL 374
Query: 272 VVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYDLD 319
V P + + + + F F A K ILG N VYD++
Sbjct: 375 DVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDME 423
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 134/349 (38%), Gaps = 45/349 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI-----YNSRSFKSYKKLP 62
Y +GIG P + LDT + W CK C ++D + Y+ RS S K++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142
Query: 63 CYDASCKSPFHC-FEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRF 118
C D C S C C Y Y D T + D L L + P S ++ F
Sbjct: 143 CDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTST-SVTF 201
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------------ 164
GC L+ ++ I GI+G + + + QL + FS CL
Sbjct: 202 GCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIG 261
Query: 165 --VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
V+P + + + +AG +L LP N F +G D GS L
Sbjct: 262 EVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIF--GTTKTKGTFIDSGSTLV 319
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN-LPARFNSFPSMTYHFQG-ADL 271
+ +Y+ L + F++H + CF+ L + + FP +T+HF+ L
Sbjct: 320 YLPEIIYSEL---ILAVFAKH--PDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTL 374
Query: 272 VVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYDLD 319
V P + + + + F F A K ILG N VYD++
Sbjct: 375 DVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDME 423
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 135/359 (37%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + + +DT + + W C C C + + ++ S + +
Sbjct: 81 YYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVS 140
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C + C Y YGD V + + D + S+L+ P+
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV-PNS 199
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
+PV FGCS + + + GI G S + QL L P FS CL
Sbjct: 200 TAPV-----VFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLK 254
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ F H + + G++L + P+ F+ NGQ
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTS-NGQ- 312
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + Y + SQ + +C V ++ + FP +
Sbjct: 313 GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVIATSVA---DIFPPV 369
Query: 263 TYHFQ-GADLVVEPENVFI--FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +F GA + + P++ I N + + G +G TILG + FVYDL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDL 428
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 134/347 (38%), Gaps = 43/347 (12%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-PIYNSRSFKSYKKLPC 63
N +++K+ IG P L + T + L W C K C D ++ +YK +PC
Sbjct: 95 NGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPC 154
Query: 64 YDASCK--SPFHCFEGDCFYGITYGDVYETKEVD-SLDTSTLLPPDEPSPVSVQNIRFGC 120
C+ + C DCFY + D ++DT TL S + N F C
Sbjct: 155 DSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKS-FMLPNTGFIC 213
Query: 121 SLE-SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
D+ + GI+GL S S + ++ L+ +FS C+V + S+L FGD
Sbjct: 214 GNRIGGDYPGV------GILGLGHGSLSLLNRISHLIDGKFSHCIVPYSSNQTSKLSFGD 267
Query: 180 QIIAGKS--------LNLPPNSFTIKLNG------------------QRGCINDCGSVLT 213
+ + S + P S+T+ G G D G++ T
Sbjct: 268 KAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFT 327
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVV 273
Y+ L + + Y Q + R+ + C+ F S P++T HF+G + +
Sbjct: 328 YFPEYFYSQLEYD-VRYAIQQEPLYPDPTRRLRL-CYRYSPDF-SPPTITMHFEGGSVEL 384
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
N FI +D F A + + + G Q N YDLD
Sbjct: 385 SSSNSFIRMTEDIVCLAF--ATSSSEQDAVFGYWQQTNLLIGYDLDA 429
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 136/350 (38%), Gaps = 47/350 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI-----YNSRSFKSYKKLP 62
Y +GIG P + LDT + W CK C ++D + Y+ RS S K++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118
Query: 63 CYDASCKSPFHC-FEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRF 118
C D C S C C Y Y D T + D L L + P S ++ F
Sbjct: 119 CDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTST-SVTF 177
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------------ 164
GC L+ ++ I GI+G + + + QL + FS CL
Sbjct: 178 GCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIG 237
Query: 165 --VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSF-TIKLNGQRGCINDCGSVL 212
V+P + + + +AG +L LP N F T K +G D GS L
Sbjct: 238 EVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK---TKGTFIDSGSTL 294
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN-LPARFNSFPSMTYHFQG-AD 270
+ +Y+ L + F++H + CF+ L + + FP +T+HF+
Sbjct: 295 VYLPEIIYSEL---ILAVFAKH--PDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLT 349
Query: 271 LVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYDLD 319
L V P + + + + F F A K ILG N VYD++
Sbjct: 350 LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDME 399
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 83/374 (22%), Positives = 140/374 (37%), Gaps = 75/374 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +G P + + +LDT + L+W C+ + ++N S SY +
Sbjct: 34 FHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPI 89
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDT----STLLPPDEPS 109
PC C++ P C C ++Y D + + D S+ LP
Sbjct: 90 PCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALP----- 144
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
FGC S + G+MG+N S SF+ QLG +FS C+ D
Sbjct: 145 -----GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGL---PKFSYCISGRDS 196
Query: 170 S-----------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTI 196
S + R+ + Q+ + K L LP + F
Sbjct: 197 SGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAP 256
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNL 252
G + D G+ T + VY L EF++ ++ + L F + C+ +
Sbjct: 257 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGDPNFVFQGAMDLCYRV 315
Query: 253 PA--RFNSFPSMTYHFQGADLVVEPENVF------IFNHQDSFFFFFGPAFTPRKGKTIL 304
PA + P+++ F+GA++VV E + + + + FG + ++
Sbjct: 316 PAGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVI 375
Query: 305 GARHQHNTQFVYDL 318
G HQ N +DL
Sbjct: 376 GHHHQQNVWMEFDL 389
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 78/359 (21%), Positives = 135/359 (37%), Gaps = 60/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P + +DT + + W C C +C + + +++ S + + +P
Sbjct: 81 YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140
Query: 63 CYDASCKSPFHCF-------EGDCFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C C S C Y YGD V +T D++ +L+
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLV 165
+ I FGCS ++ K + GI G S + QL + P FS CL
Sbjct: 201 AA------IVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLK 254
Query: 166 QPDKSF-----------------------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
D H L+ ++G+ L + P +F N R
Sbjct: 255 GEDSGGGILVLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSN--R 312
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + E Y + SQ + +C + ++ FP +
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQCYLVSNSVS---EVFPPV 369
Query: 263 TYHFQ-GADLVVEPEN--VFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+++F GA ++++PE +++ N+ + + G + G TILG + FVYDL
Sbjct: 370 SFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQ-KIQGGITILGDLVLKDKIFVYDL 427
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 136/350 (38%), Gaps = 47/350 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI-----YNSRSFKSYKKLP 62
Y +GIG P + LDT + W CK C ++D + Y+ RS S K++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118
Query: 63 CYDASCKSPFHC-FEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRF 118
C D C S C C Y Y D T + D L L + P S ++ F
Sbjct: 119 CDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTST-SVTF 177
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------------ 164
GC L+ ++ I GI+G + + + QL + FS CL
Sbjct: 178 GCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIG 237
Query: 165 --VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSF-TIKLNGQRGCINDCGSVL 212
V+P + + + +AG +L LP N F T K +G D GS L
Sbjct: 238 EVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK---TKGTFIDSGSTL 294
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN-LPARFNSFPSMTYHFQG-AD 270
+ +Y+ L + F++H + CF+ L + + FP +T+HF+
Sbjct: 295 VYLPEIIYSEL---ILAVFAKH--PDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLT 349
Query: 271 LVVEPENVFIFNHQDSFFFFFGPA-FTPRKGKTILGARHQHNTQFVYDLD 319
L V P + + + + F F A K ILG N VYD++
Sbjct: 350 LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDME 399
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 107/298 (35%), Gaps = 45/298 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + IG P + L L DT + L W +C C C Q P Y S+ KLPC +
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSL 141
Query: 68 CKS--PFHCFEG--DCFYGITYGDVYE----TKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C C G +C Y +YG + T+ +T TL +V I FG
Sbjct: 142 CSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTL------GSDAVPGIGFG 195
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
C+ S+ ++ G S + QL FS CL D + S L FG
Sbjct: 196 CTTMSEGGYGSGSGLVGLGRG----PLSLVSQLNV---GAFSYCLTS-DAAKTSPLLFGS 247
Query: 180 QIIAGKSLNLPP------------------NSFTIKLNGQRGCINDCGSVLTVIECEVYA 221
+ G + P + T G G I D G+ + + Y
Sbjct: 248 GALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAY- 306
Query: 222 VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPENVF 279
L E + SQ + + R CF FPSM HF G D+ + EN F
Sbjct: 307 TLAKEAV--LSQTTNLTMASGRDGYEVCFQTSGAV--FPSMVLHFDGGDMDLPTENYF 360
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 82/373 (21%), Positives = 135/373 (36%), Gaps = 69/373 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +G P +++ +LDT + L+W C+ + ++N S +Y +
Sbjct: 55 FRHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPV 110
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC C++ P C C I+Y D + + DT + P +
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGTL- 169
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC S + G+MG+N S SF+ QLG +FS C+ D S
Sbjct: 170 -----FGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCISGSDSSGI 221
Query: 171 ---------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLN 199
+ R+ + Q+ + K L+LP + F
Sbjct: 222 LLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHT 281
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ-----HDIEKLF--TCRKCGVTCFNL 252
G + D G+ T + VY L EFI D +F T C +
Sbjct: 282 GAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSST 341
Query: 253 PARFNSFPSMTYHFQGADLVVEPENVFIF-------NHQDSFFFFFGPAFTPRKGKTILG 305
F P ++ F+GA++ V + + ++ + F FG + ++G
Sbjct: 342 RPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIG 401
Query: 306 ARHQHNTQFVYDL 318
HQ N +DL
Sbjct: 402 HHHQQNVWMEFDL 414
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 131/364 (35%), Gaps = 68/364 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G P + L LDT A TW C PC +C + ++ + SY LPC +
Sbjct: 78 SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 135
Query: 67 SC------KSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSP 110
C P GD C + + D + S DT L
Sbjct: 136 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL------GK 188
Query: 111 VSVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
++ N FGC S + ++ ++ G++GL + + Q G L FS CL
Sbjct: 189 DAIPNYTFGCVSSVTGPTTNMPRQ---GLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRS 245
Query: 170 SFHS---RLEFGDQI----------------------IAGKSLN-----LPPNSFTIKLN 199
+ S RL G + G S+ +P SF
Sbjct: 246 YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAA 305
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNS 258
G + D G+V+T VYA L EF Q +T TCFN
Sbjct: 306 TGAGTVVDSGTVITRWTAPVYAALREEF---RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 362
Query: 259 FPSMTYHFQGA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKT---ILGARHQHNTQF 314
P++T H G DL + EN I + A P+ + ++ Q N +
Sbjct: 363 APAVTVHMDGGVDLALPMENTLIHSSATP-LACLAMAEAPQNVNSVVNVIANLQQQNIRV 421
Query: 315 VYDL 318
V+D+
Sbjct: 422 VFDV 425
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 135/353 (38%), Gaps = 56/353 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C C C DP + +Y+ + C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144
Query: 65 DASCKSPFHCFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
+A C +C E C Y Y ++ + V + D + E P Q FGC +
Sbjct: 145 NADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVP---QRAVFGCET 197
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GR-LVPDRFSCCL--------------- 164
+ES D + + GIMGL + S M QL G+ +V + FS C
Sbjct: 198 MESGDLYTQRAD---GIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGI 254
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + +E + +AGK L L P +F +G+ G I D G+
Sbjct: 255 SSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAY 310
Query: 215 IECEVY-----AVLTA-EFIDYFSQHDIEKLFTC-RKCGVTCFNLPARFNSFPSMTYHF- 266
+ Y A++ F+ S D C G LP FP + F
Sbjct: 311 FPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELP---KVFPEVDMVFA 367
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G + + PEN + + S + G T+LG NT Y+ +
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRE 420
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 135/353 (38%), Gaps = 56/353 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C C C DP + +Y+ + C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144
Query: 65 DASCKSPFHCFEG--DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
+A C +C E C Y Y ++ + V + D + E P Q FGC +
Sbjct: 145 NADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVP---QRAVFGCET 197
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GR-LVPDRFSCCL--------------- 164
+ES D + + GIMGL + S M QL G+ +V + FS C
Sbjct: 198 MESGDLYTQRAD---GIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGI 254
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + +E + +AGK L L P +F +G+ G I D G+
Sbjct: 255 SSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAY 310
Query: 215 IECEVY-----AVLTA-EFIDYFSQHDIEKLFTC-RKCGVTCFNLPARFNSFPSMTYHF- 266
+ Y A++ F+ S D C G LP FP + F
Sbjct: 311 FPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELP---KVFPEVDMVFA 367
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G + + PEN + + S + G T+LG NT Y+ +
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRE 420
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/346 (21%), Positives = 125/346 (36%), Gaps = 56/346 (16%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P +++ +DT W C C C + ++N+ ++K + C
Sbjct: 95 TYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVKSTTFKTVGCEAP 151
Query: 67 SCKS--PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
CK C C + +TYG + S D TL PS FGC E+
Sbjct: 152 QCKQVPNSKCGGSACAFNMTYGSSSIAANL-SQDVVTLATDSIPS------YTFGCLTEA 204
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--------------------- 163
SI + G++GL S + Q L FS C
Sbjct: 205 TG-SSIPPQ---GLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQP 260
Query: 164 --------LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
L P +S + + + +++PP++ G I D G+V T +
Sbjct: 261 KRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRL 320
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y + D F + T TC+ P P++T+ F G ++ + P
Sbjct: 321 VAPAYTAVR----DAFRKRVGNATVTSLGGFDTCYTSPI---VAPTITFMFSGMNVTLPP 373
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
+N+ I + S A P ++L Q N + ++D+
Sbjct: 374 DNLLIHSTASS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDV 418
>gi|326531368|dbj|BAK05035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 129/340 (37%), Gaps = 45/340 (13%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+ LG G+ + LDT A +W C+PC Q +++ + ++ + D C
Sbjct: 74 VSLGTGEGTRLKVLALDTEASTSWVMCKPCHPSPPQVGNLFSPGASPTFHGVHSNDPVCT 133
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVS 129
P+ C + + Y +++ L T E S+ + FGC+ S F
Sbjct: 134 VPYRKTANGCSFHFSSITGYLSRDTFHLRTGRAGAVRE----SIPRVVFGCAHSSTGF-- 187
Query: 130 IQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNL 189
+ G++ L+ S + QLG RFS CL P + H+ G + +
Sbjct: 188 HNDNTLGGVLSLSHLPLSLLTQLGAHASGRFSYCL--PKSTGHN--PHGSLFLGADVPSP 243
Query: 190 PPNSFTIKLNGQRGC----INDCG------------SVLTVIECEVYAVLTAEFIDYFSQ 233
PP+S T L G +N G VL C + T I
Sbjct: 244 PPHSHTTNLVIHPGVSGYHLNLIGITRGYKRLKIDKRVLVSHSCSINPAETITHIAEPIY 303
Query: 234 HDIEKLFTCRKCGVTCFNL------PARFN--------SFPSMTYHFQ-GADLVVEPENV 278
+EK R + + P F+ P+M +HF+ GA+L + +
Sbjct: 304 LVVEKALVARMKELGSDRVKGPPGGPLWFDRMYQSVKEQLPNMAFHFEGGAELWFTSDRL 363
Query: 279 FIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F + ++ F G + +T++GA Q NT+F +D+
Sbjct: 364 FEVHGMNARFMVAGRGYR----RTVIGAAQQVNTRFTFDV 399
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 135/359 (37%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y KL +G P + + +DT + + W C C C + + ++ S + +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C + C Y YGD V + + D + S+L+ P+
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV-PNS 199
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
+PV FGCS + + + GI G S + QL + P FS CL
Sbjct: 200 TAPVV-----FGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ F H + + G++L + P+ F+ NGQ
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTS-NGQ- 312
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + Y + SQ + +C V ++ + FP +
Sbjct: 313 GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTSVG---DIFPPV 369
Query: 263 TYHFQ-GADLVVEPENVFI--FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +F GA + + P++ I N + + G +G TILG + FVYDL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDL 428
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 135/359 (37%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y KL +G P + + +DT + + W C C C + + ++ S + +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C + C Y YGD V + + D + S+L+ P+
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV-PNS 199
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
+PV FGCS + + + GI G S + QL + P FS CL
Sbjct: 200 TAPV-----VFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ F H + + G++L + P+ F+ NGQ
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTS-NGQ- 312
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + Y + SQ + +C V ++ + FP +
Sbjct: 313 GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTSVG---DIFPPV 369
Query: 263 TYHFQ-GADLVVEPENVFI--FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +F GA + + P++ I N + + G +G TILG + FVYDL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDL 428
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/373 (22%), Positives = 144/373 (38%), Gaps = 67/373 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N T + L +G P + + +LDT + L+W C+ + ++N S SY +
Sbjct: 994 FHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPI 1049
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC C++ P C C ++Y D + + D + P +
Sbjct: 1050 PCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTL-- 1107
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--- 170
FGC S + G+MG+N S SF+ QLG +P +FS C+ D S
Sbjct: 1108 ----FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLG--LP-KFSYCISGRDSSGVL 1160
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ + K L LP + F G
Sbjct: 1161 LFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTG 1220
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPA-- 254
+ D G+ T + VY L EF++ ++ + L F + C+++ A
Sbjct: 1221 AGQTMVDSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGDPNFVFQGAMDLCYSVAAGG 1279
Query: 255 RFNSFPSMTYHFQGADLVVEPENVF------IFNHQDSFFFFFGPAFTPRKGKTILGARH 308
+ + PS++ F+GA++VV E + + ++ + FG + ++G H
Sbjct: 1280 KLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHH 1339
Query: 309 QHNTQFVYDLDTF 321
Q N +DL F
Sbjct: 1340 QQNVWMEFDLVAF 1352
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 128/337 (37%), Gaps = 45/337 (13%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
IG P ++ +D L WTQC C C++Q+ P++ + ++K PC CKS
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119
Query: 72 FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQ 131
C C Y G T + + DT + +P S + FGC + S D ++
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAI---GTAAPAS---LGFGCVVAS-DIDTMG 172
Query: 132 KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-IAGKSLNLP 190
+G +GL S + Q+ +L RFS CL D +SRL G +AG P
Sbjct: 173 GP--SGFIGLGRTPWSLVAQM-KLT--RFSYCLAPHDTGKNSRLFLGASAKLAGGGAWTP 227
Query: 191 -----PNS-----FTIKLN----GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDI 236
PN + I+L G G +++ V V + +D Q
Sbjct: 228 FVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAV--VRVSLLVDSVYQEFK 285
Query: 237 EKLFTCRKCGVTCFNLPARF---------NSFPSMTYHFQ-GADLVVEPENVFIFNHQD- 285
+ + T + A F + P + + FQ GA L V P N D
Sbjct: 286 KAVMASVGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDVGNDT 345
Query: 286 ---SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
S T G ILG+ Q N ++DLD
Sbjct: 346 VCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLD 382
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 132/374 (35%), Gaps = 82/374 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ + IGDP + ++DT + L WTQC C+ +C+ QN P Y+ ++ + + C DA
Sbjct: 71 YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDA 130
Query: 67 SCK--SPFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+C S C + G G++ T ++L + ++ FGC
Sbjct: 131 ACALGSETQCLSDNKTCAVVTGYGAGNIAGTLATENLTFQS----------ETVSLVFGC 180
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLEFG 178
+ +K +GI+GL S QLG RFS CL D S + G
Sbjct: 181 IVVTK-LSPGSLNGASGIIGLGRGKLSLPSQLGD---TRFSYCLTPYFEDTIEPSHMVVG 236
Query: 179 DQ-------------------------------------IIAGK-SLNLPPNSFTIKLNG 200
I AGK L +P +F ++
Sbjct: 237 ASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVA 296
Query: 201 Q---RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
G D G+ LT + Y L AE ++ L G T F+L
Sbjct: 297 PGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPL-----AGTTGFDLCVALK 351
Query: 258 S----FPSMTYHFQGA-----DLVVEPENVFI-FNHQDSFFFFFGPA---FTPRKGKTIL 304
P + HF G DLVV P N + + + F P T++
Sbjct: 352 DAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVI 411
Query: 305 GARHQHNTQFVYDL 318
G Q N +YDL
Sbjct: 412 GNYMQQNMHVLYDL 425
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 47/88 (53%), Gaps = 3/88 (3%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + DT + L W QC PC CY+Q+ PI++ S+ +PC +
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQN 151
Query: 68 CKS--PFHC-FEGDCFYGITYGDVYETK 92
CK+ HC +G C Y TYGD TK
Sbjct: 152 CKAIDDSHCGAQGVCDYSYTYGDQTYTK 179
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 85/361 (23%), Positives = 138/361 (38%), Gaps = 57/361 (15%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+ Y K+ +G P +DT + + W C C +C + + F + L
Sbjct: 101 MTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTA 160
Query: 64 YDASCKSPF----------HCFEGD-CFYGITYGDVYETKEVDSLDT-------STLLPP 105
+C P C E + C Y YGD T DT L
Sbjct: 161 GSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 220
Query: 106 DEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC 163
+ +P I FGCS ++ K + GI G S + QL + P FS C
Sbjct: 221 NSSAP-----IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC 275
Query: 164 LVQPDKSFHSRLEFGDQIIAGKSLN-LPP-------NSFTIKLNGQ-------------- 201
L + D S G+ ++ G + L P N +I +NGQ
Sbjct: 276 L-KGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT 334
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPS 261
RG I D G+ LT + E Y + + SQ + +C + ++ + FPS
Sbjct: 335 RGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSIS---DMFPS 391
Query: 262 MTYHFQ-GADLVVEPENVFIFN---HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
++ +F GA +++ P++ ++F+ + + + G P + +TILG + FVYD
Sbjct: 392 VSLNFAGGASMMLRPQD-YLFHYGIYDGASMWCIGFQKAPEE-QTILGDLVLKDKVFVYD 449
Query: 318 L 318
L
Sbjct: 450 L 450
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 74/171 (43%), Gaps = 15/171 (8%)
Query: 15 GDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--S 70
G S ++D+ + + W QCQPC C+ Q DP+++ + +Y +PC A+C
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 71 PFH--CF-EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
P+ C C +GITY + S D TL P D V+ FGC+ D
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDV-----VRGFLFGCA--HADQ 187
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
S +AG + L S SF+ Q FS C V P S + FG
Sbjct: 188 GSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFG 237
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 137/355 (38%), Gaps = 50/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+GIG P K + +DT + + W C C+ C + + + S
Sbjct: 87 YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146
Query: 68 CKSPFHCFEGD------------CFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSV 113
C F C E + C Y YGD T V + D + +
Sbjct: 147 CDEQF-CLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 114 QNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL------ 164
+I+FGC + +S D S ++ + GI+G ++S + QL R V F+ CL
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP + H + + LN+ + F + ++G I D
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVF--EAGDRKGTIIDS 323
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
G+ L + +Y L A+ + QH++E CF R + FP + +HF+
Sbjct: 324 GTTLAYLPELIYEPLVAKILS--QQHNLE--VQTIHGEYKCFQYSERVDDGFPPVIFHFE 379
Query: 268 GADLVVEPENVFIFNHQDSFFFFF---GPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L+ + ++F +++ + + G RK T+ G N +YDL+
Sbjct: 380 NSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLE 434
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 76/331 (22%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASC- 68
++ IG P + + L+DT + LTW Q C +C P +N S+ PC + C
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60
Query: 69 -------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+S + G C + + Y D E V + + +L D + ++ ++ FGC+
Sbjct: 61 GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAAS-TLGDVIFGCA 119
Query: 122 LESKDFVSIQKKI--IAGIMGLNWDSTSFMVQLGRL----VPDRFSCCLVQPDKSFH--- 172
SKD +Q+ + +G +GLN S SF Q+G + DRFS C P+++ H
Sbjct: 120 --SKD---LQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF--PNRAEHLNS 172
Query: 173 -SRLEFGDQII--------------------------------AGKSLNLPPNSFTIKLN 199
+ FGD I G+ L++P ++F I
Sbjct: 173 SGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRL 232
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEF------IDYFSQHDIEKLFTCRKCGVTCFNLP 253
G G D G+ ++ + + L F ++ S D K C+++
Sbjct: 233 GNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKEL--------CYDVA 284
Query: 254 ---ARFNSFPSMTYHFQ-GADLVVEPENVFI 280
AR + P +T HF+ D+ + +V++
Sbjct: 285 AGDARLPTAPLVTLHFKNNVDMELREASVWV 315
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 134/346 (38%), Gaps = 56/346 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + +G P ++L L DT + L W +C CK C + Y S+ KLPC A
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 68 CKSPFHCFEGDC--------------FYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
C++ C YG++ + T+ +T TL +V
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTL------GSDAV 194
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
Q I FGC+ S+ ++ G + + +L FS CL D S S
Sbjct: 195 QGIGFGCTTMSEGGYGSGSGLVGLGRGK-------LSLVRQLKVGAFSYCLTS-DPSTSS 246
Query: 174 RLEFGDQIIAGKS------LNLPPNSF-TIKLN------------GQRGCINDCGSVLTV 214
L FG + G +NL ++F T+ L+ G+ G I D G+ LT
Sbjct: 247 PLLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTF 306
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ Y + A + + ++ ++ V CF FPSM HF G D+ ++
Sbjct: 307 LAEPAYTLAEAGLLSQTT--NLTRVPGTDGYEV-CFQTSGG-AVFPSMVLHFDGGDMALK 362
Query: 275 PENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
EN F N S + +P + +I+G Q + YDLD
Sbjct: 363 TENYFGAVNDSVSCWLV---QKSPSE-MSIVGNIMQMDYHIRYDLD 404
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/306 (24%), Positives = 119/306 (38%), Gaps = 54/306 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y++K +G P ++L LD W C+ C C + ++N+ ++K L C
Sbjct: 34 SYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTLGCGAP 90
Query: 67 SCK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
CK +P C C + TYG + + +L T+ +P P FGC ++
Sbjct: 91 QCKQVPNPI-CGGSTCTWNTTYG---SSTILSNLTRDTIALSMDPVPYYA----FGC-IQ 141
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG---- 178
S+ + G++G SF+ Q L FS CL +F L G
Sbjct: 142 KATGSSVPPQ---GLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQ 198
Query: 179 -DQIIAGKSLNLPPNS--FTIKLNGQR---------------------GCINDCGSVLTV 214
+I L P S + +KLNG R G I D G+V T
Sbjct: 199 PPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTR 258
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVE 274
+ Y + EF + L TC+++P P++T+ F G ++ +
Sbjct: 259 LVAPAYIAVRNEFRKRVGNATVSSLGGFD----TCYSVPI---VPPTITFMFSGMNVTMP 311
Query: 275 PENVFI 280
PEN+ I
Sbjct: 312 PENLLI 317
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 117/333 (35%), Gaps = 63/333 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSF-----K 56
+T Y ++L +G P + + DT + LTW +C S R F K
Sbjct: 98 YTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSK 157
Query: 57 SYKKLPCYDASCKS--PFHCFE-----GDCFYGITYGDVYETKEVDSLDTST--LLPPDE 107
S+ LPC +CKS PF C Y Y D + V LD++T L D
Sbjct: 158 SWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDG 217
Query: 108 PSPVSVQNIRFGC--SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
+Q + GC S + + F S G++ L + SF + RFS CLV
Sbjct: 218 TRKAKLQEVVLGCTTSYDGQSFKSSD-----GVLSLGNSNISFASRAASRFGGRFSYCLV 272
Query: 166 Q--PDKSFHSRLEFGDQ----------------------------------IIAGKSLNL 189
++ S L FG+ +AG+ L +
Sbjct: 273 DHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEI 332
Query: 190 PPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTC 249
P+ + + NG G I D G+ LT++ Y + F+ + C
Sbjct: 333 LPDVWDFRKNG--GAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDPFEY----C 386
Query: 250 FNLPARFNSFPSMTYHFQGADLVVEPENVFIFN 282
+N P M F GA + P ++ +
Sbjct: 387 YNWTGVSAEIPRMELRFAGAATLAPPGKSYVID 419
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 137/357 (38%), Gaps = 57/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+ +G P +DT + + W C C +C + + F + L +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 68 CKSPF----------HCFEGD-CFYGITYGDVYETKEVDSLDT-------STLLPPDEPS 109
C P C E + C Y YGD T DT L + +
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQP 167
P I FGCS ++ K + GI G S + QL + P FS CL +
Sbjct: 220 P-----IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KG 273
Query: 168 DKSFHSRLEFGDQIIAGKSLN-LPP-------NSFTIKLNGQ--------------RGCI 205
D S G+ ++ G + L P N +I +NGQ RG I
Sbjct: 274 DGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
D G+ LT + E Y + + SQ + +C + ++ + FPS++ +
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSIS---DMFPSVSLN 390
Query: 266 FQ-GADLVVEPENVFIFN---HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA +++ P++ ++F+ + + + G P + +TILG + FVYDL
Sbjct: 391 FAGGASMMLRPQD-YLFHYGIYDGASMWCIGFQKAPEE-QTILGDLVLKDKVFVYDL 445
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/348 (21%), Positives = 126/348 (36%), Gaps = 63/348 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + IG P + ++DT + ++W C + ++ +Y C A+
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSLFFDPGKSSTYTPFSCSSAA 182
Query: 68 C-----KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C + C Y + YGD T DT L ++ V+N +FGCS
Sbjct: 183 CTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEK-----VENFQFGCSE 237
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS------------ 170
S + + G+MGL + S + Q FS CL +S
Sbjct: 238 TSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSSGFLTLGASTGT 297
Query: 171 --------FHSRLE-------FGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
F SR + G + + P F G I D G+++T +
Sbjct: 298 SGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA------GSIMDSGTIITRL 351
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVE 274
Y+ L+A F ++ + F+ TCF+ + N S P++ F G +V
Sbjct: 352 PPRAYSALSAAFRAGMRRYPRARAFSILD---TCFDFTGQDNVSIPAVELVFSGGAVV-- 406
Query: 275 PENVFIFNHQDSFFFFFGP--AFTPRKG--KTILGARHQHNTQFVYDL 318
D+ +G AF P G +I+G Q + ++D+
Sbjct: 407 --------DLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDV 446
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/350 (22%), Positives = 128/350 (36%), Gaps = 62/350 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P + + LDT W C C C + +++ S + L C
Sbjct: 90 TYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASSVLFDPSKSSSSRNLQCDAP 147
Query: 67 SCKSPFH--CFEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
CK + C G C + +TYG T E SL TL ++ +++ FGC +
Sbjct: 148 QCKQAPNPTCTAGKSCGFNMTYGG--STIEA-SLTQDTLTLAND----VIKSYTFGC-IS 199
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEFGDQI- 181
S+ + G+MGL S + Q L FS CL S F L G +
Sbjct: 200 KATGTSLPAQ---GLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQ 256
Query: 182 ---------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
+ K +++P ++ + G I D G+V T
Sbjct: 257 PVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTR 316
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLF---TCRKCGVTCFNLPARFNSFPSMTYHFQGADL 271
+ Y + EF + L TC V +PS+T+ F G ++
Sbjct: 317 LVEPAYVAVRNEFRRRIKNANATSLGGFDTCYSGSVV----------YPSVTFMFAGMNV 366
Query: 272 VVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
+ P+N+ I + S A P ++L + Q N + + DL
Sbjct: 367 TLPPDNLLIHSSSGS-TSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDL 415
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 25/121 (20%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N +++ L IG P ++ ++DT + L WTQC+PCK C++Q PI++ S+ KLPC
Sbjct: 94 NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
D+Y + L T T D SV I FGC ++
Sbjct: 154 S---------------------DLYHSSTQGVLATETFTFGD----ASVSKIGFGCGEDN 188
Query: 125 K 125
+
Sbjct: 189 R 189
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 73/163 (44%), Gaps = 28/163 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + L IG+P K+ F +DT + LTW QC PCK C + D +Y ++ +PC ++
Sbjct: 54 YSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKPKN----NLVPCSNS 109
Query: 67 SCKS-----PFHCFEGD--CFYGITYGDVYETKEV---DS----LDTSTLLPPDEPSPVS 112
C++ +HC D C Y I Y D+ + V DS L TLL P
Sbjct: 110 LCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQP------- 162
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL 155
+ FGC + K AGI+GL S + QL L
Sbjct: 163 --KMAFGCGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTL 203
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/345 (22%), Positives = 140/345 (40%), Gaps = 53/345 (15%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS-PF 72
IG P + ++D L WTQC C C++Q+ P++ + +++ PC +CKS P
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 73 HCFEGD-CFYGITYG---DVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFV 128
GD C Y T D + T + +T + + ++ FGC + S D
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAI-------GTATASLAFGCVVAS-DID 160
Query: 129 SIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD--QIIAGKS 186
++ +G +GL S + Q+ +L +FS CL SRL G ++ G+S
Sbjct: 161 TMDG--TSGFIGLGRTPRSLVAQM-KLT--KFSYCLSPRGTGKSSRLFLGSSAKLAGGES 215
Query: 187 LNLPP-----------NSFTIKLNGQRG-----CINDCGSVLTVIECEVYAVLTAEFIDY 230
+ P + + + L+ R G +L + +++L
Sbjct: 216 TSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRA 275
Query: 231 FSQHDIEKLFTCRKCGVT--------CFNLPARFN--SFPSMTYHFQGADLVVEPENVFI 280
F + E + + + CF A F+ + P + + FQGA + P ++
Sbjct: 276 FKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYL 335
Query: 281 FN---HQDSF-FFFFGPAFTPR---KGKTILGARHQHNTQFVYDL 318
+ +D+ A+ R +G ++LG+ Q + F+YDL
Sbjct: 336 IDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 380
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 137/357 (38%), Gaps = 57/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+ +G P +DT + + W C C +C + + F + L +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 68 CKSPF----------HCFEGD-CFYGITYGDVYETKEVDSLDT-------STLLPPDEPS 109
C P C E + C Y YGD T DT L + +
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQP 167
P I FGCS ++ K + GI G S + QL + P FS CL +
Sbjct: 220 P-----IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KG 273
Query: 168 DKSFHSRLEFGDQIIAGKSLN-LPP-------NSFTIKLNGQ--------------RGCI 205
D S G+ ++ G + L P N +I +NGQ RG I
Sbjct: 274 DGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
D G+ LT + E Y + + SQ + +C + ++ + FPS++ +
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSIS---DMFPSVSLN 390
Query: 266 FQ-GADLVVEPENVFIFN---HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA +++ P++ ++F+ + + + G P + +TILG + FVYDL
Sbjct: 391 FAGGASMMLRPQD-YLFHYGIYDGASMWCIGFQKAPEE-QTILGDLVLKDKVFVYDL 445
>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
Length = 402
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/267 (24%), Positives = 98/267 (36%), Gaps = 64/267 (23%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASC--- 68
I DP+ + +DT L W QC PC CY Q + +++ R ++ +PC A+C
Sbjct: 139 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 198
Query: 69 -KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
+ C C Y + YGD T STL PS V V N RFGCS +
Sbjct: 199 GRYGAGCSNNQCQYFVDYGDGRATSGRTWWTPSTL----NPSTV-VMNFRFGCSHAVRGN 253
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSL 187
S +G MG+ + G+ L
Sbjct: 254 FSAST---SGTMGIE---------------------------------------VGGRRL 271
Query: 188 NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV 247
N+PP F G + D ++T + Y L F + + ++ R
Sbjct: 272 NVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY--PRVAGGRAGLD 323
Query: 248 TCFNLPARFNS--FPSMTYHFQGADLV 272
TC++ RF S P+++ F G +V
Sbjct: 324 TCYDF-VRFTSVTVPAVSLVFDGGAVV 349
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 129/331 (38%), Gaps = 66/331 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ----PCKSCYEQNDPIYNSRSFKSYKKLPC 63
+ + + IG+P K + +DT + LTW +C PCK+C + P+Y + K +PC
Sbjct: 40 FYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK-----KLVPC 94
Query: 64 YDASCKSPFH--------CFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
D C + H C E C Y I Y D + V LD +L S
Sbjct: 95 ADPLCDA-LHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSL------PTGSA 147
Query: 114 QNIRFGCS---LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRFSCCLVQP 167
+NI FGC ++ + +K + GI+GL S + QL G + + CL
Sbjct: 148 RNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCLSSK 207
Query: 168 DKSFHSRLEFGDQIIAGKSLNL-------------PPNSFTIKLN----GQR--GCINDC 208
+ L G++ + L++ P T+ L G + I D
Sbjct: 208 GGGY---LFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKAIFDS 264
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIE-------KLFTCRKCG---VTCFNLPARFNS 258
GS T + ++A L + + ++ +L C K T +LP F S
Sbjct: 265 GSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPKEFKS 324
Query: 259 FPSMTYHFQGADLVVEPENVFIF-NHQDSFF 288
++ + G + + PEN I H ++ F
Sbjct: 325 LVTLKFD-HGVTMTIPPENYLIITGHGNACF 354
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 135/357 (37%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC--YEQNDPIYNSRSFKSYKKLPCYD 65
YM++L IG P + + ++DT + L W +C C C + I+ S + SYKKLPC
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 66 ASCKSPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLP--PDEPSPVSVQNIRF 118
C E C Y YGD T D + E F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK--SFHSRLE 176
GC+ + K G++GL S S + QLG + +FS CLV D S S L
Sbjct: 125 GCARKLKG----DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180
Query: 177 FGDQ------------IIAGKSLN-----LPPNSFTI---------KLNGQRGCINDCGS 210
G I+ G L+ + S TI K +G + +
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLA 240
Query: 211 VLTVIEC-EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-----CFNLPARFN-SFPSMT 263
TVI+ Y +LT + + IE+ G + CFN + FPS+T
Sbjct: 241 NKTVIDSGTTYTLLTPPVYEAMRK-SIEEQVILPTLGNSAGLDLCFNSSGDTSYGFPSVT 299
Query: 264 YHFQGADLVVEP-ENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
++F +V P EN+F +D + G +I+G Q N +YDL
Sbjct: 300 FYFANQVQLVLPFENIFQVTSRDVVCL----SMDSSGGDLSIIGNMQQQNFHILYDL 352
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 134/339 (39%), Gaps = 45/339 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P K + DT + L W Q +PC C I++ R +++++ C
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSSQL 112
Query: 68 CKS-PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C P C G C Y YG ET+ + DT +L + S + GC + +
Sbjct: 113 CAELPGSCEPGSSTCSYSYEYGS-GETEGEFARDTISLGTTSDGSQ-KFPSFAVGCGMVN 170
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-KSFHSRLEFG-DQII 182
F + G++GL S QL + +FS CLV + +S S L FG +
Sbjct: 171 SGFDGVD-----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL 225
Query: 183 AG---KSLNLPPNSFT-----------IKLNGQR-----GCINDCGSVLTVIECEVYAVL 223
G +S + P S T I + GQ I D G+ LT + VY +
Sbjct: 226 HGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRV 285
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFN-SFPSMTYHFQGADLVVEPENVFIF 281
+ + ++ G+ C++ + N FP++T GA + N F+
Sbjct: 286 LSRMESMVTLPRVDG----SSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLV 341
Query: 282 --NHQDSFFFFFGPAFT-PRKGKTILGARHQHNTQFVYD 317
+ D+ G A P +I+G Q +YD
Sbjct: 342 VDDSGDTVCLAMGSASGLP---VSIIGNVMQQGYHILYD 377
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 142/357 (39%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ IG P K +DT + + W C C C ++D +Y+ + S +
Sbjct: 83 YYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTVS 142
Query: 63 CYDASCKSPF-----HCFEG-DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSV 113
C C + + C + C Y + YGD T DSL + + D + +
Sbjct: 143 CDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQ-VSGDGQTRHAN 201
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------- 164
++ FGC + + + + GI+G +TS + QL V FS CL
Sbjct: 202 ASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKGGG 261
Query: 165 -------VQPD-KSF-------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP KS H + + G +L LP + F + ++G I D G
Sbjct: 262 IFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMF--ETGEKKGTIIDSG 319
Query: 210 SVLTVIECEVYA-VLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
+ LT + VY VL A F + + H ++ F C + + + FP +T+HF
Sbjct: 320 TTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD-FLCIQY------FQSVDDGFPKITFHF 372
Query: 267 Q-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
+ L V P + F N + + F F + GK +LG N VYDL+
Sbjct: 373 EDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNKVVVYDLE 429
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 141/354 (39%), Gaps = 48/354 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+G+G PV+ + +DT + + W C C +C +++D +Y+ S + ++
Sbjct: 74 YFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVT 133
Query: 63 CYDASCKSPFH------CFEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVSVQ 114
C C S + E C Y + YGD T D L + + + +
Sbjct: 134 CNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNG 193
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-------- 164
+I FGC + + + GI+G ++S + QL V F+ CL
Sbjct: 194 SIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGI 253
Query: 165 ------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
VQP + H + + + LNLP + F L ++G I D G+
Sbjct: 254 FAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL--RKGTIIDSGT 311
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQGA 269
L +Y L ++ F++ KL T + TCF + FP++T+HF+ +
Sbjct: 312 TLAYFPDVIYEPLISKI---FARQSTLKLHTVEE-QFTCFEYDGNVDDGFPTVTFHFEDS 367
Query: 270 -DLVVEP-ENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDLD 319
L V P E +F + + R GK +LG N +YDL+
Sbjct: 368 LSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLE 421
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 139/371 (37%), Gaps = 65/371 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W C ++ D + R+ ++ +
Sbjct: 55 FHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAV 113
Query: 62 PCYDASCKS-----PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
PC A C S P C C ++Y D + +L T D P S
Sbjct: 114 PCGSARCSSRDLPAPPSCDAASRRCRVSLSYAD--GSASDGALATDVFAVGDAPPLRSA- 170
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS---- 170
FGC + D S AG++G+N + SF+ Q RFS C+ D +
Sbjct: 171 ---FGCMSAAYD-SSPDAVATAGLLGMNRGALSFVTQAST---RRFSYCISDRDDAGVLL 223
Query: 171 ------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNGQR 202
+ R+ + Q+ + GK L +PP+ G
Sbjct: 224 LGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAG 283
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNL----PA 254
+ D G+ T + + Y+ + AEF+ ++ + L F ++ TCF + P
Sbjct: 284 QTMVDSGTQFTFLLGDAYSAVKAEFLKQ-TKPLLPALEDPSFAFQEAFDTCFRVPKGRPP 342
Query: 255 RFNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILGARH 308
P +T F GA + V + + D + FG A ++G H
Sbjct: 343 PSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHH 402
Query: 309 QHNTQFVYDLD 319
Q N YDL+
Sbjct: 403 QMNLWVEYDLE 413
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 127/354 (35%), Gaps = 70/354 (19%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P + + LDT W C C C + +++ S + L C
Sbjct: 87 TYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAP 144
Query: 67 SCK---SPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK +P C + +TYG + Y T++ +L S ++P N FG
Sbjct: 145 QCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTL-ASDVIP----------NYTFG 193
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEFG 178
C + S+ + G+MGL S + Q L FS CL S F L G
Sbjct: 194 C-INKASGTSLPAQ---GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 179 DQ----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ + K +++P ++ G I D G+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLF---TCRKCGVTCFNLPARFNSFPSMTYHFQ 267
V T + Y + EF + L TC V FPS+T+ F
Sbjct: 310 VYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV----------FPSVTFMFA 359
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
G ++ + P+N+ I + + A P ++L + Q N + + D+
Sbjct: 360 GMNVTLPPDNLLIHSSAGN-LSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDV 412
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 136/346 (39%), Gaps = 54/346 (15%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS-PF 72
IG P + ++D L WTQC C C++Q+ P++ + +++ PC +CKS P
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 73 HCFEGD-CFYGITYG---DVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFV 128
GD C Y T D + T + +T + + ++ FGC + S D
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAI-------GTATASLAFGCVVAS-DID 160
Query: 129 SIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD--QIIAGKS 186
++ +G +GL S + Q+ +L +FS CL SRL G ++ G+S
Sbjct: 161 TMDG--TSGFIGLGRTPRSLVAQM-KLT--KFSYCLSPRGTGKSSRLFLGSSAKLAGGES 215
Query: 187 LNLPP-----------NSFTIKLNGQRG-----CINDCGSVLTVIECEVYAVLTAEFIDY 230
+ P + + + L+ R G +L + +++L
Sbjct: 216 TSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRA 275
Query: 231 FSQHDIEKLFTCRKCGVT--------CFNLPARFN--SFPSMTYHFQ--GADLVVEPENV 278
F + E + + CF A F+ + P + + FQ GA L V P
Sbjct: 276 FKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKY 335
Query: 279 FIFNHQD------SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
I ++ + T +G ++LG+ Q N F+YDL
Sbjct: 336 LIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDL 381
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 127/354 (35%), Gaps = 70/354 (19%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P + + LDT W C C C + +++ S + L C
Sbjct: 87 TYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAP 144
Query: 67 SCK---SPFHCFEGDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
CK +P C + +TYG + Y T++ +L S ++P N FG
Sbjct: 145 QCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTL-ASDVIP----------NYTFG 193
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEFG 178
C + S+ + G+MGL S + Q L FS CL S F L G
Sbjct: 194 C-INKASGTSLPAQ---GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 179 DQ----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ + K +++P ++ G I D G+
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGT 309
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLF---TCRKCGVTCFNLPARFNSFPSMTYHFQ 267
V T + Y + EF + L TC V FPS+T+ F
Sbjct: 310 VYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV----------FPSVTFMFA 359
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
G ++ + P+N+ I + + A P ++L + Q N + + D+
Sbjct: 360 GMNVTLPPDNLLIHSSAGN-LSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDV 412
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 74/175 (42%), Gaps = 13/175 (7%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-DPIYNSRSFKSYKKLPCYDA 66
Y + L +G P + L + DT + L W +C C++C + +R ++ CYD+
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDS 148
Query: 67 SCK---SPFHC------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
+C+ P H C Y +YGD +T S +T+T L ++ I
Sbjct: 149 ACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTT-LNTSSGREAKLKGIA 207
Query: 118 FGCS--LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
FGC+ + G+MGL S QLG ++FS CL+ D S
Sbjct: 208 FGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDIS 262
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 76/355 (21%), Positives = 132/355 (37%), Gaps = 60/355 (16%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++DT + +T+ C C+ C DP + ++Y+ + C
Sbjct: 90 NGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT 149
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
+ +C + C Y Y ++ + D + E SP Q FGC E
Sbjct: 150 WQCNCDND----RKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSP---QRAIFGC--E 200
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLV---------------- 165
+ + I + GIMGL S M QL +++ D FS C
Sbjct: 201 NDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISP 260
Query: 166 ---------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
P +S + ++ + +AGK L+L P F +G+ G + D G+ +
Sbjct: 261 PADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVF----DGKHGTVLDSGTTYAYLP 316
Query: 217 CEVYAVLTAEFIDYFSQHDIEKL-----------FTCRKCGVTCFNLPARFNSFPSMTYH 265
+ + H ++++ F+ + V+ + SFP +
Sbjct: 317 ESAFLAFKHAIMK--ETHSLKRISGPDPRYNDICFSGAEIDVSQIS-----KSFPVVEMV 369
Query: 266 F-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G L + PEN + + + G T+LG NT +YD +
Sbjct: 370 FGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDRE 424
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 133/359 (37%), Gaps = 59/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G+P + +DT + + W C PC C + + ++++ S + LP
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 63 CYDASCKSPF----HCF--EGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSV 113
C D C + C C Y Y D T DS+ LL + S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
I FGCS+ ++ K + GI G S + QL + P FS CL
Sbjct: 204 -TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
++P + H L+ ++G+ L PN ++ I D
Sbjct: 263 GILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQ---LFPNPTMFPISNAGETIIDS 319
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
G+ L + EVY + + SQ + G CF + + FP + ++F+
Sbjct: 320 GTTLAYLVEEVYDWIVSVITSAVSQSATPTI----SRGSQCFRVSMSVADIFPVLRFNFE 375
Query: 268 G-ADLVVEPENVFIFNHQDSFFFF-------FGPAFTPRKGKTILGARHQHNTQFVYDL 318
G A +VV PE F+ S + F F A G ILG + VYDL
Sbjct: 376 GIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKA---EDGLNILGDLVLKDKIIVYDL 431
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 134/357 (37%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC--YEQNDPIYNSRSFKSYKKLPCYD 65
YM++L IG P + + ++DT + L W +C C C + I+ S + SYKKLPC
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 66 ASCKSPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLP--PDEPSPVSVQNIRF 118
C E C Y YGD T D + E F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK--SFHSRLE 176
GC + K G++GL S S + QLG + +FS CLV D S S L
Sbjct: 125 GCGRKLKG----DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180
Query: 177 FGDQ------------IIAGKSLN-----LPPNSFTI---------KLNGQRGCINDCGS 210
G I+ G L+ + S T+ K +G + +
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLA 240
Query: 211 VLTVIEC-EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-----CFNLPARFN-SFPSMT 263
TVI+ Y +LT + + IE+ G + CFN + FPS+T
Sbjct: 241 NKTVIDSGTTYTLLTPPVYEAMRK-SIEEQVILPTLGNSAGLDLCFNSSGDTSYGFPSVT 299
Query: 264 YHFQGADLVVEP-ENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
++F +V P EN+F +D + G +I+G Q N +YDL
Sbjct: 300 FYFANQVQLVLPFENIFQVTSRDVVCL----SMDSSGGDLSIIGNMQQQNFHILYDL 352
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 119/325 (36%), Gaps = 46/325 (14%)
Query: 25 LDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYKKLPCYDASCKS-----PFHCFE 76
+DT + L+W QC+PC SCY Q DP+++ SY +PC C C
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62
Query: 77 GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS-LESKDFVSIQKKII 135
C Y ++YGD T V S DT TL + +VQ FGC +S F +
Sbjct: 63 AQCGYVVSYGDGSNTTGVYSSDTLTL-----SASSAVQGFFFGCGHAQSGLFNGVD---- 113
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPDKSFHSRLEFGDQIIAGKSLN----LP 190
G++GL + S + Q FS CL +P + + L G A + LP
Sbjct: 114 -GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLP 172
Query: 191 ----PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ---HDIEKLFTCR 243
P + + L G I+ G L+V + ++ L +
Sbjct: 173 SPNAPTYYVVMLTG----ISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAF 228
Query: 244 KCGVTCFNLP-ARFNSFPSMTYHFQGADLVVEPENVFIFNH-------QDSFFFFFGPAF 295
+ G+ + P A N Y+F G V P F D F AF
Sbjct: 229 RSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFGCLAF 288
Query: 296 TPR---KGKTILGARHQHNTQFVYD 317
P G ILG Q + + D
Sbjct: 289 APSGSDGGMAILGNVQQRSFEVRID 313
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 84/189 (44%), Gaps = 21/189 (11%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F+ + Y++KL IG P + +DT + + W C CK C+ Q+ I+N + +Y+
Sbjct: 91 IFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQD 150
Query: 61 LPCYDASCKSPFHCFEGD--CFYGITYGDVYETKEVD------SLDTSTLLPPD-EPSPV 111
PC C++ + D C Y E +++ ++DT TL D P P+
Sbjct: 151 APCDSYQCETTSSSCQSDNVCLYSCD-----EKHQLNCPNGRIAVDTMTLTSSDGRPFPL 205
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
+ G S+ K F + G++GL + S +L L +FS CL
Sbjct: 206 PYSDFVCGNSIY-KTFAGV------GVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQ 258
Query: 172 HSRLEFGDQ 180
S++ FG Q
Sbjct: 259 PSKINFGLQ 267
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 144/358 (40%), Gaps = 51/358 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y KLG+G P K + +DT + + W C C C ++D +Y+ + ++ + +
Sbjct: 70 YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129
Query: 63 CYDASCKSPFHC------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN- 115
C C + + E C Y ITYGD T D T ++ + QN
Sbjct: 130 CDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNS 189
Query: 116 -IRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
I FGC +++S S ++ + GI+G ++S + QL V FS CL
Sbjct: 190 SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGG 249
Query: 165 -------VQPDKS--------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
V+P S H + + L LP + F NG +G I D G
Sbjct: 250 IFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFD-SGNG-KGTIIDSG 307
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
+ L + VY L + + ++ KL+ + +CF + FP + HF+
Sbjct: 308 TTLAYLPAIVYDELIPKVM---ARQPRLKLYLVEQ-QFSCFQYTGNVDRGFPVVKLHFED 363
Query: 269 A-DLVVEPENVFIFNHQDSFF--FFFGPAFTPRKGK--TILGARHQHNTQFVYDLDTF 321
+ L V P + ++F +D + + + GK T+LG N +YDL+
Sbjct: 364 SLSLTVYPHD-YLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENM 420
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 80/374 (21%), Positives = 139/374 (37%), Gaps = 69/374 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W C P + + + + R+ ++ +
Sbjct: 79 FHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAV 138
Query: 62 PCYDASCK-----SPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTL--LPPDEPSPVS 112
PC A C+ SP C C ++Y D + + D + PP
Sbjct: 139 PCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPP------- 191
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC + D S AG++G+N + SF+ Q RFS C+ D +
Sbjct: 192 -LRAAFGCMSSAFD-SSPDGVASAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV 246
Query: 171 ---------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLN 199
+ R+ + Q+ + GK L +P +
Sbjct: 247 LLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHT 306
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLPAR 255
G + D G+ T + + Y+ L AEF ++ + L F ++ TCF +P
Sbjct: 307 GAGQTMVDSGTQFTFLLGDAYSALKAEFTRQ-ARPLLPALDDPSFAFQEAFDTCFRVPQG 365
Query: 256 FN----SFPSMTYHFQGADLVVEPENVFI------FNHQDSFFFFFGPAFTPRKGKTILG 305
+ P +T F GA++ V + + + FG A ++G
Sbjct: 366 RSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIG 425
Query: 306 ARHQHNTQFVYDLD 319
HQ N YDL+
Sbjct: 426 HHHQMNVWVEYDLE 439
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 141/360 (39%), Gaps = 55/360 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+G+G + +DT + W C C +C +++ +Y+ S K+ K +P
Sbjct: 77 YYTKIGLGP--NDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVP 134
Query: 63 CYDASCKS----PFHCFEGD--CFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSV 113
C D C S P + D C Y ITYGD T D L ++ P +
Sbjct: 135 CDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNT 194
Query: 114 QNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL------ 164
I FGC S +S S + GI+G ++S + QL V FS CL
Sbjct: 195 SVI-FGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGG 253
Query: 165 --------VQPD--------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP + H + D +AG + LP + F RG I D
Sbjct: 254 GIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIF--DSTSGRGTIIDS 311
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHD---IEKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
G+ L + +Y L + + S + +E FTC + + ++FP++ +
Sbjct: 312 GTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFTCFHYS----DEKSLDDAFPTVKFT 367
Query: 266 F-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-GK--TILGARHQHNTQFVYDLDTF 321
F +G L P + +D + + + K GK +LG N F+YDLD
Sbjct: 368 FEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNM 427
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 133/339 (39%), Gaps = 45/339 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ + +G P K + DT + L W Q +PC C I++ R +++++ C
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSSQL 112
Query: 68 CKS-PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C P C G C Y YG ET+ + DT +L S + GC + +
Sbjct: 113 CTELPGSCEPGSSACSYSYEYGS-GETEGEFARDTISLGTTSGGSQ-KFPSFAVGCGMVN 170
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD-KSFHSRLEFG-DQII 182
F + G++GL S QL + +FS CLV + +S S L FG +
Sbjct: 171 SGFDGVD-----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL 225
Query: 183 AG---KSLNLPPNSFT-----------IKLNGQR-----GCINDCGSVLTVIECEVYAVL 223
G +S + P S T I + GQ I D G+ LT + VY +
Sbjct: 226 HGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRV 285
Query: 224 TAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFN-SFPSMTYHFQGADLVVEPENVFIF 281
+ + ++ G+ C++ + N FP++T GA + N F+
Sbjct: 286 LSRMESMVTLPRVDG----SSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLV 341
Query: 282 --NHQDSFFFFFGPAFT-PRKGKTILGARHQHNTQFVYD 317
+ D+ G A P +I+G Q +YD
Sbjct: 342 VDDSGDTVCLAMGSAGGLP---VSIIGNVMQQGYHILYD 377
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 131/353 (37%), Gaps = 50/353 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G+P + +DT + + W C PC C + + ++++ S + LP
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 63 CYDASCKSPF----HCF--EGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSV 113
C D C + C C Y Y D T DS+ LL + S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
I FGCS+ ++ K + GI G S + QL + P FS CL
Sbjct: 204 -TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
++P + H L+ ++G+ L PN ++ I D
Sbjct: 263 GILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQ---LFPNPTMFPISNAGETIIDS 319
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
G+ L + EVY + + SQ + G CF + + FP + ++F+
Sbjct: 320 GTTLAYLVEEVYDWIVSVITSAVSQSATPTI----SRGSQCFRVSMSVADIFPVLRFNFE 375
Query: 268 G-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRK-GKTILGARHQHNTQFVYDL 318
G A +VV PE F+ + F + G ILG + VYDL
Sbjct: 376 GIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDL 428
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 135/357 (37%), Gaps = 57/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + +DT + + W C C +C + ++S S + ++
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125
Query: 63 CYDASCKSPFHCFE-------GDCFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C S C Y YGD V +T D++ +L+ D
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLI--DN 183
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLV 165
S + I FGCS ++ K + GI G S + QL + P FS CL
Sbjct: 184 SSAL----IVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL- 238
Query: 166 QPDKSFHSRLEFGDQIIAGKSLN-LPP-------NSFTIKLNGQ--------------RG 203
+ D S L G+ + G + L P N +I +NGQ +G
Sbjct: 239 KGDGSGGGILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQG 298
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSM 262
I D G+ L + E Y F+ + + G C+ + + FP
Sbjct: 299 TIVDSGTTLAYLVAEAY----DPFVSAVNAIVSPSVTPITSKGNQCYLVSTSVSQMFPLA 354
Query: 263 TYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+++F GA +V++PE+ I + F +G TILG + FVYDL
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDKIFVYDL 411
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 119/316 (37%), Gaps = 55/316 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + GIG P L DT + L WT+C C C + P Y S S + C D +
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRT 151
Query: 68 CK----------SPFHCFEGDCFYGITYGDVYETKEVDS--LDTSTLLPPDEPSPVSVQN 115
C + G+C Y YG+ +T L T T D+ + +
Sbjct: 152 CGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAA--AFPG 209
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLVQPDK-SFHS 173
I FGC+L S+ +G++GL S + QL R S L P SF S
Sbjct: 210 IAFGCTLRSEGGFGTG----SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGS 265
Query: 174 RLEF----GDQI-------------------------IAGKSLNLPPNSFTI-KLNGQRG 203
+ GD + GK + +P +F+ + G G
Sbjct: 266 LADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGG 325
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEK-LFTCRKCGVTCFNLPARFNSFPSM 262
I D G+ LT++ Y ++ E + SQ +K + CF + +FPSM
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELL---SQMGFQKPPPAANDDDLICFTGGSSTTTFPSM 382
Query: 263 TYHFQ-GADLVVEPEN 277
HF GAD+ + EN
Sbjct: 383 VLHFDGGADMDLSTEN 398
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 127/355 (35%), Gaps = 64/355 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + +Y + C
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN 147
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C + C Y Y ++ + V D + E P Q FGC +
Sbjct: 148 VDCTCDNE----RSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP---QRAVFGCENT 200
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---------------- 164
E+ D S GIMGL S M QL ++ D FS C
Sbjct: 201 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMP 257
Query: 165 VQPDKSF-HS--------RLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
PD F HS +E + +AGK+L L P F N + G + D G+
Sbjct: 258 APPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGT----- 308
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCFNLPARFNS-----FPSMT 263
YA L + F K+ + +K CF R S FP +
Sbjct: 309 ---TYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVD 365
Query: 264 YHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F G L + PEN + + + G + T+LG NT YD
Sbjct: 366 MVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 420
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 128/338 (37%), Gaps = 47/338 (13%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
IG P ++ +D L WTQC C C++Q+ P++ + ++K PC CKS
Sbjct: 30 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 89
Query: 72 FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQ 131
C C + G T + + DT + +P S + FGC + S D ++
Sbjct: 90 PKCASDVCAFDGVTGLGGHTVGIVATDTFAI---GTAAPAS---LGFGCVVAS-DIDTMG 142
Query: 132 KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-IAGKSLNLP 190
+G +GL S + Q+ +L RFS CL D +SRL G +AG P
Sbjct: 143 GP--SGFIGLGRTPWSLVAQM-KLT--RFSYCLAPHDTGKNSRLFLGASAKLAGGGAWTP 197
Query: 191 -----PNS-----FTIKLN----GQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDI 236
PN + I+L G G +++ V V + +D Q
Sbjct: 198 FVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAV--VRVSLLVDSVYQEFK 255
Query: 237 EKLFTCRKCGVT----------CFNLPARFNSFPSMTYHFQ-GADLVVEPENVFIFNHQD 285
+ + T CF A + P + + FQ GA L V P N D
Sbjct: 256 KAVMASVGAAPTATPVGEPFEVCFP-KAGVSGAPDLVFTFQAGAALTVPPANYLFDVGND 314
Query: 286 ----SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
S T G ILG+ Q N ++DLD
Sbjct: 315 TVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLD 352
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/349 (23%), Positives = 132/349 (37%), Gaps = 54/349 (15%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYN---SRSFKSYKKLP 62
T ++ L IG P ++DT + + W C PC +C +++ S +F K P
Sbjct: 99 RTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTP 158
Query: 63 CYDASCKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C CK PF I+Y D D DE + + ++ G
Sbjct: 159 CGFKGCKCDPIPFT---------ISYVDNSSASGTFGRDILVFETTDEGTS-QISDVIIG 208
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC---LVQPDKSFHS-RL 175
C + F S GI+GLN S Q+GR +FS C L P +++ RL
Sbjct: 209 CG-HNIGFNS--DPGYNGILGLNNGPNSLATQIGR----KFSYCIGNLADPYYNYNQLRL 261
Query: 176 EFGDQI----------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
G + + K L++ +F +K NG G I D G+ +T
Sbjct: 262 GEGADLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTIT 321
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSMTYHF-QGADL 271
+ + +L E + + +F + + + +R FP +T+HF GADL
Sbjct: 322 YLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADL 381
Query: 272 VVEPENVFIFNHQDSFFFFFGPA--FTPRKGKTILGARHQHNTQFVYDL 318
++ + F D F PA +++G Q + YDL
Sbjct: 382 ALDTGS-FFSQRDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDL 429
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 115/311 (36%), Gaps = 58/311 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G P ++ +DT + L W C PC C +D PI Y+ ++ S K+P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 63 CYDASCKSPFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C D SC E C Y YGD T D + + +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVN------ATATV 149
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL---------- 164
FGC + +S ++ + GI+G SF QL + P+ F+ CL
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGIL 209
Query: 165 -----VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
++PD + H + + +L + P F+ + +G I D G+
Sbjct: 210 VLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDSGTT 267
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF--NSFPSMTYHFQGA 269
L + E Y T + + + C +RF FP++ +F+GA
Sbjct: 268 LAYLPDEAYQAFT------------QAVSLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGA 315
Query: 270 DLVVEPENVFI 280
+ + P I
Sbjct: 316 SMTLTPAEYLI 326
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 114/295 (38%), Gaps = 43/295 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI-----YNSRSFKSYKKLP 62
Y +GIG P + LDT + W CK C ++D + Y+ RS S K++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142
Query: 63 CYDASCKSPFHC-FEGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRF 118
C D C S C C Y Y D T + D L L + P S ++ F
Sbjct: 143 CDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTST-SVTF 201
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------------ 164
GC L+ ++ I GI+G + + + QL + FS CL
Sbjct: 202 GCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIG 261
Query: 165 --VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLT 213
V+P + + + +AG +L LP N F +G D GS L
Sbjct: 262 EVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIF--GTTKTKGTFIDSGSTLV 319
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN-LPARFNSFPSMTYHFQ 267
+ +Y+ L + F++H + CF+ L + + FP +T+HF+
Sbjct: 320 YLPEIIYSEL---ILAVFAKH--PDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 369
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/366 (22%), Positives = 133/366 (36%), Gaps = 72/366 (19%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ----PCKSCYEQNDPIYNSRSFKSYKK 60
+ + L +GI P K ++DT + L WTQC+ + + P+Y+ ++
Sbjct: 13 DQGHSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAF 69
Query: 61 LPCYDASCKSPFHCFEGDCFYG--ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
LPC D C+ F+ +C Y DVY + + S VS++ + F
Sbjct: 70 LPCSDRLCQEGQFSFK-NCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLR-LGF 127
Query: 119 GC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
GC +L + + GI+GL+ +S S + QL RFS CL S L F
Sbjct: 128 GCGALSAGSLIG-----ATGILGLSPESLSLITQLKI---QRFSYCLTPFADKKTSPLLF 179
Query: 178 GDQI----------------------------------IAGKSLNLPPNSFTIKLNGQRG 203
G + K L +P S ++ +G G
Sbjct: 180 GAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGG 239
Query: 204 CINDCGSVLT-VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG---VTCFNLPARFNS- 258
I D GS + ++E AV A D+ +L + CF LP R +
Sbjct: 240 TIVDSGSTVAYLVEAAFEAVKEAVM-------DVVRLPVANRTVEDYELCFVLPRRTAAA 292
Query: 259 ------FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
P + HF G +V P + + + T G +I+G Q N
Sbjct: 293 AMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNM 352
Query: 313 QFVYDL 318
++D+
Sbjct: 353 HVLFDV 358
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 133/339 (39%), Gaps = 45/339 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ G P +S++ L+DT + + W C+ C+ C+ PI++ SYK C
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYKPFACDSQP 173
Query: 68 CKSPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ G+ C + ++YGD + + D TL P N FGC+
Sbjct: 174 CQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQYLP------NFSFGCAESLS 227
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGK 185
+ S ++ G T L FS CL S S + + ++
Sbjct: 228 EDTSPSPGLMGLGGGSLSLLT--QAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSS 285
Query: 186 SL---------NLPPNSF----TIKLNGQR------------GCINDCGSVLTVIECEVY 220
SL ++P F I + R G I D G+ +T + Y
Sbjct: 286 SLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAY 345
Query: 221 AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF-QGADLVVEPENVF 279
L D F Q T + TC++L + P++T H + DLV+ EN+
Sbjct: 346 TALR----DAFRQQLSSLQPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENIL 401
Query: 280 IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
I Q+S AF+ ++I+G Q N + V+D+
Sbjct: 402 I--TQESGLACL--AFSSTDSRSIIGNVQQQNWRIVFDV 436
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 115/311 (36%), Gaps = 58/311 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G P ++ +DT + L W C PC C +D PI Y+ ++ S K+P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 63 CYDASCKSPFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C D SC E C Y YGD T D + + +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVN------ATATV 149
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL---------- 164
FGC + +S ++ + GI+G SF QL + P+ F+ CL
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGIL 209
Query: 165 -----VQPDKS--------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
++PD +H + + +L + P F+ + +G I D G+
Sbjct: 210 VLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDSGTT 267
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF--NSFPSMTYHFQGA 269
L + E Y T + + + C +RF FP++ +F+GA
Sbjct: 268 LAYLPDEAYQAFT------------QAVSLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGA 315
Query: 270 DLVVEPENVFI 280
+ + P I
Sbjct: 316 SMTLTPAEYLI 326
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 135/358 (37%), Gaps = 64/358 (17%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y +L IG P + ++DT + +T+ C C+ C DP + +Y+ + C
Sbjct: 77 LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC 136
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C + C Y Y ++ + V D + E +P Q FGC +
Sbjct: 137 TLDCNCDND----RMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAP---QRAVFGCEN 189
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S GIMGL S M QL +V D FS C
Sbjct: 190 VETGDLYSQHAD---GIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGI 246
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + ++ + +AGK L L P+ F +G+ G + D G+
Sbjct: 247 SPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVF----DGKHGSVLDSGT---- 298
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSM 262
YA L E F + +++L + + CF ++ +FP +
Sbjct: 299 ----TYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVV 354
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G + PEN + + + G + T+LG NT +YD +
Sbjct: 355 DMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRE 412
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 138/359 (38%), Gaps = 58/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K+ + +DT + + W C CK C ++ +Y+ + S K +P
Sbjct: 83 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVP 142
Query: 63 CYDASCKSPFHCFEGDCFYGIT------YGDVYETKE--VDSLDTSTLLPPDEPSPVSVQ 114
C CK C I+ YGD T V + + D + +
Sbjct: 143 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 202
Query: 115 NIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
+I FGC + +S D S ++ + GI+G ++S + QL V F+ CL
Sbjct: 203 SIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 262
Query: 165 -------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
VQP + H + + L+L + T ++G I D G
Sbjct: 263 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTD--TSAQGDRKGTIIDSG 320
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ- 267
+ L + +Y L + I SQH K+ T TCF + FP++T+ F+
Sbjct: 321 TTLAYLPEGIYEPLVYKMI---SQHPDLKVQTLHD-EYTCFQYSESVDDGFPAVTFFFEN 376
Query: 268 GADLVVEPE-------NVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L V P N + Q+S G K T+LG N YDL+
Sbjct: 377 GLSLKVYPHDYLFPSVNFWCIGWQNS-----GTQSRDSKNMTLLGDLVLSNKLVFYDLE 430
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 93/208 (44%), Gaps = 23/208 (11%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQP--CKSCYEQNDPIYNSRSFKSY 58
M + Y++K IG P + + D+ + L W QC C++CY Q P++N +Y
Sbjct: 94 MSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTY 153
Query: 59 KKLPCYDASCKSP-----FHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
K C A C+ + C + + C Y Y D T+ V S D T P+ S
Sbjct: 154 MKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTF--PEHISGF 211
Query: 112 SVQNIR--FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL-VQPD 168
+R FGC + D Q G++GL + S +G++ D+FS C+ + +
Sbjct: 212 GNYTLRIIFGCGYNNSD---PQHFYPPGLVGLTNNKASL---VGQMDVDQFSYCVSIDTE 265
Query: 169 KSFHSRLE--FG-DQIIAGKSLNLPPNS 193
++ +E FG I+G S L PNS
Sbjct: 266 QNLKGSMEIRFGLAASISGHSTQLVPNS 293
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 131/319 (41%), Gaps = 57/319 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P KS + +DT + + W C CK C ++ +YN S K +
Sbjct: 80 YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139
Query: 63 CYDASC----KSPFHCFEGD--CFYGITYGDVYET-----KEVDSLDTSTLLPPDEPSPV 111
C D C P + + C Y YGD T K+V D+ + D +
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDS---VAGDLKTQT 196
Query: 112 SVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRFSCCL--- 164
+ ++ FGC + +S D S ++ + GI+G ++S + QL GR V F+ CL
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR-VKKIFAHCLDGR 255
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
VQP + H + + + L +P + F + ++G I
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLF--QPGDRKGAI 313
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTY 264
D G+ L + +Y L + H ++K + CF R + FP++T+
Sbjct: 314 IDSGTTLAYLPEIIYEPLVKKE-PALKVHIVDKDY-------KCFQYSGRVDEGFPNVTF 365
Query: 265 HFQGADLVVEPENVFIFNH 283
HF+ + + + ++F H
Sbjct: 366 HFENSVFLRVYPHDYLFPH 384
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 119/316 (37%), Gaps = 55/316 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y + GIG P L DT + L WT+C C C + P Y S S + C D +
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRT 151
Query: 68 CK----------SPFHCFEGDCFYGITYGDVYETKEVDS--LDTSTLLPPDEPSPVSVQN 115
C + G+C Y YG+ +T L T T D+ + +
Sbjct: 152 CGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAA--AFPG 209
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG-RLVPDRFSCCLVQPDK-SFHS 173
I FGC+L S+ +G++GL S + QL R S L P SF S
Sbjct: 210 IAFGCTLRSEGGFGTG----SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGS 265
Query: 174 RLEF----GDQI-------------------------IAGKSLNLPPNSFTI-KLNGQRG 203
+ GD + GK + +P +F+ + G G
Sbjct: 266 LADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGG 325
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEK-LFTCRKCGVTCFNLPARFNSFPSM 262
I D G+ LT++ Y ++ E + SQ +K + CF + +FPSM
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELL---SQMGFQKPPPAANDDDLICFTGGSSTTTFPSM 382
Query: 263 TYHFQ-GADLVVEPEN 277
HF GAD+ + EN
Sbjct: 383 VLHFDGGADMDLSTEN 398
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 130/355 (36%), Gaps = 64/355 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + SY + C
Sbjct: 85 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCN 144
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + E P Q+ FGC +
Sbjct: 145 VDCTCDSD----KKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP---QHAIFGCENS 197
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC----------------L 164
E+ D S GIMGL S M QL ++ D FS C L
Sbjct: 198 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGML 254
Query: 165 VQPD---------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
PD +S + +E + +AGK+L + F N + G + D G+
Sbjct: 255 APPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIF----NSKHGTVLDSGT----- 305
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCF-----NLPARFNSFPSMT 263
YA L + F + K+ + +K CF N+ FP +
Sbjct: 306 ---TYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVD 362
Query: 264 YHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F G L + PEN + + + G + T+LG NT YD
Sbjct: 363 MVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYD 417
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 102/274 (37%), Gaps = 40/274 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND------PIYNSRSFKSYKKL 61
Y K+ +G P + +DT + +TW C PC SC + Y+ + L
Sbjct: 37 YYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGAL 96
Query: 62 PCYDASCKSPFHCFE------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS-VQ 114
C D++C + E G C Y TYGD T+ D T + V+
Sbjct: 97 SCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTA 156
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL-------- 164
++ FGC + + + + G++G + S QL + V +RF+ CL
Sbjct: 157 SVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGG 216
Query: 165 -------VQPDKSF-----HSRLEFGDQIIAGKSLNL-PPNSFTIKLNGQRGCINDCGSV 211
+P+ S+ + G Q IA N+ P SF G I D G+
Sbjct: 217 TIVIGSVSEPNISYTPIVSRNHYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVIMDSGTT 276
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKC 245
L + Y +F++ S + + +C
Sbjct: 277 LAYLVDPAY----TQFVNAVSTFESSMFSSHSQC 306
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/359 (22%), Positives = 129/359 (35%), Gaps = 58/359 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + L+ +DT + + W C C C + + ++ S + +
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLIS 136
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C+S + C Y YGD V + S+ TL
Sbjct: 137 CLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSS 196
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
S V FGCS+ ++ ++ + GI G S + QL + P FS CL
Sbjct: 197 ASVV------FGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLK 250
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ + H L + G+ + + P+ F N R
Sbjct: 251 GDNSGGGVLVLGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN--R 308
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + E Y Q L +C + + FP +
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQC--YLITTSSNVDIFPQV 366
Query: 263 TYHFQ-GADLVVEPENVFIFNH--QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ +F GA LV+ P++ + + + + G + TILG + FVYDL
Sbjct: 367 SLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDL 425
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 133/357 (37%), Gaps = 58/357 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + +DT + + W C C C + ++ ++ S +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 63 CYDASCKSPFHCFEG-----DCFYGITYGD--------VYETKEVDSLDTSTLLPPDEPS 109
C D C S F G C Y YGD + + D++ TST L + +
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITST-LAINSSA 202
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL--- 164
P FGCS + ++ + GI GL S S + QL L P FS CL
Sbjct: 203 P-----FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGD 257
Query: 165 ------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
+PD + H + + G+ L + P+ FTI G
Sbjct: 258 KSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATG--DGT 315
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMT 263
I D G+ L + E Y+ + SQ+ + + CF + A + FP ++
Sbjct: 316 IIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ----CFEITAGDVDVFPEVS 371
Query: 264 YHFQ-GADLVVEPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F GA +V+ P IF+ S + G + TILG + VYDL
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDL 428
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 136/355 (38%), Gaps = 66/355 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC-YDA 66
Y ++ IG P ++ ++DT + LT+ C C+ C + DP + +Y+ L C +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLESK 125
+C S C Y Y ++ + V D + E P Q FGC ++E+
Sbjct: 152 TCDSEMM----HCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP---QRTVFGCENVETG 204
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------------------- 164
D S + GIMGL S + QL ++ + FS C
Sbjct: 205 DIYSQRAD---GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPA 261
Query: 165 ------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
P +S + ++ + IAGK L + P F +G+ G I D G+
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGT-------- 309
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTC-------RKCGVTCF-----NLPARFNSFPSMTYHF 266
YA L F +++L + R CF ++ +FP++ F
Sbjct: 310 TYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVF 369
Query: 267 -QGADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L + PEN ++F H + + G T+LG NT +YD +
Sbjct: 370 SNGNRLSLSPEN-YLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/344 (18%), Positives = 129/344 (37%), Gaps = 50/344 (14%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
T++++ IG P ++L LDT W C C C +++S S++ LPC
Sbjct: 25 TFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQSP 82
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C C + +TYG T D + + L D SV + FGC ++
Sbjct: 83 QCNQVPNPSCSGSACGFNLTYGS--STVAADLVQDNLTLATD-----SVPSYTFGCIRKA 135
Query: 125 KD-------------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR 159
+ S + +N+ + + + + + +
Sbjct: 136 TGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIK 195
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++ L P +S + + K +++PP++ G + D G+ T +
Sbjct: 196 YTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPA 255
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
Y + EF + + + T G TC+ +P P++T+ F G ++ + P+N
Sbjct: 256 YTAVRDEF-----RRRVGRNVTVSSLGGFDTCYTVPI---ISPTITFMFAGMNVTLPPDN 307
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
F+ + A P ++L + Q N + ++D+
Sbjct: 308 -FLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDI 350
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 75/173 (43%), Gaps = 20/173 (11%)
Query: 4 LNHTYMLKLGIGDPV-KSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLP 62
++ Y++ L IG P + + LDT + L WTQC C C+ Q P +++ + ++ +P
Sbjct: 96 IDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVP 154
Query: 63 CYDASCKSPFHCFEG------DCFYGITYGDVYETKEVDSLDTSTLLPPD------EPSP 110
C D C S + G CFY Y D T DT T P +
Sbjct: 155 CSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAG 214
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC 163
V+V N+RFGC +K I K +GI G + S QL RFS C
Sbjct: 215 VAVPNVRFGCGQYNK---GIFKSNESGIAGFSRGPMSLPSQLKVA---RFSHC 261
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 103/290 (35%), Gaps = 24/290 (8%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
+Y+++ G+G PV+ L LDT A TW+ C PC +C + I S S SY LPC
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSS--SYASLPCASD 135
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLP------------PDEPSPVSVQ 114
C F G G V +V L ++ P PSP +
Sbjct: 136 WCP----LFRRPAVPG-EPGRVGAAADVRLLQAASRTPRSGVLAATRCGWARTPSPATRS 190
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
S + + + + + + G+ R++ L P +
Sbjct: 191 GPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYY 250
Query: 175 LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH 234
+ + + P SF + G + D G+V+T VYA L EF Q
Sbjct: 251 VNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEF---RRQV 307
Query: 235 DIEKLFTCRKCGVTCFNL-PARFNSFPSMTYHFQGA-DLVVEPENVFIFN 282
+T TCFN P +T H G DL + EN I +
Sbjct: 308 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTLIHS 357
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 135/358 (37%), Gaps = 60/358 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + +DT + + W C C C + ++ ++ S +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 63 CYDASCKSPFHCFEG-----DCFYGITYGDVYETK--------EVDSLDTSTLLPPDEPS 109
C D C S F G C Y YGD T D++ TST L + +
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITST-LAINSSA 202
Query: 110 PVSVQNIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL-- 164
P FGCS L+S D ++ + GI GL S S + QL L P FS CL
Sbjct: 203 P-----FVFGCSNLQSGDLQR-PRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKG 256
Query: 165 -------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
+PD + H + + G+ L + P+ FTI G
Sbjct: 257 DKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATG--DG 314
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR-FNSFPSM 262
I D G+ L + E Y+ + SQ+ + + CF + A + FP +
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ----CFEITAGDVDVFPQV 370
Query: 263 TYHFQ-GADLVVEPENVF-IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F GA +V+ P IF+ S + G + TILG + VYDL
Sbjct: 371 SLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDL 428
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 107/261 (40%), Gaps = 35/261 (13%)
Query: 5 NHTYM-LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
H+Y L +G P ++ ++DT + +T+ C+ C C + ++ + KKL C
Sbjct: 9 RHSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLAC 68
Query: 64 YDA--SCKSPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
D +C +P C C+Y TY + ++ DT PD SPV + FGC
Sbjct: 69 GDPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGF--PDSDSPV---RLVFGC 123
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLVQPDKSF------- 171
E+ + I +++ GIMG+ + +F QL +++ D FS C P
Sbjct: 124 --ENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDVT 181
Query: 172 --------------HSRLEFGDQIIAGKSLNLPPNSFTIKL-NGQRGCINDCGSVLTVIE 216
H L + + + G ++N +F + + G + D G+ T +
Sbjct: 182 LPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTYLP 241
Query: 217 CEVYAVLTAEFIDYFSQHDIE 237
+ + + DY + ++
Sbjct: 242 TDAFKAMAKAVGDYVEKKGLQ 262
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 80/198 (40%), Gaps = 25/198 (12%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+T Y+ IG P + ++D L WTQC+ C C+EQ+ P+++ + +Y+
Sbjct: 45 WTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAE 104
Query: 62 PCYDASCKS----PFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
PC C+S +C C Y GD D+ T + +
Sbjct: 105 PCGTPLCESIPSDSRNCSGNVCAYQASTNAGDTGGKVGTDTFAVGT----------AKAS 154
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+ FGC + S D ++ +GI+GL S + Q G FS CL D +S L
Sbjct: 155 LAFGCVVAS-DIDTMGGP--SGIVGLGRTPWSLVTQTGVAA---FSYCLAPHDAGRNSAL 208
Query: 176 EFGDQII---AGKSLNLP 190
G GK+ + P
Sbjct: 209 FLGSSAKLAGGGKAASTP 226
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 136/355 (38%), Gaps = 66/355 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC-YDA 66
Y ++ IG P ++ ++DT + LT+ C C+ C + DP + +Y+ L C +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151
Query: 67 SCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLESK 125
+C S C Y Y ++ + V D + E P Q FGC ++E+
Sbjct: 152 TCDSEMM----HCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP---QRTVFGCENVETG 204
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------------------- 164
D S + GIMGL S + QL ++ + FS C
Sbjct: 205 DIYSQRAD---GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPA 261
Query: 165 ------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
P +S + ++ + IAGK L + P F +G+ G I D G+
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGT-------- 309
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTC-------RKCGVTCF-----NLPARFNSFPSMTYHF 266
YA L F +++L + R CF ++ +FP++ F
Sbjct: 310 TYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVF 369
Query: 267 -QGADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G L + PEN ++F H + + G T+LG NT +YD +
Sbjct: 370 SNGNRLSLSPEN-YLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/344 (19%), Positives = 129/344 (37%), Gaps = 50/344 (14%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
T++++ IG P ++L LDT W C C C +++S S++ LPC
Sbjct: 102 TFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQSP 159
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C C + +TYG T D + + L D SV + FGC ++
Sbjct: 160 QCNQVPNPSCSGSACGFNLTYGS--STVAADLVQDNLTLATD-----SVPSYTFGCIRKA 212
Query: 125 KD-------------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR 159
+ S + +N+ + + + + + +
Sbjct: 213 TGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIK 272
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++ L P +S + + K +++PP++ G + D G+ T +
Sbjct: 273 YTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPA 332
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
Y + EF + + + T G TC+ +P P++T+ F G ++ + P+N
Sbjct: 333 YTAVRDEF-----RRRVGRNVTVSSLGGFDTCYTVPI---ISPTITFMFAGMNVTLPPDN 384
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
I + S A P ++L + Q N + ++D+
Sbjct: 385 FLIHSTAGS-TTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDI 427
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 139/358 (38%), Gaps = 59/358 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P +DT + + W C C +C + +++ + +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159
Query: 63 CYDASCKSPF-----HCFEGD-CFYGITYGDVYETKEVDSLDT-------STLLPPDEPS 109
C D C S F C E + C Y YGD T DT L + +
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQP 167
P I FGCS ++ K + GI G S + QL + P FS CL +
Sbjct: 220 P-----IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KG 273
Query: 168 DKSFHSRLEFGDQIIAGKSLN--LPP------NSFTIKLNGQ--------------RGCI 205
D S G+ ++ G + LP N +I +NGQ RG I
Sbjct: 274 DGSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTI 333
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTY 264
D G+ LT + E Y F++ S + + G C+ + + FP ++
Sbjct: 334 VDTGTTLTYLVKEAY----DPFLNAISNSVSQLVTLIISNGEQCYLVSTSISDMFPPVSL 389
Query: 265 HFQ-GADLVVEPENVFIFN---HQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F GA +++ P++ ++F+ + + + G P + +TILG + FVYDL
Sbjct: 390 NFAGGASMMLRPQD-YLFHYGFYDGASMWCIGFQKAPEE-QTILGDLVLKDKVFVYDL 445
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/354 (22%), Positives = 129/354 (36%), Gaps = 57/354 (16%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQ---NDPIYNSRSFKSYKKL 61
+ Y + + +G P +DT + L+W QC+ C+ CY+Q I+N + +Y K+
Sbjct: 23 NKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKV 82
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C + C E D C Y + YG + D TL S S
Sbjct: 83 GCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-----ASNRS 137
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP-DRFSCCLVQPDKSF 171
+ N FGC + ++ + AGI+G S SF Q+ + FS C + D
Sbjct: 138 IDNFIFGCGED-----NLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR-DHEN 191
Query: 172 HSRLEFG---------------------------DQIIAGKSLNLPPNSFTIKLNGQRGC 204
L G D ++ G L + P + K+
Sbjct: 192 EGSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----- 246
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
I D G+ T I V+ L + + R+ + A +N FP++
Sbjct: 247 IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEM 306
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ L + EN F + + F P +G +LG R + + V+D+
Sbjct: 307 KLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDI 360
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 97/275 (35%), Gaps = 66/275 (24%)
Query: 18 VKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASCKS----P 71
+ S +DT + W QC PC CY Q + ++ R + + C +C++
Sbjct: 156 ILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYA 215
Query: 72 FHCFE----GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
C + GDC Y I Y D T DT T+ P + N RFGCS +
Sbjct: 216 NGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISP-----STTFLNFRFGCSHAVRGK 270
Query: 128 VSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI------ 181
S Q +G M L S + Q R + FS C+ P S L G +
Sbjct: 271 FSAQA---SGTMSLGGGPQSLLSQTARAYGNAFSYCV--PGPSAAGFLSIGGPVNGDDGG 325
Query: 182 ------------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+AG+ LN+PP F+ G + D +V
Sbjct: 326 GSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPPVVFS------GGTVMDSSAV 379
Query: 212 LTVIECEVYAVLTAEFID----YFSQHDIEKLFTC 242
+T + Y L F + Y ++ L TC
Sbjct: 380 ITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTC 414
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 115/315 (36%), Gaps = 54/315 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P ++++ +LDT W C C C +++++ ++ L C
Sbjct: 95 YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTT--TFSAQNSSTFATLDCSKPE 152
Query: 68 CKSP--FHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPD-EPSPVSVQNIRFGCS 121
C C DC + TYG DS ++TL+ P + N FGC
Sbjct: 153 CTQARGLSCPTTGNVDCLFNQTYGG-------DSTFSATLVQDSLHLGPNVIPNFSFGC- 204
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG-- 178
+ S SI + G+MGL S + Q G L FS CL F L+ G
Sbjct: 205 ISSASGSSIPPQ---GLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPV 261
Query: 179 -------------------------DQIIAGKSL-NLPPNSFTIKLNGQRGCINDCGSVL 212
I G+ L + P N G I D G+V+
Sbjct: 262 GQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVI 321
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLV 272
T +Y + EF + + F+ TCF S P++T H G DL
Sbjct: 322 TRFVPAIYTAVRDEF-----RKQVGGSFSPLGAFDTCFATNNEV-SAPAITLHLSGLDLK 375
Query: 273 VEPENVFIFNHQDSF 287
+ EN I + S
Sbjct: 376 LPMENSLIHSSAGSL 390
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/321 (22%), Positives = 126/321 (39%), Gaps = 63/321 (19%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
LN Y ++ IG P ++ ++DT + +T+ C C+ C DP + +Y+ + C
Sbjct: 86 LNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC 145
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
D +C + C Y Y ++ + V D + E P Q FGC
Sbjct: 146 NIDCTCDNE----RKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVP---QRAIFGC-- 196
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---------------- 164
E+++ + + GIMGL S + QL ++ D FS C
Sbjct: 197 ENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGIS 256
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + ++ +AGK L+L P+ F +G+ G + D G+
Sbjct: 257 PPSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIF----DGKHGTVLDSGT----- 307
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCF-----NLPARFNSFPSMT 263
YA L F +++L + ++ CF ++ N+FP++
Sbjct: 308 ---TYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVE 364
Query: 264 YHF-QGADLVVEPENVFIFNH 283
F G L + PEN ++F +
Sbjct: 365 MVFSNGQKLSLSPEN-YLFQY 384
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/357 (21%), Positives = 129/357 (36%), Gaps = 57/357 (15%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQ---NDPIYNSRSFKSYKKL 61
+ Y + + +G P +DT + L+W QC+ C+ CY+Q I+N + +Y K+
Sbjct: 4 NKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKV 63
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C +C + C E D C Y + YG + D TL S S
Sbjct: 64 GCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-----ASNRS 118
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP-DRFSCCLVQPDKSF 171
+ N FGC + ++ + AGI+G S SF Q+ + FS C + D
Sbjct: 119 IDNFIFGCGED-----NLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR-DHEN 172
Query: 172 HSRLEFG---------------------------DQIIAGKSLNLPPNSFTIKLNGQRGC 204
L G D ++ G L + P + K+
Sbjct: 173 EGSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----- 227
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTY 264
I D G+ T I V+ L + + R+ + A +N FP++
Sbjct: 228 IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEM 287
Query: 265 HFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
+ L + EN F + + F P +G +LG R + + V+D+
Sbjct: 288 KLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAM 344
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 136/357 (38%), Gaps = 66/357 (18%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+N Y +L IG P + ++DT + +T+ C C+ C DP + +Y+ + C
Sbjct: 9 INGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKC 68
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C + C Y Y ++ + V D + S ++ Q FGC +
Sbjct: 69 NIDCNCDDE----KQQCVYERQYAEMSTSSGVLGEDIISF---GNLSALAPQRAVFGCEN 121
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLV-------------- 165
+E+ D S GIMG+ S + L ++ D FS C
Sbjct: 122 METGDLYSQHAD---GIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGI 178
Query: 166 -----------QPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + ++ + +AGK L L P F +G+ G I D G+
Sbjct: 179 SPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVF----DGKHGTILDSGT---- 230
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCF-----NLPARFNSFPSM 262
YA L F +++L + + CF ++ +SFP++
Sbjct: 231 ----TYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAV 286
Query: 263 TYHF-QGADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYD 317
F G L++ PEN ++F H + G + T+LG NT +YD
Sbjct: 287 EMVFGNGQKLLLSPEN-YLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYD 342
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ + +GDP + L+DT + L WTQC C K C Q+ P +N+ S S+ +PC D
Sbjct: 86 YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145
Query: 66 ASCKSPFHCF---EGDCFYGITYG 86
+C + F +G C + +TYG
Sbjct: 146 KACAGNYLHFCALDGTCTFRVTYG 169
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 72/179 (40%), Gaps = 8/179 (4%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ--PCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y + +G P + + +DT + TW QC PC SC + P+Y R ++ LP D
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLY--RPARTADALPASD 217
Query: 66 ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ H C Y I+Y D + V D+ + E +I FGC + +
Sbjct: 218 PLCEGAQHENPNQCDYEISYADGSSSMGVYVRDSMQFV--GEDGERENADIVFGCGYDQQ 275
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLEFGDQII 182
+ + G++GL + S QL ++ + F C+ L GD I
Sbjct: 276 GVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 135/355 (38%), Gaps = 50/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + + +DT + + W C C C + + ++ RS + +
Sbjct: 77 YYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSLIS 136
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSV 113
C D C+S + C Y YGD T V L + + S
Sbjct: 137 CSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNSS 196
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL------- 164
++ FGCS+ ++ ++ + GI G S + QL + P FS CL
Sbjct: 197 ASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSGG 256
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
V+P+ + H L + G+ + + P F N RG I D
Sbjct: 257 GVLVLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN--RGTIVDS 314
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS--FPSMTYHF 266
G+ L + E Y F++ + + + + G C+ + N FP ++ +F
Sbjct: 315 GTTLAYLAEEAY----NPFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNF 370
Query: 267 Q-GADLVVEPENVFIFNH--QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GA LV+ P++ + + + + G P + TILG + FVYDL
Sbjct: 371 AGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYDL 425
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 137/360 (38%), Gaps = 56/360 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C C + P+ ++ S + +
Sbjct: 84 YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGD-----VYETKEVDSLDTSTLLPPDEPSP 110
C D C + + C Y YGD Y ++ LDT LL E S
Sbjct: 144 CSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDT-LLLSSGELSQ 202
Query: 111 VSV---QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
+ ++ F CS ++ + + GI G S + QL + P FS CL
Sbjct: 203 ICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLK 262
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ + H L +AG++L + P+ F N +
Sbjct: 263 GDDSGGGVLVLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSN--Q 320
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPS 261
G I D G+ L + Y + S + L G C+ + + N FP
Sbjct: 321 GTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYL----SKGNQCYLVTSSVNDVFPQ 376
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQ--DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
++ +F GA L++ P++ + + + + G TP + TILG + FVYD+
Sbjct: 377 VSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDI 436
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 107/292 (36%), Gaps = 48/292 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C PC C + +N + + K+P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 63 CYDASCKSPFHCFEG--------DCFYGITYGDVYETKEVDSLDT---STLLPPDEPSPV 111
C D C + E C Y TYGD T DT T++ +E +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVM-GNEQTAN 209
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL----- 164
S +I FGCS ++ + + GI G S + QL L P FS CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 165 ----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
V+P + H L ++ G+ L + + FT + +G I
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTIV 327
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS 258
D G+ L + Y F++ + + + G CF +R S
Sbjct: 328 DSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSRLAS 375
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 87/201 (43%), Gaps = 31/201 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +DT + LTW QC PC+SC + P+Y + + +PC +A
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP---TANRLVPCANA 109
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + H +G C Y I Y D ++ V D+ +L P+ NIR
Sbjct: 110 LCTA-LHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSL-------PMRSSNIR 161
Query: 118 ----FGCSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKS 170
FGC + + + I G++GL S S + QL + + + CL
Sbjct: 162 PGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 171 FHSRLEFGDQIIAGKSLNLPP 191
F L FGD ++ + P
Sbjct: 222 F---LFFGDDVVPSSRVTWVP 239
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 87/201 (43%), Gaps = 31/201 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +DT + LTW QC PC+SC + P+Y + + +PC +A
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP---TANRLVPCANA 109
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + H +G C Y I Y D ++ V D+ +L P+ NIR
Sbjct: 110 LCTA-LHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSL-------PMRSSNIR 161
Query: 118 ----FGCSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKS 170
FGC + + + I G++GL S S + QL + + + CL
Sbjct: 162 PGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 171 FHSRLEFGDQIIAGKSLNLPP 191
F L FGD ++ + P
Sbjct: 222 F---LFFGDDVVPSSRVTWVP 239
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/372 (21%), Positives = 140/372 (37%), Gaps = 65/372 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQP--CKSCYEQNDPIYNSRSFKSYK 59
F N + + L +G P +++ +LDT + L+W C P ++ + R+ ++
Sbjct: 59 FHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFA 118
Query: 60 KLPCYDASCK-----SPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
+PC A C+ SP C C ++Y D + + + T+ + P+
Sbjct: 119 SVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR 175
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC + D S AG++G+N + SF+ Q RFS C+ D +
Sbjct: 176 AA---FGCMATAFD-TSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV 228
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ + GK L +P + G
Sbjct: 229 LLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTG 288
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLP--- 253
+ D G+ T + + Y+ L AEF ++ + L F ++ TCF +P
Sbjct: 289 AGQTMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGR 347
Query: 254 ARFNSFPSMTYHFQGADLVVEPENVFI------FNHQDSFFFFFGPAFTPRKGKTILGAR 307
A P++T F GA + V + + + FG A ++G
Sbjct: 348 APPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHH 407
Query: 308 HQHNTQFVYDLD 319
HQ N YDL+
Sbjct: 408 HQMNVWVEYDLE 419
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/268 (22%), Positives = 103/268 (38%), Gaps = 49/268 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP--IYNSRSFKSYKKLPCYD 65
Y + + +G P ++L + DT + LTW +C CK+ + P + +R ++ C+
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142
Query: 66 ASCK-----SPFHC----FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
+ C+ +P C C Y Y D +T S +T+T L + +++I
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTT-LNTSSGREMKLKSI 201
Query: 117 RFGCSLESK--DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL---------- 164
FGC + + +G+MGL SF QLGR FS CL
Sbjct: 202 AFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPT 261
Query: 165 --------VQPDKSFHSRLEFGDQIIA-----------------GKSLNLPPNSFTIKLN 199
V K S + F +I G L++ P+ +++
Sbjct: 262 SYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDEL 321
Query: 200 GQRGCINDCGSVLTVIECEVYAVLTAEF 227
G G + D G+ LT + Y + + F
Sbjct: 322 GNGGTVIDSGTTLTFLTEPAYREILSAF 349
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 131/357 (36%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K + +DT + + W C C +C + +++ + +
Sbjct: 83 YFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142
Query: 63 CYDASCKSPFHCF-------EGDCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVS 112
C D C C Y YGD T D++ T+L S
Sbjct: 143 CADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANS 202
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------ 164
I FGCS ++ K + GI G + S + QL + P FS CL
Sbjct: 203 SSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENG 262
Query: 165 ---------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
++P + H L + G+ L + N F N +G I D
Sbjct: 263 GGVLVLGEILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN--QGTIVD 320
Query: 208 CGSVLTVIECEVYA----VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
G+ L + E Y +TA + FS+ I K C + ++ FP ++
Sbjct: 321 SGTTLAYLVQEAYNPFVDAITAA-VSQFSKPIISKGNQCYLVSNSVGDI------FPQVS 373
Query: 264 YHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+F GA +V+ PE+ + + DS + +G TILG + FVYDL
Sbjct: 374 LNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDL 430
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 129/351 (36%), Gaps = 56/351 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + +Y + C
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + E P Q FGC +
Sbjct: 145 VDCTCDSD----KNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP---QRAVFGCENS 197
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLVQPD------------ 168
E+ D S GIMGL S M QL ++ D FS C D
Sbjct: 198 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP 254
Query: 169 -------------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+S + +E + +AGK+L + P F +G+ G + D G+ +
Sbjct: 255 APPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYL 310
Query: 216 ECEVYAVLTAEFIDYFSQ--HDIEKLFTC-RKCGVTCFNLPARFNS-----FPSMTYHF- 266
+ + F D S H ++K+ CF R S FP + F
Sbjct: 311 PEQAFVA----FKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFG 366
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
G L + PEN + + + G + T+LG NT YD
Sbjct: 367 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 81/175 (46%), Gaps = 25/175 (14%)
Query: 5 NHTYMLKLGIGDPVKSL---WFLLDTVAGLTWTQCQPCKSC-----YEQNDPIYNSRSFK 56
TY+++L IG P + + L DT + L+WTQC+PC +C Y +DP S+S +
Sbjct: 119 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP---SKS-R 174
Query: 57 SYKKLPCYDASCKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
++++L C+D C+ +G C + YGD D +
Sbjct: 175 TFRRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGY 234
Query: 112 SVQ-NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ ++ FGC+ +E V + GI+ L SF+ QLG DRFS C+
Sbjct: 235 QLERDVAFGCAHVEDSKAV---RGYSTGILALGIGKPSFVTQLG---VDRFSYCI 283
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 131/362 (36%), Gaps = 65/362 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K + +DT + + W C C +C + +++ + +
Sbjct: 83 YFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDT---STLLPPDEPSPVS 112
C D C + C Y YGD T DT T+L S
Sbjct: 143 CGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANS 202
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------ 164
I FGCS ++ K + GI G + S + QL + P FS CL
Sbjct: 203 SSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENG 262
Query: 165 ---------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
++P + H L + G+ L + N F N +G I D
Sbjct: 263 GGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN--QGTIVD 320
Query: 208 CGSVLTVIECEVY----AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
G+ L + E Y +TA + FS+ I K C + ++ FP ++
Sbjct: 321 SGTTLAYLVQEAYNPFVKAITAA-VSQFSKPIISKGNQCYLVSNSVGDI------FPQVS 373
Query: 264 YHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAF------TPRKGKTILGARHQHNTQFVY 316
+F GA +V+ PE+ + + F G A +G TILG + FVY
Sbjct: 374 LNFMGGASMVLNPEHYLMH-----YGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVY 428
Query: 317 DL 318
DL
Sbjct: 429 DL 430
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/372 (21%), Positives = 140/372 (37%), Gaps = 65/372 (17%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQP--CKSCYEQNDPIYNSRSFKSYK 59
F N + + L +G P +++ +LDT + L+W C P ++ + R+ ++
Sbjct: 60 FHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFA 119
Query: 60 KLPCYDASCK-----SPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
+PC A C+ SP C C ++Y D + + + T+ + P+
Sbjct: 120 SVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR 176
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-- 170
FGC + D S AG++G+N + SF+ Q RFS C+ D +
Sbjct: 177 AA---FGCMATAFD-TSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV 229
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ + GK L +P + G
Sbjct: 230 LLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTG 289
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLP--- 253
+ D G+ T + + Y+ L AEF ++ + L F ++ TCF +P
Sbjct: 290 AGQTMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGR 348
Query: 254 ARFNSFPSMTYHFQGADLVVEPENVFI------FNHQDSFFFFFGPAFTPRKGKTILGAR 307
A P++T F GA + V + + + FG A ++G
Sbjct: 349 APPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHH 408
Query: 308 HQHNTQFVYDLD 319
HQ N YDL+
Sbjct: 409 HQMNVWVEYDLE 420
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 81/175 (46%), Gaps = 25/175 (14%)
Query: 5 NHTYMLKLGIGDPVKSL---WFLLDTVAGLTWTQCQPCKSC-----YEQNDPIYNSRSFK 56
TY+++L IG P + + L DT + L+WTQC+PC +C Y +DP S+S +
Sbjct: 98 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP---SKS-R 153
Query: 57 SYKKLPCYDASCKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
++++L C+D C+ +G C + YGD D +
Sbjct: 154 TFRRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGY 213
Query: 112 SVQ-NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ ++ FGC+ +E V + GI+ L SF+ QLG DRFS C+
Sbjct: 214 QLERDVAFGCAHVEDSKAV---RGYSTGILALGIGKPSFVTQLG---VDRFSYCI 262
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 121/348 (34%), Gaps = 57/348 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P + ++ +LDT W C C C + P ++ + +Y L C
Sbjct: 99 YVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFSPNTSSTYASLQCSVPQ 155
Query: 68 CKS--PFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C C CF+ TYG + S D+ L PS FGC
Sbjct: 156 CTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPS------YSFGC-- 207
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG--- 178
+ VS G++GL S + Q G L FS C F L G
Sbjct: 208 --VNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGPLG 265
Query: 179 ------------------------DQIIAGKSL-NLPPNSFTIKLNGQRGCINDCGSVLT 213
+ G+ L + P N G I D G+V+T
Sbjct: 266 QPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVIT 325
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVV 273
VYA + EF + ++ F TCF + P +T+HF G DL +
Sbjct: 326 RFVEPVYAAIRDEF-----RKQVKGPFATIGAFDTCFAA-TNEDIAPPVTFHFTGMDLKL 379
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
EN I + S A P ++L Q N + ++D+
Sbjct: 380 PLENTLIHSSAGS-LACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDV 426
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 81/175 (46%), Gaps = 25/175 (14%)
Query: 5 NHTYMLKLGIGDPVKSL---WFLLDTVAGLTWTQCQPCKSC-----YEQNDPIYNSRSFK 56
TY+++L IG P + + L DT + L+WTQC+PC +C Y +DP S+S +
Sbjct: 120 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP---SKS-R 175
Query: 57 SYKKLPCYDASCKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
++++L C+D C+ +G C + YGD D +
Sbjct: 176 TFRRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGY 235
Query: 112 SVQ-NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ ++ FGC+ +E V + GI+ L SF+ QLG DRFS C+
Sbjct: 236 QLERDVAFGCAHVEDSKAV---RGYSTGILALGIGKPSFVTQLG---VDRFSYCI 284
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 81/175 (46%), Gaps = 25/175 (14%)
Query: 5 NHTYMLKLGIGDPVKSL---WFLLDTVAGLTWTQCQPCKSC-----YEQNDPIYNSRSFK 56
TY+++L IG P + + L DT + L+WTQC+PC +C Y +DP S+S +
Sbjct: 101 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP---SKS-R 156
Query: 57 SYKKLPCYDASCKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
++++L C+D C+ +G C + YGD D +
Sbjct: 157 TFRRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGY 216
Query: 112 SVQ-NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ ++ FGC+ +E V + GI+ L SF+ QLG DRFS C+
Sbjct: 217 QLERDVAFGCAHVEDSKAV---RGYSTGILALGIGKPSFVTQLG---VDRFSYCI 265
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 81/175 (46%), Gaps = 25/175 (14%)
Query: 5 NHTYMLKLGIGDPVKSL---WFLLDTVAGLTWTQCQPCKSC-----YEQNDPIYNSRSFK 56
TY+++L IG P + + L DT + L+WTQC+PC +C Y +DP S+S +
Sbjct: 99 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP---SKS-R 154
Query: 57 SYKKLPCYDASCKSPFHCFEG-----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
++++L C+D C+ +G C + YGD D +
Sbjct: 155 TFRRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGY 214
Query: 112 SVQ-NIRFGCS-LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
++ ++ FGC+ +E V + GI+ L SF+ QLG DRFS C+
Sbjct: 215 QLERDVAFGCAHVEDSKAV---RGYSTGILALGIGKPSFVTQLG---VDRFSYCI 263
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 129/351 (36%), Gaps = 56/351 (15%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + +Y + C
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + E P Q FGC +
Sbjct: 145 VDCTCDSD----KNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP---QRAVFGCENS 197
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCLVQPD------------ 168
E+ D S GIMGL S M QL ++ D FS C D
Sbjct: 198 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP 254
Query: 169 -------------KSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+S + +E + +AGK+L + P F +G+ G + D G+ +
Sbjct: 255 APPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYL 310
Query: 216 ECEVYAVLTAEFIDYFSQ--HDIEKLFTCR-KCGVTCFNLPARFNS-----FPSMTYHF- 266
+ + F D S H ++K+ CF R S FP + F
Sbjct: 311 PEQAFVA----FKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFG 366
Query: 267 QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
G L + PEN + + + G + T+LG NT YD
Sbjct: 367 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 72/179 (40%), Gaps = 8/179 (4%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ--PCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y + +G P + + +DT + TW QC PC SC + P+Y R ++ LP D
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLY--RPARTADALPASD 217
Query: 66 ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK 125
C+ H C Y I+Y D + V D+ + E +I FGC + +
Sbjct: 218 PLCEGAQHENPNQCDYEISYADGSSSMGVYVRDSMQFV--GEDGERENADIVFGCGYDQQ 275
Query: 126 DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLEFGDQII 182
+ + G++GL + S QL ++ + F C+ L GD I
Sbjct: 276 GVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 78/371 (21%), Positives = 139/371 (37%), Gaps = 63/371 (16%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ ++DT + L+W C + +RS SY+ +
Sbjct: 25 FRHNISLTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSI-SYRPI 83
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC ++C + P C C ++Y D ++ + DT + D P V
Sbjct: 84 PCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMV-- 141
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--- 170
FGC S + G+MG+N S SF+ Q+G +FS C+ D S
Sbjct: 142 ----FGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGF---PKFSYCISGTDFSGML 194
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ ++ + L +P + F G
Sbjct: 195 LLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTG 254
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDY---FSQHDIEKLFTCRKCGVTCFNLPAR-- 255
+ D G+ T + Y L +EF++ F + + F + C+ +P
Sbjct: 255 AGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQR 314
Query: 256 -FNSFPSMTYHFQGADLVVEPENVF------IFNHQDSFFFFFGPAFTPRKGKTILGARH 308
P+++ F GA++ V E V I + FG + ++G H
Sbjct: 315 VLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHH 374
Query: 309 QHNTQFVYDLD 319
Q N +DL+
Sbjct: 375 QQNVWMEFDLE 385
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 137/345 (39%), Gaps = 51/345 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVA-GLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPC 63
Y + G G PV+ DT G T QC+PC + C+ DP +S S +PC
Sbjct: 145 YHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCAADEPCHHAFDPSASS----SIAHVPC 200
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C C C ++ + D TL P + V + RF C LE
Sbjct: 201 GSPDCPFNKGCSGHSCTLSVSINNTLLGNATFFTDKLTLTPWN-----IVDDFRFVC-LE 254
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCLVQ--PDKSFHSRLEFGD 179
+ F GI+ L+ +S S + PD FS CL D F S
Sbjct: 255 AG-FRPDDDST--GILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSDVGFLSLGATKP 311
Query: 180 QIIAGKSLNLPP--------NSFTIKLNG----------QRGCINDCGSVL------TVI 215
+++ G+ ++ P N + ++L G R I G++L T +
Sbjct: 312 ELL-GRKVSYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAIAGGGTILELHTTFTYL 370
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG-ADLVV 273
+ +VYA L EF SQ+ + TC+N A + S P++T F G A+ +
Sbjct: 371 KPKVYAALRDEFRKSMSQYPVAPPQGSLD---TCYNFTALSSYSVPAVTLKFDGGAEFDL 427
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ + F S+F AF + G ++G+ Q +T+ VYD+
Sbjct: 428 WIDEMMYFPEPGSYFSVGCLAFVAQDGGAVIGSMAQMSTEVVYDV 472
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 131/356 (36%), Gaps = 58/356 (16%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQND---PIYNSRSFKSYKKL 61
+ + + + +G P +DT + ++W QCQ C CY Q+ P +N+ S +Y+++
Sbjct: 21 NQFFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRV 80
Query: 62 PCYDASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C P C E + C Y + Y + S D TL + S
Sbjct: 81 GCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTL-----ANSYS 135
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP-DRFSCCL--VQPDK 169
+Q FGC +++ AGI+G S SF Q+ +L FS C Q ++
Sbjct: 136 IQKFIFGCGSDNR-----YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQENE 190
Query: 170 SFHS--------------------------RLEFGDQIIAGKSLNLPPNSFTIKLNGQRG 203
F S L+ D ++ G L + P +T ++
Sbjct: 191 GFLSIGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMT---- 246
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMT 263
+ D G+V T + V+ L + ++ ++ P +
Sbjct: 247 -VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNGDSVDWSKLPVVE 305
Query: 264 YHFQGADLVVEPENVFIFNHQD-SFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F + L + ENVF + D S F P G ILG R + + V+D+
Sbjct: 306 IKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVFDI 361
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 88/389 (22%), Positives = 139/389 (35%), Gaps = 85/389 (21%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + + +G P +++ +LDT + L+W C S P +N+ SY +
Sbjct: 49 FRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAV 106
Query: 62 PCYDASCK-----SPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
PC +C+ P F C ++Y D V L T T L PV
Sbjct: 107 PCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGV--LATDTFLLTGGAPPV 164
Query: 112 SVQNIRFGC--------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC 163
+V FGC + S + + G++G+N + SF+ Q G RF+ C
Sbjct: 165 AV-GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFAYC 220
Query: 164 LVQPDKSFHSRLEFGDQIIAGKSLNLPP-------------NSFTIKLNGQR-GC----- 204
+ + L GD LN P +++++L G R GC
Sbjct: 221 IAPGEGP--GVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPI 278
Query: 205 ---------------INDCGSVLTVIECEVYAVLTAEFIDYFSQHDI------EKLFTCR 243
+ D G+ T + + YA L AEF SQ + E F +
Sbjct: 279 PKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFT---SQARLLLAPLGEPGFVFQ 335
Query: 244 KCGVTCFNLP-ARFNS----FPSMTYHFQGADLVVEPENVFIF---------NHQDSFFF 289
CF P AR + P + +GA++ V E + + +
Sbjct: 336 GAFDACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCL 395
Query: 290 FFGPAFTPRKGKTILGARHQHNTQFVYDL 318
FG + ++G HQ N YDL
Sbjct: 396 TFGNSDMAGMSAYVIGHHHQQNVWVEYDL 424
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P + + +DT + LTW QC PC SC + P+Y R K+ K +PC D
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY--RPTKN-KLVPCVDQ 114
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + G C Y I Y D + V D+ L + S + +
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLAN--SSIVRPGLA 172
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC + + S + G++GL S S + QL + + + CL F L
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGF---L 229
Query: 176 EFGDQII 182
FGD I+
Sbjct: 230 FFGDDIV 236
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 123/318 (38%), Gaps = 47/318 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y K+GIG P K + +DT + + W C C+ C + + + S
Sbjct: 87 YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146
Query: 68 CKSPFHCFEGD------------CFYGITYGDVYETKE--VDSLDTSTLLPPDEPSPVSV 113
C F C E + C Y YGD T V + D + +
Sbjct: 147 CDEQF-CLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 114 QNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCL------ 164
+I+FGC + +S D S ++ + GI+G ++S + QL R V F+ CL
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
VQP + H + + LN+ + F + ++G I D
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVF--EAGDRKGTIIDS 323
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQ 267
G+ L + +Y L A+ + QH++E CF R + FP + +HF+
Sbjct: 324 GTTLAYLPELIYEPLVAKILS--QQHNLE--VQTIHGEYKCFQYSERVDDGFPPVIFHFE 379
Query: 268 GADLVVEPENVFIFNHQD 285
+ L+ + ++F +++
Sbjct: 380 NSLLLKVYPHEYLFQYEN 397
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 140/356 (39%), Gaps = 58/356 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++G+G+PV+ L ++DT + + W +C PC+SC + D IYN + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142
Query: 63 CYDASCKSPFHC-----FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C D C C YGI+Y D ++ + + + + + +I
Sbjct: 143 CSDPLCTGEQAVCSRSGSNSACAYGISYQD--KSTSIGAYVKDDMHYVLQGGNATTSHIF 200
Query: 118 FGCSLESKDFVSIQKKIIA-GIMGLNWDSTSFMVQLG--RLVPDRFSCCLVQPDKSFHSR 174
FGC+ ++I A GIMG S + Q+ R + FS CL +K
Sbjct: 201 FGCA------INITGSWPADGIMGFGQISKTVPNQIATQRNMSRVFSHCL-GGEKHGGGI 253
Query: 175 LEFGDQ-----------------------IIAGKSLNLPPN----SFTIKLNGQRGCIND 207
LEFG++ I+ S LP + S+ + G I D
Sbjct: 254 LEFGEEPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIID 313
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF---NSFPSMTY 264
G+ ++ + +L +E + + KL G+ CF L + SFP++T
Sbjct: 314 SGTSFALLATKANRILFSEIKNLTTAKLGPKLE-----GLQCFYLKSGLTVETSFPNVTL 368
Query: 265 HFQGAD-LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G + ++P+N + + A++ G TI G + YD++
Sbjct: 369 TFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADGLTIFGEIVLKDKLVFYDVE 424
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 135/366 (36%), Gaps = 61/366 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC---------YEQNDPIYNSR 53
T Y ++ IG P K + +DT + + W C C Q DP +
Sbjct: 80 TATGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT 139
Query: 54 SFKSYKKLPCYDASCKS--PFHC--FEGDCFYGITYGDVYETKE---VDSLDTSTLLPPD 106
+ ++ C S S P C C + ITYGD T D + + +
Sbjct: 140 TVGCEQEF-CVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNG 198
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL 164
+ +P +V +I FGC + + + + GI+G S + QL R V F+ CL
Sbjct: 199 QTTPSNV-SITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL 257
Query: 165 --------------VQP---------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
VQP + H + + G +L LP ++F
Sbjct: 258 DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTF--DSGDS 315
Query: 202 RGCINDCGSVLTVIECEVY-AVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN- 257
+G I D G+ L + EVY +LTA F + + + E CF +
Sbjct: 316 KGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDFI--------CFQFSGSLDE 367
Query: 258 SFPSMTYHFQG-ADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGK--TILGARHQHNTQ 313
FP +T+ F+G L V P + N D + F + GK +LG N
Sbjct: 368 EFPVITFSFEGDLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKL 427
Query: 314 FVYDLD 319
VYDL+
Sbjct: 428 VVYDLE 433
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/309 (21%), Positives = 115/309 (37%), Gaps = 48/309 (15%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY++K +G P ++ LDT W C C C + ++NS + ++K L C
Sbjct: 89 TYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCDAP 145
Query: 67 SCKSPFH--CFEGDCFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CK + C C + TYG + D++ ST + P FGC
Sbjct: 146 QCKQVPNPTCGGSTCTWNTTYGGSTILSNLTRDTIALSTDIVP---------GYTFGCIQ 196
Query: 123 ESK-----------------DFVSIQKKI--------IAGIMGLNWDSTSFMVQLGRLVP 157
++ F+S + + + LN+ T + G+ +
Sbjct: 197 KTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR 256
Query: 158 DRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ + L P +S + + K +++P ++ G I D G+V T +
Sbjct: 257 IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVA 316
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
VY + EF + L TC+ P P+MT+ F G ++ + P+N
Sbjct: 317 PVYTAVRDEFRKRVGNAIVSSLGGFD----TCYTGPI---VAPTMTFMFSGMNVTLPPDN 369
Query: 278 VFIFNHQDS 286
+ I + S
Sbjct: 370 LLIRSTAGS 378
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 85/209 (40%), Gaps = 24/209 (11%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
++ Y L IG P + ++DT + + W C C C QN ++ + S KL C
Sbjct: 78 ISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGCPLQNVTFFDPGASSSAVKLAC 137
Query: 64 YDASCKSPFHCFEG--DCFYGITYGD-----VYETKEVDSLDT--STLLPPDEPSPVSVQ 114
D C S H G Y + Y D Y ++ S +T S+ L +P
Sbjct: 138 SDKRCFSDLHKKSGCSPLEYKVEYSDGSFTSGYYISDLISFETVMSSNLTVKSSAP---- 193
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCLVQPDKSFH 172
FGCS +S+ + I GI+GL + QL RL P+ FS CL
Sbjct: 194 -FVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGGQ---- 248
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
E G II G++ LP +T + Q
Sbjct: 249 ---EGGGVIILGEN-RLPNTVYTPLVRSQ 273
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 129/350 (36%), Gaps = 68/350 (19%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY-DASCKSPF 72
IG P + ++DT + +T+ C C C DP + +Y + C D +C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT-- 59
Query: 73 HCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLESKDFVSI 130
E D C Y Y ++ + + D + E P Q FGC + E+ D S
Sbjct: 60 ---ENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP---QRAVFGCENAETGDLFSQ 113
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------------------------ 164
GIMGL S + QL ++ D FS C
Sbjct: 114 HAD---GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFS 170
Query: 165 -VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL 223
PD+S + +E +AGK L++ P F +G+ G I D G+ YA L
Sbjct: 171 HSDPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGT--------TYAYL 218
Query: 224 TAEFIDYFSQHDIEKLFTCRKC-------GVTCFN-----LPARFNSFPSMTYHF-QGAD 270
F Q +L ++ CF+ +P + +FPS+ F G
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEK 278
Query: 271 LVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ PEN ++F H + G + T+LG NT YD +
Sbjct: 279 YSLSPEN-YLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRE 327
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 129/350 (36%), Gaps = 68/350 (19%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY-DASCKSPF 72
IG P + ++DT + +T+ C C C DP + +Y + C D +C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT-- 59
Query: 73 HCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SLESKDFVSI 130
E D C Y Y ++ + + D + E P Q FGC + E+ D S
Sbjct: 60 ---ENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP---QRAVFGCENAETGDLFSQ 113
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------------------------ 164
GIMGL S + QL ++ D FS C
Sbjct: 114 HAD---GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFS 170
Query: 165 -VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVL 223
PD+S + +E +AGK L++ P F +G+ G I D G+ YA L
Sbjct: 171 HSDPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGT--------TYAYL 218
Query: 224 TAEFIDYFSQHDIEKLFTCRKC-------GVTCFN-----LPARFNSFPSMTYHF-QGAD 270
F Q +L ++ CF+ +P + +FPS+ F G
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEK 278
Query: 271 LVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ PEN ++F H + G + T+LG NT YD +
Sbjct: 279 YSLSPEN-YLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRE 327
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 127/355 (35%), Gaps = 64/355 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + +Y + C
Sbjct: 82 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCS 141
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + E P Q FGC +
Sbjct: 142 ADCTCDSD----KSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP---QRAVFGCENS 194
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL---------------- 164
E+ D S GIMGL S M QL ++ D FS C
Sbjct: 195 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP 251
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + +E + +AGK+L L P F + + G + D G+
Sbjct: 252 APPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIF----DSKHGTVLDSGT----- 302
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSMT 263
YA L + F K+ +K CF N+ +FP +
Sbjct: 303 ---TYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVD 359
Query: 264 YHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F G L + PEN + + + G + T+LG NT YD
Sbjct: 360 MVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 414
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/354 (23%), Positives = 130/354 (36%), Gaps = 62/354 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN---DPIYNSRSFKSYKKLPCY 64
Y ++ IG P + ++DT + +T+ C C C DP + + SY+ + C
Sbjct: 99 YTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCN 158
Query: 65 DASCKSPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C + C Y Y ++ +K V D LL S + + FGC E
Sbjct: 159 SPDCITKMCDARVHQCKYERVYAEMSSSKGVLGKD---LLGFGNGSRLQPHPLLFGC--E 213
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL----------------- 164
+ + + + GIMGL S + QL + D FS C
Sbjct: 214 TAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPP 273
Query: 165 --------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIE 216
P++S + LE + + G SLN+P F NG+ G + D G+
Sbjct: 274 PPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF----NGRLGTVLDSGT------ 323
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSMTY 264
YA L + D F ++L + + CF + A FP + +
Sbjct: 324 --TYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDF 381
Query: 265 HFQGADLV-VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F G V + PEN ++F H + F + T+LG NT YD
Sbjct: 382 VFSGNQKVFLAPEN-YLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYD 434
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 84/204 (41%), Gaps = 20/204 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P + + +DT + LTW QC PC SC + P+Y R K+ K +PC D
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY--RPTKN-KLVPCVDQ 114
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + G C Y I Y D + V D+ L + S + +
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLAN--SSIVRPGLA 172
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC + + S + G++GL S S + QL + + + CL F L
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGF---L 229
Query: 176 EFGDQIIAGKSLNLPPNSFTIKLN 199
FGD I+ P + + N
Sbjct: 230 FFGDDIVPYSRATWAPMARSTSRN 253
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 138/358 (38%), Gaps = 67/358 (18%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+N Y +L IG P + ++D+ + +T+ C C+ C + DP + +Y+ + C
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC 148
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C C Y Y + +K V D L+ S ++ Q FGC +
Sbjct: 149 NMDCNCDDDRE----QCVYEREYAEHSSSKGVLGED---LISFGNESQLTPQRAVFGCET 201
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S + GI+GL S + QL L+ + F C
Sbjct: 202 VETGDLYSQRAD---GIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGF 258
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
PD+S + ++ +AGK L+L F +G+ G + D G+
Sbjct: 259 DYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF----DGEHGAVLDSGT---- 310
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCFNLPAR------FNSFPS 261
YA L F + + ++ T ++ TCF + A FPS
Sbjct: 311 ----TYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPS 366
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYD 317
+ F+ G ++ PEN ++F H + G + T+LG NT VYD
Sbjct: 367 VEMVFKSGQSWLLSPEN-YMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYD 423
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P + + +DT + LTW QC PC SC + P+Y R K+ K +PC D
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY--RPTKN-KLVPCVDQ 114
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + G C Y I Y D + V D+ L + S + +
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLAN--SSIVRPGLA 172
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC + + S + G++GL S S + QL + + + CL F L
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGF---L 229
Query: 176 EFGDQII 182
FGD I+
Sbjct: 230 FFGDDIV 236
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 88/207 (42%), Gaps = 28/207 (13%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++ IG+ + + L+DT + L WTQC C C+ + P Y ++++++ C D
Sbjct: 81 VYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSCGDD 140
Query: 67 S-------------CKSPFH---CFEGDCFYGITYGDVYETKEVD---SLDTSTLLPPDE 107
K P + C G C + Y + + V S+DT +
Sbjct: 141 DDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTFHFIDDRR 200
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--V 165
+ + FGC+ ++ V K GI+GL SF+ Q G +FS C+
Sbjct: 201 FDYQAKFRMVFGCA-HQENIVLTAVKECTGILGLGMGDASFLRQTGIT---KFSYCVPPR 256
Query: 166 QPDKSF--HSRLEFGDQI-IAGKSLNL 189
P S+ HS L FG I+GK + L
Sbjct: 257 MPGYSYRRHSWLRFGSHAQISGKKVPL 283
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P + + +DT + LTW QC PC SC + P+Y R K+ K +PC D
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY--RPTKN-KLVPCVDQ 114
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + G C Y I Y D + V D+ L + S + +
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLAN--SSIVRPGLA 172
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC + + S + G++GL S S + QL + + + CL F L
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGF---L 229
Query: 176 EFGDQII 182
FGD I+
Sbjct: 230 FFGDDIV 236
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 124/350 (35%), Gaps = 78/350 (22%)
Query: 15 GDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYDASCKS-- 70
G + +LDT + W +C PC C + DP +S +Y PC ++CK
Sbjct: 157 GSSSPPVTVVLDTAGDVPWMRCVPCTFAQCADY-DPTRSS----TYSAFPCNSSACKQLG 211
Query: 71 --PFHC-FEGDCFYG-ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKD 126
C G C Y +T GD + T S D T+ D V+ RFGCS +
Sbjct: 212 RYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGDR-----VEGFRFGCSQNEQG 266
Query: 127 FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA--- 183
Q GIM L S M Q D FS CL P ++ + G I A
Sbjct: 267 SFENQAD---GIMALGRGVQSLMAQTSSTYGDAFSYCL-PPTETTKGFFQIGVPIGASYR 322
Query: 184 -------------------------------GKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
GK LN+P F G + D +++
Sbjct: 323 FVTTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFAA------GTVMDSRTII 376
Query: 213 TVIECEVYAVLTAEF---IDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQG 268
T + Y L A F + Y E+L TC++L R+ P + F G
Sbjct: 377 TRLPVTAYGALRAAFRNRMRYRVAPPQEELD-------TCYDLTGVRYPRLPRIALVFDG 429
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ VVE + I + F +P +ILG Q Q ++D+
Sbjct: 430 -NAVVEMDRSGILLNGCLAFASNDDDSSP----SILGNVQQQTIQVLHDV 474
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 21/187 (11%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F + +++ + G P ++ +LDT + +TWTQC+ C +C + + +N + +Y
Sbjct: 121 LFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSS 180
Query: 61 LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C + ++ Y +TYGD + DT TL EPS V Q +FGC
Sbjct: 181 GSCIPGTVENN---------YNMTYGDDSTSVGNYGCDTMTL----EPSDV-FQKFQFGC 226
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
+K DF S + G++GL S + Q FS CL P++ L FG+
Sbjct: 227 GRNNKGDFGS----GVDGMLGLGQGQLSTVSQTASKFNKVFSYCL--PEEDSIGSLLFGE 280
Query: 180 QIIAGKS 186
+ + S
Sbjct: 281 KATSQSS 287
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 66/360 (18%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKSPFH--------CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C P + C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL----V 165
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTDET 167
Query: 166 QPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVL 212
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 168 KPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQR 227
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSF 259
T + +A+L S + R+ C+ + +++
Sbjct: 228 TSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSAL 287
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + F GA L + P NVF + F A P ILG R + +D+
Sbjct: 288 PLLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 88/389 (22%), Positives = 139/389 (35%), Gaps = 85/389 (21%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + + +G P +++ +LDT + L+W C S P +N+ SY +
Sbjct: 49 FRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAV 106
Query: 62 PCYDASCK-----SPFHCF-----EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
PC +C+ P F C ++Y D V L T T L PV
Sbjct: 107 PCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGV--LATDTFLLTGGAPPV 164
Query: 112 SVQNIRFGC--------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC 163
+V FGC + S + + G++G+N + SF+ Q G RF+ C
Sbjct: 165 AV-GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFAYC 220
Query: 164 LVQPDKSFHSRLEFGDQIIAGKSLNLPP-------------NSFTIKLNGQR-GC----- 204
+ + L GD LN P +++++L G R GC
Sbjct: 221 IAPGEGP--GVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPI 278
Query: 205 ---------------INDCGSVLTVIECEVYAVLTAEFIDYFSQHDI------EKLFTCR 243
+ D G+ T + + YA L AEF SQ + E F +
Sbjct: 279 PKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFT---SQARLLLAPLGEPGFVFQ 335
Query: 244 KCGVTCFNLP-ARFNS----FPSMTYHFQGADLVVEPENVFIF---------NHQDSFFF 289
CF P AR + P + +GA++ V E + + +
Sbjct: 336 GAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCL 395
Query: 290 FFGPAFTPRKGKTILGARHQHNTQFVYDL 318
FG + ++G HQ N YDL
Sbjct: 396 TFGNSDMAGMSAYVIGHHHQQNVWVEYDL 424
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 64/295 (21%), Positives = 109/295 (36%), Gaps = 50/295 (16%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKS 57
T Y ++GIG P K + +DT + + W C C C ++ +Y+ + +
Sbjct: 28 TATRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSST 87
Query: 58 YKKLPCYDASCKSPFHCFEGDCF------YGITYGDVYETKE--VDSLDTSTLLPPDEPS 109
K+ C C + + C Y +TYGD T V L + D +
Sbjct: 88 GSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 147
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL--- 164
+ + FGC + + + + GI+G +TS + QL V F+ CL
Sbjct: 148 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 207
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
VQP H + + G +L LP + F ++G I
Sbjct: 208 NGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF--DTGEKKGTI 265
Query: 206 NDCGSVLTVIECEVYA-VLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN 257
D G+ LT + VY ++ A F + + H++++ CF R+
Sbjct: 266 IDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEFL--------CFQYVGRYT 312
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 80/377 (21%), Positives = 140/377 (37%), Gaps = 70/377 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPI------YNSRSF 55
F N + + L +G P +++ +LDT + L+W C + + R+
Sbjct: 57 FHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRAS 116
Query: 56 KSYKKLPCYDASCKS-----PFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEP 108
++ +PC C S P C C ++Y D + + D + E
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV---GEA 173
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD 168
P+ FGC + D S AG++G+N + SF+ Q RFS C+ D
Sbjct: 174 PPLRSA---FGCMSTAYD-SSPDGVATAGLLGMNRGTLSFVTQAST---RRFSYCISDRD 226
Query: 169 KS----------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTI 196
+ + R+ + Q+ + GK+L +P +
Sbjct: 227 DAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAP 286
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNL 252
G + D G+ T + + Y+ L AEF+ ++ + L F ++ TCF +
Sbjct: 287 DHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQ-TKPLLRALDDPSFAFQEALDTCFRV 345
Query: 253 PA----RFNSFPSMTYHFQGADLVVEPENVFI---FNHQDS---FFFFFGPAFTPRKGKT 302
PA P +T F GA++ V + + H+ + + FG A
Sbjct: 346 PAGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAY 405
Query: 303 ILGARHQHNTQFVYDLD 319
++G HQ N YDL+
Sbjct: 406 VIGHHHQMNLWVEYDLE 422
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 127/353 (35%), Gaps = 57/353 (16%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQ---NDPIYNSRSFKSYKKLPCYD 65
+ + +G P +DT + L+W QC+ C+ CY+Q I+N + +Y K+ C
Sbjct: 1 MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60
Query: 66 ASCKS-------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
+C + C E D C Y + YG + D TL S S+ N
Sbjct: 61 EACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-----ASNRSIDNF 115
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVP-DRFSCCLVQPDKSFHSRL 175
FGC + ++ + AGI+G S SF Q+ + FS C + D L
Sbjct: 116 IFGCGED-----NLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR-DHENEGSL 169
Query: 176 EFG---------------------------DQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
G D ++ G L + P + K+ I D
Sbjct: 170 TIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT-----IVDS 224
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQG 268
G+ T I V+ L + + R+ + A +N FP++
Sbjct: 225 GTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR 284
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDTF 321
+ L + EN F + + F P +G +LG R + + V+D+
Sbjct: 285 STLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAM 337
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 76/354 (21%), Positives = 128/354 (36%), Gaps = 49/354 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P +DT + + W C C C + + ++ S + +
Sbjct: 75 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 134
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV--SV 113
C D C + + C Y YGD T D L E S S
Sbjct: 135 CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 194
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
+ FGCS + ++ + + GI G S + QL + P FS CL
Sbjct: 195 APVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGG 254
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
V+P+ + H L + G++L + + F + RG I D
Sbjct: 255 GILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFAT--SNSRGTIVDS 312
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTYHFQ 267
G+ L + E Y F+ + + + T G C+ + + FP ++ +F
Sbjct: 313 GTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFPQVSLNFA 368
Query: 268 -GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
GA +++ P++ I + + F +G+ TILG + VYDL
Sbjct: 369 GGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 134/365 (36%), Gaps = 61/365 (16%)
Query: 6 HTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD 65
++ + +G G ++ LD L W QC+P + + Q P + S+++LP +
Sbjct: 84 YSVVTSVGTGAGRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNN 143
Query: 66 ASC----KSPFHCFEGDC-FYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
A C + + C F+ I + + V S +T + V + GC
Sbjct: 144 AFCLPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAASGQ-QQTEVTGVVIGC 202
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR-----LVPDRFSCCL---VQPDKSFH 172
+ SK F ++AG++GL + S + LG+ + RFS CL H
Sbjct: 203 THNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHH 262
Query: 173 SRLEFGDQI---------------------------------IAGKSLNLPPNSFTIKLN 199
+ L F D + +AGK L F ++
Sbjct: 263 TFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRAYFVSLTGISVAGKPLQDVKELFKRHVH 322
Query: 200 GQ---RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR- 255
GQ GC D G+ V+ Y L + + ++ + CF ++
Sbjct: 323 GQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGLQIVSGQYHL---CFRATSQL 379
Query: 256 FNSFPSMTYHFQ--GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQ 313
+ P++ F A LV+ P+ +F+ D A TI+GA Q + +
Sbjct: 380 WQHLPTVMLQFAETEARLVLPPQRLFVAVGYDICL-----AVVRSYDITIIGAMQQVDKR 434
Query: 314 FVYDL 318
FVYD+
Sbjct: 435 FVYDV 439
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 139/391 (35%), Gaps = 93/391 (23%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWT------QCQPCKSCYEQNDPIYNSRSFKSYKKL 61
Y +G P + L LLDT + LTW +C+ C S P+++ ++ S + +
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 62 PCYDASCK-------SPFHCFEGDCFYGIT------------YGDVYETKEVDSLDTS-T 101
C + SC+ C C G Y VY + L + T
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218
Query: 102 LLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR------L 155
L P +V GCSL VS+ + +G+ G + S QLG L
Sbjct: 219 LRAPGR----AVPGFVLGCSL-----VSVHQPP-SGLAGFGRGAPSVPAQLGLPKFSYCL 268
Query: 156 VPDRF------SCCLVQPDKSFHSRLEF--------GDQI--------------IAGKSL 187
+ RF S LV +++ GD++ + GK++
Sbjct: 269 LSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAV 328
Query: 188 NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFI-----DYFSQHDIEKLFTC 242
LP +F G G I D G+ T ++ V+ + + Y D E
Sbjct: 329 RLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGL 388
Query: 243 RKCGVTCFNLP--ARFNSFPSMTYHFQGADLVVEP-ENVFIFNHQDS-----------FF 288
CF LP AR + P +++HF+G ++ P EN F+ + + F
Sbjct: 389 HP----CFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFG 444
Query: 289 FFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G ILG+ Q N YDL+
Sbjct: 445 GGSGAGNEGSGPAIILGSFQQQNYLVEYDLE 475
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 137/356 (38%), Gaps = 58/356 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K +DT + + W C+PC C + + +++ + + KK+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 63 CYDASC---KSPFHCFEG-DCFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C D C C C Y I Y D + K + + T + D + Q +
Sbjct: 134 CDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEV 193
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL---------- 164
FGC + + + G+MG +TS + QL + FS CL
Sbjct: 194 VFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFA 253
Query: 165 -------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ P++ ++ + G + G SL+LP +I NG G I D G+
Sbjct: 254 VGVVDSPKVKTTPMVPNQMHYNVMLMG-MDVDGTSLDLPR---SIVRNG--GTIVDSGTT 307
Query: 212 LTVIECEVYAVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
L +Y L + H +E+ F CF+ + +FP +++ F+
Sbjct: 308 LAYFPKVLYDSLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFPPVSFEFED 360
Query: 269 A-DLVVEPENVFIFNHQDSFFFF----FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L V P + ++F ++ + F G R +LG N VYDLD
Sbjct: 361 SVKLTVYPHD-YLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 80/357 (22%), Positives = 136/357 (38%), Gaps = 60/357 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K +DT + + W C+PC C + + +++ + + KK+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVG 133
Query: 63 CYDASC---KSPFHCFEG-DCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSVQN 115
C D C C C Y I Y D ++ D L + + P+ Q
Sbjct: 134 CDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLG-QE 192
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL--------- 164
+ FGC + + + G+MG +TS + QL + FS CL
Sbjct: 193 VVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIF 252
Query: 165 --------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
+ P++ ++ + G + G +L+LPP +I NG G I D G+
Sbjct: 253 AVGVVDSPKVKTTPMVPNQMHYNVMLMG-MDVDGTALDLPP---SIMRNG--GTIVDSGT 306
Query: 211 VLTVIECEVYAVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQ 267
L +Y L + H +E F CF+ + +FP +++ F+
Sbjct: 307 TLAYFPKVLYDSLIETILARQPVKLHIVEDTF-------QCFSFSENVDVAFPPVSFEFE 359
Query: 268 GA-DLVVEPENVFIFNHQDSFFFF----FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L V P + ++F + + F G R +LG N VYDL+
Sbjct: 360 DSVKLTVYPHD-YLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLE 415
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 66/360 (18%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKSPFH--------CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C P + C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL----V 165
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTDET 167
Query: 166 QPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVL 212
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 168 KPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQR 227
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSF 259
T + +A+L S + R+ C+ + +++
Sbjct: 228 TSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSAL 287
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + F GA L + P NVF + F A P ILG R + +D+
Sbjct: 288 PLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 97/267 (36%), Gaps = 44/267 (16%)
Query: 23 FLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH--CFE 76
+LDT + +TW QC PC + CY Q D +Y+ S C +C P+ C
Sbjct: 146 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTN 205
Query: 77 GD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
+ C Y + Y D T D T+ P +V++ +FGCS + S
Sbjct: 206 NNQCQYRVRYPDGTSTAGTYISDLLTITP-----ATAVRSFQFGCSHGVQGSFSFGSS-A 259
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG---------KS 186
AGIM L S + Q FS C P + G +A K+
Sbjct: 260 AGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR--RGFFTLGVPRVAAWRYVLTPMLKN 317
Query: 187 LNLPPNSFTIKLN-----GQR----------GCINDCGSVLTVIECEVYAVLTAEFIDYF 231
+PP + ++L GQR G D + +T + Y L F D
Sbjct: 318 PAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRM 377
Query: 232 SQHDIE----KLFTC-RKCGVTCFNLP 253
+ + L TC GV F LP
Sbjct: 378 AMYQPAPPKGPLDTCYDMAGVRSFALP 404
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/363 (22%), Positives = 130/363 (35%), Gaps = 68/363 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P +DT + + W C C C + +++ S S +
Sbjct: 79 YFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVS 138
Query: 63 CYDASCKSPFHCFEGDCF-------YGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C S F C Y YGD V E+ D + +++
Sbjct: 139 CSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANSS 198
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
S V FGCS ++ I GI G S + QL + P FS CL
Sbjct: 199 ASVV------FGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLK 252
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
++P + H L + G++L + P+ F +N R
Sbjct: 253 GEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSIN--R 310
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + E Y + SQ + +C + ++ FP +
Sbjct: 311 GTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQCYLVSTSVG---EIFPLV 367
Query: 263 TYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAF------TPRKGKTILGARHQHNTQFV 315
+ +F G A +V++PE + F+ G A ++G TILG + FV
Sbjct: 368 SLNFAGSASMVLKPEEYLMH-----LGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFV 422
Query: 316 YDL 318
YDL
Sbjct: 423 YDL 425
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 138/364 (37%), Gaps = 66/364 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+GIG P K + +DT + W C CK C +++ +YN + S K +P
Sbjct: 73 YYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVP 132
Query: 63 CYDASCKSPFHCF--------EGDCFYGITYGDVYET-----KEVDSLDTSTLLPPDEPS 109
C CK C Y YGD T K+V D + D +
Sbjct: 133 CDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQ---VSGDLKT 189
Query: 110 PVSVQNIRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL-- 164
+ ++ FGC + +S D ++ + GI+G + S + QL V F+ CL
Sbjct: 190 ASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNG 249
Query: 165 ------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGC 204
VQP + H + + LNL ++ + +G
Sbjct: 250 VNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQR--DSKGT 307
Query: 205 INDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMT 263
I D G+ L + +Y L + + SQ K+ T TCF + FP++T
Sbjct: 308 IIDSGTTLAYLPDGIYQPLVYKIL---SQQPNLKVQTLHD-EYTCFQYSGSVDDGFPNVT 363
Query: 264 YHFQ-GADLVVEP-------ENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
++F+ G L V P EN++ Q+S G K T+LG N
Sbjct: 364 FYFENGLSLKVYPHDYLFLSENLWCIGWQNS-----GAQSRDSKNMTLLGDLVLSNKLVF 418
Query: 316 YDLD 319
YDL+
Sbjct: 419 YDLE 422
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 137/356 (38%), Gaps = 58/356 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P K +DT + + W C+PC C + + +++ + + KK+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 63 CYDASC---KSPFHCFEG-DCFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C D C C C Y I Y D + K + + T + D + Q +
Sbjct: 134 CDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEV 193
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL---------- 164
FGC + + + G+MG +TS + QL + FS CL
Sbjct: 194 VFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFA 253
Query: 165 -------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSV 211
+ P++ ++ + G + G SL+LP +I NG G I D G+
Sbjct: 254 VGVVDSPKVKTTPMVPNQMHYNVMLMG-MDVDGTSLDLPR---SIVRNG--GTIVDSGTT 307
Query: 212 LTVIECEVYAVLTAEFIDY--FSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
L +Y L + H +E+ F CF+ + +FP +++ F+
Sbjct: 308 LAYFPKVLYDSLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFPPVSFEFED 360
Query: 269 A-DLVVEPENVFIFNHQDSFFFF----FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ L V P + ++F ++ + F G R +LG N VYDLD
Sbjct: 361 SVKLTVYPHD-YLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/344 (20%), Positives = 124/344 (36%), Gaps = 47/344 (13%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P ++L +DT W C C C ++ ++K + C
Sbjct: 96 TYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTST---LFAPEKSTTFKNVSCGSP 152
Query: 67 SCK---SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C SP C C + +TYG V DT TL +P P FGC +
Sbjct: 153 ECNKVPSP-SCGTSACTFNLTYGSSSIAANVVQ-DTVTLA--TDPIP----GYTFGCVAK 204
Query: 124 SKD-------------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD 158
+ + S + LN+ + + + + +
Sbjct: 205 TTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRI 264
Query: 159 RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECE 218
+++ L P +S + + K +++PP + G + D G+V T +
Sbjct: 265 KYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAP 324
Query: 219 VYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
VY + EF + L G TC+ +P P++T+ F G ++ + +N
Sbjct: 325 VYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPI---VAPTITFMFSGMNVTLPQDN 381
Query: 278 VFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
+ I + S A P ++L Q N + +YD+
Sbjct: 382 ILIHSTAGS-TSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDV 424
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 139/359 (38%), Gaps = 64/359 (17%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKSPFH--------CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C P + C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR----LVPDRFSCCL----VQ 166
++ FGCS++ K + AGI G S SF QL L FS CL +
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 168
Query: 167 PDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVLT 213
P R + G +S+N P S T ++ NGQR I D G+ T
Sbjct: 169 PGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGAQRT 228
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSFP 260
+ +A+L S + R+ C+ + +++ P
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSALP 288
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F GA L + P NVF + F A P ILG R + +D+
Sbjct: 289 LLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 68/166 (40%), Gaps = 25/166 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND--PIYNSRSFKSYKKLPCYD 65
Y + + +G P ++DT + L W QC PC C+ + P+ ++ +LPC
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 66 ASCK------SPFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+ C+ P C C Y TYG Y L T TL D P + F
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGY---LATETLTVGDGTFP----KVAF 203
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
GCS E+ S +GI+GL S + Q L RFS CL
Sbjct: 204 GCSTENGVDNS------SGIVGLGRGPLSLVSQ---LAVGRFSYCL 240
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 124/332 (37%), Gaps = 58/332 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y KL +G P + + +DT + + W C C C + + ++ S + +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 63 CYDASCKSPFHCFEGD-------CFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C + C Y YGD V + + D + S+L+ P+
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV-PNS 199
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
+PV FGCS + + + GI G S + QL + P FS CL
Sbjct: 200 TAPV-----VFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
V+P+ F H + + G++L + P+ F+ NGQ
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTS-NGQ- 312
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + Y + SQ + +C V ++ + FP +
Sbjct: 313 GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTSVG---DIFPPV 369
Query: 263 TYHFQ-GADLVVEPENVFI-FNHQDSFFFFFG 292
+ +F GA + + P++ I N+ S F G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVASALCFLG 401
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 68/166 (40%), Gaps = 25/166 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND--PIYNSRSFKSYKKLPCYD 65
Y + + +G P ++DT + L W QC PC C+ + P+ ++ +LPC
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 66 ASCK------SPFHC-FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
+ C+ P C C Y TYG Y L T TL D P + F
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGY---LATETLTVGDGTFP----KVAF 203
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
GCS E+ S +GI+GL S + Q L RFS CL
Sbjct: 204 GCSTENGVDNS------SGIVGLGRGPLSLVSQ---LAVGRFSYCL 240
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/343 (19%), Positives = 123/343 (35%), Gaps = 49/343 (14%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P ++L +DT W C C C ++ ++K + C
Sbjct: 77 TYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAP 133
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC---- 120
CK + C C + +TYG + +L T+ +P P + FGC
Sbjct: 134 ECKQVPNPGCGVSSCNFNLTYG---SSSIAANLVQDTITLATDPVP----SYTFGCVSKT 186
Query: 121 ---------------------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR 159
S + S + LN+ + + + + +
Sbjct: 187 TGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIK 246
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++ L P +S + + K +++PP + G I D G+V T + V
Sbjct: 247 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPV 306
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
Y + EF + + T G TC+N+P P++T+ F G ++ + +N
Sbjct: 307 YVAVRDEF-----RRRVGPKLTVTSLGGFDTCYNVPI---VVPTITFIFTGMNVTLPQDN 358
Query: 278 VFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ I + S G ++ Q N + +YD+
Sbjct: 359 ILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 401
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 135/353 (38%), Gaps = 63/353 (17%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
+ Y + +GIG P + + DT + LTWTQC +Q +P+++ S+ + C
Sbjct: 88 DEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCS 147
Query: 65 DASCK----SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C C C Y Y V E V + ++ TL ++ +S FGC
Sbjct: 148 SKLCTEDNPGTKRCSNKTCRYVYPYVSV-EAAGVLAYESFTLSDNNQHICMS---FGFGC 203
Query: 121 -SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--------------- 164
+L + + +GI+G+ S + + + +L +FS CL
Sbjct: 204 GALTDGNLLGA-----SGILGM---SPAILSMVSQLAIPKFSYCLTPYTDRKSSPLFFGA 255
Query: 165 ------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
+Q +F+ + + + L++P +F +K Q G + D G +
Sbjct: 256 WADLGRYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALK---QGGTVVDLGCTV 312
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCR--KCGVTCFNLPA-----RFNSFPSMTYH 265
+ + L + H + T R K CF LP+ + P + Y
Sbjct: 313 GQLAEPAFTALKEAVL-----HTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYF 367
Query: 266 FQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GAD+V+ +N F Q+ A P G +I+G Q N ++D+
Sbjct: 368 DGGADMVLPRDNYF----QEPTAGLMCLALVPGGGMSIIGNVQQQNFHLLFDV 416
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 136/363 (37%), Gaps = 74/363 (20%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y +L IG P + ++DT + +T+ C C+ C + DP + +Y+ + C
Sbjct: 74 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC- 132
Query: 65 DASCKSPFHC-FEG-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
+ SC +C EG C Y Y ++ + V + D + E P Q FGC +
Sbjct: 133 NPSC----NCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKP---QRAVFGCEN 185
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S + GIMGL S + QL ++ D FS C
Sbjct: 186 VETGDLYSQRAD---GIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQI 242
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
P +S + +E + +AGK L L P F K G + D G+
Sbjct: 243 SPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEK----HGTVLDSGTTYAY 298
Query: 215 I-ECEVYAVLTA--EFIDYFSQ--------HDIEKLFTCRKCGVTCFNLPARFNS----- 258
E +A+ A + I + Q HDI CF+ R S
Sbjct: 299 FPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDI------------CFSGAGREVSHLSKV 346
Query: 259 FPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
FP + F G L + PEN + + S + G T+LG NT YD
Sbjct: 347 FPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYD 406
Query: 318 LDT 320
+
Sbjct: 407 REN 409
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 97/267 (36%), Gaps = 44/267 (16%)
Query: 23 FLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH--CFE 76
+LDT + +TW QC PC + CY Q D +Y+ S C +C P+ C
Sbjct: 171 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTN 230
Query: 77 GD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
+ C Y + Y D T D T+ P +V++ +FGCS + S
Sbjct: 231 NNQCQYRVRYPDGTSTAGTYISDLLTITP-----ATAVRSFQFGCSHGVQGSFSFGSS-A 284
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAG---------KS 186
AGIM L S + Q FS C P + G +A K+
Sbjct: 285 AGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR--RGFFTLGVPRVAAWRYVLTPMLKN 342
Query: 187 LNLPPNSFTIKLN-----GQR----------GCINDCGSVLTVIECEVYAVLTAEFIDYF 231
+PP + ++L GQR G D + +T + Y L F D
Sbjct: 343 PAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRM 402
Query: 232 SQHDIE----KLFTC-RKCGVTCFNLP 253
+ + L TC GV F LP
Sbjct: 403 AMYQPAPPKGPLDTCYDMAGVRSFALP 429
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/373 (20%), Positives = 142/373 (38%), Gaps = 69/373 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN---DPIYNSRSFKSY 58
F N + + L +G P +++ ++DT + L+W C ++ + +P+++S SY
Sbjct: 67 FRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS----SY 122
Query: 59 KKLPCYDASCKSPFHCFEGD--------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSP 110
+PC ++C F C ++Y D ++ + DT +
Sbjct: 123 SPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYI------GS 176
Query: 111 VSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS 170
+ N+ FGC S + G+MG+N S SF+ Q+G +FS C+ + D S
Sbjct: 177 SGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGF---PKFSYCISEYDFS 233
Query: 171 -----------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIK 197
+ R+ + Q+ +A K L +P + F
Sbjct: 234 GLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPD 293
Query: 198 LNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFS---QHDIEKLFTCRKCGVTCFNLP- 253
G + D G+ T + Y L F++ + + + F + C+ +P
Sbjct: 294 HTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPT 353
Query: 254 --ARFNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILG 305
R PS+T F+GA++ V + + DS F FG + ++G
Sbjct: 354 NQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIG 413
Query: 306 ARHQHNTQFVYDL 318
HQ N +DL
Sbjct: 414 HLHQQNVWMEFDL 426
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/362 (21%), Positives = 144/362 (39%), Gaps = 79/362 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K+ +DT + +W C+ C C+ N R+F + C S
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHT------NPRTFLQSRSTTCAKVS 134
Query: 68 CKSPF--------HCFEG----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C + HC + DC + ++Y D + + DT T + + +
Sbjct: 135 CGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-----IPS 189
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--LVQPDKSFHS 173
FGC+L+S F + + + G++G+ S + Q D FS C L + ++ F S
Sbjct: 190 FTFGCNLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRF-DGFSYCLPLQKSERGFFS 246
Query: 174 R--------------------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
+ ++ + G+ L L P+ F+ +
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----R 301
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
+G + D GS L+ I +VL+ + + + + R C+++ + P
Sbjct: 302 KGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERN----CYDMRSVDEGDMP 357
Query: 261 SMTYHF-QGADLVVEPENVFI---FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+++ HF GA + VF+ QD + AF P + +I+G+ Q + + VY
Sbjct: 358 AISLHFDDGARFDLGSHGVFVERSVQEQDVWCL----AFAPTESVSIIGSLMQTSKEVVY 413
Query: 317 DL 318
DL
Sbjct: 414 DL 415
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 83/194 (42%), Gaps = 35/194 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +DT + LTW QC PC+SC + P+Y + +PC +A
Sbjct: 54 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP---TANSLVPCANA 110
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + H G C Y I Y D ++ V D +L P+ NIR
Sbjct: 111 LCTA-LHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSL-------PMRSSNIR 162
Query: 118 ----FGCSLESKDFVSIQKKIIA---GIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPD 168
FGC + + V + A G++GL S S + QL + + + CL
Sbjct: 163 PGLTFGCGYDQQ--VGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLSTNG 220
Query: 169 KSFHSRLEFGDQII 182
F L FGD I+
Sbjct: 221 GGF---LFFGDDIV 231
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 135/357 (37%), Gaps = 65/357 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
T + TY+++ G P ++L +DT W C C C P +S ++KK+
Sbjct: 100 ITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGC-STTTPFAPPKS-TTFKKV 157
Query: 62 PCYDASCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C + CK + C C + TYG + SL T+ +P P FG
Sbjct: 158 GCGASQCKQVRNPTCDGSACAFNFTYG---TSSVAASLVQDTVTLATDPVPAYT----FG 210
Query: 120 CSLESKDFVSIQKKIIAGI-----MGLNWDSTSFMVQLGRLVPDRFSCCL---------- 164
C IQK + + +GL S + Q +L FS CL
Sbjct: 211 C---------IQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSG 261
Query: 165 -------VQPDKSFH--------SRLEFGDQI---IAGKSLNLPPNSFTIKLNGQRGCIN 206
QP + S L + + + + + +++PP + G +
Sbjct: 262 HXDLXPVAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVF 321
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYH 265
D G+V T + Y + EF S H +KL G TC+ +P P++T+
Sbjct: 322 DSGTVFTRLVEPAYTAVRNEFRRRVSVH--KKLTVTSLGGFDTCYTVPI---VAPTITFM 376
Query: 266 FQGADLVVEPENVFIFNHQDSFF-FFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
F G ++ + P+N+ I + S PA P ++L Q N + ++D+
Sbjct: 377 FSGMNVTLPPDNILIHSTAGSVTCLAMAPA--PDNVNSVLNVIANMQQQNHRVLFDV 431
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 130/357 (36%), Gaps = 66/357 (18%)
Query: 1 MFTLNHT--YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSY 58
M TLN +++ +G G P + ++DT + TW QC C N +N SY
Sbjct: 120 MDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSY 179
Query: 59 KKLPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C ++ D Y + Y D +K V D TL P P +F
Sbjct: 180 SNRSCIPST----------DTNYTMKYEDNSYSKGVFVCDEVTLKPDVFP------KFQF 223
Query: 119 GCSLESKDFVSIQKKIIAGIMGL-NWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
GC D + +G++GL + S + Q +FS C + + S L F
Sbjct: 224 GCG----DSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEHTLGSLL-F 278
Query: 178 GDQII-AGKSLNL-----PPNSF-------TIKLNGQR-----------GCINDCGSVLT 213
G++ I A SL PP+ I + +R G I D G+V+T
Sbjct: 279 GEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVIT 338
Query: 214 VIECEVYAVLTAEFIDYF------SQHDIEKLFTCRKCGVTCFNLP---ARFNSFPSMTY 264
+ Y L F S EKL TC+NL R P +
Sbjct: 339 RLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLD------TCYNLKGCGGRNIKLPEIVL 392
Query: 265 HFQG-ADLVVEPENVFIFNHQ-DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
HF G D+ + P + N F P TI+G R Q + + VYD++
Sbjct: 393 HFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSH-VTIIGNRQQVSLKVVYDIE 448
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 116 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 174
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 228
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 229 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTD 282
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 283 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 342
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 343 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 402
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 403 ALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 460
Query: 317 DL 318
D+
Sbjct: 461 DI 462
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/351 (22%), Positives = 139/351 (39%), Gaps = 46/351 (13%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDP--IYNSRSFKSYKKL 61
N +++ + +G P +DT A L++ QC+PC C++Q D I++ +S+ ++
Sbjct: 203 NFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPCTLRCHKQTDAGEIFDPSKSESFSRV 262
Query: 62 PCYDASCKS---PFH-----CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
C + C++ H C E + C Y +T+G + V L L
Sbjct: 263 GCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGT-SSYSVGKLVRDRLAIGKYAKGY 321
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR-FSCCLVQPDKS 170
S + FGCSL+++ + AG++G + SF Q+ LV + FS C D+
Sbjct: 322 SFPDFLFGCSLDTE-----YHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCF-PSDRR 375
Query: 171 FHSRLEFGDQIIAGKS-----LNLPPNSFTIKL-----NGQ------RGCINDCGSVLTV 214
L GD + L + + +KL NG I D GS T+
Sbjct: 376 KTGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVDSGSRWTI 435
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSF------PSMTYHF-Q 267
+ + + L A + + + R CF A F F P + F
Sbjct: 436 LLSDTFTQLDAAITEAMRPLGYNRNYY-RGSDYICFE-DAHFQQFSDWAALPVVELKFDM 493
Query: 268 GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
G +V++P++ F FN+ +F + G +LG + +D+
Sbjct: 494 GVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQLLGNTMTRSVGITFDI 544
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 126/355 (35%), Gaps = 64/355 (18%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + SY + C
Sbjct: 86 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN 145
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + E P Q FGC +
Sbjct: 146 VDCTCDSD----KKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP---QRAVFGCENS 198
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---------------- 164
E+ D S GIMGL S M QL ++ D FS C
Sbjct: 199 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVP 255
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + +E + +AGK+L + F N + G + D G+
Sbjct: 256 APSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVF----NSKHGTVLDSGT----- 306
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKC-------GVTCF-----NLPARFNSFPSMT 263
YA L + F K+ + +K CF N+ FP +
Sbjct: 307 ---TYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVD 363
Query: 264 YHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
F G L + PEN + + + G + T+LG NT YD
Sbjct: 364 MVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYD 418
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 10/141 (7%)
Query: 182 IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFT 241
+ G L++ + F + +G G I D G+ +T +E V+ L EFI SQ +++ L
Sbjct: 57 VGGTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFI---SQSNLQ-LDK 112
Query: 242 CRKCGV-TCFNLPARFN--SFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPR 298
G+ CF+LP+ P + +HF+G DL + E+ I DS A
Sbjct: 113 SSSTGLDVCFSLPSETTQVEVPKLVFHFKGGDLELPAESYMI---ADSKLGVACLAMGAS 169
Query: 299 KGKTILGARHQHNTQFVYDLD 319
G +I G Q N +DL+
Sbjct: 170 NGMSIFGNVQQQNILVNHDLE 190
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 99/243 (40%), Gaps = 45/243 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYD- 65
T+ ++ +G P + ++DT + LT C C+ C + DP+++ + K L C+D
Sbjct: 94 THYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGCGQHTDPLFDVSKSTTAKYLACHDF 153
Query: 66 ASCKSPFHCFEGDCFYGITY--GDVYETKEVDSL--DTSTLLPPDEPSPV-SVQNIRF-- 118
SC+S C + C+ +Y G ++E VD L P DE V RF
Sbjct: 154 DSCRS---CEQDRCYISQSYMEGSMWEAVMVDELVWVGGFSSPADEMEGVLKTFGFRFPV 210
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDST---SFMVQLGRLVPDRFSCCLVQ--------- 166
GC + QK+ GIMGL + S+M+ GR+ + F+ C
Sbjct: 211 GCQTKETGLFITQKE--NGIMGLGRHRSTVMSYMLNAGRVTQNLFTLCFAGDGGELVFGG 268
Query: 167 ----------------PDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGS 210
DKS + + D ++ G SL + T +N RG I D G+
Sbjct: 269 VDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGID----TGTINSGRGVIVDSGT 324
Query: 211 VLT 213
T
Sbjct: 325 TDT 327
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/361 (23%), Positives = 131/361 (36%), Gaps = 75/361 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKK 60
FT Y + +G P K ++DT + LTW +C PC C D + ++ +YK
Sbjct: 118 FTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKA 173
Query: 61 LPCYDASCKSP--FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
L C D + P + G + D + S + E P V F
Sbjct: 174 LTCAD-DLRLPVLLRLWRRLFHSGRSLRDTLKMAGAASDEL-------EEFPGFV----F 221
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP----------- 167
GC K +S + GI+ L+ S SF Q+G ++FS CL++
Sbjct: 222 GCGSLLKGLISGE----VGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPM 277
Query: 168 --------------------------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
+ S + + + + L+L P++F LNGQ
Sbjct: 278 VFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTF---LNGQ 334
Query: 202 -RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL-PARFNSF 259
+ I D G+ LT++ V + S + F K CF + P+
Sbjct: 335 DKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAE----FVAIKGLDACFRVPPSSGQGL 390
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P +T+HF GAD V P N I F P +I G Q + ++D+
Sbjct: 391 PDITFHFNGGADFVTRPSNYVIDLGSLQCLI-----FVPTNEVSIFGNLQQQDFFVLHDM 445
Query: 319 D 319
D
Sbjct: 446 D 446
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 84/195 (43%), Gaps = 31/195 (15%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSPF 72
IG+P K + +DT + LTW QC PC+SC + P+Y + + +PC +A C +
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP---TANRLVPCANALCTA-L 56
Query: 73 HCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR----FG 119
H +G C Y I Y D ++ V D+ +L P+ NIR FG
Sbjct: 57 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSL-------PMRSSNIRPGLTFG 109
Query: 120 CSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLE 176
C + + + I G++GL S S + QL + + + CL F L
Sbjct: 110 CGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGF---LF 166
Query: 177 FGDQIIAGKSLNLPP 191
FGD ++ + P
Sbjct: 167 FGDDVVPSSRVTWVP 181
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 78/197 (39%), Gaps = 25/197 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + + ++D L WTQC PC+ C+EQ+ P+++ +++ LPC
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 68 C----KSPFHCFEGDCFY--GITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC- 120
C +S +C C Y GD D+ + + + FGC
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAIG----------AAKETLGFGCV 166
Query: 121 SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ K +I +GI+GL S + Q+ FS CL Q
Sbjct: 167 VMTDKRLKTIGGP--SGIVGLGRTPWSLVTQMNVTA---FSYCLAGKSSGALFLGATAKQ 221
Query: 181 IIAGKSLNLPPNSFTIK 197
+ GK+ + P F IK
Sbjct: 222 LAGGKNSSTP---FVIK 235
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 97/253 (38%), Gaps = 42/253 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYK 59
++ L + Y+L L IG+P K +DT + LTW QC PC C + Y ++
Sbjct: 61 VYPLGYYYVL-LNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKP----NHN 115
Query: 60 KLPCYDASCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
LPC C P E C Y I Y D + +L T + P + + S
Sbjct: 116 TLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSD--HASSIGALVTDEV--PLKLANGS 171
Query: 113 VQNIR--FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPD 168
+ N+R FGC + ++ AGI+GL QL L + CL
Sbjct: 172 IMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTG 231
Query: 169 KSFHSRLEFGDQIIAGKSL---NLPPNS-------------FTIKLNGQRG--CINDCGS 210
K F L GD+++ + +L NS F K G +G + D GS
Sbjct: 232 KGF---LSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGS 288
Query: 211 VLTVIECEVYAVL 223
T E Y +
Sbjct: 289 SYTYFNAEAYQAI 301
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 78/197 (39%), Gaps = 17/197 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSC-YEQNDPIYNSRSFKSYKKLPCYDA 66
+ + L +G P F + + W C PC C NDP+++S S SY ++PC
Sbjct: 88 FAMNLNLGTPPVQHNFTMALNSEFFWAACSPCVDCNVSTNDPLFSSASSTSYTRIPCTSP 147
Query: 67 SCK-SPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C SP C Y +Y Y + + D + P + +
Sbjct: 148 FCSTSPGFSTNACGSSAVGSTTCLYNFSYSTDYSSAGEMASDVVAMKTPRKTRGNKSLRM 207
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL-VPDRFSCCLVQPDKSFHSRL 175
GC ES + I +G++G SF+ QL + +F C+ P +F ++
Sbjct: 208 SLGCGRESTTLLGILNT--SGLVGFAKTDKSFIGQLAEMDYTSKFIYCV--PSDTFSGKI 263
Query: 176 EFGD-QIIAGKSLNLPP 191
G+ +I + SL+ P
Sbjct: 264 VLGNYKISSHSSLSYTP 280
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 172
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 226
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 227 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTD 280
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 281 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 341 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 400
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 401 ALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 458
Query: 317 DL 318
D+
Sbjct: 459 DI 460
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 97/253 (38%), Gaps = 42/253 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYK 59
++ L + Y+L L IG+P K +DT + LTW QC PC C + Y ++
Sbjct: 61 VYPLGYYYVL-LNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKP----NHN 115
Query: 60 KLPCYDASCKS-------PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
LPC C P E C Y I Y D + +L T + P + + S
Sbjct: 116 TLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSD--HASSIGALVTDEV--PLKLANGS 171
Query: 113 VQNIR--FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPD 168
+ N+R FGC + ++ AGI+GL QL L + CL
Sbjct: 172 IMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTG 231
Query: 169 KSFHSRLEFGDQIIAGKSL---NLPPNS-------------FTIKLNGQRG--CINDCGS 210
K F L GD+++ + +L NS F K G +G + D GS
Sbjct: 232 KGF---LSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGS 288
Query: 211 VLTVIECEVYAVL 223
T E Y +
Sbjct: 289 SYTYFNAEAYQAI 301
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/372 (22%), Positives = 134/372 (36%), Gaps = 69/372 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++ +G P K + +DT + + W C C C ++ Y+ ++ S +
Sbjct: 87 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVS 146
Query: 63 CYDASC------KSPFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVSV 113
C C K P C Y + YGD T D+L + + P +
Sbjct: 147 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNA 206
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR--FSCCL------- 164
I FGC + + + + GI+G +TS + QL + F+ CL
Sbjct: 207 -TITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGG 265
Query: 165 -------VQPDKSF------------------------HSRLEFGDQIIAGKSLNLPPNS 193
VQP F H + + G +L LP +
Sbjct: 266 IFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHV 325
Query: 194 FTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFID-YFSQHDIEKLFTCRKCGVTCFNL 252
F + ++G I D G+ LT + V+ + +D FS+H + CF
Sbjct: 326 F--ETGEKKGTIIDSGTTLTYLPELVF----KQVMDVVFSKHRDIAFHNLQD--FLCFQY 377
Query: 253 PARF-NSFPSMTYHFQ-GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTI--LGAR 307
+ FP++T+HF+ L V P F N D + F A + GK I +G
Sbjct: 378 SGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDL 437
Query: 308 HQHNTQFVYDLD 319
N VYDL+
Sbjct: 438 VLSNKLVVYDLE 449
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 142/358 (39%), Gaps = 51/358 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y KLG+G P + + +DT + + W C C C ++D +Y+ + ++ +
Sbjct: 70 YFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVS 129
Query: 63 CYDASCKSPFHC------FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN- 115
C C + F E C Y ITYGD T D T + S QN
Sbjct: 130 CDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNS 189
Query: 116 -IRFGC-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
I FGC +++S S ++ + GI+G ++S + QL V FS CL
Sbjct: 190 SIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGG 249
Query: 165 -------VQPDKS--------FHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
V+P S H + + L LP + F +NG +G + D G
Sbjct: 250 IFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFD-SVNG-KGTVIDSG 307
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQG 268
+ L + VY L + + ++ KL+ + CF + FP + HF+
Sbjct: 308 TTLAYLPDIVYDELIQKVL---ARQPGLKLYLVEQ-QFRCFLYTGNVDRGFPVVKLHFKD 363
Query: 269 A-DLVVEPENVFIFNHQDSFF--FFFGPAFTPRKGK--TILGARHQHNTQFVYDLDTF 321
+ L V P + ++F +D + + + GK T+LG N +YDL+
Sbjct: 364 SLSLTVYPHD-YLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENM 420
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 172
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 226
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 227 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTD 280
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 281 ETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 341 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 400
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 401 ALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 458
Query: 317 DL 318
D+
Sbjct: 459 DI 460
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 140/360 (38%), Gaps = 66/360 (18%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKSPFH--------CFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C P + C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL----V 165
++ FGCS++ K + AGI G S SF QL PD S CL
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKALSYCLPTDET 167
Query: 166 QPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVL 212
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 168 KPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQR 227
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSF 259
T + +A+L S + R+ C+ + +++
Sbjct: 228 TSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSAL 287
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + F GA L + P NVF + F A P ILG R + +D+
Sbjct: 288 PLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/391 (23%), Positives = 142/391 (36%), Gaps = 93/391 (23%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWT------QCQPCKSCYEQNDPIYNSRSFKSYKKL 61
Y +G P + L LLDT + LTW +C+ C S P+++ ++ S + +
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 62 PCYDASCK-------SPFHCFEGDCFYGIT------------YGDVYETKEVDSLDTS-T 101
C + SC+ C C G Y VY + L + T
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 186
Query: 102 LLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR------L 155
L P +V GCSL VS+ + +G+ G + S QLG L
Sbjct: 187 LRAPGR----AVPGFVLGCSL-----VSVHQPP-SGLAGFGRGAPSVPAQLGLPKFSYCL 236
Query: 156 VPDRF------SCCLVQPDKSFHSRLEF--------GDQI--------------IAGKSL 187
+ RF S LV +++ GD++ + GK++
Sbjct: 237 LSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAV 296
Query: 188 NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFI-----DYFSQHDIEKLFTC 242
LP +F G G I D G+ T ++ V+ + + Y D E
Sbjct: 297 RLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL 356
Query: 243 RKCGVTCFNLP--ARFNSFPSMTYHFQGADLVVEP-ENVFIFNHQDSFFF--------FF 291
CF LP AR + P +++HF+G ++ P EN F+ + + F
Sbjct: 357 HP----CFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFS 412
Query: 292 GPAFTPRKGK---TILGARHQHNTQFVYDLD 319
G + +G ILG+ Q N YDL+
Sbjct: 413 GGSGAGNEGSGPAIILGSFQQQNYLVEYDLE 443
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 16/156 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +D+ + LTW QC PC+SC E P+Y R KS K +PC
Sbjct: 64 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY--RPTKS-KLVPCVHR 120
Query: 67 SCKSPFHCFEG----------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C S + G C Y I Y D + V D+ L + V+ ++
Sbjct: 121 LCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTN--GSVARPSV 178
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
FGC + + G++GL S S + QL
Sbjct: 179 AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 214
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 124/353 (35%), Gaps = 65/353 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + G+G P ++L +D W C C C + P ++ +Y+ +PC
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSPQ 160
Query: 68 CK---SPFHCFEG---DCFYGITYGDV--YETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C SP C G C + +TY DSL + V + FG
Sbjct: 161 CAQVPSP-SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNV---------VVSYTFG 210
Query: 120 C-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEF 177
C + S + V Q G++G SF+ Q FS CL S F L+
Sbjct: 211 CLRVVSGNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKL 265
Query: 178 G-------------------------DQI---IAGKSLNLPPNSFTIKLNGQRGCINDCG 209
G + I + K + +P ++ G I D G
Sbjct: 266 GPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAG 325
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA 269
++ T + VYA + F L TC+N+ S P++T+ F GA
Sbjct: 326 TMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGGFD----TCYNVTV---SVPTVTFMFAGA 378
Query: 270 DLVVEP-ENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V P ENV I + GP+ +L + Q N + ++D+
Sbjct: 379 VAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDV 431
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 83/379 (21%), Positives = 125/379 (32%), Gaps = 90/379 (23%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ------PCKSCY-----EQNDPIYNSRSFK 56
Y + +G P + + +LDT + L WT C C++C PIY
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 57 SYKKLPCYDASCK----SPFHCFEGD--CFYGITYGDVYETKEV--DSLDTSTLLPPDEP 108
+ + LPC C S +C +YG+ YG T ++ D L S L
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKL------ 187
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR------LVPDRFSC 162
+ + FGCSL + + GI G S QLG LV RF
Sbjct: 188 --NRIPDFLFGCSL-------VSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDD 238
Query: 163 CLVQPDKSFHSRLEFGDQ------------------------------IIAGKSLNLPPN 192
D H D ++ GK + +PP
Sbjct: 239 TPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPR 298
Query: 193 SFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL 252
G G I D GS T +E ++ + E + +++ K C+N+
Sbjct: 299 YLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNI 358
Query: 253 PARFN-SFPSMTYHFQ-GADLVVEPENVF-----------IFNHQDSFFFFFGPAFTPRK 299
+ P +T+ F+ GA++ + + F + D GPA
Sbjct: 359 TGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAI---- 414
Query: 300 GKTILGARHQHNTQFVYDL 318
ILG Q N YDL
Sbjct: 415 ---ILGNYQQQNFYIEYDL 430
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 172
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 226
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 227 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTD 280
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 281 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 341 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 400
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 401 ALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 458
Query: 317 DL 318
D+
Sbjct: 459 DI 460
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/357 (22%), Positives = 131/357 (36%), Gaps = 57/357 (15%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+ L +G P + L F L +G +W C + ++ S+ KLPC SC
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSC- 59
Query: 70 SPFHCF------EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV-QNIRFGCSL 122
S F C Y +YG + + D +T+ D V N+ GC
Sbjct: 60 SAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATM---DSVRNRKVAANLSLGCGR 116
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL-VPDRFSCCLVQPDKSFHSRLEFGDQI 181
+S + + +G +G + + SFM QL L +F CL P +F +L G+
Sbjct: 117 DSGGLLELLDT--SGFVGFDKGNVSFMGQLSALGYRSKFIYCL--PSDTFRGKLVIGNYK 172
Query: 182 IAGKS--------------------------LNLPPNSFTIKL-----NGQRGCINDCGS 210
+ S +++ N F + + NG G + D +
Sbjct: 173 LRNASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTT 232
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-CFNLPARFNSFP---SMTYHF 266
L+ + + Y L +Y + GV C+N+ A + FP ++TYHF
Sbjct: 233 FLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISAN-SDFPPPATLTYHF 291
Query: 267 QGADLVVEPENVFIFNHQDS----FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G VE F+ + DS G + + ++G Q + YDL+
Sbjct: 292 LGG-AGVEVSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLE 347
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 124/353 (35%), Gaps = 65/353 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + G+G P ++L +D W C C C + P ++ +Y+ +PC
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSPQ 141
Query: 68 CK---SPFHCFEG---DCFYGITYGDV--YETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C SP C G C + +TY DSL + V + FG
Sbjct: 142 CAQVPSP-SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNV---------VVSYTFG 191
Query: 120 C-SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS-FHSRLEF 177
C + S + V Q G++G SF+ Q FS CL S F L+
Sbjct: 192 CLRVVSGNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKL 246
Query: 178 G-------------------------DQI---IAGKSLNLPPNSFTIKLNGQRGCINDCG 209
G + I + K + +P ++ G I D G
Sbjct: 247 GPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAG 306
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGA 269
++ T + VYA + F L TC+N+ S P++T+ F GA
Sbjct: 307 TMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGGFD----TCYNVTV---SVPTVTFMFAGA 359
Query: 270 DLVVEP-ENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
V P ENV I + GP+ +L + Q N + ++D+
Sbjct: 360 VAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDV 412
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/343 (19%), Positives = 123/343 (35%), Gaps = 45/343 (13%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P ++L +DT W C C C ++ ++K + C
Sbjct: 97 TYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTST---LFAPEKSTTFKNVSCGSP 153
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C + C C + +TYG V DT TL +P P + FGC ++
Sbjct: 154 QCNQVPNPSCGTSACTFNLTYGSSSIAANVVQ-DTVTLA--TDPIP----DYTFGCVAKT 206
Query: 125 KD-------------------------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR 159
+ S + LN+ + + + + + +
Sbjct: 207 TGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIK 266
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++ L P +S + + K +++PP + G + D G+V T +
Sbjct: 267 YTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPA 326
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYHFQGADLVVEPENV 278
Y + EF + L G TC+ +P P++T+ F G ++ + +N+
Sbjct: 327 YTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPI---VAPTITFMFSGMNVTLPEDNI 383
Query: 279 FIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYDL 318
I + S A P ++L Q N + +YD+
Sbjct: 384 LIHSTAGS-TTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDV 425
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 172
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 226
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD FS CL
Sbjct: 227 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKAFSYCLPTD 280
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 281 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 341 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 400
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 401 ALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 458
Query: 317 DL 318
D+
Sbjct: 459 DI 460
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 101/259 (38%), Gaps = 37/259 (14%)
Query: 89 YETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSF 148
Y K+V LD L+ + + + I FGC + + + + GIMG ++SF
Sbjct: 11 YLVKDVVHLD---LVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSF 67
Query: 149 MVQLGRL--VPDRFSCCL--------------VQP--------DKSFHSRLEFGDQIIAG 184
+ QL V F+ CL V P KS H + +
Sbjct: 68 ISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGN 127
Query: 185 KSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRK 244
L L N+F +G I D G+ L + VY L E + + H L T ++
Sbjct: 128 SVLELSSNAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEIL---ASHPELTLHTVQE 182
Query: 245 CGVTCFNLPARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-- 301
TCF+ + + FP++T+ F + L V P +D++ F + KG
Sbjct: 183 S-FTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGAS 241
Query: 302 -TILGARHQHNTQFVYDLD 319
TILG N VYD++
Sbjct: 242 LTILGDMALSNKLVVYDIE 260
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 131/366 (35%), Gaps = 80/366 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++K+ +G P + L D LTW C+ C+ C + + S S +Y C
Sbjct: 97 YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFFPSES-STYTSAACESYQ 155
Query: 68 CKSPFHCFEGDCFYGITYGDVYETK-----------EVDSLDTSTLLPPDEPS------- 109
C+ IT G V +TK + S L+ D S
Sbjct: 156 CQ-------------ITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQ 202
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDK 169
+S N F C F+ I AGI+GL S Q+ L+ FS CLV
Sbjct: 203 ALSYPNTNFICGT----FIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSS 258
Query: 170 SFHSRLEFG-DQIIAGKSLNLPPNS------------FTIKLNGQRGCIN---------- 206
S++ FG +++G+ + P + + + G R N
Sbjct: 259 KQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSNIY 318
Query: 207 -DCGSVLTV--------IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN 257
D + T +E EV + I+Y ++ KL C K F+
Sbjct: 319 IDWRTTFTSLPHDFYENVEAEVRKAINLTPINY---NNERKLSLCYKS-----ESDHDFD 370
Query: 258 SFPSMTYHFQGADLVVEPENVFIFNHQDSF-FFFFGPAF--TPRKGKTILGARHQHNTQF 314
+ P +T HF AD+ + P N F+ + F F F T R + G+ Q N
Sbjct: 371 A-PPITMHFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIV 429
Query: 315 VYDLDT 320
YDL +
Sbjct: 430 GYDLKS 435
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 124/333 (37%), Gaps = 62/333 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDP----IYNSRSFKS 57
+T Y ++ +G P + + DT + LTW +C+ + ++ + + KS
Sbjct: 95 YTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKS 154
Query: 58 YKKLPCYDASCKS--PFHCFE-----GDCFYGITYGDVYETKEVDSLDTSTLL------- 103
+ + C +C S PF C Y Y D + V D++T+
Sbjct: 155 WAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGR 214
Query: 104 ---PPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRF 160
+Q + GC+ + D S Q G++ L + SF + RF
Sbjct: 215 GGGDSSGGRRAKLQGVVLGCA-ATYDGQSFQSS--DGVLSLGNSNISFASRAAARFGGRF 271
Query: 161 SCCLVQ--PDKSFHSRLEFG--------------------------DQI-IAGKSLNLPP 191
S CLV ++ S L FG D + +AG++L++P
Sbjct: 272 SYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPA 331
Query: 192 NSFTIKLNGQRGCINDCGSVLTVIECEVY-AVLTAEFIDYFSQHDIEKLFTCRKCGVTCF 250
+ + + NG G I D G+ LT++ Y AV+TA S+H C+
Sbjct: 332 DVWDVDRNG--GAILDSGTSLTILATPAYRAVVTA-----LSKHLAGLPRVTMDPFEYCY 384
Query: 251 NLP-ARFNSFPSMTYHFQGADLVVEPENVFIFN 282
N A P M HF G+ + P ++ +
Sbjct: 385 NWTDAGALEIPKMEVHFAGSARLEPPAKSYVID 417
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/358 (22%), Positives = 129/358 (36%), Gaps = 58/358 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P + +DT + + W C C +C + ++S S + +
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVH 125
Query: 63 CYDASCKSPFHCF-------EGDCFYGITYGD--------VYETKEVDSLDTSTLLPPDE 107
C D C S C Y Y D V +T D++ +L+
Sbjct: 126 CSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVN-- 183
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL- 164
S I FGCS +++ K + GI G S + QL + P FS CL
Sbjct: 184 ----SSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 165 --------------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQR 202
++P + H L + GK L + P+ F + +
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFAT--SNSQ 297
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ L + E Y + S + +C + ++ FP
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQCYLVSTSVSQM---FPLA 354
Query: 263 TYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+++F GA +V++PE+ I F + F +G TILG + FVYDL
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVLKDKIFVYDL 412
>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+KL IG P + +LDT + L WTQC PC CY+Q PI++ ++K + C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----ETRCN 55
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL 102
+P H C Y I Y D T+ + +T T+
Sbjct: 56 TPDH----SCXYKIVYDDKSYTQGTLATETVTI 84
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/373 (21%), Positives = 141/373 (37%), Gaps = 70/373 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + ++ L +G P +++ ++DT + L+W C S DP ++ SY+ +
Sbjct: 25 FHHNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRST----SYQTI 80
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
PC +C + P C + C ++Y D + + D + D +
Sbjct: 81 PCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD------I 134
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKS--- 170
+ FGC S + G+MG+N S SF+ QLG +FS C+ D S
Sbjct: 135 SGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGF---PKFSYCISGTDFSGLL 191
Query: 171 --------------------------FHSRLEFGDQI----IAGKSLNLPPNSFTIKLNG 200
+ R+ + Q+ + K L +P ++F G
Sbjct: 192 LLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTG 251
Query: 201 QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQ-----HDIEKLFTCRKCGVTCFNLPAR 255
+ D G+ T + VY L + F++ S D + +F + C+ +P
Sbjct: 252 AGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVF--QGAMDLCYLVPLS 309
Query: 256 ---FNSFPSMTYHFQGADLVVEPENVFI-----FNHQDSFF-FFFGPAFTPRKGKTILGA 306
P++T F+GA++ V + V DS FG + ++G
Sbjct: 310 QRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGH 369
Query: 307 RHQHNTQFVYDLD 319
HQ N +DL+
Sbjct: 370 HHQQNVWMEFDLE 382
>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
Length = 441
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++G+G P ++DT + LTW QC PC SC+ Q+ P++N +S +Y + C
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 67 SCK 69
C
Sbjct: 182 QCS 184
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/330 (22%), Positives = 118/330 (35%), Gaps = 41/330 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ + G+G P ++L +D W C C C + P ++ +Y+ +PC
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSPQ 160
Query: 68 CK---SPFHCFEG---DCFYGITYGDV--YETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C SP C G C + +TY DSL + V + FG
Sbjct: 161 CAQVPSP-SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNV---------VVSYTFG 210
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMV----QLGRL-VPDRFSCC--LVQPDKSFH 172
C V+ + AG L + +V LG + P R L P +
Sbjct: 211 C----LRVVNGNSRAAAGAHRLRPRAALLLVADQGHLGPIGQPKRIKTTPLLYNPHRPSL 266
Query: 173 SRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFS 232
+ + K + +P ++ G I D G++ T + VYA + F
Sbjct: 267 YYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR 326
Query: 233 QHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP-ENVFIFNHQDS---FF 288
L TC+N+ S P++T+ F GA V P ENV I +
Sbjct: 327 TPVAPPLGGFD----TCYNVTV---SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLA 379
Query: 289 FFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
GP+ +L + Q N + ++D+
Sbjct: 380 MAAGPSDGVNAALNVLASMQQQNQRVLFDV 409
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 15/155 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +D+ + LTW QC PC+SC E P+Y R KS K +PC
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY--RPTKS-KLVPCVHR 122
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + G C Y I Y D + V D+ L + V+ ++
Sbjct: 123 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN--GSVARPSVA 180
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
FGC + + G++GL S S + QL
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 215
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 15/155 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +D+ + LTW QC PC+SC E P+Y R KS K +PC
Sbjct: 57 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY--RPTKS-KLVPCVHR 113
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + G C Y I Y D + V D+ L + V+ ++
Sbjct: 114 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN--GSVARPSVA 171
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
FGC + + G++GL S S + QL
Sbjct: 172 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 206
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 66/149 (44%), Gaps = 14/149 (9%)
Query: 23 FLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH--CFE 76
++D+ + + W QCQPC C+ Q DP+++ + +Y +PC A+C P+ C
Sbjct: 163 VIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSA 222
Query: 77 G-DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
C +G TY D S D TL P D V+ FGC+ D S +
Sbjct: 223 NVQCQFGFTYTDGATATGTYSSDDLTLGPYDV-----VRGFLFGCA--HADRGSTFSFDV 275
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
+G + L + SF+ Q FS C+
Sbjct: 276 SGTLALGGGAQSFVQQTATQYGRVFSYCI 304
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 15/155 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +D+ + LTW QC PC+SC E P+Y R KS K +PC
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY--RPTKS-KLVPCVHR 122
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + G C Y I Y D + V D+ L + V+ ++
Sbjct: 123 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN--GSVARPSVA 180
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
FGC + + G++GL S S + QL
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 215
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 64/309 (20%), Positives = 114/309 (36%), Gaps = 48/309 (15%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY++K +G P ++ LDT W C C C + ++NS + ++K L C
Sbjct: 89 TYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCDAP 145
Query: 67 SCKSPFH--CFEGDCFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
CK + C C + TYG + D++ ST + P FGC
Sbjct: 146 QCKQVPNPTCGGSTCTWNTTYGGSTILSNLTRDTIALSTDIVP---------GYTFGCIQ 196
Query: 123 ESK-----------------DFVSIQKKI--------IAGIMGLNWDSTSFMVQLGRLVP 157
++ F+S + + + LN+ T + G+ +
Sbjct: 197 KTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR 256
Query: 158 DRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIEC 217
+ + L P +S + + K +++P ++ G I D G+V T +
Sbjct: 257 IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVA 316
Query: 218 EVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
VY + EF + L TC+ P P+MT+ F G ++ + +N
Sbjct: 317 PVYTAVRDEFRKRVGNAIVSSLGGFD----TCYTGPI---VAPTMTFMFSGMNVTLPTDN 369
Query: 278 VFIFNHQDS 286
+ I + S
Sbjct: 370 LLIRSTAGS 378
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P + + +DT + LTW QC PC SC + P+Y R K+ K +PC D
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLY--RPTKN-KIVPCVDQ 114
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S G C Y I Y D + + L T + S + ++
Sbjct: 115 LCSSLHGGLSGKHKCDSPKQQCDYEIKYAD--QGSSLGVLLTDSFAVRLANSSIVRPSLA 172
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC + + S + G++GL S S + QL + + + CL F L
Sbjct: 173 FGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLSIRGGGF---L 229
Query: 176 EFGDQII 182
FGD ++
Sbjct: 230 FFGDNLV 236
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 132/330 (40%), Gaps = 47/330 (14%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + L +G P +++ +LDT + L+W C+ + + DP+ +S SY +
Sbjct: 369 FHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSS----SYSPI 424
Query: 62 PCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
PC +C++ H + G+ G + + + +Q +F
Sbjct: 425 PCTSPTCRTRTHS-KTTGLIGMNRGSLSFVTQ-----------------MGLQ--KFSYC 464
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI 181
+ +D +GI+ S S++ L + S L D+ ++ ++
Sbjct: 465 ISGQDS--------SGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYT-VQLEGIK 515
Query: 182 IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL-- 239
+A L LP + + G + D G+ T + VY L EF+ ++ ++ L
Sbjct: 516 VANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQ-TKASLKVLED 574
Query: 240 --FTCRKCGVTCFNLPARFNS---FPSMTYHFQGADLVVEPENVF-----IFNHQDSFF- 288
F + C+ +P + P++T F+GA++ V E + + DS +
Sbjct: 575 PNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYC 634
Query: 289 FFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
F FG + I+G HQ N +DL
Sbjct: 635 FTFGNSELLGVESYIIGHHHQQNVWMEFDL 664
>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+KL IG P + +LDT + L WTQC PC CY+Q PI++ ++K + C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----ETRCN 55
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL 102
+P H C Y I Y D T+ + +T T+
Sbjct: 56 TPDH----SCSYKIVYDDKSYTQGTLATETVTI 84
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 104/270 (38%), Gaps = 47/270 (17%)
Query: 79 CFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
C Y I YGD T+ + L T+L V++ FGC +K +
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTIL---------VKDFIFGCGRNNKGLFGG----V 179
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ---------IIAGKS 186
+G+MGL S + Q + FS CL ++ L G I K
Sbjct: 180 SGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKM 239
Query: 187 LNLPP--NSFTIKLNG--------------QRGCINDCGSVLTVIECEVYAVLTAEFIDY 230
+ P N + I L G + D G+V+T + +Y L AEF+
Sbjct: 240 IENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQ 299
Query: 231 FSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQG-ADLVVEPENVFIFNHQDSFF 288
F+ F+ TCFNL A + P++ HF+G A+L V+ VF F D+
Sbjct: 300 FTGFPPAPAFSILD---TCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQ 356
Query: 289 FFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
A + + ILG Q N + +YD
Sbjct: 357 VCLALASLEYQDEVAILGNYQQKNLRVIYD 386
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 134/355 (37%), Gaps = 51/355 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P +DT + + W C C C + + ++ S + +
Sbjct: 78 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIA 137
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKE---VDSLDTSTLLPPDEPSPVS 112
C D C + + C Y YGD T D + +T+ + S
Sbjct: 138 CSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN-S 196
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKS 170
+ FGCS + ++ + + GI G S + QL + P FS CL + D S
Sbjct: 197 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL-KGDSS 255
Query: 171 FHSRLEFGDQI---IAGKSLNLPP------NSFTIKLNGQ--------------RGCIND 207
L G+ + I SL +P N +I +NGQ RG I D
Sbjct: 256 GGGILVLGEIVEPNIVYTSL-VPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVD 314
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHF 266
G+ L + E Y F+ + + + T G C+ + + + FP ++ +F
Sbjct: 315 SGTTLAYLAEEAY----DPFVSAITAAIPQSVRTVVSRGNQCYLITSSVTDVFPQVSLNF 370
Query: 267 Q-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
GA +++ P++ I + + F +G+ TILG + VYDL
Sbjct: 371 AGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 425
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 119/315 (37%), Gaps = 58/315 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY++K G P ++L LDT + W C C C S+ F K +
Sbjct: 96 TYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-------STSKPFAPIKSTSFRNV 148
Query: 67 SCKSPFHCFE--------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
SC SP HC + C + TYG V DT TL +P P F
Sbjct: 149 SCGSP-HCKQVPNPTCGGSACAFNFTYGSSSIAASVVQ-DTLTLA--TDPIP----GYTF 200
Query: 119 GCSLESKDFVSIQKKIIAGIMGL--------------------NWDSTSFMVQLGRLVPD 158
GC ++ + Q+ ++ G ++ S +F L RL P
Sbjct: 201 GCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSL-RLGPV 259
Query: 159 ------RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
+++ L P +S + + K +++PP + G I D G+V
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYHFQGADL 271
T + VY + E F + KL G TC+N+P P++T+ F G ++
Sbjct: 320 TRLAEPVYTAVRNE----FRRRVGPKLPVTTLGGFDTCYNVPI---VVPTITFLFSGMNV 372
Query: 272 VVEPENVFIFNHQDS 286
+ P+N+ I + S
Sbjct: 373 TLPPDNIVIHSTAGS 387
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 104/270 (38%), Gaps = 47/270 (17%)
Query: 79 CFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
C Y I YGD T+ + L T+L V++ FGC +K +
Sbjct: 76 CNYAINYGDGSFTRGELGHEKLKFGTIL---------VKDFIFGCGRNNKGLFGG----V 122
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ---------IIAGKS 186
+G+MGL S + Q + FS CL ++ L G I K
Sbjct: 123 SGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKM 182
Query: 187 LNLPP--NSFTIKLNG--------------QRGCINDCGSVLTVIECEVYAVLTAEFIDY 230
+ P N + I L G + D G+V+T + +Y L AEF+
Sbjct: 183 IENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQ 242
Query: 231 FSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSMTYHFQG-ADLVVEPENVFIFNHQDSFF 288
F+ F+ TCFNL A + P++ HF+G A+L V+ VF F D+
Sbjct: 243 FTGFPPAPAFSILD---TCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQ 299
Query: 289 FFFGPAFTPRKGK-TILGARHQHNTQFVYD 317
A + + ILG Q N + +YD
Sbjct: 300 VCLALASLEYQDEVAILGNYQQKNLRVIYD 329
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/373 (21%), Positives = 143/373 (38%), Gaps = 69/373 (18%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F N + + + +G P +++ ++DT + L+W C + P +N SY +
Sbjct: 60 FHHNVSLTISITVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPI 118
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
C +C + P C + C ++Y D ++ + DT P
Sbjct: 119 SCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPG---- 174
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
I FGC S S G+MG+N S S + QL +P +FS C+ D F
Sbjct: 175 --IVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLK--IP-KFSYCISGSD--FSG 227
Query: 174 RLEFGDQIIA-GKSLNLPP-------------NSFTIKLNGQR----------------- 202
L G+ + G SLN P +++T++L G +
Sbjct: 228 ILLLGESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDH 287
Query: 203 ----GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL----FTCRKCGVTCFNLP- 253
+ D G+ + + VY L EF++ + + L F + C+ +P
Sbjct: 288 TGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQ-TNGTLRALDDPNFVFQIAMDLCYRVPV 346
Query: 254 --ARFNSFPSMTYHFQGADLVVEPENV------FIFNHQDSFFFFFGPAFTPRKGKTILG 305
+ PS++ F+GA++ V + + F++ + + F FG + I+G
Sbjct: 347 NQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406
Query: 306 ARHQHNTQFVYDL 318
HQ + +DL
Sbjct: 407 HHHQQSMWMEFDL 419
>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+KL IG P + +LDT + L WTQC PC CY+Q PI++ ++K + C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----ETRCN 55
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL 102
+P H C Y I Y D T+ + +T T+
Sbjct: 56 TPDH----SCPYKIVYDDKSYTQGTLATETVTI 84
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 116/309 (37%), Gaps = 54/309 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y ++ +G P + L L DT + L W +C SC Q P Y + ++ KLPC D
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 66 ASCK-----SPFHCFE--GDCFYGITYG----DVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C S C +C Y +YG D + T+ + +T TL PS
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPS----- 205
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSR 174
+RFGC+ S+ ++ G S + QL F CL D S S
Sbjct: 206 -VRFGCTTASEGGYGSGSGLVGLGRG----PLSLVSQLN---ASTFMYCLTS-DASKASP 256
Query: 175 LEFGD------------QIIAGK---SLNLPPNSF----TIKLNGQRGCINDCGSVLTVI 215
L FG ++A ++NL S T + G + D G+ LT +
Sbjct: 257 LLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTTLTYL 316
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN----SFPSMTYHFQGADL 271
Y+ A F+ S +E CF PA + P+M HF GAD+
Sbjct: 317 AEPAYSEAKAAFLSQTSLDQVEDTDGFEA----CFQKPANGRLSNAAVPTMVLHFDGADM 372
Query: 272 VVEPENVFI 280
+ N +
Sbjct: 373 ALPVANYVV 381
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 119/315 (37%), Gaps = 58/315 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY++K G P ++L LDT + W C C C S+ F K +
Sbjct: 96 TYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-------STSKPFAPIKSTSFRNV 148
Query: 67 SCKSPFHCFE--------GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
SC SP HC + C + TYG V DT TL +P P F
Sbjct: 149 SCGSP-HCKQVPNPTCGGSACAFNFTYGSSSIAASVVQ-DTLTLA--ADPIP----GYTF 200
Query: 119 GCSLESKDFVSIQKKIIAGIMGL--------------------NWDSTSFMVQLGRLVPD 158
GC ++ + Q+ ++ G ++ S +F L RL P
Sbjct: 201 GCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSL-RLGPV 259
Query: 159 ------RFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVL 212
+++ L P +S + + K +++PP + G I D G+V
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFPSMTYHFQGADL 271
T + VY + E F + KL G TC+N+P P++T+ F G ++
Sbjct: 320 TRLAEPVYTAVRNE----FRRRVGPKLPVTTLGGFDTCYNVPI---VVPTITFLFSGMNV 372
Query: 272 VVEPENVFIFNHQDS 286
+ P+N+ I + S
Sbjct: 373 ALPPDNIVIHSTAGS 387
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 76/361 (21%), Positives = 127/361 (35%), Gaps = 61/361 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +DT + LTW QC PC+SC +Y+ K + + C
Sbjct: 31 YYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDP---KRARVVDCRRP 87
Query: 67 SCKSPFHCFEGDCFYGITYGDV----YETKEVDSLDTSTLLPPDEPSPVSVQNIRF---- 118
+C + C GDV YE VD T +L D + V RF
Sbjct: 88 TCAQVQRGGQFTC-----SGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRA 142
Query: 119 --GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSR 174
GC + + ++ + G++GL+ S QL + + CL
Sbjct: 143 VIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNG-GGY 201
Query: 175 LEFGDQIIAGKSLNLPP------------NSFTIKLNGQ-----------RGCINDCGSV 211
L FGD ++ + P +IK G+ G + D G+
Sbjct: 202 LFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTS 261
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ---- 267
T + Y + + + + +E++ T C+ P+ F S ++ +F+
Sbjct: 262 FTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPF-CWRGPSPFESVADVSAYFKTVTL 320
Query: 268 ----------GADLVVEPENVFIFNHQDSF-FFFFGPAFTPRKGKTILGARHQHNTQFVY 316
G L + PE I + Q + + + ILG VY
Sbjct: 321 DFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYLVVY 380
Query: 317 D 317
D
Sbjct: 381 D 381
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 76/362 (20%), Positives = 143/362 (39%), Gaps = 79/362 (21%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K+ +DT + +W C+ C C+ N R+F + C S
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHT------NPRTFLQSRSTTCAKVS 134
Query: 68 CKSPF--------HCFEG----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C + HC + DC + ++Y D + + DT T + +
Sbjct: 135 CGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-----IPG 189
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--LVQPDKSFHS 173
FGC+++S F + + + G++G+ S + Q D FS C L + ++ F S
Sbjct: 190 FSFGCNMDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFS 246
Query: 174 R--------------------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
+ ++ + G+ L L P+ F+ +
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----R 301
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
+G + D GS L+ I +VL+ + + + + R C+++ + P
Sbjct: 302 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN----CYDMRSVDEGDMP 357
Query: 261 SMTYHF-QGADLVVEPENVFI---FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+++ HF GA + VF+ QD + AF P + +I+G+ Q + + VY
Sbjct: 358 AISLHFDDGARFDLGSHGVFVERSVQEQDVWCL----AFAPTESVSIIGSLMQTSKEVVY 413
Query: 317 DL 318
DL
Sbjct: 414 DL 415
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 140/361 (38%), Gaps = 67/361 (18%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC 63
+N Y +L IG P + ++D+ + +T+ C C+ C + DP + +Y+ + C
Sbjct: 90 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKC 149
Query: 64 -YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-S 121
D +C + C Y Y + +K V D L+ S ++ Q FGC +
Sbjct: 150 NMDCNCDDD----KEQCVYEREYAEHSSSKGVLGED---LISFGNESQLTPQRAVFGCET 202
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------------- 164
+E+ D S + GI+GL S + QL L+ + F C
Sbjct: 203 VETGDLYSQRAD---GIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGF 259
Query: 165 ----------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTV 214
PD+S + ++ +AGK L+L F +G+ G + D G+
Sbjct: 260 DYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVF----DGEHGAVLDSGT---- 311
Query: 215 IECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCFNLPARFNS------FPS 261
YA L F + + ++ ++ TCF + A + FPS
Sbjct: 312 ----TYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPS 367
Query: 262 MTYHFQ-GADLVVEPENVFIFNHQDSFFFF-FGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ F+ G ++ PEN ++F H + G + T+LG NT VYD +
Sbjct: 368 VEMIFKSGQSWLLSPEN-YMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRE 426
Query: 320 T 320
Sbjct: 427 N 427
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/320 (21%), Positives = 115/320 (35%), Gaps = 51/320 (15%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
++ Y +KL +G PV+ + DT + LTW +C + ++ ++ +S+ +
Sbjct: 110 YSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKC----AGASPPGRVFRPKTSRSWAPI 165
Query: 62 PCYDASCK--SPF---HCFE--GDCFYGITY--GDVYETKEVDSLDTSTLLPPDEPSPVS 112
PC +CK PF +C C Y Y G V + + LP + +
Sbjct: 166 PCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVA--Q 223
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKS 170
++++ GCS S D S + G++ L SF Q FS CLV ++
Sbjct: 224 LKDVVLGCS-SSHDGQSFRSA--DGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRN 280
Query: 171 FHSRLEFG---------------------------DQI-IAGKSLNLPPNSFTIKLNGQR 202
L FG D I +AGK+L++P + K
Sbjct: 281 ATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSG--- 337
Query: 203 GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSM 262
G I D G+ LTV+ Y + A + C P P +
Sbjct: 338 GVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHCYNWTARRPGAPEIIPKL 397
Query: 263 TYHFQGADLVVEPENVFIFN 282
F G+ + P ++ +
Sbjct: 398 AVQFAGSARLEPPAKSYVID 417
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 125/352 (35%), Gaps = 77/352 (21%)
Query: 12 LGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKSP 71
LGIG P ++ + DT + L WTQCQPC SC Q +Y+ ++Y L
Sbjct: 92 LGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSS------ 145
Query: 72 FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQ 131
Y TY T + +T L V+V NI FGC ++ +
Sbjct: 146 ---------YNYTYSKQSFTSGYFATETFAL------GNVTVANITFGCGTRNQGYYD-N 189
Query: 132 KKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ----------- 180
+ G+ S + QLG DRFS C S + G
Sbjct: 190 VAGVFGVGRGGRGGVSLLNQLGI---DRFSYCFSSSGAPGSSAVFLGGSPELATNATTTP 246
Query: 181 -----IIAGKSLNLPPNSFTIKL-------------------NGQRGCINDCGSVLTVIE 216
++A L + + +KL G R + D S +TV++
Sbjct: 247 AASTPMVADPVLK---SGYFVKLVGVTVGATLVDVAGASSAEGGGRALVIDSTSPVTVLD 303
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNSFP-----SMTYHFQG-- 268
Y + + + G+ CF L A + P +MT HF G
Sbjct: 304 EATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAG-GATPTPPNVTMTLHFDGGA 362
Query: 269 ADLVVEPENVFIFNHQDSFFFFFGPAFTP--RKGKTILGARHQHNTQFVYDL 318
ADLV+ P + + +DS TP G +LG+ +T +YDL
Sbjct: 363 ADLVLPPAS---YLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDL 411
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 70/320 (21%), Positives = 124/320 (38%), Gaps = 46/320 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P +DT + + W C C +C +++D +YN +S + +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 63 CYDASCK----SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN- 115
C C +P + D C Y + YGD T D L S N
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 116 -IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---------------------- 152
I FGC + + + + GI+G ++S + QL
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 153 ---GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
G +V + V P+++ H + + +L+LP F +RG I D G
Sbjct: 253 FAIGEVVEPKLKTTPVVPNQA-HYNVVLNGVKVGDTALDLPLGLFETSY--KRGAIIDSG 309
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQG 268
+ L + +Y L + + +Q D+ KL T TCF + FP++T+ F+
Sbjct: 310 TTLAYLPDSIYLPLMEKILG--AQPDL-KLRTVDD-QFTCFVFDKNVDDGFPTVTFKFEE 365
Query: 269 ADLVVEPENVFIFNHQDSFF 288
+ ++ + ++F +D +
Sbjct: 366 SLILTIYPHEYLFQIRDDVW 385
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 71/326 (21%), Positives = 127/326 (38%), Gaps = 51/326 (15%)
Query: 25 LDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--PFHCFEGDCFYG 82
+DT + + W C C C ++NS + +YK L C A CK C G C +
Sbjct: 1 MDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57
Query: 83 ITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-------SLESKD--------- 126
+TYG + S DT TL + +V FGC SL ++
Sbjct: 58 LTYGGSSLAANL-SQDTITL------ATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPL 110
Query: 127 ---------FVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEF 177
+ S + LN+ + + +G+ +++ L P + +
Sbjct: 111 SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNL 170
Query: 178 GDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIE 237
+ + +++PP SFT + G I D G+V T + Y + F ++ +
Sbjct: 171 MAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAF-----RNRVG 225
Query: 238 KLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAF 295
+ T G TC+ +P + P++T+ F G ++ + P+N+ I + S A
Sbjct: 226 RNLTVTSLGGFDTCYTVPI---AAPTITFMFTGMNVTLPPDNLLIHSTAGS-TTCLAMAA 281
Query: 296 TPRKGKTILGA---RHQHNTQFVYDL 318
P ++L Q N + +YD+
Sbjct: 282 APDNVNSVLNVIANLQQQNHRLLYDV 307
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 76/349 (21%), Positives = 117/349 (33%), Gaps = 66/349 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ---------PCKSCYEQNDPIYNSRSFKSY 58
Y++ + G P + + + DT + L W QC P K+C + P + + +
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 111
Query: 59 KKLPCYDASC-KSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEP 108
+PC A C P G C Y Y D T + DT+T +
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTAT-ISNGTS 170
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD 168
+V+ + FGC ++ G++GL SF Q G L FS CL+ +
Sbjct: 171 GGAAVRGVAFGCGTRNQGG---SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLE 227
Query: 169 KSFHSR----LEFGDQ----------------------------IIAGKSLNLPPNSFTI 196
R L G + + L +P + + I
Sbjct: 228 GGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 287
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL---- 252
+ G G + D GS LT + Y L + F I T + C+N+
Sbjct: 288 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 347
Query: 253 ---PARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTP 297
PA FP +T F QG L + N + D P +P
Sbjct: 348 SLAPAN-GGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSP 395
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 113/286 (39%), Gaps = 35/286 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYK-KLP-CY 64
Y + + IG P K + +DT + LTW QC PC+SC +Y+ + + ++P C
Sbjct: 23 YYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKARLVDCRVPLCA 82
Query: 65 DASCKSPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
+ C C Y + Y D T V DT TLL + + I GC
Sbjct: 83 LVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTNGTRSKTTAII--GCGY 140
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ + ++ G+MGL+ S QL + +V + CL L FGD
Sbjct: 141 DQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNG-GGYLFFGDS 199
Query: 181 IIAGKSLNLPP---NSFTIKLNGQRGCIN-----------DCGSVLTVIECEVY-AVLTA 225
++ + P S T + G+ G + D G+ T + E Y AVL+A
Sbjct: 200 LVPALGMTWTPIMGKSITGNIGGKSGDADDKTGDIGGVMFDSGTSFTYLVPEAYNAVLSA 259
Query: 226 EFIDYFSQHDIEKLFTCR-KCGVT---CFNLPARFNSFPSMTYHFQ 267
+ +EK R K T C+ P+ F S + +F+
Sbjct: 260 M------EMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFK 299
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 72/170 (42%), Gaps = 15/170 (8%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--P 71
IG P + ++D L WTQC C C++Q+ P++ + +++ PC +CKS
Sbjct: 73 IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132
Query: 72 FHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESK-DFVSI 130
+C C Y T + + T T + ++ FGC + S D +
Sbjct: 133 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFA-----IGTATASLGFGCVVASGIDTMGG 187
Query: 131 QKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+G++GL +S + Q+ +FS CL D +SRL G
Sbjct: 188 P----SGLIGLGRAPSSLVSQMNI---TKFSYCLTPHDSGKNSRLLLGSS 230
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 99/250 (39%), Gaps = 41/250 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYN----SRSF 55
++ L + Y+L L IG+P K +DT + LTW QC PC C + P +N S
Sbjct: 61 VYPLGYYYVL-LNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKYK-PNHNTLPCSHIL 118
Query: 56 KSYKKLPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
S LP D C P E C Y I Y D + +L T + P + + S+ N
Sbjct: 119 CSGLDLP-QDRPCADP----EDQCDYEIGYSD--HASSIGALVTDEV--PLKLANGSIMN 169
Query: 116 IR--FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSF 171
+R FGC + ++ AGI+GL QL L + CL K F
Sbjct: 170 LRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGF 229
Query: 172 HSRLEFGDQIIAGKSL---NLPPNS-------------FTIKLNGQRG--CINDCGSVLT 213
L GD+++ + +L NS F K G +G + D GS T
Sbjct: 230 ---LSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYT 286
Query: 214 VIECEVYAVL 223
E Y +
Sbjct: 287 YFNAEAYQAI 296
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/363 (19%), Positives = 128/363 (35%), Gaps = 63/363 (17%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ +G P + L +DT W C C C P +N S +++ +PC
Sbjct: 93 TYLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAP 151
Query: 67 SCK-------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS----VQN 115
C + + C + ++YGD SLD + L D + + ++
Sbjct: 152 PCSQAPNPSCTSLAKSKNSCGFSLSYGD-------SSLDAT--LSQDNLAVTANGGVIKG 202
Query: 116 IRFGCSLESKD-----------------FVSIQKKIIAGIM----------GLNWDSTSF 148
FGC +S FV+ K I G N+ +
Sbjct: 203 YTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLT 262
Query: 149 MVQLGRLVPDRFSCC--LVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIN 206
+ + G+ P++ L P + + I KS+ +PP++ G +
Sbjct: 263 LGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVL 322
Query: 207 DCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCFNLPARFNSF 259
D G++ + YA + E + + V TC+N+ ++
Sbjct: 323 DSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNV--STVAW 380
Query: 260 PSMTYHFQGADLVVEP-ENVFI---FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
P++T F G V P ENV I + PA ++G+ Q N + +
Sbjct: 381 PAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVL 440
Query: 316 YDL 318
+D+
Sbjct: 441 FDV 443
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 130/374 (34%), Gaps = 81/374 (21%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK- 60
F + ++ L IG P + +LDT + L+W QC K+ ++ P +S
Sbjct: 76 FKYSMALVVTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSF 135
Query: 61 --LPCYDASCKS--PFHCFEGDC----------FYG-ITYGDVYETKEVDSLDTSTLLPP 105
LPC CK P DC FY TY + +E + S PP
Sbjct: 136 FVLPCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPP 195
Query: 106 DEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL- 164
I GC+ +S D GI+G+N F Q ++ +FS C+
Sbjct: 196 ----------IILGCATQSDD--------ARGILGMNLGRLGFPSQ-AKIT--KFSYCVP 234
Query: 165 ---VQP-DKSFH-------------SRLEFGDQ------------------IIAGKSLNL 189
QP SF+ + L FG I GK LN+
Sbjct: 235 TKQAQPASGSFYLGNNPASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNI 294
Query: 190 PPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTC 249
PP+ F G + D GS T + E Y V+ E + I+K + C
Sbjct: 295 PPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPK-IKKGYMYGGVADIC 353
Query: 250 FNLPA----RFNSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTIL 304
F+ A R M + F+ G +V+ E V G + G I+
Sbjct: 354 FDGDAIEIGRL--VGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGMGRSERLGAGGNII 411
Query: 305 GARHQHNTQFVYDL 318
G HQ N +DL
Sbjct: 412 GNFHQQNLWVEFDL 425
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 78/346 (22%), Positives = 134/346 (38%), Gaps = 62/346 (17%)
Query: 20 SLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASC----KSPFHCF 75
++ FLL L T C P KS + +Y+ K+ +PC D C P
Sbjct: 24 AMVFLLQ----LGCTAC-PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGC 78
Query: 76 EGD--CFYGITYGD--------VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL-ES 124
+ D C Y ITYGD V ++ D + + PD S + FGC +S
Sbjct: 79 KQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVI------FGCGAKQS 132
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQL-------------------------GRLVPDR 159
S + + GI+G ++S + QL G+++ +
Sbjct: 133 GSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPK 192
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
F+ + P + H + D + G+ + LP + RG I D G+ L + +
Sbjct: 193 FNTTPLVP-RMAHYNVILKDMDVDGEPILLPL--YLFDSGSGRGTIIDSGTTLAYLPLSI 249
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPENV 278
Y L + + Q ++ + + TCF+ + + FP + +HF+G L V P +
Sbjct: 250 YNQLLPKVLG--RQPGLKLMIVEDQ--FTCFHYSDKLDEGFPVVKFHFEGLSLTVHPHDY 305
Query: 279 FIFNHQDSFFFFFGPAFTPRK-GK--TILGARHQHNTQFVYDLDTF 321
+D + + + T K G+ ++G N VYDL+
Sbjct: 306 LFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENM 351
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 76/349 (21%), Positives = 117/349 (33%), Gaps = 66/349 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ---------PCKSCYEQNDPIYNSRSFKSY 58
Y++ + G P + + + DT + L W QC P K+C + P + + +
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 110
Query: 59 KKLPCYDASC-KSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEP 108
+PC A C P G C Y Y D T + DT+T +
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTAT-ISNGTS 169
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPD 168
+V+ + FGC ++ G++GL SF Q G L FS CL+ +
Sbjct: 170 GGAAVRGVAFGCGTRNQGG---SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLE 226
Query: 169 KSFHSR----LEFGDQ----------------------------IIAGKSLNLPPNSFTI 196
R L G + + L +P + + I
Sbjct: 227 GGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 286
Query: 197 KLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL---- 252
+ G G + D GS LT + Y L + F I T + C+N+
Sbjct: 287 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 346
Query: 253 ---PARFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTP 297
PA FP +T F QG L + N + D P +P
Sbjct: 347 SSAPAN-GGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSP 394
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 82/202 (40%), Gaps = 37/202 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + L IG+P K+ +DT + LTW QC PCK C + D +Y ++ ++PC +
Sbjct: 68 YSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKN----NRVPCASS 123
Query: 67 SCKSPFH----CFEGDCFYGITYGDVYETKEV-------DSLDTSTLLPPDEPSPVSVQN 115
C++ + C Y + Y D+ + V L+ +LL P
Sbjct: 124 LCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQP---------R 174
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFHS 173
I FGC + K AGI+GL S + QL L + C + F
Sbjct: 175 IAFGCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGF-- 232
Query: 174 RLEFGDQIIAGKSLNLPPNSFT 195
L FGD + LPP+ T
Sbjct: 233 -LFFGDHL-------LPPSGIT 246
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 129/356 (36%), Gaps = 52/356 (14%)
Query: 8 YMLKLGIGDPVKSLWFLL-DTVAGLTWTQCQP-CKSCYEQN---DPIYNSRSFKSYKKLP 62
Y + + IG P + L+ DT + LTW C+ CKSC + N ++ + S++ +P
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIP 178
Query: 63 CYDASCKSPFHCF---------EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
C CK + C + Y + V + +T T+ D + +
Sbjct: 179 CSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKK-IRL 237
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHS 173
++ GC+ + G+MGL + S ++L + ++FS CLV S +
Sbjct: 238 FDVLIGCTESFNE----TNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNH 293
Query: 174 R--LEFGD----------------------------QIIAGKSLNLPPNSFTIKLNGQRG 203
+ L FGD I G S+ L +S + G G
Sbjct: 294 KNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSM-LSISSDIWNVTGVGG 352
Query: 204 CINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFPSM 262
I D G+ LT++ E Y + F +H + CF + P +
Sbjct: 353 MIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKGFDRAAVPRL 412
Query: 263 TYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
HF + P +I + + G G +ILG Q N + YDL
Sbjct: 413 LIHFADGAIFKPPVKSYIIDVAEG-IKCLGIIKADFPGSSILGNVMQQNHLWEYDL 467
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 126/362 (34%), Gaps = 70/362 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN-----------DPIYNSRSFK 56
Y ++ IG P ++DT + +T+ C C C DP + +
Sbjct: 40 YTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSS 99
Query: 57 SYKKLPCYDASCKSPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
SY+K+ C + C + C Y Y ++ +K V D LL S + Q
Sbjct: 100 SYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKD---LLDFGPASRLQSQL 156
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCCL--------- 164
+ FGC E+ + + ++ GIMGL S + QL + D FS C
Sbjct: 157 LSFGC--ETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGS 214
Query: 165 ----------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
P +S + LE + + G SL L N F NG+ G I D
Sbjct: 215 MVLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVF----NGKFGTILDS 270
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-----ARFNSFPSMT 263
G+ YA L + F+ + +L + + N P +
Sbjct: 271 GT--------TYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELG 322
Query: 264 YHFQGADLV--------VEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFV 315
HF D V + PEN ++F H + F + T+LG N
Sbjct: 323 KHFPLVDFVFAENQKVSLAPEN-YLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVT 381
Query: 316 YD 317
YD
Sbjct: 382 YD 383
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 75/356 (21%), Positives = 126/356 (35%), Gaps = 63/356 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
+ L +G+G P + +LD + L WTQC +Q +P++++ S+ LPC
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 68 CKSPF----HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLE 123
C++ C + C Y YG + T V + +T T S N+ FGC
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMTATG-VLATETFTFGAHHGVS----ANLTFGCGKL 221
Query: 124 SKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQI-- 181
+ ++ +GI+GL+ S + QL +FS CL S + FG
Sbjct: 222 ANGTIAEA----SGILGLSPGPLSMLKQLAI---TKFSYCLTPFADRKTSPVMFGAMADL 274
Query: 182 --------------------------------IAGKSLNLPPNSFTIKLNGQRGCINDCG 209
+ K L++P + IK +G G + D
Sbjct: 275 GKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSA 334
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCG--VTCFNLPARFN----SFPSMT 263
+ L + + L ++ I+ R CF LP + P +
Sbjct: 335 TTLAYLVEPAFTELKKAVME-----GIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLV 389
Query: 264 YHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNTQFVYDL 318
HF G + P + + F P +G ++G Q N +YD+
Sbjct: 390 LHFDGDAEMSLPRDNY-FQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDV 444
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 139/359 (38%), Gaps = 64/359 (17%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR----LVPDRFSCCL----VQ 166
++ FGCS++ K + AGI G S SF QL L FS CL +
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 168
Query: 167 PDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVLT 213
P R + G +S+N P S T+++ NGQR I D G+ T
Sbjct: 169 PGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQRT 228
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSFP 260
+ +A+L S + R+ C+ + +++ P
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSALP 288
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F GA L + P NVF + F A P ILG R + +D+
Sbjct: 289 LLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/346 (20%), Positives = 128/346 (36%), Gaps = 51/346 (14%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ +G P + L +DT W C C C + P ++ + SY+ +PC
Sbjct: 109 TYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSP 168
Query: 67 SC-KSP-FHCFEGD--CFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C ++P C G C + +TY D + DSL + +V+ FGC
Sbjct: 169 LCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAG---------DAVKTYTFGC 219
Query: 121 SLESK-----------------DFVSIQKKIIAGIM--------GLNWDSTSFMVQLGRL 155
++ F+S + + G LN+ T + + G+
Sbjct: 220 LQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQP 279
Query: 156 VPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ + L P +S + + K + +PP + G + D G++ T +
Sbjct: 280 PRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRL 339
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y + E + + + TCFN A ++P +T F G + +
Sbjct: 340 VAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFNTTAV--AWPPVTLLFDGMQVTLPE 392
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
ENV I + + A P T+L + Q N + ++D+
Sbjct: 393 ENVVIHSTYGT-ISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDV 437
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 95/219 (43%), Gaps = 25/219 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYNSRSFKSYKKLPC 63
+++ + +G P +DT + L+W QC+PC C+ Q PI++ + +++ + C
Sbjct: 53 FLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGC 112
Query: 64 YDASCK--------SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEP--SPV 111
+ C C E + C Y ++YG + ++ +L E + +
Sbjct: 113 STSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTL 172
Query: 112 SVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSF 171
S+ N FGCS++++ S K+ AGI GL + SF L FS CL D++
Sbjct: 173 SLANFVFGCSMDTQ--YSTHKE--AGIFGLGTSNYSFEQIAPLLSYKAFSYCLPS-DEAH 227
Query: 172 HSRLEFGDQIIAGKSLNLPPNS----FTIKLNGQRGCIN 206
L G G ++ P + ++I + G +N
Sbjct: 228 QGYLSIGPDSSGGVPTSMFPGTPRPVYSIGMTGLTVTVN 266
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 81/187 (43%), Gaps = 21/187 (11%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKK 60
+F + +++ + G P + +LDT + +TWTQC+ C +C + + +B + +Y
Sbjct: 121 LFDEDGNFLVDVAFGTPPQXFXLILDTGSSITWTQCKACVNCLQDSXRYFBXSASSTYSX 180
Query: 61 LPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
C + ++ Y +TYGD + T TL EPS V Q +FG
Sbjct: 181 GSCIPXTVENN---------YNMTYGDDSTSVGNYGCXTMTL----EPSDV-FQKFQFGX 226
Query: 121 SLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGD 179
+K DF S G++GL S + Q FS CL P++ L FG+
Sbjct: 227 GRNNKGDFGSGAD----GMLGLGQGQLSTVSQTASKFXKVFSYCL--PEEDSIGSLLFGE 280
Query: 180 QIIAGKS 186
+ + S
Sbjct: 281 KATSQSS 287
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 134/365 (36%), Gaps = 64/365 (17%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ----PCKSCYEQNDPIYNSRSFK 56
++ + H Y + + IG+P + + +DT + TW +C PCK+C + P+Y
Sbjct: 33 VYPVGHFY-VTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYR---LT 88
Query: 57 SYKKLPCYDASC----------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPD 106
K +PC D C K + C Y + Y D + V LD +L
Sbjct: 89 RKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSL---- 144
Query: 107 EPSPVSVQNIRFGCS---LESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRF 160
+NI FGC ++ + +K + GI+GL S QL G + +
Sbjct: 145 --PTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVI 202
Query: 161 SCCLVQPDKSFHSRLEFGDQII--------------AGKSLNLPPNSFTIKLNGQR---- 202
CL + L G++ + G+ + P T+ L+
Sbjct: 203 GHCLSSKGGGY---LFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTK 259
Query: 203 --GCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKL--------FTCRKCGVTCFNL 252
I D GS T + ++A L + S+ ++++ + K T +
Sbjct: 260 PLKAIFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPLCWKGPKPFKTVHDT 319
Query: 253 PARFNSFPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNT 312
P F S ++ + G +++ PEN I + FG P + I+G
Sbjct: 320 PKEFKSLVTLKFDL-GVTMIIPPENYLIITGHGN--ACFGILDMPGLDQYIIGDITMQEQ 376
Query: 313 QFVYD 317
+YD
Sbjct: 377 LVIYD 381
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/320 (21%), Positives = 124/320 (38%), Gaps = 46/320 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++GIG P +DT + + W C C +C +++D +YN +S + +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 63 CYDASCK----SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN- 115
C C +P + D C Y + YGD T D L S N
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 116 -IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---------------------- 152
I FGC + + + + GI+G ++S + QL
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 153 ---GRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCG 209
G +V + V P+++ H + + +L+LP F +RG I D G
Sbjct: 253 FAIGEVVEPKLXNTPVVPNQA-HYNVVLNGVKVGDTALDLPLGLFETSY--KRGAIIDSG 309
Query: 210 SVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF-NSFPSMTYHFQG 268
+ L + +Y L + + +Q D+ KL T TCF + FP++T+ F+
Sbjct: 310 TTLAYLPESIYLPLMEKILG--AQPDL-KLRTVDD-QFTCFVFDKNVDDGFPTVTFKFEE 365
Query: 269 ADLVVEPENVFIFNHQDSFF 288
+ ++ + ++F +D +
Sbjct: 366 SLILTIYPHEYLFQIRDDVW 385
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 65/159 (40%), Gaps = 17/159 (10%)
Query: 35 QCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCKS--PFHCFE---GDCFYGITYGDVY 89
QCQPC SCY Q DP++N + SY +PC +C C E G C Y Y
Sbjct: 2 QCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHG 61
Query: 90 ETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFM 149
TK ++D + + FGCS S + Q +G++GL S +
Sbjct: 62 VTKGTLAIDKLAI------GGDVFHAVVFGCSDSSVGGPAAQA---SGLVGLGRGPLSLV 112
Query: 150 VQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLN 188
QL RF CL P +L G A ++++
Sbjct: 113 SQLSV---HRFMYCLPPPMSRTSGKLVLGAGADAVRNMS 148
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 129/354 (36%), Gaps = 49/354 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y +L +G P + + +DT + + W C C C + P+ ++ S + +
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVS 112
C D C + C Y YGD T D L T+L S
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------ 164
I FGCS ++ + + GI G S + QL + P FS CL
Sbjct: 210 AP-IVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSG 268
Query: 165 ---------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCIND 207
V+P+ + H L + G++L + P+ F N +G I D
Sbjct: 269 GGILVLGEIVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSN--QGTIID 326
Query: 208 CGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQ 267
G+ L + Y + S L +C +T ++ + FP ++ +F
Sbjct: 327 SGTTLAYLTEAAYDPFISAITSTVSPSVSPYLSKGNQCYLTSSSIN---DVFPQVSLNFA 383
Query: 268 GA-DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
G +++ P++ I + + F +G+ TILG + FVYD+
Sbjct: 384 GGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDI 437
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/314 (21%), Positives = 113/314 (35%), Gaps = 47/314 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y K+ +G P +DT + + W C C C + + ++ S + +
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 63 CYDASCKSPFHCFEG-------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV--SV 113
C D C + + C Y YGD T D L E S S
Sbjct: 85 CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 144
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL------- 164
+ FGCS + ++ + + GI G S + QL + P FS CL
Sbjct: 145 APVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGG 204
Query: 165 --------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDC 208
V+P+ + H L + G++L + + F + RG I D
Sbjct: 205 GILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVF--ATSNSRGTIVDS 262
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTYHFQ 267
G+ L + E Y F+ + + + T G C+ + + FP ++ +F
Sbjct: 263 GTTLAYLAEEAY----DPFVSAITASIPQSVHTAVSRGNQCYLITSSVTEVFPQVSLNFA 318
Query: 268 -GADLVVEPENVFI 280
GA +++ P++ I
Sbjct: 319 GGASMILRPQDYLI 332
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 138/359 (38%), Gaps = 64/359 (17%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR----LVPDRFSCCL----VQ 166
++ FGCS++ K + AGI G S SF QL L FS CL +
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 168
Query: 167 PDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVLT 213
P R + G +S+N P S T ++ NGQR I D G+ T
Sbjct: 169 PGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGAQRT 228
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSFP 260
+ +A+L S + R+ C+ + +++ P
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSALP 288
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F GA L + P NVF + F A P ILG R + +D+
Sbjct: 289 LLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 9/93 (9%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+KL IG P + +LDT + L WTQC PC CY+Q PI++ ++K + C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----ETRCN 55
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL 102
+P H C Y I Y D T + +T T+
Sbjct: 56 TPNH----SCPYKIVYDDKSYTLGTLATETVTI 84
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 141/362 (38%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 172
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 226
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD S CL
Sbjct: 227 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKALSYCLPTD 280
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 281 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 341 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 400
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 401 ALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 458
Query: 317 DL 318
D+
Sbjct: 459 DI 460
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 141/362 (38%), Gaps = 66/362 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLP 62
+++ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++
Sbjct: 116 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVR 174
Query: 63 CYDASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
C C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----S 228
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL--- 164
++ FGCS++ K + AGI G S SF QL PD S CL
Sbjct: 229 FMDLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKALSYCLPTD 282
Query: 165 -VQPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGS 210
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 283 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 342
Query: 211 VLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFN 257
T + +A+L S + R+ C+ + ++
Sbjct: 343 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 402
Query: 258 SFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVY 316
+ P + F GA L + P NVF + F A P ILG R + +
Sbjct: 403 ALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTF 460
Query: 317 DL 318
D+
Sbjct: 461 DI 462
>gi|222617032|gb|EEE53164.1| hypothetical protein OsJ_35998 [Oryza sativa Japonica Group]
Length = 384
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 100/253 (39%), Gaps = 33/253 (13%)
Query: 81 YGITYGDVYETKEVDSLDTSTLLPPDEPS--PVSVQNIRFGCSLES-KDFVSIQKKIIAG 137
Y +TYG + +TS L D + +V + FGCS S DF +G
Sbjct: 120 YSLTYGG-------SAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGA-----SG 167
Query: 138 IMGLNWDSTSFMVQLGRLVPDRFSCCLVQP----DKSFHSRLEFGDQII----AGKSLNL 189
++G+ + S + QL +FS L+ P D S S + FGD + G+ +
Sbjct: 168 VIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRLDAI 224
Query: 190 PPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTC 249
P +F ++ NG G I + +T +E Y V+ A + C
Sbjct: 225 PAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELD--LC 282
Query: 250 FNLPARFN-SFPSMTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGAR 307
+N + P +T F G AD+ + N F ++ P +G ++LG
Sbjct: 283 YNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECL---TMLPSQGGSVLGTL 339
Query: 308 HQHNTQFVYDLDT 320
Q T +YD+D
Sbjct: 340 LQTGTNMIYDVDA 352
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/369 (21%), Positives = 127/369 (34%), Gaps = 70/369 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ + IGDP + ++DT + L WTQC C+ C+ QN Y+ ++ + + C D
Sbjct: 71 YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACND 130
Query: 66 ASCK--SPFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCS 121
+C S C + C YG V + T P E ++ FGC
Sbjct: 131 TACALGSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQPQSENV-----SLAFGC- 183
Query: 122 LESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL----------------- 164
+ + +GI+GL + S + QLG ++FS CL
Sbjct: 184 IAATRLTPGSLDGASGIIGLGRGNLSLVSQLGD---NKFSYCLTPYFSQSTNTSRLFVGA 240
Query: 165 ---------------------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQ-- 201
V P +F+ L + L +P +F ++
Sbjct: 241 SAGLSSGGAPATSVPFLKNPDVDPFSTFY-YLPLTGITVGDAKLAVPEAAFDLRQVATGL 299
Query: 202 -RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRK---CGVTCFNLPARFN 257
G + D GS T + Y L E + + C +
Sbjct: 300 WAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKL- 358
Query: 258 SFPSMTYHF--QGADLVVEPENVFIFNHQDS----FFFFFGPAFT-PRKGKTILGARHQH 310
P + HF G D+ V PEN + + F GP T P TI+G Q
Sbjct: 359 -VPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQ 417
Query: 311 NTQFVYDLD 319
+ +YDL+
Sbjct: 418 DMHLLYDLE 426
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 77/188 (40%), Gaps = 22/188 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IGDP K + +DT + LTW QC PC+SC + P+Y K +PC +
Sbjct: 52 YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKP---TKNKLVPCAAS 108
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C + H + C Y I Y D + V D TL P S +
Sbjct: 109 ICTT-LHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTL--PLRNSSSVRPSFT 165
Query: 118 FGCSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPDKSFHSR 174
FGC + + + + G++GL S S + QL L + CL F
Sbjct: 166 FGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGGGF--- 222
Query: 175 LEFGDQII 182
L FGD ++
Sbjct: 223 LFFGDNVV 230
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/351 (21%), Positives = 124/351 (35%), Gaps = 90/351 (25%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
YM L IG P + ++ WTQC PC+ C++Q+ P++N ++
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRYEVETM--------- 78
Query: 68 CKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDF 127
+GD D+ T + ++ FGC+++S
Sbjct: 79 -----------------FGDTSGIGGTDTFAIGT----------ATASLAFGCAMDS--- 108
Query: 128 VSIQKKIIA-GIMGLNWDSTSFMVQLGRLVPDRFSCCLV---QPDKSFHSRLEFGDQIIA 183
+I++ + A G++GL S +G++ FS CL K L ++
Sbjct: 109 -NIKQLLGASGVVGLGRTPWSL---VGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAG 164
Query: 184 GKSLNLPP--------NSFTIKLNGQRGCINDC-------GSVLTVIECEVYAVLTAEFI 228
GKS P + + I L G + D GSV+ V + F+
Sbjct: 165 GKSAATTPLVNTSDDSSDYMIHLEGIK--FGDVIIEPPPNGSVVL-----VDTIFGVSFL 217
Query: 229 DYFSQHDIEKLFTCRKCGVTCFNLPAR--------------FNS---FPSMTYHFQGADL 271
+ H I+K T G P + NS P + FQGA
Sbjct: 218 VDAAFHAIKKAVTV-AVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAA 276
Query: 272 VVEPENVFIFNHQDS---FFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
+ P + ++++ + +ILG HQ N F++DLD
Sbjct: 277 LTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLD 327
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/346 (19%), Positives = 128/346 (36%), Gaps = 53/346 (15%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ +G P + L +DT W C C C +N + KSY+ +PC
Sbjct: 107 TYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSP 164
Query: 67 SC-KSPF---HCFEGDCFYGITYGD--VYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+C ++P C + +TY D + DSL + + V++ FGC
Sbjct: 165 ACSRAPNPSCSLNTKSCGFSLTYADSSLEAALSQDSLAVANDV---------VKSYTFGC 215
Query: 121 SLESK-----------------DFVSIQKKIIAGIM--------GLNWDSTSFMVQLGRL 155
++ F+S K + G LN+ T + + G+
Sbjct: 216 LQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQP 275
Query: 156 VPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
+ + + LV P +S + + K + +PP + G + D G++ T +
Sbjct: 276 LRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVVEP 275
Y + E + L TC+N + +P +T+ F G + +
Sbjct: 336 VAPAYVAVRDEVRRRIRGAPLSSLGGFD----TCYNTTVK---WPPVTFMFTGMQVTLPA 388
Query: 276 ENVFIFNHQDSFFFFFGPAFTPRKGKTIL---GARHQHNTQFVYDL 318
+N+ I + + A P T+L + Q N + ++D+
Sbjct: 389 DNLVIHSTYGT-TSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDV 433
>gi|242084790|ref|XP_002442820.1| hypothetical protein SORBIDRAFT_08g003350 [Sorghum bicolor]
gi|241943513|gb|EES16658.1| hypothetical protein SORBIDRAFT_08g003350 [Sorghum bicolor]
Length = 194
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 65/160 (40%), Gaps = 20/160 (12%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKK-LPCYDAS 67
+ + +G P +DT + L+W C+ C C+EQ P + +Y+ + C D
Sbjct: 1 MAISLGTPAVFNLVAIDTGSTLSWVNCERCLIRCHEQAGPKLDPHRSATYRHVVGCSDED 60
Query: 68 C-------KSPFHCFEGD----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
C P+ C + D C YG+ YG Y ++ P + V
Sbjct: 61 CLDVQGDNGVPYGCVDDDETDTCLYGLRYGSQYSVGKLGR--DREARPGRQLHYTIVDGF 118
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLV 156
FGCS E F ++ AG++GL SF Q+ RL
Sbjct: 119 VFGCS-EDDRFYGLE----AGVIGLGDKRYSFFNQMARLT 153
>gi|38345728|emb|CAE03531.2| OSJNBa0061C06.12 [Oryza sativa Japonica Group]
Length = 183
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 4/67 (5%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS----CYEQNDPIYNSRSFKSYKKLPC 63
Y + IGDP + ++DT + L WTQC C + C++Q+ P+YN + +S K +PC
Sbjct: 81 YFTEYLIGDPPQHAEAIVDTGSNLVWTQCTDCLAVADRCFKQHLPLYNYSASRSAKPVPC 140
Query: 64 YDASCKS 70
DA C++
Sbjct: 141 TDALCQA 147
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 96/253 (37%), Gaps = 42/253 (16%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYK 59
++ L + Y+L L IG+P K +DT + LTW QC PC C + Y ++
Sbjct: 62 VYPLGYYYVL-LNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKP----NHN 116
Query: 60 KLPC-------YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVS 112
LPC D + P E C Y I Y D + +L T P + + S
Sbjct: 117 TLPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGYSD--HASSIGALVTDEF--PLKLANGS 172
Query: 113 VQN--IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCLVQPD 168
+ N + FGC + ++ AGI+GL QL L + CL
Sbjct: 173 IMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTG 232
Query: 169 KSFHSRLEFGDQIIAGKSL---NLPPNS-------------FTIKLNGQRG--CINDCGS 210
K F L GD+++ + +L NS F K G +G + D GS
Sbjct: 233 KGF---LSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGS 289
Query: 211 VLTVIECEVYAVL 223
T E Y +
Sbjct: 290 SYTYFNAEAYQAI 302
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 87/191 (45%), Gaps = 28/191 (14%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IGDP K + +DT + LTW QC PC+SC + P+Y R K+ K +PC ++
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLY--RPTKN-KLVPCANS 113
Query: 67 SC------KSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP + C Y I Y D + V +D+ + LP S V ++ F
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFS-LPLRNKSNVR-PSLSF 171
Query: 119 GCSLESKDFVSIQKKIIA-----GIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSF 171
GC + + + K A G++GL S S + QL + + + CL F
Sbjct: 172 GCGYDQQ----VGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGF 227
Query: 172 HSRLEFGDQII 182
L FGD ++
Sbjct: 228 ---LFFGDDMV 235
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/339 (20%), Positives = 121/339 (35%), Gaps = 71/339 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQNDPIYNSRSFKSYKK 60
+T Y ++ +G P + + DT + LTW +C + ++ + + +S+
Sbjct: 106 YTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAP 165
Query: 61 LPCYDASCKS--PFHCFE-----GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS---- 109
+ C +C S PF C Y Y D + V D++T+ S
Sbjct: 166 IACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGG 225
Query: 110 --PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ- 166
+Q + GC+ S D S Q G++ L + SF + RFS CLV
Sbjct: 226 GRRAKLQGVVLGCT-ASYDGQSFQSSD--GVLSLGNSNISFASRAAARFGGRFSYCLVDH 282
Query: 167 -PDKSFHSRLEFG-------------------------DQIIA-------------GKSL 187
++ S L FG D+ ++ G++L
Sbjct: 283 LAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEAL 342
Query: 188 NLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV 247
++P + + + G G I D G+ LTV+ Y + A E+L + +
Sbjct: 343 DIPADVWDVARGG--GAILDSGTSLTVLATPAYRAVVAAL--------SERLAGLPRVSM 392
Query: 248 T----CFNLPARFNSFPSMTYHFQGADLVVEPENVFIFN 282
C+N A P + F G+ + P ++ +
Sbjct: 393 DPFEYCYNWTAAALEIPGLEVRFAGSARLQPPAKSYVVD 431
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 138/359 (38%), Gaps = 64/359 (17%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR----LVPDRFSCCL----VQ 166
++ FGCS++ K + AGI G S SF QL L S CL +
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETK 168
Query: 167 PDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVLT 213
P R + G +S+N P S T+++ NGQR I D G+ T
Sbjct: 169 PGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQRT 228
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSFP 260
+ +A+L S + R+ C+ + +++ P
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSALP 288
Query: 261 SMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ F GA L + P NVF + F A P ILG R + +D+
Sbjct: 289 LLEIGFAGGAALALSPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 78/186 (41%), Gaps = 21/186 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKL-PCYD 65
Y + L IG+P K +DT + LTW QC PC+ C I +R +K L C D
Sbjct: 64 YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCT-----IPRNRLYKPNGNLVKCGD 118
Query: 66 ASCK----SPFHCFEG---DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
CK +P H G C Y + Y D + V D L + ++ + F
Sbjct: 119 PLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTN--GSLARPILAF 176
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLE 176
GC + K AG++GL TS + QL L+ + CL + F L
Sbjct: 177 GCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGF---LF 233
Query: 177 FGDQII 182
FGDQ++
Sbjct: 234 FGDQLV 239
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 116/321 (36%), Gaps = 65/321 (20%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPC- 63
N Y +L IG P + ++D+ + +T+ C C+ C DP + SY + C
Sbjct: 86 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN 145
Query: 64 YDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC-SL 122
D +C S + C Y Y ++ + V D + S + Q FGC +
Sbjct: 146 VDCTCDSD----KKQCTYERQYAEMSSSSGVLGEDIVSF---GRESELKAQRAVFGCENS 198
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---------------- 164
E+ D S GIMGL S M QL ++ D FS C
Sbjct: 199 ETGDLFSQHAD---GIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVP 255
Query: 165 ---------VQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVI 215
P +S + +E + +AGK+L + F K G + D G+
Sbjct: 256 TPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSK----HGTVLDSGT----- 306
Query: 216 ECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-------TCF-----NLPARFNSFPSMT 263
YA L + F K+ + +K CF N+ FP +
Sbjct: 307 ---TYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVD 363
Query: 264 YHF-QGADLVVEPENVFIFNH 283
F G L + PEN ++F H
Sbjct: 364 MVFGNGQKLSLTPEN-YLFRH 383
>gi|125589489|gb|EAZ29839.1| hypothetical protein OsJ_13898 [Oryza sativa Japonica Group]
Length = 278
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC----KSCYEQNDPIYNSRSFKSYKKLPC 63
Y + IGDP + ++DT + L WTQC C C++Q+ P+YN + +S K +PC
Sbjct: 81 YFTEYLIGDPPQHAEAIVDTGSNLVWTQCTDCLAVADRCFKQHLPLYNYSASRSAKPVPC 140
Query: 64 YDASCKS 70
DA C++
Sbjct: 141 TDALCQA 147
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/343 (18%), Positives = 122/343 (35%), Gaps = 49/343 (14%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
TY+++ IG P ++L +DT W C C C ++ ++K + C
Sbjct: 92 TYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAP 148
Query: 67 SCKSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC---- 120
CK + C + +TYG + +L T+ +P P + FGC
Sbjct: 149 ECKQVPNPGCGVSSRNFNLTYG---SSSIAANLVQDTITLATDPVP----SYTFGCVSKT 201
Query: 121 ---------------------SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR 159
S + S + LN+ + + + + +
Sbjct: 202 TGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIK 261
Query: 160 FSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEV 219
++ L P +S + + K +++PP + G I D G+V T + V
Sbjct: 262 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPV 321
Query: 220 YAVLTAEFIDYFSQHDIEKLFTCRKCGV--TCFNLPARFNSFPSMTYHFQGADLVVEPEN 277
Y + EF + + T G TC+N+P P++T+ F G ++ + +N
Sbjct: 322 YVAVRDEF-----RRRVGPKLTVTSLGGFDTCYNVPI---VVPTITFIFTGMNVTLPQDN 373
Query: 278 VFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ I + S G ++ Q N + +YD+
Sbjct: 374 ILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/370 (22%), Positives = 139/370 (37%), Gaps = 76/370 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKK 60
F + ++ L IG P ++ +LDT + L+W QC+ P K+ DP+ +S S+
Sbjct: 72 FKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSS----SFSV 127
Query: 61 LPCYDASCKS-------PFHCFEGD-CFYGI-----TYGDVYETKEVDSLDTSTLLPPDE 107
LPC + CK P C + C Y TY + +E + +S PP
Sbjct: 128 LPCNHSLCKPRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPP-- 185
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSF--MVQLGRL---VPDRFSC 162
+ GC+ +S D GI+G+N SF + ++ + VP R S
Sbjct: 186 --------LILGCATDSSD--------TQGILGMNLGRLSFSSLAKISKFSYCVPPRRSQ 229
Query: 163 CLVQPDKSFH---------------------SRLEFGDQI----------IAGKSLNLPP 191
P SF+ R+ D + I GK LN+
Sbjct: 230 SGSSPTGSFYLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNIST 289
Query: 192 NSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFN 251
++F +G + D G+ T + E Y+ + E + + ++K + CF+
Sbjct: 290 SAFRADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFD 348
Query: 252 LPARF--NSFPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARH 308
A +M + F+ G ++VVE E + G + I+G H
Sbjct: 349 GDAMVIGRMIGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFH 408
Query: 309 QHNTQFVYDL 318
Q + +DL
Sbjct: 409 QQDLWVEFDL 418
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 67/160 (41%), Gaps = 19/160 (11%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPC 63
N Y + L IG P K + +DT + LTW QC PC C E P Y R+ +PC
Sbjct: 31 NGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRN----NLVPC 86
Query: 64 YDASCKSPF----HCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEP--SPVSVQN 115
D C+S H E G C Y + Y D + V DT L E SP+
Sbjct: 87 MDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPL---- 142
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL 155
+ GC + F I G++GL +S + QL L
Sbjct: 143 LALGCGYD--QFPGGSHHPIDGVLGLGKGKSSIVSQLSSL 180
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 80/357 (22%), Positives = 130/357 (36%), Gaps = 55/357 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND---PI--YNSRSFKSYKKLP 62
Y +L +G P + + +DT + + W C C C + P+ ++ S + +
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111
Query: 63 CYDASCKSPFHCFEGDCF-------YGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C D C + C Y YGD T S LL D SV N
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTS---GYYVSDLLHFDTVLGGSVMN 168
Query: 116 -----IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCL---- 164
I FGCS ++ + + GI G S + QL + P FS CL
Sbjct: 169 NSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDD 228
Query: 165 -----------VQPDKSF--------HSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
V+P+ + H L + G++L + P+ F + +G I
Sbjct: 229 SGGGILVLGEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVF--GTSSSQGTI 286
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPSMTY 264
D G+ L + Y FI + + G C+ + + N FP ++
Sbjct: 287 IDSGTTLAYLAEAAY----DPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIFPQVSL 342
Query: 265 HFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK--TILGARHQHNTQFVYDL 318
+F GA +++ P++ I + F +G+ TILG + FVYD+
Sbjct: 343 NFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDI 399
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/363 (22%), Positives = 127/363 (34%), Gaps = 83/363 (22%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK----------SCYEQNDPIYNSRSFKS 57
Y+ GIGDP + ++DT + L WTQC C+ C+ QN P YN ++
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 58 YKKLPCYD------------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPP 105
+ +PC D A C + C +YG V D T
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALG-VLGTDAFTF--- 193
Query: 106 DEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
PS SV + FGC +++ I + G G+ + LG R + L
Sbjct: 194 --PSSSSV-TLAFGCVSQTR----ISPGALTGASGI--------IGLG-----RGALSLN 233
Query: 166 QPDKSFHS-------RLEFGDQIIAGKSLNLPPNSFTIKLNGQR----GCINDCGSVLTV 214
D F + L G+ +A LP +F ++ + G + D GS T
Sbjct: 234 PKDSPFSTFYYLPLVGLAAGNATVA-----LPAGAFDLREAAPKVWAGGALIDSGSPFTR 288
Query: 215 IECEVYAVLTAEFIDYFSQHDI---------EKLFTCRKCGVTCFNLPARFNSFPSMTYH 265
+ + LT E L C + G +L A + PS+
Sbjct: 289 LVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAA--AAVPSLVLR 346
Query: 266 FQ-----GADLVVEPENVFIFNHQDSFFFFF-----GPAFTPRKGKTILGARHQHNTQFV 315
F G +LV+ E + ++ G A P TI+G Q + + +
Sbjct: 347 FDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVL 406
Query: 316 YDL 318
YDL
Sbjct: 407 YDL 409
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 75/161 (46%), Gaps = 15/161 (9%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---CYEQNDPIYNSRSFKSYKKLPCY 64
Y KL IG P + ++DT + +T+ C C S C + DP + + S +Y+ + C+
Sbjct: 50 YATKLYIGTPPQEFTLVVDTGSNMTFVPC--CGSEEYCGKHEDPAFQTESSSTYQPVNCH 107
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
SC + C Y + YGD ++ V + D + E +P Q + FGC L++
Sbjct: 108 -PSCDCDY--LRSQCSYKMHYGDGSYSRGVLAEDIISFGNESEFAP---QRLVFGCELDA 161
Query: 125 KDFVSIQKKIIAGIMGLNWDSTSFMVQL--GRLVPDRFSCC 163
S+ GI+GL ++ + QL ++ D FS C
Sbjct: 162 --IGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 139/360 (38%), Gaps = 66/360 (18%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYEQN---DPIYN-SRSFKSYKKLPCY 64
+ + +G P +DT + L+W QCQPC C+ Q+ PI++ RS+ S +++ C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTS-RRVRCS 59
Query: 65 DASCKS--------PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQ 114
C +C E + C Y +TYG+ + V + T TL D S
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW-AYSVGKMVTDTLRIGD-----SFM 113
Query: 115 NIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPD-----RFSCCL----V 165
++ FGCS++ K + AGI G S SF QL PD S CL
Sbjct: 114 DLMFGCSMDVK-----YSEFEAGIFGFGSSSFSFFEQLAGY-PDILSYKALSYCLPTDET 167
Query: 166 QPDKSFHSRLEF----GDQIIAGKSLNLPPNSFTIKL---NGQR------GCINDCGSVL 212
+P R + G +S+N P S T+++ NGQR I D G+
Sbjct: 168 KPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGAQR 227
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP-------------ARFNSF 259
T + +A+L S + R+ C+ + +++
Sbjct: 228 TSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWSAL 287
Query: 260 PSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P + F GA L + P NVF + F A P ILG R + +D+
Sbjct: 288 PLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF--AQNPALRSQILGNRVTRSFGTTFDI 345
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 64/157 (40%), Gaps = 16/157 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC----QPCKSCYEQNDPIYNSRSFKSYKKLPC 63
Y + + IG+P +DT + LTW QC PCK C D +Y + K C
Sbjct: 62 YTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVK---C 118
Query: 64 YDASCKS---PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPD-----EPSPVSVQN 115
D C + PF F C I VY+ + D+ +++ L D PS +V
Sbjct: 119 SDPICAAVQPPFSTFGQKCAKPIPPC-VYKVEYADNAESTGALARDYMHIGSPSGSNVPL 177
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL 152
+ FGC E K G++GL S + QL
Sbjct: 178 VVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQL 214
>gi|255685722|gb|ACU28350.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 9/93 (9%)
Query: 10 LKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDASCK 69
+KL IG P +LDT + L WTQC PC CY+Q PI++ ++K + C
Sbjct: 1 MKLQIGTPPFEXEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK-----ETRCN 55
Query: 70 SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTL 102
+P H C Y J Y D T + +T T+
Sbjct: 56 TPBH----SCPYKJVYDDKSYTXGTLATETVTI 84
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 78/182 (42%), Gaps = 21/182 (11%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-PIYNSRSFKSYKKLPCYD 65
+M ++ G P K + +DT + LTWTQC PC CY Q P Y + +Y+ C D
Sbjct: 57 AFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCED 116
Query: 66 ASCKSPFHCFEGD-----CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGC 120
+ KS H F D C Y Y D K + + T+ D V + FGC
Sbjct: 117 SHPKSNPH-FAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFK-RVHGVYFGC 174
Query: 121 -SLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL--VQPDKSFHSRLEF 177
+L + + GI+GL S + + G +FS CL + K+ H+ L
Sbjct: 175 NTLSDGSYFT-----GTGILGLGVGKYSIIGEFG----SKFSFCLGEISEPKASHN-LIL 224
Query: 178 GD 179
GD
Sbjct: 225 GD 226
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 73/359 (20%), Positives = 122/359 (33%), Gaps = 75/359 (20%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y ++ +G P + W ++DT + TW C KS++ + C
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNCS------------------KSFEAVTCASRK 154
Query: 68 CKSPFHCF---------EGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
CK C Y I+Y D K D+ T+ + + N+
Sbjct: 155 CKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQG-KLNNLTI 213
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--PDKSFHSRLE 176
GC+ + V+ ++ GI+GL + SF+ + +FS CLV +S S L
Sbjct: 214 GCTKSMLNGVNFNEE-TGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLT 272
Query: 177 FGDQ----------------------------IIAGKSLNLPPNSFTIKLNGQRGCINDC 208
G I G+ L +PP + N + G + D
Sbjct: 273 IGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVW--DFNAEGGTLIDS 330
Query: 209 GSVLTVIEC----EVYAVLTAEF--IDYFSQHDIEKLFTCRKCGVTCFNLPARFNS-FPS 261
G+ LT + V+ LT + + D + L CF+ +S P
Sbjct: 331 GTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDAL-------EFCFDAEGFDDSVVPR 383
Query: 262 MTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ +HF G P +I + G +++G Q N + +DL T
Sbjct: 384 LVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLST 442
>gi|224028837|gb|ACN33494.1| unknown [Zea mays]
Length = 209
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 32/47 (68%), Gaps = 1/47 (2%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC-KSCYEQNDPIYNSR 53
Y+ ++G+G P KS ++DT + LTW QC PC SC+ Q+ P++N +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPK 173
>gi|218201673|gb|EEC84100.1| hypothetical protein OsI_30414 [Oryza sativa Indica Group]
Length = 366
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 87/213 (40%), Gaps = 33/213 (15%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y+ ++ IG+ + + L+DT + L WTQC C C+ + P Y ++++++ C D
Sbjct: 81 AYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSCGDD 140
Query: 67 S-------------CKSPFH---CFEGDCFYGITYGDVYETKEVD---SLDTSTLLPPDE 107
K P + C G C + Y + + V S+DT +
Sbjct: 141 DDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTFHFIDDRR 200
Query: 108 PSPVSVQNIRFGCSLESKDFVSIQK------KIIAGIMGLNWDSTSFMVQLGRLVPDRFS 161
+ + FGC+ + ++ K K GI+GL SF+ Q G +FS
Sbjct: 201 FDYQAKFRMVFGCAHQENIVLTAVKECTTAVKECTGILGLGMGDASFLRQTGIT---KFS 257
Query: 162 CCL--VQPDKSFH--SRLEFGDQI-IAGKSLNL 189
C P S+ S L FG I+GK + L
Sbjct: 258 YCAPPRMPGYSYRRDSWLRFGSHAQISGKKVPL 290
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 82/369 (22%), Positives = 126/369 (34%), Gaps = 75/369 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F + ++ L IG P ++ +LDT + L+W QC + DP +S ++ L
Sbjct: 69 FKYSMALIINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTASFDPSLSS----TFSIL 124
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGI-----TYGDVYETKEVDSLDTSTLLPPDEP 108
PC CK P C + C Y TY + +E + S PP
Sbjct: 125 PCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPP--- 181
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSF-----MVQLGRLVPDRFSCC 163
+ GC+ ES D GI+G+N SF + + VP R +
Sbjct: 182 -------LILGCATESTD--------PRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRP 226
Query: 164 LVQPDKSFH--------------------SRLEFGDQI----------IAGKSLNLPPNS 193
P SF+ R+ D + IAGK LN+ P
Sbjct: 227 GFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAV 286
Query: 194 FTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP 253
F G + D GS T + E Y + A+ + ++K + CF+
Sbjct: 287 FRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPR-LKKGYVYGGVADMCFDSV 345
Query: 254 ARFNS---FPSMTYHFQ-GADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQ 309
M + F+ G ++V+ E V G + I+G HQ
Sbjct: 346 KAVEIGRLIGEMVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGAASNIIGNFHQ 405
Query: 310 HNTQFVYDL 318
N +DL
Sbjct: 406 QNLWVEFDL 414
>gi|125595845|gb|EAZ35625.1| hypothetical protein OsJ_19916 [Oryza sativa Japonica Group]
Length = 152
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 58/138 (42%), Gaps = 13/138 (9%)
Query: 23 FLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYDASCK--SPFH--CFE 76
+LDT + +TW QC PC + CY Q D +Y+ S C +C P+ C
Sbjct: 19 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTN 78
Query: 77 GD-CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
+ C Y + Y D T D T+ P +V++ +FGCS K S
Sbjct: 79 NNQCQYRVRYPDGTSTAGTYISDLLTITP-----ATAVRSFQFGCSKGVKGSFSFGSS-A 132
Query: 136 AGIMGLNWDSTSFMVQLG 153
AGIM L S + Q G
Sbjct: 133 AGIMALGGGPESLVSQTG 150
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 98/251 (39%), Gaps = 34/251 (13%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFK-SYKKLPCYD 65
Y + IG+P + + +DT + TW C PC +C + P+Y K + + P +
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHPRDPLCE 75
Query: 66 ASCKSPFHCFE-GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR--FGCSL 122
+ +C C Y ITY D +K V + D L D ++N+ FGC+
Sbjct: 76 ELQGNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADG----EMKNVDFVFGCAH 131
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRLEFGDQ 180
+ + GI+GL+ + S QL ++ + F C+ D S + GD
Sbjct: 132 NQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMAT-DPSSGGYMFLGDD 190
Query: 181 IIAGKSL------NLPPNSFT------------IKLNGQRG----CINDCGSVLTVIECE 218
+ + N P N ++ + L GQ G I D GS T E
Sbjct: 191 YVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTYFPHE 250
Query: 219 VYAVLTAEFID 229
+Y L A D
Sbjct: 251 IYTNLIALLED 261
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 72/144 (50%), Gaps = 17/144 (11%)
Query: 182 IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFI----DYFSQHDIE 237
+ G+ L +P SF + G G I D G+ +T ++ +VY V+ F+ D + +++
Sbjct: 20 VGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVKGTKDLLATNEV- 78
Query: 238 KLFTCRKCGVTCFNLPARFN-SFPSMTYHF-QGADLVVEPENVFI-FNHQDSFFFFFGPA 294
LF TC++L ++ + P++ +HF +G LV+ +N + + +F F F P
Sbjct: 79 SLFD------TCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVGTFCFAFAPT 132
Query: 295 FTPRKGKTILGARHQHNTQFVYDL 318
+ +I+G Q T+ +DL
Sbjct: 133 MSSL---SIIGNIQQQGTRVSFDL 153
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 5/70 (7%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 137 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYA 194
Query: 60 KLPCYDASCK 69
+PC C
Sbjct: 195 AVPCGGPVCA 204
>gi|297794561|ref|XP_002865165.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297795163|ref|XP_002865466.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311000|gb|EFH41424.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311301|gb|EFH41725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 134
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 30/46 (65%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSR 53
Y + + IG P + + DT + LTW QC+PC+ CY+QN P+++ +
Sbjct: 79 YFMSISIGTPPSKVLAIADTGSDLTWVQCKPCQQCYKQNSPLFDKK 124
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 5/70 (7%)
Query: 3 TLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK---SCYEQNDPIYNSRSFKSYK 59
TLN Y++ +G P + +DT + L+W QC+PC SCY Q DP+++ SY
Sbjct: 137 TLN--YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYA 194
Query: 60 KLPCYDASCK 69
+PC C
Sbjct: 195 AVPCGGPVCA 204
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 79/187 (42%), Gaps = 22/187 (11%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + IG+P + + +DT + LTW QC PC +C + P+Y +P D+
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKP---AKENIVPPRDS 185
Query: 67 SCKSPFHCFEGD---------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C+ +G+ C Y I Y D + V + D L+ D ++
Sbjct: 186 HCQE----LQGNQNYCDTCKQCDYEIAYADRSSSAGVLARDNMELITAD--GERENMDLV 239
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
FGC+ + + + GI+GL+ + S QL + ++ + F C+ D S + +
Sbjct: 240 FGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIAT-DPSGSAYM 298
Query: 176 EFGDQII 182
GD +
Sbjct: 299 FLGDDYV 305
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 83/187 (44%), Gaps = 20/187 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IGDP K + +DT + LTW QC PC+SC + P+Y R K+ K +PC ++
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLY--RPTKN-KLVPCANS 113
Query: 67 SC------KSPFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C SP + C Y I Y D + V D+ + LP S V ++ F
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFS-LPLRNKSNVR-PSLSF 171
Query: 119 GCSLESK-DFVSIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKSFHSRL 175
GC + + G++GL S S + QL + + + CL F L
Sbjct: 172 GCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGF---L 228
Query: 176 EFGDQII 182
FGD ++
Sbjct: 229 FFGDDMV 235
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 78/355 (21%), Positives = 129/355 (36%), Gaps = 56/355 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y ++G+G+PV+ L ++DT + + W +C PC+SC + D IYN + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142
Query: 63 CYDASCKS-----PFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C D C C Y +Y D ++ V + + + I
Sbjct: 143 CSDPLCTGEEVVCSRSGNNSACAYVSSYQD--KSASVGAYVRDDMHYVLHGGNATTSRIF 200
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCLVQPDKSFHSRL 175
FGC+ + GIMG S + Q+ R + FS CL +K L
Sbjct: 201 FGCATNITGSWPVD-----GIMGFGLISKTVPNQIATQRNMSRVFSHCL-GGEKHGGGIL 254
Query: 176 EFGDQ-------------------------IIAGKSLNLPPNSFTIKLN--GQRGCINDC 208
EFG+ + K L + P F+ N G I D
Sbjct: 255 EFGEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDS 314
Query: 209 GSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARF---NSFPSMTYH 265
G+ ++ + +L E + KL G+ CF L + SFP++T
Sbjct: 315 GTTFVLLTTKANRMLFQEIKSLTTAKLGPKLE-----GLECFYLKSGLTMETSFPNVTLT 369
Query: 266 FQGAD-LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
F G + ++P+N + + A++ G TI G + YD++
Sbjct: 370 FSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLTIFGEIVLKDKLVFYDVE 424
>gi|125575543|gb|EAZ16827.1| hypothetical protein OsJ_32299 [Oryza sativa Japonica Group]
Length = 207
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y++ L IG P + + ++D L WTQC Q C+ C++Q+ P++++ + +++ PC A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 67 SCKS 70
C+S
Sbjct: 111 VCES 114
>gi|222613194|gb|EEE51326.1| hypothetical protein OsJ_32296 [Oryza sativa Japonica Group]
Length = 309
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y+ IG P + ++D L WTQC C+ C++Q+ P++ + ++K PC A
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 68 CKS-PFHCFEGD-CFY 81
C+S P GD C Y
Sbjct: 122 CESIPTRSCSGDVCSY 137
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 73/360 (20%), Positives = 120/360 (33%), Gaps = 67/360 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC--KSCYEQNDPIYNSRSFKSYKKLPCYD 65
Y+ + IGDP + L+DT + L WTQC C K C Q P YNS + ++ +PC
Sbjct: 90 YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAA 149
Query: 66 ASCKSP-----FHCFEGDC--FYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRF 118
C + F C G G V T ++ + + F
Sbjct: 150 RICAANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGTEAFAFQS----------GTAELAF 199
Query: 119 GCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFG 178
GC ++ V +G++GL S + Q G +FS CL FH+ G
Sbjct: 200 GCVTFTR-IVQGALHGASGLIGLGRGRLSLVSQTG---ATKFSYCLT---PYFHNNGATG 252
Query: 179 DQIIAGKS------------------------------------LNLPPNSFTIKLNG-- 200
+ + L +P F ++
Sbjct: 253 HLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPG 312
Query: 201 --QRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNS 258
G I D GS T + + Y L +E + + G C
Sbjct: 313 LFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADD-GALCVARRDVGRV 371
Query: 259 FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
P++ +HF+G + P + + + P + ++++G Q N + +YDL
Sbjct: 372 VPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDL 431
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 74/180 (41%), Gaps = 17/180 (9%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSF 55
+ T Y ++GIG P K + +DT + + W C C C +++ +Y+ R
Sbjct: 83 LATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGS 142
Query: 56 KSYKKLPCYDASCKS------PFHCFEGDCFYGITYGDVYETKE---VDSLDTSTLLPPD 106
+S + + C C + P C Y I+YGD T D L + +
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 107 EPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL--VPDRFSCCL 164
+ +P + ++ FGC + + + GI+G ++S + QL V F+ CL
Sbjct: 203 QTTPANA-SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 75/181 (41%), Gaps = 22/181 (12%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQP--CKSCYEQNDPIYNSRSFKSYKKL 61
++ Y++K IG P + + DT + + W QC C +CY+Q P++N +Y
Sbjct: 104 IDKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIR 163
Query: 62 PCYDASCKSPFHCFEGD----------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPV 111
C CK G+ C Y I+Y D ++ S D T P+ +
Sbjct: 164 LCGHRECKQALWGL-GEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITF--PEHIAEF 220
Query: 112 SVQNIR--FGCSLESKDFVSIQKK--IIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQP 167
++R FGC + + G++GL + S +G+L +FS C+ P
Sbjct: 221 GNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASL---VGQLTLGQFSYCISTP 277
Query: 168 D 168
D
Sbjct: 278 D 278
>gi|125575536|gb|EAZ16820.1| hypothetical protein OsJ_32292 [Oryza sativa Japonica Group]
Length = 253
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 35/69 (50%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
+T Y+ IG P + ++D L WTQC+ C C+EQ P+++ + +Y+
Sbjct: 45 WTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAE 104
Query: 62 PCYDASCKS 70
PC C+S
Sbjct: 105 PCGTPLCES 113
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 127/364 (34%), Gaps = 63/364 (17%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKS---CYEQNDPIYNSRSF-----KSY 58
Y + +G P + + DT + LTW C+ C+S + I + R F S+
Sbjct: 12 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 71
Query: 59 KKLPCYDASCK-------SPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
K +PC CK S +C C Y Y D + +T T+ E
Sbjct: 72 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV-ELKEGR 130
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--P 167
+ + N+ GCS + F + G+MGL + SF ++ +FS CLV
Sbjct: 131 KMKLHNVLIGCS---ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 187
Query: 168 DKSFHSRLEFGDQ-------------------------------IIAGKSLNLPPNSFTI 196
K+ + L FG I G L +P + +
Sbjct: 188 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 247
Query: 197 KLNGQRGCINDCGSVLTVIECEVYA-VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR 255
K G G I D GS LT + Y V+ A + +E + CFN
Sbjct: 248 K--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE---YCFNSTGF 302
Query: 256 FNSF-PSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQF 314
S P + +HF P ++ + D G G +++G Q N +
Sbjct: 303 EESLVPRLVFHFADGAEFEPPVKSYVISAADG-VRCLGFVSVAWPGTSVVGNIMQQNHLW 361
Query: 315 VYDL 318
+DL
Sbjct: 362 EFDL 365
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 83/361 (22%), Positives = 133/361 (36%), Gaps = 68/361 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK-SCYE---QNDPIYNSRSFKSYKKLPC 63
+ + + +G P + +DT + L+W CQ C+ SC+ + +++ +Y+ + C
Sbjct: 75 FFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVGC 134
Query: 64 YDASCK-------SPFHCFE--GDCFYGITYGDVYETK-EVDSLDTSTLLPPDEPSPVSV 113
C +PF C E C Y + YG + L T L S +
Sbjct: 135 SSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSSS--II 192
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDR-FSCCL-------- 164
FGCS + K +G++G + SF Q+ R R FS C
Sbjct: 193 DGFIFGCSGDDS-----FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDHTAEG 247
Query: 165 -----VQP--------------DKSFHSRLEFGDQIIAGKSLNLPPNSFTIKLNGQRGCI 205
P D+S +S L+ D ++ G L + + +T +R +
Sbjct: 248 FLSIGAYPKDELVYTNLIPHFGDRSVYS-LQQIDMMVDGNRLQVDQSEYT-----KRMMV 301
Query: 206 NDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGV-TCFNLPARFNS-----F 259
D G+V T + V+ + K F G TCF P +S
Sbjct: 302 VDSGTVDTFLLGPVFDAFSKAMASAMQ----AKGFLSDTVGTETCFR-PNGGDSVDSGDL 356
Query: 260 PSMTYHFQGADLVVEPENVF--IFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
P++ F G L + PENVF + D F P + ILG + + + VYD
Sbjct: 357 PTVEMRFIGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRNVQILGNKATXSFRVVYD 416
Query: 318 L 318
L
Sbjct: 417 L 417
>gi|357512163|ref|XP_003626370.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355501385|gb|AES82588.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 223
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 9/141 (6%)
Query: 183 AGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTC 242
+GK L + P F +K + G + D GS T + + VL IDYF FT
Sbjct: 74 SGKILPINPELFKLKPGIEGGFVFDSGSPATYLVEGAFNVLKKSVIDYFGMKYNLHPFTE 133
Query: 243 RKCGVTC--FNLPARFNSFPSMTYHFQ-GADLVVEPENVFI-FNHQDSFFFFFGPAFTPR 298
RK +P R P M Y+FQ G+ V+P ++F+ F ++ F P
Sbjct: 134 RKLDYDLCYLGVPKRVVK-PGMAYYFQGGSKFEVDPNSLFLTFKAKEGNMFCMNVMAVPN 192
Query: 299 KGKTILGARHQHNTQFVYDLD 319
ILGA HQ N ++D++
Sbjct: 193 ----ILGAYHQSNHVLLFDVE 209
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 83/366 (22%), Positives = 129/366 (35%), Gaps = 67/366 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKS---CYEQNDPIYNSRSF-----KSY 58
Y + +G P + + DT + LTW C+ C+S + I + R F S+
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 59 KKLPCYDASCK-------SPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
K +PC CK S +C C Y Y D + +T T+ E
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV-ELKEGR 201
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--P 167
+ + N+ GCS + F + G+MGL + SF ++ +FS CLV
Sbjct: 202 KMKLHNVLIGCS---ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 258
Query: 168 DKSFHSRLEFGDQ-------------------------------IIAGKSLNLPPNSFTI 196
K+ + L FG I G L +P + +
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 318
Query: 197 KLNGQRGCINDCGSVLTVIECEVYA-VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR 255
K G G I D GS LT + Y V+ A + +E + CFN
Sbjct: 319 K--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE---YCFNSTGF 373
Query: 256 FNSF-PSMTYHFQGADLVVEPENVFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNT 312
S P + +HF P ++ + D F A+ G +++G Q N
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP---GTSVVGNIMQQNH 430
Query: 313 QFVYDL 318
+ +DL
Sbjct: 431 LWEFDL 436
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 86/209 (41%), Gaps = 21/209 (10%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYK 59
++ + H Y + + IGDP K + +DT + LTW QC PC+SC + P Y K
Sbjct: 67 VYPIGH-YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKP---TKNK 122
Query: 60 KLPCYDASCKS--PFH--CFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
+PC + C S P C Y I Y D + V D TL + S N
Sbjct: 123 IVPCAASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRN--SSTVRAN 180
Query: 116 IRFGCSLESKDFVSIQKKIIA---GIMGLNWDSTSFMVQLGR--LVPDRFSCCLVQPDKS 170
+ FGC + + V + A G++GL + S + QL + + + C
Sbjct: 181 LTFGCGYDQQ--VGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGG 238
Query: 171 FHSRLEFGDQIIAGKSLNLPPNSFTIKLN 199
F L FGD I+ + P + T N
Sbjct: 239 F---LFFGDDIVPTSRVTWVPMARTTSGN 264
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 110/318 (34%), Gaps = 67/318 (21%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCY 64
N Y+++L IG P + DT + W QC PC++C
Sbjct: 75 NGEYLMRLYIGTPPVERLVIADTGSDFIWVQCSPCQNC---------------------- 112
Query: 65 DASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLES 124
C Y Y + T EV +T + VS N FGC +
Sbjct: 113 -------------QCVYLNIYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANN 159
Query: 125 K-DFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIA 183
F S K G++GL S + QLG + +FS + + +I
Sbjct: 160 NLTFRSSDKA--TGLVGLVAGQLSLVSQLGAQIGYKFSYLKFGSEAIITTNGVVSTPLII 217
Query: 184 GKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCR 243
SL L + + GQ+ V+ E + S D+ F
Sbjct: 218 KPSLPLYFLNLEVVTIGQK-------------------VVPTETLGVESVQDLPFPFK-- 256
Query: 244 KCGVTCFNLPARFN-SFPSMTYHFQGADLVVEPENVFI-FNHQDSFFFFFGPAFTPRKGK 301
F P R N + P++ + F GA + + P+N+ I ++ P+ +
Sbjct: 257 ------FCFPYRDNMTVPAIAFQFTGASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVI 310
Query: 302 TILGARHQHNTQFVYDLD 319
+I G Q + Q +YDLD
Sbjct: 311 SIFGIIAQFDFQVLYDLD 328
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 131/348 (37%), Gaps = 56/348 (16%)
Query: 14 IGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQN--DPIYNSRSFKSYKKLPCYDA--SCK 69
+G P K ++DT + +T+ C C S N D ++ + + ++ C SC
Sbjct: 84 LGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRISCTSPKCSCG 143
Query: 70 SPF-HCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFV 128
SP C C Y +Y + + + D L +P I FGC E+++
Sbjct: 144 SPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAP-----IIFGC--ETRETG 196
Query: 129 SIQKKIIAGIMGLNWDSTSFMVQLGR--LVPDRFSCC--LVQPDKSFHSRLEFGDQIIAG 184
I ++ G+ GL S + QL + ++ D FS C +V+ D + L GD + G
Sbjct: 197 EIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGA----LLLGDAEVPG 252
Query: 185 K-SLNLPP-------------NSFTIKLNGQR------------GCINDCGSVLTVIECE 218
SL P ++ + GQ G + D G+ T +
Sbjct: 253 SISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTTFTYMPSP 312
Query: 219 VYAVLTAEFIDYFSQHDIEKL--------FTCRKCGVTCFNLPARFNSFPSMTYHF-QGA 269
V+ Y H ++++ C + +L A + FPSM F QG
Sbjct: 313 VFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSVFPSMEVQFDQGT 372
Query: 270 DLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYD 317
LV+ P N + +S + G F + T+LG N YD
Sbjct: 373 SLVLGPLNYLFVHTFNSGKYCLG-VFDNGRAGTLLGGITFRNVLVRYD 419
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 65/156 (41%), Gaps = 14/156 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG ++ F +D+ + LTW QC PC C + + +Y + L C++
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNN----NALNCFEP 110
Query: 67 SCKS-----PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C S HC D C Y I Y D + V D L + ++ I FG
Sbjct: 111 LCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTN--GSLAAPRIAFG 168
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL 155
C + K V AG++GL SF+ QL +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSM 204
>gi|325190367|emb|CCA24840.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 603
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 138/358 (38%), Gaps = 66/358 (18%)
Query: 7 TYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYN--SRSFKSYKKL--- 61
TY + L IG P++ LLDT + T C+ C +C + DP Y+ ++ K
Sbjct: 119 TYYIDLYIGIPLQKASLLLDTTSQHTVFPCKNCVACADHMDPYYDIAKSQTSNFTKCGAE 178
Query: 62 ----PCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN-- 115
C D C+ +G + G+ D+ + D E + ++N
Sbjct: 179 NVCNSCEDEKCRVEQSYSDGSFWSGLVVEDLVWVASPKTGDI-------EMTSGIIRNFG 231
Query: 116 --IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTS---FMVQLGRLVPDRFSCCLVQPDKS 170
+RF C S Q++ GI+GL+ + S FMVQ R+ FS CL +
Sbjct: 232 FPMRFACETSEDGIFSQQRE--NGILGLDRSNHSILNFMVQAKRIDHRIFSYCLHDTGGT 289
Query: 171 F---------HSRLEFGDQIIAGKSLNLPPNSFT-IKLNGQRGCIND----CGSVLTVIE 216
F H+ +I+A ++ +L I++N + I++ G + +
Sbjct: 290 FVLGGFDSMHHTSDMIYTRIVANQNDSLHGVYLKDIQINNRSIGIDEKQYNSGRGMVIAS 349
Query: 217 CEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFN---------SFPSMTYHFQ 267
V + F + K+F +T F+ N + P++T F
Sbjct: 350 SSVES-----FFPSVAGEAFRKVFKS----ITGFDFEQEANMIFDKKTKQALPTITLVFA 400
Query: 268 GAD------LVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
G D L + + I + D FF G FT R G + G+R + ++DLD
Sbjct: 401 GIDEEHDIKLTIPASSYLIPSDNDR--FFAGIQFTERTGG-VFGSRILSDYNVIFDLD 455
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 83/366 (22%), Positives = 129/366 (35%), Gaps = 67/366 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKS---CYEQNDPIYNSRSF-----KSY 58
Y + +G P + + DT + LTW C+ C+S + I + R F S+
Sbjct: 83 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 59 KKLPCYDASCK-------SPFHCFE--GDCFYGITYGDVYETKEVDSLDTSTLLPPDEPS 109
K +PC CK S +C C Y Y D + +T T+ E
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV-ELKEGR 201
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ--P 167
+ + N+ GCS + F + G+MGL + SF ++ +FS CLV
Sbjct: 202 KMKLHNVLIGCS---ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 258
Query: 168 DKSFHSRLEFGDQ-------------------------------IIAGKSLNLPPNSFTI 196
K+ + L FG I G L +P + +
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 318
Query: 197 KLNGQRGCINDCGSVLTVIECEVYA-VLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPAR 255
K G G I D GS LT + Y V+ A + +E + CFN
Sbjct: 319 K--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE---YCFNSTGF 373
Query: 256 FNSF-PSMTYHFQGADLVVEPENVFIFNHQDSF--FFFFGPAFTPRKGKTILGARHQHNT 312
S P + +HF P ++ + D F A+ G +++G Q N
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP---GTSVVGNIMQQNH 430
Query: 313 QFVYDL 318
+ +DL
Sbjct: 431 LWEFDL 436
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/357 (22%), Positives = 129/357 (36%), Gaps = 71/357 (19%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQND-----PIYNSRSFKSYKKLP 62
Y + + +G+P + + T + + W C C C +D +Y+ ++ + ++
Sbjct: 76 YCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDCPTPDDIGFSLDLYDPKNSSTSSEIS 135
Query: 63 CYDASCKSPFHCFEGDCFYGITYGDV--YETKEVDSLDTST-----------LLPPDEPS 109
C D C C + GD Y D + +T + +E
Sbjct: 136 CSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNESF 195
Query: 110 PVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL-GRLVPDRFSCCLVQPD 168
S ++ FGCS + G++G D+ S + QL + V FS CL D
Sbjct: 196 ASSSASVIFGCSKSRSGHLQAD-----GVIGFGKDAPSLISQLNSQGVSHAFSRCLDDSD 250
Query: 169 KS---------FHSRLEFGDQI------------IAGKSLNLPPNSFTIKLNGQRGCIND 207
LEF + IA + N+P +S + +G D
Sbjct: 251 DGGGVLILDEVGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQGTFLD 310
Query: 208 CGSVLTVIECEVY-AVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHF 266
G+ L VY V+ A YFS F+SFP++T +F
Sbjct: 311 SGTSLAYFPDGVYDPVIRAILFIYFSTR--------------------SFSSFPTVTXYF 350
Query: 267 Q-GADLVVEPENVFI----FNHQDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDL 318
+ GA + V PEN + +++ F + K TILG H+ FVY+L
Sbjct: 351 EGGAAMKVGPENYLLRRGSYDNDSYMCIAFQRSEGDYKQTTILGDLILHDKIFVYNL 407
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 65/156 (41%), Gaps = 14/156 (8%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG ++ F +D+ + LTW QC PC C + + +Y + L C++
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNN----NALNCFEP 110
Query: 67 SCKS-----PFHCFEGD--CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
C S HC D C Y I Y D + V D L + ++ I FG
Sbjct: 111 LCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTN--GSLAAPRIAFG 168
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRL 155
C + K V AG++GL SF+ QL +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSM 204
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 70/179 (39%), Gaps = 33/179 (18%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK----------SCYEQNDPIYNSRSFKS 57
Y+ GIGDP + ++DT + L WTQC C+ C+ QN P YN ++
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 58 YKKLPCYD------------ASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPP 105
+ +PC D A C + C +YG V D T
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALG-VLGTDAFTF--- 193
Query: 106 DEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
PS SV + FGC +++ +GI+GL + S + QL FS CL
Sbjct: 194 --PSSSSV-TLAFGCVSQTR-ISPGALNGASGIIGLGRGALSLVSQLNA---TEFSYCL 245
>gi|413936471|gb|AFW71022.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 315
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 36/64 (56%), Gaps = 2/64 (3%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS--CYEQNDPIYNSRSFKSYKKLPCYD 65
Y++++ G P ++DT + ++W QC+PC S C+ Q DP+Y+ +Y +PC
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 66 ASCK 69
CK
Sbjct: 173 DVCK 176
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/362 (19%), Positives = 120/362 (33%), Gaps = 101/362 (27%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKS---------------CYEQN 46
+T Y ++ +G P + + DT + LTW +C+ S
Sbjct: 104 YTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAP 163
Query: 47 DPIYNSRSFKSYKKLPCYDASCKS--PFHCFE-----GDCFYGITYGDVYETKEVDSLDT 99
++ K++ +PC +CKS PF C Y Y D + V D+
Sbjct: 164 PRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDS 223
Query: 100 STLL-------PPDEPSPVSVQNIRFGCS-------LESKDFVSIQKKIIAGIMGLNWDS 145
+T+ +Q + GC+ E+ D G++ L + +
Sbjct: 224 ATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASD----------GVLSLGYSN 273
Query: 146 TSFMVQLGRLVPDRFSCCLVQ----------------PDKSFHSRLEFGDQI-------- 181
SF + RFS CLV PD + S G +
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333
Query: 182 ------------IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFID 229
+ G +L++P + + NG G I D G+ LTV+ Y + A
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVWDVGSNG--GTIIDSGTSLTVLATPAYKAVVAAL-- 389
Query: 230 YFSQHDIEKLFTCRKCGVT----CFNLPARFN-----SFPSMTYHFQGADLVVEPENVFI 280
E+L + + C+N AR + + P + F G+ + P ++
Sbjct: 390 ------SEQLAGLPRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYV 443
Query: 281 FN 282
+
Sbjct: 444 ID 445
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 68/154 (44%), Gaps = 16/154 (10%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQC-QPCKSCYEQNDPIYNSRSFKSYKKLPCYDA 66
Y + + IG+P K + +DT + LTW QC PC+SC + P+Y R K+ K +PC D
Sbjct: 66 YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY--RPTKN-KLVPCVDQ 122
Query: 67 SCKSPFHCFEG---------DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIR 117
C S + C Y I Y D + V D+ L + V ++
Sbjct: 123 LCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLAN--GSVVRPSLA 180
Query: 118 FGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQ 151
FGC + + S + G++GL S S + Q
Sbjct: 181 FGCGYD-QQVSSGEMSPTDGVLGLGTGSVSLLSQ 213
>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
Length = 201
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 69/157 (43%), Gaps = 19/157 (12%)
Query: 175 LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH 234
+ F + + L +P ++F ++ +G G I D G+ LT++ V A E + F Q
Sbjct: 43 VHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLA----EVVRAFRQQ 98
Query: 235 DIEKLFTCR---KCGVTCFNLPARFNS--------FPSMTYHFQGADLVVEPENVFIFNH 283
+ F + GV CF +PA + P M HFQGADL + N + +H
Sbjct: 99 -LRLPFANGGNPEDGV-CFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDDH 156
Query: 284 QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ A + G TI G Q + + +YDL+
Sbjct: 157 RRGRLCLL-LADSGDDGSTI-GNLVQQDMRVLYDLEA 191
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 20/166 (12%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC---KSCYEQNDPIYNSRSFKSYKKLPCY 64
Y+ IG P + L+DT + L WTQC KSC +Q P YN ++ +PC
Sbjct: 86 YIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCA 145
Query: 65 DAS---CKSPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
D + + H +G C + +YG + + SL T + ++ FG
Sbjct: 146 DKAGFCAANGVHLCGLDGSCTFIASYG---AGRVIGSLGTESFAFES-----GTTSLAFG 197
Query: 120 CSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
C + S +G++GL S + Q+G RFS CL
Sbjct: 198 C-VSLTRITSGALNDASGLIGLGRGRLSLVSQIG---ATRFSYCLT 239
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 81/202 (40%), Gaps = 21/202 (10%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYK 59
++ + H + + + IGDP K + +DT + LTW QC PC +C + +Y + K
Sbjct: 32 VYPIGH-FFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVK 90
Query: 60 --KLPCYD--ASCKSPFHCF-EGDCFYGITY--GDVYETKEVDSLDTSTLLPPDEPSPVS 112
+ C D A + P C + C YGI Y G VDS +L + +P S
Sbjct: 91 CTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSSIGVLIVDSF---SLPASNGTNPTS 147
Query: 113 VQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQL---GRLVPDRFSCCLVQPDK 169
I FGC + GI+GL + + QL G + C+ K
Sbjct: 148 ---IAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 170 SFHSRLEFGDQIIAGKSLNLPP 191
F L FGD + + P
Sbjct: 205 GF---LFFGDAKVPTSGVTWSP 223
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 69/168 (41%), Gaps = 26/168 (15%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPC---KSCYEQNDPIYNSRSFKSYKKLPCY 64
Y+ + IGDP + L+DT + L WTQC K+C +Q+ P YN ++ +PC
Sbjct: 84 YIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCA 143
Query: 65 DAS---CKSPFHC--FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFG 119
D++ + H +G C + +YG SL T + FG
Sbjct: 144 DSAKLCAANGVHLCGLDGSCTFAASYG---AGSVFGSLGTEAFTFQS-----GAAKLGFG 195
Query: 120 CSLESKDFVSIQKKII---AGIMGLNWDSTSFMVQLGRLVPDRFSCCL 164
C I K + +G++GL S + Q G +FS CL
Sbjct: 196 C----VSLTRITKGALNGASGLIGLGRGRLSLVSQTG---ATKFSYCL 236
>gi|413923781|gb|AFW63713.1| hypothetical protein ZEAMMB73_300584, partial [Zea mays]
Length = 190
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +GIG P + +DT + ++W QC+PC C+ + D +++ S +Y C A
Sbjct: 82 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAP 141
Query: 68 CKSPFHCFEGD------CFYGITYG 86
C EG+ C Y + YG
Sbjct: 142 CAQLSQSQEGNGCMSSQCQYIVNYG 166
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/245 (25%), Positives = 94/245 (38%), Gaps = 40/245 (16%)
Query: 79 CFYGITYGDVYETKEV---DSLDTSTLLPPDEPSPVSVQNIRFGCSLESKDFVSIQKKII 135
C Y I YGD T+ + L T+L V++ FGC +K +
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTIL---------VKDFIFGCGRNNKGLFGG----V 179
Query: 136 AGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRLEFGDQIIAGKSLNLPPNSFT 195
+G+MGL S + Q P + I G +L P
Sbjct: 180 SGLMGLGRSDLSLISQTSE-----------NPQLYNFYFINLTGISIGGVALQAPSV--- 225
Query: 196 IKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA- 254
G + D G+V+T + +Y L AEF+ F+ F+ TCFNL A
Sbjct: 226 ----GPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILD---TCFNLSAY 278
Query: 255 RFNSFPSMTYHFQG-ADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGK-TILGARHQHNT 312
+ P++ HF+G A+L V+ VF F D+ A + + ILG Q N
Sbjct: 279 QEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNL 338
Query: 313 QFVYD 317
+ +YD
Sbjct: 339 RVIYD 343
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 128/371 (34%), Gaps = 75/371 (20%)
Query: 2 FTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKL 61
F + ++ L IG P ++ +LDT + L+W QC +++ S+ L
Sbjct: 76 FKYSMILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVL 135
Query: 62 PCYDASCKS-------PFHCFEGD-CFYGITYGD--VYE---TKEVDSLDTSTLLPPDEP 108
PC CK P C + C Y Y D + E +E + S PP
Sbjct: 136 PCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPP--- 192
Query: 109 SPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSF-----MVQLGRLVPDRFSCC 163
+ GC+ ES D GI+G+N SF + + VP R
Sbjct: 193 -------LILGCAEESSD--------AKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRP 237
Query: 164 LVQPDKSFH---------------------SRLEFGDQI----------IAGKSLNLPPN 192
P SF+ R+ D + I + LN+P +
Sbjct: 238 GFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPIS 297
Query: 193 SFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNL 252
+F +G + D GS T + E Y + E + ++K + CFN
Sbjct: 298 AFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGAR-LKKGYVYGGVSDMCFNG 356
Query: 253 PA----RFNSFPSMTYHF-QGADLVVEPENVFIFNHQDSFFFFFGPAFTPRKGKTILGAR 307
A R +M + F +G ++VVE E V G + I+G
Sbjct: 357 NAIEIGRL--IGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASNIIGNF 414
Query: 308 HQHNTQFVYDL 318
HQ N +DL
Sbjct: 415 HQQNIWVEFDL 425
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 136/384 (35%), Gaps = 89/384 (23%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQP---CKSC-YEQNDPIYNSR----SFKSYK 59
Y + L G P +++ F+ DT + L W C C C + DP R + S K
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 60 KLPCYDASCK--------------SPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPP 105
+ C C+ + +C G Y + YG + T V L T L P
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYG-LGSTAGV--LITEKLDFP 206
Query: 106 DEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLV 165
D ++V + GCS+ I + AGI G S Q+ RFS CLV
Sbjct: 207 D----LTVPDFVVGCSI-------ISTRQPAGIAGFGRGPVSLPSQMNL---KRFSHCLV 252
Query: 166 -------------------------------------QPDKSFHSRLEFG----DQIIAG 184
P+ S + LE+ +I G
Sbjct: 253 SRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312
Query: 185 -KSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCR 243
K + +P NG G I D GS T +E V+ ++ EF S + EK
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372
Query: 244 KCGVTCFNLPARFN-SFPSMTYHFQ-GADLVVEPENVFIF-NHQDS--FFFFFGPAFTPR 298
CFN+ + + + P + + F+ GA L + N F F + D+ P
Sbjct: 373 TGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPS 432
Query: 299 KG---KTILGARHQHNTQFVYDLD 319
G ILG+ Q N YDL+
Sbjct: 433 GGTGPAIILGSFQQQNYLVEYDLE 456
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 19/157 (12%)
Query: 175 LEFGDQIIAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQH 234
+ F + + L +P ++F ++ +G G I D G+ LT++ A + AE + F Q
Sbjct: 146 VHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLP----AAVLAEVVRAFRQQ 201
Query: 235 DIEKLFTCR---KCGVTCFNLPARFNS--------FPSMTYHFQGADLVVEPENVFIFNH 283
+ F + GV CF +PA + P M HFQGADL + N + +H
Sbjct: 202 -LRLPFANGGNPEDGV-CFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDDH 259
Query: 284 QDSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLDT 320
+ A + G TI G Q + + +YDL+
Sbjct: 260 RRGRLCLL-LADSGDDGSTI-GNLVQQDMRVLYDLEA 294
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/265 (23%), Positives = 99/265 (37%), Gaps = 38/265 (14%)
Query: 5 NHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSRSFKSYKKLPC 63
N Y + +G P + + +DT + LTW QC PC SC + +P+Y K +P
Sbjct: 98 NGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKP---KKGNLVPL 154
Query: 64 YDASC-------KSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
D+ C K+ + C Y I Y D + V + D L+ + ++ I
Sbjct: 155 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLAN--GSLTKLGI 212
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG--RLVPDRFSCCLVQPDKSFHSR 174
FGC+ + + + GI+GL+ S QL R++ + CL D +
Sbjct: 213 MFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS-DATGGGY 271
Query: 175 LEFGDQIIA------------------GKSLNLPPNSFTIKLNGQRG----CINDCGSVL 212
+ GD + + + + S + L Q G + D GS
Sbjct: 272 MFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSY 331
Query: 213 TVIECEVYAVLTAEFIDYFSQHDIE 237
T E Y L A D + I+
Sbjct: 332 TYFPKEAYYALVASLKDVSDEGLIQ 356
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 71/160 (44%), Gaps = 20/160 (12%)
Query: 4 LNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQ-PCKSCYEQNDPIYNSR--SFKSYKK 60
L +TY++ +G+P + + +DT + LTW QC PC SC + P+Y R + S+K
Sbjct: 198 LYYTYIM---VGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFK- 253
Query: 61 LPCYDASCKSPFHCFEGD-------CFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSV 113
D+ C ++GD C Y + Y D + V D TL + ++
Sbjct: 254 ----DSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSN--GSLTK 307
Query: 114 QNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLG 153
N FGC+ + + + GI+GL+ S QL
Sbjct: 308 LNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLA 347
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/336 (20%), Positives = 118/336 (35%), Gaps = 67/336 (19%)
Query: 1 MFTLNHTYMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRS--FKSY 58
+T Y ++ +G P + + DT + LTW +C+ + P R+ +S+
Sbjct: 98 AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSW 157
Query: 59 KKLPCYDASCKS--PFHCFE-----GDCFYGITYGDVYETKEVDSLDTSTLL-------- 103
L C +C S PF C Y Y D + V D +T+
Sbjct: 158 APLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSED 217
Query: 104 -PPDEPSPVSVQNIRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSC 162
+Q + GC+ + D S Q G++ L + SF + RFS
Sbjct: 218 GSGGGGRRAKLQGVVLGCT-ATYDGQSFQSSD--GVLSLGNSNISFASRAAARFGGRFSY 274
Query: 163 CLVQ--PDKSFHSRLEFG-----------------DQIIA-------------GKSLNLP 190
CLV ++ S L FG D+ ++ G++L++P
Sbjct: 275 CLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIP 334
Query: 191 PNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVT-- 248
+ + + G G I D G+ LTV+ Y + A +L + +
Sbjct: 335 ADVWDVGRGG--GAILDSGTSLTVLATPAYRAVVAALGG--------RLAALPRVAMDPF 384
Query: 249 --CFNLPARFNSFPSMTYHFQGADLVVEPENVFIFN 282
C+N A P + F G+ + P ++ +
Sbjct: 385 EYCYNWTAGAPEIPKLEVSFAGSARLEPPAKSYVID 420
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 132/353 (37%), Gaps = 56/353 (15%)
Query: 1 MFTLNHT--YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCK--SCYEQNDPIYNSRSFK 56
M +LN +++ +G G P ++L ++DT + TW +C C +C+ + P +N
Sbjct: 120 MHSLNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSS 179
Query: 57 SYKKLPCYDASCKSPFHCFEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNI 116
SY C ++ + Y + Y D +K V D TL P P
Sbjct: 180 SYSNRSCIPSTKTN----------YTMNYEDNSYSKGVFVCDEVTLKPDVFP-------- 221
Query: 117 RFGCSLESKDFVSIQKKIIAGIMGL-NWDSTSFMVQLGRLVPDRFSCCLVQPDKSFHSRL 175
+F D +G++GL + S + Q +FS C + + S L
Sbjct: 222 KF--QFGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCFPHNENTRGSLL 279
Query: 176 EFGDQII-AGKSLN----LPPNSFTI-------------KLN------GQRGCINDCGSV 211
FG++ I A SL L P+S ++ +LN G I D G+V
Sbjct: 280 -FGEKAISASPSLKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTV 338
Query: 212 LTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLP---ARFNSFPSMTYHFQG 268
+T + Y L F K TC+NL R P + HF G
Sbjct: 339 ITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVG 398
Query: 269 -ADLVVEPENVFIFNHQ-DSFFFFFGPAFTPRKGKTILGARHQHNTQFVYDLD 319
D+ + P + N F P TI+G R Q + + VYD++
Sbjct: 399 EVDVSLHPSGILWANGDLTQACLAFARKSHPSH-VTIIGNRQQVSLKVVYDIE 450
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 73/349 (20%), Positives = 136/349 (38%), Gaps = 79/349 (22%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++ +G+G P K+ +DT + TW C+ C C+ N R+F + C S
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHT------NPRTFLQSRSTTCAKVS 53
Query: 68 CKSPF--------HCFEG----DCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQN 115
C + HC + DC + ++Y D + + DT T + + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-----IPS 108
Query: 116 IRFGCSLESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCC--LVQPDKSFHS 173
FGC+L+S F + + + G++G+ S + Q D FS C L + ++ F S
Sbjct: 109 FTFGCNLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DGFSYCLPLQKSERGFFS 165
Query: 174 R--------------------------------LEFGDQIIAGKSLNLPPNSFTIKLNGQ 201
+ ++ + G+ L L P+ F+ +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----R 220
Query: 202 RGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPA-RFNSFP 260
+G + D GS L+ I +VL+ + + + + R C+++ + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERN----CYDMRSVDEGDMP 276
Query: 261 SMTYHF-QGADLVVEPENVFI---FNHQDSFFFFFGPAFTPRKGKTILG 305
+++ HF GA + VF+ QD + AF P + +I+G
Sbjct: 277 AISLHFDDGARFDLGSRGVFVERSVQEQDVWCL----AFAPTESVSIIG 321
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/347 (21%), Positives = 118/347 (34%), Gaps = 57/347 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P + ++ +LDT W C C C + + + L C A
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSGAQ 154
Query: 68 CKS--PFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C F C C + +YG D TL P FGC +
Sbjct: 155 CSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIP------GFTFGC-I 207
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG--- 178
+ SI + G++GL S + Q G + FS CL F L+ G
Sbjct: 208 NAVSGGSIPPQ---GLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG 264
Query: 179 ------------------------DQIIAGK-SLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+ G+ + +P N G I D G+V+T
Sbjct: 265 QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 324
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVV 273
VY + EF + + + TCF + P++T HF+G +LV+
Sbjct: 325 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTCFAATNEAEA-PAITLHFEGLNLVL 378
Query: 274 EPENVFIFNHQDSFFFFFGPAFTPRKGKTILGA---RHQHNTQFVYD 317
EN I + S A P ++L Q N + ++D
Sbjct: 379 PMENSLIHSSSGS-LACLSMAAAPNNVNSVLNVIANLQQQNLRIMFD 424
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/313 (21%), Positives = 108/313 (34%), Gaps = 53/313 (16%)
Query: 8 YMLKLGIGDPVKSLWFLLDTVAGLTWTQCQPCKSCYEQNDPIYNSRSFKSYKKLPCYDAS 67
Y++++ +G P + ++ +LDT W C C C + + + L C +A
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQ 101
Query: 68 CKS--PFHC---FEGDCFYGITYGDVYETKEVDSLDTSTLLPPDEPSPVSVQNIRFGCSL 122
C F C C + +YG D TL P FGC +
Sbjct: 102 CSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIP------GFTFGC-I 154
Query: 123 ESKDFVSIQKKIIAGIMGLNWDSTSFMVQLGRLVPDRFSCCLVQ-PDKSFHSRLEFG--- 178
+ SI + G++GL S + Q G + FS CL F L+ G
Sbjct: 155 NAVSGGSIPPQ---GLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG 211
Query: 179 ------------------------DQIIAGK-SLNLPPNSFTIKLNGQRGCINDCGSVLT 213
+ G+ + +P N G I D G+V+T
Sbjct: 212 QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271
Query: 214 VIECEVYAVLTAEFIDYFSQHDIEKLFTCRKCGVTCFNLPARFNSFPSMTYHFQGADLVV 273
VY + EF + + + TCF + P++T HF+G +LV+
Sbjct: 272 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTCFAATNEAEA-PAVTLHFEGLNLVL 325
Query: 274 EPENVFIFNHQDS 286
EN I + S
Sbjct: 326 PMENSLIHSSSGS 338
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 58/143 (40%), Gaps = 16/143 (11%)
Query: 182 IAGKSLNLPPNSFTIKLNGQRGCINDCGSVLTVIECEVYAVLTAEFIDYFSQHDIEKLFT 241
+ L++ ++F + +G G I D G+ +T IE + L EF +
Sbjct: 46 VGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYIEENAFDSLKKEFTS-------QTKLP 98
Query: 242 CRKCGVT----CFNLPARFNS--FPSMTYHFQGADLVVEPENVFIFNHQDSFFFFFGPAF 295
K G T CF+LP+ P + +HF+G DL + EN I DS A
Sbjct: 99 VDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGDLELPGENYMI---ADSSLGVACLAM 155
Query: 296 TPRKGKTILGARHQHNTQFVYDL 318
G +I G Q N +DL
Sbjct: 156 GASNGMSIFGNIQQQNILVNHDL 178
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.141 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,281,749,664
Number of Sequences: 23463169
Number of extensions: 221884899
Number of successful extensions: 591536
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 599
Number of HSP's successfully gapped in prelim test: 726
Number of HSP's that attempted gapping in prelim test: 588781
Number of HSP's gapped (non-prelim): 1888
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)