BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 045061
(359 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 142/362 (39%), Positives = 199/362 (54%), Gaps = 17/362 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V+ G+P +L+ DTGS L WTQC PC F Q PIFN AS TY+ +PC
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQF 150
Query: 67 C--RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C + F+C + +CV+RI YAGG++ +G+ + + +++ FGCS DN++
Sbjct: 151 CTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRI----PFYFGCSRDNQN 206
Query: 125 FS---FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AYREMEATSILRFGKDA 179
FS G GI+G ++SP SLL Q+ + FSYCL ATS+LRFG D
Sbjct: 207 FSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDI 266
Query: 180 NIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
R+ + R +Y+L+L D+SVA +R+ PGTFAL+ +GTGG +ID+G
Sbjct: 267 RKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGTAV 326
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHFDRADF 297
T+I + Y V+ F +F G QR+ N CY+ F Y SM FHF ADF
Sbjct: 327 TYISQTAYFPVITAFKNYFDQHGFQRV-NIQLSGYICYKQQGHTFHNYPSMAFHFQGADF 385
Query: 298 KVEPTYMYFIFQNEGYFCVAIS--FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
VEP Y+Y Q+ G FCVA+ + +++GA Q +T+F+YD + F PENC
Sbjct: 386 FVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENC-Q 444
Query: 356 DH 357
DH
Sbjct: 445 DH 446
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/359 (35%), Positives = 186/359 (51%), Gaps = 46/359 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G P +L+ DTGS LIWT VN NQ+
Sbjct: 91 YLVKVRIGNPGIPLYLVPDTGSALIWT-----VN--NQNI-------------------- 123
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
F+C N +C + Y G+ +G+ + + ++ FGCS DN++FS
Sbjct: 124 -----FQCRNNKCSYTRRYDDGSITTGVAAQDILQSEGSERIP----FYFGCSRDNQNFS 174
Query: 127 F---DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-VYAY-REMEATSILRFGKDANI 181
G G++G + SP SLL QL Q FSYCL Y + E +S+LRFG D
Sbjct: 175 VFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPSSLLRFGNDIRK 234
Query: 182 QRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
R+ ++ + DR + Y+L+L D++VA R+ PGTFALR++GTGG +ID+G T
Sbjct: 235 GRRRFQSTPLMSSPDRPN-YFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTGLT 293
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
FI + Y ++ F +F G QR+H D Y +R + F +ASMTFHF+RADF V
Sbjct: 294 FITQTAYPRLISAFQNYFDHRGFQRVHIPEFDLCYSFRGNHTFHDHASMTFHFERADFTV 353
Query: 300 EPTYMYFIFQNEGYFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
+ Y+Y +++ FCVA+ + + +V+GA Q +TRF+YD + F+ ENC ND
Sbjct: 354 QADYVYLPMEDDNAFCVALQPTPPQQRTVIGAINQGNTRFIYDAAAHQLLFIAENCRND 412
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 17/353 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++V GTP S + DTGS LIWTQC PC CF+Q PIFNP SS++ +PC+
Sbjct: 96 YLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ P C N +C + Y G++ G ++TETFTF + VP + FGC DN+ F
Sbjct: 156 CQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSS----VPNIAFGCGEDNQGF 211
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
GN AG++G P SL QL G FSYC+ +T L +
Sbjct: 212 G-QGNGAGLIGMGWGPLSLPSQL---GVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSP 267
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
T+ ++YY++LQ I+V +G TF L+ +GTGG +ID+G T++ +
Sbjct: 268 STTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDA 327
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFKVEPTY 303
Y V + F + + +S C++ D ++ FD +
Sbjct: 328 YNAVAQAFTDQIN---LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQN 384
Query: 304 MYFIFQNEGYFCVAISFSDR--NSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ I EG C+A+ S + S+ G QQQ+T+ +YDL + FVP C
Sbjct: 385 I-LISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 16/352 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++V GTP+ S + DTGS LIWTQC PC CF+Q PIFNP SS++ +PC+
Sbjct: 96 YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
C+ P C + Y G+S G ++TETFTF + VP + FGC DN+ F
Sbjct: 156 CQDLPSESCYNDCQYTYGYGDGSSTQGYMATETFTFETSS----VPNIAFGCGEDNQGFG 211
Query: 127 FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDM 186
GN AG++G P SL QL G FSYC+ + +T L +
Sbjct: 212 -QGNGAGLIGMGWGPLSLPSQL---GVGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPS 267
Query: 187 KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
T+ ++YY++LQ I+V +G TF L+ +GTGG +ID+G T++ + Y
Sbjct: 268 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 327
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFKVEPTYM 304
V + F + + +S C++ D ++ FD + +
Sbjct: 328 NAVAQAFTDQIN---LSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENV 384
Query: 305 YFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
I EG C+A+ S + S+ G QQQ+T+ +YDL + FVP C
Sbjct: 385 -LISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 435
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 187/355 (52%), Gaps = 14/355 (3%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + GTPS S + DTGS L WTQC PC +C+ Q PI++P+ SSTY ++PC
Sbjct: 112 NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCS 171
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C+ P + C C + +Y +S G++S E+FT ++ +P + FGC +N
Sbjct: 172 SSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQS----LPHIAFGCGQEN 227
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ G++GF P SL+ QL + FSYCLV TS L GK A++
Sbjct: 228 -EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLN 286
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K + + + RS + YYLSL+ ISV + A GTF L+ +GTGG +ID+G T+
Sbjct: 287 AKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTY 346
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDRADFK 298
+++ Y+VV + +S ++ ++ + C+ + S + ++TFHF+ ADF
Sbjct: 347 LEQSGYDVVKKAV---ISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFEGADFN 403
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y + G C+A+ S+ S+ G QQQ+ + +YD + F P C
Sbjct: 404 L-PKENYIYTDSSGIACLAMLPSNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/375 (32%), Positives = 186/375 (49%), Gaps = 42/375 (11%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +D+ GTP+ + + DTGS L+WTQC PCV CFNQS P+F+P++SSTY +
Sbjct: 96 HAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAAL 155
Query: 61 PCDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
PC +C P +C + +C + Y +S G+++ ETFT K KL P V FGC
Sbjct: 156 PCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLA-KTKL---PDVAFGCG 211
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFGK 177
+ N F AG++G P SL+ QL GL FSYCL + + S L G
Sbjct: 212 DTNEGDGFTQG-AGLVGLGRGPLSLVSQL-----GLNKFSYCLT--SLDDTSKSPLLLGS 263
Query: 178 DANI-----QRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
A I ++T + + S S YY++L+ ++V I FA++ +GTGG
Sbjct: 264 LATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGV 323
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS--- 287
++D+G T+++ Y + + F +M + D D+ F A AS
Sbjct: 324 IVDSGTSITYLELQGYRALKKAF--------AAQMKLPAADGSG-IGLDTCFEAPASGVD 374
Query: 288 ------MTFHFDRADFKVEPTYMYFIFQN-EGYFCVAISFSDRNSVVGAWQQQDTRFVYD 340
+ FH D AD + P Y + + G C+ + S S++G +QQQ+ +FVYD
Sbjct: 375 QVEVPKLVFHLDGADLDL-PAENYMVLDSGSGALCLTVMGSRGLSIIGNFQQQNIQFVYD 433
Query: 341 LNTGTIQFVPENCAN 355
+ T+ F P CA
Sbjct: 434 VGENTLSFAPVQCAK 448
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 186/376 (49%), Gaps = 39/376 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H + + +++ G P+ + DTGS LIWTQC PC CF+Q PIF+P SS+Y ++
Sbjct: 102 HGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKV 161
Query: 61 PCDDLICRR-PPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C +C P C + C + Y +S GL++TETFTF +N + G+ FG
Sbjct: 162 GCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFG 218
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +N F +G++G P SL+ QLK T FSYCL + + EA+S L G
Sbjct: 219 CGVENEGDGFSQG-SGLVGLGRGPLSLISQLKETK---FSYCLT-SIEDSEASSSLFIGS 273
Query: 178 ---------DANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
AN+ + KT+ + D+ S YYL LQ I+V R+ TF L +G
Sbjct: 274 LASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDG 333
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-----NASEDWEYCYRYDSR 281
TGG +ID+G T+++ E + E FTS RM + S + C++ +
Sbjct: 334 TGGMIIDSGTTITYLE----ETAFKVLKEEFTS----RMSLPVDDSGSTGLDLCFKLPNA 385
Query: 282 FR--AYASMTFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFV 338
+ A + FHF AD ++ P Y + + G C+A+ S+ S+ G QQQ+ +
Sbjct: 386 AKNIAVPKLIFHFKGADLEL-PGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVL 444
Query: 339 YDLNTGTIQFVPENCA 354
+DL T+ FVP C
Sbjct: 445 HDLEKETVTFVPTECG 460
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 182/361 (50%), Gaps = 21/361 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + G+P +S + DTGS LIWTQC PC CF+QS PIF+P SS++ +I C
Sbjct: 108 NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCS 167
Query: 64 DLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSND 121
+C P C + C + Y +S G+++ ETFTF + + +PG+ FGC ND
Sbjct: 168 SELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGND 227
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F AG++G P SL+ QLK F+YCL A + + +S+L G ANI
Sbjct: 228 NNGDGFSQG-AGLVGLGRGPLSLVSQLKEQK---FAYCLT-AIDDSKPSSLL-LGSLANI 281
Query: 182 QRK----DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
K +MKT + + S S YYLSLQ ISV ++ TF L +G+GG +ID+G
Sbjct: 282 TPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSG 341
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYASMTFHFD 293
T+++ + + F + + + C+ + +TFHF
Sbjct: 342 TTITYVENSAFTSLKNEF---IAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK 398
Query: 294 RADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
AD ++ P Y I ++ G C+AI S S+ G QQQ+ V+DL T+ F+P
Sbjct: 399 GADLEL-PGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQ 457
Query: 353 C 353
C
Sbjct: 458 C 458
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 182/369 (49%), Gaps = 39/369 (10%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR 68
+++ G P+ + DTGS LIWTQC PC CF+Q PIF+P SS+Y ++ C +C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 69 R-PPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
P C + C + Y +S GL++TETFTF +N + G+ FGC +N
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFGCGVENEGD 117
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK-------- 177
F +G++G P SL+ QLK T FSYCL + + EA+S L G
Sbjct: 118 GFSQG-SGLVGLGRGPLSLISQLKETK---FSYCLT-SIEDSEASSSLFIGSLASGIVNK 172
Query: 178 -DANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
A++ + KT+ + D+ S YYL LQ I+V R+ TF L +GTGG +ID+
Sbjct: 173 TGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDS 232
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-----NASEDWEYCYRYDSRFR--AYAS 287
G T+++ E + E FTS RM + S + C++ + A
Sbjct: 233 GTTITYLE----ETAFKVLKEEFTS----RMSLPVDDSGSTGLDLCFKLPDAAKNIAVPK 284
Query: 288 MTFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
M FHF AD ++ P Y + + G C+A+ S+ S+ G QQQ+ ++DL T+
Sbjct: 285 MIFHFKGADLEL-PGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETV 343
Query: 347 QFVPENCAN 355
FVP C
Sbjct: 344 SFVPTECGK 352
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 186/376 (49%), Gaps = 39/376 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H + + +++ G P+ + DTGS LIWTQC PC CF+Q PIF+P SS+Y ++
Sbjct: 101 HGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKV 160
Query: 61 PCDDLICRR-PPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C +C P C + C + Y +S GL++TETFTF +N + G+ FG
Sbjct: 161 GCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFG 217
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +N F +G++G P SL+ QLK T FSYCL + + EA+S L G
Sbjct: 218 CGVENEGDGFSQG-SGLVGLGRGPLSLISQLKETK---FSYCLT-SIEDSEASSSLFIGS 272
Query: 178 ---------DANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
A++ + KT+ + D+ S YYL LQ I+V R+ TF L +G
Sbjct: 273 LASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDG 332
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-----NASEDWEYCYRYDSR 281
TGG +ID+G T+++ ++V E FTS RM + S + C++
Sbjct: 333 TGGMIIDSGTTITYLEETAFKV----LKEEFTS----RMSLPVDDSGSTGLDLCFKLPDA 384
Query: 282 FR--AYASMTFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFV 338
+ A M FHF AD ++ P Y + + G C+A+ S+ S+ G QQQ+ +
Sbjct: 385 AKNIAVPKMIFHFKGADLEL-PGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVL 443
Query: 339 YDLNTGTIQFVPENCA 354
+DL T+ FVP C
Sbjct: 444 HDLEKETVSFVPTECG 459
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 178/356 (50%), Gaps = 16/356 (4%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +++ GTP+++ + DTGS LIWTQC PC CF+Q PIF+P SS++ ++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKL 150
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
PC +C P + C +R +Y +S G+++TETFTF + V + FGC
Sbjct: 151 PCSSDLCVALPISSCSDGCEYRYSYGDHSSTQGVLATETFTFGDAS----VSKIGFGCGE 206
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DNR ++ AG++G P SL+ QL FSYCL + + + S L G +A
Sbjct: 207 DNRGRAYSQG-AGLVGLGRGPLSLISQLGVPK---FSYCLT-SIDDSKGISTLLVGSEAT 261
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
++ + R S YYLSL+ ISV D + TF+++ +G+GG +ID+G T+
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFK 298
++ + + + F + + S + E C+ D + FHF+ D K
Sbjct: 322 LKDNAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLK 378
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I + C+ + S S+ G +QQQ+ ++DL TI F P C
Sbjct: 379 L-PKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 169/353 (47%), Gaps = 17/353 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC CFNQS PIFNP SS++ +PC +
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ C N C + Y G+ G + TET TF V +P + FGC +N+ F
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGCGENNQGF 210
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
GN AG++G P SL QL T FSYC+ + +L ++
Sbjct: 211 G-QGNGAGLVGMGRGPLSLPSQLDVTK---FSYCMTPIGSSTPSNLLLGSLANSVTAGSP 266
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATFIQRG 244
T+ + YY++L +SV R+ P FAL NGTGG +ID+G T+
Sbjct: 267 NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNN 326
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF--HFDRADFKVEPT 302
Y+ V + F + ++ +S ++ C++ S TF HFD D ++ P+
Sbjct: 327 AYQSVRQEF---ISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLEL-PS 382
Query: 303 YMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
YFI + G C+A+ S + S+ G QQQ+ VYD + F C
Sbjct: 383 ENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 178/356 (50%), Gaps = 16/356 (4%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +++ GTP+++ + DTGS LIWTQC PC CF+Q PIF+P SS++ ++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKL 150
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
PC +C P + C +R +Y +S G+++TETFTF + V + FGC
Sbjct: 151 PCSSDLCVALPISSCSDGCEYRYSYGDHSSTQGVLATETFTFGDAS----VSKIGFGCGE 206
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DNR ++ AG++G P SL+ QL FSYCL + + + S L G +A
Sbjct: 207 DNRGRAYSQG-AGLVGLGRGPLSLISQLGVPK---FSYCLT-SIDDSKGISTLLVGSEAT 261
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
++ + R S YYLSL+ ISV D + TF+++ +G+GG +ID+G T+
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFK 298
++ + + + F + + S + E C+ D + FHF+ D K
Sbjct: 322 LKDSAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLK 378
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I + C+ + S S+ G +QQQ+ ++DL TI F P C
Sbjct: 379 L-PKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 182/361 (50%), Gaps = 21/361 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + G+P +S + DTGS LIWTQC PC CF+QS PIF+P SS++ +I C
Sbjct: 363 NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCS 422
Query: 64 DLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSND 121
+C P C + C + Y +S G+++ ETFTF + + +PG+ FGC ND
Sbjct: 423 SELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGND 482
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F AG++G P SL+ QLK F+YCL A + + +S+L G ANI
Sbjct: 483 NNGDGFSQG-AGLVGLGRGPLSLVSQLKEQK---FAYCLT-AIDDSKPSSLL-LGSLANI 536
Query: 182 QRK----DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
K +MKT + + S S YYLSLQ ISV ++ TF L +G+GG +ID+G
Sbjct: 537 TPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSG 596
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYASMTFHFD 293
T+++ + + F + + + C+ + +TFHF
Sbjct: 597 TTITYVENSAFTSLKNEF---IAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK 653
Query: 294 RADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
AD ++ P Y I ++ G C+AI S S+ G QQQ+ V+DL T+ F+P
Sbjct: 654 GADLEL-PGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQ 712
Query: 353 C 353
C
Sbjct: 713 C 713
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 185/369 (50%), Gaps = 32/369 (8%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +DV GTP+ + + DTGS L+WTQC PCV+CF QS P+F+P++SSTY +
Sbjct: 99 HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATV 158
Query: 61 PCDDLICRR-PPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
PC C P +C + +C + Y +S G+++TETFT K+KL PGV+FGC
Sbjct: 159 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-KSKL---PGVVFGC 214
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFG 176
+ N F AG++G P SL+ QL GL FSYCL + S L G
Sbjct: 215 GDTNEGDGFSQG-AGLVGLGRGPLSLVSQL-----GLDKFSYCLT--SLDDTNNSPLLLG 266
Query: 177 KDANI-----QRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
A I ++T + + S S YY+SL+ I+V RI FA++ +GTGG
Sbjct: 267 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 326
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF---RAYA 286
++D+G T+++ Y + + F + + C+R ++
Sbjct: 327 VIVDSGTSITYLEVQGYRALKKAFAAQMA---LPAADGSGVGLDLCFRAPAKGVDQVEVP 383
Query: 287 SMTFHFD-RADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
+ FHFD AD + P Y + G C+ + S S++G +QQQ+ +FVYD+
Sbjct: 384 RLVFHFDGGADLDL-PAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHD 442
Query: 345 TIQFVPENC 353
T+ F P C
Sbjct: 443 TLSFAPVQC 451
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 19/354 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC CFNQS PIFNP SS++ +PC +
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 67 CR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ + P C N C + Y G+ G + TET TF V +P + FGC +N+
Sbjct: 155 CQALQSP-TCSNNSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGCGENNQG 209
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F GN AG++G P SL QL T FSYC+ +T +L ++
Sbjct: 210 FG-QGNGAGLVGMGRGPLSLPSQLDVTK---FSYCMTPIGSSNSSTLLLGSLANSVTAGS 265
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATFIQR 243
T+ + YY++L +SV + P F L NGTGG +ID+G T+
Sbjct: 266 PNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVD 325
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF--HFDRADFKVEP 301
Y+ V + F + ++ +S ++ C++ S TF HFD D V P
Sbjct: 326 NAYQAVRQAF---ISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDL-VLP 381
Query: 302 TYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ YFI + G C+A+ S + S+ G QQQ+ VYD + F+ C
Sbjct: 382 SENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQCG 435
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 185/369 (50%), Gaps = 32/369 (8%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +DV GTP+ + + DTGS L+WTQC PCV+CF QS P+F+P++SSTY +
Sbjct: 89 HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATV 148
Query: 61 PCDDLICRR-PPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
PC C P +C + +C + Y +S G+++TETFT K+KL PGV+FGC
Sbjct: 149 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-KSKL---PGVVFGC 204
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFG 176
+ N F AG++G P SL+ QL GL FSYCL + S L G
Sbjct: 205 GDTNEGDGFSQG-AGLVGLGRGPLSLVSQL-----GLDKFSYCLT--SLDDTNNSPLLLG 256
Query: 177 KDANI-----QRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
A I ++T + + S S YY+SL+ I+V RI FA++ +GTGG
Sbjct: 257 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 316
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF---RAYA 286
++D+G T+++ Y + + F + + C+R ++
Sbjct: 317 VIVDSGTSITYLEVQGYRALKKAFAAQMA---LPAADGSGVGLDLCFRAPAKGVDQVEVP 373
Query: 287 SMTFHFD-RADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
+ FHFD AD + P Y + G C+ + S S++G +QQQ+ +FVYD+
Sbjct: 374 RLVFHFDGGADLDL-PAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHD 432
Query: 345 TIQFVPENC 353
T+ F P C
Sbjct: 433 TLSFAPVQC 441
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 177/356 (49%), Gaps = 16/356 (4%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + + + GTP+++ + DTGS LIWTQC PC +CF+Q PIF+P SS++ ++
Sbjct: 91 HAGNGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKL 150
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
PC +C P + C + +Y +S G+++TETF F + V + FGC
Sbjct: 151 PCSSDLCAALPISSCSDGCEYLYSYGDYSSTQGVLATETFAFGDAS----VSKIGFGCGE 206
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN F AG++G P SL+ QL + FSYCL + + + S L G +A
Sbjct: 207 DNDGSGFSQG-AGLVGLGRGPLSLISQL---GEPKFSYCLT-SMDDSKGISSLLVGSEAT 261
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
++ + + S YYLSL+ ISV D + TF+++ +G+GG +ID+G T+
Sbjct: 262 MKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITY 321
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFK 298
++ + + + F + + S + C+ D+ + FHF+ AD K
Sbjct: 322 LEDSAFAALKKEF---ISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLK 378
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I + G C+ + S S+ G +QQQ+ ++DL TI F P C
Sbjct: 379 L-PAENYIIADSGLGVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 185/369 (50%), Gaps = 32/369 (8%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +DV GTP+ + + DTGS L+WTQC PCV+CF QS P+F+P++SSTY +
Sbjct: 68 HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATV 127
Query: 61 PCDDLICRR-PPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
PC C P +C + +C + Y +S G+++TETFT K+KL PGV+FGC
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-KSKL---PGVVFGC 183
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFG 176
+ N F AG++G P SL+ QL GL FSYCL + S L G
Sbjct: 184 GDTNEGDGFSQG-AGLVGLGRGPLSLVSQL-----GLDKFSYCLT--SLDDTNNSPLLLG 235
Query: 177 KDANI-----QRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
A I ++T + + S S YY+SL+ I+V RI FA++ +GTGG
Sbjct: 236 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 295
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF---RAYA 286
++D+G T+++ Y + + F + + C+R ++
Sbjct: 296 VIVDSGTSITYLEVQGYRALKKAFAAQMA---LPAADGSGVGLDLCFRAPAKGVDQVEVP 352
Query: 287 SMTFHFD-RADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
+ FHFD AD + P Y + G C+ + S S++G +QQQ+ +FVYD+
Sbjct: 353 RLVFHFDGGADLDL-PAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHD 411
Query: 345 TIQFVPENC 353
T+ F P C
Sbjct: 412 TLSFAPVQC 420
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 19/354 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC CFNQS PIFNP SS++ +PC +
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 67 CR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ + P C N C + Y G+ G + TET TF V +P + FGC +N+
Sbjct: 155 CQALQSP-TCSNNSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGCGENNQG 209
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F GN AG++G P SL QL T FSYC+ +T +L ++
Sbjct: 210 FG-QGNGAGLVGMGRGPLSLPSQLDVTK---FSYCMTPIGSSTSSTLLLGSLANSVTAGS 265
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATFIQR 243
T+ + YY++L +SV + P F L NGTGG +ID+G T+
Sbjct: 266 PNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFAD 325
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF--HFDRADFKVEP 301
Y+ V + F + ++ +S ++ C++ S TF HFD D V P
Sbjct: 326 NAYQAVRQAF---ISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDL-VLP 381
Query: 302 TYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ YFI + G C+A+ S + S+ G QQQ+ VYD + F+ C
Sbjct: 382 SENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQCG 435
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 182/363 (50%), Gaps = 28/363 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV G+P + + DTGS LIWTQC PC+ C Q P F P S++Y +PC +
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 144
Query: 67 CRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C P C CV++ Y AS++G+++ ETFTF + V VP V FGC N N
Sbjct: 145 CNALYSPL-CFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAG 203
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F+G +G++GF SL+ QL S FSYCL ATS L FG A +
Sbjct: 204 TLFNG--SGMVGFGRGALSLVSQLGSPR---FSYCLTSFMS--PATSRLYFGAYATLNST 256
Query: 185 D------MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRR-NGTGGCMIDTG 235
+ +++ V+ + + Y+L++ ISVA + P FA+ +GTGG +ID+G
Sbjct: 257 NTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSG 316
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDS---RFRAYASMTFH 291
TF+ + Y +V F G R + S+ ++ C+++ R M H
Sbjct: 317 TTVTFLAQPAYAMVQGAF---VAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLH 373
Query: 292 FDRADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
FD AD ++ P Y + G C+A+ SD S++G++Q Q+ +YDL + FVP
Sbjct: 374 FDGADMEL-PLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFVP 432
Query: 351 ENC 353
C
Sbjct: 433 APC 435
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 182/363 (50%), Gaps = 28/363 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV G+P + + DTGS LIWTQC PC+ C Q P F P S++Y +PC +
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 147
Query: 67 CRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C P C CV++ Y AS++G+++ ETFTF + V VP V FGC N N
Sbjct: 148 CNALYSPL-CFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAG 206
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F+G+ G++GF SL+ QL S FSYCL ATS L FG A +
Sbjct: 207 TLFNGS--GMVGFGRGALSLVSQLGSPR---FSYCLTSFMS--PATSRLYFGAYATLNST 259
Query: 185 D------MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRR-NGTGGCMIDTG 235
+ +++ V+ + + Y+L++ ISVA + P FA+ +GTGG +ID+G
Sbjct: 260 NTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSG 319
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDS---RFRAYASMTFH 291
TF+ + Y +V F G R + S+ ++ C+++ R M H
Sbjct: 320 TTVTFLAQPAYAMVQGAF---VAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLH 376
Query: 292 FDRADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
FD AD ++ P Y + G C+A+ SD S++G++Q Q+ +YDL + FVP
Sbjct: 377 FDGADMEL-PLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFVP 435
Query: 351 ENC 353
C
Sbjct: 436 APC 438
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 184/356 (51%), Gaps = 23/356 (6%)
Query: 18 KSEFLLFDTGSYLIWTQCLPCVN----CFNQSAPIFNPNASSTYKRIPCDD-LICRRPPF 72
K+ + DTG+ L W QC C N CF P + + S +YK + C+ C P
Sbjct: 99 KTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCE--PN 156
Query: 73 RCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRD----FSF 127
+C+ G C + + Y G+ SG ++ ETFTF+ + K + + FGCS D+R+ F
Sbjct: 157 QCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLL 216
Query: 128 DGN-IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDM 186
D N ++G+LG P S L QL S + G FSYC+ + LRFGK ++ K++
Sbjct: 217 DKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTH---NTYLRFGKHV-VKSKNL 272
Query: 187 KTIR-MFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+T + M V S+ Y+++L ISV ++ A+R++G+ GC+ID G +AT + +
Sbjct: 273 QTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCIIDAGTLATLLVKPI 332
Query: 246 YEVVMRHFDEHFTS---FGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
++ + H +S R +H +D Y D+ + +TFH + AD +V+P
Sbjct: 333 FDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLENADLEVKPE 392
Query: 303 YMYFI--FQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
++ F+ + FC+++ D +++GA+QQ +FVYD + F PE+C +
Sbjct: 393 AIFLFREFEGKNVFCLSMLSDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDCEKN 448
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 187/376 (49%), Gaps = 41/376 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +D+ GTP+ + + DTGS L+WTQC PCV CFNQS P+F+P++SSTY +
Sbjct: 112 HAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTL 171
Query: 61 PCDDLICRR-PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
PC +C P C + C + Y +S G+++ ETFT K KL PGV FG
Sbjct: 172 PCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLA-KTKL---PGVAFG 227
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + N F AG++G P SL+ QL G FSYCL + + S L G
Sbjct: 228 CGDTNEGDGFTQG-AGLVGLGRGPLSLVSQL---GLGKFSYCLT--SLDDTSKSPLLLGS 281
Query: 178 DANIQRKD-----MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
A I ++T + + S S YY++L+ ++V RI FA++ +GTGG
Sbjct: 282 LAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGV 341
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS--- 287
++D+G T+++ Y R + F + + + + S D F+A AS
Sbjct: 342 IVDSGTSITYLELQGY----RPLKKAFAAQMKLPVADGSA-----VGLDLCFKAPASGVD 392
Query: 288 ------MTFHFD-RADFKVEPTYMYFIFQN-EGYFCVAISFSDRNSVVGAWQQQDTRFVY 339
+ HFD AD + P Y + + G C+ + S S++G +QQQ+ +FVY
Sbjct: 393 DVEVPKLVLHFDGGADLDL-PAENYMVLDSASGALCLTVMGSRGLSIIGNFQQQNIQFVY 451
Query: 340 DLNTGTIQFVPENCAN 355
D++ T+ F P CA
Sbjct: 452 DVDKDTLSFAPVQCAK 467
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 176/358 (49%), Gaps = 14/358 (3%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N Y +++ GTP S + DTGS LIWTQC PC C+ Q PIF+P SS++ ++
Sbjct: 102 HAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKV 161
Query: 61 PCDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C +C P C +G C + +Y + G+++TETFTF V V + FGC
Sbjct: 162 SCGSSLCSALPSSTCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
DN F+ +G++G P SL+ QLK + FSYCL E S+L G
Sbjct: 221 EDNEGDGFE-QASGLVGLGRGPLSLVSQLK---EQRFSYCLTPIDDTKE--SVLLLGSLG 274
Query: 180 NIQ-RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
++ K++ T + + + S YYLSL+ ISV D R+ TF + +G GG +ID+G
Sbjct: 275 KVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGT 334
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T++Q+ YE + + F T + + D + S + FHF D
Sbjct: 335 TITYVQQKAYEALKKEFISQ-TKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD 393
Query: 297 FKVEPTYMYFIF-QNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ P Y I N G C+A+ S S+ G QQQ+ +DL TI FVP +C
Sbjct: 394 LEL-PAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 185/369 (50%), Gaps = 29/369 (7%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +DV GTP+ S + DTGS L+WTQC PCV+CF QS P+F+P++SSTY +
Sbjct: 94 HAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATV 153
Query: 61 PCDDLICRR-PPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
PC +C P C + +C + Y +S G++++ETFT + K +PGV FGC
Sbjct: 154 PCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKK--LPGVAFGC 211
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFG 176
+ N F AG++G P SL+ QL GL FSYCL + + + S L G
Sbjct: 212 GDTNEGDGFTQG-AGLVGLGRGPLSLVSQL-----GLDKFSYCLT-SLDDGDGKSPLLLG 264
Query: 177 -----KDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
+ ++T + + S S YY+SL ++V RI FA++ +GTGG
Sbjct: 265 GSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGG 324
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF---RAYA 286
++D+G T+++ Y + + F + + + C++ ++
Sbjct: 325 VIVDSGTSITYLELQGYRALKKAF---VAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVP 381
Query: 287 SMTFHFD-RADFKVEPTYMYFIFQN-EGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
+ HFD AD + P Y + + G C+ ++ S S++G +QQQ+ +FVYD+
Sbjct: 382 KLVLHFDGGADLDL-PAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAGD 440
Query: 345 TIQFVPENC 353
T+ F P C
Sbjct: 441 TLSFAPVQC 449
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 181/360 (50%), Gaps = 21/360 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + + DTGS LIWTQC PC+ C +Q P F+P S +Y ++PC+ +
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + C CV++ Y A+ +G++S ETFTF + V VP + FGC N N
Sbjct: 149 CNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGS 208
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR-- 183
F+G +G++GF P SL+ QL S FSYCL + S L FG A +
Sbjct: 209 LFNG--SGMVGFGRGPLSLVSQLGSPR---FSYCLTSFMSPVP--SRLYFGAYATLNSTS 261
Query: 184 ----KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGA 236
+ +++ V+ + YYL++ ISV + P FA+ +GTGG +ID+G+
Sbjct: 262 ASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGS 321
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM---TFHFD 293
T++ R Y++V + F + + ++ + C+ + R +M FHF+
Sbjct: 322 TITYLARAAYDMVHQAFADQ-VGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFE 380
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
A+ ++ I + G C+AI+ SD S++G++Q Q+ +YD + F P C
Sbjct: 381 GANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 176/358 (49%), Gaps = 14/358 (3%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N Y +++ GTP S + DTGS LIWTQC PC C+ Q PIF+P SS++ ++
Sbjct: 102 HAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKV 161
Query: 61 PCDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C +C P C +G C + +Y + G+++TETFTF V V + FGC
Sbjct: 162 SCGSSLCSAVPSSTCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
DN F+ +G++G P SL+ QLK + FSYCL E SIL G
Sbjct: 221 EDNEGDGFE-QASGLVGLGRGPLSLVSQLK---EPRFSYCLTPMDDTKE--SILLLGSLG 274
Query: 180 NIQ-RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
++ K++ T + + + S YYLSL+ ISV D R+ TF + +G GG +ID+G
Sbjct: 275 KVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGT 334
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T+I++ +E + + F T + + D + S + FHF D
Sbjct: 335 TITYIEQKAFEALKKEFISQ-TKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGD 393
Query: 297 FKVEPTYMYFIF-QNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ P Y I N G C+A+ S S+ G QQQ+ +DL TI FVP +C
Sbjct: 394 LEL-PAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 186/358 (51%), Gaps = 23/358 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + GTP ++ + DTGS LIWTQC PC CF+Q PIF+P SS++ ++ C
Sbjct: 94 NGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCS 153
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C P C +G C + Y +S G++++ET TF V VP V FGC DN
Sbjct: 154 SKLCEALPQSTCSDG-CEYLYGYGDYSSTQGMLASETLTF----GKVSVPEVAFGCGEDN 208
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F +G++G P SL+ QLK FSYCL + + +A+++L G A+++
Sbjct: 209 EGSGFSQG-SGLVGLGRGPLSLVSQLKEPK---FSYCLT-SVDDTKASTLL-MGSLASVK 262
Query: 183 RKD--MKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
D +KT + + + S YYLSL+ ISV D + TF+L+ +G+GG +ID+G
Sbjct: 263 ASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTI 322
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYASMTFHFDRAD 296
T++++ +++V + F ++ S E C+ S + FHFD AD
Sbjct: 323 TYLEQSAFDLVAKEFTSQIN---LPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGAD 379
Query: 297 FKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ P Y I + G C+A+ S S+ G QQQ+ ++DL T+ F+P C
Sbjct: 380 LEL-PAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 180/358 (50%), Gaps = 32/358 (8%)
Query: 12 LFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
+ GTP+ + + DTGS L+WTQC PCV+CF QS P+F+P++SSTY +PC C P
Sbjct: 172 VIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231
Query: 71 PFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDG 129
+C + +C + Y +S G+++TETFT K+KL PGV+FGC + N F
Sbjct: 232 TSKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-KSKL---PGVVFGCGDTNEGDGFS- 286
Query: 130 NIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFGKDANI-----Q 182
AG++G P SL+ QL GL FSYCL + S L G A I
Sbjct: 287 QGAGLVGLGRGPLSLVSQL-----GLDKFSYCLTSL--DDTNNSPLLLGSLAGISEASAA 339
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
++T + + S S YY+SL+ I+V RI FA++ +GTGG ++D+G T+
Sbjct: 340 ASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITY 399
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF---RAYASMTFHFD-RAD 296
++ Y + + F + + C+R ++ + FHFD AD
Sbjct: 400 LEVQGYRALKKAFAAQMA---LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGAD 456
Query: 297 FKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y + G C+ + S S++G +QQQ+ +FVYD+ T+ F P C
Sbjct: 457 LDL-PAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 175/361 (48%), Gaps = 20/361 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+K ++ DTGS LIW QC PC CFNQ PIF+P SS+Y + C D +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDF 125
C P + + C + Y G+ G +S+ET T + + + + FGC + NR
Sbjct: 100 CDSLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRG- 158
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
SF+ + +G++G S + QL FSYCLV TS + FG +++
Sbjct: 159 SFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSG 217
Query: 186 MKTIRMFVD------RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
K F S YY+ L+DIS+A + G+F ++ +G+GG + D+G T
Sbjct: 218 KKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLT 277
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY----ASMTFHFDRA 295
+ PY++V+R SF ++ +S + CY +Y +M FHF+ A
Sbjct: 278 LLPDAPYQIVLRALRSKI-SF--PKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEGA 334
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAW---QQQDTRFVYDLNTGTIQFVPEN 352
D+++ P YFI N+ V ++ N +G + QQ+ R +YD+ + I + P
Sbjct: 335 DYQL-PVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQ 393
Query: 353 C 353
C
Sbjct: 394 C 394
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 127/372 (34%), Positives = 183/372 (49%), Gaps = 41/372 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDD- 64
Y + + GTP S + DTGS LIWTQC PC CF Q+ +NP++S+T+ +PC+
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147
Query: 65 ------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFG 117
L PP C C++ Y G +A G+ S ETFTF VPG+ FG
Sbjct: 148 VSMCAALAGPSPPPGCS---CMYNQTYGTGWTA-GIQSVETFTFGSTPADQTRVPGIAFG 203
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
CSN + D ++G+ AG++G SL+ QL + G+FSYCL +++ +TS L G
Sbjct: 204 CSNASSD-DWNGS-AGLVGLGRGSMSLVSQLGA---GMFSYCLT-PFQDANSTSTLLLGP 257
Query: 178 DANIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
A + + T S++YYL+L IS+ + P FALR +GTGG +I
Sbjct: 258 SAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLII 317
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMT 289
D+G T + Y+ V R E + + S + C+ S + SMT
Sbjct: 318 DSGTTITSLVDAAYQQV-RAAIESLVTLPVADGSD-STGLDLCFALTSETSTPPSMPSMT 375
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGA------WQQQDTRFVYDLNT 343
FHFD AD V P Y I G +C+A+ RN VGA +QQQ+ +YD++
Sbjct: 376 FHFDGADM-VLPVDNYMIL-GSGVWCLAM----RNQTVGAMSTFGNYQQQNVHLLYDIHE 429
Query: 344 GTIQFVPENCAN 355
T+ F P C+
Sbjct: 430 ETLSFAPAKCST 441
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 176/356 (49%), Gaps = 19/356 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + GTP ++ + DTGS LIWTQC PC CF+QS PIF+P SS++ ++ C
Sbjct: 94 NGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCS 153
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C P C NG C + +Y +S G++++ET TF + VP V FGC DN
Sbjct: 154 SQLCEALPQSSCNNG-CEYLYSYGDYSSTQGILASETLTFGKAS----VPNVAFGCGADN 208
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F AG++G P SL+ QLK FSYCL +T ++ N
Sbjct: 209 EGSGFSQG-AGLVGLGRGPLSLVSQLKEPK---FSYCLTTVDDTKTSTLLMGSLASVNAS 264
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+KT + + S YYLSL+ ISV D R+ TF+L+ +G+GG +ID+G T+
Sbjct: 265 SSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITY 324
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYASMTFHFDRADFK 298
++ + +V + F + S + C+ S + FHFD AD +
Sbjct: 325 LEESAFNLVAKEFTAKIN---LPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGADLE 381
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I + G C+A+ S S+ G QQQ+ ++DL T+ F+P C
Sbjct: 382 L-PAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 177/356 (49%), Gaps = 17/356 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + +++ GTP ++ + DTGS LIWTQC PC CF+Q +PIF+P SS++ ++ C
Sbjct: 97 NGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCS 156
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
+C+ P + C + Y +S G ++TETFTF V +P V FGC DN
Sbjct: 157 SQLCKALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF----GKVSIPNVGFGCGEDNE 212
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F +G++G P SL+ QLK + FSYCL +T ++ N
Sbjct: 213 GDGFTQG-SGLVGLGRGPLSLVSQLK---EAKFSYCLTSIDDTKTSTLLMGSLASVNGTS 268
Query: 184 KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
++T + + + S YYLSL+ ISV R+ TF L+ +GTGG +ID+G T++
Sbjct: 269 AAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYL 328
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFKV 299
+ +++V + F G ++ + E CY D+ + HF AD ++
Sbjct: 329 EESAFDLVKKEFTSQ---MGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTGADLEL 385
Query: 300 EPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
P Y I + G C+A+ S S+ G QQQ+ +DL T+ F+P NC
Sbjct: 386 -PGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCG 440
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 177/363 (48%), Gaps = 25/363 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +D+ GTP + DTGS LIWTQC PCV C +Q P F P S+TY+ +PC +
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 67 CRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNR 123
C P+ + CV++ Y AS +G++++ETFTF N V V V FGC N N
Sbjct: 152 CAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINS 211
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF-------G 176
N +G++G P SL+ QL + FSYCL ++ E S L F G
Sbjct: 212 GQL--ANSSGMVGLGRGPLSLVSQLGPSR---FSYCLT-SFLSPEP-SRLNFGVFATLNG 264
Query: 177 KDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+A+ +++ + V+ + S Y++SL+ IS+ R+ P FA+ +GTGG ID+
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY---DSRFRAYASMTFH 291
G T++Q+ Y+ V R ++ E C+ + S M H
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLRPL--PPTNDTEIGLETCFPWPPPPSVAVTVPDMELH 382
Query: 292 FD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
FD A+ V P I G+ C+A+ S +++G +QQQ+ +YD+ + FVP
Sbjct: 383 FDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFVP 442
Query: 351 ENC 353
C
Sbjct: 443 APC 445
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 173/361 (47%), Gaps = 20/361 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+K ++ DTGS LIW QC PC CFNQ PIF+P SS+Y + C D +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDF 125
C P + + C + Y G+ G +S+ET T + + + + FGC + NR
Sbjct: 100 CDSLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRG- 158
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
SF+ + +G++G S + QL FSYCLV TS + FG +++
Sbjct: 159 SFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSG 217
Query: 186 MKTIRMFVD------RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
K F S YY+ L+DIS+A + G+F ++ +G+GG + D+G T
Sbjct: 218 KKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLT 277
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY----ASMTFHFDRA 295
+ PY++V+R SF + +S + CY +Y +M FHF+ A
Sbjct: 278 LLPDAPYQIVLRALRSK-VSF--PEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGA 334
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAW---QQQDTRFVYDLNTGTIQFVPEN 352
D ++ P YFI N+ V ++ N +G + QQ+ R +YD+ + I + P
Sbjct: 335 DHQL-PVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQ 393
Query: 353 C 353
C
Sbjct: 394 C 394
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 181/364 (49%), Gaps = 27/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +D+ GTP + DTGS LIWTQC PCV C +Q P F P S+TY+ +PC +
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 67 CRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNR 123
C P+ + CV++ Y AS +G++++ETFTF N V V V FGC N N
Sbjct: 152 CAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINS 211
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF-------G 176
N +G++G P SL+ QL + FSYCL ++ E S L F G
Sbjct: 212 GQL--ANSSGMVGLGRGPLSLVSQLGPSR---FSYCLT-SFLSPEP-SRLNFGVFATLNG 264
Query: 177 KDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+A+ +++ + V+ + S Y++SL+ IS+ R+ P FA+ +GTGG ID+
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-DWEYCYRY---DSRFRAYASMTF 290
G T++Q+ Y+ V RH E + N +E E C+ + S M
Sbjct: 325 GTSLTWLQQDAYDAV-RH--ELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMEL 381
Query: 291 HFD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
HFD A+ V P I G+ C+A+ S +++G +QQQ+ +YD+ + FV
Sbjct: 382 HFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFV 441
Query: 350 PENC 353
P C
Sbjct: 442 PAPC 445
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 181/364 (49%), Gaps = 31/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + DTGS LIWTQC PC+ C +Q P F+ S+TY+ +PC
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDN 122
C P C CV++ Y AS +G+++ ETFTF N V + FGC + N
Sbjct: 149 CASLSSP--SCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLN 206
Query: 123 R-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT-SILRFGKDAN 180
D + N +G++GF P SL+ QL + FSYCL + AT S L FG AN
Sbjct: 207 AGDLA---NSSGMVGFGRGPLSLVSQLGPSR---FSYCLT---SYLSATPSRLYFGVYAN 257
Query: 181 IQRKD------MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ + +++ ++ + + Y+LSL+ IS+ + P FA+ +GTGG +I
Sbjct: 258 LSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVII 317
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS---MT 289
D+G T++Q+ YE V R ++ M++ + C+++ + +
Sbjct: 318 DSGTSITWLQQDAYEAVRRGL---VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLV 374
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
FHFD A+ + P I GY C+ ++ + +++G +QQQ+ +YD+ + FV
Sbjct: 375 FHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFV 434
Query: 350 PENC 353
P C
Sbjct: 435 PAPC 438
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 181/355 (50%), Gaps = 24/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP+ S + DTGS L+WT+C PC +C ++ I++P++SSTY ++ C +
Sbjct: 42 YLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSL 99
Query: 67 CRRPP-FRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ P F C N G C + Y +S SG++S ETF+ ++ +P + FGC +DN+
Sbjct: 100 CQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQS----LPNITFGCGHDNQG 155
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F + G++GF SL+ QL + FSYCLV + + TS L G A+++
Sbjct: 156 FD---KVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLV-SRTDSSKTSPLFIGNTASLEAT 211
Query: 185 DMKTIRMFVDRSS-HYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + + S+ HYYLSL+ ISV + GTF ++ +G+GG +ID+G TF+Q+
Sbjct: 212 TVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQ 271
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYASMTFHFDRADFKVEPT 302
Y+ V + A + C+ + S + SMTFHF AD+ V
Sbjct: 272 TAYDAVKEAMVSSI------NLPQADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKE 325
Query: 303 YMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
F C+A+ ++ N ++ G QQQ+ + +YD + F P C
Sbjct: 326 NYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 166/346 (47%), Gaps = 12/346 (3%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR--RPP 71
GTP + + DTGS ++W QC PC C+NQ+ PIFNP+ SS+YK IPC +C R
Sbjct: 94 GTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDT 153
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRDFSFDGN 130
+ C ++I+Y + + G +S +T + V P ++ GC DN +F G
Sbjct: 154 SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAG-TFGGA 212
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANIQRKDMKTI 189
+GI+G P SL+ QL S+ G FSYCLV +E A+SIL FG A + + +
Sbjct: 213 SSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVST 272
Query: 190 RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
+ Y+L+LQ SV + R+ F G + + G +ID+G T I P +V
Sbjct: 273 PLIKKDPVFYFLTLQAFSVGNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLI---PSDVY 327
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQ 309
R+ + ++ + CY S + +T HF AD ++ + F+
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITVHFKGADVELH-SISTFVPI 386
Query: 310 NEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+G C A S + S+ G QQ+ YDL T+ F P +C
Sbjct: 387 TDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 163/345 (47%), Gaps = 15/345 (4%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFR 73
GTP + + + DTGS ++W QC PC C+ Q+ PIFNP+ SS+YK IPC +C+ +
Sbjct: 94 GTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYT 153
Query: 74 CENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
N Q C + IN++ + + G +S ET T V P + GC ++NR F G
Sbjct: 154 SCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGM-FQGE 212
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIR 190
+GI+G + P SL QLKS+ G FSYCL+ + TS L FG DA + D
Sbjct: 213 TSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFG-DAAVVSGDGVVST 271
Query: 191 MFV--DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEV 248
FV D + YYL+L+ SV + RI F L + G ++D+G T + P V
Sbjct: 272 PFVKKDPQAFYYLTLEAFSVGNKRIEFE----VLDDSEEGNIILDSGTTLTLL---PSHV 324
Query: 249 VMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIF 308
R+ + ++ CY S + +T HF AD K+ P F
Sbjct: 325 YTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPIITAHFKGADIKLNPIST-FAH 383
Query: 309 QNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+G C+A + S + G Q + YDL + F P +C
Sbjct: 384 VADGVVCLAFTSSQTGPIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 174/357 (48%), Gaps = 17/357 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC+ C +Q P F+P SSTY+ + C
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPA 151
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + C CV++ Y AS +G+++ ETFTF + V +P + FGC N N
Sbjct: 152 CNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNAGS 211
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
+G +G++GF SL+ QL S FSYCL + S L FG A + +
Sbjct: 212 LANG--SGMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVR--SRLYFGAYATLNSTN 264
Query: 186 MKTIR-----MFVDRSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIAT 239
T++ + + Y+L++ ISV +R+ P A+ +GTGG +ID+G T
Sbjct: 265 ASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTIT 324
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHFDRAD 296
++ Y V F + S + + C+++ R + HFD AD
Sbjct: 325 YLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGAD 384
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+++ + + G C+A++ S S++G++Q Q+ +YDL + FVP C
Sbjct: 385 WELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|255563737|ref|XP_002522870.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537954|gb|EEF39568.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 341
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 98/250 (39%), Positives = 142/250 (56%), Gaps = 10/250 (4%)
Query: 116 FGCSNDNRDFSF---DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
FGCS DNR+FS G GI+G ++SP S+L QL++ FSYCL ATS+
Sbjct: 91 FGCSKDNRNFSAFSRTGKTDGIMGLNMSPVSILQQLRNVTNQRFSYCLTPYGSRPPATSL 150
Query: 173 LRFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
LRFG D + + + FVD +Y+L+L D+SVA R+ P TFAL+R+GTGG
Sbjct: 151 LRFGNDISTWGRGFYST-PFVDPPDMPNYFLNLLDLSVAGQRLRLPPETFALKRDGTGGT 209
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASM 288
+ID+G T + + Y ++ HF G R+H + E Y + + F+ +AS+
Sbjct: 210 IIDSGTGLTLVVQPAYRHLLGALQNHFDHHGFHRVHIPDTNLELRYNFAQNRTFQNHASL 269
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
T+HF ADF VEP Y Y ++ +E FCVA+ S + +++GA Q +TRFVY+ +
Sbjct: 270 TYHFQGADFTVEPRYAYVVYNDENAFCVALLASHIEGRAIIGALHQANTRFVYNAAKRRL 329
Query: 347 QFVPENCAND 356
+F EN ND
Sbjct: 330 KFKAENFQND 339
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 177/362 (48%), Gaps = 27/362 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + DTGS LIWTQC PC+ C Q P F+ S+TY+ +PC
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSR 148
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDN 122
C P C CV++ Y AS +G+++ ETFTF + V + FGC + N
Sbjct: 149 CAALSSP--SCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLN 206
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
N +G++GF P SL+ QL + FSYCL S L FG AN+
Sbjct: 207 AGEL--ANSSGMVGFGRGPLSLVSQLGPSR---FSYCLTSYLSPTP--SRLYFGVFANLN 259
Query: 183 RKD------MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ +++ ++ + + Y+LS++ IS+ R+ P FA+ +GTGG +ID+
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDS 319
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM---TFH 291
G T++Q+ YE V R ++ M++ + C+++ ++ FH
Sbjct: 320 GTSITWLQQDAYEAVRRGLA---STIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFH 376
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
FD A+ + P I GY C+A++ + +++G +QQQ+ +YD+ + FVP
Sbjct: 377 FDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFLSFVPA 436
Query: 352 NC 353
C
Sbjct: 437 PC 438
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 165/351 (47%), Gaps = 12/351 (3%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y +D+ FG+P + ++ DTGS LIWTQCLPC C ++ IF+P SSTY + C
Sbjct: 77 NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCA 136
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C PF+ C + Y G+S SG +STET T +P V FGC + N
Sbjct: 137 SNFCSSLPFQSCTTSCKYDYMYGDGSSTSGALSTETVTVGTGT----IPNVAFGCGHTNL 192
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
SF G AGI+G P SL+ Q S FSYCLV TS + G A
Sbjct: 193 G-SFAG-AAGIVGLGQGPLSLISQASSITSKKFSYCLV--PLGSTKTSPMLIGDSAAAGG 248
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + YY L ISV+ + + GTF++ +G GG ++D+G T+++
Sbjct: 249 VAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLET 308
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPT 302
G + ++ + +YC+ Y +MTFHF AD+++ P
Sbjct: 309 GAFNALVAALKAEVP---FPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPE 365
Query: 303 YMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ G C+A++ S S++G QQQ+ V+DL + F NC
Sbjct: 366 NVFVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 165/346 (47%), Gaps = 12/346 (3%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR--RPP 71
GTP + + DTGS ++W QC PC C+NQ+ PIFNP+ SS+YK IPC +C R
Sbjct: 94 GTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDT 153
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRDFSFDGN 130
+ C ++I+Y + + G +S +T + V P + GC DN +F G
Sbjct: 154 SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAG-TFGGA 212
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANIQRKDMKTI 189
+GI+G P SL+ QL S+ G FSYCLV +E A+SIL FG A + + +
Sbjct: 213 SSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVST 272
Query: 190 RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
+ Y+L+LQ SV + R+ F G + + G +ID+G T I P +V
Sbjct: 273 PLIKKDPVFYFLTLQAFSVGNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLI---PSDVY 327
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQ 309
R+ + ++ + CY S + +T HF AD ++ + F+
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITAHFKGADIELH-SISTFVPI 386
Query: 310 NEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+G C A S + S+ G QQ+ YDL T+ F P +C
Sbjct: 387 TDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 167/351 (47%), Gaps = 6/351 (1%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + ++ DTGS L W QC PC C++Q+ +F PN S+++ ++ C +
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSAL 72
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
C PF C CV+ +Y G+ +G +T T + + VP FGC +DN
Sbjct: 73 CNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG 132
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
SF G GILG P S QLKS G FSYCLV TS L FG A
Sbjct: 133 -SFAG-ADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILP 190
Query: 185 DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
D+K + + + ++YY+ L ISV D+ + + F + G G + D+G T +
Sbjct: 191 DVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVTQLA 250
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
Y+ V+ + ++ R+ + D + +MTFHF+ D + P+
Sbjct: 251 EAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVPAMTFHFEGGDMVLPPS 310
Query: 303 YMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ ++ +C A++ S +++G+ QQQ+ + YD + FVP++C
Sbjct: 311 NYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 167/351 (47%), Gaps = 20/351 (5%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFR 73
GTP + + DTGS ++W QC PC C+NQ+ P+FNP+ SS+YK IPC +C+
Sbjct: 94 GTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDT 153
Query: 74 CENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDNRDFSFDGN 130
N + C + Y + + G +S +T T N L V P ++ GC +N S++G
Sbjct: 154 SCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNI-LSYEGA 212
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYR----EMEATSILRFGKDANIQRKDM 186
+GI+GF P S + QL S+ G FSYCL + + ATS L FG A + +
Sbjct: 213 SSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGV 272
Query: 187 KTIRMF-VDRSSHYYLSLQDISVADHR--IGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T + D + YYL+L+ SV + R IG P + G +ID+G T + +
Sbjct: 273 VTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVP-----NGDNEGNIIIDSGTTLTSLTK 327
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTY 303
Y + + +R+ + ++ CY + + +T HF AD + P
Sbjct: 328 DDYSFLESAVVDLVK---LERVDDPTQTLNLCYSVKAEGYDFPIITMHFKGADVDLHPIS 384
Query: 304 MYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
F+ +G FC+A S +++ G QQ+ YDL + F P +C
Sbjct: 385 T-FVSVADGVFCLAFESSQDHAIFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 172/360 (47%), Gaps = 18/360 (5%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
++ + V+ G P + + DTGS L+W QC PC +CF QS PIF+P+ SSTY +
Sbjct: 86 DRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLS 145
Query: 62 CDDLICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGC 118
D IC P + N QC++ +YA G+++SG ++TE F ++ V V V+FGC
Sbjct: 146 YDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGC 205
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ NR FDG +GILG S S++ +L S FSYC+ + + L G
Sbjct: 206 GHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGDG 260
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
++ F + YY++L+ ISV + R+ P F +G GG ++D+G A
Sbjct: 261 VKMEGSSTP----FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 316
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDRAD 296
TF+ + ++ + +Q ++ W CY R + R + + FHF
Sbjct: 317 TFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGA 375
Query: 297 FKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V F+ +N+ FC+A+ + + SV+G QQ YDL + F +C
Sbjct: 376 DLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 172/360 (47%), Gaps = 18/360 (5%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
++ + V+ G P + + DTGS L+W QC PC +CF QS PIF+P+ SSTY +
Sbjct: 54 DRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLS 113
Query: 62 CDDLICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGC 118
D IC P + N QC++ +YA G+++SG ++TE F ++ V V V+FGC
Sbjct: 114 YDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGC 173
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ NR FDG +GILG S S++ +L S FSYC+ + + L G
Sbjct: 174 GHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGDG 228
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
++ F + YY++L+ ISV + R+ P F +G GG ++D+G A
Sbjct: 229 VKMEGSSTP----FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDRAD 296
TF+ + ++ + +Q ++ W CY R + R + + FHF
Sbjct: 285 TFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGA 343
Query: 297 FKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V F+ +N+ FC+A+ + + SV+G QQ YDL + F +C
Sbjct: 344 DLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/383 (32%), Positives = 184/383 (48%), Gaps = 49/383 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--------CFNQSAPIFNPNASSTYK 58
Y + + GTP S + DTGS LIWTQC PC + CF QS ++NP++S+T+
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFG 146
Query: 59 RIPCDD-------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLV 109
+PC+ + PP C C++ Y G +A G+ S ETFTF + V
Sbjct: 147 VLPCNSPLSMCAAMAGPSPPPGCA---CMYNQTYGTGWTA-GVQSVETFTFGSSSTPPAV 202
Query: 110 CVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA 169
VP + FGCSN + + ++G+ AG++G SL+ QL + G FSYCL +++ +
Sbjct: 203 RVPNIAFGCSNASSN-DWNGS-AGLVGLGRGSMSLVSQLGA---GAFSYCLT-PFQDANS 256
Query: 170 TSILRFGKDANIQRKDMKTIRM--FV------DRSSHYYLSLQDISVADHRIGFAPGTFA 221
TS L G A K +R FV S++YYL+L ISV + + P F+
Sbjct: 257 TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFS 316
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVV---MRHFDEHFTSFGRQRMHNASEDWEYCYRY 278
LR +GTGG +ID+G T + Y+ V +R H+ D + +
Sbjct: 317 LRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKA 376
Query: 279 DSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGA------WQQ 332
+ A SMT HF+ V P Y I G +C+A+ RN VGA +QQ
Sbjct: 377 STPPPAMPSMTLHFEGGADMVLPVENYMIL-GSGVWCLAM----RNQTVGAMSMVGNYQQ 431
Query: 333 QDTRFVYDLNTGTIQFVPENCAN 355
Q+ +YD+ T+ F P C++
Sbjct: 432 QNIHVLYDVRKETLSFAPAVCSS 454
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 172/360 (47%), Gaps = 18/360 (5%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
++ + V+ G P + + DTGS L+W QC PC +CF QS PIF+P+ SSTY +
Sbjct: 54 DRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLS 113
Query: 62 CDDLICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGC 118
D IC P + N QC++ +YA G+++SG ++TE F ++ V V V+FGC
Sbjct: 114 YDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGC 173
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ NR FDG +GILG S S++ +L S FSYC+ + + L G
Sbjct: 174 GHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGDG 228
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
++ F + YY++L+ ISV + R+ P F +G GG ++D+G A
Sbjct: 229 VKMEGSSTP----FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDRAD 296
TF+ + ++ + +Q ++ W CY R + R + + FHF
Sbjct: 285 TFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGA 343
Query: 297 FKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V F+ +N+ FC+A+ + + SV+G QQ YDL + F +C
Sbjct: 344 DLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 174/367 (47%), Gaps = 31/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP L DTGS L WTQC PC CF Q PI++ SS++ +PC
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152
Query: 67 CRRPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C P + N C +R Y GA ++G++ TET TF V V G+ FGC D
Sbjct: 153 C-LPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFP-GAPGVSVGGIAFGCGVD 210
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N S+ N G +G SL+ QL G FSYCL + + +L FG A +
Sbjct: 211 NGGLSY--NSTGTVGLGRGSLSLVAQL---GVGKFSYCLTDFFNTSLGSPVL-FGALAEL 264
Query: 182 QRKDMKTIRMFVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+ YY+SL+ IS+ D R+ GTF LR +G+GG ++D+G
Sbjct: 265 AAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSG 324
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY---DSRFRAYASMTFHF 292
TF+ + VV+ +H RQ + NAS C+ + + A M HF
Sbjct: 325 TTFTFLVESAFRVVV----DHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHF 380
Query: 293 -DRADFKV-EPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQF 348
AD ++ YM F Q E FC+ I+ S S++G +QQQ+ + ++D+ G + F
Sbjct: 381 AGGADMRLHRDNYMSF-NQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSF 439
Query: 349 VPENCAN 355
+P +C
Sbjct: 440 MPTDCGK 446
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 179/374 (47%), Gaps = 43/374 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y + + GTP S + DTGS LIWTQC PC CF Q AP++NP +S+T+ +PC+
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151
Query: 65 --------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVI 115
L + PP C C++ Y G +A G+ +ETFTF VPG+
Sbjct: 152 SLSMCAGVLAGKAPPPGCA---CMYNQTYGTGWTA-GVQGSETFTFGSAAADQARVPGIA 207
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGCSN + ++G+ AG++G SL+ QL + G FSYCL +++ +TS L
Sbjct: 208 FGCSNASSS-DWNGS-AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLLL 261
Query: 176 GKDANIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
G A + +++ S++YYL+L IS+ + +P F+L+ +GTGG
Sbjct: 262 GPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGL 321
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD---SRFRAYAS 287
+ID+G T + Y+ V T + S + CY S A S
Sbjct: 322 IIDSGTTITSLVNAAYQQVRAAVQSLVTLPAID--GSDSTGLDLCYALPTPTSAPPAMPS 379
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGA------WQQQDTRFVYDL 341
MT HFD AD V P Y I G +C+A+ RN GA +QQQ+ +YD+
Sbjct: 380 MTLHFDGADM-VLPADSYMI-SGSGVWCLAM----RNQTDGAMSTFGNYQQQNMHILYDV 433
Query: 342 NTGTIQFVPENCAN 355
+ F P C+
Sbjct: 434 RNEMLSFAPAKCST 447
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 166/373 (44%), Gaps = 36/373 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKR 59
H Y VD+ GTP + DTGS LIWTQC PC CF Q AP++ P S+TY
Sbjct: 86 HASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYAN 145
Query: 60 IPCDDLICR---RPPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+ C +C+ P RC + C + +Y G S G+++TETFT V GV
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAVRGV 202
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC +N N +G++G P SL+ QL T FSYC + A S L
Sbjct: 203 AFGCGTEN--LGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYC--FTPFNATAASPLF 255
Query: 175 FGKDANIQRKDMKTIRMFVD--------RSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + T FV RSS+YYLSL+ I+V D + P F L G
Sbjct: 256 LGSSARLSSAAKTT--PFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMG 313
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRYDS-RF 282
GG +ID+G T ++ + + R R R+ AS C+ S
Sbjct: 314 DGGVIIDSGTTFTALEESAFVALARALAS------RVRLPLASGAHLGLSLCFAAASPEA 367
Query: 283 RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
+ HFD AD ++ ++ G C+ + + SV+G+ QQQ+T +YDL
Sbjct: 368 VEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLE 427
Query: 343 TGTIQFVPENCAN 355
G + F P C
Sbjct: 428 RGILSFEPAKCGE 440
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 166/373 (44%), Gaps = 36/373 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKR 59
H Y VD+ GTP + DTGS LIWTQC PC CF Q AP++ P S+TY
Sbjct: 86 HASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYAN 145
Query: 60 IPCDDLICR---RPPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+ C +C+ P RC + C + +Y G S G+++TETFT V GV
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAVRGV 202
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC +N N +G++G P SL+ QL T FSYC + A S L
Sbjct: 203 AFGCGTEN--LGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYC--FTPFNATAASPLF 255
Query: 175 FGKDANIQRKDMKTIRMFVD--------RSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + T FV RSS+YYLSL+ I+V D + P F L G
Sbjct: 256 LGSSARLSSAAKTT--PFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMG 313
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRYDS-RF 282
GG +ID+G T ++ + + R R R+ AS C+ S
Sbjct: 314 DGGVIIDSGTTFTALEERAFVALARALAS------RVRLPLASGAHLGLSLCFAAASPEA 367
Query: 283 RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
+ HFD AD ++ ++ G C+ + + SV+G+ QQQ+T +YDL
Sbjct: 368 VEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLE 427
Query: 343 TGTIQFVPENCAN 355
G + F P C
Sbjct: 428 RGILSFEPAKCGE 440
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 34/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L+ DTGS L WTQC PCV+CF QS P FNP+ S T+ +PCD I
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 170
Query: 67 CRRPPFRC------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV---CVPGVIFG 117
CR + NG CV+ YA + +G + ++TF+F + + VP + FG
Sbjct: 171 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFG 230
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C N F N GI GFS S+ QLK FSYC A E + + G
Sbjct: 231 CGLFNNGI-FVSNETGIAGFSRGALSMPAQLKVDN---FSYCFT-AITGSEPSPVF-LGV 284
Query: 178 DANIQR----------KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
N+ + IR + YY+SL+ ++V R+ FAL+ +GT
Sbjct: 285 PPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGT 344
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSRFRA-Y 285
GG ++D+G T + Y +V + F + + +HN++ + C+ +
Sbjct: 345 GGTIVDSGTGMTMLPEAVYNLVC----DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDV 400
Query: 286 ASMTFHFDRADFKV-EPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
++ HF+ A + YM+ I + G C+AI+ + SV+G +QQQ+ +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLA 460
Query: 343 TGTIQFVPENC 353
+ FVP C
Sbjct: 461 NDMLSFVPARC 471
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 34/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L+ DTGS L WTQC PCV+CF QS P FNP+ S T+ +PCD I
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 170
Query: 67 CRRPPFRC------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV---CVPGVIFG 117
CR + NG CV+ YA + +G + ++TF+F + + VP + FG
Sbjct: 171 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFG 230
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C N F N GI GFS S+ QLK FSYC A E + + G
Sbjct: 231 CGLFNNGI-FVSNETGIAGFSRGALSMPAQLKVDN---FSYCFT-AITGSEPSPVF-LGV 284
Query: 178 DANIQR----------KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
N+ + IR + YY+SL+ ++V R+ FAL+ +GT
Sbjct: 285 PPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGT 344
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSRFRA-Y 285
GG ++D+G T + Y +V + F + + +HN++ + C+ +
Sbjct: 345 GGTIVDSGTGMTMLPEAVYNLVC----DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDV 400
Query: 286 ASMTFHFDRADFKV-EPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
++ HF+ A + YM+ I + G C+AI+ + SV+G +QQQ+ +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLA 460
Query: 343 TGTIQFVPENC 353
+ FVP C
Sbjct: 461 NDMLSFVPARC 471
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 34/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L+ DTGS L WTQC PCV+CF QS P FNP+ S T+ +PCD I
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 144
Query: 67 CRRPPFRC------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV---CVPGVIFG 117
CR + NG CV+ YA + +G + ++TF+F + + VP + FG
Sbjct: 145 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFG 204
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C N F N GI GFS S+ QLK FSYC A E + + G
Sbjct: 205 CGLFNNGI-FVSNETGIAGFSRGALSMPAQLKVDN---FSYCFT-AITGSEPSPVF-LGV 258
Query: 178 DANIQR----------KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
N+ + IR + YY+SL+ ++V R+ FAL+ +GT
Sbjct: 259 PPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGT 318
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSRFRA-Y 285
GG ++D+G T + Y +V + F + + +HN++ + C+ +
Sbjct: 319 GGTIVDSGTGMTMLPEAVYNLVC----DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDV 374
Query: 286 ASMTFHFDRADFKV-EPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
++ HF+ A + YM+ I + G C+AI+ + SV+G +QQQ+ +YDL
Sbjct: 375 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLA 434
Query: 343 TGTIQFVPENC 353
+ FVP C
Sbjct: 435 NDMLSFVPARC 445
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 173/347 (49%), Gaps = 31/347 (8%)
Query: 24 FDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC---RRPPFRCENGQCV 80
DTGS LIWTQC PC+ C +Q P F+ S+TY+ +PC C P C CV
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSP--SCFKKMCV 58
Query: 81 HRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDNR-DFSFDGNIAGILGFS 138
++ Y AS +G+++ ETFTF N V + FGC + N D + N +G++GF
Sbjct: 59 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLA---NSSGMVGFG 115
Query: 139 VSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT-SILRFGKDANIQRKD------MKTIRM 191
P SL+ QL + FSYCL + AT S L FG AN+ + +++
Sbjct: 116 RGPLSLVSQLGPSR---FSYCLT---SYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPF 169
Query: 192 FVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
++ + + Y+LSL+ IS+ + P FA+ +GTGG +ID+G T++Q+ YE V
Sbjct: 170 VINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV 229
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS---MTFHFDRADFKVEPTYMYF 306
R ++ M++ + C+++ + + FHFD A+ + P
Sbjct: 230 RRGL---VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYML 286
Query: 307 IFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
I GY C+ ++ + +++G +QQQ+ +YD+ + FVP C
Sbjct: 287 IASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 176/371 (47%), Gaps = 34/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDD- 64
Y + + GTP S + DTGS LIWTQC PC + CF Q P++NP++S+T+ +PC+
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145
Query: 65 -------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL--VCVPGVI 115
L PP C C++ + Y G + S +ETFTF VPG+
Sbjct: 146 LSMCAAALAGTTPPPGCT---CMYNMTYGSGWT-SVYQGSETFTFGSSTPANQTGVPGIA 201
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGCSN + F+ + +G++G SL+ QL FSYCL Y++ +TS L
Sbjct: 202 FGCSNASGGFNTS-SASGLVGLGRGSLSLVSQLGVPK---FSYCLT-PYQDTNSTSTLLL 256
Query: 176 GKDANIQRKDMKTIRMFV------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G A++ + FV S++YYL+L IS+ + +L+ +GTGG
Sbjct: 257 GPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGG 316
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YA 286
+ID+G T + Y+ V R + +A+ + C+ S A
Sbjct: 317 FIIDSGTTITLLGNTAYQQV-RAAVVSLVTLPTTDGGSAATGLDLCFELPSSTSAPPTMP 375
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTG 344
SMT HFD AD V P Y + + +C+A+ S++G +QQQ+ +YD+
Sbjct: 376 SMTLHFDGADM-VLPADSYMML-DSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVGQE 433
Query: 345 TIQFVPENCAN 355
T+ F P C+
Sbjct: 434 TLTFAPAKCST 444
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 177/371 (47%), Gaps = 35/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + DTGS LIWTQC PC CF Q AP++NP +S+T+ +PC+
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 173
Query: 66 I--CRRPPFRCENG---QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCS 119
+ C C++ Y G +A G+ +ETFTF VPGV FGCS
Sbjct: 174 LSMCAGALAGAAPPPGCACMYYQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFGCS 232
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N + ++G+ AG++G SL+ QL + G FSYCL +++ +TS L G A
Sbjct: 233 NASSS-DWNGS-AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLLLGPSA 286
Query: 180 NIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ +++ S++YYL+L IS+ + +PG F+L+ +GTGG +ID+
Sbjct: 287 ALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDS 346
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----YASMTF 290
G T + Y+ V + + S + C+ + A SMT
Sbjct: 347 GTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTL 406
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGA------WQQQDTRFVYDLNTG 344
HFD AD V P Y I G +C+A+ RN GA +QQQ+ +YD+
Sbjct: 407 HFDGADM-VLPADSYMI-SGSGVWCLAM----RNQTDGAMSTFGNYQQQNMHILYDVREE 460
Query: 345 TIQFVPENCAN 355
T+ F P C+
Sbjct: 461 TLSFAPAKCST 471
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 178/371 (47%), Gaps = 36/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + DTGS LIWTQC PC CF Q AP++NP +S+T+ +PC+
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 171
Query: 66 I--CRRPPFRCENG---QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCS 119
+ C C++ Y G +A G+ +ETFTF VPGV FGCS
Sbjct: 172 LSMCAGALAGAAPPPGCACMYNQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFGCS 230
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N + ++G+ AG++G SL+ QL + G FSYCL +++ +TS L G A
Sbjct: 231 NASSS-DWNGS-AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLLLGPSA 284
Query: 180 NIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ +++ S++YYL+L IS+ + +PG F+L+ +GTGG +ID+
Sbjct: 285 ALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDS 344
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----YASMTF 290
G T + Y+ V T+ + S + C+ + A SMT
Sbjct: 345 GTTITSLANAAYQQVRAAVKSLVTTLPTVDGSD-STGLDLCFALPAPTSAPPAVLPSMTL 403
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGA------WQQQDTRFVYDLNTG 344
HFD AD V P Y I G +C+A+ RN GA +QQQ+ +YD+
Sbjct: 404 HFDGADM-VLPADSYMI-SGSGVWCLAM----RNQTDGAMSTFGNYQQQNMHILYDVREE 457
Query: 345 TIQFVPENCAN 355
T+ F P C+
Sbjct: 458 TLSFAPAKCST 468
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 177/376 (47%), Gaps = 36/376 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +D+ GTP+ + DTGS L+WTQC PCV CFNQ+ P+F+P ASSTY +
Sbjct: 110 HAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAAL 169
Query: 61 PCDDLICRRPPFRCENGQCV---------HRINYAGGASASGLVSTETFTFHLKNKLVCV 111
PC +C P + Y +S G+++TETFT + V
Sbjct: 170 PCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQK----V 225
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
PGV FGC + N F AG++G P SL+ QL FSYCL +
Sbjct: 226 PGVAFGCGDTNEGDGFTQG-AGLVGLGRGPLSLVSQLGIDR---FSYCLTSLDDAAGRSP 281
Query: 172 IL---RFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+L G A+ +T + + S S YY+SL ++V R+ FA++ +G
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-DWEYCYR-----YDS 280
TGG ++D+G T+++ Y + + F H + +ASE + C++ D
Sbjct: 342 TGGVIVDSGTSITYLELRAYRALRKAFVAHMS----LPTVDASEIGLDLCFQGPAGAVDQ 397
Query: 281 RFRA-YASMTFHFD-RADFKVEPTYMYFIFQN-EGYFCVAISFSDRNSVVGAWQQQDTRF 337
+ + HFD AD + P Y + + G C+ + S S++G +QQQ+ +F
Sbjct: 398 DVQVQVPKLVLHFDGGADLDL-PAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQF 456
Query: 338 VYDLNTGTIQFVPENC 353
VYD+ T+ F P C
Sbjct: 457 VYDVAGDTLSFAPAEC 472
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 164/369 (44%), Gaps = 23/369 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V G P ++ DTGS LIW QCLPC C+ Q P+++P S T++RIPC
Sbjct: 91 EYFAVIGV--GDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCA 148
Query: 64 DLICRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
CR P G CV+ + Y G+++SG ++T+T + V V GC
Sbjct: 149 SPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTR---VHNVTLGCG 205
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKD 178
+DN + AG+LG S QL +FSYCL R ++S L FG+
Sbjct: 206 HDNEGLL--ASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRT 263
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMIDTGA 236
+ +R R S YY+ + SV R+ GF+ + AL G GG ++D+G
Sbjct: 264 PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGT 323
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA----SMTFHF 292
+ R Y V F H + G +R+ N ++ CY S+ HF
Sbjct: 324 AISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIVLHF 383
Query: 293 DRADFKVEPTYMYFIFQNEG----YFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQ 347
A P Y I G YFC+ + +D +V+G QQQ V+D+ G I
Sbjct: 384 AAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERGRIG 443
Query: 348 FVPENCAND 356
F P C+ +
Sbjct: 444 FTPNGCSGE 452
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 166/353 (47%), Gaps = 6/353 (1%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + ++ DTGS L W QC PC C++Q+ +F PN S+++ ++ C +
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
C P+ C CV+ +Y G+ ++G +T T + + VP FGC +DN
Sbjct: 63 CNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG 122
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
SF G GILG P S QLK+ G FSYCLV TS L FG A
Sbjct: 123 -SFAG-ADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180
Query: 185 DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+K I + + ++YY+ L ISV + + F + G G + D+G T +
Sbjct: 181 GVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLA 240
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
++ V+ + + R+ ++ D + + SMTFHF+ D ++ P+
Sbjct: 241 GEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDMELPPS 300
Query: 303 YMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ ++ +C ++ S +++G+ QQQ+ + YD I FVP++C
Sbjct: 301 NYFIFLESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSCVG 353
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 172/373 (46%), Gaps = 27/373 (7%)
Query: 7 YTVDVLFGTPSKSE--FLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GT E L D + W QC PC C Q P+F+P S T++ + +
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160
Query: 65 LICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHL-KNKLVCVPGVIFGCSNDN 122
+ RPP+ ++G+C I Y GASA+G ++ +TF+F N +PG++FGC+N
Sbjct: 161 AVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRI 220
Query: 123 RDFSFDGNIAGILGFSVS----PFS-LLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
F G +AG+LG + P + + QL G FSYC + A S LRFG
Sbjct: 221 ARFDTHGALAGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVP--GTTAYSFLRFGN 278
Query: 178 D------ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGC 230
D A + R+ M + S YY+ L ISV R+ G P F ++G GGC
Sbjct: 279 DIPSQPPAGVHRQSMAVLAP-TTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGC 337
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
ID G T I + Y V H R R + +R + SMT
Sbjct: 338 AIDIGTKMTAIVQTAYAHVEAAVRGHLQR-NRARFVQSPGHHLCVHRTPAIEERLPSMTL 396
Query: 291 HFDRADF-KVEPTYMYFIFQNEG----YFCVAISFSDRNSVVGAWQQQDTRFVYDL--NT 343
HF + +V+P +++ + + Y C+ + +V+GA QQ DTRF++DL N
Sbjct: 397 HFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAEMTVIGAMQQIDTRFIFDLHNNI 456
Query: 344 GTIQFVPENCAND 356
+ F PE+C D
Sbjct: 457 PIVSFNPEDCHLD 469
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 174/375 (46%), Gaps = 37/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP L DTGS L WTQC PC CF Q PI++ AS+++ +PC
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 67 CRRPPFRCE-------NGQCVHRINYAGGASASGLVSTETFTFH-----LKNKLVCVPGV 114
C P +R C +R Y GA ++G++ TET TF V V GV
Sbjct: 155 C-LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGV 213
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL- 173
FGC DN S+ N G +G SL+ QL G FSYCL + + +L
Sbjct: 214 AFGCGVDNGGLSY--NSTGTVGLGRGSLSLVAQL---GVGKFSYCLTDFFNTSLGSPVLF 268
Query: 174 ----RFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ I +++ + S YY+SL+ IS+ D R+ GTF LR +G+
Sbjct: 269 GSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGS 328
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY---DSRFRA 284
GG ++D+G I T + + VV+ H Q + NAS C+ + +
Sbjct: 329 GGMIVDSGTIFTVLVESAFRVVVNHVAGVL----NQPVVNASSLDSPCFPATAGEQQLPD 384
Query: 285 YASMTFHF-DRADFKV-EPTYMYFIFQNEGYFCVAISFSDR--NSVVGAWQQQDTRFVYD 340
M HF AD ++ YM F Q FC+ I+ + S++G +QQQ+ + ++D
Sbjct: 385 MPDMLLHFAGGADMRLHRDNYMSF-NQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFD 443
Query: 341 LNTGTIQFVPENCAN 355
+ G + FVP +C+
Sbjct: 444 ITVGQLSFVPTDCSK 458
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 178/360 (49%), Gaps = 22/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC+ C +Q P F+P S+TY+ + C
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPA 149
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + C CV++ Y AS +G+++ ETFTF V +PG+ FGC N N
Sbjct: 150 CNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGL 209
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG-----KDAN 180
+G+ G++GF SL+ QL S FSYCL + S L FG N
Sbjct: 210 LANGS--GMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVP--SRLYFGVYATLNSTN 262
Query: 181 IQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAI 237
+ +++ V+ + + Y+L++ ISV + + P FA+ +GTGG +ID+G
Sbjct: 263 ASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTT 322
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHFDR 294
T++ Y+ V F T + +AS + C+++ R + HFD
Sbjct: 323 ITYLAEPAYDAVRAAFASQIT-LPLLNVTDASV-LDTCFQWPPPPRQSVTLPQLVLHFDG 380
Query: 295 ADFKVE-PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
AD+++ YM G C+A++ S S++G++Q Q+ +YDL + FVP C
Sbjct: 381 ADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 173/369 (46%), Gaps = 28/369 (7%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H Y +++ G P L DTGS L WTQC PC CF Q P+++P+ASST+ +
Sbjct: 65 HSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPL 124
Query: 61 PCDDLICRRPPFRCEN----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
PC C P N C +R Y GA ++G++ TET T + V V GV F
Sbjct: 125 PCSSATCL--PIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAF 182
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC DN S N G +G SLL QL G FSYCL + + L G
Sbjct: 183 GCGTDNGGDSL--NSTGTVGLGRGTLSLLAQL---GVGKFSYCLTDFFNSALDSPFL-LG 236
Query: 177 KDANIQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
A + + +S S Y++SLQ IS+ D R+ GTF LR +GTGG ++
Sbjct: 237 TLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIV 296
Query: 233 DTGAIATFI-QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY-ASMTF 290
D+G T + + G EVV R Q NAS C+ + Y +
Sbjct: 297 DSGTTFTILAESGFREVVGR-----VARVLGQPPVNASSLDAPCFPAPAGEPPYMPDLVL 351
Query: 291 HF-DRADFKV-EPTYMYFIFQNEGYFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTGTI 346
HF AD ++ YM + + + FC+ I+ + + SV+G +QQQ+ + ++D G +
Sbjct: 352 HFAGGADMRLYRDNYMSY-NEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQL 410
Query: 347 QFVPENCAN 355
F+P +C+
Sbjct: 411 SFLPTDCSK 419
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 178/367 (48%), Gaps = 30/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L+ DTGS L+WTQC PC CF+++ +P+ SST+ +PC +
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPV 474
Query: 67 CRRPPFRC------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKL--VCVPGVIFGC 118
C + N CV+ YA G+ +G + ETFTF + VP + FGC
Sbjct: 475 CDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGC 534
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
N F N GI GF SL QLK FS+C A E +S+L G
Sbjct: 535 GLFNNGI-FTSNETGIAGFGRGALSLPSQLKVDN---FSHCFT-AITGSEPSSVL-LGLP 588
Query: 179 ANI---QRKDMKTIRMFVDRSS--HYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
AN+ +++ + + SS YYLSL+ I+V R+ TFAL+++GTGG +ID
Sbjct: 589 ANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIID 648
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRA---YASM 288
+G T + + Y++V + FT+ R + NA+ C+ + RA +
Sbjct: 649 SGTGMTTLPQDAYKLV----HDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKL 704
Query: 289 TFHFDRADFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
HF+ A + F F++ G C+AI+ D +++G +QQQ+ +YDL +
Sbjct: 705 VLHFEGATLDLPRENYMFEFEDAGGSVTCLAINAGDDLTIIGNYQQQNLHVLYDLVRNML 764
Query: 347 QFVPENC 353
FVP C
Sbjct: 765 SFVPAQC 771
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 178/367 (48%), Gaps = 31/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
+ + + GTP + DTGS LIWTQC PC CF Q P++NP++S+T+ +PC+
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL--VCVPGVIFGCSNDNR 123
+ P C C++ + Y G + TETFTF V VPG+ FGCSN +
Sbjct: 145 LGLCAP-ACA---CMYNMTYGSGWTYV-FQGTETFTFGSSTPADQVRVPGIAFGCSNASS 199
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F+ + +G++G SL+ QL + FSYCL Y++ +TS L G A++
Sbjct: 200 GFNAS-SASGLVGLGRGSLSLVSQLGAPK---FSYCLT-PYQDTNSTSTLLLGPSASLND 254
Query: 184 KDMKTIRMFVDRSS--HYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + FV S +YYL+L IS+ + P F+L+ +GTGG +ID+G T +
Sbjct: 255 TGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITML 314
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YASMTFHFDRADFK 298
Y+ V T +A+ + C+ S A SMT HFD AD
Sbjct: 315 GNTAYQQVRAAVLSLVTLPTTD--GSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADM- 371
Query: 299 VEPTYMYFI-----FQNEGYFCVAI-SFSDRNSVV----GAWQQQDTRFVYDLNTGTIQF 348
V P Y + + +C+A+ + +D + VV G +QQQ+ +YD+ T+ F
Sbjct: 372 VLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSF 431
Query: 349 VPENCAN 355
P C+
Sbjct: 432 APAKCST 438
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 178/360 (49%), Gaps = 22/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP++ + DTGS LIWTQC PC+ C +Q P F+P S+TY+ + C
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPA 149
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + C CV++ Y AS +G+++ ETFTF V +PG+ FGC N N
Sbjct: 150 CNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGS 209
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG-----KDAN 180
+G+ G++GF SL+ QL S FSYCL + S L FG N
Sbjct: 210 LANGS--GMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVP--SRLYFGVYATLNSTN 262
Query: 181 IQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAI 237
+ +++ V+ + + Y+L++ ISV + + P FA+ +GTGG +ID+G
Sbjct: 263 ASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTT 322
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHFDR 294
T++ Y+ V F T + +AS + C+++ R + HFD
Sbjct: 323 ITYLAEPAYDAVRAAFASQIT-LPLLNVTDASV-LDTCFQWPPPPRQSVTLPQLVLHFDG 380
Query: 295 ADFKVE-PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
AD+++ YM G C+A++ S S++G++Q Q+ +YDL + FVP C
Sbjct: 381 ADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 164/364 (45%), Gaps = 28/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PCV+CF+Q P F+ + SST +PC+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQ 94
Query: 67 CRRPP-------FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P C + +Y + GL++ + FTF L PGV FGC
Sbjct: 95 CKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSL---PGVTFGCG 151
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD- 178
+N F+ N GI GF P SL QLK G FS+C + +T +L D
Sbjct: 152 LNNTGV-FNSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADL 207
Query: 179 -----ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+Q + + YYLSL+ I+V R+ FAL NGTGG +ID
Sbjct: 208 FSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-TNGTGGTIID 266
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G T + Y+VV DE + + C+ S+ + + HF
Sbjct: 267 SGTSITSLPPQVYQVVR---DEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 323
Query: 293 DRADFKVE-PTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
+ A + Y++ + + G C+AI+ D +++G +QQQ+ +YDL + FV
Sbjct: 324 EGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFV 383
Query: 350 PENC 353
C
Sbjct: 384 AAQC 387
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 170/371 (45%), Gaps = 35/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + DTGS LIWTQC PC + CF Q P++NP++S+T+ +PC+
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 66 IC----------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
+ PP C C + + Y G + S +ETFTF VPG+
Sbjct: 92 LSVCAAALAGTGTAPPPGCA---CTYNVTYGSGWT-SVFQGSETFTFGSTPAGHARVPGI 147
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCS + F+ + +G++G SL+ QL FSYCL Y++ +TS L
Sbjct: 148 AFGCSTASSGFNAS-SASGLVGLGRGRLSLVSQLGVPK---FSYCLT-PYQDTNSTSTLL 202
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH------YYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G A++ + FV S YYL+L IS+ + P F+L +GTG
Sbjct: 203 LGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 262
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AY 285
G +ID+G T + Y+ V T +A + C+ S A
Sbjct: 263 GLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSADTGLDLCFMLPSSTSAPPAM 320
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGAWQQQDTRFVYDLNT 343
SMT HF+ AD V P Y + + G +C+A+ V +G +QQQ+ +YD+
Sbjct: 321 PSMTLHFNGADM-VLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQ 379
Query: 344 GTIQFVPENCA 354
T+ F P C+
Sbjct: 380 ETLSFAPAKCS 390
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 170/371 (45%), Gaps = 35/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + DTGS LIWTQC PC + CF Q P++NP++S+T+ +PC+
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 66 IC----------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
+ PP C C + + Y G + S +ETFTF VPG+
Sbjct: 152 LSVCAAALAGTGTAPPPGCA---CTYNVTYGSGWT-SVFQGSETFTFGSTPAGHARVPGI 207
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCS + F+ + +G++G SL+ QL FSYCL Y++ +TS L
Sbjct: 208 AFGCSTASSGFNAS-SASGLVGLGRGRLSLVSQLGVPK---FSYCLT-PYQDTNSTSTLL 262
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH------YYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G A++ + FV S YYL+L IS+ + P F+L +GTG
Sbjct: 263 LGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 322
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AY 285
G +ID+G T + Y+ V T +A + C+ S A
Sbjct: 323 GLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSADTGLDLCFMLPSSTSAPPAM 380
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGAWQQQDTRFVYDLNT 343
SMT HF+ AD V P Y + + G +C+A+ V +G +QQQ+ +YD+
Sbjct: 381 PSMTLHFNGADM-VLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQ 439
Query: 344 GTIQFVPENCA 354
T+ F P C+
Sbjct: 440 ETLSFAPAKCS 450
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 171/360 (47%), Gaps = 21/360 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP F + DTGS LIW QC PC C Q+AP+F+P SST+K +PCD
Sbjct: 92 YLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQP 151
Query: 67 CR-RPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C PP + ++GQC ++ Y SG++ E+ F KN + P + FGC+
Sbjct: 152 CTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFS 211
Query: 122 NRDFSFDGNI-AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N D + G++G V P SL+ QL FSYC + +TS +RFG DA
Sbjct: 212 NNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYC--FPPLSSNSTSKMRFGNDAI 269
Query: 181 I-QRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ Q K + + + + S+YYL+L+ +S+ + ++ + G +ID+G
Sbjct: 270 VKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTS------ESQTDGNILIDSGTS 323
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
T +++ Y + E +G + + + +C+ + + + + F F A
Sbjct: 324 FTILKQSFYNKFVALVKE---VYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKV 380
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
+V+ + ++ N VA+ SD +S+ G Q + YDL G + F P +CA D
Sbjct: 381 RVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADCAKD 440
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 170/371 (45%), Gaps = 35/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + DTGS LIWTQC PC + CF Q P++NP++S+T+ +PC+
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 66 IC----------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
+ PP C C + + Y G + S +ETFTF VPG+
Sbjct: 150 LSVCAAALAGTGTAPPPGCA---CTYNVTYGSGWT-SVFQGSETFTFGSTPAGQSRVPGI 205
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCS + F+ + +G++G SL+ QL FSYCL Y++ +TS L
Sbjct: 206 AFGCSTASSGFNAS-SASGLVGLGRGRLSLVSQLGVPK---FSYCLT-PYQDTNSTSTLL 260
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH------YYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G A++ + FV S YYL+L IS+ + P F L +GTG
Sbjct: 261 LGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTG 320
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AY 285
G +ID+G T + Y+ V T +A+ + C+ S A
Sbjct: 321 GLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSAATGLDLCFMLPSSTSAPPAM 378
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGAWQQQDTRFVYDLNT 343
SMT HF+ AD V P Y + + G +C+A+ V +G +QQQ+ +YD+
Sbjct: 379 PSMTLHFNGADM-VLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQ 437
Query: 344 GTIQFVPENCA 354
T+ F P C+
Sbjct: 438 ETLSFAPAKCS 448
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 165/373 (44%), Gaps = 33/373 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF ++V G P ++ DTGS LIW QC+PC +C+ Q P+++P +SST++RIPC
Sbjct: 87 EYFAVINV--GDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCA 144
Query: 64 DLICRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTF----HLKNKLVCVPGVI 115
CR P G CV+ + Y G+++SG ++T+ F H+ N V
Sbjct: 145 SPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVHN-------VT 197
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILR 174
GC +DN + AG+LG S QL +FSYCL R +S L
Sbjct: 198 LGCGHDN--VGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLV 255
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMI 232
FG+ +R R S YY+ + SV R+ GF+ + AL G GG ++
Sbjct: 256 FGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVV 315
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQR-MHNASEDWEYCYRYDSRFRAYA----- 286
D+G + R Y V FD H + G R + ++ CY A
Sbjct: 316 DSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVP 375
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEG----YFCVAISFSDRN-SVVGAWQQQDTRFVYDL 341
S+ HF P Y I G YFC+ + +D +V+G QQQ V+D+
Sbjct: 376 SIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDV 435
Query: 342 NTGTIQFVPENCA 354
G I F P C+
Sbjct: 436 ERGRIGFTPNGCS 448
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 166/356 (46%), Gaps = 13/356 (3%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + G+P +S ++ DTGS L W QCLPC C+ Q P F+P+ S ++++ C
Sbjct: 36 NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACT 95
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
D +C P C C ++ Y ++ +G ++ ET + + VP FGC
Sbjct: 96 DNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCGT 155
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N +F G AG++G P SL QL T FSYCLV + + A S L FG A
Sbjct: 156 QNLG-TFAG-AAGLVGLGQGPLSLNSQLSHTFANKFSYCLV-SLNSLSA-SPLTFGSIAA 211
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIAT 239
+I + ++YY+ L I V + AP FA+ + G GG +ID+G T
Sbjct: 212 AANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTIT 271
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFK 298
+ Y V+R + E F ++ R+ ++ + C+ + M F F ADF+
Sbjct: 272 MLTLPAYSAVLRAY-ESFVNY--PRLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQ 328
Query: 299 VEPTYMYFIFQNEG-YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ ++ + C+A+ S S++G QQQ+ VYDL I F +C
Sbjct: 329 MRGENLFVLVDTSATTLCLAMGGSQGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 167/359 (46%), Gaps = 26/359 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + + DTGS L W QC PC CF Q P+F P ASS+Y C D +
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C RP N C + +Y G++ G + ET T + + + FGC + N+
Sbjct: 68 CDALPRPTCSMRN-TCTYSYSYGDGSNTRGDFAFETVTLNGST----LARIGFGCGH-NQ 121
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ +F G G++G P SL QL S+ +FSYCLV S + FG A R
Sbjct: 122 EGTFAG-ADGLIGLGQGPLSLPSQLNSSFTHIFSYCLV-DQSTTGTFSPITFGNAAENSR 179
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ D S+YY+ ++ ISV + R+ P F + NG GG ++D+G T+ +
Sbjct: 180 ASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRL 239
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEY----CYRYDSRFRA---YASMTFHFDRAD 296
+ ++ RQ + ++ Y CY S + SMT H D
Sbjct: 240 AAFIPILAELR-------RQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVD 292
Query: 297 FKVEPTYMYFIFQNEG-YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
F++ + ++ + N G C A+S SD+ S++G QQQ+ V D+ + F+ +C+
Sbjct: 293 FEIPVSNLWVLVDNFGETVCTAMSTSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 120/376 (31%), Positives = 174/376 (46%), Gaps = 49/376 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP----- 61
Y VD+ GTP + L DTGS LIWTQC PC +C Q PIF+P ASS+Y+ +
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGEL 163
Query: 62 CDDLI---CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHL------KNKLVCVP 112
C+D++ C+RP C +R +Y G + G+ +TE FTF KL
Sbjct: 164 CNDILHHSCQRP------DTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPL 217
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
G FGC N+ +G+ GI+GF +P SL+ QL A FSYCL ++T +
Sbjct: 218 G--FGCGTMNKGSLNNGS--GIVGFGRAPLSLVSQL---AIRRFSYCLTPYASGRKSTLL 270
Query: 173 ---LRFGK-DANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
LR G DA ++T R+ R + YY+ ++V R+ FALR +G
Sbjct: 271 FGSLRGGVYDAAT--ATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDG 328
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDS-- 280
+GG ++D+G T V+R F F + G ++ D C+ +
Sbjct: 329 SGGAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANG-----SSGPDDGVCFAAAASR 383
Query: 281 --RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRF 337
R M FH AD + Q +G C+ ++ S D + +G + QQD R
Sbjct: 384 VPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRV 443
Query: 338 VYDLNTGTIQFVPENC 353
+YDL T+ F P C
Sbjct: 444 LYDLEADTLSFAPAQC 459
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 184/374 (49%), Gaps = 38/374 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDD- 64
Y + + GTP +S + DTGS L+WTQC PC CF Q +P++NP++S T++ +PC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 65 ---------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
L PP C C + Y G + SGL +ETFTF V VPG+
Sbjct: 152 LNLCAAEARLAGATPPPGCA---CRYNQTYGTGWT-SGLQGSETFTFGSSPADQVRVPGI 207
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCSN + D ++G+ AG++G SL+ QL A G+FSYCL +++ ++ S L
Sbjct: 208 AFGCSNASSD-DWNGS-AGLVGLGRGGLSLVSQL---AAGMFSYCLT-PFQDTKSKSTLL 261
Query: 175 FGKDANIQRKDMKTIRM--FVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + +R FV S++YYL+L ISV + PG FALR +G
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 321
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
TGG +ID+G T + Y+ V R NA+ + C+ S A
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLPVTDGSNAT-GLDLCFALPSSSAPPA 379
Query: 287 ---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI-SFSDRN-SVVGAWQQQDTRFVYDL 341
SMT HF V P Y I + G +C+A+ S +D S +G +QQQ+ +YD+
Sbjct: 380 TLPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 438
Query: 342 NTGTIQFVPENCAN 355
T+ F P C+
Sbjct: 439 QKETLSFAPAKCST 452
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 161/356 (45%), Gaps = 17/356 (4%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
E + Y V V G+P ++L+ D+GS +IW QC PC+ C+ Q+ P+F+P S+T+ +P
Sbjct: 122 EGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVP 181
Query: 62 CDDLICR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C +CR R ++G C + ++Y G+ G ++ ET T V GV GC
Sbjct: 182 CGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTL----GGTAVEGVAIGCG 237
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ NR F G AG+LG P SL+GQL A G FSYCL A S++ +A
Sbjct: 238 HRNRGL-FVGA-AGLLGLGWGPMSLVGQLGGAAGGAFSYCLA----SRGAGSLVLGRSEA 291
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + S YY+ L I V D R+ F L +G GG ++DTG T
Sbjct: 292 VPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTG---T 348
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFK 298
+ R P E D + G + CY +++F+FD A
Sbjct: 349 AVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATL 408
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + + G +C+A + S S++G QQ+ + D G I F P C
Sbjct: 409 TLPARNLLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 184/374 (49%), Gaps = 38/374 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDD- 64
Y + + GTP +S + DTGS L+WTQC PC CF Q +P++NP++S T++ +PC
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 156
Query: 65 ---------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
L PP C C + Y G + SGL +ETFTF V VPG+
Sbjct: 157 LNLCAAEARLAGATPPPGCA---CRYNQTYGTGWT-SGLQGSETFTFGSSPADQVRVPGI 212
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCSN + D ++G+ AG++G SL+ QL A G+FSYCL +++ ++ S L
Sbjct: 213 AFGCSNASSD-DWNGS-AGLVGLGRGGLSLVSQL---AAGMFSYCLT-PFQDTKSKSTLL 266
Query: 175 FGKDANIQRKDMKTIRM--FVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + +R FV S++YYL+L ISV + PG FALR +G
Sbjct: 267 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 326
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
TGG +ID+G T + Y+ V R NA+ + C+ S A
Sbjct: 327 TGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLPVTDGSNAT-GLDLCFALPSSSAPPA 384
Query: 287 ---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI-SFSDRN-SVVGAWQQQDTRFVYDL 341
SMT HF V P Y I + G +C+A+ S +D S +G +QQQ+ +YD+
Sbjct: 385 TLPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 443
Query: 342 NTGTIQFVPENCAN 355
T+ F P C+
Sbjct: 444 QKETLSFAPAKCST 457
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 184/374 (49%), Gaps = 38/374 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDD- 64
Y + + GTP +S + DTGS L+WTQC PC CF Q +P++NP++S T++ +PC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 65 ---------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGV 114
L PP C C + Y G + SGL +ETFTF V VPG+
Sbjct: 152 LNLCAAEARLAGATPPPGCA---CRYNQTYGTGWT-SGLQGSETFTFGSSPADQVRVPGI 207
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGCSN + D ++G+ AG++G SL+ QL A G+FSYCL +++ ++ S L
Sbjct: 208 AFGCSNASSD-DWNGS-AGLVGLGRGGLSLVSQL---AAGMFSYCLT-PFQDTKSKSTLL 261
Query: 175 FGKDANIQRKDMKTIRM--FVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + +R FV S++YYL+L ISV + PG FALR +G
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADG 321
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
TGG +ID+G T + Y+ V R NA+ + C+ S A
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLPVTDGSNAT-GLDLCFALPSSSAPPA 379
Query: 287 ---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI-SFSDRN-SVVGAWQQQDTRFVYDL 341
SMT HF V P Y I + G +C+A+ S +D S +G +QQQ+ +YD+
Sbjct: 380 TLPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 438
Query: 342 NTGTIQFVPENCAN 355
T+ F P C+
Sbjct: 439 QKETLSFAPAKCST 452
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 172/367 (46%), Gaps = 32/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + L DTGS LIWTQC PC +C Q P+F P S++Y+ + C +
Sbjct: 102 YVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQL 161
Query: 67 CRRPPFR-CE-NGQCVHRINYAGGASASGLVSTETFTFHLK--NKLVCVPGVIFGCSNDN 122
C CE C +R NY G G+ +TE FTF ++L+ VP + FGC + N
Sbjct: 162 CSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LGFGCGSMN 220
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+G+ GI+GF +P SL+ QL FSYCL +Y +++L FG +
Sbjct: 221 VGSLNNGS--GIVGFGRNPLSLVSQLSIRR---FSYCLT-SYGSGRKSTLL-FGSLSGGV 273
Query: 183 RKD----MKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D ++T + + + YY+ L ++V R+ FALR +G+GG ++D+G
Sbjct: 274 YGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGT 333
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE-YCYRYDSRFRAYAS-------- 287
T + V+R F + R N + C+ + +R +S
Sbjct: 334 ALTLLPGAVLAEVVRAFRQQL----RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPR 389
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYDLNTGTI 346
M FHF AD + +G C+ ++ S D S +G QQD R +YDL T+
Sbjct: 390 MVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETL 449
Query: 347 QFVPENC 353
F P C
Sbjct: 450 SFAPAQC 456
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 172/358 (48%), Gaps = 18/358 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V++ GTP S + DTGS +IWTQC PC NC+ Q+AP+F+P+ S+TYK + C +
Sbjct: 83 YLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPV 142
Query: 67 CRRP--PFRC-ENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDN 122
C C ++ +C++ I Y + + G ++ +T T + V P + GC +DN
Sbjct: 143 CSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDN 202
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANI 181
+F+ N++GI+G P SL+ QL G FSYCL+ ++ L FG +AN+
Sbjct: 203 AG-TFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNANV 261
Query: 182 QRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ ++ + Y L L+ +SV D + F G A + G +ID+G T
Sbjct: 262 SGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEG--ASKLGGESNIIIDSGTTLT 319
Query: 240 FIQRGPYEVVMRHFDEHFT-SFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
++ ++ F + S + SE +YC+ + +T HF+ AD
Sbjct: 320 YLP----SALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPPVTMHFEGADVP 375
Query: 299 VEPTYMYFIFQNEGYFCVAI-SFSDRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ + F+ ++ C+A SF D N + G Q + YD+ + F P +C
Sbjct: 376 LQRENL-FVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHCG 432
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 164/361 (45%), Gaps = 18/361 (4%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
E + Y V V G+P ++L+ D+GS +IW QC PC+ C+ Q+ P+F+P +S+T+ +
Sbjct: 120 EGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVS 179
Query: 62 CDDLICR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C ICR R ++G C + ++Y G+ G ++ ET T V GV GC
Sbjct: 180 CGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL----GGTAVEGVAIGCG 235
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV----YAYREMEATSILRF 175
+ NR F G AG+LG P SL+GQL A G FSYCL +A L
Sbjct: 236 HRNRGL-FVGA-AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVL 293
Query: 176 GKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G+ + + + ++ S YY+ + I V D R+ G F L +G GG ++DT
Sbjct: 294 GRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDT 353
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD 293
G T + R P E D + G + CY +++F+FD
Sbjct: 354 G---TAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFD 410
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
A P + + G +C+A + S S++G QQ+ + D G I F P
Sbjct: 411 GAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGPAT 470
Query: 353 C 353
C
Sbjct: 471 C 471
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 166/361 (45%), Gaps = 33/361 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V FGTP+K+ L+ DTGS + W QC PC +C++Q PIF P SS+YK + C
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSA 197
Query: 67 CRRPPF--RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G CV+ INY G+ + G S ET T + P FGC + N
Sbjct: 198 CTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS----FPSFAFGCGHTNTG 253
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G+ AG+LG + S Q KS G FSYCL + +T G+ +
Sbjct: 254 L-FKGS-AGLLGLGRTALSFPSQTKSKYGGQFSYCLP-DFVSSTSTGSFSVGQGSIPATA 310
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ + S Y++ L ISV R+ P G GG ++D+G + T
Sbjct: 311 TFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL-----GRGGTIVDSGTVIT----- 360
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEY-----CYRYDSRFRA-YASMTFHF-DRADF 297
+V + +D TSF R + N + CY S + ++TFHF + AD
Sbjct: 361 --RLVPQAYDALKTSF-RSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADV 417
Query: 298 KVEPTYMYFIFQNEG-YFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V + F Q++G C+A + + ++ +++G +QQQ R +D G I F P +C
Sbjct: 418 AVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
Query: 354 A 354
A
Sbjct: 478 A 478
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 172/368 (46%), Gaps = 24/368 (6%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H Y +++ GTP L DTGS L WTQC PC CF Q P+++P+ASST+ +
Sbjct: 60 HSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPV 119
Query: 61 PCDDLICRRPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPG 113
PC C P +R N C + +Y+ GA + G++ TET T + + V V
Sbjct: 120 PCSSATC-LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGS 178
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSI 172
V FGC DN S N G +G SLL QL G FSYCL + M++
Sbjct: 179 VAFGCGTDNGGDSL--NSTGTVGLGRGTLSLLAQL---GVGKFSYCLTDFFNSTMDSPFF 233
Query: 173 LRFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L + +++ + S Y+++LQ IS+ D R+ GTF LR +G GG
Sbjct: 234 LGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGM 293
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
M+D+G T + + + V+ + Q NAS C+ +
Sbjct: 294 MVDSGTTFTILAKSGFREVVDRVAQLL----GQPPVNASSLDSPCFPSPDGEPFMPDLVL 349
Query: 291 HF-DRADFKV-EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQ 347
HF AD ++ YM + +++ FC+ I S S +G +QQQ+ + ++D+ G +
Sbjct: 350 HFAGGADMRLHRDNYMSY-NEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLS 408
Query: 348 FVPENCAN 355
F+P +C+
Sbjct: 409 FLPTDCSK 416
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 172/380 (45%), Gaps = 54/380 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP----- 61
Y +D+ GTP + L DTGS LIWTQC PC +C Q P+F P ASS+Y +P
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSY--VPMRCSG 160
Query: 62 --CDDLI---CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVI 115
C+D++ C+RP C +R NY G + G+ +TE FTF + + + VP +
Sbjct: 161 QLCNDILHHSCQRP------DTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVP-LG 213
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC N +G+ GI+GF P SL+ QL FSYCL ++T +
Sbjct: 214 FGCGTMNVGSLNNGS--GIVGFGRDPLSLVSQLSIRR---FSYCLTPYTSTRKSTLMFGS 268
Query: 176 GKDANIQRKD-----MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
D + D ++T R+ R + YY+ ++V R+ FALR +G+G
Sbjct: 269 LSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSG 328
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYD----- 279
G ++D+G T V+R F FTS ++S D C+
Sbjct: 329 GVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTS-------SSSPDDGVCFATPMAAGG 381
Query: 280 -----SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQ 333
+ + M FHF AD ++ G C+ ++ S D + +G + QQ
Sbjct: 382 RRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQ 441
Query: 334 DTRFVYDLNTGTIQFVPENC 353
D R +YDL T+ F P C
Sbjct: 442 DMRVLYDLEAETLSFAPAQC 461
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 175/364 (48%), Gaps = 32/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + + DTGS L WT C+PC NC+ Q P+F+P S+TY+ I CD +
Sbjct: 72 YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKL 131
Query: 67 CRR-------PPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGC 118
C + P RC + YA A G+++ ET T K K V + G++FGC
Sbjct: 132 CHKLDTGVCSPQKRCN-----YTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGC 186
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL-FSYCLVYAYREMEATSILRFGK 177
++N F+ + GI+G P SL+ Q+ S+ G FS CLV + ++ +S + FGK
Sbjct: 187 GHNNTG-GFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGK 245
Query: 178 DANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
+ + K + + + + + Y+++L ISV + + F + + + G +D+G
Sbjct: 246 GSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEK---GNMFLDSGT 302
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHF 292
T + Y+ V+ M ++D + CYR + R +T HF
Sbjct: 303 PPTILPTQLYDQVVAQVRSEVA------MKPVTDDPDLGPQLCYRTKNNLRG-PVLTAHF 355
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ AD K+ PT FI +G FC+ + S V G + Q + +DL+ + F P+
Sbjct: 356 EGADVKLSPT-QTFISPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPK 414
Query: 352 NCAN 355
+C
Sbjct: 415 DCTK 418
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 109/369 (29%), Positives = 165/369 (44%), Gaps = 26/369 (7%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
E + Y V V G+P ++L+ D+GS ++W QC PC+ C+ Q+ P+F+P S+T+ +
Sbjct: 166 EGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVS 225
Query: 62 CDDLICR-RPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C ICR P C +G+ C + ++YA G+ G ++ ET T V GV+ G
Sbjct: 226 CGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLG----GTAVEGVVIG 281
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AYREMEA---TSI 172
C + NR F G AG++G P SL+GQL G FSYCL Y A
Sbjct: 282 CGHRNRGL-FVG-AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGW 339
Query: 173 LRFGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
L G+ + + + R+ S YY+ L I V D R+ G F L +G G +
Sbjct: 340 LVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVV 399
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS---- 287
+DTG T + + Y + F + +S + CY YAS
Sbjct: 400 MDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCY----DLSGYASVRVP 455
Query: 288 -MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGT 345
++F FD + + + G +C+A + S S++G QQ + D G
Sbjct: 456 TVSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGY 515
Query: 346 IQFVPENCA 354
I F P NC
Sbjct: 516 IGFGPANCG 524
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 169/358 (47%), Gaps = 21/358 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + DTGS +IWTQC+PC NC+ Q P+FNP+ S+TY+++ C +
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144
Query: 67 CRRPPFRCENGQ------CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCS 119
C F E+ C + I+Y + + G + +T T ++V P GC
Sbjct: 145 CS---FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCG 201
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+DN SFD N++GI+G + P SL+ Q+ S G FSYCL + ++ L FG +A
Sbjct: 202 HDNAG-SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNA 260
Query: 180 NIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
N+ + +++ S Y L L+ +SV + ++ L G +ID+G
Sbjct: 261 NVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL--GGKANIIIDSGTT 318
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
T + P ++ S QR + ++ EYC+ + + HF+ A+
Sbjct: 319 LTLL---PVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGANL 375
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+++ + I ++ C+A + + N S+ G Q + YD+ ++ F P NC
Sbjct: 376 RLQRENV-LIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 171/372 (45%), Gaps = 29/372 (7%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H Y +++ GTP L DTGS L WTQC PC CF Q P+++P+ASST+ +
Sbjct: 71 HSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPV 130
Query: 61 PCDDLICRRPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPG 113
PC C P R N C + +Y+ GA ++G++ TET T + + V V
Sbjct: 131 PCSSATC-LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSD 189
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
V FGC DN S N G +G SLL QL G FSYCL + + L
Sbjct: 190 VAFGCGTDNGGDSL--NSTGTVGLGRGTLSLLAQL---GVGKFSYCLTDFFNSTLDSPFL 244
Query: 174 RFGKDANIQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G A + + +S S Y +SLQ I++ D R+ TF L N TGG
Sbjct: 245 -LGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGG 303
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM- 288
++D+G + + + VV+ H + Q NAS C+ + R M
Sbjct: 304 MVVDSGTTFSILPESGFRVVVDHVAQVL----GQPPVNASSLDSPCFPAPAGERQLPFMP 359
Query: 289 --TFHF-DRADFKV-EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNT 343
HF AD ++ YM + Q + FC+ I + S++G +QQQ+ + ++D+
Sbjct: 360 DLVLHFAGGADMRLHRDNYMSY-NQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTV 418
Query: 344 GTIQFVPENCAN 355
G + F+P +C+
Sbjct: 419 GQLSFLPTDCSK 430
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 27/359 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P ++L+ D+GS +IW QC PC C+ Q+ P+F+P ASS++ + C I
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 67 CRR-----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CR + G+C + + Y G+ G ++ ET T V GV GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F G AG+LG SL+GQL A G+FSYCL A R L G+ +
Sbjct: 246 NSGL-FVG-AAGLLGLGWGAMSLIGQLGGAAGGVFSYCL--ASRGAGGAGSLVLGRTEAV 301
Query: 182 QRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + ++ SS YY+ L I V R+ G F L +G GG ++DTG T
Sbjct: 302 PVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTR 361
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-----MTFHFDRA 295
+ R Y + FD + R A + CY YAS ++F+FD+
Sbjct: 362 LPREAYAALRGAFDGAMGALPRSP---AVSLLDTCY----DLSGYASVRVPTVSFYFDQG 414
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + FC+A + S S++G QQ+ + D G + F P C
Sbjct: 415 AVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 161/360 (44%), Gaps = 23/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PC CF+Q+ P F+P+ SST CD +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 67 CRRPPFRC-------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P N CV+ +Y + +G + + FTF VPGV FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF--VGAGASVPGVAFGCG 199
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N F N GI GF P SL QLK G FS+C +T +L D
Sbjct: 200 LFNNGV-FKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 180 -NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
R +++ + + + + YYLSL+ I+V R+ FAL +NGTGG +ID+G
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDSGT 314
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRA 295
T + Y +V F + + D +C R + Y + HF+ A
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVK---LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA 371
Query: 296 DFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ F ++ G C+AI + +G +QQQ+ +YDL + FVP C
Sbjct: 372 TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 162/364 (44%), Gaps = 27/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PC CF+Q+ P F+P+ SST CD +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 94
Query: 67 CRRPPFRC-------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P N CV+ +Y + +G + + FTF VPGV FGC
Sbjct: 95 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF--VGAGASVPGVAFGCG 152
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD- 178
N F N GI GF P SL QLK G FS+C + +T +L D
Sbjct: 153 LFNNGV-FKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADL 208
Query: 179 -----ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+Q + + YYLSL+ I+V R+ FAL NGTGG +ID
Sbjct: 209 FSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-TNGTGGTIID 267
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G T + Y+VV DE + + C+ S+ + + HF
Sbjct: 268 SGTSITSLPPQVYQVVR---DEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 324
Query: 293 DRADFKVE-PTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
+ A + Y++ + + G C+AI+ D +++G +QQQ+ +YDL + FV
Sbjct: 325 EGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFV 384
Query: 350 PENC 353
C
Sbjct: 385 AAQC 388
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 167/362 (46%), Gaps = 24/362 (6%)
Query: 7 YTVDVLFGTPSKSEFLL-FDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP + L DTGS ++WTQC PC +CF Q P F+ +AS T + C D
Sbjct: 92 YLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDP 151
Query: 66 ICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLK-NKLVCVPGVIFGCSNDNR 123
ICR P C G C +++NY + G ++ ++FTF K V VP ++FGC N
Sbjct: 152 ICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNT 211
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+F N GI GF P SL QL ++ FSYC + E ++T + G A+ R
Sbjct: 212 G-NFHSNETGIAGFGRGPLSLPRQLGVSS---FSYCFTTIF-ESKSTPVFLGGAPADGLR 266
Query: 184 K----DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + +YYLSL+ I+V R+ F ++ +G+GG +ID+G T
Sbjct: 267 AHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAIT 326
Query: 240 FIQRGPYEVVMRHFDEHFTS---FGRQRMHNASEDWEYCYRYDSRFRA----YASMTFHF 292
R V R E F + ++ E C+ +S A MT H
Sbjct: 327 AFPR----AVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHL 382
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAI-SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ AD+++ + + CV + + D +++G +QQQ+ V+DL + P
Sbjct: 383 EGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPA 442
Query: 352 NC 353
C
Sbjct: 443 QC 444
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 156/358 (43%), Gaps = 34/358 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P ++L+ D+GS +IW QC PC C+ Q+ P+F+P ASS++ + C I
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 67 CRR-----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CR + G+C + + Y G+ G ++ ET T V GV GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F G AG+LG SL+GQL A G+FSYCL A R L G+ +
Sbjct: 246 NSGL-FVG-AAGLLGLGWGAMSLVGQLGGAAGGVFSYCL--ASRGAGGAGSLVLGRTEAV 301
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
R SS YY+ L I V R+ F L +G GG ++DTG T +
Sbjct: 302 PRGRRA--------SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRL 353
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-----MTFHFDRAD 296
R Y + FD + R A + CY YAS ++F+FD+
Sbjct: 354 PREAYAALRGAFDGAMGALPRS---PAVSLLDTCY----DLSGYASVRVPTVSFYFDQGA 406
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + FC+A + S S++G QQ+ + D G + F P C
Sbjct: 407 VLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/354 (30%), Positives = 162/354 (45%), Gaps = 19/354 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTPS F + DTGS +IW QC PC C+ Q+ PIF+ + S TYK +PC
Sbjct: 89 YLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNT 148
Query: 67 CR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNR 123
C+ + F C++ I+Y G+ + G +S ET T N V PG + GC N
Sbjct: 149 CQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGRYNA 208
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ +GI+G P SL+ QL + G FSYCLV A+S L FG A +
Sbjct: 209 -IGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLS--TASSKLNFGNAAVVSG 265
Query: 184 KDMKTIRMFVDRS-SHYYLSLQDISVADHRIGF-APGTFALRRNGTGGCMIDTGAIATFI 241
+ + +F Y+L+L+ SV +RI F +PG+ G G +ID+G T +
Sbjct: 266 RGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGS-----GGKGNIIIDSGTTLTAL 320
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY--DSRFRAYASMTFHFDRADFKV 299
G Y + + QR+ + ++ CY+ D + +T HF AD +
Sbjct: 321 PNGVYSKLEAAVAKTVI---LQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFSGADVTL 377
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
F+ + C A ++ +V G QQ+ YDL T+ F +C
Sbjct: 378 N-AINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 172/359 (47%), Gaps = 29/359 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ ++ G P + LL DTGS L W QCLPC C+ Q+ P F+P+ SSTY+ C+
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESAP 146
Query: 67 CRRPP-FRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSNDNR 123
P FR E G C + + Y ++ G+++ E TF ++ L+ P ++FGC DN
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F+ +G+LG FS++ + + FSYC + L G A I+
Sbjct: 207 GFT---QYSGVLGLGPGTFSIVTRNFGSK---FSYCFGSLIDPTYPHNFLILGNGARIE- 259
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
D +++F DR YYL LQ IS+ + + PG F R GG +IDTG T + R
Sbjct: 260 GDPTPLQIFQDR---YYLDLQAISLGEKLLDIEPGIFQ-RYRSKGGTVIDTGCSPTILAR 315
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWE----YCYRYDSRFRAYA--SMTFHF-DRAD 296
YE + D F + +DWE +CY + + Y +TFHF A+
Sbjct: 316 EAYETLSEEID-----FLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAE 370
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ ++ ++ FC+A++ + D SV+GA QQ+ Y+L T + F +C
Sbjct: 371 LALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 165/366 (45%), Gaps = 33/366 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + ++ DTGS LIWTQC PC CF Q AP F P +SST+ ++PC
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 67 CRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ P C CV+ Y G +A G ++TET LK P V FGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTA-GYLATET----LKVGDASFPSVAFGCSTEN- 199
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ +GI G SL+ QL G FSYCL S + FG AN+
Sbjct: 200 --GVGNSTSGIAGLGRGALSLIPQL---GVGRFSYCLRSG--SAAGASPILFGSLANLTD 252
Query: 184 KDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMIDTGAIAT 239
++++ + + H YY++L I+V + + TF +NG GG ++D+G T
Sbjct: 253 GNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLT 312
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHFDRAD 296
++ + YE+V + F + N + + C++ A S+ FD
Sbjct: 313 YLAKDGYEMVKQAF---LSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGA 369
Query: 297 FKVEPTYMYFI-FQNEGYFCVAISF------SDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
PTY + ++G VA SV+G Q D +YDL+ G F
Sbjct: 370 EYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFA 429
Query: 350 PENCAN 355
P +CA
Sbjct: 430 PADCAK 435
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 168/358 (46%), Gaps = 21/358 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + DTGS +IWTQC PC NC+ Q P+FNP+ S+TY+++ C +
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144
Query: 67 CRRPPFRCENGQ------CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCS 119
C F E+ C + I+Y + + G + +T T ++V P GC
Sbjct: 145 CS---FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCG 201
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+DN SFD N++GI+G + P SL+ Q+ S G FSYCL + ++ L FG +A
Sbjct: 202 HDNAG-SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNA 260
Query: 180 NIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
N+ + +++ S Y L L+ +SV + ++ L G +ID+G
Sbjct: 261 NVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL--GGKANIIIDSGTT 318
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
T + P ++ S QR + ++ EYC+ + + HF+ A+
Sbjct: 319 LTLL---PVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGANL 375
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+++ + I ++ C+A + + N S+ G Q + YD+ ++ F P NC
Sbjct: 376 RLQRENV-LIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 176/366 (48%), Gaps = 20/366 (5%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF ++ + GTP F + DTGS L W QC PC C+ Q++P+F+ SSTYK CD
Sbjct: 85 YFMSISI--GTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDS 142
Query: 65 LICR---RPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCV-PGVIFGC 118
C+ C+ + C +R +Y + G V+TET + + PG +FGC
Sbjct: 143 KTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGC 202
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+N +F+ +GI+G P SL+ QL S+ FSYCL + TS++ G +
Sbjct: 203 GYNNGG-TFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTN 261
Query: 179 A--NIQRKDMKTIRMFV---DRSSHYYLSLQDISVADHRIGFAPGTFALRRNG---TGGC 230
+ + KD T+ + D ++Y+L+L+ ++V ++ + G + L TG
Sbjct: 262 SIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNI 321
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
+ID+G T + G Y+ +E T G +R+ + +C++ + ++T
Sbjct: 322 IIDSGTTLTLLDSGFYDDFGTAVEESVT--GAKRVSDPQGLLTHCFKSGDKEIGLPAITM 379
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
HF AD K+ P F+ NE C+++ + ++ G Q D YDL T T+ F
Sbjct: 380 HFTNADVKLSPINA-FVKLNEDTVCLSMIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQR 438
Query: 351 ENCAND 356
+C+ +
Sbjct: 439 MDCSGN 444
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 168/361 (46%), Gaps = 18/361 (4%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
E + Y V V G+P ++L+ D+GS +IW QC PC C+ Q+ P+F+P AS+++ +P
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVP 187
Query: 62 CDDLICRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
CD +CR P ++G C ++++Y G+ G+++ ET TF V GV G
Sbjct: 188 CDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTP---VQGVAIG 244
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + NR F G AG+LG P SL+GQL A G FSYCL + A S++ FG+
Sbjct: 245 CGHRNRGL-FVGA-AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLV-FGR 301
Query: 178 DANIQRKDM-KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + + + S YY+ L + V R+ G F L +G GG ++DTG
Sbjct: 302 DDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGT 361
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFD 293
T + Y + F T G + CY Y S ++ F D
Sbjct: 362 AVTRLPPDAYAALRDAFAS--TIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRD 419
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
A + P + G +C+A + S S++G QQQ + D G + F P
Sbjct: 420 GAALTL-PARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPST 478
Query: 353 C 353
C
Sbjct: 479 C 479
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 163/380 (42%), Gaps = 39/380 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKR 59
H Y VD GTP + + DTGS LIWTQC PC CF Q AP++ P S TY
Sbjct: 94 HASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYAN 153
Query: 60 IPCDDLICRR-PPFRC-------------ENGQCVHRINYAGGASASGLVSTETFTFHLK 105
+ C +C P R E G C + +Y G+S G+++TETFTF
Sbjct: 154 VSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAG 213
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYR 165
V + FGC DN N +G++G P SL+ QL T FSYC +
Sbjct: 214 TT---VHDLAFGCGTDN--LGGTDNSSGLVGMGRGPLSLVSQLGVTK---FSYCFT-PFN 264
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVD------RSSHYYLSLQDISVADHRIGFAPGT 219
+ +S L G A++ T FV RSS+YYLSL+ I+V D + P
Sbjct: 265 DTTTSSPLFLGSSASLSPAAKST--PFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAV 322
Query: 220 FALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD 279
F L +G GG +ID+G T ++ + V+ R A C+
Sbjct: 323 FRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVAL---PLASGAHLGLSVCFAAP 379
Query: 280 SRFRAYA----SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDT 335
A + HFD AD ++ + + G C+ I + SV+G+ QQQ+
Sbjct: 380 QGRGPEAVDVPRLVLHFDGADMELPRSSAVVEDRVAGVACLGIVSARGMSVLGSMQQQNM 439
Query: 336 RFVYDLNTGTIQFVPENCAN 355
YD+ + F P NC
Sbjct: 440 HVRYDVGRDVLSFEPANCGE 459
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 165/364 (45%), Gaps = 32/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + ++ DTGS LIWTQC PC CF Q AP F P +SST+ ++PC
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 67 CRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ P C CV+ Y G +A G ++TET LK P V FGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTA-GYLATET----LKVGDASFPSVAFGCSTEN- 199
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ +GI G SL+ QL G FSYCL S + FG AN+
Sbjct: 200 --GVGNSTSGIAGLGRGALSLIPQL---GVGRFSYCLRSG--SAAGASPILFGSLANLTD 252
Query: 184 KDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMIDTGAIAT 239
++++ + + H YY++L I+V + + TF +NG GG ++D+G T
Sbjct: 253 GNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLT 312
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA--SMTFHFDRADF 297
++ + YE+V + F + N + + C++ A S+ FD
Sbjct: 313 YLAKDGYEMVKQAF---LSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAE 369
Query: 298 KVEPTYMYFI-FQNEGYFCVAISF------SDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
PTY + ++G VA SV+G Q D +YDL+ G F P
Sbjct: 370 YAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSP 429
Query: 351 ENCA 354
+CA
Sbjct: 430 ADCA 433
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 160/360 (44%), Gaps = 23/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PC CF+Q+ P F+P+ SST CD +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 67 CRRPPFRC-------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P N CV+ +Y + +G + + FTF VPGV FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF--VGAGASVPGVAFGCG 199
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N F N GI GF P SL QLK G FS+C +T +L D
Sbjct: 200 LFNNGV-FKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 180 -NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
R +++ + + + + YYLSL+ I+V R+ F L +NGTGG +ID+G
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTL-KNGTGGTIIDSGT 314
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRA 295
T + Y +V F + + D +C R + Y + HF+ A
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVK---LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA 371
Query: 296 DFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ F ++ G C+AI + +G +QQQ+ +YDL + FVP C
Sbjct: 372 TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 168/356 (47%), Gaps = 14/356 (3%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
+Y + GTP + + DTGS IW QC PC C NQ++PIFNP+ SSTYK I C
Sbjct: 89 YYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSP 148
Query: 66 ICRR-PPFRCENG---QCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSN 120
IC+R RC + +C + I Y + + G +S +T T + + + P ++ GC +
Sbjct: 149 ICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGH 208
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N + +G +GI+GF FS++ QL S+ G FSYCL + + +S L FG A
Sbjct: 209 KN-SLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMAV 267
Query: 181 IQRKDMKTIRMFVD-RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + + +Y+ +L+ SV DH I +L + G +ID+G+ T
Sbjct: 268 VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDS--SLIPDNEGNAVIDSGSTIT 325
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ P +V + + +R+ + ++ CY+ + +T HF AD K+
Sbjct: 326 QL---PNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPIITAHFRGADVKL 382
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ FI N C A + S VV G QQ+ YD I F P NC
Sbjct: 383 N-AFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 165/357 (46%), Gaps = 16/357 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ + DTGS LIWTQC PC C+ Q AP+F+P +SSTY+ I C
Sbjct: 92 YLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQ 151
Query: 67 CR--RPPFRCE---NGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSN 120
C + C N C + +Y + SG V+ +T T + V +P I GC +
Sbjct: 152 CDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGH 211
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
+N SF +GI+G P SL+ QL ST G FSYCLV +S L FG +
Sbjct: 212 NNGG-SFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGI 270
Query: 181 IQRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ +++ + D + Y+L+L+ +SV RI F +F G +ID+G T
Sbjct: 271 VSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIIDSGTTLT 327
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ + + + + + S CY D+ + + S+T HFD AD K+
Sbjct: 328 LFPEDFFSELSSAVQD---AVAGTPVEDPSGILSLCYSIDADLK-FPSITAHFDGADVKL 383
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
P F+ ++ C A + + ++ G Q + YDL T+ F P +C D
Sbjct: 384 NP-LNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGYDLEGKTVSFKPTDCTQD 439
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 167/361 (46%), Gaps = 21/361 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF +V V GTP L+ DTGS ++W QC PCV+C+ Q +P+++P SSTY + PC
Sbjct: 98 EYFASVGV--GTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCS 155
Query: 64 DLICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CR P C+ G C +RI Y +S SG ++T+ F + V V GC +D
Sbjct: 156 PPQCRN-PQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVF---SNDTSVGNVTLGCGHD 211
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N G+ AG+LG + S Q+ + F+YCL R ++S L FG+ A
Sbjct: 212 NEGLF--GSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRTAPE 269
Query: 182 QRKDMKT-IRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMIDTGAIA 238
+ T +R R S YY+ + SV + GF+ + +L G GG ++D+G
Sbjct: 270 PPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSI 329
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA---SMTFHFDRA 295
T R Y + FD G +++ ++ C YD R A A + HF
Sbjct: 330 TRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDAC--YDLRGVAVADAPGVVLHFAGG 387
Query: 296 DFKVEPTYMYFIFQNEG-YFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
P Y + + G Y C A+ + D SV+G QQ R V+D+ + F P
Sbjct: 388 ADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFEPNG 447
Query: 353 C 353
C
Sbjct: 448 C 448
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 176/363 (48%), Gaps = 19/363 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
+Y Y +++ GTP + DTGS LIW QC+PC NC+ Q P+F+P +SSTY I
Sbjct: 56 HYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYG 115
Query: 64 DLICRRP-PFRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCS 119
C + C Q C + +Y + G+++ ET T K V + GVIFGC
Sbjct: 116 SESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCG 175
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKD 178
++N F+ GI+G P SL+ Q+ S+ G +FS CLV + TS + FGK
Sbjct: 176 HNNNGV-FNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKG 234
Query: 179 ANIQRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+ + + + + V +++H Y+++L ISV D + F G+ +L G +ID+G
Sbjct: 235 SEVLGNGVVSTPL-VSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMVIDSG 292
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDSRFRAYASMTFHFDR 294
T + P + R +E + + + ++ CYR + + ++T HF+
Sbjct: 293 TPTTLL---PEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKG-TTLTAHFEG 348
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
AD + PT ++ Q +G FC A +FS+ + G Q + +DL + F +
Sbjct: 349 ADVLLTPTQIFIPVQ-DGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATD 407
Query: 353 CAN 355
C N
Sbjct: 408 CTN 410
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 174/362 (48%), Gaps = 20/362 (5%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR 68
+ + GTP F + DTGS L W QC PC C+ ++ PIF+ SSTYK PCD C+
Sbjct: 87 MSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQ 146
Query: 69 ---RPPFRCE--NGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C+ N C +R +Y + + G V+TET + V PG +FGC +N
Sbjct: 147 ALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNN 206
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA--N 180
+FD +GI+G SL+ QL S+ FSYCL + TS++ G ++ +
Sbjct: 207 GG-TFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIPS 265
Query: 181 IQRKDMKTIRM-FVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNG-----TGGCMI 232
KD + VD+ ++YYL+L+ ISV +I + ++ +G +G +I
Sbjct: 266 SLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIII 325
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G T ++ G ++ +E T G +R+ + +C++ S +T HF
Sbjct: 326 DSGTTLTLLEAGFFDKFSSAVEESVT--GAKRVSDPQGLLSHCFKSGSAEIGLPEITVHF 383
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
AD ++ P F+ +E C+++ + ++ G + Q D YDL T T+ F +
Sbjct: 384 TGADVRLSPINA-FVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMD 442
Query: 353 CA 354
C+
Sbjct: 443 CS 444
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 161/373 (43%), Gaps = 43/373 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP----- 61
Y +D+ GTP + L DTGS LIWTQC C C Q P+F+P SS+Y+ +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 62 CDDLI---CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C D++ C RP C +R +Y G + G +TE FTF + + FGC
Sbjct: 158 CGDILHHSCVRP------DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
N N +GI+GF P SL+ QL FSYCL ++T L+FG
Sbjct: 212 GTMN--VGSLNNASGIVGFGRDPLSLVSQLSIRR---FSYCLTPYASSRKST--LQFGSL 264
Query: 179 ANIQRKDMKT-------IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
A++ D T I + YY++ ++V R+ FALR +G+GG +
Sbjct: 265 ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVI 324
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN-ASEDWEYCYRYD---------SR 281
ID+G T V+R F R N +S D C+ +R
Sbjct: 325 IDSGTALTLFPAAVLAEVVRAFRSQL----RLPFANGSSPDDGVCFAAPAVAAGGGRMAR 380
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYD 340
A M FHF AD + G+ CV + S D + +G + QQD R VYD
Sbjct: 381 QVAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYD 440
Query: 341 LNTGTIQFVPENC 353
L T+ F P C
Sbjct: 441 LERETLSFAPVEC 453
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 159/359 (44%), Gaps = 27/359 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P ++L+ D+GS +IW QC PC C+ Q+ P+F+P ASS++ + C I
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 67 CRR-----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CR + G+C + + Y G+ G ++ ET T V GV GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F G AG+LG SL+GQL A G+FSYCL A R L G+ +
Sbjct: 246 NSGL-FVG-AAGLLGLGWGAMSLVGQLGGAAGGVFSYCL--ASRGAGGAGSLVLGRTEAV 301
Query: 182 QRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + ++ SS YY+ L I V R+ F L +G GG ++DTG T
Sbjct: 302 PVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTR 361
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-----MTFHFDRA 295
+ R Y + FD + R A + CY YAS ++F+FD+
Sbjct: 362 LPREAYAALRGAFDGAMGALPRSP---AVSLLDTCY----DLSGYASVRVPTVSFYFDQG 414
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + FC+A + S S++G QQ+ + D G + F P C
Sbjct: 415 AVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 158/355 (44%), Gaps = 26/355 (7%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFR-CENGQCV 80
L D G L W QCLPC +C Q +P+F+P S T+ IP + + RPP++ NG C
Sbjct: 113 LALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGACG 172
Query: 81 HRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSV 139
I Y ASG ++ +TF+F N V + ++FGC++ F +AGILG +
Sbjct: 173 FDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGILGLGM 232
Query: 140 SPF-----SLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD------ANIQRKDMKT 188
P + Q+ G FSYC M S LRFG D N+ R+
Sbjct: 233 GPAGKPPTAFTKQVLPAHGGRFSYCPFVP--GMSMYSYLRFGSDIPSHPPPNVHRQSTPV 290
Query: 189 IRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYE 247
+ S Y++ L +SV +R+ G P F +G GGC++D G T Y
Sbjct: 291 LAP-AHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAYV 349
Query: 248 VVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDR-ADFKVEP--TY 303
+ +H R+ H C + + SMT HF+ A +V P +
Sbjct: 350 HIDHAVRQHLQ---RRGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGAWLRVMPEHVF 406
Query: 304 MYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN--TGTIQFVPENCAND 356
M F+ Y C S +V+GA QQ + RF++DL+ + F PE+C D
Sbjct: 407 MPFVVGGHHYQCFGFVSSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDCHLD 461
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 175/362 (48%), Gaps = 17/362 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + GTP + ++DTGS L+WTQCLPC++C+ Q P+F+P+ S+++K + C+
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 64 DLICR-RPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCS 119
CR C Q C Y G+ A G+++TET T + + + + ++FGC
Sbjct: 148 SQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCG 207
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG--LFSYCLVYAYREMEATSILRFGK 177
++N +F+ N G+ G P SL Q+ ST FS CLV + TS + FG
Sbjct: 208 HNNSG-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 266
Query: 178 DANIQRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
+A + D+ + + D ++Y+++L ISV D F+ + + G ID G
Sbjct: 267 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATK---GNVFIDAGT 323
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T + R Y +++ E + + + + + CYR + +T HFD AD
Sbjct: 324 PPTLLPRDFYNRLVQGVKE---AIPMEPVQDPDLQPQLCYRSATLIDG-PILTAHFDGAD 379
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNS-VVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+++P FI EG +C A+ D ++ + G + Q + +DL+ + F +C
Sbjct: 380 VQLKPLNT-FISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
Query: 356 DH 357
Sbjct: 439 QQ 440
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 161/373 (43%), Gaps = 43/373 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP----- 61
Y +D+ GTP + L DTGS LIWTQC C C Q P+F+P SS+Y+ +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 62 CDDLI---CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C D++ C RP C +R +Y G + G +TE FTF + + FGC
Sbjct: 158 CGDILHHSCVRP------DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
N N +GI+GF P SL+ QL FSYCL ++T L+FG
Sbjct: 212 GTMN--VGSLNNASGIVGFGRDPLSLVSQLSIRR---FSYCLTPYASSRKST--LQFGSL 264
Query: 179 ANIQRKDMKT-------IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
A++ D T I + YY++ ++V R+ FALR +G+GG +
Sbjct: 265 ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVI 324
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN-ASEDWEYCYRYD---------SR 281
ID+G T V+R F R N +S D C+ +R
Sbjct: 325 IDSGTALTLFPVAVLAEVVRAFRSQL----RLPFANGSSPDDGVCFAAPAVAAGGGRMAR 380
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYD 340
A M FHF AD + G+ CV + S D + +G + QQD R VYD
Sbjct: 381 QVAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYD 440
Query: 341 LNTGTIQFVPENC 353
L T+ F P C
Sbjct: 441 LERETLSFAPVEC 453
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 164/358 (45%), Gaps = 28/358 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + DTGS ++W QC PC CFNQ++PIFNP+ SS+YK IPC
Sbjct: 89 YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSST 148
Query: 67 CRRPP---FRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSN 120
C+ C NG C + I Y G A + G +S ++ T V P ++ GC +
Sbjct: 149 CKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGH 208
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKDA 179
N + +G++G P SL+ Q+ S++ G FSYCL+ + ++S L FG+D
Sbjct: 209 INV-LQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGEDV 267
Query: 180 NIQRKDMKTIRMFV--DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ + + + M + ++Y+L+L+ SV ++RI + + A +N +ID+G
Sbjct: 268 VVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQN----ILIDSGT- 322
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQ-----RMHNASEDWEYCYRYDSRFRAYASMTFHF 292
P ++ F S+ Q R+ CY + +T HF
Sbjct: 323 -------PLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHF 375
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ AD K+ +F F+ +G C S+ + G Q + YDL I F P
Sbjct: 376 NGADVKLNSNGTFFPFE-DGIMCFGFISSNGLEIFGNIAQNNLLIDYDLEKEIISFKP 432
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 174/366 (47%), Gaps = 20/366 (5%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF ++ + GTP + DTGS L W QC PC C+ Q+ P+F+ SSTYK CD
Sbjct: 85 YFMSISI--GTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDS 142
Query: 65 LICRRPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGC 118
+ C E C +R +Y + G V+TET + + V PG FGC
Sbjct: 143 ITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGC 202
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+N +F+ +GI+G P SL+ QL S+ FSYCL + TS++ G +
Sbjct: 203 GYNNGG-TFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNGTSVINLGTN 261
Query: 179 ANIQR--KD---MKTIRMFVDRSSHYYLSLQDISVADHRIGF-APGTFALRRNG--TGGC 230
+ + KD + T + D ++Y+L+L+ I+V ++ + G ++L R TG
Sbjct: 262 SMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGNI 321
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
+ID+G T + G Y+ +E T G +R+ + +C++ + ++T
Sbjct: 322 IIDSGTTLTLLDSGFYDDFGAVVEESVT--GAKRVSDPQGILTHCFKSGDKEIGLPTITM 379
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
HF AD K+ P F+ +E C+++ + ++ G Q D YDL T T+ F
Sbjct: 380 HFTGADVKLSP-INSFVKLSEDIVCLSMIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQR 438
Query: 351 ENCAND 356
+C+ +
Sbjct: 439 MDCSGN 444
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 173/379 (45%), Gaps = 43/379 (11%)
Query: 7 YTVDVLFGTPSKSEFLL-FDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP +L DTGS L+WTQC C CF+Q P+F + S T+ R+PC D
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDP 152
Query: 66 ICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKL---VCVPGVIF 116
+C + +G C + Y + +G ++ +TFTF ++ VP + F
Sbjct: 153 LCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRF 212
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC N F N +GI GF P SL QLK FSYC A E + ++ G
Sbjct: 213 GCGMMNYGL-FTPNQSGIAGFGTGPLSLPSQLKVRR---FSYCFT-AMEESRVSPVILGG 267
Query: 177 KDANIQRKDMKTIR----------MFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+ NI+ I+ V Y+LSL+ ++V + R+ F TFAL+ +G
Sbjct: 268 EPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDG 327
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE--YCYRYDSRFRA 284
+GG ID+G TF + V R E F + + D + C+ ++ +A
Sbjct: 328 SGGTFIDSGTAITFFP----QAVFRSLREAFVAQVPLPVAKGYTDPDNLLCFSVPAKKKA 383
Query: 285 YA--SMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAI--SFSDRNSVVGAWQQQD 334
A + H + AD+++ P Y + ++ CV I + + +++G +QQQ+
Sbjct: 384 PAVPKLILHLEGADWEL-PRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQN 442
Query: 335 TRFVYDLNTGTIQFVPENC 353
VYDL + + F P C
Sbjct: 443 MHIVYDLESNKMVFAPARC 461
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 173/372 (46%), Gaps = 39/372 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQ-SAPIFNPNASSTYKRIPCDDL 65
Y + V GTP + L DTGS L+WTQC PC++CF Q +AP+ +P ASST+ +PCD
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAP 149
Query: 66 ICRRPPFRCENGQ------CVHRINYAGGASASGLVSTETFTFHLKNKL--VCVPGVIFG 117
+CR PF G+ CV+ +Y + G ++T++FTF + + V FG
Sbjct: 150 LCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFG 209
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + N+ F N GI GF +SL QL T+ FSYC + + +++S++ G
Sbjct: 210 CGHINKGI-FQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMF-DTKSSSVVTLGA 264
Query: 178 DA--------NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
A D++T R+ + S S Y++ L+ ISV R+ R+ T
Sbjct: 265 AAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRL---RSST 321
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS----RFR 283
+ID+GA T + YE V F + G S + C+ R
Sbjct: 322 ---IIDSGASITTLPEDVYEAVKAEF---VSQVGLPAAAAGSAALDLCFALPVAALWRRP 375
Query: 284 AYASMTFHFD-RADFKVEPTYMYFIFQNEGYFCVAI-SFSDRNSVVGAWQQQDTRFVYDL 341
A ++T H D AD+++ F CV + + + V+G +QQQ+T VYDL
Sbjct: 376 AVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDL 435
Query: 342 NTGTIQFVPENC 353
+ F P C
Sbjct: 436 ENDVLSFAPARC 447
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 161/370 (43%), Gaps = 36/370 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + L DTGS LIWTQC PC +C +Q P+F P S++Y+ + C +
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTL 155
Query: 67 CRRPPFR-CENGQ-CVHRINYAGGASASGLVSTETFTF----HLKNKLVCVPGVIFGCSN 120
C CE C +R NY G G+ +TE FTF VP + FGC +
Sbjct: 156 CSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGFGCGS 214
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDA 179
N +G+ GI+GF +P SL+ QL FSYCL YA R S L FG +
Sbjct: 215 VNVGSLNNGS--GIVGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQ---STLLFGSLS 266
Query: 180 N------IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+ R + + YY+ ++V R+ FALR +G+GG ++D
Sbjct: 267 DGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVD 326
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE-YCYRYDSRFRAYAS----- 287
+G T + V+R F + R N + C+ + +R +S
Sbjct: 327 SGTALTLLPAAVLAEVVRAFRQQL----RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMP 382
Query: 288 ---MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYDLNT 343
M HF AD + G C+ ++ S D S +G QQD R +YDL
Sbjct: 383 VPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEA 442
Query: 344 GTIQFVPENC 353
T+ P C
Sbjct: 443 ETLSIAPARC 452
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 163/356 (45%), Gaps = 22/356 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + DTGS LIWTQC PC C+ Q P+F+P +S TY+ CD
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQ 154
Query: 67 CR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRD 124
C C C ++ +Y + G V+++T T V P + GC ++N D
Sbjct: 155 CSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGHEN-D 213
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
+F +GI+G P SL+ Q+ S+ G FSYCLV +S L FG +A +
Sbjct: 214 GTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGP 273
Query: 185 DMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+++ + SS Y+L+L+ +SV + RI F + G G +ID+G T +
Sbjct: 274 GVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLG---TGEGNIIIDSGTTLTIVP 330
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFK 298
F T+ G Q +ED CY S + A +T HF AD K
Sbjct: 331 D-------DFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKVPA-ITAHFTGADVK 382
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++P F+ ++ C+A + + S+ G Q + Y++ ++ F P +C
Sbjct: 383 LKPINT-FVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 158/351 (45%), Gaps = 40/351 (11%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFR 73
GTP + + DTGS ++W QC PC C+NQ+ P F P+ SSTYK IPC +C+
Sbjct: 94 GTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLCK----- 148
Query: 74 CENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRDFSFDGNIA 132
+GQ G +S +T T + P + GC DN SF+G +
Sbjct: 149 --SGQ-------------QGNLSVDTLTLESSTGHPISFPKTVIGCGTDNT-VSFEGASS 192
Query: 133 GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMF 192
GI+G P SL+ QL S+ FSYCL+ E TS L FG A + + + +
Sbjct: 193 GIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIV 252
Query: 193 -VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG--TGGCMIDTGAIATFIQRGPYEVV 249
D YYL+L+ SV + RI F + NG G +ID+G T I P +V
Sbjct: 253 KKDPIVFYYLTLEAFSVGNKRIEFEGSS-----NGGHEGNIIIDSGTTLTVI---PTDVY 304
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQ 309
+R+++ + + CY S + +T HF AD K+ P F+
Sbjct: 305 NNLESAVLELVKLKRVNDPTRLFNLCYSVTSDGYDFPIITTHFKGADVKLHPIST-FVDV 363
Query: 310 NEGYFCVAISF------SDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+G C+A + SD S+ G QQ+ YDL + F P +C+
Sbjct: 364 ADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 177/373 (47%), Gaps = 41/373 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCL----PCVNCFNQSAPIFNPNASSTYKRIPC 62
+++ V GTP + L+ DTGS LIWTQC V + S P+++P SST+ +PC
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPC 150
Query: 63 DDLICRRPPFRCEN----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
D +C+ F +N +CV+ Y G A+A G++++ETFTF + + G FGC
Sbjct: 151 SDRLCQEGQFSFKNCTSKNRCVYEDVY-GSAAAVGVLASETFTFGARRAVSLRLG--FGC 207
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGK 177
+ GILG S SL+ QLK FSYCL +A ++ TS L FG
Sbjct: 208 GALSAGSLIGAT--GILGLSPESLSLITQLKIQR---FSYCLTPFADKK---TSPLLFGA 259
Query: 178 DANIQR----KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
A++ R + ++T + + ++ +YY+ L IS+ R+ + A+R +G GG +
Sbjct: 260 MADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTI 319
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDSRFRAYA---- 286
+D+G+ ++ +E V E R + N + ED+E C+ R A A
Sbjct: 320 VDSGSTVAYLVEAAFEAV----KEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAV 375
Query: 287 ---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYD 340
+ HFD V P YF G C+A+ + S++G QQQ+ ++D
Sbjct: 376 QVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435
Query: 341 LNTGTIQFVPENC 353
+ F P C
Sbjct: 436 VQHHKFSFAPTQC 448
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 171/364 (46%), Gaps = 27/364 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V GTP + +L+ DTGS + W QC PC NC+ Q +FNP++SS++K + C
Sbjct: 16 YFAVVGV--GTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSS 73
Query: 65 LIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVCVPGVIFGCSND 121
+C C + +C+++ +Y G+ G + T+ V + + GC +D
Sbjct: 74 SLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHD 133
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N +F G AGILG P S L ++ + +FSYCL + S L FG DA I
Sbjct: 134 NEG-TF-GTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFG-DAAI 190
Query: 182 QRKDMKTIRMFVDR------SSHYYLSLQDISVADHRIGFAPGT-FALRRNGTGGCMIDT 234
+++ F+ + +++YY+ + ISV + + P + F L +G GG + D+
Sbjct: 191 PHTATGSVK-FIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDS 249
Query: 235 GAIATFIQRGPYEVV---MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTF 290
G T ++ Y V R H TS ++ ++ CY + + ++TF
Sbjct: 250 GTTITRLEARAYTAVRDAFRAATMHLTSAADFKI------FDTCYDFTGMNSISVPTVTF 303
Query: 291 HFD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
HF D ++ P+ N FC A + S SV+G QQQ R +YD I +
Sbjct: 304 HFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGPSVIGNVQQQSFRVIYDNVHKQIGLL 363
Query: 350 PENC 353
P+ C
Sbjct: 364 PDQC 367
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 159/360 (44%), Gaps = 26/360 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + L DTGS L+WTQC PC CFNQS P ++ + SST+ CD
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 94
Query: 67 CRRPP--FRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C+ P C N C + +Y ++ G + ET +F VPGV+FGC +
Sbjct: 95 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGAS---VPGVVFGCGLN 151
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F N GI GF P SL QLK G FS+C +T + D
Sbjct: 152 NTGI-FRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYK 207
Query: 182 QRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + +H YYLSL+ I+V R+ FAL+ NGTGG +ID+G
Sbjct: 208 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTAF 266
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY-CYRYDSRFRA--YASMTFHFDRA 295
T + Y +V F H + + ++E C+ +A + HF+ A
Sbjct: 267 TSLPPRVYRLVHDEFAAHV----KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA 322
Query: 296 DFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ F ++ G C+AI +++G +QQQ+ +YDL + FV C
Sbjct: 323 TMHLPRENYVFEAKDGGNCSICLAI-IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 162/351 (46%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P ++++++ D+GS +IW QC PC C++QS P+FNP SS+Y + C +
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTV 193
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C C G+C + ++Y G+ G ++ ET TF + V GC + N+
Sbjct: 194 CSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTF----GRTLIRNVAIGCGHHNQGM 249
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
F G AG+LG P S +GQL A G FSYCLV R ++++ +L+FG++A
Sbjct: 250 -FVG-AAGLLGLGSGPMSFVGQLGGQAGGTFSYCLV--SRGIQSSGLLQFGREAVPVGAA 305
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V R+ + F L G GG ++DTG T +
Sbjct: 306 WVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAA 365
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
YE F T+ R + ++ CY +++F+F P
Sbjct: 366 YEAFRDAFIAQTTNLPRA---SGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARN 422
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ D G + F P C
Sbjct: 423 FLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 159/360 (44%), Gaps = 26/360 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + L DTGS L+WTQC PC CFNQS P ++ + SST+ CD
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 67 CRRPP--FRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C+ P C N C + +Y ++ G + ET +F VPGV+FGC +
Sbjct: 151 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG---ASVPGVVFGCGLN 207
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F N GI GF P SL QLK G FS+C +T + D
Sbjct: 208 NTGI-FRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYK 263
Query: 182 QRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + +H YYLSL+ I+V R+ FAL+ NGTGG +ID+G
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTAF 322
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY-CYRYDSRFRA--YASMTFHFDRA 295
T + Y +V F H + + ++E C+ +A + HF+ A
Sbjct: 323 TSLPPRVYRLVHDEFAAHV----KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA 378
Query: 296 DFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ F ++ G C+AI +++G +QQQ+ +YDL + FV C
Sbjct: 379 TMHLPRENYVFEAKDGGNCSICLAI-IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 166/360 (46%), Gaps = 39/360 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y Y + + GTP + DTGS IWTQCLPCV+C+NQ+APIF+P+ SST+K I CD
Sbjct: 62 TYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCD 121
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDN 122
+ C + + Y G + G + TET T H + +P I GC +N
Sbjct: 122 T----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN 171
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F AG++G P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 172 S--GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSKINFGANAIVA 224
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGTGGCMIDTGAIAT 239
+ + +FV + YYL+L +SV + RI F AL+ G +ID+G+ T
Sbjct: 225 GDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALK----GNIVIDSGSTLT 280
Query: 240 FIQRGPYEVVMRHFDEHFTS--FGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RAD 296
+ +V + ++ T+ F R + CY Y + +T HF AD
Sbjct: 281 YFPESYCNLVRKAVEQVVTAVRFPRSDI--------LCY-YSKTIDIFPVITMHFSGGAD 331
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ MY G FC+AI + ++ G Q + YD ++ + F P NC+
Sbjct: 332 LVLDKYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 171/362 (47%), Gaps = 20/362 (5%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR 68
+ + GTP F + DTGS L W QC PC C+ ++ PIF+ SSTYK PCD C
Sbjct: 87 MSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCH 146
Query: 69 ---RPPFRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C+ + C +R +Y + + G V+TET + V PG +FGC +N
Sbjct: 147 ALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGYNN 206
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA--N 180
+FD +GI+G SL+ QL S+ FSYCL + TS++ G ++ +
Sbjct: 207 GG-TFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIPS 265
Query: 181 IQRKDMKTIRM-FVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNG-----TGGCMI 232
KD I VD+ ++YYL+L+ ISV +I + ++ G +G +I
Sbjct: 266 SLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNIII 325
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G T + G ++ +E T G +R+ + +C++ S +T HF
Sbjct: 326 DSGTTLTLLDSGFFDKFGAAVEELVT--GAKRVSDPQGLLSHCFKSGSAEIGLPEITVHF 383
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
AD ++ P F+ +E C+++ + ++ G + Q D YDL T T+ F +
Sbjct: 384 TGADVRLSPINA-FVKVSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQRMD 442
Query: 353 CA 354
C+
Sbjct: 443 CS 444
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 166/360 (46%), Gaps = 39/360 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y Y + + GTP + DTGS IWTQCLPCV+C+NQ+APIF+P+ SST+K I CD
Sbjct: 56 TYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCD 115
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDN 122
+ C + + Y G + G + TET T H + +P I GC +N
Sbjct: 116 T----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN 165
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F AG++G P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 166 S--GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSKINFGANAIVA 218
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGTGGCMIDTGAIAT 239
+ + +FV + YYL+L +SV + RI F AL+ G +ID+G+ T
Sbjct: 219 GDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALK----GNIVIDSGSTLT 274
Query: 240 FIQRGPYEVVMRHFDEHFTS--FGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RAD 296
+ +V + ++ T+ F R + CY Y + +T HF AD
Sbjct: 275 YFPESYCNLVRKAVEQVVTAVRFPRSDI--------LCY-YSKTIDIFPVITMHFSGGAD 325
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ MY G FC+AI + ++ G Q + YD ++ + F P NC+
Sbjct: 326 LVLDKYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 174/362 (48%), Gaps = 17/362 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + GTP + ++DTGS L+WTQCLPC++C+ Q P+F+P+ S+++K + C+
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 64 DLICR-RPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCS 119
CR C Q C Y G+ A G+++TET T + + + + ++FGC
Sbjct: 148 SQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCG 207
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG--LFSYCLVYAYREMEATSILRFGK 177
++N +F+ N G+ G P SL Q+ ST FS CLV + TS + FG
Sbjct: 208 HNNSG-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 266
Query: 178 DANIQRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
+A + + + + D ++Y+++L ISV D F+ + + G ID G
Sbjct: 267 EAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATK---GNVFIDAGT 323
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T + R Y +++ E + + + + + CYR + +T HFD AD
Sbjct: 324 PPTLLPRDFYNRLVQGVKE---AIPMEPVQDPDLQPQLCYRSATLIDG-PILTAHFDGAD 379
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNS-VVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+++P FI EG +C A+ D ++ + G + Q + +DL+ + F +C
Sbjct: 380 VQLKPLNT-FISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
Query: 356 DH 357
Sbjct: 439 QQ 440
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 162/360 (45%), Gaps = 25/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + G+P + DTGS LIW QC PC NCF Q P+F P SSTYK CD
Sbjct: 89 YLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQP 148
Query: 67 CR--RPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLVCVPGVIFGCSN 120
C +P R + GQC++ I Y + + G++ TET +F + V P IFGC
Sbjct: 149 CTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGV 208
Query: 121 DNRDFSFDGN-IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
DN + N + GI G P SL+ QL + FSYCL+ + +TS L+FG +A
Sbjct: 209 DNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCLL--PYDSTSTSKLKFGSEA 266
Query: 180 NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
I + + + + S ++Y+L+L+ +++ + G +ID+G
Sbjct: 267 IITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTG--------QTDGNIVIDSGTP 318
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
T+++ Y + E + G + + + + C+ + A + F F A
Sbjct: 319 LTYLENTFYNNFVASLQE---TLGVKLLQDLPSPLKTCFPNRANL-AIPDIAFQFTGASV 374
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ P + + C+A+ S S+ G+ Q D + YDL + F P +CA
Sbjct: 375 ALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDCAK 434
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 171/363 (47%), Gaps = 19/363 (5%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
+E +Y + V GTP + + DTGS ++W QC PC C+NQ+ P FNP+ SS+YK I
Sbjct: 83 YEGDYIMSYSV--GTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNI 140
Query: 61 PCDDLICR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFG 117
C +C+ R + C + INY + + G +S ET T + V P + G
Sbjct: 141 SCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIG 200
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME----ATSIL 173
C +N SF +G++G P SL+ QL + G FSYCLV ++ +S L
Sbjct: 201 CGTNNIG-SFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKL 259
Query: 174 RFGKDANIQRKD-MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
FG A + + + T + D S YYL+++ SV D R+ FA + + G +I
Sbjct: 260 NFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEE---GNIII 316
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFH 291
D+ I TF+ P +V + +R+ + ++ + CY S + MT H
Sbjct: 317 DSSTIVTFV---PSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAH 373
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
F AD + T F+ C A + S+ ++ G++ QQD YDL T+ F
Sbjct: 374 FKGADILLYATNT-FVEVARDVLCFAFAPSNGGAIFGSFSQQDFMVGYDLQQKTVSFKSV 432
Query: 352 NCA 354
+C
Sbjct: 433 DCT 435
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 168/354 (47%), Gaps = 15/354 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L+WTQC PC +C+ Q P+F+P ASSTYK + C
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSND 121
C + E+ C + +Y + G ++ +T T + V + +I GC ++
Sbjct: 154 CTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIGCGHN 213
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N +F+ +GI+G SL+ QL + G FSYCLV E + TS + FG +A +
Sbjct: 214 NAG-TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNAVV 272
Query: 182 QRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + + + YYL+L+ ISV + + PG+ + +G G +ID+G T
Sbjct: 273 SGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQY-PGSDS--GSGEGNIIIDSGTTLTL 329
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ P E D +S ++ + CY + A +T HFD AD ++
Sbjct: 330 L---PTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPA-ITMHFDGADVNLK 385
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
P+ F+ +E C A S S+ G Q + YD + T+ F P +CA
Sbjct: 386 PSNC-FVQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 438
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 159/359 (44%), Gaps = 22/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ +++ DTGS ++W QC PC+ C++Q+ P+F+P S ++ IPC
Sbjct: 144 EYFTRLGV--GTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCG 201
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR P + C+++++Y G+ G STET TF V V+ GC +
Sbjct: 202 SPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTR----VGRVVLGCGH 257
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F Q+ FSYCL +SI+ FG A
Sbjct: 258 DNEGLFVGAAGLLGLGRGRLSFP--SQIGRRFNSKFSYCLGDRSASSRPSSIV-FGDSAI 314
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+ L ISV R+ G + F L G GG +ID+G T
Sbjct: 315 SRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVT 374
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R Y + F G + A E ++ C+ + ++ HF AD
Sbjct: 375 RLTRAAYVALRDAF-----LVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 429
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + N G FC A + + S++G QQQ R VYDL T + F P CA
Sbjct: 430 VPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGCA 488
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 172/370 (46%), Gaps = 26/370 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DVL G+P K L+ DTGS L W QCLPC +CF Q+ ++P AS++YK I C+D
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPR 214
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP C +N C + Y ++ +G + ETFT +L ++L V +
Sbjct: 215 CNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENM 274
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 275 MFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLI 332
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ FV R + YY+ ++ I VA + T+ + +G GG
Sbjct: 333 FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGG 392
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT 289
+ID+G ++ YE + E + G+ ++ + C+ S +
Sbjct: 393 TIIDSGTTLSYFAEPAYEFIKNKIAEK--AKGKYPVYRDFPILDPCFNV-SGIDSIQLPE 449
Query: 290 FHFDRADFKVE--PTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGT 345
AD V PT FI+ NE C+AI + ++ S++G +QQQ+ +YD
Sbjct: 450 LGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSR 509
Query: 346 IQFVPENCAN 355
+ + P CA+
Sbjct: 510 LGYAPTKCAD 519
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 168/359 (46%), Gaps = 29/359 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ ++ G P + LL DTGS L W CLPC C+ Q+ P F+P+ SSTY+ C
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136
Query: 67 CRRPP-FRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSNDNR 123
P FR E G C + + Y ++ G+++ E TF + L+ ++FGC DN
Sbjct: 137 HAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNS 196
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F+ +G+LG FS++ + + FSYC +IL G A I+
Sbjct: 197 GFT---KYSGVLGLGPGTFSIVTRNFGSK---FSYCFGSLTNPTYPHNILILGNGAKIE- 249
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
D +++F DR YYL LQ IS + + PGTF R GG +IDTG T + R
Sbjct: 250 GDPTPLQIFQDR---YYLDLQAISFGEKLLDIEPGTFQ-RYRSQGGTVIDTGCSPTILAR 305
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEY----CYRYDSRFRAYA--SMTFHF-DRAD 296
YE + D F + +DW+ CY + + Y +TFHF A+
Sbjct: 306 EAYETLSEEID-----FLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAE 360
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ ++ ++ FC+A++ + D SV+GA QQ+ Y+L T + F +C
Sbjct: 361 LALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 168/372 (45%), Gaps = 31/372 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC+ CF QS P ++P SS+++ I C D
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C+ PP C EN C + Y G++ +G + ETFT +L ++L V V
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG F+ Q++S FSYCLV +S L
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGLGKGPLSFA--SQMQSLYGQSFSYCLVDRNSNASVSSKLI 374
Query: 175 FGKDANIQRKDMKTIRMF-------VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
FG+D + F VD + YY+ ++ + V D + T+ L G
Sbjct: 375 FGEDKELLSHPNLNFTSFGGGKDGSVD--TFYYVQIKSVMVDDEVLKIPEETWHLSSEGA 432
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
GG +ID+G T+ YE++ F + ++ + CY S
Sbjct: 433 GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGY---QLVEGLPPLKPCYNV-SGIEKMEL 488
Query: 288 MTFHFDRADFKVE--PTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNT 343
F AD V P YFI+ + C+AI + R+ S++G +QQQ+ +YD+
Sbjct: 489 PDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKK 548
Query: 344 GTIQFVPENCAN 355
+ + P CA+
Sbjct: 549 SRLGYAPMKCAD 560
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 168/365 (46%), Gaps = 25/365 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP + + DTGS L WTQC PC CF Q P+++P SST+ ++PC
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155
Query: 66 ICRRPP--FR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCV----PGVIFGC 118
+C+ P FR C CV+ YA G +A G ++ +T + GV FGC
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFAGVAFGC 214
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
S N DG +GI+G S SLL Q+ G FSYCL + + A+ IL FG
Sbjct: 215 STANGG-DMDGA-SGIVGLGRSALSLLSQI---GVGRFSYCL-RSDADAGASPIL-FGAL 267
Query: 179 ANIQRKDMKTIRMFVD------RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
AN+ +++ + + R+ +YY++L I+V + TF G GG ++
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIV 327
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G T++ Y ++ + F R+ A D++ C+ + + F F
Sbjct: 328 DSGTTFTYLAEAGYTMLRQAFLSQTAGL-LTRVSGAQFDFDLCFEAGAADTPVPRLVFRF 386
Query: 293 DRADFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
P YF +EG C+ + + SV+G Q D +YDL+ T F P
Sbjct: 387 AGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAP 446
Query: 351 ENCAN 355
+CA+
Sbjct: 447 ADCAS 451
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 168/355 (47%), Gaps = 23/355 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V G P++ +++ DTGS + W QC PC +C+ Q+ PIF+P ASSTY + C
Sbjct: 20 YFTRVGV--GNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQS 77
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C +GQC++++NY G+ G +TE+ +F V V GC +DN
Sbjct: 78 QQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS---VKNVALGCGHDNE 134
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG P SL QLK+T+ FSYCLV R+ +S L F ++
Sbjct: 135 GLFV--GAAGLLGLGGGPLSLTNQLKATS---FSYCLV--NRDSAGSSTLDF--NSAQLG 185
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
D T + +R + YY+ L +SV + TF L +G GG ++D G T +
Sbjct: 186 VDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRL 245
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Q Y + F + ++ +A ++ CY + +++FHF
Sbjct: 246 QTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNL 302
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G +C A + + + S++G QQQ TR +DL + F P C
Sbjct: 303 PAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 158/354 (44%), Gaps = 25/354 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V FGTP +S + L DTGS + W C C C + +APIF+P SS+YK CD
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDPAKSSSYKPFACDSQP 173
Query: 67 CRRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ C N +C ++Y G G ++++ T + +P FGC+
Sbjct: 174 CQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQY----LPNFSFGCA---ESL 226
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQ---GLFSYCLVYAYREMEATSILRFGKDANIQ 182
S D + + L L TA+ G FSYCL + + L GK+A +
Sbjct: 227 SEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKEAAVS 283
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+K + D S + Y+++L+ ISV + RI PGT GG +ID+G T
Sbjct: 284 SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISV-PGT---NIASGGGTIIDSGTTITH 339
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ Y + F + +S ED + CY S ++T H DR V
Sbjct: 340 LVPSAYTALRDAFRQQLSSL----QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVL 395
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
P I Q G C+A S +D S++G QQQ+ R V+D+ + F E CA
Sbjct: 396 PKENILITQESGLACLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 158/360 (43%), Gaps = 26/360 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + L DTGS L+WTQC PC CFNQS P ++ + SST+ CD
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 67 CRRPP--FRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C+ P C N C +Y ++ G + ET +F VPGV+FGC +
Sbjct: 151 CKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAG---ASVPGVVFGCGLN 207
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F N GI GF P SL QLK G FS+C +T + D
Sbjct: 208 NTGI-FRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYK 263
Query: 182 QRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + +H YYLSL+ I+V R+ FAL+ NGTGG +ID+G
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTIIDSGTAF 322
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY-CYRYDSRFRA--YASMTFHFDRA 295
T + Y +V F H + + ++E C+ +A + HF+ A
Sbjct: 323 TSLPPRVYRLVHDEFAAHV----KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA 378
Query: 296 DFKVEPTYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ F ++ G C+AI +++G +QQQ+ +YDL + FV C
Sbjct: 379 TMHLPRENYVFEAKDGGNCSICLAI-IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 170/356 (47%), Gaps = 20/356 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS LIWTQC PC +C+ Q++P+F+P SSTY+++ C
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145
Query: 67 CRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDN 122
CR + C + I Y + G V+ +T T + V + +I GC ++N
Sbjct: 146 CRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHEN 205
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+FD +GI+G SL+ QL+ + G FSYCLV E TS + FG + +
Sbjct: 206 TG-TFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVS 264
Query: 183 RKDMKTIRMF-VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + M D +++Y+L+L+ ISV +I F F G G +ID+G T +
Sbjct: 265 GDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG---TGEGNIVIDSGTTLTLL 321
Query: 242 QRGPY---EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
Y E V+ ++ +R+ + CYR S F+ +T HF D K
Sbjct: 322 PSNFYYELESVVA------STIKAERVQDPDGILSLCYRDSSSFKV-PDITVHFKGGDVK 374
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ F+ +E C A + +++ ++ G Q + YD +GT+ F +C+
Sbjct: 375 LG-NLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 31/372 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC+ CF QS P ++P SS+++ I C D
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C+ PP C EN C + Y G++ +G + ETFT +L K++L V V
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG F+ Q++S FSYCLV +S L
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFA--SQMQSLYGQSFSYCLVDRNSNASVSSKLI 372
Query: 175 FGKDANIQRKDMKTIRMF-------VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
FG+D + F VD + YY+ + + V D + T+ L G
Sbjct: 373 FGEDKELLSHPNLNFTSFGGGKDGSVD--TFYYVQINSVMVDDEVLKIPEETWHLSSEGA 430
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
GG +ID+G T+ YE++ F + + + CY S
Sbjct: 431 GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGY---ELVEGLPPLKPCYNV-SGIEKMEL 486
Query: 288 MTFHFDRADFKVE--PTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNT 343
F AD V P YFI + C+AI + R+ S++G +QQQ+ +YD+
Sbjct: 487 PDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKK 546
Query: 344 GTIQFVPENCAN 355
+ + P CA+
Sbjct: 547 SRLGYAPMKCAD 558
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 14/355 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L WTQC PC +C+ Q P+F+P SSTY+ C
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 67 CR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C + + +C R +YA G+ G +++ET T K V PG FGC + +
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
FD + +GI+G SL+ QLKST GLFSYCL+ + +S + FG +
Sbjct: 212 GGI-FDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVS 270
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + V +S + YYL+L+ ISV R+ + G G ++D+G TF
Sbjct: 271 GYGTVSTPL-VQKSPDTFYYLTLEGISVGKKRLPYK-GYSKKTEVEEGNIIVDSGTTYTF 328
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ P E + S +R+ + + + CY + A +T HF A+ +++
Sbjct: 329 L---PQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANVELQ 384
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
P + Q E C ++ + V+G Q + +DL + F +C
Sbjct: 385 PLNTFMRMQ-EDLVCFTVAPTSDIGVLGNLAQVNFLVGFDLRKKRVSFKAADCTQ 438
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 168/355 (47%), Gaps = 23/355 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V G P++ +++ DTGS + W QC PC +C+ Q+ PIF+P ASSTY + C
Sbjct: 161 YFTRVGV--GNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQS 218
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C +GQC++++NY G+ G +TE+ +F V V GC +DN
Sbjct: 219 QQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS---VKNVALGCGHDNE 275
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG P SL QLK+T+ FSYCLV R+ +S L F ++
Sbjct: 276 GLFV--GAAGLLGLGGGPLSLTNQLKATS---FSYCLV--NRDSAGSSTLDF--NSAQLG 326
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
D T + +R + YY+ L +SV + TF L +G GG ++D G T +
Sbjct: 327 VDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRL 386
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Q Y + F + ++ +A ++ CY + +++FHF
Sbjct: 387 QTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNL 443
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G +C A + + + S++G QQQ TR +DL + F P C
Sbjct: 444 PAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 163/354 (46%), Gaps = 22/354 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V G P+KS +++ DTGS + W QC PC +C+ QS PIF P ASS+Y + CD
Sbjct: 159 YFTRVGV--GNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDS 216
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C NGQC +++NY G+ G TET +F V + GC +DN
Sbjct: 217 QQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGT---VNSIALGCGHDNE 273
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG P SL QLK+T+ FSYCLV R+ A+S L F A +
Sbjct: 274 GLFV--GAAGLLGLGGGPLSLTSQLKATS---FSYCLV--NRDSAASSTLDF-NSAPVGD 325
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + + YY+ L +SV + F L +G GG ++D G T +Q
Sbjct: 326 SVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQS 385
Query: 244 GPYEVVMRHFDEHFTSFGRQ-RMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEP 301
Y + F S R R + ++ CY + +++FHFD P
Sbjct: 386 EAYN----SLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLP 441
Query: 302 TYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I + G +C A + + + S++G QQQ TR +DL + F C
Sbjct: 442 AANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 168/370 (45%), Gaps = 31/370 (8%)
Query: 7 YTVDVLFGTPSKSEF-LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP L DTGS L+WTQC PC CF+Q P+F+P+ SST++ + C D
Sbjct: 87 YLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDP 146
Query: 66 ICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKN----KLVCVPGVI 115
ICR + +C + +Y + +G + +TFTF N V V G+
Sbjct: 147 ICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLA 206
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILR 174
FGC + N F N +GI GF P SL QL+ G FSYCL + E TS +
Sbjct: 207 FGCGDYNTGV-FASNESGIAGFGRGPLSLPSQLRV---GRFSYCLTSHDETESNKTSAVF 262
Query: 175 FGKDANIQRK----DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G N R ++ + S + YYLSL+ I+V R+ FAL+++G+G
Sbjct: 263 LGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSG 322
Query: 229 GCMIDTG-AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE-YCYRYDSRFR--A 284
G +ID+G + TF P V + +E R N SE C++ +
Sbjct: 323 GTVIDSGTGVTTF----PAAVFEQLKNEFVAQLPLPRYDNTSEVGNLLCFQRPKGGKQVP 378
Query: 285 YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV-VGAWQQQDTRFVYDLNT 343
+ FH AD + + G C+ I+ ++ + V +G +QQQ+ VYD+
Sbjct: 379 VPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVEN 438
Query: 344 GTIQFVPENC 353
+ F C
Sbjct: 439 SKLLFASAQC 448
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 171/377 (45%), Gaps = 41/377 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQ--SAPIFNPNASSTYKRIPCDD 64
Y +++ GTP ++ DTGS LIW QC PC CF + AP+ P SST+ R+PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 65 LICR------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C+ RP C + Y G +A G ++TET T P V FGC
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETLTVGDGT----FPKVAFGC 205
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
S +N D N +GI+G P SL+ QL A G FSYCL + A+ IL FG
Sbjct: 206 STEN---GVD-NSSGIVGLGRGPLSLVSQL---AVGRFSYCLRSDMADGGASPIL-FGSL 257
Query: 179 ANI-QRKDMKTIRM----FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMI 232
A + +R +++ + ++ RS+HYY++L I+V + TF + G GG ++
Sbjct: 258 AKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQR-MHNASEDWEYCYRYDSRFRAYA----S 287
D+G T++ + Y +V + F + + A D + CY+ + A
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPR 377
Query: 288 MTFHFDRADFKVEPTYMYFI---FQNEGYFCVA----ISFSD--RNSVVGAWQQQDTRFV 338
+ F P YF ++G VA + +D S++G Q D +
Sbjct: 378 LALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLL 437
Query: 339 YDLNTGTIQFVPENCAN 355
YD++ G F P +CA
Sbjct: 438 YDIDGGMFSFAPADCAK 454
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 154/348 (44%), Gaps = 12/348 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +D+ +G P + + DTGS L W QCLPC +C+ + F+P+ S++YK + C
Sbjct: 90 YLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNF 149
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
C+ PF+ C + Y G+S SG +ST+ T +P V FGC N N +
Sbjct: 150 CQDLPFQSCAASCQYDYMYGDGSSTSGALSTDDVTIGTGK----IPNVAFGCGNSNLG-T 204
Query: 127 FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDM 186
F G++G P SL+ QL TA FSYCLV TS L G
Sbjct: 205 FA-GAGGLVGLGKGPLSLVSQLGGTATKKFSYCLV--PLGSTKTSPLYIGDSTLAGGVAY 261
Query: 187 KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
+ + + YY LQ ISV + + TF + G GG ++D+G T++ +
Sbjct: 262 TPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAF 321
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPTYMY 305
++ + + EYC+ Y ++ FHF+ AD + P +
Sbjct: 322 NPMVAALK---AALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGADVALAPDNTF 378
Query: 306 FIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
EG C+A++ S S+ G QQ + V+DL I F NC
Sbjct: 379 IALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 153/351 (43%), Gaps = 19/351 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V FGTP +S + L DTGS + W C C C + +APIF+P SS+YK CD
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDPAKSSSYKPFACDSQP 173
Query: 67 CRRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ C N +C + Y G G ++++ T + +P FGC+ +
Sbjct: 174 CQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQY----LPNFSFGCAESLSED 229
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
++ LG G FSYCL + + L GK+A +
Sbjct: 230 TYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKEAAVSSSS 286
Query: 186 MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+K + D S + Y+++L+ ISV + RI A GG +ID+G T++
Sbjct: 287 LKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIA----SGGGTIIDSGTTITYLVP 342
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTY 303
Y+ + F + +S ED + CY S ++T H DR V P
Sbjct: 343 SAYKDLRDAFRQQLSSL----QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKE 398
Query: 304 MYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
I Q G C+A S +D S++G QQQ+ R V+D+ + F E CA
Sbjct: 399 NILITQESGLSCLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 111/354 (31%), Positives = 157/354 (44%), Gaps = 37/354 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PC CF+Q+ P F+P+ SST CD +
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 148
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
C+ G + L ++ FTF VPGV FGC N
Sbjct: 149 CQ-------------------GLPVASLPRSDKFTF--VGAGASVPGVAFGCGLFNNGV- 186
Query: 127 FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD--ANIQRK 184
F N GI GF P SL QLK G FS+C + +T +L D +N Q
Sbjct: 187 FKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADLFSNGQGA 243
Query: 185 DMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
++T + + + + YYLSL+ I+V R+ FAL +NGTGG +ID+G T +
Sbjct: 244 -VQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDSGTAMTSLP 301
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRADFKVEP 301
Y +V F + + D +C R + Y + HF+ A +
Sbjct: 302 TRVYRLVRDAFAAQVK---LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPR 358
Query: 302 TYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
F ++ G C+AI + +G +QQQ+ +YDL + FVP C
Sbjct: 359 ENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 168/369 (45%), Gaps = 24/369 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DVL G+P K L+ DTGS L W QCLPC +CF Q+ ++P AS++YK I C+D
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP C +N C + Y ++ +G + ETFT +L ++L V +
Sbjct: 230 CNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENM 289
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 290 MFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLI 347
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ FV + YY+ ++ I VA + T+ + +G GG
Sbjct: 348 FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGG 407
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
+ID+G ++ YE + E + G+ ++ + C+ +
Sbjct: 408 TIIDSGTTLSYFAEPAYEFIKNKIAEK--AKGKYPVYRDFPILDPCFNVSGIHNVQLPEL 465
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTI 346
F PT FI+ NE C+A+ + ++ S++G +QQQ+ +YD +
Sbjct: 466 GIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRL 525
Query: 347 QFVPENCAN 355
+ P CA+
Sbjct: 526 GYAPTKCAD 534
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 152/358 (42%), Gaps = 47/358 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P ++L+ D+GS +IW QC PC C+ Q+ P+F+P ASS++ + C I
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 67 CRR-----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CR + G+C + + Y G+ G ++ ET T V GV GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F G AG+LG SL+GQL A G+FSYCL A R L
Sbjct: 246 NSGL-FVG-AAGLLGLGWGAMSLVGQLGGAAGGVFSYCL--ASRGAGGAGSL-------- 293
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
SS YY+ L I V R+ F L +G GG ++DTG T +
Sbjct: 294 -------------ASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRL 340
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-----MTFHFDRAD 296
R Y + FD + R A + CY YAS ++F+FD+
Sbjct: 341 PREAYAALRGAFDGAMGALPRS---PAVSLLDTCY----DLSGYASVRVPTVSFYFDQGA 393
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + FC+A + S S++G QQ+ + D G + F P C
Sbjct: 394 VLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 160/359 (44%), Gaps = 22/359 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD--- 63
+ V + GTP + ++ DTGS L W Q PC CF Q+ PIF+P+ SSTY +I C
Sbjct: 25 FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84
Query: 64 --DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
DL+ + N C++ Y G+ G S ET T V FG S
Sbjct: 85 CADLLGTQTCSAAAN--CIYAYGYGDGSVTRGYFSKETIT----ATDTAGEEVKFGASVY 138
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N D GILG P S+ QL S FSYCLV TS + FG DA +
Sbjct: 139 NTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFG-DAAV 197
Query: 182 QRKDMK--TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+++ I D ++YY+++Q ISV + + + G+GG +ID+G T
Sbjct: 198 PSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTIT 257
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRAYASMTFHFDRADFK 298
++Q + V +TS R ++ + C+ + + +MT H D +
Sbjct: 258 YLQ----QEVFNALVAAYTSQVRYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLDGVHLE 313
Query: 299 VEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ PT FI C+A + ++ G QQQ+ VYDL+ I F P +CA+
Sbjct: 314 L-PTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCAS 371
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 163/354 (46%), Gaps = 27/354 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + V FGTP K++ ++FDTGS + W QC PC V+C+ Q P+F+P SSTY+ I C
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C CV+ + Y G+S G ++TETFT N IFGC +N+
Sbjct: 76 ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVF---NNFIFGCGQNNQG 132
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G AG++G SP+SL QL ++ +FSYCL AT L G N R
Sbjct: 133 L-FTG-AAGLIGLGRSPYSLNSQLATSLGNIFSYCLP---STSSATGYLNIG---NPLRT 184
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T + R+ + Y++ L ISV R+ + F + G +ID+G + T +
Sbjct: 185 PGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPP 239
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD-SRFRAYASMTFHFDRADFKVEPT 302
Y + F T + R A+ + CY + + + ++ H+ D +
Sbjct: 240 TAYGALRTAFRAAMTQYTRAA---AASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGA 296
Query: 303 YMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++++ + C+A S S + ++G QQ+ YD I F C
Sbjct: 297 GVFYVISSS-QVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 169/358 (47%), Gaps = 34/358 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + GTP + DTGS + WTQCLPCV+C+ Q+APIF+P+ SST+K
Sbjct: 62 NSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEK--- 118
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDN 122
RC+ C + ++Y G ++TET T H + +P I GC ++N
Sbjct: 119 ---------RCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNN 169
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + +G++G + P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 170 S--WFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF-----SGQGTSKINFGANAIVA 222
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGTGGCMIDTGAIAT 239
+ + MF+ + YYL+L +SV + RI TF AL G +ID+G T
Sbjct: 223 GDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE----GNIVIDSGTTLT 278
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA-DFK 298
+ Y ++R EH + R + + + CY D+ + +T HF D
Sbjct: 279 YFPVS-YCNLVRQAVEHVVT--AVRAADPTGNDMLCYNSDT-IDIFPVITMHFSGGVDLV 334
Query: 299 VEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ MY N G FC+AI + + ++ G Q + YD ++ + F P NC+
Sbjct: 335 LDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 170/354 (48%), Gaps = 14/354 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + G+P + L DTGS L+W QC PC C+ Q +P+F P S TY IPC+
Sbjct: 82 YLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQ 141
Query: 67 CRRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRD 124
C + C C + +YA + G+++ E TF + V V +IFGC + N
Sbjct: 142 CSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSNSG 201
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKST-AQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+F+ N GI+G P SL+ Q+ + FS CLV + + + + FG+++++
Sbjct: 202 -TFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVSG 260
Query: 184 KDMKTIRMFVDRSSHYYL-SLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + T + + YL +L+ ISV D + F + G MID+G AT+I
Sbjct: 261 EGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFN----SSETLSKGNIMIDSGTPATYIP 316
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
+ YE ++ + + + + CYR ++ +T HF+ AD ++ P
Sbjct: 317 QEFYERLVEELKVQSSLLPIE--DDPDLGTQLCYRSETNLEG-PILTAHFEGADVQLLPI 373
Query: 303 YMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
FI +G FC A++ S D + + G + Q + +DL+ TI F P +C N
Sbjct: 374 -QTFIPPKDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCTN 426
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 162/358 (45%), Gaps = 27/358 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V G+P++ +++ DTGS + W QC PC +C+ QS P+F+P+ S++Y + CD
Sbjct: 162 EYFSRVGV--GSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACD 219
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+ C R G C++ + Y G+ G +TET T V V GC +
Sbjct: 220 NPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGCGH 276
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG--KD 178
DN AG+L P S Q+ +T FSYCLV R+ ++S L+FG D
Sbjct: 277 DNEGLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLV--DRDSPSSSTLQFGDAAD 329
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A + +++ R S+ YY+ L ISV + P FA+ G GG ++D+G
Sbjct: 330 AEVTAPLIRSPRT----STFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAV 385
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADF 297
T +Q Y + F S R + ++ CY R +++ F
Sbjct: 386 TRLQSSAYAALRDAFVRGTQSLPRT---SGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGE 442
Query: 298 KVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A + ++ S++G QQQ TR +D T+ F C
Sbjct: 443 LRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/365 (30%), Positives = 163/365 (44%), Gaps = 28/365 (7%)
Query: 5 YFYTVDVLFGTPS--KSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+ Y V V GT K + L DT + + W C PC Q+ +F+P AS T+ +
Sbjct: 65 FVYGVFVSIGTGQGFKLQVLGLDTSTSMSWVMCEPCQPSLPQAGHLFSPAASPTFHGVHS 124
Query: 63 DDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-------KLVCVPGVI 115
+D +C P +R C R +A SG +S +TF HL+N + VPG++
Sbjct: 125 NDPVCTAP-YRPTANGCSFRFPFA-----SGYLSRDTF--HLRNGGLSGGAPIESVPGIM 176
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC++ F DG + G+L S SLL QL + A G FSYCL + LR
Sbjct: 177 FGCAHSVAGFHNDGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKP-TQGNPHGFLRL 235
Query: 176 GKDA--NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G D + M + + + YYLSL I++A+ R+ P FA G GGC I+
Sbjct: 236 GADVLPPLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVFA---AGRGGCSIN 292
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTF 290
A T I Y VV R + G R+ + Y S SM F
Sbjct: 293 PAATITAIMEPAYLVVERALVAYMKELGSDRVKKGPPGGGALFFDRMYKSVQARLPSMAF 352
Query: 291 HF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
HF D A+ P ++ + +F + + R +V+GA QQ +TRF +D+ G + F
Sbjct: 353 HFKDGAELWFTPEQLFEVHGMVAWFMM-VGKGYRRTVIGAPQQVNTRFTFDVAAGRLSFA 411
Query: 350 PENCA 354
E C
Sbjct: 412 SELCG 416
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 159/366 (43%), Gaps = 49/366 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + ++ DTGS LIWTQC PC CF Q AP F P +SST+ ++PC
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 67 CRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ P C CV+ Y G +A G ++TET LK P V FGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTA-GYLATET----LKVGDASFPSVAFGCSTENG 200
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LGQL G FSYCL S + FG AN+
Sbjct: 201 ---------------------LGQLD-LGVGRFSYCLRSG--SAAGASPILFGSLANLTD 236
Query: 184 KDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMIDTGAIAT 239
++++ + + H YY++L I+V + + TF +NG GG ++D+G T
Sbjct: 237 GNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLT 296
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHFDRAD 296
++ + YE+V + F + N + + C++ A S+ FD
Sbjct: 297 YLAKDGYEMVKQAF---LSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGA 353
Query: 297 FKVEPTYMYFI-FQNEGYFCVAISF------SDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
PTY + ++G VA SV+G Q D +YDL+ G F
Sbjct: 354 EYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFA 413
Query: 350 PENCAN 355
P +CA
Sbjct: 414 PADCAK 419
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 161/357 (45%), Gaps = 22/357 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V + G+P++ +++ DTGS + W QC PC +C+ QS P+F+P+ S++Y + CD
Sbjct: 165 EYFSRVGI--GSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCD 222
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CR R G C++ + Y G+ G +TET T V V GC +
Sbjct: 223 SQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP---VGNVAIGCGH 279
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN AG+L P S Q+ ++ FSYCLV R+ A S L+FG A
Sbjct: 280 DNEGLFV--GAAGLLALGGGPLSFPSQISAST---FSYCLV--DRDSPAASTLQFGDGAA 332
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIAT 239
+ S+ YY++L ISV + FA+ +G+GG ++D+G T
Sbjct: 333 EAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVT 392
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFK 298
+Q Y + F + S R + ++ CY R +++ F+
Sbjct: 393 RLQSAAYAALRDAFVQGAPSLPRT---SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGAL 449
Query: 299 VEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A + ++ S++G QQQ TR +D G + F P C
Sbjct: 450 RLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 163/361 (45%), Gaps = 30/361 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V + G+P++ +++ DTGS + W QC PC +C+ QS P+F+P+ S++Y + CD
Sbjct: 168 EYFSRVGI--GSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCD 225
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CR R G C++ + Y G+ G +TET T V V GC +
Sbjct: 226 SPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP---VTNVAIGCGH 282
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN AG+L P S Q+ ++ FSYCLV R+ A S L+FG D
Sbjct: 283 DNEGLFV--GAAGLLALGGGPLSFPSQISAST---FSYCLV--DRDSPAASTLQFGADG- 334
Query: 181 IQRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTG 235
+ T+ + RS YY++L ISV + FA+ +G+GG ++D+G
Sbjct: 335 ---AEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSG 391
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR 294
T +Q Y + F S R + ++ CY R +++ F+
Sbjct: 392 TAVTRLQSSAYAALRDAFVRGTPSLPRT---SGVSLFDTCYDLSDRTSVEVPAVSLRFEG 448
Query: 295 ADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
P Y I G +C+A + ++ S++G QQQ TR +D G + F P
Sbjct: 449 GGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNK 508
Query: 353 C 353
C
Sbjct: 509 C 509
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 163/356 (45%), Gaps = 18/356 (5%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP++ +++ DTGS ++W QC PC C++QS PIF+P S TY IPC
Sbjct: 142 YFTRLGV--GTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 65 LICRRPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CRR N + C+++++Y G+ G STET TF +N+ V GV GC +D
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-RNR---VKGVALGCGHD 255
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F GQ FSYCLV + +S++ FG A
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFP--GQTGHRFNQKFSYCLVDRSASSKPSSVV-FGNAAVS 312
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV R+ G F L + G GG +ID+G T
Sbjct: 313 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 372
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKV 299
+ R Y + F + +R N S ++ C+ + ++ HF RAD +
Sbjct: 373 LIRPAYIAMRDAFRVGAKTL--KRAPNFSL-FDTCFDLSNMNEVKVPTVVLHFRRADVSL 429
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
T G FC A + + S++G QQQ R VYDL + + F P CA
Sbjct: 430 PATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 30/359 (8%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP+K +L+ DTGS + W QC PC +C+ QS P+FNP +SSTYK + C
Sbjct: 162 YFSRIGV--GTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 65 LICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C + +C+++++Y G+ G ++T+T TF K + V GC +DN
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK---INNVALGCGHDNE 276
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF------GK 177
LG V S+ Q+K+T+ FSYCLV R+ +S L F G
Sbjct: 277 GLFTGAAGLLGLGGGV--LSITNQMKATS---FSYCLV--DRDSGKSSSLDFNSVQLGGG 329
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
DA K I F YY+ L SV ++ F + +G+GG ++D G
Sbjct: 330 DATAPLLRNKKIDTF------YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 383
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRAD 296
T +Q Y + F + + ++ ++ ++ CY + S ++ FHF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNL--KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGK 441
Query: 297 FKVEPTYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC A + S S++G QQQ TR YDL+ I C
Sbjct: 442 SLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 30/359 (8%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP+K +L+ DTGS + W QC PC +C+ QS P+FNP +SSTYK + C
Sbjct: 162 YFSRIGV--GTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 65 LICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C + +C+++++Y G+ G ++T+T TF K + V GC +DN
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK---INNVALGCGHDNE 276
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF------GK 177
LG V S+ Q+K+T+ FSYCLV R+ +S L F G
Sbjct: 277 GLFTGAAGLLGLGGGV--LSITNQMKATS---FSYCLV--DRDSGKSSSLDFNSVQLGGG 329
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
DA K I F YY+ L SV ++ F + +G+GG ++D G
Sbjct: 330 DATAPLLRNKKIDTF------YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 383
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRAD 296
T +Q Y + F + + ++ ++ ++ CY + S ++ FHF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNL--KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGK 441
Query: 297 FKVEPTYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC A + S S++G QQQ TR YDL+ I C
Sbjct: 442 SLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 169/372 (45%), Gaps = 31/372 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-- 64
Y +D+ GTP K +L+ DTGS L W QC PC +CF Q+ P +NPN SS+Y+ I C D
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPR 229
Query: 65 --LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
L+ P + EN C + +YA G++ +G + ETFT +L K K V V
Sbjct: 230 CQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDV 289
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + N+ F LG F QL+S FSYCL + +S L
Sbjct: 290 MFGCGHWNKGFFHGAGGLLGLGRGPLSFP--SQLQSIYGHSFSYCLTDLFSNTSVSSKLI 347
Query: 175 FGKDAN-IQRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D + ++ ++ + YYL ++ I V + T+ G GG
Sbjct: 348 FGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGG 407
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW--EYCYRYDSRFRA-YA 286
+ID+G+ TF Y+V+ F++ + A++D+ CY +
Sbjct: 408 TIIDSGSTLTFFPDSAYDVIKEAFEKKI-----KLQQIAADDFIMSPCYNVSGAMQVELP 462
Query: 287 SMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLN 342
HF D A + ++ ++ + C+AI + +S ++G QQ+ +YD+
Sbjct: 463 DYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVK 522
Query: 343 TGTIQFVPENCA 354
+ + P CA
Sbjct: 523 RSRLGYSPRRCA 534
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 162/358 (45%), Gaps = 27/358 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V G+P++ +++ DTGS + W QC PC +C+ QS P+F+P+ S++Y + CD
Sbjct: 166 EYFSRVGV--GSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACD 223
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+ C R G C++ + Y G+ G +TET T V V GC +
Sbjct: 224 NPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGCGH 280
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG--KD 178
DN AG+L P S Q+ +T FSYCLV R+ ++S L+FG D
Sbjct: 281 DNEGLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLV--DRDSPSSSTLQFGDAAD 333
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A + +++ R S+ YY+ L +SV + P FA+ G GG ++D+G
Sbjct: 334 AEVTAPLIRSPRT----STFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAV 389
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADF 297
T +Q Y + F S R + ++ CY R +++ F
Sbjct: 390 TRLQSSAYAALRDAFVRGTQSLPRT---SGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGE 446
Query: 298 KVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A + ++ S++G QQQ TR +D T+ F C
Sbjct: 447 LRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 161/357 (45%), Gaps = 32/357 (8%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP DTGS +IWTQC+PC NC++Q APIF+P+ SST++
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQ----- 474
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRD 124
RC C + I YA + G+++TET T + + GC DN +
Sbjct: 475 -------RCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTN 527
Query: 125 FSFDG---NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ G + +GI+G ++ P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 528 LQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF-----SGQGTSKINFGTNAIV 582
Query: 182 QRKDMKTIRMFVDRSSH-YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
MF+ + + YYL+L +SV D+ I F G ID+G T+
Sbjct: 583 AGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAED---GNIFIDSGTTLTY 639
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKV 299
+V ++ T+ ++ + D CY Y + +T HF AD +
Sbjct: 640 FPMSYCNLVREAVEQVVTAV---KVPDMGSDNLLCY-YSDTIDIFPVITMHFSGGADLVL 695
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ MY G FC+AI +D + +V G Q + YD ++ I F P NC+
Sbjct: 696 DKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 158/350 (45%), Gaps = 32/350 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP DTGS LIWTQC+PC +C++Q PIF+P+ SST+
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQ----- 135
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
RC C + I Y + G+++TET T H + + GC N D
Sbjct: 136 -------RCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTD 188
Query: 125 F---SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
F + +GI+G ++ P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 189 LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCF-----SGQGTSKINFGTNAIV 243
Query: 182 QRKDMKTIRMFVDRSSH-YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
MF+ + + YYL+L +SV D+RI F G +ID+G+ T+
Sbjct: 244 AGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAED---GNIVIDSGSTVTY 300
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKV 299
+V + ++ T+ R+ + S + CY + + +T HF AD +
Sbjct: 301 FPVSYCNLVRKAVEQVVTAV---RVPDPSGNDMLCY-FSETIDIFPVITMHFSGGADLVL 356
Query: 300 EPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
+ MY + G FC+AI + + ++ G Q + YD ++ +Q
Sbjct: 357 DKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLLQ 406
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 149/351 (42%), Gaps = 23/351 (6%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PP 71
GTPS +FDTGS L W QC PC C+ Q AP+F+P SSTY +PC+ C P
Sbjct: 94 LGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQPCTLFPQ 153
Query: 72 FRCENG---QCVHRINYAGGASASGLVSTETFTFH---LKNKLVCVPGVIFGCS-NDNRD 124
+ E G QC++ Y + G + +T +F + P +FGC+ N
Sbjct: 154 NQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAFYSNFT 213
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G +G P SL QL FSYC+V +T L+FG A
Sbjct: 214 FKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMV--PFSSTSTGKLKFGSMAPTNEV 271
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ S+Y L+L+ I+V ++ L G +ID+ I T +++G
Sbjct: 272 VSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPILTHLEQG 323
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYM 304
Y + E + + +A +EYC R + + FHF AD + P M
Sbjct: 324 IYTDFISSVKE---AINVEVAEDAPTPFEYCVRNPTNLN-FPEFVFHFTGADVVLGPKNM 379
Query: 305 YFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ N C+ + S S+ G W Q + + YDL + F P NC+
Sbjct: 380 FIALDNN-LVCMTVVPSKGISIFGNWAQVNFQVEYDLGEKKVSFAPTNCST 429
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 111/377 (29%), Positives = 166/377 (44%), Gaps = 41/377 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQ--SAPIFNPNASSTYKRIPCDD 64
Y +++ GTP ++ DTGS LIW QC PC CF + AP+ P SST+ R+PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 65 LICR------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C+ RP C + Y G +A G ++TET T P V FGC
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETLTVGDGT----FPKVAFGC 205
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
S +N D N +GI+G P SL+ QL A G FSYCL + A+ IL FG
Sbjct: 206 STEN---GVD-NSSGIVGLGRGPLSLVSQL---AVGRFSYCLRSDMADGGASPIL-FGSL 257
Query: 179 ANIQRKDMKTI-----RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMI 232
A + + ++ RS+HYY++L I+V + TF + G GG ++
Sbjct: 258 AKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQR-MHNASEDWEYCYRYDSRFRAYA----S 287
D+G T++ + Y +V + F + + A D + CY+ + A
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPR 377
Query: 288 MTFHFDRADFKVEPTYMYFI---FQNEGYFCVA----ISFSD--RNSVVGAWQQQDTRFV 338
+ F P YF ++G VA + +D S++G Q D +
Sbjct: 378 LALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLL 437
Query: 339 YDLNTGTIQFVPENCAN 355
YD++ G F P +CA
Sbjct: 438 YDIDGGMFSFAPADCAK 454
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 171/368 (46%), Gaps = 36/368 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS L+WTQC PC +CF+Q P+ +P ASSTY +PC
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 67 CRRPPFRC-------ENGQCVHRINYAGGASASGLVSTETFTF---HLKNKLVCVPGVIF 116
CR PF + C++ +Y + G ++T+ FTF + + + F
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC + N+ F N GI GF +SL QL T+ FSYC + E +++ + G
Sbjct: 204 GCGHLNKGV-FQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMF-ESKSSLVTLGG 258
Query: 177 KDANI----QRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
A + +++T + + S S Y+LSL+ ISV R+ F
Sbjct: 259 SPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------ST 311
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYASM 288
+ID+GA T + YE V F + ++ D + + +R A S+
Sbjct: 312 IIDSGASITTLPEEVYEAVKAEFAAQV-GLPPSGVEGSALDLCFALPVTALWRRPAVPSL 370
Query: 289 TFHFDRADFKVEPTYMYFIFQNEG--YFCVAI-SFSDRNSVVGAWQQQDTRFVYDLNTGT 345
T H + AD+++ + ++F++ G C+ + + +V+G +QQQ+T VYDL
Sbjct: 371 TLHLEGADWELPRS--NYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDR 428
Query: 346 IQFVPENC 353
+ F P C
Sbjct: 429 LSFAPARC 436
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 170/358 (47%), Gaps = 20/358 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP ++ DTGS L W QCLPC C+ Q +P+F+P+ SS+Y+ + C
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRF 153
Query: 67 CR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSN 120
C + C + +Y + +G ++TE FT ++ V + ++FGC
Sbjct: 154 CNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGT 213
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N +FD +GI+G SL+ QL S +G FSYCLV + TS ++FG D+
Sbjct: 214 GNGG-TFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSV 272
Query: 181 IQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGT---GGCMIDTGA 236
I + + + + ++YY++L+ ISV + R+ + G NG G +ID+G
Sbjct: 273 ISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLL----NGNVEKGNVIIDSGT 328
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
TF+ + + R +E + +R+ + + C+R + HF+ AD
Sbjct: 329 TLTFLDSEFFTELERVLEE---TVKAERVSDPRGLFSVCFRSAGDID-LPVIAVHFNDAD 384
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K++P F+ +E C + S++ + G Q D YDL T+ F P +C
Sbjct: 385 VKLQPLNT-FVKADEDLLCFTMISSNQIGIFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 161/355 (45%), Gaps = 18/355 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + + DT S +IW QC C C+N ++P+F+P+ S TYK +PC
Sbjct: 88 YLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTT 147
Query: 67 CRRPP-FRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSND 121
C+ C + + C H +NY G+ + G + ET T N V P + GC
Sbjct: 148 CKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIR- 206
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N + SFD GI+G P SL+ QL S+ FSYCL + +S L+FG A +
Sbjct: 207 NTNVSFDS--IGIVGLGGGPVSLVPQLSSSISKKFSYCLAPI---SDRSSKLKFGDAAMV 261
Query: 182 QRKDMKTIRM-FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ R+ F D YYL+L+ SV ++RI F + + R +G G +ID+G T
Sbjct: 262 SGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFR--SSSSRSSGKGNIIIDSGTTFTV 319
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ P +V + +R + + + CY+ +T HF AD K+
Sbjct: 320 L---PDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPVITAHFSGADVKLN 376
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
FI + C+A S ++ G QQ+ YDL + F P +C
Sbjct: 377 -ALNTFIVASHRVVCLAFLSSQSGAIFGNLAQQNFLVGYDLQRKIVSFKPTDCTK 430
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 162/375 (43%), Gaps = 30/375 (8%)
Query: 4 NYFYTVDVLFGTP--SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
Y Y V V GT + FL+ DT S L W +C C+ Q +P+F+P+ SS+Y+ +
Sbjct: 71 EYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLH 130
Query: 62 CDDLICRRP-PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CR P P +C + A G V T+T L N + + V FGC+
Sbjct: 131 PTSPLCRAPNPVLPAGDKCSFHLP----GEAHGYVGTDTII--LGNPTLPIHSVAFGCAQ 184
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD-- 178
F G AG LG P SL+ Q+K FSYCL+ +RFG D
Sbjct: 185 STEGFDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIP 244
Query: 179 -----ANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGC 230
+ + K + T S YY+ L IS+ I G F R +G+GGC
Sbjct: 245 DPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGC 304
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRAYASMT 289
+D G T + Y VV +G +R+ + ++ C+R + + +T
Sbjct: 305 FVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRD--PNFSLCFREHPGIWSHIPKLT 362
Query: 290 FHFDR------ADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDL 341
F+ A ++ ++ N+ C + + R S VVGA QQ DTRF++DL
Sbjct: 363 LDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDL 422
Query: 342 NTGTIQFVPENCAND 356
+ TI F E+C D
Sbjct: 423 HANTITFHRESCEAD 437
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 170/381 (44%), Gaps = 48/381 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC +CF Q+ P ++P SS+++ I C D
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP C EN C + Y ++ +G +TETFT +L K++ V V
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR + LG FS QL+S FSYCLV + +S L
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLI 267
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ V + YY+ ++ I V + T+ + +G GG
Sbjct: 268 FGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGG 327
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW---EYCYRYDSRFRAYA 286
++D+G ++ Y+++ F + + + +D+ + CY
Sbjct: 328 TIVDSGTTLSYFTEPAYQIIKDAF------VKKVKGYPIVQDFPILDPCYN--------V 373
Query: 287 SMTFHFDRADFKVE---------PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQD 334
S D DF + P YFI E C+AI + R+ S++G +QQQ+
Sbjct: 374 SGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQN 433
Query: 335 TRFVYDLNTGTIQFVPENCAN 355
+YD + + P NCA+
Sbjct: 434 FHVLYDTKKSRLGYAPMNCAD 454
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 172/367 (46%), Gaps = 33/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+T+ V GTP + L+ DTGS LIWTQC ++ P+++P SS++ PCD +
Sbjct: 89 HTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRL 148
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C F C +C++ NY G A+ G +++ETFTF +++ V V + FGC
Sbjct: 149 CETGSFNTKNCSRNKCIYTYNY-GSATTKGELASETFTFG-EHRRVSV-SLDFGCGKLTS 205
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
S G +GILG S SL+ QL+ FSYCL + + TS + FG A++ +
Sbjct: 206 G-SLPG-ASGILGISPDRLSLVSQLQIPR---FSYCLT-PFLDRNTTSHIFFGAMADLSK 259
Query: 184 ----KDMKTIRMFVDRSS---HYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
++T + + +YY+ L ISV R+ +FA+ R+G+GG +D+G
Sbjct: 260 YRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGD 319
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRYDSRFRA-------YA 286
+ VVM E + + NA++ ++E C++
Sbjct: 320 TTGMLP----SVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVP 375
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
+ +HFD + Y + + G C+ IS R +++G +QQQ+ ++D+
Sbjct: 376 PLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGAIIGNYQQQNMHVLFDVENHEF 435
Query: 347 QFVPENC 353
F P C
Sbjct: 436 SFAPTQC 442
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 163/355 (45%), Gaps = 32/355 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP DTGS LIWTQC+PC NC++Q APIF+P+ SST+K
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEK----- 114
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
RC C ++I YA + G ++TET T H + +P GC +++
Sbjct: 115 -------RCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSS- 166
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F +G++G S P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 167 -WFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA-----SQGTSKINFGTNAIVAGD 220
Query: 185 DMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + MF+ + YYL+L +SV D + TF G +ID+G T+
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFP 277
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKVEP 301
+V D + T+ R + + + CY Y + +T HF AD ++
Sbjct: 278 VSYCNLVREAVDHYVTAV---RTADPTGNDMLCY-YTDTIDIFPVITMHFSGGADLVLDK 333
Query: 302 TYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
MY G FC+AI ++ ++++ G Q + YD ++ + F P NC+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 166/359 (46%), Gaps = 22/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ +++ DTGS ++W QC PC C++QS PIF+P S TY IPC
Sbjct: 141 EYFTRLGV--GTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 64 DLICRRPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CRR N + C+++++Y G+ G STET TF +N+ V GV GC +
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-RNR---VKGVALGCGH 254
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F GQ FSYCLV + +S++ FG A
Sbjct: 255 DNEGLFVGAAGLLGLGKGKLSFP--GQTGHRFNQKFSYCLVDRSASSKPSSVV-FGNAAV 311
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+ L ISV R+ G A F L + G GG +ID+G T
Sbjct: 312 SRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVT 371
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R P + MR + F G + + A + ++ C+ + ++ HF AD
Sbjct: 372 RLIR-PAYIAMR---DAF-RVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGAD 426
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ T G FC A + + S++G QQQ R VYDL + + F P CA
Sbjct: 427 VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 113/374 (30%), Positives = 168/374 (44%), Gaps = 27/374 (7%)
Query: 6 FYTVDVLFGTPSKSEF--LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y+V V G+ F L D L W QC PCV Q +FN AS Y I
Sbjct: 78 IYSVRVGIGSGGTQHFYKLALDLVRPLTWMQCKPCVPEKRQDGSVFNTAASPHYHHIAST 137
Query: 64 DLICRRPPFRCENGQCVHRINYA-GGASASGLVSTETFTFH---LKNKLVCVPGVIFGCS 119
D C P R G+C + + G + A G++ ++ F F + + V G++FGC+
Sbjct: 138 DPRCMAPYTRAGQGRCTFDVKFQYGDSRARGVLGSDDFVFDGSGPGSPISSVNGLVFGCA 197
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAYREMEATSILRF 175
++ DF AG++ + P S + QL +A+GL FSYCL + + + LRF
Sbjct: 198 HNTHDFYNHDLWAGVMSLNRHPTSFIRQL--SARGLAAPRFSYCLA-SRQHRDRRGFLRF 254
Query: 176 GKDANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGC 230
G D Q T + D + YY+ + +S+ R+ P F L RR+ GGC
Sbjct: 255 GADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGRRLTAITPVMFELNRRSLRGGC 314
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRAYASM 288
+ID G T + PY V++ H S G Q S ++C+R ++S R S+
Sbjct: 315 IIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHA-IFSPGQKHCFRGKWESIHRHLPSV 373
Query: 289 TFHF----DRADFKVEPTYMYFIFQNE--GYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
T HF + + P ++ E Y C+AI +++GA Q DTRF +DL
Sbjct: 374 TLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVPYAERTIIGAGQMLDTRFTFDLQ 433
Query: 343 TGTIQFVPENCAND 356
+ F PE C D
Sbjct: 434 QNRLFFAPEQCHLD 447
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 166/355 (46%), Gaps = 13/355 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++V GTP + + DTGS L WT C+PC C+ Q PIF+P S++Y+ I CD +
Sbjct: 25 YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84
Query: 67 CRRPPFRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNR 123
C + + Q C + YA A G+++ ET T K + V + G++FGC ++N
Sbjct: 85 CHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNNT 144
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL-FSYCLVYAYREMEATSILRFGKDANIQ 182
F+ GI+G P S + Q+ S+ G FS CLV + ++ +S + GK + +
Sbjct: 145 G-GFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVS 203
Query: 183 RKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
K + + + + + Y+++L ISV + + F + G +D+G T +
Sbjct: 204 GKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTPPTIL 261
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEP 301
Y+ ++ + ++ + CYR + R +T HF+ D K+ P
Sbjct: 262 PTQLYDRLVAQVRSEVAM--KPVTNDLDLGPQLCYRTKNNLRGPV-LTAHFEGGDVKLLP 318
Query: 302 TYMYFIFQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
T F+ +G FC+ + S V G + Q + +DL+ + F P +C
Sbjct: 319 T-QTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCTK 372
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 113/373 (30%), Positives = 163/373 (43%), Gaps = 32/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC CF Q+ P ++P SS++K I C D
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C+ PP C E C + Y ++ +G + ETFT +L K +L V V
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG F+ QL+S FSYCLV +S L
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGRGPLSFAT--QLQSLYGHSFSYCLVDRNSNSSVSSKLI 372
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D + FV + YY+ ++ I V + T+ L G GG
Sbjct: 373 FGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGG 432
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS----RFRAY 285
+ID+G T+ YE++ F F + + CY +
Sbjct: 433 TIIDSGTTLTYFAEPAYEIIKEAFMRKIKGF---PLVETFPPLKPCYNVSGVEKMELPEF 489
Query: 286 ASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLN 342
A + DF VE YFI + E C+AI + R+ S++G +QQQ+ +YDL
Sbjct: 490 AILFADGAMWDFPVE---NYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLK 546
Query: 343 TGTIQFVPENCAN 355
+ + P CA+
Sbjct: 547 KSRLGYAPMKCAD 559
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 155/359 (43%), Gaps = 22/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ F++ DTGS ++W QC PC C++Q+ P+FNP S ++ IPC
Sbjct: 146 EYFTRLGV--GTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCG 203
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR P + C+++++Y G+ G STET TF V V GC +
Sbjct: 204 SPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTR----VGRVALGCGH 259
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F Q+ FSYCLV + S + FG A
Sbjct: 260 DNEGLFIGAAGLLGLGRGRLSFP--SQIGRRFSRKFSYCLVDRSASSKP-SYMVFGDSAI 316
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+ L +SV R+ G F L G GG +ID+G T
Sbjct: 317 SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVT 376
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R Y + F G + A E ++ C+ + ++ HF AD
Sbjct: 377 RLTRPAYVALRDAF-----RVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 431
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + N G FC A + + S+VG QQQ R VYDL + F P CA
Sbjct: 432 VSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGCA 490
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 163/372 (43%), Gaps = 24/372 (6%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
YF V V GTPS L+ DTGS L+W QC PC C+ Q +F+P SSTY+R+
Sbjct: 82 ESGEYFALVGV--GTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRV 139
Query: 61 PCDDLICRRPPFR-CEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
PC CR F C++ G C + + Y G+S++G ++T+ F V V
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDTY---VNNV 196
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC DN FD + AG+LG + S+ Q+ +F YCL +S L
Sbjct: 197 TLGCGRDNEGL-FD-SAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLV 254
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMI 232
FG+ + R S YY+ + SV R+ GF+ + AL G GG ++
Sbjct: 255 FGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM-TFH 291
D+G + R Y + FD + G +R+ ++ CY R A A + H
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374
Query: 292 FDRADFKVEPTYMYFIFQNEG-------YFCVAISFSDRN-SVVGAWQQQDTRFVYDLNT 343
F P YF+ + G C+ +D SV+G QQQ R V+D+
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEK 434
Query: 344 GTIQFVPENCAN 355
I F P+ C +
Sbjct: 435 ERIGFAPKGCTS 446
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 168/357 (47%), Gaps = 26/357 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V + G PSK+ +++ DTGS + W QC PC +C+ Q PIF+P +SS++ R+ C
Sbjct: 159 EYFLRVGI--GRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQ 216
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
CR F C N C+++++Y G+ G +TET +F V V GC +DN
Sbjct: 217 TPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGS---VDKVAIGCGHDN 273
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
AG++G P SL Q+K+++ FSYCLV R+ +S L F + +
Sbjct: 274 EGLFV--GAAGLIGLGGGPLSLTSQIKASS---FSYCLV--NRDSVDSSTLEFN---SAK 323
Query: 183 RKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
D T +F + + YY+ + +SV ++ P F + +G GG ++D G T
Sbjct: 324 PSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTR 383
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDSRFRA-YASMTFHFDRADFK 298
+Q Y + + F + + ++ CY SR ++ F FD
Sbjct: 384 LQTQAYNALR----DTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSL 439
Query: 299 VEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC+A + + + S++G QQQ TR YDL + F C
Sbjct: 440 PLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 169/367 (46%), Gaps = 40/367 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
+ V+ G P+ + + DTGS ++W +C PC C Q+ P+ +P+ SSTY +PC +
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNT 157
Query: 66 ICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNKLV-CVPGVIFGCSNDN 122
+C P N QC + ++YA G S++G+++TE FH ++ V VP V+FGCS++N
Sbjct: 158 MCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN 217
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
D+ D G+ G S + ++ S FSYCL + L FG+ AN +
Sbjct: 218 GDYK-DRRFTGVFGLGKGITSFVTRMGSK----FSYCLGNIADPHYGYNQLVFGEKANFE 272
Query: 183 --RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+K + + HYY++L+ ISV + R+ F+++ N +ID+G T+
Sbjct: 273 GYSTPLKVV------NGHYYVTLEGISVGEKRLDIDSTAFSMKGN-EKSALIDSGTALTW 325
Query: 241 IQRGPYEV----VMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRAYASMTFHFD- 293
+ + V + D F R CY+ + +TFHF
Sbjct: 326 LAESAFRALDNEVRQLLDGVLMPFWRGSFA--------CYKGTVSQDLIGFPVVTFHFSG 377
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTRFVYDLNTGTI 346
AD ++ M++ + C+A+ + SV+G QQ YDLN+ +
Sbjct: 378 GADLDLDTESMFYQATPD-ILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKL 436
Query: 347 QFVPENC 353
F +C
Sbjct: 437 FFQRIDC 443
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 158/359 (44%), Gaps = 25/359 (6%)
Query: 5 YFYTVDVLFGTP--SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+ Y V V GT ++ + L DTG+ W C PC Q +F+P AS T++ +
Sbjct: 66 FIYGVFVSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVGHLFSPAASPTFQGVRG 125
Query: 63 DDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNK-----LVCVPGVIFG 117
D +C P +R + C R +A +G +S +TF HL++ + VPG++FG
Sbjct: 126 DGPVCTVP-YRHTDKGCSFRFPFA-----AGYLSRDTF--HLRSGRSGTVMESVPGIMFG 177
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C++ F DG ++G+L S SP S L L + G FSYCL S LRFG
Sbjct: 178 CAHSVTGFHNDGTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTH-NPDSFLRFGA 236
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
D T + Y+L++ IS+ + R+ FA GGC I+
Sbjct: 237 DVPSLPPHAHTTTLVHAGVPGYHLNIVGISLGNKRLHIDRHVFA----AGGGCSINPAVT 292
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRA-YASMTFHF-DR 294
T I Y V H G R+ C+ + D R M+FHF D
Sbjct: 293 ITRIMELAYLAVEHALVAHMKELGSGRV-KGMPGRSLCFDHMDRSVRVQLPGMSFHFEDG 351
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
A+ + ++ + F V + +V+GA QQ DTRF +D+ G + FVPE C
Sbjct: 352 AELRFAAEQLFDVRVMAACFLV-VGRGHHQTVIGAAQQVDTRFTFDIAAGRLAFVPETC 409
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 173/359 (48%), Gaps = 22/359 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP DTGS LIW QC+PC+ C+NQ P+F+P SSTY I CD +
Sbjct: 64 YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123
Query: 67 CRRPPF-RCE-NGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNR 123
C +P C +C + YA + G+++ ET T K + + G++FGC ++N
Sbjct: 124 CYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNT 183
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKDANIQ 182
+F+ + G++G P SL+ Q+ G FS CLV ++ +S + FGK + +
Sbjct: 184 G-NFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVL 242
Query: 183 RKDMKTIRMFVDRS---SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + T + V R + YY++L ISV D + P + + G ++D+G
Sbjct: 243 GEGVVTTPL-VQREQDMTSYYVTLLGISVEDT---YLPMNSTIEK---GNMLVDSGTPPN 295
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ + Y+ V + S + CYR + + ++T+HF+ A+ +
Sbjct: 296 ILPQQLYDRVYVEVKNKVPL--EPITDDPSLGPQLCYRTQTNLKG-PTLTYHFEGANLLL 352
Query: 300 EP--TYMYFIFQNEGYFCVAIS--FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
P T++ + +G FC+AI+ + + G + Q + +DL+ + F P +C
Sbjct: 353 TPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDCT 411
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 157/358 (43%), Gaps = 21/358 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP K +++ DTGS ++W QC PC NC++Q+ P+FNP S ++ ++ C
Sbjct: 128 EYFTRIGV--GTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 185
Query: 64 DLICRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+CRR P + C+++++Y G+ +G TET TF V V GC +D
Sbjct: 186 TPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK----VEQVALGCGHD 241
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F Q T FSYCLV + +S++ FG A
Sbjct: 242 NEGLFVGAAGLLGLGRGGLSFP--SQAGRTFNQKFSYCLVDRSASSKPSSVV-FGNSAVS 298
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV + G F L R G GG +ID G T
Sbjct: 299 RTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTR 358
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRADF 297
+ + Y + F G + +A E ++ CY + ++ HF AD
Sbjct: 359 LNKPAYIALRDAFRA-----GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV 413
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + G FC A + + S++G QQQ R VYDL + + F P CA
Sbjct: 414 SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 164/375 (43%), Gaps = 40/375 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS L+WTQC PC +CF+Q P+ +P ASSTY +PC
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPR 151
Query: 67 CRRPPF-RCENG----------QCVHRINYAGGASASGLVSTETFTFHLKN----KLVCV 111
CR PF C G C + +Y + G ++T+ FTF N +
Sbjct: 152 CRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPT 211
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA-- 169
+ FGC + N+ F N GI GF +SL QL T FSYC + +
Sbjct: 212 RRLTFGCGHFNKGV-FQSNETGIAGFGRGRWSLPSQLNVTT---FSYCFTSMFESKSSLV 267
Query: 170 ------TSILRFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFA 221
+ L + A+I +++T + + S S Y+LSL+ ISV R+ A
Sbjct: 268 TLGGAPAAALLYSHAAHIS-GEVRTTPLLKNPSQPSLYFLSLKGISVGKTRL-------A 319
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
+ +ID+GA T + YE V F + ++ D + +
Sbjct: 320 VPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTAL 379
Query: 282 FR--AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFV 338
+R S+T H D AD+++ F CV + + + +V+G +QQQ+T V
Sbjct: 380 WRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVV 439
Query: 339 YDLNTGTIQFVPENC 353
YDL + F P C
Sbjct: 440 YDLENDWLSFAPARC 454
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 157/358 (43%), Gaps = 21/358 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP K +++ DTGS ++W QC PC NC++Q+ P+FNP S ++ ++ C
Sbjct: 41 EYFTRIGV--GTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 98
Query: 64 DLICRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+CRR P + C+++++Y G+ +G TET TF V V GC +D
Sbjct: 99 TPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK----VEQVALGCGHD 154
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F Q T FSYCLV + +S++ FG A
Sbjct: 155 NEGLFVGAAGLLGLGRGGLSFP--SQAGRTFNQKFSYCLVDRSASSKPSSVV-FGNSAVS 211
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV + G F L R G GG +ID G T
Sbjct: 212 RTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTR 271
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRADF 297
+ + Y + F G + +A E ++ CY + ++ HF AD
Sbjct: 272 LNKPAYIALRDAFRA-----GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV 326
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + G FC A + + S++G QQQ R VYDL + + F P CA
Sbjct: 327 SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 163/371 (43%), Gaps = 31/371 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC+ CF QS P ++P SS+++ I C D
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPR 251
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C+ PP C EN C + Y ++ +G + ETFT +L K++ V V
Sbjct: 252 CKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENV 311
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG F+ QL+S FSYCLV + +S L
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFA--SQLQSIYGHSFSYCLVDRNSDTSVSSKLI 369
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D + FV + YY+ ++ I V + T+ L + G GG
Sbjct: 370 FGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGG 429
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS----RFRAY 285
+ID+G T+ YE++ F + + + + CY +
Sbjct: 430 TIIDSGTTLTYFAEPAYEIIKEAFMKKIKGY---ELVEGFPPLKPCYNVSGIEKMELPDF 486
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNT 343
+ DF VE YFI C+AI + ++ S++G +QQQ+ +YD+
Sbjct: 487 GILFSDGAMWDFPVE---NYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKK 543
Query: 344 GTIQFVPENCA 354
+ + P C
Sbjct: 544 SRLGYAPMKCT 554
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 163/355 (45%), Gaps = 32/355 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP DTGS LIWTQC+PC NC++Q APIF+P+ SST+K
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEK----- 114
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
RC C ++I YA + G ++TET T H + +P GC +++
Sbjct: 115 -------RCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSS- 166
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F +G++G S P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 167 -WFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA-----SQGTSKINFGTNAIVAGD 220
Query: 185 DMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + MF+ + YYL+L +SV D + TF G +ID+G T+
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFP 277
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKVEP 301
+V D + T+ R + + + CY Y + +T HF AD ++
Sbjct: 278 VSYCNLVREAVDHYVTAV---RTADPTGNDMLCY-YTDTIDIFPVITMHFSGGADLVLDK 333
Query: 302 TYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
MY G FC+AI ++ ++++ G Q + YD ++ + F P NC+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 160/359 (44%), Gaps = 23/359 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + L GTP + + DT + IW QC PC CFN ++P+F+P+ SSTYK IPC
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPK 148
Query: 67 CRRPPFRCENGQCV--------HRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFG 117
C+ EN C + Y G A + G +S +T T + N + ++ G
Sbjct: 149 CK----NVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIG 204
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + N+ +G ++G +G P S + QL S+ G FSYCLV + + L FG
Sbjct: 205 CGHRNKG-PLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGD 263
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ + + + Y +L +SV DH I F T + + G +ID+G
Sbjct: 264 KSVVSGVGTVSTPITAGEIG-YSTTLNALSVGDHIIKFENST--SKNDNLGNTIIDSGTT 320
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGR-QRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T + E V + TS + +R + ++ ++ CY+ + +T HF+ AD
Sbjct: 321 LTILP----ENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKNLDVPIITAHFNGAD 376
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSD-RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ ++ +E +S + +++G QQ+ +DL I F P +C
Sbjct: 377 VHLNSLNTFYPIDHEVVCFAFVSVGNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 163/366 (44%), Gaps = 38/366 (10%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + + G+P++ +++ DTGS + W QC PC +C+ QS P+F+P SS+Y +PCD
Sbjct: 196 YFSRIGI--GSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDS 253
Query: 65 LICRR-PPFRCE------NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
CR C N CV+ + Y G+ G +TET T + V V G
Sbjct: 254 PHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVAIG 312
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +DN AG+L P S Q+ +T FSYCLV R+ + S L+FG
Sbjct: 313 CGHDNEGLFV--GAAGLLALGGGPLSFPSQISATE---FSYCLV--DRDSPSASTLQFGA 365
Query: 178 DANIQRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIG-FAPGTFALRRNGTGGCMI 232
D T+ + RS YY++L ISV + P FA+ G+GG ++
Sbjct: 366 ------SDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIV 419
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRA-YASMT 289
D+G T +Q Y + F G Q + AS ++ CY R +++
Sbjct: 420 DSGTAVTRLQSSAYSALRDAFVR-----GTQALPRASGVSLFDTCYDLAGRSSVQVPAVS 474
Query: 290 FHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQ 347
F+ P Y I G +C+A + + S+VG QQQ R +D T+
Sbjct: 475 LRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVG 534
Query: 348 FVPENC 353
F P C
Sbjct: 535 FSPNKC 540
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 162/372 (43%), Gaps = 24/372 (6%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
YF V V GTPS L+ DTGS L+W QC PC C+ Q +F+P SSTY+R+
Sbjct: 82 ESGEYFALVGV--GTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRV 139
Query: 61 PCDDLICRRPPFR-CEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
PC CR F C++ G C + + Y G+S++G ++T+ F V V
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTY---VNNV 196
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC DN FD + AG+LG S+ Q+ +F YCL +S L
Sbjct: 197 TLGCGRDNEGL-FD-SAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLV 254
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMI 232
FG+ + R S YY+ + SV R+ GF+ + AL G GG ++
Sbjct: 255 FGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM-TFH 291
D+G + R Y + FD + G +R+ ++ CY R A A + H
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374
Query: 292 FDRADFKVEPTYMYFIFQNEG-------YFCVAISFSDRN-SVVGAWQQQDTRFVYDLNT 343
F P YF+ + G C+ +D SV+G QQQ R V+D+
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEK 434
Query: 344 GTIQFVPENCAN 355
I F P+ C +
Sbjct: 435 ERIGFAPKGCTS 446
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 165/358 (46%), Gaps = 22/358 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP++ +++ DTGS ++W QC PC C++QS PIF+P S TY IPC
Sbjct: 142 YFTRLGV--GTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 65 LICRRPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
CRR N + C+++++Y G+ G STET TF +N+ V GV GC +D
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-RNR---VKGVALGCGHD 255
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F GQ FSYCLV + +S++ FG A
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFP--GQTGHRFNQKFSYCLVDRSASSKPSSVV-FGNAAVS 312
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV R+ G F L + G GG +ID+G T
Sbjct: 313 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 372
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRADF 297
+ R P + MR + F G + + A + ++ C+ + ++ HF AD
Sbjct: 373 LIR-PAYIAMR---DAF-RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 427
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ T G FC A + + S++G QQQ R VYDL + + F P CA
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 165/356 (46%), Gaps = 27/356 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V G PSK +++ DTGS + W QC PC +C+ QS PIF+P ASS+Y + CD
Sbjct: 157 YFSRVGV--GQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDA 214
Query: 65 LICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ C NG+C+++++Y G+ G TET +F + V V GC +DN
Sbjct: 215 QQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGS----VNRVAIGCGHDNE 270
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG + Q+K+T+ FSYCLV R+ +S L F N R
Sbjct: 271 GLFVGSAGLLGLGGGPLSLT--SQIKATS---FSYCLV--DRDSGKSSTLEF----NSPR 319
Query: 184 KDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + ++ YY+ L +SV + P TFA+ ++G GG ++D+G T
Sbjct: 320 PGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITR 379
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRADFKV 299
++ Y V F ++ R ++ CY S + +++FHF
Sbjct: 380 LRTQAYNSVRDAFKRKTSNL---RPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWA 436
Query: 300 EPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C A + + + S++G QQQ TR +DL + F P C
Sbjct: 437 LPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 163/344 (47%), Gaps = 9/344 (2%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR--RPP 71
G+P + DTGS ++W QC PC +C+ Q+ PIF+P+ S TYK +PC C R
Sbjct: 98 GSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNT 157
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
+ C + I+Y G+ + G +S ET T V P + GC ++N +F
Sbjct: 158 ACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGG-TFQEE 216
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ-RKDMKTI 189
+GI+G P SL+ QL S+ G FSYCL + E ++S L FG A + R + T
Sbjct: 217 GSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTP 276
Query: 190 RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
++ Y+L+L+ SV D+RI F+ + + +G G +ID+G T + P E
Sbjct: 277 LDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLL---PQEDY 333
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQ 309
+ +R + S+ CY+ S +T HF AD ++ P F+
Sbjct: 334 LNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDLPVITAHFKGADVELNPIST-FVPV 392
Query: 310 NEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+G C A S ++ G QQ+ YDL T+ F P +C
Sbjct: 393 EKGVVCFAFISSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 168/358 (46%), Gaps = 24/358 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + DTGS LIWTQC PC C+ Q AP+F+P +S TY+ + CD
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTRQ 152
Query: 67 CRR--PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDN 122
C+ C + Q C + Y + +G ++ +T T N V P + GC N
Sbjct: 153 CQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRN 212
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANI 181
+FD +GI+G P SL+ Q+ S+ G FSYCLV ++ +S L FG++A +
Sbjct: 213 NG-TFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVV 271
Query: 182 QRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+++ + + + YYL+L+ +SV D +I F + G +ID+G T
Sbjct: 272 SGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFG---GSSFGGSEGNIIIDSGTSLTL 328
Query: 241 IQRGPYEVVMRHFDEHFTS-----FGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA 295
+ F E T+ +R +AS +CYR + +T HF+ A
Sbjct: 329 FP-------VNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPV-ITAHFNGA 380
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
D ++ T FI ++ C+A + + ++ G Q + YD+ ++ F P +C
Sbjct: 381 DVVLQ-TLNTFILISDDVLCLAFNSTQSGAIFGNVAQMNFLIGYDIQGKSVSFKPTDC 437
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 165/355 (46%), Gaps = 16/355 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++V GTP + DTGS L+WTQC PC +C+ Q P+F+P SSTYK + C
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSND 121
C + + C + ++Y + G ++ +T T + + + + +I GC ++
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHN 209
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N +F+ +GI+G P SL+ QL + G FSYCLV + + TS + FG +A +
Sbjct: 210 NAG-TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 182 QRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + S YYL+L+ ISV +I ++ + + G +ID+G T
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYS---GSDSESSEGNIIIDSGTTLT 325
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ P E D +S ++ + CY + +T HFD AD K+
Sbjct: 326 LL---PTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPV-ITMHFDGADVKL 381
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + F+ +E C A S S+ G Q + YD + T+ F P +CA
Sbjct: 382 DSSNA-FVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 165/355 (46%), Gaps = 16/355 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++V GTP + DTGS L+WTQC PC +C+ Q P+F+P SSTYK + C
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSND 121
C + + C + ++Y + G ++ +T T + + + + +I GC ++
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHN 209
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N +F+ +GI+G P SL+ QL + G FSYCLV + + TS + FG +A +
Sbjct: 210 NAG-TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 182 QRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + S YYL+L+ ISV +I ++ + + G +ID+G T
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYS---GSDSESSEGNIIIDSGTTLT 325
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ P E D +S ++ + CY + +T HFD AD K+
Sbjct: 326 LL---PTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPV-ITMHFDGADVKL 381
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + F+ +E C A S S+ G Q + YD + T+ F P +CA
Sbjct: 382 DSSNA-FVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 159/360 (44%), Gaps = 22/360 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP+ + +++ DTGS ++W QC PC C+NQS P+FNP S T+ +PC
Sbjct: 136 YFMRLGV--GTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGS 193
Query: 65 LICRRPPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+CRR E + C+++++Y G+ G STET TFH V V GC
Sbjct: 194 RLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGAR----VDHVALGCG 249
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEATSILRFG 176
+DN LG F Q K+ G FSYCLV + + S + FG
Sbjct: 250 HDNEGLFVGAAGLLGLGRGGLSFP--SQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG 307
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTG 235
A + + + YYL L ISV R+ G + F L G GG +ID+G
Sbjct: 308 NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSG 367
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR 294
T + + Y + F T R ++ ++ C+ ++ FHF
Sbjct: 368 TSVTRLTQSAYVALRDAFRLGATRLKRAPSYSL---FDTCFDLSGMTTVKVPTVVFHFTG 424
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + N+G FC A + + + S++G QQQ R YDL + F+ C
Sbjct: 425 GEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 172/375 (45%), Gaps = 40/375 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + +L DTGS LIWTQC PC C + AP F P +SST+ ++PC +
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 67 CR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ P C CV+ Y G +A G ++TE T H+ PGV FGCS +N
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMGFTA-GYLATE--TLHVGGA--SFPGVAFGCSTEN- 203
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT-SILRFGKDANIQ 182
+ +GI+G SP SL+ Q+ G FSYCL + +A S + FG A +
Sbjct: 204 --GVGNSSSGIVGLGRSPLSLVSQV---GVGRFSYCL---RSDADAGDSPILFGSLAKVT 255
Query: 183 RKDMKTIRMF----VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG----TGGCMIDT 234
++++ + + SS+YY++L I+V + TF R GG ++D+
Sbjct: 256 GGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDS 315
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-DWEYCYRYDSRFRAYA----SMT 289
G T++ + Y +V R F + N + ++ C+ + ++
Sbjct: 316 GTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLV 375
Query: 290 FHF-DRADFKV-EPTYMYFIFQN-------EGYFCVAISFSDRNSVVGAWQQQDTRFVYD 340
F A++ V +Y+ + + E + S S++G Q D +YD
Sbjct: 376 LRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYD 435
Query: 341 LNTGTIQFVPENCAN 355
L+ G F P +CAN
Sbjct: 436 LDGGMFSFAPADCAN 450
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 169/358 (47%), Gaps = 41/358 (11%)
Query: 22 LLFDTGSYLIWTQCL----PCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFRCEN- 76
L+ DTGS LIWTQC + S P+++P SST+ +PC D +C+ F +N
Sbjct: 28 LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87
Query: 77 ---GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAG 133
+CV+ Y G A+A G++++ETFTF + + G FGC + S G G
Sbjct: 88 TSKNRCVYEDVY-GSAAAVGVLASETFTFGARRAVSLRLG--FGCGALSAG-SLIGA-TG 142
Query: 134 ILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANIQR----KDMKT 188
ILG S SL+ QLK FSYCL +A ++ TS L FG A++ R + ++T
Sbjct: 143 ILGLSPESLSLITQLKIQR---FSYCLTPFADKK---TSPLLFGAMADLSRHKTTRPIQT 196
Query: 189 IRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
+ + + +YY+ L IS+ R+ + A+R +G GG ++D+G+ ++ +
Sbjct: 197 TAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAF 256
Query: 247 EVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDSRFRAYA-------SMTFHFDRADFK 298
E V E R + N + ED+E C+ R A A + HFD
Sbjct: 257 EAV----KEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAM 312
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V P YF G C+A+ + S++G QQQ+ ++D+ F P C
Sbjct: 313 VLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 170/357 (47%), Gaps = 13/357 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + DTGS +IW QC PC +C+NQ+ PIF+P+ S TYK +PC I
Sbjct: 94 YLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNI 153
Query: 67 CR--RPPFRCE--NGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSND 121
C+ + C N +C + I Y + + G +S ET T V P + GC ++
Sbjct: 154 CQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCGHN 213
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N+ +F +GI+G P SL+ QL S+ G FSYCL + + ++S L FG +A +
Sbjct: 214 NKG-TFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVV 272
Query: 182 Q-RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
R + T + + Y+L+L+ SV D+RI F + G G +ID+G T
Sbjct: 273 SGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFG-SSSFESSGGEGNIIIDSGTTLTI 331
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-MTFHFDRADFKV 299
+ Y + + + +R+ + S+ CYR S +T HF AD ++
Sbjct: 332 LPEDDYLNLESAVAD---AIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGADVEL 388
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
P FI +EG C A S + G QQ+ YDL T+ F P +C +
Sbjct: 389 NPIST-FIEVDEGVVCFAFRSSKIGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCTQE 444
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 159/357 (44%), Gaps = 24/357 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V G P + + ++ DTGS + W QC PC +C+ QS PI+NP SS+YK + C
Sbjct: 145 YFSRIGV--GAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQA 202
Query: 65 LICRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFH---LKNKLVCVPGVIFGCS 119
+C++ NG C+++++Y G+ G +TET T L+N V GC
Sbjct: 203 NLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQN-------VAIGCG 255
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+DN LG F QL +FSYCLV R+ E++S L+FG+ A
Sbjct: 256 HDNEGLFVGAAGLLGLGGGSLSFP--SQLTDENGKIFSYCLV--DRDSESSSTLQFGRAA 311
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+SL ISV + + F + +G GG ++D+G T
Sbjct: 312 VPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVT 371
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFK 298
+Q Y+ + F + + ++ CY S+ ++ FHF
Sbjct: 372 RLQTAAYDSLRDAFRAGTKNLPS---TDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSM 428
Query: 299 VEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y + + G FC A + + + S+VG QQQ R +D + F C
Sbjct: 429 SLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 164/356 (46%), Gaps = 18/356 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L+W QC+PC C+ Q P+F+P +SS+Y I C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTES 119
Query: 67 CRR-PPFRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C + C Q C + +YA + G+++ ET T + V G+IFGC ++N
Sbjct: 120 CNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNN 179
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKST---AQGLFSYCLVYAYREMEATSILRFGKDA 179
F+ G++G P SL+ Q+ S+ +FS CLV + TS + FGK +
Sbjct: 180 S--GFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGS 237
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + + Y+ +L ISV D + F+ G+ +L G +ID+G T
Sbjct: 238 EVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGS-SLGTITKGNILIDSGTTIT 296
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
++ P E R ++ + + +E CY+ + ++T HF+ D +
Sbjct: 297 YL---PEEFYHRLIEQVRNKVALEPFR--IDGYELCYQTPTNLNG-PTLTIHFEGGDVLL 350
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
P M+ Q++ FC A+ ++ V G + Q + +DL + F +C
Sbjct: 351 TPAQMFIPVQDDN-FCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 164/354 (46%), Gaps = 20/354 (5%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP+K +++ DTGS + W QCLPC C+ QS PIF+P +SST+K + C D
Sbjct: 164 YFSRIGV--GTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSD 221
Query: 65 LICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C + +C+++++Y G+ G +T+T TF K V V GC +DN
Sbjct: 222 PKCASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGK---VNDVALGCGHDNE 278
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG S+ Q+K+ + FSYCLV R+ +S L F I
Sbjct: 279 GLF--TGAAGLLGLGGGALSMTNQIKAKS---FSYCLV--DRDSAKSSSLDFNS-VQIGA 330
Query: 184 KDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
D + + + YY+ L SV ++ F + +G GG ++D G T +Q
Sbjct: 331 GDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQ 390
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEP 301
Y + F + T F ++ + ++ CY + S ++TFHF P
Sbjct: 391 TQAYNSLRDAFVKLTTDF--KKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLP 448
Query: 302 TYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I + G FC A + S S++G QQQ TR YDL I C
Sbjct: 449 AKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 158/361 (43%), Gaps = 22/361 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ + +++ DTGS ++W QC PC C+NQS IF+P S T+ +PC
Sbjct: 137 EYFMRLGV--GTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCG 194
Query: 64 DLICRRPPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+CRR E + C+++++Y G+ G STET TFH V V GC
Sbjct: 195 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGC 250
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEATSILRF 175
+DN LG F Q KS G FSYCLV + + S + F
Sbjct: 251 GHDNEGLFVGAAGLLGLGRGGLSFP--SQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVF 308
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDT 234
G DA + + + YYL L ISV R+ G + F L G GG +ID+
Sbjct: 309 GNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDS 368
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFD 293
G T + + Y + F T R ++ ++ C+ ++ FHF
Sbjct: 369 GTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSL---FDTCFDLSGMTTVKVPTVVFHFG 425
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ + + EG FC A + + + S++G QQQ R YDL + F+
Sbjct: 426 GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 485
Query: 353 C 353
C
Sbjct: 486 C 486
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 168/374 (44%), Gaps = 38/374 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL- 65
Y +DV GTP + ++ DTGS L W QC PC++CF Q P+F+P ASS+Y+ + C D
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQR 208
Query: 66 -----------ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC--VP 112
CRRP C + Y ++ +G ++ E+FT +L V
Sbjct: 209 CGLVAPPEAPRACRRP----AEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 264
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
GV+FGC + NR LG F+ QL++ FSYCLV +A S
Sbjct: 265 GVVFGCGHRNRGLFHGAAGLLGLGRGPLSFA--SQLRAVYGHTFSYCLV--EHGSDAGSK 320
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTG 228
+ FG+D + F SS YY+ L+ + V + + T+ + ++G+G
Sbjct: 321 VVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSG 380
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRA- 284
G +ID+G ++ Y+V+ + F + + R++ D+ CY R
Sbjct: 381 GTIIDSGTTLSYFVEPAYQVIRQAFVDLMS-----RLYPLIPDFPVLNPCYNVSGVERPE 435
Query: 285 YASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDL 341
++ F P YF+ +G C+A+ + R S++G +QQQ+ VYDL
Sbjct: 436 VPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNFHVVYDL 495
Query: 342 NTGTIQFVPENCAN 355
+ F P CA
Sbjct: 496 QNNRLGFAPRRCAE 509
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 164/359 (45%), Gaps = 29/359 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
+ V V FGTP+++ ++FDTGS + W QCLPC +C+ Q PIF+P S+TY +PC
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHP 194
Query: 66 ICRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR- 123
C +C NG C++++ Y G+S++G++S ET + L PG FGC N
Sbjct: 195 QCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRAL---PGFAFGCGQTNLG 251
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
DF G++ G++G SL Q ++ G FSYCL + L G
Sbjct: 252 DF---GDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLP---SDNTTHGYLTIGPTTPASN 305
Query: 184 KDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
D++ M D S Y++ L I + + + P F G +D+G I T++
Sbjct: 306 DDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTILTYL 360
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY-ASMTFHFDRADFKVE 300
Y + F T + + A + ++ CY + + + +++F F
Sbjct: 361 PPEAYTALRDRFKFTMTQY---KPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDL 417
Query: 301 PTYMYFIFQNEGYFCVA-ISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ IF ++ + + F R S +VG QQ++T +YD+ I F +C
Sbjct: 418 SFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 161/365 (44%), Gaps = 33/365 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + DT S LIW QC PC CF Q P+F P+ SST+ + CD
Sbjct: 90 YLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQP 149
Query: 67 CRRPP-FRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C + C C++ Y G+S G++ TE + H ++ V P IFGC ++N
Sbjct: 150 CTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTE--SIHFGSQTVTFPKTIFGCGSNN- 206
Query: 124 DF--SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
DF + GI+G P SL+ QL FSYCL+ +T L+FG D I
Sbjct: 207 DFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTS--TSTIKLKFGNDTTI 264
Query: 182 QRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + +D S+Y+L L I++ + + G +ID G + T
Sbjct: 265 TGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKML-----QVRTTDHTNGNIIIDLGTVLT 319
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT-----FHFDR 294
+++ +F +F + R+ + + + Y +D F A++T F F
Sbjct: 320 YLE--------VNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTG 371
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAIS---FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
A + P ++F F + C+A+ ++ SV G Q D + YD + F P
Sbjct: 372 AKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPA 431
Query: 352 NCAND 356
+C+ +
Sbjct: 432 DCSKN 436
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 147/365 (40%), Gaps = 49/365 (13%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKR 59
H Y VD+ GTP + DTGS LIWTQC PC CF Q AP++ P S+TY
Sbjct: 86 HASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYAN 145
Query: 60 IPCDDLIC---RRPPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+ C +C + P RC + C + +Y G S G+++TETFT V GV
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTA---VRGV 202
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC +N N +G++G P SL+ QL T
Sbjct: 203 AFGCGTEN--LGSTDNSSGLVGMGRGPLSLVSQLGVT----------------------- 237
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+R + L+ I+V D + P F L G GG +ID+
Sbjct: 238 -----RPRRSCRARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDS 292
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRYDS-RFRAYASMTF 290
G T ++ + + R R R+ AS C+ S +
Sbjct: 293 GTTFTALEERAFVALARALAS------RVRLPLASGAHLGLSLCFAAASPEAVEVPRLVL 346
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
HFD AD ++ ++ G C+ + + SV+G+ QQQ+T +YDL G + F P
Sbjct: 347 HFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEP 406
Query: 351 ENCAN 355
C
Sbjct: 407 AKCGE 411
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 159/353 (45%), Gaps = 12/353 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L WTQC PC +C+ Q P F+P SSTY+ C
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSF 151
Query: 67 CRR--PPFRCENG-QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C C NG +C +YA G+ G ++ ET T K V PG FGC + +
Sbjct: 152 CLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRS 211
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
FD + +GI+G V+ S++ QLKST G FSYCL+ + + +S + FG+ +
Sbjct: 212 GGI-FDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVS 270
Query: 183 RKDMKTIRMFVDRSSHYY--LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YY ++L+ SV R+ + G G ++D+G T+
Sbjct: 271 GAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK-GFSKKAEVEEGNIIVDSGTTYTY 329
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ P E ++ + S +R+ + + CY +T HF A+ +++
Sbjct: 330 L---PLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAHFKDANVELQ 386
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + F+ E C + + ++G Q + +DL + F +C
Sbjct: 387 P-WNTFLRMQEDLVCFTVLPTSDIGILGNLAQVNFLVGFDLRKKRVSFKAADC 438
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 161/368 (43%), Gaps = 35/368 (9%)
Query: 4 NYFYTVDVLFGTP-SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
N Y + + G P S+ L DTGS ++WTQC PC CF Q P F+ AS+T + + C
Sbjct: 89 NSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVAC 148
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFH--LKNKLVCVPGVIF 116
D +C E+G C + Y G+ + G ++FTF V VP + F
Sbjct: 149 SDPLCNA---HSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGF 205
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC N F GI GF P SL QLK FSYC + + L
Sbjct: 206 GCGMYNAG-RFLQTETGIAGFGRGPLSLPSQLKVRQ---FSYCFTTRFEAKSSPVFLGGA 261
Query: 177 KDANIQRKDMKTIRMFVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
D FV +SHY LS + ++V R+ ++ +G+G
Sbjct: 262 GDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADGSGAT 317
Query: 231 MIDTGA-IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD-SRFRAYASM 288
ID+G I TF + V R F + ++ +++ + C+ +D + A +
Sbjct: 318 FIDSGTDITTF-----PDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGKKTAAMPKL 372
Query: 289 TFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDR--NSVVGAWQQQDTRFVYDLNTGT 345
FH + AD+ + P Y E G CVA+S S + +++G +QQQ+T VYDL G
Sbjct: 373 VFHLEGADWDL-PRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGK 431
Query: 346 IQFVPENC 353
+ VP C
Sbjct: 432 LLLVPAQC 439
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 167/367 (45%), Gaps = 25/367 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD--- 63
Y +DV G P + L+ DTGS L W QC PC CF+QS P+F+P+ S+++K IPC+
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 64 -DLI----CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLVCVPGVIF 116
DL+ CR + C + Y + SG ++ E+ + L + + + ++
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL-FSYCLVYAYREMEATSILRF 175
GC + N+ LG F QL+S+ G FSYCLV + +S + F
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFP--SQLRSSPIGQSFSYCLVDRTNNLSVSSAISF 264
Query: 176 GKDANIQRK-DMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
G + R D FV ++ YYL +Q I + + FA+ NG+GG
Sbjct: 265 GAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGT 324
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMT 289
+ID+G T++ R Y V F S+ R + + CY R + +++
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRA---DPFDILGICYNATGRAAVPFPALS 380
Query: 290 FHFDRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
F P YFI + E C+AI +D S++G +QQQ+ F+YD+ +
Sbjct: 381 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLG 440
Query: 348 FVPENCA 354
F +C+
Sbjct: 441 FANTDCS 447
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 166/372 (44%), Gaps = 48/372 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP L DTGS L WTQC PC CF Q PI++ SS++ +PC
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 67 CRRPPF---RCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C P RC + C +R Y GA + + + V G+ FGC D
Sbjct: 143 CL--PIWSSRCSTPSATCRYRYAYDDGA------------YSPECAGISVGGIAFGCGVD 188
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N S+ N G +G SL+ QL G FSYCL + ++ + FG A +
Sbjct: 189 NGGLSY--NSTGTVGLGRGSLSLVAQL---GVGKFSYCLTDFFNTSLSSPVF-FGSLAEL 242
Query: 182 QRKDMKTIRMFVDRS---------SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCM 231
V + S YY+SL+ IS+ D R+ GTF L +G+GG +
Sbjct: 243 AASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMI 302
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS----RFRAYAS 287
+D+G I T + + VV+ +H Q + NAS C+ +
Sbjct: 303 VDSGTIFTILVETGFRVVV----DHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPDMPD 358
Query: 288 MTFHF-DRADFKV-EPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNT 343
M HF AD ++ YM F + E FC+ I + S SV+G +QQQ+ + ++D+
Sbjct: 359 MVLHFAGGADMRLHRDNYMSF-NEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITV 417
Query: 344 GTIQFVPENCAN 355
G + F+P +C+
Sbjct: 418 GQLSFMPTDCSK 429
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 173/360 (48%), Gaps = 14/360 (3%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
N Y +++ GTP S + DTGS L+W QC PC +C+ Q PIF+P S TY+ +
Sbjct: 90 SNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILS 149
Query: 62 CDDLICRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFG 117
C+ C + ++ C++ +Y G+ SG ++ +T T + V VP V+FG
Sbjct: 150 CEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFG 209
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C ++N +F+ + +G++G P S++ QL+ G FSYCLV + +S + FG
Sbjct: 210 CGHNNGG-TFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGS 268
Query: 178 DANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRI---GFAPGTFALRRNGTGGCMID 233
+ + + + + YYL+L+ +SV ++ GF+ L G +ID
Sbjct: 269 RGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIID 328
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD 293
+G T + + Y + + ++ G + + + + + CY S R ++T HF
Sbjct: 329 SGTTLTLLPQDFYGTLESNV---VSAIGGKPVRDPNNVFSLCYSNLSGLR-IPTITAHFV 384
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
AD +++P + Q E FC A+ ++ G Q + YDL + T+ F P +C
Sbjct: 385 GADLELKPLNTFVQVQ-EDLFCFAMIPVSDLAIFGNLAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 166/370 (44%), Gaps = 30/370 (8%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA--PIFNPNASSTYKRI 60
K + V+ G P +F + DTGS L+W QC PC +C + P+FNP SST+
Sbjct: 64 KTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVEC 123
Query: 61 PCDDLICRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGC 118
CDD CR P C + +CV+ Y G + G+++ E TF N V + FGC
Sbjct: 124 SCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGC 183
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
++N + + GILG P SL QL S FSYC+ + + L G+D
Sbjct: 184 GHENGE-QLESEFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANKNYGYNQLVLGED 238
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A+I D I F + YY++L+ ISV D ++ P F RR G ++DTG +
Sbjct: 239 ADI-LGDPTPIE-FETENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLY 295
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHF-DRA 295
T++ ++ R S ++ CY R + + +TFHF A
Sbjct: 296 TWLA----DIAYRELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGA 351
Query: 296 DFKVEPTYMYF-IFQNEGY---FCVAISFSDRN-------SVVGAWQQQDTRFVYDLNTG 344
+ +E T M++ + +++ Y FC+++ + + + +G QQ YDL
Sbjct: 352 ELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKER 411
Query: 345 TIQFVPENCA 354
I +C
Sbjct: 412 NIYLQRIDCV 421
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 166/363 (45%), Gaps = 30/363 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTY-----KRIP 61
Y + GTP DTGS LIW QC PC +CF QS P+F P SST+ + P
Sbjct: 90 YLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQP 149
Query: 62 CDDLICRRPPFRC-ENGQCVHRINYAGGAS-ASGLVSTETFTFHLKN--KLVCVPGVIFG 117
C L+ + C ++G+C++ Y S + GL+STET F + + V P FG
Sbjct: 150 CTLLLPEQK--GCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFG 207
Query: 118 CSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C N F + GI+G P SL+ Q+ FSYCL+ +TS L+FG
Sbjct: 208 CGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGS--TSTSKLKFG 265
Query: 177 KDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
++ I + + + M + ++Y+L+L+ ++VA + + G +ID+
Sbjct: 266 NESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTG--------STDGNVIIDS 317
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDR 294
G + T++ Y E S + + + +C+ Y F + + F F
Sbjct: 318 GTLLTYLGESFYYNFAASLQE---SLAVELVQDVLSPLPFCFPYRDNF-VFPEIAFQFTG 373
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
A ++P ++ + ++ C+ I+ S + S+ G++ Q D + YDL + F P +
Sbjct: 374 ARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTD 433
Query: 353 CAN 355
C+
Sbjct: 434 CSK 436
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 162/355 (45%), Gaps = 22/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS LIWTQC PC +C+ Q P+F+P ASSTYK + C
Sbjct: 94 YLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSND 121
C + E+ C + ++YA G+ G + +T T N+ V + +I GC +
Sbjct: 154 CTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQN 213
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N +F +G++G SL+ QL + G FSYCLV E + TS + FG +A +
Sbjct: 214 NA-VTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLV---PENDQTSKINFGTNAVV 269
Query: 182 QRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + V R + YYL+L+ ISV + N G +ID+G T
Sbjct: 270 SGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTP------DSNIKGNMVIDSGTTLTL 323
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ P + + + + + + CY + +T HF+ AD K+
Sbjct: 324 L---PVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLN-IPVITMHFEGADVKLY 379
Query: 301 PTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
P Y F E C+A S RN + G Q++ YD + T+ F P +CA
Sbjct: 380 P-YNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDCA 433
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 171/376 (45%), Gaps = 36/376 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DVL GTP K L+ DTGS L W QCLPC +CF+Q+ ++P S+++K I C+D
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP +C +N C + Y ++ +G + ETFT +L ++ V +
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR + LG FS QL+S FSYCLV + +S L
Sbjct: 282 MFGCGHWNRGLFSGASGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLI 339
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ FV+ + YY+ ++ I V + T+ + +G GG
Sbjct: 340 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGG 399
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEH-------FTSFG-RQRMHNASEDWEYCYRYDSR 281
+ID+G ++ YE++ F E F F N S E
Sbjct: 400 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPEL 459
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVY 339
A+A D A + P FI+ +E C+AI + ++ S++G +QQQ+ +Y
Sbjct: 460 GIAFA------DGAVWNF-PAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILY 512
Query: 340 DLNTGTIQFVPENCAN 355
D + F P CA+
Sbjct: 513 DTKMSRLGFTPTKCAD 528
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 167/367 (45%), Gaps = 25/367 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD--- 63
Y +DV G P + L+ DTGS L W QC PC CF+QS P+F+P+ S+++K IPC+
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 64 -DLI----CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLVCVPGVIF 116
DL+ CR + C + Y + SG ++ E+ + L + + + ++
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL-FSYCLVYAYREMEATSILRF 175
GC + N+ LG F QL+S+ G FSYCLV + +S + F
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFP--SQLRSSPIGQSFSYCLVDRTNNLSVSSAISF 348
Query: 176 GKDANIQRK-DMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
G + R D FV ++ YYL +Q I + + FA+ NG+GG
Sbjct: 349 GAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGT 408
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMT 289
+ID+G T++ R Y V F S+ R + + CY R + +++
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRA---DPFDILGICYNATGRTAVPFPTLS 464
Query: 290 FHFDRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
F P YFI + E C+AI +D S++G +QQQ+ F+YD+ +
Sbjct: 465 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLG 524
Query: 348 FVPENCA 354
F +C+
Sbjct: 525 FANTDCS 531
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 154/358 (43%), Gaps = 22/358 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ +++ DTGS ++W QC PC C+ Q+ P+F+P S TY IPC
Sbjct: 128 EYFTRIGV--GTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCG 185
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR P +N C ++++Y G+ G STET TF V V GC +
Sbjct: 186 APLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR----VTRVALGCGH 241
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F + Q FSYCLV + +S++ FG A
Sbjct: 242 DNEGLFIGAAGLLGLGRGRLSFPV--QTGRRFNQKFSYCLVDRSASAKPSSVV-FGDSAV 298
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YYL L ISV + G + F L G GG +ID+G T
Sbjct: 299 SRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSVT 358
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R Y + F G + A+E ++ C+ ++ HF AD
Sbjct: 359 RLTRPAYIALRDAF-----RVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGAD 413
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ T N G FC A + + S++G QQQ R +DL + F P C
Sbjct: 414 VSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 160/357 (44%), Gaps = 31/357 (8%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP DTGS LIWTQC+PC NC+ Q APIF+P+ SST+K
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEK----- 114
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRD 124
RC C + I YA + ++G+++TET T + + GC +N +
Sbjct: 115 -------RCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSN 167
Query: 125 FSFDGNIA---GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
G A GI+G ++ P SL+ Q+ GL SYC + TS + FG +A +
Sbjct: 168 LMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF-----SSQGTSKINFGTNAVV 222
Query: 182 QRKDMKTIRMFVDRSSH-YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
MF+ + YYL+L +SV D RI F + G ID+G T+
Sbjct: 223 AGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQD---GNIFIDSGTTYTY 279
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF-DRADFKV 299
+ Y ++R Q +SE+ CY +D+ + +T HF AD +
Sbjct: 280 LPTS-YCNLVREAVAASVVAANQVPDPSSENL-LCYNWDT-MEIFPVITLHFAGGADLVL 336
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ MY G FC+AI D + ++ G + YD +T I F P NC+
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 155/360 (43%), Gaps = 20/360 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP L DT S L W QC PC C+ QS P+F+P S++Y+ + +
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAAD 197
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C R + G CV+ + Y G++ G ET TF +L P + GC +DN
Sbjct: 198 CQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVRL---PRISIGCGHDN 254
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANI 181
+ F AGILG S Q+ G FSYCLV + +S L FG A
Sbjct: 255 KGL-FGAPAAGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGAGAVD 311
Query: 182 QRKDMK--TIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRR-NGTGGCMIDTGAI 237
+ + ++ + YY+ L ISV R+ G L G GG ++D+G
Sbjct: 312 TSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVDSGTA 371
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHF-DRA 295
T + R Y F G+ + S ++ CY R + +++ HF
Sbjct: 372 VTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMHFAGSV 431
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ K++P + G C A + + +SV +G QQQ R VYD+ G + F P +C
Sbjct: 432 EVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIG-GRVGFAPNSC 490
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 171/391 (43%), Gaps = 48/391 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPCDDL 65
Y V + GTP + L DTGS L+WTQC PC+NCF+Q A P+ +P ASST+ + CD
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAP 153
Query: 66 ICRRPPF-RCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNKL----VCVPG 113
+CR PF C G CV+ +Y + G ++++ FTF + V
Sbjct: 154 VCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSERR 213
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+ FGC + N+ F N GI GF +SL QL T+ FSYC + + L
Sbjct: 214 LTFGCGHFNKGI-FQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFESTSSLVTL 269
Query: 174 RFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+++ + D S S Y+LSL+ I+V RI LR +
Sbjct: 270 GVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLRE---ASAI 326
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW------------EYCYRYD 279
ID+GA T + YE V F + ++ D + +R+
Sbjct: 327 IDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAVEGSALDLCFALPSAAAPKSAFGWRWR 385
Query: 280 SRFRAY----ASMTFHF-DRADFKVEPTYMYFIFQNEG--YFCV----AISFSDRNSVVG 328
R RA + FH AD+++ P Y +F++ G C+ A D+ V+G
Sbjct: 386 GRGRAMPVRVPRLVFHLGGGADWEL-PRENY-VFEDYGARVMCLVLDAATGGGDQTVVIG 443
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENCANDHFL 359
+QQQ+T VYDL + F P C D +
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARCECDKLV 474
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 158/367 (43%), Gaps = 41/367 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V FGTP+K+ L+ DTGS L W QC PC +C++Q IF P SS+YK +PC
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSAT 196
Query: 67 C------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C P C G CV+ INY G+S+ G S ET T + FGC +
Sbjct: 197 CTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDS----FQNFAFGCGH 252
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N F G+ +G+LG + S Q KS G F+YCL S +
Sbjct: 253 TNTGL-FKGS-SGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGS-------FS 303
Query: 181 IQRKDMKTIRMFVDRSSH------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ + + +F S+ Y++ L ISV R+ P G G ++D+
Sbjct: 304 VGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL-----GRGSTIVDS 358
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD--SRFRAYASMTFHF 292
G + T + Y + F + + + CY S+ R ++TFHF
Sbjct: 359 GTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI---LDTCYDLSRHSQVR-IPTITFHF 414
Query: 293 -DRADFKVEPTYMYFIFQNEG-YFCVAISFS---DRNSVVGAWQQQDTRFVYDLNTGTIQ 347
+ AD V + QN G C+A + + D +++G +QQQ R +D G I
Sbjct: 415 QNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIG 474
Query: 348 FVPENCA 354
F +CA
Sbjct: 475 FASGSCA 481
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 166/359 (46%), Gaps = 30/359 (8%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF + V GTP+K +L+ DTGS + W QC PC +C+ QS P+FNP +SSTYK + C
Sbjct: 162 YFSRIGV--GTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 65 LICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C + +C+++++Y G+ G ++T+T TF K + V GC +DN
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK---INDVALGCGHDNE 276
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK------ 177
AG+LG S+ Q+K+T+ FSYCLV R+ +S L F
Sbjct: 277 GLFT--GAAGLLGLGGGALSITNQMKATS---FSYCLV--DRDSGKSSSLDFNSVQLGSG 329
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
DA + I F YY+ L SV ++ F + +G+GG ++D G
Sbjct: 330 DATAPLLRNQKIDTF------YYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTA 383
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRAD 296
T +Q Y + F + T+ ++ ++ ++ CY + S ++ FHF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTTNL--KKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGK 441
Query: 297 FKVEPTYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC A + S S++G QQQ TR YDL I C
Sbjct: 442 SLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 155/359 (43%), Gaps = 22/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP K +++ DTGS ++W QC PC C++Q+ IF+P+ S ++ IPC
Sbjct: 129 EYFTRLGV--GTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCY 186
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR P +N C ++++Y G+ G STET TF + VP V GC +
Sbjct: 187 SPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF----RRAAVPRVAIGCGH 242
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F Q + FSYCL + +SI+ FG A
Sbjct: 243 DNEGLFVGAAGLLGLGRGGLSFPT--QTGTRFNNKFSYCLTDRTASAKPSSIV-FGDSAV 299
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+ L ISV + G + F L G GG +ID+G T
Sbjct: 300 SRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVT 359
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R Y + F G + A E ++ CY ++ HF AD
Sbjct: 360 RLTRPAYVSLRDAF-----RVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRGAD 414
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ N G FC A + + S++G QQQ R V+DL + F P CA
Sbjct: 415 VSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/339 (28%), Positives = 151/339 (44%), Gaps = 20/339 (5%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR---RPPFRCENGQ 78
++ DTGS + W QC PC +C+ QS P+F+P+ S++Y + CD CR R G
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 79 CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFS 138
C++ + Y G+ G +TET T V V GC +DN AG+L
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTLGDSTP---VGNVAIGCGHDNEGLFV--GAAGLLALG 115
Query: 139 VSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSH 198
P S Q+ ++ FSYCLV R+ A S L+FG A + S+
Sbjct: 116 GGPLSFPSQISAST---FSYCLV--DRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTF 170
Query: 199 YYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF 257
YY++L ISV + FA+ +G+GG ++D+G T +Q Y + F +
Sbjct: 171 YYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGA 230
Query: 258 TSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYMYFI-FQNEGYFC 315
S R + ++ CY R +++ F+ P Y I G +C
Sbjct: 231 PSLPRT---SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYC 287
Query: 316 VAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+A + ++ S++G QQQ TR +D G + F P C
Sbjct: 288 LAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 162/358 (45%), Gaps = 27/358 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V G P++ +++ DTGS + W QC PC +C+ QS P+++P+ S++Y + CD
Sbjct: 162 EYFSRVGV--GRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCD 219
Query: 64 DLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CR R G C++ + Y G+ G +TET T V V GC +
Sbjct: 220 SPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTL---GDSAPVSNVAIGCGH 276
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN AG+L P S Q+ +T FSYCLV R+ ++S L+FG D+
Sbjct: 277 DNEGLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLV--DRDSPSSSTLQFG-DSE 328
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
IR ++ YY++L ISV + FA+ G+GG ++D+G T
Sbjct: 329 QPAVTAPLIRS-PRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTR 387
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRA-YASMTFHFDRADF 297
+Q G Y + F + G Q + AS ++ CY R ++ F+
Sbjct: 388 LQSGAYGALREAFVQ-----GTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGE 442
Query: 298 KVEPTYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A + S S++G QQQ R +D T+ F + C
Sbjct: 443 LKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 159/355 (44%), Gaps = 41/355 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + L DTG+ IW QC PC C NQ++P+F+P+ SSTYK IPC I
Sbjct: 90 YVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPI 149
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDNRDF 125
C+ +G Y G +T T + N + ++ GC + N+
Sbjct: 150 CKN-----ADGH------YLG---------VDTLTLNSNNGTPISFKNIVIGCGHRNQG- 188
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
+G ++G +G + P S + QL S+ G FSYCLV + + +S L FG + +
Sbjct: 189 PLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVS--G 246
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ T+ + + Y++SL+ SV DH I + G +ID+G T + P
Sbjct: 247 LGTVSTPIKEENGYFVSLEAFSVGDHIIKLE------NSDNRGNSIIDSGTTMTIL---P 297
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR--FRAYASMTFHFDRADFKVEPTY 303
+V R +R+ + S+ + CY+ S +T HF ++ +
Sbjct: 298 KDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNALN 357
Query: 304 MYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ +E C A +FS ++ G QQ+ +DLN TI F P +C
Sbjct: 358 TFYPITDE-VICFAFVSGGNFSSL-AIFGNVVQQNFLVGFDLNKKTISFKPTDCT 410
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 158/351 (45%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P +S++++ D+GS ++W QC PC C+ QS P+F+P S+TY I CD +
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSV 196
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C +G+C + ++Y G+ G ++ ET TF V + + GC + NR
Sbjct: 197 CDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF----GRVLIRNIAIGCGHMNRGM 252
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S +GQL G FSYCLV R E+T L FG+ A
Sbjct: 253 FI--GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLV--SRGTESTGTLEFGRGAMPVGAA 308
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V R+ F L G GG ++DTG T +
Sbjct: 309 WVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPA 368
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
YE F + R + ++ CY + +++F+F P
Sbjct: 369 YEAFRDTFIGQTANLPR---SDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARN 425
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I EG FC A + S S++G QQ+ + D + G + F P C
Sbjct: 426 FLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 162/362 (44%), Gaps = 23/362 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP++S F++ DTGS L W QC PC +C+ Q+ PIF+P SS+++RIPC +
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 188
Query: 67 CRRPPF------RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C+ R +C +++ Y G+ + G S++ FT +K + V FGC
Sbjct: 189 CKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAM---SVAFGCGF 245
Query: 121 DNRDFSFDGNIAGI-----LGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM-EATSILR 174
DN L F F+ S+ FSYCLV M ++S L
Sbjct: 246 DNEGLFAGAAGLLGLGAGKLSFPSQIFAS--STNSSTANSFSYCLVDRSNPMTRSSSSLI 303
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
FG A + + + YY ++ +SV ++ + + L ++G+GG +ID+
Sbjct: 304 FGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDS 363
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD 293
G T Y + F T+ ++ ++ CY + + ++ HF+
Sbjct: 364 GTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSL---FDTCYNFSGKASVDVPALVLHFE 420
Query: 294 R-ADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
AD ++ PT G FC+A + + ++G QQQ R +DL + F P+
Sbjct: 421 NGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQ 480
Query: 352 NC 353
C
Sbjct: 481 QC 482
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 161/359 (44%), Gaps = 24/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP + +++ DTGS ++W QC PC C++QS PIFNP S ++ IPC
Sbjct: 109 EYFTRLGV--GTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCS 166
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR C+++++Y G+ +G +TET TF NK+ V GC +
Sbjct: 167 SPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFR-GNKIA---KVALGCGH 222
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N LG F ++ + FSYCLV + +S++ FG DA
Sbjct: 223 HNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHK--FSYCLVDRSASSKPSSMV-FG-DAA 278
Query: 181 IQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIA 238
I R T + + + YY+ L ISV R+ G +P F L G GG +ID+G
Sbjct: 279 ISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSV 338
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRA 295
T + R Y + F G + + E ++ CY + ++ HF A
Sbjct: 339 TRLTRPAYTALRDAF-----RVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGA 393
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
D + T G FC A + + S++G QQQ R VYDL I F P C
Sbjct: 394 DMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 151/351 (43%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + G+P + ++++ D+GS ++W QC PC C++Q+ P+F+P S+++ +PC +
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSV 201
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C G C + + Y G+ G ++ ET TF V V GC + NR
Sbjct: 202 CERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF----GRTVVRNVAIGCGHRNRGM 257
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
L SL+GQL G FSYCLV R ++ L FG+ A
Sbjct: 258 FVGAAGLLGL--GGGSMSLVGQLGGQTGGAFSYCLV--SRGTDSAGSLEFGRGAMPVGAA 313
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V ++ + F L G GG ++DTG T + R P
Sbjct: 314 WIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTG---TAVTRIP 370
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
+ D G + ++ CY + +++F+F P
Sbjct: 371 TVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARN 430
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + +D G + F P C
Sbjct: 431 FLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 155/358 (43%), Gaps = 22/358 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ +++ DTGS ++W QC PC C+ Q+ +F+P S TY IPC
Sbjct: 117 EYFTRIGV--GTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCG 174
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR P +N C ++++Y G+ G STET TF +N+ V V GC +
Sbjct: 175 APLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-RNR---VTRVALGCGH 230
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F + Q FSYCLV + +S++ FG A
Sbjct: 231 DNEGLFTGAAGLLGLGRGRLSFPV--QTGRRFNHKFSYCLVDRSASAKPSSVI-FGDSAV 287
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YYL L ISV + G + F L G GG +ID+G T
Sbjct: 288 SRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVT 347
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFDRAD 296
+ R Y + F G + A E ++ C+ ++ HF AD
Sbjct: 348 RLTRPAYIALRDAF-----RIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGAD 402
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ T N G FC A + + S++G QQQ R YDL + F P C
Sbjct: 403 VSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 166/377 (44%), Gaps = 38/377 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DVL GTP K L+ DTGS L W QCLPC +CF+Q+ ++P S+++K I C+D
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 67 CR-----RPPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLK-----NKLVCVPGV 114
C PP +CE N C + Y ++ +G + ETFT +L + V +
Sbjct: 220 CSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR + LG FS QL+S FSYCLV +S L
Sbjct: 280 MFGCGHWNRGLFSGASGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSNTNVSSKLI 337
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ FV+ + YY+ ++ I V + T+ + +G GG
Sbjct: 338 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGG 397
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT 289
+ID+G ++ YE++ F E +M + D F
Sbjct: 398 TIIDSGTTLSYFAEPAYEIIKNKFAE--------KMKENYPIFRDFPVLDPCFNVSGIEE 449
Query: 290 FHFDRADFKVE---------PTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFV 338
+ + + P FI+ +E C+AI + ++ S++G +QQQ+ +
Sbjct: 450 NNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHIL 509
Query: 339 YDLNTGTIQFVPENCAN 355
YD + F P CA+
Sbjct: 510 YDTKRSRLGFTPTKCAD 526
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 157/355 (44%), Gaps = 24/355 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G+P K +++ DTGS + W QC PC +C+ Q+ PIF P+ SS+Y + C+
Sbjct: 155 YFSRVGI--GSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCET 212
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ C N C++ ++Y G+ G +TET T L V GC +DN
Sbjct: 213 HQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASL---NNVAIGCGHDNE 269
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG F Q+ +++ FSYCLV R+ ++ S L F
Sbjct: 270 GLFVGAAGLLGLGGGSLSFP--SQINASS---FSYCLV--NRDTDSASTLEFNSPIPSHS 322
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+R + YYL + I V + +F + +G GG ++D+G T +Q
Sbjct: 323 VTAPLLRNN-QLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQS 381
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Y + F G Q + + S ++ CY SR +++FHF +
Sbjct: 382 DVYNSLRDSFVR-----GTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLAL 436
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC A + + S++G QQQ TR YDL+ + F P C
Sbjct: 437 PAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 165/381 (43%), Gaps = 48/381 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP K L+ DTGS L W QC+PC CF Q+ P ++P SS+Y+ I C D
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP C EN C + Y ++ +G + ETFT +L K +L V V
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 301 MFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDANVSSKLI 358
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ V + YY+ ++ I V + + + +G+GG
Sbjct: 359 FGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGG 418
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW---EYCYRYDSRFRAYA 286
+ID+G ++ Y+V+ F + + + +D+ E CY
Sbjct: 419 TIIDSGTTLSYFAEPAYQVIKEAF------MAKVKGYPVVKDFPVLEPCYN--------V 464
Query: 287 SMTFHFDRADFKVE---------PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQD 334
+ D DF + P YFI + C+AI + + S++G +QQQ+
Sbjct: 465 TGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQN 524
Query: 335 TRFVYDLNTGTIQFVPENCAN 355
+YD + F P CA+
Sbjct: 525 FHILYDTKKSRLGFAPTKCAD 545
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 155/351 (44%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P ++++++ D+GS +IW QC PC C++QS P+FNP SS++ + C +
Sbjct: 136 YFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTV 195
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C C G+C + ++Y G+ G ++ ET TF + V GC + N+
Sbjct: 196 CSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITF----GRTLIRNVAIGCGHHNQGM 251
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
L P S +GQL G FSYCLV R +E++ +L FG++A
Sbjct: 252 FVGAAGLLGL--GGGPMSFVGQLGGQTGGAFSYCLV--SRGIESSGLLEFGREAMPVGAA 307
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V R+ + F L G GG ++DTG T +
Sbjct: 308 WVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVA 367
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
YE F T+ R + ++ CY +++F+F P
Sbjct: 368 YEAFRDGFIAQTTNLPRA---SGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARN 424
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + D G + F P C
Sbjct: 425 FLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 165/373 (44%), Gaps = 32/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP + L+ DTGS L W QC+PC +CF Q+ P ++P SS++K I C D
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
C PP C EN C + Y ++ +G + ETFT +L K++ V V
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLI 369
Query: 175 FGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
FG+D ++ V + YY+ ++ I V + T+ L G GG
Sbjct: 370 FGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGG 429
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW---EYCYRYDSRFR-AY 285
++D+G ++ YE++ F + + + +D+ + CY +
Sbjct: 430 TIVDSGTTLSYFAEPSYEIIKDAF------VKKVKGYPVIKDFPILDPCYNVSGVEKMEL 483
Query: 286 ASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLN 342
F+ P YFI + E C+AI + R+ S++G +QQQ+ +YD
Sbjct: 484 PEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQQNFHILYDTK 543
Query: 343 TGTIQFVPENCAN 355
+ + P CA+
Sbjct: 544 KSRLGYAPMKCAD 556
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 170/360 (47%), Gaps = 27/360 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ ++++ DTGS + W QC PC C++Q+ PIFNP+ S+++ + CD
Sbjct: 156 EYFTRIGV--GTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C + + C +G C++ +Y G+ ++G +TET TF + V V GC + N
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTS----VANVAIGCGHKN 269
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
LG F Q+ + FSYCLV RE +++ L+FG +
Sbjct: 270 VGLFIGAAGLLGLGAGALSFP--NQIGTQTGHTFSYCLV--DRESDSSGPLQFGP----K 321
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMIDTGA 236
+ +I ++++ H YYLS+ ISV + P F + +G GG +ID+G
Sbjct: 322 SVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGT 381
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRA 295
+ T + Y+ V D G+ +A ++ CY +F + ++ FHF
Sbjct: 382 VVTRLVTSAYDAVR---DAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHFSNG 438
Query: 296 DFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I G FC A + + + S++G QQQ R +D + F + C
Sbjct: 439 ASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 153/353 (43%), Gaps = 21/353 (5%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G P +L+ DTGS + W QC PC +C+ Q+ PIF P +S+++ + C+
Sbjct: 149 YFSRVGI--GKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNT 206
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
CR C N C++ ++Y G+ G TET T V V GC ++N
Sbjct: 207 RQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAP----VDNVAIGCGHNNE 262
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG F Q+ +T+ FSYCLV R+ E+ S L F
Sbjct: 263 GLFVGAAGLLGLGGGSLSFP--SQINATS---FSYCLV--DRDSESASTLEFNSTLPPNA 315
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+R + YY+ L +SV + F + +G GG ++D+G T +Q
Sbjct: 316 VSAPLLRNH-HLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQT 374
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPT 302
Y + F + N ++ CY S+ +++FHF P
Sbjct: 375 DVYNSLRDAFVKRTRDLPST---NGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPA 431
Query: 303 YMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y + +EG FC A + + + S++G QQQ TR VYDL + FVP C
Sbjct: 432 KNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 155/355 (43%), Gaps = 31/355 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP F DTGS L+W QC PC C+ Q PIF+P+ SS+Y+ IPC
Sbjct: 88 YLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSDT 147
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDNRDF 125
C C R G +S ET T V P + GC N
Sbjct: 148 CH----SMRTTSCDVR----------GYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTG- 192
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
+F G +GI+G P SL QL ++ G FSYCL +TS L FG DA I D
Sbjct: 193 TFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCL--GPWLPNSTSKLNFG-DAAIVYGD 249
Query: 186 --MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
M T + D S YYL+L+ SV + I F T+ G +ID+G TF+
Sbjct: 250 GAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYG---GNEGNILIDSGTTFTFL-- 304
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTY 303
PY+V R + + + + ++ CY +T HF AD K+ Y
Sbjct: 305 -PYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHGFEAPLITAHFKGADIKLY--Y 361
Query: 304 M-YFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCANDH 357
+ FI ++G C+A + ++ G QQ+ Y+L T+ F P +C +
Sbjct: 362 ISTFIKVSDGIACLAF-IPSQTAIFGNVAQQNLLVGYNLVQNTVTFKPVDCTKPY 415
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 157/361 (43%), Gaps = 22/361 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ + +++ DTGS ++W QC PC C+NQ+ IF+P S T+ +PC
Sbjct: 134 EYFMRLGV--GTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191
Query: 64 DLICRRPPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+CRR E + C+++++Y G+ G STET TFH V V GC
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGC 247
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEATSILRF 175
+DN LG F Q K+ G FSYCLV + + S + F
Sbjct: 248 GHDNEGLFVGAAGLLGLGRGGLSFP--SQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVF 305
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDT 234
G A + + + YYL L ISV R+ G + F L G GG +ID+
Sbjct: 306 GNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDS 365
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD 293
G T + + Y + F T R ++ ++ C+ ++ FHF
Sbjct: 366 GTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSL---FDTCFDLSGMTTVKVPTVVFHFG 422
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ + + EG FC A + + + S++G QQQ R YDL + F+
Sbjct: 423 GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 482
Query: 353 C 353
C
Sbjct: 483 C 483
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 165/360 (45%), Gaps = 38/360 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + GTP + DTGS + WTQCLPCV+C+ Q+APIF+P+ SST+K
Sbjct: 377 NSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEK--- 433
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDN 122
RC + C + ++Y G ++T+T T H + + I GC +N
Sbjct: 434 ---------RCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNN 484
Query: 123 RDF--SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
F SF+ G +G + P SL+ Q+ GL SYC TS + FG +A
Sbjct: 485 SWFRPSFE----GFVGLNWGPLSLITQMGGEYPGLMSYCFA-----GNGTSKINFGTNAI 535
Query: 181 IQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGTGGCMIDTGAI 237
+ + + MFV R YYL+L +SV D RI F AL G +ID+G
Sbjct: 536 VGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALE----GNIVIDSGTT 591
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RAD 296
T+ Y ++R EH + + + CY Y + + +T HF AD
Sbjct: 592 LTYFPES-YCNLVRQAVEHVVP--AVPAADPTGNDLLCY-YSNTTEIFPVITMHFSGGAD 647
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ M+ + G FC+AI ++ + ++ G Q + YD ++ + F P NC+
Sbjct: 648 LVLDKYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 140/317 (44%), Gaps = 48/317 (15%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y Y + + GTP + DTGS LIWTQCLPC++C++Q APIF+P+ SST+K C+
Sbjct: 63 YEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT 122
Query: 65 LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCV-PGVIFGCSNDNR 123
+ C +++ Y + G ++TET T H + + V P I GCS +N
Sbjct: 123 ----------PDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNS 172
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F + +GI+G S SL+ Q+ AY S F K A
Sbjct: 173 GSGFRPSSSGIVGLSRGSLSLISQMGG------------AYPGDGVVSTTMFAKTA---- 216
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGTGGCMIDTGAIATFIQ 242
+ YYL+L +SV D RI F AL G +ID+G T+
Sbjct: 217 -----------KRGQYYLNLDAVSVGDTRIETVGTPFHALN----GNIVIDSGTPLTYFP 261
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKVEP 301
+V + + T+ R+ + S + CY Y + + +T HF AD ++
Sbjct: 262 VSYCNLVRKAVERVVTA---DRVVDPSRNDMLCY-YSNTIEIFPVITVHFSGGADLVLDK 317
Query: 302 TYMYFIFQNEGYFCVAI 318
MY G FC+AI
Sbjct: 318 YNMYMELNRGGVFCLAI 334
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 158/358 (44%), Gaps = 21/358 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + G P + + DTGS +IW QC PC C+NQ+ IF+P+ S+TYK +P
Sbjct: 86 YLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTT 145
Query: 67 CRR-PPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSND 121
C+ C + C + I Y G+ + G +S ET T N V + GC +
Sbjct: 146 CQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRN 205
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL---FSYCLVYAYREMEATSILRFGKD 178
N SF+G +GI+G P SL+ QL+ + + FSYCL +S L FG D
Sbjct: 206 NT-VSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASM---SNISSKLNFG-D 260
Query: 179 ANIQRKD--MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A + D + T + D YYL+L+ SV ++RI F +F R G +ID+G
Sbjct: 261 AAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSF--RFGEKGNIIIDSGT 318
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
T + P ++ + R+ + + CYR + HF AD
Sbjct: 319 TLTLL---PNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFDELNAPVIMAHFSGAD 375
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K+ FI +G C+A S + G QQ+ YDL + F P +C+
Sbjct: 376 VKLNAVNT-FIEVEQGVTCLAFISSKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 170/378 (44%), Gaps = 37/378 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-- 64
Y +D+ GTP K +L+ DTGS L W QC PC +CF Q+ + P SSTY+ I C D
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPR 230
Query: 65 --LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGV 114
L+ P + EN C + +YA G++ +G ++ETFT +L K K V V
Sbjct: 231 CQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDV 290
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC + N+ F + +G+LG P S Q++S FSYCL + +S L
Sbjct: 291 MFGCGHWNKGFFYGA--SGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLI 348
Query: 175 FGKDAN-IQRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRN---- 225
FG+D + ++ + + YYL ++ I V + + T+
Sbjct: 349 FGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAA 408
Query: 226 -GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW--EYCYRYDSRF 282
GG +ID+G+ TF Y+++ F++ + A++D+ CY
Sbjct: 409 DAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKI-----KLQQIAADDFVMSPCYNVSGAM 463
Query: 283 RAYASMTFHFDRADFKVE--PTYMYFI-FQNEGYFCVAISFSDRNS---VVGAWQQQDTR 336
F AD V P YF ++ + C+AI + +S ++G QQ+
Sbjct: 464 MQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFH 523
Query: 337 FVYDLNTGTIQFVPENCA 354
+YD+ + + P CA
Sbjct: 524 ILYDVKRSRLGYSPRRCA 541
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 161/362 (44%), Gaps = 23/362 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP++S F++ DTGS L W QC PC +C+ Q+ PIF+P SS+++RIPC +
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113
Query: 67 CRRPPF------RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C+ R +C +++ Y G+ + G S++ FT +K + V FGC
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAM---SVAFGCGF 170
Query: 121 DNRDFSFDGNIAGI-----LGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM-EATSILR 174
DN L F F+ S+ FSYCLV M ++S L
Sbjct: 171 DNEGLFAGAAGLLGLGAGKLSFPSQIFAS--STNSSTANSFSYCLVDRSNPMTRSSSSLI 228
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
FG A + + + YY ++ +SV ++ + + L ++G+GG +ID+
Sbjct: 229 FGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDS 288
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD 293
G T Y + F + ++ ++ CY + + ++ HF+
Sbjct: 289 GTSVTRFPTSVYATIRDAFRNATINLPSAPRYSL---FDTCYNFSGKASVDVPALVLHFE 345
Query: 294 R-ADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
AD ++ PT G FC+A + + ++G QQQ R +DL + F P+
Sbjct: 346 NGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQ 405
Query: 352 NC 353
C
Sbjct: 406 QC 407
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 165/371 (44%), Gaps = 31/371 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + ++ DTGS L W QC PC++CF Q P+F+P AS +Y+ + C D
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPR 211
Query: 67 CR--RPPF------RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC--VPGVIF 116
C PP R + C + Y ++ +G ++ E FT +L V V+F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC + NR LG F+ QL++ FSYCLV + + I+
Sbjct: 272 GCGHSNRGLFHGAAGLLGLGRGALSFA--SQLRAVYGHAFSYCLV-DHGSSVGSKIVFGD 328
Query: 177 KDANIQRKDMK----TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
DA + + + YY+ L+ + V ++ +P T+ + ++G+GG +I
Sbjct: 329 DDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTII 388
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAYASMT 289
D+G ++ YEV+ R F E + + D+ CY R
Sbjct: 389 DSGTTLSYFAEPAYEVIRRAFVERM-----DKAYPLVADFPVLSPCYNVSGVERVEVP-E 442
Query: 290 FHFDRADFKVE--PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTG 344
F AD V P YF+ +G C+A+ + R+ S++G +QQQ+ +YDL
Sbjct: 443 FSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNN 502
Query: 345 TIQFVPENCAN 355
+ F P CA
Sbjct: 503 RLGFAPRRCAE 513
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 165/369 (44%), Gaps = 47/369 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y + + GTPS + DTGS L W QC PC N CF Q+ P+++P SST+ +PCD
Sbjct: 96 YLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDS 155
Query: 65 LICRRPPFR---CEN-GQCVHRINYAGGASASGLVSTETFTFHLK----NKLVCVPGVIF 116
C + P+ C + G C++ Y + + G +S+++ L N +C F
Sbjct: 156 QPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKIC-----F 210
Query: 117 GCSNDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC N+ F+ D G GI+G P SL+ QL FSYCL+ + S L+
Sbjct: 211 GCGFQNK-FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLL--PFSSNSNSKLK 267
Query: 175 FGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGT-GGCMI 232
FG+ A +Q + + + + YYL+L+ I+V G ++ T G +I
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITV---------GAKTVKTGQTDGNIII 318
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-----EYCYRYDSRFRAYAS 287
D+G+ T+++ F F S ++ + + + ++C+ Y
Sbjct: 319 DSGSTLTYLEES--------FYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPD 370
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFC--VAISFSDRNSVVGAWQQQDTRFVYDLNTGT 345
+ FHF D ++P + + + C V S D ++ G Q D YD+ G
Sbjct: 371 VVFHFTGGDVVLKPMNTLVLIE-DNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGK 429
Query: 346 IQFVPENCA 354
+ F P +C+
Sbjct: 430 VSFAPTDCS 438
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 144/284 (50%), Gaps = 24/284 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + DTGS LIWTQC PC+ C +Q P F+ S+TY+ +PC
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL-VCVPGVIFGCSNDNR- 123
C C CV++ Y AS +G+++ ETFTF N V + FGC + N
Sbjct: 149 CASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAG 208
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT-SILRFGKDANIQ 182
D + N +G++GF P SL+ QL + FSYCL + AT S L FG AN+
Sbjct: 209 DLA---NSSGMVGFGRGPLSLVSQLGPSR---FSYCLT---SYLSATPSRLYFGVYANLS 259
Query: 183 RKD------MKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ +++ ++ + + Y+LSL+ IS+ + P FA+ +GTGG +ID+
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDS 319
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY 278
G T++Q+ YE V R ++ M++ + C+++
Sbjct: 320 GTSITWLQQDAYEAVRRGL---VSAIPLTAMNDTDIGLDTCFQW 360
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 158/356 (44%), Gaps = 18/356 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP + +++ DTGS ++W QCLPC C+ Q+ P+FNP ASSTY+++PC
Sbjct: 152 EYFTRLGV--GTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCA 209
Query: 64 DLICRRPPFR-CENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+C++ C N + C ++++Y G+ G STET TF + + V GC +D
Sbjct: 210 TPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ----VIRRVALGCGHD 265
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F Q + FSYCLV A+S++ FGK A
Sbjct: 266 NEGLFIGAAGLLGLGRGSLSFP--SQTGAQFSKRFSYCLVDRSASGTASSLI-FGKAAIP 322
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGT-FALRRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV R+ P + F + G GG +ID+G T
Sbjct: 323 KSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTR 382
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRADFKV 299
+ Y MR D G + ++ CY + ++ FHF
Sbjct: 383 LVDSAYS-TMR--DAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGAHIS 439
Query: 300 EPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + FC A + + S++G QQQ R V+D + F +C
Sbjct: 440 LPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 160/355 (45%), Gaps = 25/355 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G P +++ DTGS + W QC PC C+ Q+ PIF P +S+++ + C+
Sbjct: 151 YFSRVGI--GRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCET 208
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ C NG C++ ++Y G+ G TET T + + + GC ++N
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTS----LGNIAIGCGHNNE 264
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG F QL +++ FSYCLV R+ ++TS L F
Sbjct: 265 GLFIGAAGLLGLGGGSLSFP--SQLNASS---FSYCLV--DRDSDSTSTLDFNSPIT--- 314
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
D T + + + + +YL L +SV + +F + +G GG ++D+G T +
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Q Y V+ F + R ++ CY S+ R +++FHF +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTAR---GVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I +EG FC A + +D S++G QQQ TR +DL + F P C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 167/359 (46%), Gaps = 25/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + + GTP++ ++++ DTGS ++W QC PC C++Q+ PIFNP++S ++ + CD
Sbjct: 7 EYFTRIGI--GTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 64
Query: 64 DLICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C + C G C++ ++Y G+ G +TET TF + + V GC +DN
Sbjct: 65 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTS----IQNVAIGCGHDN 120
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
AG+LG S QL + FSYCLV R+ E++ L FG ++
Sbjct: 121 VGLFV--GAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV--DRDSESSGTLEFGPESVPI 176
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR---RNGTGGCMIDTGAIAT 239
+ + YYLS+ ISV + P + A R G GG +ID+G T
Sbjct: 177 GSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVP-SEAFRIDETTGRGGIIIDSGTAVT 235
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDS-RFRAYASMTFHFDRAD 296
+Q Y+ + F G Q + A ++ CY + + + ++ FHF
Sbjct: 236 RLQTSAYDALRDAFIA-----GTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGA 290
Query: 297 FKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P I + G FC A + +D N S++G QQQ R +D + F + C
Sbjct: 291 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 163/364 (44%), Gaps = 44/364 (12%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPF 72
G P + + DTGS L W C PC +C QS PIF+P+ SSTY + C + C
Sbjct: 99 IGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCSE--CN---- 152
Query: 73 RCE--NGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGCSNDNRDFSFDG 129
+C+ NG+C + + Y G S+ G+ + E T + ++ VP +IFGC R FS
Sbjct: 153 KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCG---RKFSISS 209
Query: 130 N------IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
N I G+ G FSLL FSYC+ + L G AN+Q
Sbjct: 210 NGYPYQGINGVFGLGSGRFSLLPSFGKK----FSYCIGNLRNTNYKFNRLVLGDKANMQ- 264
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATFIQ 242
D T+ + + YY++L+ IS+ ++ P F + G +ID+GA T++
Sbjct: 265 GDSTTLNVI---NGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLT 321
Query: 243 RGPYEVV---MRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRAYASMTFHF-DRAD 296
+ +EV+ + + E +Q HN + CY + +TFHF + A
Sbjct: 322 KYGFEVLSFEVENLLEGVLVLAQQDKHNP---YTLCYSGVVSQDLSGFPLVTFHFAEGAV 378
Query: 297 FKVEPTYMYFIFQNEGYFCVAIS----FSDRN---SVVGAWQQQDTRFVYDLNTGTIQFV 349
++ T M FI E FC+A+ F D S +G QQ+ YDLN + F
Sbjct: 379 LDLDVTSM-FIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQ 437
Query: 350 PENC 353
+C
Sbjct: 438 RIDC 441
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 152/356 (42%), Gaps = 17/356 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP K +++ DTGS ++W QC PC C++Q+ P+F+P S ++ I C
Sbjct: 146 EYFTRLGV--GTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 203
Query: 64 DLICRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+C R P C++++ Y G+ G STET TF VP V GC +D
Sbjct: 204 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR----VPKVALGCGHD 259
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N LG F L+ + FSYCLV + +S++ FG+ A
Sbjct: 260 NEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRK--FSYCLVDRSASSKPSSVV-FGQSAVS 316
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + YYL L ISV R+ G F L G GG +ID+G T
Sbjct: 317 RTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTR 376
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKV 299
+ R Y + F R ++ ++ C+ + ++ HF AD +
Sbjct: 377 LTRRAYVSLRDAFRAGAADLKRAPDYSL---FDTCFDLSGKTEVKVPTVVMHFRGADVSL 433
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
T G FC A + + S++G QQQ R V+D+ I F CA
Sbjct: 434 PATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGCA 489
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 175/368 (47%), Gaps = 46/368 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
+ V V FGTP+++ L+FDTGS + W QCLPC +C+ Q PIF+P S+TY +PC
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179
Query: 66 ICRRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR- 123
C +C NG C++++ Y G+S +G++S ET + L PG FGC N
Sbjct: 180 QCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARAL---PGFAFGCGETNLG 236
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-----YAYREMEATSILRFGKD 178
DF G++ G++G SL Q ++ FSYCL + Y + T+ G D
Sbjct: 237 DF---GDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPAS-GSD 292
Query: 179 -----ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
A IQ++D S Y++ L I V + P F R+GT ++D
Sbjct: 293 GVRYTAMIQKQDYP---------SFYFVDLVSIVVGGFVLPVPPILFT--RDGT---LLD 338
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM-TFHF 292
+G + T++ Y + F T + + A + ++ CY + + + + +F F
Sbjct: 339 SGTVLTYLPPEAYTALRDRFKFTMTQY---KPAPAYDPFDTCYDFAGQNAIFMPLVSFKF 395
Query: 293 -DRADFKVEPTYMYFIFQNEGYFCVA-ISFSDRNS-----VVGAWQQQDTRFVYDLNTGT 345
D + F + P + IF ++ ++F R S +VG QQ++T +YD+
Sbjct: 396 SDGSSFDLSP-FGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEK 454
Query: 346 IQFVPENC 353
I FV +C
Sbjct: 455 IGFVSGSC 462
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 170/363 (46%), Gaps = 31/363 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ +++ GTP L DTGS LIW QC PC+ C+ Q P+F+P SSTY I CD +
Sbjct: 68 HLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPL 127
Query: 67 CRRPPFRCENGQCV--HRINYAGG----ASASGLVSTETFTFHLKN-KLVCVPGVIFGCS 119
C + + G C R NY G + G+++ +T TF K V + +FGC
Sbjct: 128 CH----KLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCG 183
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKD 178
++N F+ + G++G P SL+ Q+ G FS CLV +++ +S + FGK
Sbjct: 184 HNNTG-GFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKG 242
Query: 179 ANIQRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRN-GTGGCMIDTGA 236
+ + + T + ++ + Y+++L ISV D F + G ++D+G
Sbjct: 243 SQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTY-------FPMNSTIGKANMLVDSGT 295
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ + Y+ V + + S + CYR + + ++TFHF A+
Sbjct: 296 PPILLPQQLYDKVFAEVRNKVAL--KPITDDPSLGTQLCYRTQTNLKG-PTLTFHFVGAN 352
Query: 297 FKVEP--TYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ P T++ Q +G FC+AI ++ NS V G + Q + +DL+ + F P
Sbjct: 353 VLLTPIQTFIPPTPQTKGIFCLAI-YNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPT 411
Query: 352 NCA 354
+C
Sbjct: 412 DCT 414
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 177/381 (46%), Gaps = 52/381 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC-------LPCVNCFNQSAPIFNPNASSTYKR 59
+++ V GTP + L+ DTGS LIWTQC + Q P++ P SS++
Sbjct: 84 HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAY 143
Query: 60 IPCDDLICRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+PC D +C+ F N +C++ Y G A A G++++ETFTF + K V +P +
Sbjct: 144 LPCSDRLCQEGQFSYKNCARNNRCMYDELY-GSAEAGGVLASETFTFGVNAK-VSLP-LG 200
Query: 116 FGCSNDNRDFSFDGNI---AGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATS 171
FGC + G++ +G++G S SL+ QL FSYCL +A R+ TS
Sbjct: 201 FGCGALSA-----GDLVGASGLMGLSPGIMSLVSQLSVPR---FSYCLTPFAERK---TS 249
Query: 172 ILRFGKDANIQR-------KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-R 223
L FG A+++R + +R +++YY+ L +S+ R+ + + +
Sbjct: 250 PLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIK 309
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCYRYD 279
+G+GG ++D+G+ ++++ + V + E R + N + +D+E C+
Sbjct: 310 PDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAV----RLPVANGTDEDYDDYELCFALP 365
Query: 280 SRFRAYA----SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQ 332
+ A + HFD P YF G C+A+ S S++G QQ
Sbjct: 366 TGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQ 425
Query: 333 QDTRFVYDLNTGTIQFVPENC 353
Q+ ++D+ F P C
Sbjct: 426 QNMHVLFDVRNQKFSFAPTKC 446
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 29/373 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V G ++ L D + L+W QC P F Q P F P S +++R+P ++
Sbjct: 84 YSVVTSVGTGAGRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNN 143
Query: 65 LICRRPPF---RCENGQC-VHRINYAGGASASGLVSTETFTFHLKNKLVC-VPGVIFGCS 119
C P R C H I G A A G++S ET F + V GV+ GC+
Sbjct: 144 AFCLPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAASGQQQTEVTGVVIGCT 203
Query: 120 NDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGL-----FSYCL-VYAYREMEATS 171
++++ F+F+ G +AG+LG SL+ L G FSYCL + + +
Sbjct: 204 HNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHT 263
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNG 226
LRF D + + T M++D ++ Y++SL ISVA + F +G
Sbjct: 264 FLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRAYFVSLTGISVAGKPLQDVKELFKRHVHG 323
Query: 227 ---TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-F 282
T GC D G + Y + H G Q + S + C+R S+ +
Sbjct: 324 QVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGLQIV---SGQYHLCFRATSQLW 380
Query: 283 RAYASMTFHFDRADFK-VEPTYMYFIFQNEGY-FCVAISFSDRNSVVGAWQQQDTRFVYD 340
+ ++ F + + V P F+ GY C+A+ S +++GA QQ D RFVYD
Sbjct: 381 QHLPTVMLQFAETEARLVLPPQRLFV--AVGYDICLAVVRSYDITIIGAMQQVDKRFVYD 438
Query: 341 LNTGTIQFVPENC 353
+ G I FVPEN
Sbjct: 439 VRHGRIYFVPENA 451
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 160/340 (47%), Gaps = 17/340 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP ++ ++ DTGS ++W QCLPC +C+ Q+ P+FNP+ SST++ I C +
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSL 140
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD- 124
C++ R C QC+++++Y G+ G STET +F V V GC ++N+
Sbjct: 141 CQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSN----AVNSVAIGCGHNNQGL 196
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F+ + G+ +S S +GQL + +FSYCL RE + L FG A
Sbjct: 197 FTGAAGLLGLGKGLLSFPSQVGQLYGS---VFSYCL--PTRESTGSVPLIFGNQAVASNA 251
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATFIQR 243
T+ + YY+ + I V + G+ +L G GG ++D+G T +
Sbjct: 252 QFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVT 311
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPT 302
Y + F S +M + ++ CY R +++F F+ P
Sbjct: 312 SAYNPMRDAFRAGMPS--DAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPA 369
Query: 303 YMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYD 340
+ N G +C+A + + N S++G QQQ R +D
Sbjct: 370 QNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFD 409
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 160/340 (47%), Gaps = 17/340 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP ++ ++ DTGS ++W QCLPC +C+ Q+ P+FNP+ SST++ I C +
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSL 140
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD- 124
C++ R C QC+++++Y G+ G STET +F V V GC ++N+
Sbjct: 141 CQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSN----AVNSVAIGCGHNNQGL 196
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F+ + G+ +S S +GQL + +FSYCL RE + L FG A
Sbjct: 197 FTGAAGLLGLGKGLLSFPSQVGQLYGS---VFSYCL--PTRESTGSVPLIFGNQAVASNA 251
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATFIQR 243
T+ + YY+ + I V + G+ +L G GG ++D+G T +
Sbjct: 252 QFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVT 311
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPT 302
Y + F S +M + ++ CY R +++F F+ P
Sbjct: 312 SAYNPMRDAFRAGMPS--DAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPA 369
Query: 303 YMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYD 340
+ N G +C+A + + N S++G QQQ R +D
Sbjct: 370 QNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFD 409
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 152/357 (42%), Gaps = 18/357 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP + +++ DTGS ++W QC PC C+ QS P+F+P S ++ I C
Sbjct: 125 EYFTRIGV--GTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACR 182
Query: 64 DLICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+C R P + C+++++Y G+ G STET TF V V GC +
Sbjct: 183 SPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR----VARVALGCGH 238
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG F Q FSYCLV + +S++ FG A
Sbjct: 239 DNEGLFVGAAGLLGLGRGRLSFP--SQTGRRFNHKFSYCLVDRSASSKPSSMV-FGDSAV 295
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + + YY+ L ISV R+ G F L + G GG +ID+G T
Sbjct: 296 SRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVT 355
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFK 298
+ R Y F ++ R + ++ C+ + ++ HF AD
Sbjct: 356 RLTRPAYIAFRDAFRAGASNLKRAPQFSL---FDTCFDLSGKTEVKVPTVVLHFRGADVS 412
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + G FC+A + + S++G QQQ R VYDL + F P CA
Sbjct: 413 LPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 169/361 (46%), Gaps = 32/361 (8%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP 70
V G SK+ ++ DTGS L W QC PC++C+NQ PIF P+ SS+Y+ + C+ C+
Sbjct: 67 VTMGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL 126
Query: 71 PFRCEN---------GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
F N C + +NY G+ +G + E +F V V +FGC +
Sbjct: 127 QFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFG----GVSVSDFVFGCGRN 182
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N+ G ++G++G S SL+ Q +T G+FSYCL E ++ L G ++++
Sbjct: 183 NKGLF--GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL--PTTEAGSSGSLVMGNESSV 238
Query: 182 --QRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ RM + S+ Y L+L I V + AP +F G GG +ID+G +
Sbjct: 239 FKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALK-APLSF-----GNGGILIDSGTV 292
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
T + Y+ + F + FT F + + YD S+ F A
Sbjct: 293 ITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFE-GNAQL 351
Query: 298 KVEPTYMYFIFQNEG-YFCVAI-SFSDR--NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+ T +++ + + C+A+ S SD +++G +QQ++ R +YD + F E C
Sbjct: 352 NVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPC 411
Query: 354 A 354
+
Sbjct: 412 S 412
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 167/367 (45%), Gaps = 37/367 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+++ V GTP + ++ D GS L+WTQC Q P+F+ SS++ +PCD +
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C F C + +C + +Y G +A+G+++TETFTF + + + FGC
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDY-GIMTATGVLATETFTFGAHHGVSA--NLTFGCGK--- 220
Query: 124 DFSFDGNIA---GILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDA 179
+G IA GILG S P S+L QL T FSYCL +A R+ TS + FG A
Sbjct: 221 --LANGTIAEASGILGLSPGPLSMLKQLAITK---FSYCLTPFADRK---TSPVMFGAMA 272
Query: 180 NIQR----KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
++ + ++TI + + +YY+ + +SV R+ T A++ +GTGG ++D
Sbjct: 273 DLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLD 332
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----YASMT 289
+ ++ + + + E R + +D+ C+ +
Sbjct: 333 SATTLAYLVEPAFTELKKAVMEGIKLPVANR---SVDDYPVCFELPRGMSMEGVQVPPLV 389
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
HFD P YF + G C+A+ F +V+G QQQ+ +YD+
Sbjct: 390 LHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKF 449
Query: 347 QFVPENC 353
+ P C
Sbjct: 450 SYAPTKC 456
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 161/347 (46%), Gaps = 13/347 (3%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPF 72
G+P +S +L DTGS + W QC PC +C++Q PI++P+ SS+Y+R+ C +C+ +
Sbjct: 51 IGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDY 110
Query: 73 R-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD-FSFDGN 130
C+ C +R+ Y +++SG + E+F N + + FGC + N F +
Sbjct: 111 SACQGMGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGCGHSNSGLFRGEAG 169
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA-TSILRFGKDANIQRKDMKTI 189
+ G+ G ++S FS Q+ ++ FSYCLV Y ++++ +S L FG+ A +
Sbjct: 170 LLGMGGGTLSFFS---QIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPL 226
Query: 190 RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
+ YY L ISV + P FAL NGTGG ++D+G T + Y V+
Sbjct: 227 LKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVL 286
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR-ADFKVEPTYMYFI 307
D + + + C+ + S+ HFD D + +
Sbjct: 287 R---DAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIP 343
Query: 308 FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G FC+A + S SV+G QQQ R +DL I P C
Sbjct: 344 VDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 154/351 (43%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P ++++++ D+GS ++W QC PC C+ QS P+F+P SS++ + C +
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDV 202
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C G+C + ++Y G+ G ++ ET T V + V GC + N+
Sbjct: 203 CDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTV----GQVMIRDVAIGCGHTNQGM 258
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S +GQL G FSYCLV R +T L FG+ A
Sbjct: 259 FI--GAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLV--SRGTGSTGALEFGRGALPVGAT 314
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
++ S YY+ L I V R+ TF L GT G ++DTG T
Sbjct: 315 WISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAA 374
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYM 304
Y F ++ R ++ CY + +++F+F P
Sbjct: 375 YVAFRDSFTAQTSNLPRA---PGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARN 431
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I G FC+A + S S++G QQ+ + +D G + F P C
Sbjct: 432 FLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 149/351 (42%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P +S++++ D+GS ++W QC PC C++Q+ P+F+P S+++ + C +
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C +G+C + ++Y G+ G ++ ET TF V V GC + NR
Sbjct: 103 CDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF----GRTVVRNVAIGCGHSNRGM 158
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
L S +GQL FSYCLV R L FG +A
Sbjct: 159 FVGAAGLLGL--GGGSMSFMGQLSGQTGNAFSYCLV--SRGTNTNGFLEFGSEAMPVGAA 214
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V D R+ + F L G+GG ++DTG T
Sbjct: 215 WIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVA 274
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
YE F E + R + ++ CY +++F+F P
Sbjct: 275 YEAFRNAFIEQTQNLPRA---SGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANN 331
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + D + F P C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 161/347 (46%), Gaps = 13/347 (3%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPF 72
G P +S +L DTGS + W QC PC +C++Q PI++P+ SS+Y+R+ C +C+ +
Sbjct: 18 IGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDY 77
Query: 73 R-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD-FSFDGN 130
C+ C +R+ Y +++SG + E+F N + + FGC + N F +
Sbjct: 78 SACQGMGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGCGHSNSGLFRGEAG 136
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA-TSILRFGKDANIQRKDMKTI 189
+ G+ G ++S FS Q+ ++ FSYCLV Y ++++ +S L FG+ A +
Sbjct: 137 LLGMGGGTLSFFS---QIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPL 193
Query: 190 RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
++ YY L ISV + P FAL NGTGG ++D+G T + Y V+
Sbjct: 194 LKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVL 253
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYMYFI- 307
D + + + C+ + S+ HFD V P I
Sbjct: 254 R---DAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNILIP 310
Query: 308 FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G FC+A + S SV+G QQQ R +DL I P C
Sbjct: 311 VDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|357116104|ref|XP_003559824.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 489
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 163/417 (39%), Gaps = 70/417 (16%)
Query: 7 YTVDVLFGTPSKSEF--LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y+V V G+ F L D L W QCLP Q APIF+P S YK + DD
Sbjct: 74 YSVRVGVGSGDTQHFYRLAVDMVGNLTWMQCLPSNPKLKQDAPIFDPKTSHRYKNVGHDD 133
Query: 65 LICRRP--PFRCENGQCVHRINYAGGASASGLVSTETFTFHL------------------ 104
+C+ P P E+ +C I + A A+G + + F F
Sbjct: 134 PLCKAPFTPRPTEH-RCGFNIRFRAEAMATGYLGKDEFAFGAGSGSRTTNVDGLVFGCAH 192
Query: 105 -------KNKLVCVP-----------------------GVIFGCSNDNRDFSFDGNIAGI 134
K+ L +P G++FGC++ + +AGI
Sbjct: 193 RINGWNNKDVLAGIPSLNRRPTSFVRQLSTHGGGGAVDGLVFGCAHAINGWKNQDVLAGI 252
Query: 135 LGFSVSPFSLLGQLKSTAQGL---FSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRM 191
L + P S + QL G FSYCLV + LRFG D T +
Sbjct: 253 LSLNRRPTSFVRQLSVHGGGTTPRFSYCLVDHKKYPNKHGFLRFGADVPDHSHAQSTALL 312
Query: 192 FVDRSS---HYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMIDTGAIATFIQRGPY 246
+ + YY+ L +SVA ++ G P F RR+ GGC +D G T PY
Sbjct: 313 YGEPDGGFGMYYVRLVGVSVAGRKLTGITPKMFQRDRRSRLGGCYVDVGNPTTRFAEAPY 372
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR--FRAYASMTFHF---DRADFKVEP 301
+++ H S G R C R S S+T HF + A +++
Sbjct: 373 DILEAGVAAHMASHGLHR--TPVPGHRLCVRGTSPEVMPKLPSITLHFAEDEAAGLEIKS 430
Query: 302 TYMYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
++ ++ G Y C + + +V+G QQ DTRF +DL + F PE+C D
Sbjct: 431 RLLFATVKHAGADYVCFIVQRAPVTTVIGGHQQVDTRFTFDLEENRLFFAPEDCHGD 487
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 156/351 (44%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P +S++++ D+GS ++W QC PC C++QS P+F+P S+++ + C +
Sbjct: 140 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSV 199
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C G+C + ++Y G+ G ++ ET TF V V GC + NR
Sbjct: 200 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIGCGHRNRGM 255
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S +GQL G FSYCLV R +++ L FG++A
Sbjct: 256 FV--GAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLV--SRGTDSSGSLVFGREALPAGAA 311
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L + V R+ + F L G GG ++DTG T +
Sbjct: 312 WVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLA 371
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
Y+ F + R ++ CY +++F+F P
Sbjct: 372 YQAFRDAFLAQTANLPRA---TGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARN 428
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + +D G + F P C
Sbjct: 429 FLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 164/371 (44%), Gaps = 31/371 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD+ GTP + ++ DTGS L W QC PC++CF Q P+F+P S +Y+ + C D
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPR 211
Query: 67 CR--RPPF------RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC--VPGVIF 116
C PP R + C + Y ++ +G ++ E FT +L V V+F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC + NR LG F+ QL++ FSYCLV + + I+
Sbjct: 272 GCGHSNRGLFHGAAGLLGLGRGALSFA--SQLRAVYGHAFSYCLV-DHGSSVGSKIVFGD 328
Query: 177 KDANIQRKDMK----TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
DA + + + YY+ L+ + V ++ +P T+ + ++G+GG +I
Sbjct: 329 DDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTII 388
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAYASMT 289
D+G ++ YEV+ R F E + + D+ CY R
Sbjct: 389 DSGTTLSYFAEPAYEVIRRAFVERM-----DKAYPLVADFPVLSPCYNVSGVERVEVP-E 442
Query: 290 FHFDRADFKVE--PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTG 344
F AD V P YF+ +G C+A+ + R+ S++G +QQQ+ +YDL
Sbjct: 443 FSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNN 502
Query: 345 TIQFVPENCAN 355
+ F P CA
Sbjct: 503 RLGFAPRRCAE 513
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 174/361 (48%), Gaps = 34/361 (9%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP 70
V G S + ++ DTGS L W QC PC++C+NQ PIF P+ SS+Y+ + C+ C+
Sbjct: 67 VTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL 126
Query: 71 PFRCEN--------GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
F N C + +NY G+ +G + E +F V V +FGC +N
Sbjct: 127 QFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFG----GVSVSDFVFGCGRNN 182
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ G ++G++G S SL+ Q +T G+FSYCL E A+ L G ++++
Sbjct: 183 KGLF--GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL--PTTESGASGSLVMGNESSVF 238
Query: 183 RK--DMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + RM + S+ Y L+L I V + +F G GG +ID+G +
Sbjct: 239 KNVTPITYTRMLPNPQLSNFYILNLTGIDV--DGVALQVPSF-----GNGGVLIDSGTVI 291
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADF 297
T + Y+ + F + FT F + + YD + +++ HF+ A+
Sbjct: 292 TRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDE--VSIPTISMHFEGNAEL 349
Query: 298 KVEPTYMYFIFQNEG-YFCVAI-SFSDR--NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
KV+ T +++ + + C+A+ S SD +++G +QQ++ R +YD + F E+C
Sbjct: 350 KVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESC 409
Query: 354 A 354
+
Sbjct: 410 S 410
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 167/359 (46%), Gaps = 25/359 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + + GTP++ ++++ DTGS ++W QC PC C++Q+ PIFNP++S ++ + CD
Sbjct: 153 EYFTRIGI--GTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 210
Query: 64 DLICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C + C G C++ ++Y G+ G +TET TF + + V GC +DN
Sbjct: 211 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTS----IQNVAIGCGHDN 266
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
AG+LG S QL + FSYCLV R+ E++ L FG ++
Sbjct: 267 VGLFV--GAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV--DRDSESSGTLEFGPESVPI 322
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR---RNGTGGCMIDTGAIAT 239
+ + YYLS+ ISV + P + A R G GG +ID+G T
Sbjct: 323 GSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVP-SEAFRIDETTGRGGIIIDSGTAVT 381
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDS-RFRAYASMTFHFDRAD 296
+Q Y+ + F G Q + A ++ CY + + + ++ FHF
Sbjct: 382 RLQTSAYDALRDAFIA-----GTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGA 436
Query: 297 FKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P I + G FC A + +D N S++G QQQ R +D + F + C
Sbjct: 437 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 157/354 (44%), Gaps = 13/354 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP + +L+ DTGS ++W QC PCV+C++Q +F+P SSTY + C+
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQ 96
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLVCVPGVIFGCSNDNR 123
C C +C+++++Y G+ ++G +T+ + + + V + + GC +DN
Sbjct: 97 CLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNE 156
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ LG F Q+ S G FSYCL + S L FG DA +
Sbjct: 157 GYFVGAAGLLGLGKGPLSFP--NQINSENGGRFSYCLTGRDTDSTERSSLIFG-DAAVPP 213
Query: 184 KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
++ + S+ YYL + ISV + F L G GG +ID+G T +
Sbjct: 214 AGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRL 273
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRAYASMTFHFD-RADFKV 299
Q Y + F + + ++ CY D ++T HF AD K+
Sbjct: 274 QNAAYASLREAFRAGTSDL---VLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKL 330
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ N FC+A + + S++G QQQ R +YD + FVP C
Sbjct: 331 PASNYLVPVDNSSTFCLAFAGTTGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 161/358 (44%), Gaps = 48/358 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N Y + + GTP + ++DTGS L+WTQCLPC++C+ Q P+F+P+ S+++K + C+
Sbjct: 21 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 80
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
CR L+ T T + ++FGC ++N
Sbjct: 81 SQQCR-------------------------LLDTPTSILN----------IVFGCGHNNS 105
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQG--LFSYCLVYAYREMEATSILRFGKDANI 181
+F+ N G+ G P SL Q+ ST FS CLV + TS + FG +A +
Sbjct: 106 G-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEV 164
Query: 182 QRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
D+ + + D ++Y+++L ISV D F+ + + G ID G T
Sbjct: 165 SGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATK---GNVFIDAGTPPTL 221
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ R Y +++ E + + + + + CYR + +T HFD AD +++
Sbjct: 222 LPRDFYNRLVQGVKE---AIPMEPVQDPDLQPQLCYRSATLIDG-PILTAHFDGADVQLK 277
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNS-VVGAWQQQDTRFVYDLNTGTIQFVPENCANDH 357
P FI EG +C A+ D ++ + G + Q + +DL+ + F +C
Sbjct: 278 P-LNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTKQQ 334
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 149/363 (41%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + + D L+WTQC C CF Q P+F+P AS+TY+ PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPL 110
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C C + + G + G V T+TF + FGC +
Sbjct: 111 CESIPSDVRNCSGNVCAYEASTNAGDTG-GKVGTDTFAVGTAKA-----SLAFGCVVAS- 163
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
D G +GI+G +P+SL+ Q T FSYCL A + S L G A +
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCL--APHDAGKNSALFLGSSAKLAG 218
Query: 184 KDMKTIRMFV-------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
FV D S++Y + L+ + D I P + ++DT +
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFS 270
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+F+ G Y+ V + + G M E ++ C+ A + F F
Sbjct: 271 PISFLVDGAYQAVKKAVT---VAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGA 327
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDR------NSVVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y + G C+A+ S R S++G+ QQ++ F++DL+ T+ F P
Sbjct: 328 AMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 351 ENC 353
+C
Sbjct: 388 ADC 390
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 165/374 (44%), Gaps = 41/374 (10%)
Query: 4 NYFYTVDV---LFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
NY T+ + G+P+ + ++ DTGS L W QC PC C+ Q P+F+P S+TY +
Sbjct: 184 NYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAV 243
Query: 61 PCDDLICRRP-------PFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCV 111
C+ C P C G +C + + Y G+ + G+++T+T + +
Sbjct: 244 RCNASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS----L 299
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
G +FGC NR G AG++G + SL+ Q G+FSYCL A +A+
Sbjct: 300 DGFVFGCGLSNRGLF--GGTAGLMGLGRTELSLVSQTALRYGGVFSYCLP-ATTSGDASG 356
Query: 172 ILRFGKDANIQRKDMKT--IRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGT 227
L G DA+ R RM D + Y+L++ G A G AL G
Sbjct: 357 SLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNV---------TGAAVGGTALAAQGL 407
Query: 228 GG--CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G +ID+G + T + Y V F F + G S + CY
Sbjct: 408 GASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSI-LDTCYDLTGHDEVK 466
Query: 286 AS-MTFHFD-RADFKVEPTYMYFIFQNEG-YFCVA---ISFSDRNSVVGAWQQQDTRFVY 339
+T + A+ V+ M F+ + +G C+A +S+ D+ ++G +QQ++ R VY
Sbjct: 467 VPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVY 526
Query: 340 DLNTGTIQFVPENC 353
D + F E+C
Sbjct: 527 DTVGSRLGFADEDC 540
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 150/363 (41%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + + D L+WTQC C CF Q P+F+P AS+TY+ PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C C ++ + G + G V T+TF + FGC +
Sbjct: 111 CESIPSDSRNCSGNVCAYQASTNAGDTG-GKVGTDTFAVGTAKA-----SLAFGCVVAS- 163
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
D G +GI+G +P+SL+ Q T FSYCL A + S L G A +
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCL--APHDAGRNSALFLGSSAKLAG 218
Query: 184 KDMKTIRMFV-------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
FV D S++Y + L+ + D I P + ++DT +
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFS 270
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+F+ G Y+ V + + G M E ++ C+ A + F F
Sbjct: 271 PISFLVDGAYQAVKKAVTA---AVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGA 327
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDR------NSVVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y + G C+A+ S R S++G+ QQ++ F++DL+ T+ F P
Sbjct: 328 AMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 351 ENC 353
+C
Sbjct: 388 ADC 390
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 159/355 (44%), Gaps = 25/355 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G P +++ DTGS + W QC PC C+ Q+ P F P +S+++ + C+
Sbjct: 151 YFSRVGI--GRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCET 208
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ C NG C++ ++Y G+ G TET T + + + GC ++N
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTS----LGNIAIGCGHNNE 264
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG F QL +++ FSYCLV R+ ++TS L F
Sbjct: 265 GLFIGAAGLLGLGGGSLSFP--SQLNASS---FSYCLV--DRDSDSTSTLDFNSPIT--- 314
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
D T + + + + +YL L +SV + +F + +G GG ++D+G T +
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Q Y V+ F + R ++ CY S+ R +++FHF +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTAR---GVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I +EG FC A + +D S++G QQQ TR +DL + F P C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 21 FLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP---PFRCENG 77
FLL DTGS + W QC PC C+ Q +F P S+TYK +PC+ +C++ C N
Sbjct: 2 FLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNS 61
Query: 78 QCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSNDNRDFSFDGNIAGILG 136
C + ++Y ++ G + ET T + LV VP FGC + N+ F+G AG++G
Sbjct: 62 SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGL-FNG-AAGLMG 119
Query: 137 FSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRS 196
S Q +FSYCL + + IL FG+ A + D++ + VD S
Sbjct: 120 LGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIP-SGILHFGEAAMLDY-DVRFTPL-VDSS 176
Query: 197 ---SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHF 253
S Y++S+ I+V D + + M+D+G + + ++ YE + F
Sbjct: 177 SGPSQYFVSMTGINVGDELLPI-----------SATVMVDSGTVISRFEQSAYERLRDAF 225
Query: 254 DEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM-TFHF-DRADFKVEPTYMYFIFQNE 311
+ G Q + + ++ C+R + + T HF D A+ ++ P ++ + ++
Sbjct: 226 TQILP--GLQTAVSVAP-FDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPV-DD 281
Query: 312 GYFCVAISFSDRN-SVVGAWQQQDTRFVYDL 341
G C A + S SV+G +QQQ+ RFVYD+
Sbjct: 282 GVMCFAFAPSSSGRSVLGNFQQQNLRFVYDI 312
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 160/370 (43%), Gaps = 30/370 (8%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA--PIFNPNASSTYKRI 60
K + V+ G P + + DTGS L+W QC PC +C + P+FNP SST+
Sbjct: 92 KTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVEC 151
Query: 61 PCDDLICRRPP-FRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFG 117
CDD CR P C + +CV+ Y G + G+++ E TF N V + FG
Sbjct: 152 SCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFG 211
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +N + + + GILG P SL QL S FSYC+ + + L G+
Sbjct: 212 CGYENGE-QLESHFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANKNYGYNQLVLGE 266
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
DA+I D I F +S YY++L+ ISV D ++ P F RR G ++D+G +
Sbjct: 267 DADI-LGDPTPIE-FETENSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSGTL 323
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHF-DR 294
T++ ++ R S ++ CY R + +TFHF
Sbjct: 324 YTWLA----DIAYRELYNEIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGG 379
Query: 295 ADFKVEPTYMYFIFQNE---GYFCVAISFSDRN-------SVVGAWQQQDTRFVYDLNTG 344
A+ +E T M++ FC+++ + + + +G QQ YDL
Sbjct: 380 AELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEK 439
Query: 345 TIQFVPENCA 354
I +C
Sbjct: 440 NIYLQRIDCV 449
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 152/351 (43%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P + ++++ D+GS ++W QC PC C+ QS P+F+P S +Y + C +
Sbjct: 132 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSV 191
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C +G C + + Y G+ G ++ ET TF V V GC + NR
Sbjct: 192 CDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF----AKTVVRNVAMGCGHRNRGM 247
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
F G + S S +GQL G F YCLV R ++T L FG++A
Sbjct: 248 -FIGAAGLLGIGGGS-MSFVGQLSGQTGGAFGYCLV--SRGTDSTGSLVFGREALPVGAS 303
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L+ + V RI G F L G GG ++DTG T + G
Sbjct: 304 WVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGA 363
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
Y F + R + ++ CY +++F+F P
Sbjct: 364 YAAFRDGFKSQTANLPRA---SGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN 420
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + G +C A + S S++G QQ+ + +D G + F P C
Sbjct: 421 FLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 171/378 (45%), Gaps = 33/378 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP+ L+ DTGS + W QC+PC +C P FNP SS++ ++PC
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197
Query: 67 CRR-----PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKN----KLVCVPGVIF 116
C PF +G+ C+ I Y G+ +SGL++ ET + N + V + +
Sbjct: 198 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 257
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC++ +R+ +G+LG P S QL S FS+C + ++ ++ FG
Sbjct: 258 GCADIDRE-GLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFG 316
Query: 177 KDANIQRKDMKTIRMFVDRS------SHYYLSLQDISVADHRIGFAPGTFALRR-NGTGG 229
+++I ++ + + + +YY+ L ISV + R+ + F + + G+GG
Sbjct: 317 -ESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGG 375
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
+ID+G T++++ ++ + R F + + + + + CY S A
Sbjct: 376 TIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV---DDNSGFTPCYNITSGTAALESTI 432
Query: 285 YASMTFHFDRADFKVEPTYMYFI----FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFV 338
S+T HF V P I + + C+A S +++G +QQQ+
Sbjct: 433 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNLWVE 492
Query: 339 YDLNTGTIQFVPENCAND 356
YDL + P CA D
Sbjct: 493 YDLEKLRLGIAPAQCATD 510
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 166/365 (45%), Gaps = 35/365 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V++ G+P ++ ++ DTGS L+W QCLPC+NCF QS F+P S ++K + C
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCG--- 160
Query: 67 CRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFGC 118
P + NG Q +++ Y GG S+ G+++ E+ F L + + FGC
Sbjct: 161 --FPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218
Query: 119 SNDNRDFSFDGNIAGILGFSVSP-FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
+ N + D G+ G P ++ QL + FSYC+ + + L G+
Sbjct: 219 GHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNK----FSYCIGDINNPLYTHNHLVLGQ 274
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ I+ F HYY++LQ ISV + P F + +G+GG +ID+G
Sbjct: 275 GSYIEGDSTPLQIHF----GHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMT 330
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRAYASMTFHF-DR 294
T + G +E++ + +R+ + C++ + ++TFHF
Sbjct: 331 YTKLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGG 389
Query: 295 ADFKVEPTYMYFIFQNEG--YFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQF 348
AD +E + F+ G FC+AI S+ SV+G QQ+ +DL + F
Sbjct: 390 ADLVLESGSL---FRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFF 446
Query: 349 VPENC 353
+C
Sbjct: 447 RRIDC 451
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 153/360 (42%), Gaps = 25/360 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + +L+ DTGS ++W QC PCVNC++QS IF+P SSTY + C
Sbjct: 58 YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQ 117
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL--VCVPGVIFGCSNDNR 123
C C+ +C+++++Y G+ +G T+ + + + + V + + GC +DN
Sbjct: 118 CLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNE 177
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA---- 179
+ LG F Q+ G FSYCL + S L FG+ A
Sbjct: 178 GYFVGAAGLLGLGKGPLSFP--NQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPA 235
Query: 180 ----NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
Q +M+ + YYL + ISV + F L G GG +ID+G
Sbjct: 236 GARFTPQDSNMRV-------PTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSG 288
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR 294
T +Q Y + F + + ++ CY ++T HF
Sbjct: 289 TSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSL---FDTCYDLSGLASVDVPTVTLHFQG 345
Query: 295 A-DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
D K+ + N FC+A + + S++G QQQ R +YD + FVP C
Sbjct: 346 GTDLKLPASNYLIPVDNSNTFCLAFAGTTGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 163/357 (45%), Gaps = 27/357 (7%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP 70
V G S++ ++ DTGS L W QC PC +C+NQ+ P+F P+ S +Y+ I C+ C+
Sbjct: 124 VTMGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSL 183
Query: 71 PFRC------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
+ C + +NY G+ SG + E F + V +FGC +N+
Sbjct: 184 ELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFG----GISVSNFVFGCGRNNKG 239
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G +G++G S S++ Q +T G+FSYCL + + A+ L G + + +
Sbjct: 240 LF--GGASGLMGLGRSELSMISQTNATFGGVFSYCLP-STDQAGASGSLVMGNQSGVFKN 296
Query: 185 --DMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ RM + S+ Y L+L I V + +F G GG ++D+G + +
Sbjct: 297 VTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDSGTVISR 351
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ Y+ + F E F+ F + + YD SM F + A+ V+
Sbjct: 352 LAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGN-AELNVD 410
Query: 301 PTYM-YFIFQNEGYFCVAI-SFSD--RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
T + Y + ++ C+A+ S SD ++G +QQ++ R +YD + F E C
Sbjct: 411 ATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 165/353 (46%), Gaps = 44/353 (12%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYK---RIPCDDL 65
++ G P + ++ DTGS ++W C PC NC N +F+P+ SST+ + PCD
Sbjct: 103 ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTPCDFK 162
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC-VPGVIFGCSNDNRD 124
C R C+ + YA ++ASG+ +T F ++ +P V+FGC + N
Sbjct: 163 GCSR----CD--PIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGH-NIG 215
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL------VYAYREMEATSILRFGKD 178
D GILG + P SL ++ FSYC+ Y Y + L G+
Sbjct: 216 QDTDPGHNGILGLNNGPDSLATKIGQK----FSYCIGDLADPYYNYHQ------LILGEG 265
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A+++ F + YY++++ ISV + R+ AP TF +++N TGG +IDTG+
Sbjct: 266 ADLEGYSTP----FEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTI 321
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHF-DR 294
TF+ + ++ + RQ S W C+ Y S R + +TFHF D
Sbjct: 322 TFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSP-WMQCF-YGSISRDLVGFPVVTFHFADG 379
Query: 295 ADFKVEPTYMYFIFQNEGYFCV------AISFSDRNSVVGAWQQQDTRFVYDL 341
AD ++ + +F N+ FC+ +++ + S++G QQ YDL
Sbjct: 380 ADLALD-SGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDL 431
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 161/368 (43%), Gaps = 30/368 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VDV GTP + ++ DTGS L W QC PC++CF QS PIF+P AS +Y+ + C D
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDR 208
Query: 67 CR--RPPF--------RCENGQCVHRINYAGGASASGLVSTETFTFHL-KNKLVCVPGVI 115
CR PP R + C + Y ++ +G ++ E FT +L ++ V GV
Sbjct: 209 CRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVA 268
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILR 174
FGC + NR LG F+ QL+ G FSYCLV A S +
Sbjct: 269 FGCGHRNRGLFHGAAGLLGLGRGPLSFA--SQLRGVYGGHAFSYCLV--EHGSAAGSKII 324
Query: 175 FGKDANIQRKDMKTIRMFV---DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
FG D + F D + YYL L+ I V + + T + GG +
Sbjct: 325 FGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS-----AGGTI 379
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTF 290
ID+G ++ Y+ + + F + + + CY + ++
Sbjct: 380 IDSGTTLSYFPEPAYQAIRQAFIDRMSP--SYPLILGFPVLSPCYNVSGAEKVEVPELSL 437
Query: 291 HFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQ 347
F P YFI + EG C+A+ + R+ S++G +QQQ+ +YDL +
Sbjct: 438 VFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLG 497
Query: 348 FVPENCAN 355
F P CA+
Sbjct: 498 FAPRRCAD 505
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 160/355 (45%), Gaps = 25/355 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G P++ +++ DTGS + W QC PC +C++Q+ PIF P++SS+Y+ + CD
Sbjct: 151 YFTRVGI--GNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDT 208
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C N C++ ++Y G+ G +TET T V V GC + N
Sbjct: 209 PQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI----GSTLVQNVAVGCGHSNE 264
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG + QL +T+ FSYCLV R+ ++ S + FG
Sbjct: 265 GLFVGAAGLLGLGGGLLALP--SQLNTTS---FSYCLV--DRDSDSASTVEFGTSLPPDA 317
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+R + YYL L ISV + +F + +G+GG +ID+G T +Q
Sbjct: 318 VVAPLLRNH-QLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQT 376
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFR-AYASMTFHFDRADFKVE 300
G Y + F + G + A+ ++ CY ++ ++ FHF
Sbjct: 377 GIYNSLRDSFLK-----GTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLAL 431
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC+A + + + +++G QQQ TR +DL I F C
Sbjct: 432 PAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 171/378 (45%), Gaps = 33/378 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP+ L+ DTGS + W QC+PC +C P FNP SS++ ++PC
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198
Query: 67 CRR-----PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKN----KLVCVPGVIF 116
C PF +G+ C+ I Y G+ +SGL++ ET + N + V + +
Sbjct: 199 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 258
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC++ +R+ +G+LG P S QL S FS+C + ++ ++ FG
Sbjct: 259 GCADIDRE-GLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFG 317
Query: 177 KDANIQRKDMKTIRMFVDRS------SHYYLSLQDISVADHRIGFAPGTFALRR-NGTGG 229
+++I ++ + + + +YY+ L ISV + R+ + F + + G+GG
Sbjct: 318 -ESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGG 376
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
+ID+G T++++ ++ + R F + + + + + CY S A
Sbjct: 377 TIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV---DDNSGFTPCYNITSGTAALESTI 433
Query: 285 YASMTFHFDRADFKVEPTYMYFI----FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFV 338
S+T HF V P I + + C+A S +++G +QQQ+
Sbjct: 434 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNLWVE 493
Query: 339 YDLNTGTIQFVPENCAND 356
YDL + P CA D
Sbjct: 494 YDLEKLRLGIAPAQCATD 511
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 165/364 (45%), Gaps = 26/364 (7%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N Y + + GTP + DTGS LIW QC PC NCF Q P+F P SST+K C
Sbjct: 88 ENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATC 147
Query: 63 DDLICRR-PPF--RC-ENGQCVHRINYAGGASASGLVSTETFTFHLKN--KLVCVPGVIF 116
D C PP +C + GQC++ +Y + G+V TET +F + V P IF
Sbjct: 148 DSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIF 207
Query: 117 GCSN-DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
GC +N F + G++G P SL+ QL FSYCL+ +TS L+F
Sbjct: 208 GCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPF--SSNSTSKLKF 265
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G +A + + + + + S Y+L+L+ +++ + G +ID
Sbjct: 266 GSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTG--------RTDGNIIID 317
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD 293
+G + T++++ Y + E + Q + +++C+ Y R + F F
Sbjct: 318 SGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFP---FKFCFPY--RDMTIPVIAFQFT 372
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
A ++P + Q+ C+A+ S + S+ G Q D + VYDL + F P
Sbjct: 373 GASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPT 432
Query: 352 NCAN 355
+C
Sbjct: 433 DCTK 436
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 136/305 (44%), Gaps = 21/305 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS LIWTQC PC CF+Q+ P F+P+ SST CD +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 67 CRRPPFRC-------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P N CV+ +Y + +G + + FTF VPGV FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF--VGAGASVPGVAFGCG 199
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N F N GI GF P SL QLK G FS+C +T +L D
Sbjct: 200 LFNNGV-FKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 180 -NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
R +++ + + + + YYLSL+ I+V R+ FAL +NGTGG +ID+G
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDSGT 314
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRA 295
T + Y +V F + + D +C R + Y + HF+ A
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQVK---LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA 371
Query: 296 DFKVE 300
+
Sbjct: 372 TMDLP 376
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 171/358 (47%), Gaps = 25/358 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P+K ++L+ DTGS + W QC PC +C+ Q+ +F+P ASS+++R+ C
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 67 CRRPPFRC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ + + +C+++++Y G+ G +++++F L ++ P V+FGC +DN
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF---LVSRGRTSP-VVFGCGHDNE 129
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG S QL S FSYCLV + A+S L FG A
Sbjct: 130 GLFV--GAAGLLGLGAGKLSFPSQLSSRK---FSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRN-GTGGCMIDTGAIATF 240
++ + + YY L IS+ + F L + G GG +ID+G T
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFD-RAD 296
+ Y V+ F Q++ A++ ++ CY + + +++FHF+ A
Sbjct: 245 LPTYAYTVMRDAFRS-----ATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGAS 299
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ P+ G FC A S + + S++G QQQ R DL++ + F P C
Sbjct: 300 VQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 49/381 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV G+P K L+ DTGS L W QC+PC +CF Q+ P ++P S +++ I C+D
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL------KNKLVCVPG 113
C+ PP C E C + Y ++ +G + ETFT +L K++ V
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
V+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRDSDTSVSSKL 373
Query: 174 RFGKDAN-IQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
FG+D + + ++ + + + YYL ++ I V ++ + L +G G
Sbjct: 374 IFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAG 433
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAY 285
G +ID+G ++ Y ++ F + + + ED+ CY
Sbjct: 434 GTIIDSGTTLSYFSDPAYRIIKEAF------LRKVKGYKLVEDFPILHPCYN-------- 479
Query: 286 ASMTFHFDRADFKVE---------PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQ 333
S T + +F ++ P YFI Q C+A+ + ++ S++G +QQQ
Sbjct: 480 VSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQ 539
Query: 334 DTRFVYDLNTGTIQFVPENCA 354
+ +YD + + P CA
Sbjct: 540 NFHILYDTKNSRLGYAPMRCA 560
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 168/359 (46%), Gaps = 23/359 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP + L DTGS L+W QC PC C+ Q +P+F P S+TY IPCD
Sbjct: 50 YLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSEE 109
Query: 67 CRRP-PFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNR 123
C C + C + YA + G+++ ET TF + + V V ++FGC + N
Sbjct: 110 CNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGCGHSNS 169
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKST-AQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+F+ N GI+G P SL+ Q + FS CLV + + + FG +++
Sbjct: 170 G-TFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTISFGDASDVS 228
Query: 183 RKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + + + + Y ++L+ ISV D + F + G MID+G AT++
Sbjct: 229 GEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSK----GNIMIDSGTPATYL 284
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADF 297
+ Y+ +++ + M +D + CYR ++ + HF+ AD
Sbjct: 285 PQEFYDRLVKELKV------QSNMLPIDDDPDLGTQLCYRSETNLEG-PILIAHFEGADV 337
Query: 298 KVEPTYMYFIFQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
++ P FI +G FC A++ +D + G + Q + +DL+ T+ F +C+N
Sbjct: 338 QLMP-IQTFIPPKDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSN 395
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 165/379 (43%), Gaps = 43/379 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-- 64
Y +DV GTP + ++ DTGS L W QC PC++CF Q P+F+P ASS+Y+ + C D
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPR 205
Query: 65 ------------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC-- 110
CRRP C + Y ++++G ++ E+FT +L
Sbjct: 206 CGHVAPPEAPAPRACRRP----GEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSR 261
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEA 169
V GV+FGC + NR LG F+ QL++ G FSYCLV +
Sbjct: 262 VDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFA--SQLRAVYGGHTFSYCLV--DHGSDV 317
Query: 170 TSILRFGKDANIQRKDMKTIR--MFVDRSSH----YYLSLQDISVADHRIGFAPGTFALR 223
S + FG+D + ++ F SS YY+ L + V + + T+
Sbjct: 318 ASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDAS 377
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDS 280
G+GG +ID+G ++ Y+V+ R F + + + D+ CY
Sbjct: 378 EGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSG-----SYPPVPDFPVLSPCYNVSG 432
Query: 281 RFRA-YASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTR 336
R ++ F P YFI +G C+A+ + R S++G +QQQ+
Sbjct: 433 VERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFH 492
Query: 337 FVYDLNTGTIQFVPENCAN 355
YDL+ + F P CA
Sbjct: 493 VAYDLHNNRLGFAPRRCAE 511
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 168/358 (46%), Gaps = 25/358 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P+K ++L+ DTGS + W QC PC +C+ Q+ +F+P ASS+++R+ C
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 67 CRRPPFRC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ + + +C+++++Y G+ G +++++F+ V+FGC +DN
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGR----TSPVVFGCGHDNE 129
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
AG+LG S QL S FSYCLV + A+S L FG A
Sbjct: 130 GLFV--GAAGLLGLGAGKLSFPSQLSSRK---FSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRN-GTGGCMIDTGAIATF 240
++ + + YY L IS+ + F L + G GG +ID+G T
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRA-YASMTFHFD-RAD 296
+ Y V+ F Q++ A++ ++ CY + + +++FHF+ A
Sbjct: 245 LPTYAYTVMRDAFRS-----ATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGAS 299
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ P+ G FC A S + + S++G QQQ R DL++ + F P C
Sbjct: 300 VQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 162/355 (45%), Gaps = 22/355 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V G P+K +++ DTGS + W QC PC +C+ Q+ PIF+P +SS++ +PC+
Sbjct: 154 EYFSRVGV--GQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCE 211
Query: 64 DLICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C+ C +C+++++Y G+ G TET TF + V GC +DN
Sbjct: 212 SQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSGM---INDVAVGCGHDN 268
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
LG + Q+K+++ FSYCLV R+ ++S L F A
Sbjct: 269 EGLFVGSAGLLGLGGGPLSLT--SQMKASS---FSYCLV--DRDSSSSSDLEFNSAAPSD 321
Query: 183 RKDMKTIRM-FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ ++ VD + YY+ L +SV + P F + +G GG ++D+G T +
Sbjct: 322 SVNAPLLKSGKVD--TFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRL 379
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Q Y + F + N ++ CY S+ R +++F F
Sbjct: 380 QTQAYNTLRDAFVSRTPYL---KKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQL 436
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC A + + + S++G QQQ TR YDL + F P C
Sbjct: 437 PPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 49/381 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV G+P K L+ DTGS L W QC+PC +CF Q+ P ++P S +++ I C+D
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 67 CR-----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL------KNKLVCVPG 113
C+ PP C E C + Y ++ +G + ETFT +L K++ V
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
V+FGC + NR LG FS QL+S FSYCLV + +S L
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRDSDTSVSSKL 373
Query: 174 RFGKDAN-IQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
FG+D + + ++ + + + YYL ++ I V ++ + L +G G
Sbjct: 374 IFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAG 433
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAY 285
G +ID+G ++ Y ++ F + + + ED+ CY
Sbjct: 434 GTIIDSGTTLSYFSDPAYRIIKEAF------LRKVKGYKLVEDFPILHPCYN-------- 479
Query: 286 ASMTFHFDRADFKVE---------PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQ 333
S T + +F ++ P YFI Q C+A+ + ++ S++G +QQQ
Sbjct: 480 VSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQ 539
Query: 334 DTRFVYDLNTGTIQFVPENCA 354
+ +YD + + P CA
Sbjct: 540 NFHILYDTKNSRLGYAPMRCA 560
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 160/356 (44%), Gaps = 25/356 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V + G PS +++ DTGS + W QC PC +C++Q+ PIF P +S++Y + CD
Sbjct: 143 EYFSRVGI--GKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCD 200
Query: 64 DLICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C+ C N C++ ++Y G+ G TET T + V V GC ++N
Sbjct: 201 TKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSAS----VDNVAIGCGHNN 256
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
AG+LG S Q+ +++ FSYCLV R+ ++ S L F A +
Sbjct: 257 EGLFI--GAAGLLGLGGGKLSFPSQINASS---FSYCLV--DRDSDSASTLEFNS-ALLP 308
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + + YY+ + +SV + F + +G GG +ID+G T +Q
Sbjct: 309 HAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQ 368
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRA-YASMTFHFDRADFKV 299
Y + F + G + + SE ++ CY + ++TFH
Sbjct: 369 TAAYNALRDAFVK-----GTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLP 423
Query: 300 EPTYMYFI-FQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I ++G FC A + S S++G QQQ TR +DL + F P C
Sbjct: 424 LPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 144/352 (40%), Gaps = 20/352 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + ++FDTGS L W QC PC NC+ Q P+F+P+ S+TY +PC
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQE 247
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
C C +G+C + + Y + G ++ +T T + + G +FGC +D D
Sbjct: 248 CLD-SGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ--LQGFVFGCGDD--DTG 302
Query: 127 FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDM 186
G G+ G SL Q + FSYCL ++R A L G A
Sbjct: 303 LFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWR---AEGYLSLGSAAAPPHAQF 359
Query: 187 KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
+ D S YYL L I VA + AP F G +ID+G + T + Y
Sbjct: 360 TAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFK-----APGTVIDSGTVITRLPSRAY 414
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYMY 305
+ F + R A + CY + R + S+ FD
Sbjct: 415 SALRSSFAGFMRRYKRA---PALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGV 471
Query: 306 FIFQNEGYFCVA-ISFSDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPENCA 354
N C+A S D SV +G QQ+ VYDL I F + C+
Sbjct: 472 LYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 165/379 (43%), Gaps = 46/379 (12%)
Query: 4 NYFYTVDV--LFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
NY T+ + G+P+ + ++ DTGS L W QC PC C+ Q P+F+P S+TY +
Sbjct: 143 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVR 202
Query: 62 CDDLICRRPPFRCENG-------------QCVHRINYAGGASASGLVSTETFTFHLKNKL 108
C+ C R G +C + + Y G+ + G+++T+T +
Sbjct: 203 CNASACAD-SLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 259
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ G +FGC NR G AG++G + SL+ Q S G+FSYCL A
Sbjct: 260 --LGGFVFGCGLSNRGLF--GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDA 315
Query: 169 ATSI-LRFGKDANIQRKDMKTI---RMFVDRSSH--YYLSLQDISVADHRIGFAPGTFAL 222
+ S+ L G DA ++ + RM D + Y+L++ G A G AL
Sbjct: 316 SGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV---------TGAAVGGTAL 366
Query: 223 RRNGTGG--CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS 280
G G +ID+G + T + Y V F F + G S + CY
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSI-LDTCYDLTG 425
Query: 281 RFRAYAS-MTFHFD-RADFKVEPTYMYFIFQNEG-YFCVA---ISFSDRNSVVGAWQQQD 334
+T + AD V+ M F+ + +G C+A +S+ D ++G +QQ++
Sbjct: 426 HDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKN 485
Query: 335 TRFVYDLNTGTIQFVPENC 353
R VYD + F E+C
Sbjct: 486 KRVVYDTLGSRLGFADEDC 504
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 149/363 (41%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + + D L+WTQC C CF Q P+F+P AS+TY+ PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C C ++ + G + G V T+TF + FGC +
Sbjct: 111 CESIPSDSRNCSGNVCAYQASTNAGDTG-GKVGTDTFAVGTAKA-----SLAFGCVVAS- 163
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
D G +GI+G +P+SL+ Q T FSYCL A + S L G A +
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCL--APHDAGKNSALFLGSSAKLAG 218
Query: 184 KDMKTIRMFV-------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
FV D S++Y + L+ + D I P + ++DT +
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFS 270
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+F+ G Y+ V + + G M E ++ C+ A + F F
Sbjct: 271 PISFLVDGAYQAVKKAVT---VAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGA 327
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDR------NSVVGAWQQQDTRFVYDLNTGTIQFVP 350
Y + G C+A+ S R S++G+ QQ++ F++DL+ T+ F P
Sbjct: 328 AMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 351 ENC 353
+C
Sbjct: 388 ADC 390
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 164/370 (44%), Gaps = 36/370 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + +L GTP DTGS +IW C+ C +CFNQS+ IFNP ASSTY+ PCD
Sbjct: 98 YLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQ 157
Query: 67 CRRPPFRCENGQ-CV------HRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C C++ C+ H++N G A V T T T + +P F C
Sbjct: 158 CETTSSSCQSDNVCLYSCDEKHQLNCPNGRIA---VDTMTLT-SSDGRPFPLPYSDFVCG 213
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N +F G G++G SL +L + G FSYCL Y + S + FG +
Sbjct: 214 NSIYK-TFAG--VGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQP--SKINFGLQS 268
Query: 180 NIQRKDMKTIRMFVDRSSH---YYLSLQDISVADHR--IGFAPGTFALRRNGTGGCMIDT 234
I D++ + + H YY++L+ ISV + R + + FA G +ID+
Sbjct: 269 FISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQDLYYVDDPFA---PPVGNMLIDS 325
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA--------SEDWEYCYRYDSRFRAYA 286
G + T + + Y+ + + HN+ + C+ Y + +
Sbjct: 326 GTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELK-FP 384
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTG 344
+T HF AD ++ FI E C A + + +++V G+WQQ + YDL G
Sbjct: 385 KITIHFTDADVELSDDNS-FIRVAEDVVCFAFAATQPGQSTVYGSWQQMNFILGYDLKRG 443
Query: 345 TIQFVPENCA 354
T+ F +C+
Sbjct: 444 TVSFKRTDCS 453
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 159/356 (44%), Gaps = 14/356 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS LIW QCLPC NC+ Q P+F+P S TYK + CD+
Sbjct: 94 YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153
Query: 67 CRR--PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C+ C ++ C + +Y + G +S++T T + PG+ FGC +DN
Sbjct: 154 CQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDN 213
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+F+ G++G P SL+ QL S G FSYCLV + +S + FGK +
Sbjct: 214 GG-TFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVS 272
Query: 183 RKDMKTIRMFVDR-SSHYYLSLQDISVADHRI---GFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + YYL+L+ +SV + GF+ + G +ID+G
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTL 332
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + + Y V + G Q + + + CY + ++T HF AD +
Sbjct: 333 TLLPQDFYTDVESALTN---AIGGQTTTDPNGIFSLCYSSVNNLE-IPTITAHFTGADVQ 388
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ P + Q E C ++ S ++ G Q + YDL + F +C
Sbjct: 389 LPPLNTFVQVQ-EDLVCFSMIPSSNLAIFGNLAQINFLVGYDLKNNKVSFKQTDCT 443
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 170/379 (44%), Gaps = 56/379 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ + ++ DTGS L W QC PC C+NQ P+FNP+ S +Y+ + C
Sbjct: 134 NYIVTVEL----GGRKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCS 189
Query: 64 DLICR--------------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLK-NKL 108
C+ PP C + +NY G+ G + TE HL
Sbjct: 190 SPTCQSLQSATGNLGVCGSNPP------SCNYVVNYGDGSYTRGELGTE----HLDLGNS 239
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
V IFGC +N+ G +G++G S SL+ Q + G+FSYCL E E
Sbjct: 240 TAVNNFIFGCGRNNQGLF--GGASGLVGLGRSSLSLISQTSAMFGGVFSYCL--PITETE 295
Query: 169 ATSILRFGKDANIQRKD--MKTIRMFVD-RSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
A+ L G ++++ + + RM + + Y+L+L I+V G+ A++
Sbjct: 296 ASGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITV---------GSVAVQAP 346
Query: 226 --GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RF 282
G G MID+G + T + Y+ + F + F+ F A + C+ +
Sbjct: 347 SFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAP---AFMILDTCFNLSGYQE 403
Query: 283 RAYASMTFHFD-RADFKVEPTYM-YFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRF 337
++ HF+ A+ V+ T + YF+ + C+AI S+ + ++G +QQ++ R
Sbjct: 404 VEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRV 463
Query: 338 VYDLNTGTIQFVPENCAND 356
+YD + F E C D
Sbjct: 464 IYDTKGSMLGFAAEACTFD 482
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 158/356 (44%), Gaps = 30/356 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + V FGTP++++ ++FDTGS + W QC PC V C+ Q P+F+P+ SSTY+ + C +
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTEP 75
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C + C++ + Y G+S G ++ +TF K IFGC +N
Sbjct: 76 ACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKF---KNFIFGCGQNNTG 132
Query: 125 FSFDGNIAGILGFS-VSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
F G AG++G S +SL Q+ + +FSYCL AT L G N
Sbjct: 133 L-FQGT-AGLVGLGRSSTYSLNSQVAPSLGNVFSYCLP---STSSATGYLNIGNPQNTPG 187
Query: 184 KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
M D + Y++ L ISV R+ + F + G +ID+G + T +
Sbjct: 188 YT----AMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVITRL 238
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD-SRFRAYASMTFHFDRADFKVE 300
Y + T + + A + CY + + Y + HF D ++
Sbjct: 239 PPTAYSALKTAVRAAMTQY---TLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIP 295
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
T ++F+F N C+A + + ++ ++G QQ YD I F C
Sbjct: 296 ATGVFFVF-NSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 163/364 (44%), Gaps = 31/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP ++ DTGS LIW QC PC C+ Q +PIFNP SSTY+R+ C+
Sbjct: 94 YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRY 153
Query: 67 CR--RPPFRCENGQ-----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C R + C + +Y + G ++TE F N + + FGC
Sbjct: 154 CNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN--SIQELAFGCG 211
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME-ATSILRFGKD 178
N N +FD +GI+G SL+ QL + FSYCLV + + + FG +
Sbjct: 212 NSNGG-NFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDN 270
Query: 179 ANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGT---GGCMID 233
+ I D V + YYL+L+ ISV + R+ + R +G G +ID
Sbjct: 271 SFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENS----RNDGNVEKGNIIID 326
Query: 234 TGAIATFIQRGPY---EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
+G TF+ Y E+V+ E +R+ + + + C+R D +T
Sbjct: 327 SGTTLTFLDSKLYNKLELVLEKAVEG------ERVSDPNGIFSICFR-DKIGIELPIITV 379
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
HF AD +++P F E C + S+ ++ G Q + YDL+ + F+P
Sbjct: 380 HFTDADVELKPINT-FAKAEEDLLCFTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMP 438
Query: 351 ENCA 354
+C+
Sbjct: 439 TDCS 442
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 165/362 (45%), Gaps = 27/362 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V++ G+P ++ L DT S L+W QC PC+NC+ QS PIF+P+ S T++ C
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144
Query: 67 CRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHL---KNKLVCVPGVIFGCSND 121
P R + C + + Y G + G+++ E F+ ++ + V+FGC +D
Sbjct: 145 YSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHD 204
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N G GILG FSL+ + + FSYC ++L G D
Sbjct: 205 NYGEPLVG--TGILGLGYGEFSLVHRFGTK----FSYCFGSLDDPSYPHNVLVLGDDGAN 258
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATF 240
D + ++ + YY++++ ISV + P F + G GG +IDTG T
Sbjct: 259 ILGDTTPLEIY---NGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTS 315
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CY----RYDSRFRAYASMTFHF- 292
+ Y+ + +++F GR + ++D + CY D + +TFHF
Sbjct: 316 LVEEAYKPLKNKIEDYFE--GRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFS 373
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
D A+ ++ + F+ + FC+A++ + NS +GA QQ YDL I F +
Sbjct: 374 DGAELSLDVKSV-FMKLSPNVFCLAVTPGNMNS-IGATAQQSYNIGYDLEAKKISFERID 431
Query: 353 CA 354
C
Sbjct: 432 CG 433
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 151/351 (43%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P + ++++ D+GS ++W QC PC C+ QS P+F+P S +Y + C +
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSV 190
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C +G C + + Y G+ G ++ ET TF V V GC + NR
Sbjct: 191 CDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF----AKTVVRNVAMGCGHRNRGM 246
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
F G + S S +GQL G F YCLV R ++T L FG++A
Sbjct: 247 -FIGAAGLLGIGGGS-MSFVGQLSGQTGGAFGYCLV--SRGTDSTGSLVFGREALPVGAS 302
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S YY+ L+ + V RI G F L G GG ++DTG T +
Sbjct: 303 WVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAA 362
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
Y F + R + ++ CY +++F+F P
Sbjct: 363 YVAFRDGFKSQTANLPRA---SGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN 419
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + G +C A + S S++G QQ+ + +D G + F P C
Sbjct: 420 FLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 160/351 (45%), Gaps = 24/351 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V+V GTP K L+FDTGS LIWTQC PC C+ + P+F+P S+++K +PC +
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPK-VPVFDPTKSASFKGLPCSSKL 190
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDF 125
C+ C + +C + Y +S++G ++TET +F HLK ++ GCS+
Sbjct: 191 CQSIRQGCSSPKCTYLTAYVDNSSSTGTLATETISFSHLKYDF---KNILIGCSDQVSGE 247
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S +GI+G + SP SL Q + LFSYC+ +T L FG +
Sbjct: 248 SL--GESGIMGLNRSPISLASQTANIYDKLFSYCIP---STPGSTGHLTFGGKVPNDVRF 302
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ SS Y + + ISV ++ F + ID+GA+ T +
Sbjct: 303 SPVSK--TAPSSDYDIKMTGISVGGRKLLIDASAFKIAST------IDSGAVLTRLPPKA 354
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRA-DFKVEPTY 303
Y + F E + + + + + CY + + A S++ F+ + ++ +
Sbjct: 355 YSALRSVFREMMKGY---PLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSG 411
Query: 304 MYFIFQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + +C+A + D S+ G +QQ+ V+D I F P C
Sbjct: 412 IMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 157/347 (45%), Gaps = 34/347 (9%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYK---RIPCDDL 65
V++ G PS + ++ DTGS ++W C PC NC N +F+P+ SST+ + PC
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPCGFK 162
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC-VPGVIFGCSNDNRD 124
C+ P I+Y +SASG + F ++ + VI GC + N
Sbjct: 163 GCKCDPIP-------FTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NIG 214
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F+ D GILG + P SL Q+ FSYC+ + LR G+ A+++
Sbjct: 215 FNSDPGYNGILGLNNGPNSLATQIGRK----FSYCIGNLADPYYNYNQLRLGEGADLEGY 270
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
F YY++++ ISV + R+ A TF ++RNGTGG ++D+G T++
Sbjct: 271 STP----FEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDS 326
Query: 245 PYEVVMRHFDEHFT-SFGRQRMHNASEDWEYCYR--YDSRFRAYASMTFHF-DRADFKVE 300
++++ SF + NA W+ CY + +TFHF D AD ++
Sbjct: 327 AHKLLYNEVRNLLKWSFRQVIFENAP--WKLCYYGIISRDLVGFPVVTFHFVDGADLALD 384
Query: 301 PTYMYFIFQNEGYFCVAISFSD------RNSVVGAWQQQDTRFVYDL 341
F Q + FC+ +S + SV+G QQ YDL
Sbjct: 385 TG--SFFSQRDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDL 429
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 161/354 (45%), Gaps = 22/354 (6%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V V G P+K +++ DTGS + W QC PC +C+ Q+ PIF+P +SS++ +PC+
Sbjct: 155 YFSRVGV--GQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 65 LICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ C +C+++++Y G+ G ET TF + V GC +DN
Sbjct: 213 QQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSGM---INNVAVGCGHDNE 269
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG + Q+K+++ FSYCLV R+ ++S L F A
Sbjct: 270 GLFVGSAGLLGLGGGSLSLT--SQMKASS---FSYCLV--DRDSSSSSDLEFNSAAPSDS 322
Query: 184 KDMKTIRM-FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ ++ VD + YY+ L +SV + P F + +G GG ++D+G T +Q
Sbjct: 323 VNAPLLKSGKVD--TFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQ 380
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEP 301
Y + F + N ++ CY S+ R +++F F P
Sbjct: 381 TQAYNTLRDAFVSRTPYL---KKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLP 437
Query: 302 TYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I + G FC A + + + S++G QQQ TR YDL + F P C
Sbjct: 438 PKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 37/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L FDTGS L WTQC PC+ CF Q+ P F+P S++YK + C
Sbjct: 140 YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSE 199
Query: 66 ICRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C+ P C + C++ I Y G + G ++TET + +FGCS
Sbjct: 200 FCKLIAEGNYPAQDCISNTCLYGIQYGSGYTI-GFLATETLAIASSDVF---KNFLFGCS 255
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
++R +F+G G+LG SP +L Q + + LFSYCL + +T L FG +
Sbjct: 256 EESRG-TFNGT-TGLLGLGRSPIALPSQTTNKYKNLFSYCLPAS---PSSTGHLSFGVEV 310
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + K+ + Y L+ ISV + P ++ R +ID+G T
Sbjct: 311 S---QAAKSTPISPKLKQLYGLNTVGISVRGREL---PINGSISRT-----IIDSGTTFT 359
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM--TFHFDRADF 297
F+ Y + F E ++ + N + ++ CY + + ++ F
Sbjct: 360 FLPSPTYSALGSAFREMMANY---TLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGV 416
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+VE + G V ++F+D S + G +QQ+ +YD+ G + F P+
Sbjct: 417 EVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKG 476
Query: 353 C 353
C
Sbjct: 477 C 477
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 160/355 (45%), Gaps = 25/355 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
YF V + G P++ +++ DTGS + W QC PC +C++Q+ PIF P++SS+Y+ + CD
Sbjct: 148 YFTRVGI--GKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDT 205
Query: 65 LICRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C C N C++ ++Y G+ G +TET T V V GC + N
Sbjct: 206 PQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI----GSTLVQNVAVGCGHSNE 261
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
LG + QL +T+ FSYCLV R+ ++ S + FG +
Sbjct: 262 GLFVGAAGLLGLGGGLLALP--SQLNTTS---FSYCLV--DRDSDSASTVDFGTSLSPDA 314
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+R + YYL L ISV + +F + +G+GG +ID+G T +Q
Sbjct: 315 VVAPLLRNH-QLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQT 373
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
Y + F + G + A+ ++ CY ++ ++ FHF
Sbjct: 374 EIYNSLRDSFVK-----GTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLAL 428
Query: 301 PTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I + G FC+A + + + +++G QQQ TR +DL I F C
Sbjct: 429 PAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 160/353 (45%), Gaps = 45/353 (12%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYK---RIPCDDL 65
++ G P + ++ DTGS ++W C PC NC N +F+P+ SST+ + PCD
Sbjct: 103 ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFE 162
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC-VPGVIFGCSNDNRD 124
CR P + YA ++ASG +T F ++ + V+FGC + N
Sbjct: 163 GCRCDPIP-------FTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH-NIG 214
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL------VYAYREMEATSILRFGKD 178
D GILG + P SL+ +L FSYC+ Y Y ++ IL G D
Sbjct: 215 HDTDPGHNGILGLNNGPDSLVTKLGQK----FSYCIGNLADPYYNYHQL----ILGEGAD 266
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ F YY++++ ISV + R+ AP TF ++ N GG +IDTG+
Sbjct: 267 LEGYSTPFEVYNGF------YYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTI 320
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR---AYASMTFHF-DR 294
TF+ ++++ + RQ S W C+ Y S R + +TFHF D
Sbjct: 321 TFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP-WMQCF-YGSISRDLVGFPVVTFHFSDG 378
Query: 295 ADFKVEPTYMYFIFQNEGYFCV------AISFSDRNSVVGAWQQQDTRFVYDL 341
AD ++ + +F N+ FC+ +++ + S++G QQ YDL
Sbjct: 379 ADLALD-SGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDL 430
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 160/355 (45%), Gaps = 14/355 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP S + DTGS LIW QCLPC +C+ Q P+F+P S TYK + C++
Sbjct: 94 YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDF 153
Query: 67 CRRPPFRCENGQ---CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C+ + G C +Y + +S+ETFT + PG+ FGC + N
Sbjct: 154 CQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSN 213
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+F+ +G++G P SL+ QL S G FSYCLV + A+S + FGK A +
Sbjct: 214 GG-TFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVS 272
Query: 183 RKDMKTIRMFVDR-SSHYYLSLQDISVADHRI---GFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + YYL+L+ +S+ ++ GF+ + +ID+G
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDSGTTL 332
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + R Y + + G Q + + CY + ++T HF AD +
Sbjct: 333 TLLPRDFYTDMESALTK---VIGGQTTTDPRGTFSLCYSGVKKLE-IPTITAHFIGADVQ 388
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P F+ E C ++ S ++ G Q + YDL + F P +C
Sbjct: 389 LPPLNT-FVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/352 (29%), Positives = 149/352 (42%), Gaps = 27/352 (7%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR-IPCDDLICRRPP 71
GTP L + G+ LIW P CF Q+ P F P T+ R +P C P
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEP---LTFSRGLPFAS--CGSPK 55
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNI 131
F N CV+ +Y + +G + + FTF VPGV FGC N F N
Sbjct: 56 FW-PNQTCVYTYSYGDKSVTTGFLEVDKFTF--VGAGASVPGVAFGCGLFNNGV-FKSNE 111
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD------ANIQRKD 185
GI GF P SL QLK G FS+C + +T +L D +Q
Sbjct: 112 TGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTP 168
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ + YYLSL+ I+V R+ FAL NGTGG +ID+G T +
Sbjct: 169 LIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQV 227
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE-PTY 303
Y+VV DE + + C+ S+ + + HF+ A + Y
Sbjct: 228 YQVVR---DEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENY 284
Query: 304 MYFIFQNEG--YFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ + + G C+AI+ D +++G +QQQ+ +YDL + FV C
Sbjct: 285 VFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 165/370 (44%), Gaps = 31/370 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V+V GTP + ++ DTGS L W QC PC++CF+Q P+F+P AS++Y+ + C D
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTR 209
Query: 67 C-----RRPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHL-KNKLVCVPGVIFG 117
C P C + + C + Y ++ +G ++ E FT +L + V GV+ G
Sbjct: 210 CGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLG 269
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + NR LG F+ QL++ FSYCLV S + FG
Sbjct: 270 CGHRNRGLFHGAAGLLGLGRGPLSFA--SQLRAVYGHAFSYCLV--DHGSAVGSKIVFGD 325
Query: 178 DANIQRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMID 233
D + F ++ YY+ L+ I V + T+ + + +G+GG +ID
Sbjct: 326 DNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIID 385
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAYASMTF 290
+G ++ Y+ + + F + + + D+ CY R F
Sbjct: 386 SGTTLSYFPEPAYKAIRQAFVDRM-----DKAYPLIADFPVLSPCYNVSGVERVEVP-EF 439
Query: 291 HFDRADFKVE--PTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGT 345
AD V P YFI EG C+A+ + R+ S++G +QQQ+ +YDL+
Sbjct: 440 SLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNR 499
Query: 346 IQFVPENCAN 355
+ F P CA
Sbjct: 500 LGFAPRRCAE 509
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 157/371 (42%), Gaps = 31/371 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTY-KRIPCDDL 65
YT+++ G+P K + DTGS L+W QC PC C++QS PI++P+ASST+ K
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 66 ICRRPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLK-NKLVCVPGVIFGCSNDN 122
P C + C++ Y +S G + ET T P FGC N
Sbjct: 64 CQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRLN 123
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
SF G AGI+G SL QL S FSYCLV + TS L FG A+
Sbjct: 124 SG-SF-GGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSASTG 181
Query: 183 RKDMKT-IRMFVDRSSHYYLSLQDISVADHRIGFAPGTF--------------ALRRNGT 227
+ T I RS++Y++ L+ ISV ++ A AL N +
Sbjct: 182 SGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVN-S 240
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD-SRFRAYA 286
GG + D+G T + Y V F +S + +S ++ CY S+ +
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFA---SSVSLPTVDASSSGFDLCYDVSKSKNFKFP 297
Query: 287 SMTFHFDRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGA--WQQQDTRFVYDLN 342
++T F F P YF+ + E C+A+ S + QQ+ VYD
Sbjct: 298 ALTLAFKGTKFS-PPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRG 356
Query: 343 TGTIQFVPENC 353
T TI P C
Sbjct: 357 TSTISMSPAQC 367
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 159/364 (43%), Gaps = 33/364 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H Y +D+ GTP K + DTGS L+W Q PC C + IF+P SST++ +
Sbjct: 49 HPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREM 106
Query: 61 PCDDLICRRPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIFG 117
C +C P CE G C + Y G + G + +T + + P G
Sbjct: 107 DCSSQLCAELPGSCEPGSSTCSYSYEYGSGET-EGEFARDTISLGTTSDGSQKFPSFAVG 165
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C N FDG + G++G P SL QL + FSYCLV + E++ +L FG
Sbjct: 166 CGMVNS--GFDG-VDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLL-FGP 221
Query: 178 DANIQRKDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
A + +++ ++ ++Y L++ I+VA +G +PGT +ID+
Sbjct: 222 SAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-SPGT----------TIIDS 270
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYASMTFHFD 293
G T++ G Y V+ + T R+ +S + CY R +R + ++T
Sbjct: 271 GTTLTYVPSGVYGRVLSRMESMVT---LPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLA 327
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFV 349
A P+ YF+ ++ V ++ + S++G QQ +YD + + FV
Sbjct: 328 GATM-TPPSSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFV 386
Query: 350 PENC 353
C
Sbjct: 387 QAKC 390
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 158/361 (43%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V G+P + +FDTGS L WTQC PCV C+ Q IF+P+ S +Y + CD
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 206
Query: 66 ICRRPPFR------CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C + C + C++ I Y G+ + G + E + + FGC
Sbjct: 207 SCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQFGCG 263
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK-D 178
+NR G AG+LG + +P SL+ Q +FSYCL + +T L FG D
Sbjct: 264 QNNRGLF--GGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSS---SSSTGYLSFGSGD 318
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + D S Y+L + ISV + ++ F+ T G +ID+G +
Sbjct: 319 GDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFS-----TAGTIIDSGTVI 373
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA--YASMTFHFD-RA 295
+ + Y V + F E + + R + + CY S+++ + +F A
Sbjct: 374 SRLPPTVYSSVQKVFRELMSDYPRVK---GVSILDTCYDL-SKYKTVKVPKIILYFSGGA 429
Query: 296 DFKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ + P + ++ + C+A S D +++G QQ+ VYD G + F P
Sbjct: 430 EMDLAPEGIIYVLKVS-QVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSG 488
Query: 353 C 353
C
Sbjct: 489 C 489
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 159/373 (42%), Gaps = 48/373 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP + + D L+WTQC PC CF Q P+F+P SST++ +PC
Sbjct: 56 LYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 66 ICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC--SN 120
+C P C + C++ G + G+ T+TF + + FGC
Sbjct: 116 LCESIPESSRNCTSDVCIYEAPTKAGDTG-GMAGTDTFAIGAAKETLG-----FGCVVMT 169
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D R G +GI+G +P+SL+ Q+ TA FSYCL +++ L G A
Sbjct: 170 DKR-LKTIGGPSGIVGLGRTPWSLVTQMNVTA---FSYCLAG-----KSSGALFLGATAK 220
Query: 181 IQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+ FV ++S YY+ + +A + G AP A T +
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYM----VKLAGIKAGGAPLQAASSSGST--VL 274
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
+DT + A+++ G Y+ + + + G Q + + + ++ C+ A + F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTA---AVGVQPVASPPKPYDLCFSKAVAGDA-PELVFT 330
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFS---------DRNSVVGAWQQQDTRFVYDLN 342
FD P Y + G C+ I S + S++G+ QQ++ ++DL
Sbjct: 331 FDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLK 390
Query: 343 TGTIQFVPENCAN 355
T+ F P +C++
Sbjct: 391 EETLSFKPADCSS 403
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 158/364 (43%), Gaps = 33/364 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H Y +D+ GTP K + DTGS L+W Q PC C + IF+P SST++ +
Sbjct: 49 HPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREM 106
Query: 61 PCDDLICRRPPFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCV-PGVIFG 117
C +C P CE G C + Y G + G + +T + + P G
Sbjct: 107 DCSSQLCTELPGSCEPGSSACSYSYEYGSGET-EGEFARDTISLGTTSGGSQKFPSFAVG 165
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C N FDG + G++G P SL QL + FSYCLV + E++ +L FG
Sbjct: 166 CGMVNS--GFDG-VDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLL-FGP 221
Query: 178 DANIQRKDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
A + +++ ++ ++Y L++ I+VA +G +PGT +ID+
Sbjct: 222 SAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-SPGT----------TIIDS 270
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYASMTFHFD 293
G T++ G Y V+ + T R+ +S + CY R +R + ++T
Sbjct: 271 GTTLTYVPSGVYGRVLSRMESMVT---LPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLA 327
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFV 349
A P+ YF+ ++ V ++ S++G QQ +YD + + FV
Sbjct: 328 GATM-TPPSSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFV 386
Query: 350 PENC 353
C
Sbjct: 387 QAKC 390
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 163/370 (44%), Gaps = 50/370 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP + +FDTGS L WTQC PC C++Q PIFNP+ S++Y I C
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 66 ICRR-------PPFRCENGQCVHRINYAGGASASG--------LVSTETFTFHLKNKLVC 110
C P C CV+ I Y + + G L ST+ F L
Sbjct: 198 TCDELKSGTGNSP-SCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDVFNNFL------ 250
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
FGC +NR F G +AG++G + SL+ Q LFSYCL +T
Sbjct: 251 -----FGCGQNNRGL-FVG-VAGLIGLGRNALSLVSQTAQKYGKLFSYCLP---STSSST 300
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
L FG + T + + S Y+L+L ISV ++ + F+ T G
Sbjct: 301 GYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFS-----TAG 355
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYA 286
+ID+G + + + Y + F + + + + + + CY +YD+
Sbjct: 356 TIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAA---PASILDTCYDFSQYDTVDVPKI 412
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISF-SDRN--SVVGAWQQQDTRFVYDLNT 343
++ F D A+ ++P+ +++I N C+A + SD +++G QQ+ VYD+
Sbjct: 413 NLYFS-DGAEMDLDPSGIFYIL-NISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAG 470
Query: 344 GTIQFVPENC 353
G I F P C
Sbjct: 471 GRIGFAPGGC 480
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 154/360 (42%), Gaps = 29/360 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP L+FDTGS L WTQC PCV C++Q PIFNP+ S++Y + C
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 163
Query: 66 ICRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C C C++ I Y + + G ++ E FT L N V GV FGC
Sbjct: 164 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFT--LTNSDV-FDGVYFGCG 220
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+N+ F G +AG+LG S Q + +FSYCL T L FG
Sbjct: 221 ENNQGL-FTG-VAGLLGLGRDKLSFPSQTATAYNKIFSYCLP---SSASYTGHLTFGSAG 275
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ I D +S Y L++ I+V ++ F+ T G +ID+G + T
Sbjct: 276 ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVIT 330
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADF 297
+ Y + F + + + + C+ + + F F A
Sbjct: 331 RLPPKAYAALRSSFKAKMSKY---PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 387
Query: 298 KVEPTYMYFIFQNEGYFCVAISFS--DRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ ++++F+ C+A + + D N+ + G QQQ VYD G + F P C+
Sbjct: 388 ELGSKGIFYVFKIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 154/360 (42%), Gaps = 29/360 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP L+FDTGS L WTQC PCV C++Q PIFNP+ S++Y + C
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 191
Query: 66 ICRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C C C++ I Y + + G ++ E FT L N V GV FGC
Sbjct: 192 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFT--LTNSDV-FDGVYFGCG 248
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+N+ F G +AG+LG S Q + +FSYCL T L FG
Sbjct: 249 ENNQGL-FTG-VAGLLGLGRDKLSFPSQTATAYNKIFSYCLP---SSASYTGHLTFGSAG 303
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ I D +S Y L++ I+V ++ F+ T G +ID+G + T
Sbjct: 304 ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVIT 358
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADF 297
+ Y + F + + + + C+ + + F F A
Sbjct: 359 RLPPKAYAALRSSFKAKMSKY---PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 415
Query: 298 KVEPTYMYFIFQNEGYFCVAISFS--DRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ ++++F+ C+A + + D N+ + G QQQ VYD G + F P C+
Sbjct: 416 ELGSKGIFYVFKIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 156/378 (41%), Gaps = 45/378 (11%)
Query: 1 HEKNYFYTV-DVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR 59
H + Y V + GTP + + D L+WTQC C CF Q P+F PNASST++
Sbjct: 60 HWSRHLYNVANFTIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRP 119
Query: 60 IPCDDLICRR-PPFRCENGQCVHR--INYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
PC C+ P C + C + IN G G+V+T+TF + F
Sbjct: 120 EPCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATA-----SLGF 174
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC + G +G++G +P SL+ Q+ T FSYCL + S L G
Sbjct: 175 GCVVAS-GIDTMGGPSGLIGLGRAPSSLVSQMNITK---FSYCLT--PHDSGKNSRLLLG 228
Query: 177 KDANIQRKDMKTIRMFV------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
A + T FV D S +Y + L I D I P +
Sbjct: 229 SSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTV-------- 280
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS--- 287
++ T A +F+ Y+ + + + + G + ++ C+ A A
Sbjct: 281 LVQTLAPMSFLVDSAYQALKKEVTK---AVGAAPTATPLQPFDLCFPKAGLSNASAPDLV 337
Query: 288 MTFHFDRADFKVEPT-YMYFIFQNEGYFCVAI---------SFSDRNSVVGAWQQQDTRF 337
TF A V P Y+ + + +G C+AI + + +++G+ QQ++T F
Sbjct: 338 FTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHF 397
Query: 338 VYDLNTGTIQFVPENCAN 355
+ DL T+ F P +C++
Sbjct: 398 LLDLEKKTLSFEPADCSS 415
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 157/357 (43%), Gaps = 37/357 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + L DTGS LIW +C C C Q +P + PN SS++ ++PC +
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSL 141
Query: 67 CRR-PPFRCENG--QCVHRINYAGGAS-----ASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P +C G +C ++ +Y G AS G + +ETFT VPG+ FGC
Sbjct: 142 CSDLPSSQCSAGGAECDYKYSY-GLASDPHHYTQGYLGSETFTLGSD----AVPGIGFGC 196
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ G+ +G++G P SL+ QL G FSYCL + TS L FG
Sbjct: 197 T--TMSEGGYGSGSGLVGLGRGPLSLVSQLN---VGAFSYCLT---SDAAKTSPLLFGSG 248
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A + +++ + + +Y ++L+ IS+ G G+ G + D+G
Sbjct: 249 A-LTGAGVQSTPLLRTSTYYYTVNLESISI---------GAATTAGTGSSGIIFDSGTTV 298
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
F+ Y + T+ M + + +E C++ + SM HFD D
Sbjct: 299 AFLAEPAYTLAKEAVLSQTTNL---TMASGRDGYEVCFQTSGAV--FPSMVLHFDGGDMD 353
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ PT YF ++ C + S S+VG Q + YD+ + F P NC N
Sbjct: 354 L-PTENYFGAVDDSVSCWIVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCDN 409
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 165/359 (45%), Gaps = 14/359 (3%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + DTGS + W QC C +C+ Q+ PIF+P+ S TYK +PC +
Sbjct: 97 YLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSNM 156
Query: 67 CRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSND 121
C+ P + C + I Y G+ + G +S ET T N V P + GC ++
Sbjct: 157 CQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGCGHN 216
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N+ +F G +G++G P SL+ QL S+ G FSYCL + + ++S L FG A +
Sbjct: 217 NKG-TFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGDAAVV 275
Query: 182 QRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFA-PGTFALRRNGTGGCMIDTGAIA 238
+ + S YYL+L+ SV D RI F + + NG G +ID+G
Sbjct: 276 SGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTTL 335
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADF 297
T + + Y + + + R+ + S CY+ + +T HF AD
Sbjct: 336 TLLPQEDYSNLESAVADAIQA---NRVSDPSNFLSLCYQTTPSGQLDVPVITAHFKGADV 392
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
++ P F+ EG C A S+ S+ G Q + YDL T+ F P +C +
Sbjct: 393 ELNPIST-FVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTDCTQE 450
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 170/373 (45%), Gaps = 47/373 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ K+ L+ DTGS L W QC PC +C+NQ P+++P+ SS+YK + C+
Sbjct: 137 NYIVTVEL----GGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN 192
Query: 64 DLICR--------RPPFRCENG----QCVHRINYAGGASASGLVSTETFTF---HLKNKL 108
C+ P NG C + ++Y G+ G +++E+ L+N
Sbjct: 193 SSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLEN-- 250
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
++FGC +N+ G +G++G S SL+ Q T G+FSYCL E
Sbjct: 251 -----LVFGCGRNNKGLF--GGASGLMGLGRSSVSLVSQTLKTFNGVFSYCL--PSLEDG 301
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAPGTFALRR 224
A+ L FG D ++ + + ++ S Y L+L S+ G T + R
Sbjct: 302 ASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIG----GVELKTLSFGR 357
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA 284
G +ID+G + T + Y+ V F + F+ F ++ + Y+
Sbjct: 358 ----GILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIP 413
Query: 285 YASMTFHFDRADFKVEPTYM-YFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYD 340
M F + A+ +V+ T + YF+ + C+A +S+ + ++G +QQ++ R +YD
Sbjct: 414 TIKMIFEGN-AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYD 472
Query: 341 LNTGTIQFVPENC 353
+ ENC
Sbjct: 473 TTQERLGIAGENC 485
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 154/351 (43%), Gaps = 15/351 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P +S++++ D+GS ++W QC PC C++Q+ P+F+P S+++ + C +
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + C +G+C + ++Y G+S G ++ ET T V V GC + N+
Sbjct: 103 CDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL----GRTVVQNVAIGCGHMNQGM 158
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S +GQL FSYCLV R + L FG +A
Sbjct: 159 FV--GAAGLLGLGGGSMSFVGQLSRERGNAFSYCLV--SRVTNSNGFLEFGSEAMPVGAA 214
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ S+YY+ L + V D ++ + F L G GG ++DTG T
Sbjct: 215 WIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVA 274
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
YE F + + R + ++ CY +++F+F P
Sbjct: 275 YEAFRDAFIDQTGNLPRA---SGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANN 331
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + D + F P C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 158/358 (44%), Gaps = 23/358 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP + ++++ DTGS ++W QC PC C++Q PIFNP+ S+++ + C+
Sbjct: 196 EYFTRIGV--GTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCN 253
Query: 64 DLICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C + C G C+++++Y G+ G +TE TF + V V GC +DN
Sbjct: 254 SAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEMLTFGTTS----VRNVAIGCGHDN 309
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
LG + F QL + FSYCLV + E T L FG ++
Sbjct: 310 AGLFVGAAGLLGLGAGLLSFP--SQLGTQTGRAFSYCLVDRFSESSGT--LEFGPESVPL 365
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMIDTGAIATF 240
+ + + YY+ L ISV + P F + +G GG ++D+G T
Sbjct: 366 GSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTR 425
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDS-RFRAYASMTFHFDRADF 297
+Q Y+ V F G +++ A ++ CY ++ FHF
Sbjct: 426 LQTPVYDAVRDAF-----VAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFSNGAS 480
Query: 298 KVEPTYMYFIFQN-EGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y I + G FC A + + + S++G QQQ R +D + F C
Sbjct: 481 LILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 161/377 (42%), Gaps = 41/377 (10%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC---RR 69
GTP + LL DT S L W Q C NC P FNP SS++ PC +C +
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64
Query: 70 PPFRC----ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG-VIFGCSNDNRD 124
F+ G C ++ Y G+ A G+++ E F+ + G VIFGC++ +
Sbjct: 65 LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQ-GL---FSYCLVYAYREMEATSILRFGKDA- 179
D + +G LG + FS Q+ S ++ GL FSYC + ++ ++ FG
Sbjct: 125 RPVDFS-SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGI 183
Query: 180 ---NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
+ Q ++ YY+ LQ ISV + F + R G GG D+G
Sbjct: 184 PAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGT 243
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMH---NASEDW--EYCYRY---DSRFRAYASM 288
+F+ + ++ +FGR+ +H + D+ E CY D+R +
Sbjct: 244 TVSFLVEPAHTALVE-------AFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLV 296
Query: 289 TFHF-DRADFKVEPTYMYF-IFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRFVY 339
T HF + D ++ ++ + + + ++F + + V+G +QQQD +
Sbjct: 297 TLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEH 356
Query: 340 DLNTGTIQFVPENCAND 356
DL I F P NC D
Sbjct: 357 DLERSRIGFAPANCVMD 373
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 156/367 (42%), Gaps = 26/367 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ ++ DTGS ++W QC PC C+ QS P+F+P SS+Y + C
Sbjct: 128 EYFTKIGV--GTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCG 185
Query: 64 DLICRR-PPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR C+ G C++++ Y G+ +G TET TF + V V GC +
Sbjct: 186 AALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGAR---VARVALGCGH 242
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-------YAYREMEATSIL 173
DN LG F Q+ FSYCLV A +S +
Sbjct: 243 DNEGLFVAAAGLLGLGRGGLSFPT--QISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTV 300
Query: 174 RFGKDANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGC 230
FG + T + R + YY+ L ISV R+ G A L G GG
Sbjct: 301 SFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGV 360
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDS-RFRAYASM 288
++D+G T + R Y + F + G R+ ++ CY R ++
Sbjct: 361 IVDSGTSVTRLARASYSALRDAF--RAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTV 418
Query: 289 TFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTI 346
+ HF P Y I + G FC A + +D S++G QQQ R V+D + +
Sbjct: 419 SMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRV 478
Query: 347 QFVPENC 353
F P+ C
Sbjct: 479 GFAPKGC 485
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 164/382 (42%), Gaps = 50/382 (13%)
Query: 1 HEKNYFYTV-DVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR 59
H + Y V + GTP + + D L+WTQC C CF Q P+F PNASST++
Sbjct: 36 HWSRHLYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRP 95
Query: 60 IPCDDLICRRPPF-RCENGQCVH------RINYAGGASASGLVSTETFTFHLKNKLVCVP 112
PC C+ P C C + R++ + G+V TETF
Sbjct: 96 EPCGTDACKSTPTSNCSGDVCTYESTTNIRLDR---HTTLGIVGTETFAIGTATA----- 147
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
+ FGC + + DG +G +G +P SL+ Q+K T FSYCL + R +S
Sbjct: 148 SLAFGCVVASDIDTMDGT-SGFIGLGRTPRSLVAQMKLTK---FSYCL--SPRGTGKSSR 201
Query: 173 LRFGKDANIQRKDMKTIRMFV-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNG 226
L G A + + + F+ D S HYY LSL I + I A
Sbjct: 202 LFLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA---------Q 252
Query: 227 TGGCMI-DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
+GG ++ T + + + Y + E Q M + ++ C++ + F RA
Sbjct: 253 SGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRA 312
Query: 285 YAS-MTFHFDRADFKVEPTYMYFI--FQNEGYFCVAI---SFSDRN-----SVVGAWQQQ 333
A + F F A P Y I + + C AI ++ +R SV+G+ QQ+
Sbjct: 313 TAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQE 372
Query: 334 DTRFVYDLNTGTIQFVPENCAN 355
D F+YDL T+ F P +C++
Sbjct: 373 DVHFLYDLKKETLSFEPADCSS 394
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 155/365 (42%), Gaps = 27/365 (7%)
Query: 3 KNYFYTVDVLFGTP--SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
+Y + V V GT + + L DT + W C PC +Q +F+P S T++ +
Sbjct: 62 SDYVHGVFVSIGTGQGGRRKILALDTAASTSWVMCEPCRPPLHQLGRLFSPAESPTFRGV 121
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGG-----ASASGLVSTETFTFHLKNKLVC--VPG 113
DD +C PP+ HR++ G SA G ++ +TF + V + G
Sbjct: 122 RRDDPVCV-PPY--------HRLHSTNGCSFAFPSAIGYLARDTFHLRHSERSVVKSISG 172
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
V FGC++ F + + G+L S SP S L Q S A G FSYCL + +
Sbjct: 173 VAFGCAHTTTGFYNEDILGGVLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSGFI 232
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+FG + + T + V +S Y+LSL IS+ + R+ + GC I+
Sbjct: 233 QFGIEVPSLPRHAHTTTLTVS-ASGYHLSLIGISLGNKRLDIDRHILT-----SHGCSIN 286
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
T I Y +V R G +++ + R RA +M FHF
Sbjct: 287 PAETITKIAEPAYIIVARELMAQMNELGSKQVKGPPSSPLVFNKISRRVRARLPNMVFHF 346
Query: 293 -DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
D D ++ + F V S R +V+GA QQ + RF++++ G + F E
Sbjct: 347 ADGGDMWFTAGKLFQVIGTTARFLVEGHGSHR-TVIGAAQQVNARFIFNVAAGRLTFAEE 405
Query: 352 NCAND 356
C+ +
Sbjct: 406 LCSRE 410
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 163/373 (43%), Gaps = 31/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS---APIFNPNASSTYKRIPCD 63
Y V++ GTP+K L+ DTGS L W QC P N S AP ++ ++SS+Y+ IPC
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 118
Query: 64 DLICRRPPFRCEN-------GQCVHRINYAGGASASGLVSTETFTF-----------HLK 105
D C+ P + C + Y+ + +G+++ ET + + K
Sbjct: 119 DDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHK 178
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA-QGLFSYCLVYAY 164
+ + + V GCS ++ SF G +G+LG P SL Q + TA G+FSYCLV
Sbjct: 179 TRRIRIKNVALGCSRESVGASFLG-ASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYL 237
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALR 223
R A+S L G+ + +R + S YY+++ ++V + G A + +
Sbjct: 238 RGSNASSFLVMGRTHWRKLAHTPIVRNPAAQ-SFYYVNVTGVAVDGKPVDGIASSDWGID 296
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+G G + D+G ++++ Y V+ + S R E +E CY +
Sbjct: 297 GDGNKGTIFDSGTTLSYLREPAYSKVLGALN---ASIYLPRAQEIPEGFELCYNVTRMEK 353
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYD 340
+ F P Y + E CVA ++ ++ ++++G QQD YD
Sbjct: 354 GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYD 413
Query: 341 LNTGTIQFVPENC 353
L I F C
Sbjct: 414 LAKARIGFKWSPC 426
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 155/355 (43%), Gaps = 19/355 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
+YF + V GTP++S +++ DTGS + W QC PC C+ Q PIFNP+ SS++K + C
Sbjct: 80 DYFARIGV--GTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 137
Query: 64 DLICRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
IC + + +C+++++Y G+ G STET +F V V GC +
Sbjct: 138 SSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEH----AVRSVAMGCGRN 193
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N+ LG F Q ++ +FSYCL RE + L FG A
Sbjct: 194 NQGLFHGAAGLLGLGRGPLSFP--SQTGTSYASVFSYCL--PRRESAIAASLVFGPSAVP 249
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
++ + ++YY+ L I VA + P FA+ GTGG ++D+G + +
Sbjct: 250 EKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRL 309
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADFKV 299
Y + F T + ++ CY S + ++ FD A +
Sbjct: 310 TTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAVVLDFDGGASMPL 365
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ +EG +C+A + + S++G QQQ R D + P+ C
Sbjct: 366 PADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 162/373 (43%), Gaps = 31/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS---APIFNPNASSTYKRIPCD 63
Y V++ GTP+K L+ DTGS L W QC P N S AP ++ ++SS+Y+ IPC
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 86
Query: 64 DLICRRPPFRCEN-------GQCVHRINYAGGASASGLVSTETFTF-----------HLK 105
D C P + C + Y+ + +G+++ ET + + K
Sbjct: 87 DDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHK 146
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA-QGLFSYCLVYAY 164
+ + + V GCS ++ SF G +G+LG P SL Q + TA G+FSYCLV
Sbjct: 147 TRTIRIKNVALGCSRESVGASFLG-ASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYL 205
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALR 223
R A+S L G+ + +R + S YY+++ ++V + G A + +
Sbjct: 206 RGSNASSFLVMGRTRWRKLAHTPIVRNPAAQ-SFYYVNVTGVAVDGKPVDGIASSDWGID 264
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+G G + D+G ++++ Y V+ + S R E +E CY +
Sbjct: 265 GDGNKGTIFDSGTTLSYLREPAYSKVLGALN---ASIYLPRAQEIPEGFELCYNVTRMEK 321
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYD 340
+ F P Y + E CVA ++ ++ ++++G QQD YD
Sbjct: 322 GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYD 381
Query: 341 LNTGTIQFVPENC 353
L I F C
Sbjct: 382 LAKARIGFKWSPC 394
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 174/371 (46%), Gaps = 44/371 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY T+ G +++ ++ DTGS L W QC PC++C++Q P+FNP+ SS+Y + C+
Sbjct: 132 NYIVTI----GLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCN 187
Query: 64 DLICRRPPFRCENGQ---------CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C+ F N + C H ++Y G+ G + E HL + V
Sbjct: 188 SSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVE----HLSFGGISVSNF 243
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC +N+ G ++GI+G S S++ Q +T G+FSYCL + A+ L
Sbjct: 244 VFGCGRNNKGLF--GGVSGIMGLGRSNLSMISQTNTTFGGVFSYCL--PTTDSGASGSLV 299
Query: 175 FGKDA----NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN--GTG 228
G ++ N+ ++ S+ Y L+L I V G A++ G G
Sbjct: 300 IGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDV---------GGVAIQDTSFGNG 350
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYAS 287
G +ID+G + T + Y + F + F+ + + A + C+ + +
Sbjct: 351 GILIDSGTVITRLAPSLYNALKAEFLKQFSGY---PIAPALSILDTCFNLTGIEEVSIPT 407
Query: 288 MTFHFD-RADFKVEPTYMYFIFQNEGYFCVAI-SFSDRN--SVVGAWQQQDTRFVYDLNT 343
++ HF+ D V+ + ++ ++ C+A+ S SD N +++G +QQ++ R +YD
Sbjct: 408 LSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQ 467
Query: 344 GTIQFVPENCA 354
I F E+C+
Sbjct: 468 SKIGFAREDCS 478
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 155/355 (43%), Gaps = 19/355 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
+YF + V GTP++S +++ DTGS + W QC PC C+ Q PIFNP+ SS++K + C
Sbjct: 13 DYFARIGV--GTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 70
Query: 64 DLICRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
IC + + +C+++++Y G+ G STET +F V V GC +
Sbjct: 71 SSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEH----AVRSVAMGCGRN 126
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N+ LG F Q ++ +FSYCL RE + L FG A
Sbjct: 127 NQGLFHGAAGLLGLGRGPLSFP--SQTGTSYASVFSYCL--PRRESAIAASLVFGPSAVP 182
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
++ + ++YY+ L I VA + P FA+ GTGG ++D+G + +
Sbjct: 183 EKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRL 242
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADFKV 299
Y + F T + ++ CY S + ++ FD A +
Sbjct: 243 TTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAVVLDFDGGASMPL 298
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ +EG +C+A + + S++G QQQ R D + P+ C
Sbjct: 299 PADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 158/367 (43%), Gaps = 37/367 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V+V GTP K L+FDTGS L WTQC PCV +C+ Q PIF+P+AS TY I C
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213
Query: 66 ICR-------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C + CV+ I Y + G + +T T + G +FGC
Sbjct: 214 ACSGLKSATGNSP-GCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQND---VFDGFMFGC 269
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+NR G AG++G P S++ Q FSYCL + L FG
Sbjct: 270 GQNNRGLF--GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLP---TSRGSNGHLTFGNG 324
Query: 179 ANIQ-RKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
++ K +K F +S Y++ + ISV + +P F G +I
Sbjct: 325 NGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGTII 379
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFH 291
D+G + T + Y + F + + + A + CY + + ++F+
Sbjct: 380 DSGTVITRLPSTVYGSLKSTFKQFMSKY---PTAPALSLLDTCYDLSNYTSISIPKISFN 436
Query: 292 FD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAW---QQQDTRFVYDLNTGTIQ 347
F+ A+ +EP + I C+A + + + +G + QQQ VYD+ G +
Sbjct: 437 FNGNANVDLEPNGI-LITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLG 495
Query: 348 FVPENCA 354
F + C+
Sbjct: 496 FGYKGCS 502
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 167/378 (44%), Gaps = 48/378 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V++ G+P ++ ++ DTGS L+W QCLPC+NCF QS F+P S ++K + C
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCG--- 160
Query: 67 CRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNK------------ 107
P + NG Q +++ Y GG S+ G+++ E+ F ++
Sbjct: 161 --FPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218
Query: 108 --LVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSP-FSLLGQLKSTAQGLFSYCLVYAY 164
+ + FGC + N + D G+ G P ++ QL + FSYC+
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNK----FSYCIGDIN 274
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
+ + L G+ + I+ F HYY++LQ ISV + P F +
Sbjct: 275 NPLYTHNHLVLGQGSYIEGDSTPLQIHF----GHYYVTLQSISVGSKTLKIDPNAFKISS 330
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRF 282
+G+GG +ID+G T + G +E++ + +R+ + C++
Sbjct: 331 DGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDL 389
Query: 283 RAYASMTFHF-DRADFKVEPTYMYFIFQNEG--YFCVAISFSDRN----SVVGAWQQQDT 335
+ ++TFHF AD +E + F+ G FC+AI S+ SV+G QQ+
Sbjct: 390 VGFPAVTFHFAGGADLVLESGSL---FRQHGGDRFCLAILPSNSELLNLSVIGILAQQNY 446
Query: 336 RFVYDLNTGTIQFVPENC 353
+DL + F +C
Sbjct: 447 NVGFDLEQMKVFFRRIDC 464
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 157/359 (43%), Gaps = 24/359 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + FGTP +S + + DTGS + W C PC C ++ P F P+ SSTY + C
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQ 182
Query: 67 CR--RPPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ R + +N C Y + ++S+ET + + V +FGCSN R
Sbjct: 183 CQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQ----VENFVFGCSNAAR 238
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
++GF +P S + Q + FSYCL + S+L GK+A +
Sbjct: 239 --GLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLL-LGKEA-LSA 294
Query: 184 KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ +K + + S YY+ L ISV + + GT +L + G +ID+G + T +
Sbjct: 295 QGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRL 354
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF-DRADFKVE 300
Y + F ++ M + ++ ++ CY S + +T HF D D +
Sbjct: 355 VEPAYNAMRDSFRSQLSNL---TMASPTDLFDTCYNRPSGDVEFPLITLHFDDNLDLTLP 411
Query: 301 PTYMYFIFQNEG-YFCVAISF-----SDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + ++G C+A D S G +QQQ R V+D+ + ENC
Sbjct: 412 LDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 158/369 (42%), Gaps = 51/369 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP+ S+ LL DTGS L W QC PC C+ Q P+F+P+ SSTY IPC+
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNT 179
Query: 65 LICR---RPPF--RCENG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
CR R + C +G QC + I Y G+ +G+ S ET T PGV
Sbjct: 180 DACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTM--------APGV 231
Query: 115 I-----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA 169
FGC +D +D D G+LG +P SL+ Q S G FSYCL A +
Sbjct: 232 TVKDFHFGCGHD-QDGPND-KYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQ--- 286
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
L G N + T M ++ + Y +++ I+V I P F +GG
Sbjct: 287 AGFLALGAPVNDASGFVFT-PMVREQQTFYVVNMTGITVGGEPIDVPPSAF------SGG 339
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
+ID+G + T +Q Y + F + ++ + N D Y + S ++
Sbjct: 340 MIIDSGTVVTELQHTAYAALQAAFRKAMAAY--PLLPNGELDTCYNFTGHSNVTVPRVAL 397
Query: 289 TFHFD-RADFKVEPTYMY---FIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
TF D V + FQ G ++ ++G Q+ +YD+ G
Sbjct: 398 TFSGGATVDLDVPDGILLDNCLAFQEAGP-------DNQPGILGNVNQRTLEVLYDVGHG 450
Query: 345 TIQFVPENC 353
+ F + C
Sbjct: 451 RVGFGADAC 459
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 146/339 (43%), Gaps = 31/339 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L WTQC PC +C+ Q P+F+P SSTY+ C
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 67 CR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDN 122
C + + +C R +YA G+ G +++ET T K V PG FGC + +
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
FD + +GI+G SL+ QLKST GLFSYCL+ + +S + FG +
Sbjct: 212 GGI-FDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVS 270
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
T+ + Y ++ G ++D+G TF+
Sbjct: 271 --GYGTVSTPLRLPYKGYSKKTEVE-------------------EGNIIVDSGTTYTFL- 308
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
P E + S +R+ + + + CY + A +T HF A+ +++P
Sbjct: 309 --PQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANVELQPL 365
Query: 303 YMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDL 341
+ Q E C ++ + V+G Q + +DL
Sbjct: 366 NTFMRMQ-EDLVCFTVAPTSDIGVLGNLAQVNFLVGFDL 403
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 158/373 (42%), Gaps = 48/373 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP + + D L+WTQC PC CF Q P+F+P SST++ +PC
Sbjct: 56 LYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 66 ICRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC--SN 120
+C P C + C++ G + G T+TF + + FGC
Sbjct: 116 LCESIPESSRNCTSDVCIYEAPTKAGDTG-GKAGTDTFAIGAAKETLG-----FGCVVMT 169
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D R G +GI+G +P+SL+ Q+ TA FSYCL +++ L G A
Sbjct: 170 DKR-LKTIGGPSGIVGLGRTPWSLVTQMNVTA---FSYCLAG-----KSSGALFLGATAK 220
Query: 181 IQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+ FV ++S YY+ + +A + G AP A T +
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYM----VKLAGIKTGGAPLQAASSSGST--VL 274
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
+DT + A+++ G Y+ + + + G Q + + + ++ C+ A + F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTA---AVGVQPVASPPKPYDLCFPKAVAGDA-PELVFT 330
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFS---------DRNSVVGAWQQQDTRFVYDLN 342
FD P Y + G C+ I S + S++G+ QQ++ ++DL
Sbjct: 331 FDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLK 390
Query: 343 TGTIQFVPENCAN 355
T+ F P +C++
Sbjct: 391 EETLSFKPADCSS 403
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 160/368 (43%), Gaps = 38/368 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ ++ ++ DTGS L W QC PC C+NQ P+FNP+ S +Y+ I C+
Sbjct: 66 NYIVTVEI----GGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCN 121
Query: 64 DLICRRPPFRCEN--------GQCVHRINYAGGASASGLVSTETF---TFHLKNKLVCVP 112
C+ + N C + +NY G+ G + E T H+ N
Sbjct: 122 SSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSN------ 175
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
IFGC +N+ G +G++G S SL+ Q + +G+FSYCL + + I
Sbjct: 176 -FIFGCGRNNKGLF--GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLI 232
Query: 173 LRFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L + RM + + Y+L+L IS+ G A R++G
Sbjct: 233 LGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIG----GVALQAPNYRQSGI--- 285
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
+ID+G + T + Y + F + F+ F + + YD M F
Sbjct: 286 LIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQF 345
Query: 291 HFDRADFKVEPTYM-YFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
A+ V+ T + YF+ + C+A +SF D ++G +QQ++ R +Y+ +
Sbjct: 346 E-GNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404
Query: 347 QFVPENCA 354
F E C+
Sbjct: 405 GFAAEACS 412
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 163/375 (43%), Gaps = 49/375 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
+Y + GTP + + D L+WTQC C CF Q P+F PNASST+K PC
Sbjct: 44 YYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 103
Query: 66 ICRRPPFR-CENGQCVHRINYAGGAS-----ASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C P R C C +Y G + SG +T+TF + FGC
Sbjct: 104 VCESIPTRSCSGDVC----SYKGPPTQLRGNTSGFAATDTFAIGTATVRLA-----FGCV 154
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ + DG +G +G +P+SL+ Q+K T FSYCL + R +S L G A
Sbjct: 155 VASDIDTMDGP-SGFIGLGRTPWSLVAQMKLT---RFSYCL--SPRNTGKSSRLFLGSSA 208
Query: 180 NIQRKDMKTIRMFV------DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI- 232
+ + + F+ D S++Y LSL I + I A +GG ++
Sbjct: 209 KLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATA---------QSGGILVM 259
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYAS-MTF 290
T + + + Y+ + E M + ++ C++ + F RA A + F
Sbjct: 260 HTVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVF 319
Query: 291 HFDRADFKVEPTYMYFI--FQNEGYFCVAI---SFSDRN-----SVVGAWQQQDTRFVYD 340
F A P Y I + + C AI ++ +R SV+G+ QQ+D F+YD
Sbjct: 320 TFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYD 379
Query: 341 LNTGTIQFVPENCAN 355
L T+ F P +C++
Sbjct: 380 LKKETLSFEPADCSS 394
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 153/357 (42%), Gaps = 32/357 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP ++ L DTGS L W QC PC C++Q P+F+P SS+Y +PC
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGG 199
Query: 65 LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+C C QC + ++Y G+ +G+ S++T T + V G FGC +
Sbjct: 200 PVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDA---VRGFFFGCGHA 256
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
F+ GN G+LG SL+ Q T G+FSYCL T L G +
Sbjct: 257 QSGFT--GN-DGLLGLGREEASLVEQTAGTYGGVFSYCLP---TRPSTTGYLTLGGPSGA 310
Query: 182 QRKDMKTIRMFV--DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
T ++ + +++Y + L ISV ++ FA GG ++DTG + T
Sbjct: 311 APPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFA------GGTVVDTGTVIT 364
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ Y + F S+G A+ + CY F Y ++T F
Sbjct: 365 RLPPTAYAALRSAFRSGMASYGYPSAP-ATGILDTCY----NFSGYGTVTLPNVALTFSG 419
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
T + C+A + S + +++G QQ+ F ++ ++ F P +C
Sbjct: 420 GATVTLGADGILSFGCLAFAPSGSDGGMAILGNVQQRS--FEVRIDGTSVGFKPSSC 474
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 162/380 (42%), Gaps = 38/380 (10%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRI 60
++ Y V + GTP ++ +LFDTGS L W QCLPC + C+ Q P+F+P+ SSTY +
Sbjct: 118 QSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDV 177
Query: 61 PCDDLICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV-CVPGVIF 116
PC C RC C + + Y + G ++ ETFT + L GV+F
Sbjct: 178 PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 117 GCSNDNRDFSFDG--NIAGILGFSVSPFSLLGQLK---STAQGLFSYCLVYAYREMEATS 171
GCS++ D +AG+LG S+L Q + ++ G+FSYCL +T
Sbjct: 238 GCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLP---PRGSSTG 294
Query: 172 ILRFGKDANIQRKD------MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
L G A ++ I S Y ++L +SV + F+L
Sbjct: 295 YLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL--- 351
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G +ID+G + T + Y + F H S+ + + + + CY +
Sbjct: 352 ---GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSY-KMLPEGSMKLLDTCYDVTGQDVVT 407
Query: 286 AS-MTFHF-DRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS----VVGAWQQQD 334
A + F A V+ + + + E ++F NS +VG QQ+
Sbjct: 408 APRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRA 467
Query: 335 TRFVYDLNTGTIQFVPENCA 354
V+D++ G I F P C+
Sbjct: 468 YNVVFDVDGGRIGFGPNGCS 487
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 162/369 (43%), Gaps = 41/369 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ + ++ DTGS L W QC PC C+NQ P+FNP+ S +Y+ + C+
Sbjct: 65 NYIVTVEL----GGRKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCN 120
Query: 64 DLICRRPPFRCENG--------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
L CR N C + +NY G+ SG V E HL V I
Sbjct: 121 SLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGME----HLNLGNTTVNNFI 176
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC N+ G +G++G + SL+ Q+ G+FSYCL E EA+ L
Sbjct: 177 FGCGRKNQGLF--GGASGLVGLGRTDLSLISQISPMFGGVFSYCL--PTTEAEASGSLVM 232
Query: 176 GKDANIQRKDMKTIRMFVDRSSH------YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G ++++ + T + R H Y+L+L I+V + AP +F R
Sbjct: 233 GGNSSVYK---NTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQ-AP-SFGKDR----- 282
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT 289
+ID+G + + + Y+ + F + F+ + + Y M
Sbjct: 283 MIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMY 342
Query: 290 FHFDRADFKVEPTYMYFIFQNEG-YFCVAIS---FSDRNSVVGAWQQQDTRFVYDLNTGT 345
F A+ V+ T +++ + + C+AI+ + D ++G +QQ++ R +YD
Sbjct: 343 FE-GSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSM 401
Query: 346 IQFVPENCA 354
+ F E C+
Sbjct: 402 LGFAEEACS 410
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 161/364 (44%), Gaps = 39/364 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP + L+FDTGSYL WTQC PC +C+ Q PIF+P+ SS+Y I C
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSS 199
Query: 66 ICRRPPFRCE------NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C + FR + C++ + Y + + G +S E T + V +FGC
Sbjct: 200 LCTQ--FRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---IVHDFLFGCG 254
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK-- 177
DN F G AG++G S P S + Q S +FSYCL + L FG
Sbjct: 255 QDNEGL-FRGT-AGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSSLGH---LTFGASA 309
Query: 178 --DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDT 234
+AN++ TI +S Y L + ISV ++ + TF+ GG +ID+
Sbjct: 310 ATNANLKYTPFSTIS---GENSFYGLDIVGISVGGTKLPAVSSSTFS-----AGGSIIDS 361
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDR 294
G + T + Y + F + + + + + CY + S ++ + F+
Sbjct: 362 GTVITRLPPTAYAALRSAFRQFMMKY---PVAYGTRLLDTCYDF-SGYKEISVPRIDFEF 417
Query: 295 A-DFKVE-PTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFV 349
A KVE P ++ C+A + + ++ G QQ+ VYD+ G I F
Sbjct: 418 AGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFG 477
Query: 350 PENC 353
C
Sbjct: 478 AAGC 481
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 163/352 (46%), Gaps = 21/352 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G P +++ DTGS + W QC PC C+ QS PIF+P +S++Y I CD
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQ 208
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ C NG C++ ++Y G+ G +TET T V V GC ++N
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL----GTAAVENVAIGCGHNNEGL 264
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S Q+ +T+ FSYCLV R+ +A S L F ++ + R
Sbjct: 265 FV--GAAGLLGLGGGKLSFPAQVNATS---FSYCLV--NRDSDAVSTLEF--NSPLPRNV 315
Query: 186 MKT-IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ +R + + YYL L+ ISV + F + G GG +ID+G T ++
Sbjct: 316 VTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSE 375
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTY 303
Y+ + F + + N ++ CY SR +++FHF P
Sbjct: 376 VYDALRDAFVKGAKGIPKA---NGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPAR 432
Query: 304 MYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I + G FC A + + + S++G QQQ TR +D+ + F ++C
Sbjct: 433 NYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 162/357 (45%), Gaps = 32/357 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
+ V V FG+P+++ L DTGS + W QCLPC +C+ Q P+F+P S+TY +PC
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHP 220
Query: 66 ICRRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR- 123
C +C N G C++++ Y G+S +G++S ET + L PG FGC N
Sbjct: 221 QCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDL---PGFAFGCGQTNLG 277
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-----VYAYREMEATSILRFGKD 178
+F + G++G SL Q +T FSYCL + Y M +T+ D
Sbjct: 278 EFG---GVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPAASNDD 334
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
++Q M D S Y++ + I + + + P F R+GT + D+G I
Sbjct: 335 DDVQYTAMIQKE---DYPSLYFVEVVSIDIGGYILPVPPTVFT--RDGT---LFDSGTIL 386
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY-ASMTFHF-DRAD 296
T++ Y + F T + + A + ++ CY + + ++ F F D A
Sbjct: 387 TYLPPEAYASLRDRFKFTMTQY---KPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAV 443
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQF 348
F + P + + ++F R S ++G QQ+ T +YD+ I F
Sbjct: 444 FDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGF 500
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 162/375 (43%), Gaps = 49/375 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
+Y + GTP + + D L+WTQC C CF Q P+F PNASST+K PC
Sbjct: 61 YYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 120
Query: 66 ICRRPPFR-CENGQCVHRINYAGGAS-----ASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C P R C C +Y G + SG +T+TF + FGC
Sbjct: 121 VCESIPTRSCSGDVC----SYKGPPTQLRGNTSGFAATDTFAIGTATVRLA-----FGCV 171
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ + DG +G +G +P+SL+ Q+K T FSYCL + R +S L G A
Sbjct: 172 VASDIDTMDGP-SGFIGLGRTPWSLVAQMKLT---RFSYCL--SPRNTGKSSRLFLGSSA 225
Query: 180 NIQRKDMKTIRMFV-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMI- 232
+ + + F+ D S HYY LSL I + I A +GG ++
Sbjct: 226 KLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA---------QSGGILVM 276
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYAS-MTF 290
T + + + Y + E M + ++ C++ + F RA A + F
Sbjct: 277 HTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVF 336
Query: 291 HFDRADFKVEPTYMYFI--FQNEGYFCVAI---SFSDRN-----SVVGAWQQQDTRFVYD 340
F A P Y I + + C AI ++ +R SV+G+ QQ+D F+YD
Sbjct: 337 TFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYD 396
Query: 341 LNTGTIQFVPENCAN 355
L T+ F P +C++
Sbjct: 397 LKKETLSFEPADCSS 411
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 168/380 (44%), Gaps = 52/380 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV + G + ++ DT S L W QC PC +C +Q P+F+P++S +Y +PCD
Sbjct: 142 NYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCD 197
Query: 64 DLICR-------------RPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNK 107
C PP C+ G+ C + ++Y G+ + G+++ + + +
Sbjct: 198 SPSCDALQQQLATGAGAGAPP--CDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-- 253
Query: 108 LVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM 167
+ G +FGC N+ F G +G++G S SL+ Q G+FSYCL + RE
Sbjct: 254 --VIDGFVFGCGTSNQGPPF-GGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLS-RES 309
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSH-------YYLSLQDISVADHRIGFAPGTF 220
+A+ L G D + R + + +S Y ++L I+V + F
Sbjct: 310 DASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVEST--GF 367
Query: 221 ALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS 280
+ R ++D+G + T + Y V F + + + + C+
Sbjct: 368 SAR------AIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSI---LDTCFNMTG 418
Query: 281 -RFRAYASMTFHFD-RADFKVEP-TYMYFIFQNEGYFCVAIS---FSDRNSVVGAWQQQD 334
+ S+T FD A+ +V+ +YF+ + C+A++ D S++G +QQ++
Sbjct: 419 LKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKN 478
Query: 335 TRFVYDLNTGTIQFVPENCA 354
R V+D + + F E C
Sbjct: 479 LRVVFDTSASQVGFAQETCG 498
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 160/386 (41%), Gaps = 42/386 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYK 58
H Y + L G P + + DTGS LIWTQC C CF+Q+ ++P+ S T +
Sbjct: 65 HWAESQYIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTAR 124
Query: 59 RIPCDDLICR-RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+ C+D C RC +N C Y G G++ TE FTF +++ V +
Sbjct: 125 PVACNDTACALGSETRCARDNKACAVLTAYGAGV-IGGVLGTEAFTFQPQSENV---SLA 180
Query: 116 FGCSNDNR--DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC R S DG +GI+G SL+ QL FSYCL + + TS L
Sbjct: 181 FGCIAATRLTPGSLDG-ASGIIGLGRGNLSLVSQLGDNK---FSYCLTPYFSQSTNTSRL 236
Query: 174 RFGKDANIQRKDMKTIRMF------VDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G A + + VD S+ YYL L I+V D ++ F LR+
Sbjct: 237 FVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVA 296
Query: 227 TG---GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRM--HNASEDWEYC--YRYD 279
TG G +ID+G+ T + Y+ + DE G + +E + C +
Sbjct: 297 TGLWAGTLIDSGSPFTSLVDVAYQALR---DELVQQLGASIVPPPAGAEGLDLCAAVAHG 353
Query: 280 SRFRAYASMTFHFDR--ADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--------VVGA 329
+ + HF D V P + + V S NS ++G
Sbjct: 354 DVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGN 413
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCAN 355
+ QQD +YDL G + F P +C++
Sbjct: 414 YMQQDMHLLYDLEKGMLSFQPADCSS 439
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 151/356 (42%), Gaps = 28/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q P+F+P SSTY + C D
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDS 222
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ G + +T T + G FGC N
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD----AIKGFRFGCGEKNNG 278
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G SL Q + G F+YCL T L FG +
Sbjct: 279 LF--GKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLP---ALTTGTGYLDFGPGS--AGN 331
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + M D+ + YY+ + I V ++ A F+ T G ++D+G + T +
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPA 386
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YASMTFHFDRADFKVE 300
Y + FD+ + G ++ S + CY + S+ F A V+
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFTGLSDVELPTVSLVFQ-GGACLDVD 444
Query: 301 PTYMYFIFQNEGYFCVA-ISFSDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + +E C+A S D SV VG QQ+ +YDL T+ F P +C
Sbjct: 445 VSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 159/390 (40%), Gaps = 57/390 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S LIWTQC PCV C+ Q P+FNP AS++Y +PC+
Sbjct: 88 YLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDT 147
Query: 67 CRR-PPFRC-------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C RC + C + +Y G A+ G+++ + GV+FGC
Sbjct: 148 CDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDD----VFRGVVFGC 203
Query: 119 SNDNRDFSFDG---NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
S+ S G ++G++G SL+ QL F YCL +L
Sbjct: 204 SSS----SVGGPPPQVSGVVGLGRGALSLVSQLSVRR---FMYCLPPPVSRSAGRLVLGA 256
Query: 176 GKDANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRIGF---------APGTFA-- 221
A ++ + + S S+YYL+L IS+ D + F PGT A
Sbjct: 257 DAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGA 316
Query: 222 --------------LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN 267
G +ID + TF++ YE ++ +E R
Sbjct: 317 PASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIR---LPRGSG 373
Query: 268 ASEDWEYCYRYDSRF---RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR 323
+ + C+ R YA ++ F+ +++ M+ + G C+ + +D
Sbjct: 374 SDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDG 433
Query: 324 NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
S++G +QQQ+ + +Y+L G I F+ C
Sbjct: 434 VSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 164/376 (43%), Gaps = 48/376 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIP 61
NY T+ L G +K+ ++ DTGS L W QC PC +C+ Q P+F+P AS T+ +P
Sbjct: 179 NYVTTI-ALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVP 237
Query: 62 CDDLICRR-------PPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKL 108
C C P C +C + ++Y G+ + G+++ +T KL
Sbjct: 238 CGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTKL 297
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
G +FGC NR G AG++G + SL+ Q + G+FSYCL
Sbjct: 298 ---DGFVFGCGLSNRGLF--GGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPA---TTT 349
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNG 226
+T L G + +M RM D + Y++++ +V APG G
Sbjct: 350 STGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF------G 403
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW---EYCYRYDSRFR 283
G ++D+G + T + Y+ V F F + A+ + + CY R
Sbjct: 404 AGNVLVDSGTVITRLAPSVYKAVRAEFARRFE-------YPAAPGFSILDACYDLTGRDE 456
Query: 284 AYAS-MTFHFD-RADFKVEPTYMYFIFQNEG-YFCVAIS---FSDRNSVVGAWQQQDTRF 337
+T + A V+ M F+ + +G C+A++ + D+ ++G +QQ++ R
Sbjct: 457 VNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRV 516
Query: 338 VYDLNTGTIQFVPENC 353
VYD + F E+C
Sbjct: 517 VYDTVGSRLGFADEDC 532
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 169/370 (45%), Gaps = 44/370 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC 62
NYF V + GTP + L+FDTGS L WTQC PC +C+ Q IF+P+ SS+Y I C
Sbjct: 135 NYFVVVGL--GTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITC 192
Query: 63 DDLICRR-----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+C + RC + C++ I Y +++ G +S E T + V +
Sbjct: 193 TSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD---IVDDFL 249
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC DN F G+ AG++G P S + Q S +FSYCL + L F
Sbjct: 250 FGCGQDNEGL-FSGS-AGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSLGH---LTF 304
Query: 176 GK----DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGC 230
G +AN++ + TI ++ Y L + ISV ++ + TF+ GG
Sbjct: 305 GASAATNANLKYTPLSTIS---GDNTFYGLDIVGISVGGTKLPAVSSSTFS-----AGGS 356
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASM 288
+ID+G + T + Y + F + G ++ A+ED ++ CY + S ++ +
Sbjct: 357 IIDSGTVITRLAPTAYAALRSAFRQ-----GMEKYPVANEDGLFDTCYDF-SGYKEISVP 410
Query: 289 TFHFDRA-DFKVE-PTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNT 343
F+ A VE P I ++ C+A + + + ++ G QQ+ VYD+
Sbjct: 411 KIDFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEG 470
Query: 344 GTIQFVPENC 353
G I F C
Sbjct: 471 GRIGFGAAGC 480
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 151/360 (41%), Gaps = 29/360 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP L+FDTGS L WTQC PCV C++Q PIFNP+ S++Y + C
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 192
Query: 66 ICRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C C C++ I Y + + G ++ + FT + GV FGC
Sbjct: 193 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVF---DGVYFGCG 249
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+N+ F G +AG+LG S Q + +FSYCL T L FG
Sbjct: 250 ENNQGL-FTG-VAGLLGLGRDKLSFPSQTATAYNKIFSYCLP---SSASYTGHLTFGSAG 304
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ I D +S Y L++ I+V ++ F+ T G +ID+G + T
Sbjct: 305 ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVIT 359
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADF 297
+ Y + F + + + + C+ + + F F A
Sbjct: 360 RLPPKAYAALRSSFKAKMSKY---PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 416
Query: 298 KVEPTYMYFIFQNEGYFCVAISFS--DRNSVV-GAWQQQDTRFVYDLNTGTIQFVPENCA 354
++ +++ F+ C+A + + D N+ + G QQQ VYD G + F P C+
Sbjct: 417 ELGSKGIFYAFKIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 168/387 (43%), Gaps = 51/387 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-- 64
Y +DV GTP + ++ DTGS L W QC PC++CF Q P+F+P ASS+Y+ + C D
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHR 210
Query: 65 ---------------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV 109
CRRP C + Y ++ +G ++ E+FT +L
Sbjct: 211 CGHVAPPPEPEASSPRTCRRP----GEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 266
Query: 110 C--VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM 167
V GV+FGC + NR LG F+ QL++ FSYCLV ++
Sbjct: 267 SRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFA--SQLRAVYGHTFSYCLVDHGSDV 324
Query: 168 EATSILRFGKD-------ANIQRKDMK---TIRMFVDRSSHYYLSLQDISVADHRIGFAP 217
+ + FG+D A+ Q K + YY+ L+ + V + +
Sbjct: 325 GSKVV--FGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISS 382
Query: 218 GTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY--- 274
T+ + ++G+GG +ID+G ++ Y+V+ F + + R + ++
Sbjct: 383 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMS-----RSYPLVPEFPVLSP 437
Query: 275 CYRYDSRFRA-YASMTFHF-DRA--DFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVG 328
CY R ++ F D A DF E ++ C+A+ + R S++G
Sbjct: 438 CYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIG 497
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENCAN 355
+QQQ+ VYDL + F P CA
Sbjct: 498 NFQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 155/359 (43%), Gaps = 31/359 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L+FDTGS L WTQC PC C+ Q P +P S++YK I C
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 66 ICR----RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C+ C + C++++ Y G+ + G +TET T N +FGC
Sbjct: 193 FCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCGQQ 249
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N F G AG+LG + SL Q + LFSYCL + + L FG +
Sbjct: 250 NSGL-FRG-AAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPAS---SSSKGYLSFGGQVS- 303
Query: 182 QRKDMKTIRMFVD-RSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
K +K + D +S+ +Y L + ++SV +++ F+ T G +ID+G + T
Sbjct: 304 --KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFS-----TSGTVIDSGTVIT 356
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ Y + F + T + ++ ++ CY + + ++
Sbjct: 357 RLPSTAYSALSSAFQKLMTDYPSTDGYSI---FDTCYDFSKNETIKIPKVGVSFKGGVEM 413
Query: 300 EPTYMYFIFQNEGYFCVAISFSD-----RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ ++ G V ++F+ + ++ G QQ+ + VYD G + F P C
Sbjct: 414 DIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 154/365 (42%), Gaps = 22/365 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ L DT S L W QC PC C+ QS P+F+P S++Y + D
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPD 193
Query: 67 C----RRPPFRCENGQCVHRINYAGG----ASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C R + G C++ + Y G +++ G + ET TF + + GC
Sbjct: 194 CQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLS---IGC 250
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA-QGLFSYCLV-YAYREMEATSILRFG 176
+DN+ F AGILG S+ Q+ FSYCLV + +S L FG
Sbjct: 251 GHDNKGL-FGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFG 309
Query: 177 KDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRI-GFAPGTFALRR-NGTGGCMI 232
A ++++ + YY+ L +SV R+ G L G GG ++
Sbjct: 310 AGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVIL 369
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFH 291
D+G T + R Y F TS G+ S ++ CY R +++ H
Sbjct: 370 DSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMH 429
Query: 292 F-DRADFKVEPTYMYFIFQNEGYFCVAISFS-DRN-SVVGAWQQQDTRFVYDLNTGTIQF 348
F + ++P + G C A + + DR+ SV+G QQ R VYDL + F
Sbjct: 430 FAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRVGF 489
Query: 349 VPENC 353
P NC
Sbjct: 490 APNNC 494
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 151/356 (42%), Gaps = 28/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q P+F+P SSTY + C D
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ G + +T T + G FGC N
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD----AIKGFRFGCGEKNNG 278
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G SL Q + G F+YCL T L FG +
Sbjct: 279 LF--GKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLP---ALTTGTGYLDFGPGS--AGN 331
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + M D+ + YY+ + I V ++ A F+ T G ++D+G + T +
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPA 386
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YASMTFHFDRADFKVE 300
Y + FD+ + G ++ S + CY + S+ F A V+
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFTGLSDVELPTVSLVFQ-GGACLDVD 444
Query: 301 PTYMYFIFQNEGYFCVA-ISFSDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + +E C+A S D SV VG QQ+ +YDL T+ F P +C
Sbjct: 445 VSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 158/362 (43%), Gaps = 23/362 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + G P + DTGS LIW QC PC C+ Q++PIF+P SS+Y+ + C +
Sbjct: 93 YLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEF 152
Query: 67 CRRPPFRCEN-------GQCVHRINYAGGASASGLVSTETFTFHLKNK-----LVCVPGV 114
C + + C + +Y + + G ++ E F N + V
Sbjct: 153 CNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEV 212
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC N +FD +GI+G SL+ QL G FSYCLV + TS +
Sbjct: 213 AFGCGTKNGG-TFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSKIN 271
Query: 175 FGKDANIQRKD---MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
FG D NI + + T + ++YYL+L+ ISV + R+ + G +
Sbjct: 272 FGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYT--NLWNGEVEKGNII 329
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
ID+G TF+ + + +E +R+ + + C++ D + +T H
Sbjct: 330 IDSGTTLTFLDSEFFNNLDSAVEEAVKG---ERVSDPHGLFNICFK-DEKAIELPIITAH 385
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
F AD +++P + + E C + S+ ++ G Q + YDL + F+P
Sbjct: 386 FTGADVELQPVNTFAKVE-EDLLCFTMIPSNDIAIFGNLAQMNFLVGYDLEKKAVSFLPT 444
Query: 352 NC 353
+C
Sbjct: 445 DC 446
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 174/370 (47%), Gaps = 47/370 (12%)
Query: 9 VDVLFGTP-SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP---IFNPNASSTYKRIPCDD 64
+++ GTP +++ L D SY +W QC PC P F PN S+T+ +PC
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 65 LIC----------------RRPPFRCENGQCVHRINYAGGAS-ASGLVSTETFTFHLKNK 107
+C RC++ + + Y G A+ SG ++T+TFTF
Sbjct: 150 DMCLPVLRETCGRAGAAANATAGARCDS----YSLTYGGSAANTSGYLATDTFTFGA--- 202
Query: 108 LVCVPGVIFGCSNDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AY 164
VPGV+FGCS+ + DF+ +G++G SL+ QL+ G FSY L+ A
Sbjct: 203 -TAVPGVVFGCSDASYGDFA---GASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAP-GTFA 221
+ A S++RFG DA + K ++ + YY++L + V +R+ P GTF
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDS 280
LR NGTGG ++ + T++++ Y+VV G ++ +A+ + + CY S
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASR---IGLPAVNGSAALELDLCYNASS 372
Query: 281 RFRA-YASMTFHFD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFV 338
+ +T FD AD + ++I + G C+ + S SV+G Q T +
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432
Query: 339 YDLNTGTIQF 348
YD++ G + F
Sbjct: 433 YDVDAGRLTF 442
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 174/370 (47%), Gaps = 47/370 (12%)
Query: 9 VDVLFGTP-SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP---IFNPNASSTYKRIPCDD 64
+++ GTP +++ L D SY +W QC PC P F PN S+T+ +PC
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 65 LIC----------------RRPPFRCENGQCVHRINYAGGAS-ASGLVSTETFTFHLKNK 107
+C RC++ + + Y G A+ SG ++T+TFTF
Sbjct: 150 DMCLPVLRETCGRAGAAANATAGARCDS----YSLTYGGSAANTSGYLATDTFTFGA--- 202
Query: 108 LVCVPGVIFGCSNDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AY 164
VPGV+FGCS+ + DF+ +G++G SL+ QL+ G FSY L+ A
Sbjct: 203 -TAVPGVVFGCSDASYGDFA---GASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAP-GTFA 221
+ A S++RFG DA + K ++ + YY++L + V +R+ P GTF
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDS 280
LR NGTGG ++ + T++++ Y+VV G ++ +A+ + + CY S
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASR---IGLPAVNGSAALELDLCYNASS 372
Query: 281 RFRA-YASMTFHFD-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFV 338
+ +T FD AD + ++I + G C+ + S SV+G Q T +
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432
Query: 339 YDLNTGTIQF 348
YD++ G + F
Sbjct: 433 YDVDAGRLTF 442
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/259 (33%), Positives = 123/259 (47%), Gaps = 23/259 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP + L DTGS L+WTQC PC +CF+Q P+ +P ASSTY +PC
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPR 145
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTF---HLKNKLVCVPG---VIFGCS 119
CR PF C CV+ +Y + G ++T+ FTF +N +P + FGC
Sbjct: 146 CRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCG 205
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ N+ F N GI GF +SL QL +T+ FSYC + + L A
Sbjct: 206 HFNKGV-FQSNETGIAGFGRGRWSLPSQLNATS---FSYCFTSMFDSKSSIVTLGGAPAA 261
Query: 180 ---NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ +++T +F + S S Y+LSL+ ISV R+ F +ID+
Sbjct: 262 LYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------STIIDS 314
Query: 235 GAIATFIQRGPYEVVMRHF 253
GA T + YE V F
Sbjct: 315 GASITTLPEEVYEAVKAEF 333
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 154/369 (41%), Gaps = 50/369 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + FGTPS + LL DTGS + W QC PC C+ Q P+F+P+ SSTY I C
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184
Query: 65 LICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI--- 115
C + NG QC +R+ Y G+S G+ S ET TF PG+
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITF--------APGITVKD 236
Query: 116 --FGCSNDNRDFS--FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGC +D R S FD G+LG +P SL+ Q S G FSYCL E +
Sbjct: 237 FHFGCGHDQRGPSDKFD----GLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLA 292
Query: 172 I-LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
+ +R N + ++ Y +++ ISV + F GG
Sbjct: 293 LGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF------RGGM 346
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
+ID+G I T + Y + + F ++ ASED++ CY F Y+++T
Sbjct: 347 LIDSGTIVTELPETAYNALNAALRKAFAAYPMV----ASEDFDTCY----NFTGYSNVTV 398
Query: 291 HFDRADFKVEPTYMYFIFQNEGYF---CVAISFSDRN---SVVGAWQQQDTRFVYDLNTG 344
F T + G C+A S + ++G Q+ +YD G
Sbjct: 399 PRVALTFSGGATIDLDV--PNGILVKDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHG 456
Query: 345 TIQFVPENC 353
+ F C
Sbjct: 457 KVGFRAGAC 465
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 158/371 (42%), Gaps = 45/371 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V+V GTP K L+FDTGS L WTQC PCV +C+ Q PIF+P+ S TY I C
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213
Query: 66 ICRRPPFR------CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVP-----GV 114
C C + CV+ I Y S+ T F K+KL G
Sbjct: 214 ACSSLKSATGNSPGCSSSNCVYGIQYGD--------SSFTIGFFAKDKLTLTQNDVFDGF 265
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC +N+ G AG++G P S++ Q FSYCL + L
Sbjct: 266 MFGCGQNNKGLF--GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLP---TSRGSNGHLT 320
Query: 175 FGKDANIQR----KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
FG ++ K+ T F +++Y++ + ISV + +P F
Sbjct: 321 FGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NA 375
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYAS 287
G +ID+G + T + Y + F + + + A + CY + +
Sbjct: 376 GTIIDSGTVITRLPSTAYGSLKSAFKQFMSKY---PTAPALSLLDTCYDLSNYTSISIPK 432
Query: 288 MTFHFD-RADFKVEPTYMYFIFQNEGYFCVAISFS---DRNSVVGAWQQQDTRFVYDLNT 343
++F+F+ A+ +++P + I C+A + + D + G QQQ VYD+
Sbjct: 433 ISFNFNGNANVELDPNGI-LITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAG 491
Query: 344 GTIQFVPENCA 354
G + F + C+
Sbjct: 492 GQLGFGYKGCS 502
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 150/351 (42%), Gaps = 34/351 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P +S++++ D+GS ++W QC PC C++QS P+F+P S+++ + C +
Sbjct: 201 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSV 260
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C R C G+C + ++Y G+ G ++ ET TF V V GC + NR
Sbjct: 261 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIGCGHRNRGM 316
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S +GQL G FSYCLV A ++R +
Sbjct: 317 FV--GAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA----AWVPLVRNPR-------- 362
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
S YY+ L + V R+ + F L G GG ++DTG T +
Sbjct: 363 ---------APSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLA 413
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
Y+ F + R ++ CY +++F+F P
Sbjct: 414 YQAFRDAFLAQTANLPRA---TGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARN 470
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ I + G FC A + S S++G QQ+ + +D G + F P C
Sbjct: 471 FLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 152/355 (42%), Gaps = 25/355 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V++ G+P ++ L DT S L+W QCLPC+NC+ QS PIF+P+ S T++ C
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 67 CRRP--PFRCENGQCVHRINYAGGASASGLVSTETFTFHL---KNKLVCVPGVIFGCSND 121
P F C + + Y + G+++ E F+ ++ + V+FGC +D
Sbjct: 145 YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHD 204
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
N G GILG FSL+ + FSYC ++L G D
Sbjct: 205 NYGEPLVG--TGILGLGYGEFSLVHRFGKK----FSYCFGSLDDPSYPHNVLVLGDDGAN 258
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATF 240
D + + + YY++++ ISV + P F + G GG +IDTG T
Sbjct: 259 ILGDTTPLEI---HNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTS 315
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASED-------WEYCYRYDSRFRAYASMTFHFD 293
+ Y+ + ++ F GR + S+D + + D + +TFHF
Sbjct: 316 LVEEAYKPLKNRIEDIFE--GRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFS 373
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
F+ + FC+A++ + NS +GA QQ YDL + F
Sbjct: 374 EGAELSLDVKSLFMKLSPNVFCLAVTPGNLNS-IGATAQQSYNIGYDLEAMEVSF 427
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 159/351 (45%), Gaps = 19/351 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G P +++ DTGS + W QC PC C+ QS PIF+P +S++Y I CD+
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQ 208
Query: 67 CRRPPF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C+ C NG C++ ++Y G+ G +TET T V V GC ++N
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL----GSAAVENVAIGCGHNNEGL 264
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
AG+LG S Q+ +T+ FSYCLV R+ +A S L F
Sbjct: 265 FV--GAAGLLGLGGGKLSFPAQVNATS---FSYCLV--NRDSDAVSTLEFNSPLPRNAAT 317
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+R + + YYL L+ ISV + +F + G GG +ID+G T ++
Sbjct: 318 APLMRN-PELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEV 376
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYM 304
Y+ + F + + N ++ CY SR +++F F P
Sbjct: 377 YDALRDAFVKGAKGIPKA---NGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARN 433
Query: 305 YFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I + G FC A + + + S++G QQQ TR +D+ + F ++C
Sbjct: 434 YLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 166/355 (46%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
Y + G P K +L+ DTGS + W QC PC + C+ Q PIF+P +SS+Y + C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 64 DLICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C+ C + C+++++Y G+ +G ++TET +F N +P + GC +DN
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLPIGCGHDN 264
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
G LG SL QLK+++ FSYCLV + +++S L F ++N+
Sbjct: 265 EGLFAGGAGLIGLGGGAI--SLSSQLKASS---FSYCLVNL--DSDSSSTLEF--NSNMP 315
Query: 183 RKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + + DR S+ Y+ + ISV + +P F + +G GG ++D+G I + +
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
YE + F + +S + ++ CY + + ++ F
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISV---FDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 301 PTYMYFI-FQNEGYFCVA-ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A I S++G++QQQ R YDL + F C
Sbjct: 433 PARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 169/362 (46%), Gaps = 36/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
+ V V FGTP+++ ++ DTGS L W QC PC +C+ Q P F+P SS+Y +PC
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-RD 124
+C C C++ + Y G+S +G++S +T TF+ +K G FGC N D
Sbjct: 197 VCAAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT---GFTFGCGEKNIGD 253
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-----VYAYREMEATSILRFGKDA 179
F G + G+LG SL Q + G+FSYCL Y + AT +
Sbjct: 254 F---GEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGAT---KPTSTV 307
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+Q M + S Y++ L I++ + + P F + GT ++D+G I T
Sbjct: 308 PVQYTAMIKKPQY---PSFYFIELVSINIGGYILPVPPSVFT--KTGT---LLDSGTILT 359
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHF-DRADF 297
++ Y + F FT G + E + CY + + +++F+F D A F
Sbjct: 360 YLPPPAYTSLRDRF--KFTMQG-NKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVF 416
Query: 298 KVEPTYMYFIFQNEGYFCVA-ISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
++ Y IF ++ + ++F R S+VG QQ+ +YD+ + I F+P
Sbjct: 417 DLD-FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPI 475
Query: 352 NC 353
+C
Sbjct: 476 SC 477
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 158/362 (43%), Gaps = 36/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCD-- 63
Y V V G+P++ ++ DTGS L W QC PCV C Q+ P+F+P+AS TYK + C
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 64 ------DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D P + CV+ +Y + + G +S + T L PG ++G
Sbjct: 73 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTL---PGFVYG 129
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C D+ G AGILG + S+LGQ+ S FSYCL L GK
Sbjct: 130 CGQDSEGLF--GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL----PTRGGGGFLSIGK 183
Query: 178 DANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
A++ K M D S Y+L L I+V +G A + + +ID+G
Sbjct: 184 -ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------IIDSG 236
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSR-FRAYASMTFHF 292
T I R P V F + F + A + C++ + + ++ + F
Sbjct: 237 ---TVITRLPMS-VYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF 292
Query: 293 D-RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
AD + P + + +EG C+A + ++ +++G QQQ + +D++T I F
Sbjct: 293 QGGADLNLRPVNV-LLQVDEGLTCLAFAGNNGVAIIGNHQQQTFKVAHDISTARIGFATG 351
Query: 352 NC 353
C
Sbjct: 352 GC 353
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 149/372 (40%), Gaps = 26/372 (6%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYK 58
H Y + + G P + L DTGS LIWTQC C+ C Q P FN ++S ++
Sbjct: 80 HWATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFA 139
Query: 59 RIPCDDLICRRP--PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+PC D C F +G C R+ Y G G + T+ FTF + F
Sbjct: 140 PVPCQDKACAGNYLHFCALDGTCTFRVTYGAGG-IIGFLGTDAFTFQSGGATLA-----F 193
Query: 117 GCSNDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC + R + D +G++G SL Q T FSYCL + A+S L
Sbjct: 194 GCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQ---TGAKRFSYCLTPYFHNNGASSHLF 250
Query: 175 FGKDANIQRKDMKTIRM-FVDR------SSHYYLSLQDISVADHRIGFAPGTFALR--RN 225
G A++ + M FV+ S+ YYL L I+V + ++ F L+
Sbjct: 251 VGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEE 310
Query: 226 G--TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
G GG +ID+G+ T + YE +M C R
Sbjct: 311 GFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDR 370
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNT 343
++ HF P Y+ + C+AI S++G +QQQ+ ++D+
Sbjct: 371 VVPTLVLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQSIIGNFQQQNMHILFDVGG 430
Query: 344 GTIQFVPENCAN 355
G + F +C+
Sbjct: 431 GRLSFQNADCST 442
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 156/374 (41%), Gaps = 43/374 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTPS ++ DTGS ++W QC PC C++QS P+F+P SS+Y + C
Sbjct: 139 EYFTKIGV--GTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCA 196
Query: 64 DLICRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+CR R ++G C++++ Y G+ +G +TET TF + V V
Sbjct: 197 APLCR----RLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGAR---VARVAL 249
Query: 117 GCSNDNRDF-----SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-------YAY 164
GC +DN G G L F Q+ FSYCLV
Sbjct: 250 GCGHDNEGLFVAAAGLLGLGRGSLSFPT-------QISRRYGKSFSYCLVDRTSSSSSGA 302
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL- 222
+S + FG + +R + YY+ L ISV R+ G A L
Sbjct: 303 ASRSRSSTVTFGPPSASAASFTPMVRN-PRMETFYYVQLVGISVGGARVPGVAESDLRLD 361
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR- 281
G GG ++D+G T + R Y + F + G + ++ CY R
Sbjct: 362 PSTGRGGVIVDSGTSVTRLARPSYSALRDAF--RAAAAGLRLSPGGFSLFDTCYDLGGRK 419
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVY 339
+++ HF P Y I + G FC A + +D S++G QQQ R V+
Sbjct: 420 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 479
Query: 340 DLNTGTIQFVPENC 353
D + + F P+ C
Sbjct: 480 DGDGQRVGFAPKGC 493
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 154/367 (41%), Gaps = 39/367 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y ++ GTP + + +WTQC PC CF Q P+FN +ASSTY+ PC
Sbjct: 27 LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTA 86
Query: 66 ICRR-PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
+C P C +G C + + G SG+ T+TF + FGC+ D+
Sbjct: 87 LCESVPASTCSGDGVCSYEVETMFG-DTSGIGGTDTFAIGTATA-----SLAFGCAMDSN 140
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
G +G++G +P+SL+GQ+ +TA FSYCL + S L G A +
Sbjct: 141 IKQLLGA-SGVVGLGRTPWSLVGQMNATA---FSYCLA-PHGAAGKKSALLLGASAKLAG 195
Query: 184 KDMKTIRMFV---DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
V D SS Y + L+ I D I P NG+ ++DT +F
Sbjct: 196 GKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPP-------NGS-VVLVDTIFGVSF 247
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS------MTFHFDR 294
+ ++ + + + G M ++ ++ C+ + S + F
Sbjct: 248 LVDAAFQAIKKAVT---VAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQG 304
Query: 295 ADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
A P Y G C+A ++ + S++G Q++ F++DL+ T+ F
Sbjct: 305 AAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSF 364
Query: 349 VPENCAN 355
P +C++
Sbjct: 365 EPADCSS 371
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 162/371 (43%), Gaps = 38/371 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ ++ DTGS ++W QC PC C++QS +F+P S +Y + C
Sbjct: 141 EYFTKIGV--GTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCS 198
Query: 64 DLICRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+CR R ++G C++++ Y G+ +G +TET TF + V +
Sbjct: 199 APLCR----RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGAR---VARIAL 251
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEATSIL 173
GC +DN AG+LG S Q+ FSYCLV + +S +
Sbjct: 252 GCGHDNEGLFV--AAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309
Query: 174 RFGKDANIQ------RKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RRN 225
FG A +K RM + YY+ L ISV R+ G A L +
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRM----ETFYYVQLVGISVGGARVSGVADSDLRLDPSS 365
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRA 284
G GG ++D+G T + R Y + F + G + ++ CY R
Sbjct: 366 GRGGVIVDSGTSVTRLARPAYSALRDAF--RAAAAGLRLSPGGFSLFDTCYDLSGRKVVK 423
Query: 285 YASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLN 342
+++ HF P Y I ++G FC A + +D S++G QQQ R V+D +
Sbjct: 424 VPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGD 483
Query: 343 TGTIQFVPENC 353
+ FVP+ C
Sbjct: 484 GQRVGFVPKGC 494
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 152/358 (42%), Gaps = 29/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + G+P K ++ DTGS L W QC PCV C +Q P+F P+AS+TY+ + C
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 66 ICR-------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P +G CV+ +Y + + G +S + T L P +GC
Sbjct: 180 ECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTL---PSFTYGC 236
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AGI+G + S+L QL FSYCL + L GK
Sbjct: 237 GQDNEGLF--GKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTS--TSSGGGFLSIGKI 292
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ K IR S Y+L L I+VA +G A + + +ID+G +
Sbjct: 293 SPSSYKFTPMIRN-SQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------IIDSGTVV 345
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS---MTFHFDRA 295
T + Y + F + + R A + C++ + + A M F A
Sbjct: 346 TRLPISIYAALREAFVKIMSR--RYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQ-GGA 402
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
D + + I ++G C+A + S++ +++G QQQ YD++ I F P C
Sbjct: 403 DLSLRAPNI-LIEADKGIACLAFASSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 148/356 (41%), Gaps = 20/356 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + ++FDTGS L W QC PC C+ Q P+F+P+ S+TY +PC
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQE 197
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV---IFGCSNDN 122
CRR C +G+C + + Y + G ++ +T T + + +FGC +D
Sbjct: 198 CRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDD- 256
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
D G G+ G SL Q + FSYCL + A L G A
Sbjct: 257 -DTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSS---STAEGYLSLGSAAPPN 312
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + D S YYL+L I VA + +P F T G +ID+G + T +
Sbjct: 313 AR-FTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFR-----TPGTVIDSGTVITRLP 366
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEP 301
Y + F + +R A + CY + R + S+ FD
Sbjct: 367 SRAYAALRSSFAGLMRRYSYKRAP-ALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLG 425
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
N+ C+A + + + +++G QQ+ VYD+ I F + C+
Sbjct: 426 FGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 156/374 (41%), Gaps = 33/374 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-APIFNPNASSTYKRIPCDDL 65
Y VD+ G P +S L+ DTGS L+W +C C NC + S A +F P SST+ C D
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 142
Query: 66 ICRRPP-----FRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVI 115
+CR P RC + + C + YA G+ SGL + ET + + K + V
Sbjct: 143 VCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVA 202
Query: 116 FGC-----SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
FGC SF+G G++G P S QL FSYCL+ T
Sbjct: 203 FGCGFRISGQSVSGTSFNG-ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPT 261
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
S L G + K T + S + YY+ L+ + V ++ P + + +G GG
Sbjct: 262 SYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGG 321
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRYDSRFRA-- 284
++D+G F+ Y +V+ + R ++ NA E ++ C +
Sbjct: 322 TVMDSGTTLAFLADPAYRLVIAAVKQ------RIKLPNADELTPGFDLCVNVSGVTKPEK 375
Query: 285 -YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYD 340
+ F F V P YFI E C+AI D SV+G QQ F +D
Sbjct: 376 ILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFD 435
Query: 341 LNTGTIQFVPENCA 354
+ + F CA
Sbjct: 436 RDRSRLGFSRRGCA 449
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 152/362 (41%), Gaps = 37/362 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K LLFDTGS L WTQC PC CF Q+ F+P S++YK + C
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191
Query: 66 ICRRPPFRCENG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C+ G C++ + Y G + G ++TET T + + GC
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTGYTV-GFLATETLTITPSDVF---ENFVIGCGE 247
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N F G AG+LG SP +L Q ST + LFSYCL + +T L FG +
Sbjct: 248 RNGG-RFSGT-AGLLGLGRSPVALPSQTSSTYKNLFSYCLPAS---SSSTGHLSFGGGVS 302
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K + Y L + ISV ++ P F T G +ID+G T+
Sbjct: 303 QAAKFTPITSKIPEL---YGLDVSGISVGGRKLPIDPSVFR-----TAGTIIDSGTTLTY 354
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM----TFHFDRAD 296
+ + + F E T++ + + + CY + ++ F +
Sbjct: 355 LPSTAHSALSSAFQEMMTNY---TLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVE 411
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
++ + + FI N G V ++F D ++ G QQ+ VYD+ G + F P
Sbjct: 412 VDIDDSGI-FIAAN-GLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPG 469
Query: 352 NC 353
C
Sbjct: 470 GC 471
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 165/366 (45%), Gaps = 47/366 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA----PIFNPNASSTYKRIPC 62
YTV + GTP + L+ DT S L WTQC N FN +A P+F+P SS++ + C
Sbjct: 91 YTVTIGIGTPPQLHTLIADTASDLTWTQC----NLFNDTAKQVEPLFDPAKSSSFAFVTC 146
Query: 63 DDLICRRP---PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C RC N C + Y A+G+++ E+FT N+ +C+ FGC
Sbjct: 147 SSKLCTEDNPGTKRCSNKTCRYVYPYV-SVEAAGVLAYESFTLSDNNQHICM-SFGFGCG 204
Query: 120 NDNRDFSFDGNI---AGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRF 175
DGN+ +GILG S + S++ QL A FSYCL Y R+ +S L F
Sbjct: 205 ALT-----DGNLLGASGILGMSPAILSMVSQL---AIPKFSYCLTPYTDRK---SSPLFF 253
Query: 176 GKDANIQR-KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG---GCM 231
G A++ R K I+ + + +YY+ L +S+ R+ TFAL++ GT GC
Sbjct: 254 GAWADLGRYKTTGPIQKSL--TFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVDLGCT 311
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT-- 289
+ A F E V+ + T + +D++ C+ S A T
Sbjct: 312 VGQLAEPAFTAL--KEAVLHTLNLPLT-------NRTVKDYKVCFALPSGVAMGAVQTPP 362
Query: 290 --FHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
+FD V P YF G C+A+ S++G QQQ+ ++D++
Sbjct: 363 LVLYFDGGADMVLPRDNYFQEPTAGLMCLALVPGGGMSIIGNVQQQNFHLLFDVHDSKFL 422
Query: 348 FVPENC 353
F P C
Sbjct: 423 FAPTIC 428
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 161/364 (44%), Gaps = 24/364 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V G+P + L+ DTGS +IW QC PC +C+ Q P+F+P S+++ +PC+ +
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGV 182
Query: 67 CRRPP------FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CR G+C ++++Y + +G+++ ET T + V GV GC +
Sbjct: 183 CRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE---VQGVAMGCGH 239
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGK-D 178
+NR + AG+LG P SL+GQL A G FSYCL Y E + L G+ D
Sbjct: 240 ENRGLFAEA--AGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVLGRED 297
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A + D S YY+ + + VA R+ G F L +G GG ++DTG
Sbjct: 298 AAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDTGTAV 357
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--------RYDSRFRAYASMTF 290
T + Y + F F G R S ++ CY R + +
Sbjct: 358 TRLPAEAYAALRGAFAGAFEE-GAPRAPGVSL-FDTCYDLSGYASVRVPTVALYFGGGGQ 415
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVA-ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
+ A + + + G +C+A + + S++G QQQ D +G + F
Sbjct: 416 GQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSASGYVGFG 475
Query: 350 PENC 353
P C
Sbjct: 476 PATC 479
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 156/360 (43%), Gaps = 29/360 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP+ + ++ D+GS L W QC PC V+C Q+ P+++P ASSTY +PC
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAP 167
Query: 66 ICRR------PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C +G C ++ +Y G+ + G +S +T + PG +GC
Sbjct: 168 QCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSF---PGFYYGC 224
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + F+YCL + L FG +
Sbjct: 225 GQDN--VGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCL--PTSAAASAGYLSFGSN 280
Query: 179 ANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
++ + + V S S Y++SL +SVA + + G+ +ID+G
Sbjct: 281 SDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEY-----GSLPTIIDSG 335
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF-DR 294
T I R P V + + A + C++ ++ F
Sbjct: 336 ---TVITRLPTP-VYTALSKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVNMAFAGG 391
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
A ++ P + + NE C+A + +D +++G QQQ VYD+ I F C+
Sbjct: 392 ATLRLTPGNV-LVDVNETTTCLAFAPTDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 156/362 (43%), Gaps = 28/362 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NYF ++ GTP+ + DTGS W QC PC +C+ Q +F+P+ SSTY I C
Sbjct: 133 NYFTSLR--LGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCS 190
Query: 64 DLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C+ C + +C + I YA + G ++ +T T + VPG +FGC
Sbjct: 191 SRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDA---VPGFVFGC 247
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
++N SF G I G+LG SL Q+ + FSYCL AT L F
Sbjct: 248 GHNNAG-SF-GEIDGLLGLGRGKASLSSQVAARYGAGFSYCLP---SSPSATGYLSFSGA 302
Query: 179 ANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
A + + M + S YYL+L I+VA I P FA G +ID+G
Sbjct: 303 AAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFAT----AAGTIIDSGTA 358
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR--FRAYASMTFHFDRA 295
+ + Y + ++ GR + +S ++ CY R + D A
Sbjct: 359 FSCLPPSAYAALRSSVR---SAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGA 415
Query: 296 DFKVEPTYMYFIFQNEGYFCVA-ISFSDRNS--VVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ P+ + + + N C+A + D S V+G QQ+ +YD++ + F
Sbjct: 416 TVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANG 475
Query: 353 CA 354
CA
Sbjct: 476 CA 477
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP+ ++ L DTGS + W QC PC + C++Q P+F+P SS+Y +PC
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201
Query: 65 LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + C GQC + ++Y G++ +G+ S++T T N L G +FGC +
Sbjct: 202 ASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCGHA 258
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ F G + G+LG SL+ Q ST G+FSYCL + + G ++
Sbjct: 259 QQGL-FAG-VDGLLGLGRQGQSLVSQASSTYGGVFSYCLP---PTQNSVGYISLGGPSST 313
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + ++Y + L ISV + FA G ++DTG + T +
Sbjct: 314 AGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRL 367
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRA-DF 297
Y + F +G A+ + CY RY + S+ F A D
Sbjct: 368 PPTAYSALRSAFRAAMAPYGYPSAP-ATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDL 426
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSD-RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G A + D + S++G QQ+ +D + T+ F+P +C
Sbjct: 427 GTS------GILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFDGS--TVGFMPASC 475
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 151/368 (41%), Gaps = 50/368 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + FGTPS + LL DTGS + W QC PC C+ Q P+F+P+ SSTY I C+
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNT 190
Query: 65 LICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI--- 115
CR+ NG QC + + YA G+ + G+ S ET T PG+
Sbjct: 191 DACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTL--------APGITVED 242
Query: 116 --FGCSNDNRDFS--FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGC D R S +D G+LG +P SL+ Q S G FSYCL A
Sbjct: 243 FHFGCGRDQRGPSDKYD----GLLGLGGAPVSLVVQTSSVYGGAFSYCLP-ALNSEAGFL 297
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+L N +R ++ Y +++ ISV + F GG +
Sbjct: 298 VLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAF------RGGMI 351
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
ID+G + T + Y + + ++ S+D++ CY F Y+++T
Sbjct: 352 IDSGTVDTELPETAYNALEAALRKALKAYPLV----PSDDFDTCY----NFTGYSNIT-- 401
Query: 292 FDRADFKVEPTYMYFIFQNEGYF---CVAISFS---DRNSVVGAWQQQDTRFVYDLNTGT 345
R F + G C+A S D ++G Q+ +YD G
Sbjct: 402 VPRVAFTFSGGATIDLDVPNGILVNDCLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGN 461
Query: 346 IQFVPENC 353
+ F C
Sbjct: 462 VGFRAGAC 469
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 158/364 (43%), Gaps = 47/364 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQ-SAPIFNPNASSTYKRIPCDD 64
+ V+ G P + + DTGS L+W QC PC +C Q P+F+P+ SSTY + C +
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKN 160
Query: 65 LICR-RPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSND 121
+ICR P C+ + QCV+ Y G + G+++TE F ++ V V+FGCS+
Sbjct: 161 IICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHR 220
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV------YAYREMEATSILRF 175
N ++ D G+ G S++ Q+ S FSYC+ Y+Y ++ +L
Sbjct: 221 NGNYK-DRRFTGVFGLGSGITSVVNQMGSK----FSYCIGNIADPDYSYNQL----VLSE 271
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G + + T VD HY + L+ ISV + R+ P F R +ID+G
Sbjct: 272 G----VNMEGYSTPLDVVD--GHYQVILEGISVGETRLVIDPSAFK-RTEKQRRVIIDSG 324
Query: 236 AIATFIQRGPYEVVMRH----FDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMT 289
T++ Y + R D T F R+ CY + + ++T
Sbjct: 325 TAPTWLAENEYRALEREVRNLLDRFLTPFMRESF--------LCYKGKVGQDLVGFPAVT 376
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
FHF V T M Q Y F D SV+G QQ YDLN + F
Sbjct: 377 FHFAEGADLVVDTEMR---QASVY---GKDFKDF-SVIGLMAQQYYNVAYDLNKHKLFFQ 429
Query: 350 PENC 353
+C
Sbjct: 430 RIDC 433
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP+ ++ L DTGS + W QC PC + C++Q P+F+P SS+Y +PC
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190
Query: 65 LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + C GQC + ++Y G++ +G+ S++T T N L G +FGC +
Sbjct: 191 ASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCGHA 247
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ F G + G+LG SL+ Q ST G+FSYCL + + G ++
Sbjct: 248 QQGL-FAG-VDGLLGLGRQGQSLVSQASSTYGGVFSYCLP---PTQNSVGYISLGGPSST 302
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + ++Y + L ISV + FA G ++DTG + T +
Sbjct: 303 AGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRL 356
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRA-DF 297
Y + F +G A+ + CY RY + S+ F A D
Sbjct: 357 PPTAYSALRSAFRAAMAPYGYPSAP-ATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDL 415
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSD-RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G A + D + S++G QQ+ +D + T+ F+P +C
Sbjct: 416 GTS------GILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFDGS--TVGFMPASC 464
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 159/368 (43%), Gaps = 31/368 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP ++ DTGS ++W QC PC C++QS +F+P AS +Y + C
Sbjct: 146 EYFTKIGV--GTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCA 203
Query: 64 DLICRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+CR R ++G C++++ Y G+ +G +TET TF + VP V
Sbjct: 204 APLCR----RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGAR---VPRVAL 256
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV----YAYREMEATSI 172
GC +DN AG+LG S Q+ FSYCLV + +S
Sbjct: 257 GCGHDNEGLFV--AAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314
Query: 173 LRFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTG 228
+ FG A M + + YY+ L ISV R+ G A L G G
Sbjct: 315 VTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRG 374
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYAS 287
G ++D+G T + R Y + F + G + ++ CY + +
Sbjct: 375 GVIVDSGTSVTRLARPAYAALRDAF--RAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPT 432
Query: 288 MTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGT 345
++ HF P Y I + G FC A + +D S++G QQQ R V+D +
Sbjct: 433 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 492
Query: 346 IQFVPENC 353
+ FVP+ C
Sbjct: 493 LGFVPKGC 500
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 153/375 (40%), Gaps = 34/375 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTY 57
H Y + L G P + L DTGS LIWTQC C Q P +N + SST+
Sbjct: 78 HLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTF 137
Query: 58 KRIPCDD---LICRRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
+PC D L C +G C +Y G S G + TE FTF +
Sbjct: 138 AAVPCADSAKLCAANGVHLCGLDGSCTFAASY-GAGSVFGSLGTEAFTFQSGAAKLG--- 193
Query: 114 VIFGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
FGC + R N A G++G SL+ Q +T FSYCL R A+S
Sbjct: 194 --FGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATK---FSYCLTPYLRNHGASSH 248
Query: 173 LRFGKDANIQRKDMKTIRM-FVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRN 225
L G A++ + FV S+ YYL L ISV + ++ F LRR
Sbjct: 249 LFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRV 308
Query: 226 G----TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDS 280
+GG +IDTG+ T + Y + DE R + A + C
Sbjct: 309 AAGYWSGGVIIDTGSPVTSLAEAAYSALS---DEVARQLNRSLVQPPADTGLDLCVARQD 365
Query: 281 RFRAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVY 339
+ + FHF AD V Y+ ++ C+ I +V+G +QQQD +Y
Sbjct: 366 VDKVVPVLVFHFGGGADMAVSAG-SYWGPVDKSTACMLIEEGGYETVIGNFQQQDVHLLY 424
Query: 340 DLNTGTIQFVPENCA 354
D+ G + F +C+
Sbjct: 425 DIGKGELSFQTADCS 439
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 12/204 (5%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC---RR 69
G PS + + DTGS LIW QCLPC +C+NQ+ PIF+P S TY+ + D IC RR
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 70 PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG-VIFGCSNDNRDFSFD 128
R + C ++ Y G + G +ST+ F F + + G + FGCS+D +
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTK-ARLK 181
Query: 129 GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKT 188
G+ AG++G + P SL+ QLK FSYC+V + + S + FG A I
Sbjct: 182 GHQAGVVGLNRHPNSLVSQLKVKK---FSYCMVIP-DDHGSGSRMYFGSRAVILGGKTPL 237
Query: 189 IRMFVDRSSHYYLSLQDISVADHR 212
++ SHY+++L+ ISV + +
Sbjct: 238 LK---GDYSHYFVTLKGISVGEEK 258
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 39 VNCFNQSAPIFNPNASSTYKRIPCDDLICRRP-PFRC--ENGQCVHRINYAGGA-SASGL 94
CFNQ+ PIF+P+ SSTY +P D C + + C + C +RI+Y G+ S G
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391
Query: 95 VSTETFTFH-LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLL 145
+S + F F + +V V ++FGCS D +F G GI+G + SL+
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCS-DYTTGTFKGYEVGIVGLNQDSLSLV 442
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 165/366 (45%), Gaps = 36/366 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIP 61
K + V V FG+P+++ +FDTGS L W QC PC +C+ Q P+F+P SS+Y +P
Sbjct: 108 KTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVP 167
Query: 62 CDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C C C CV+ + Y G+S +G+++ ET TF ++ G IFGC
Sbjct: 168 CGTTECAAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFT---GFIFGCGET 224
Query: 122 NR-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-----VYAYREMEATSILRF 175
N DF G + G+LG SL Q G+FSYCL Y + AT +
Sbjct: 225 NLGDF---GEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVT-- 279
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+Q M D S Y++ L I++ + + P F + GT ++D+G
Sbjct: 280 -GQIPVQYTAMVNKP---DYPSFYFIELVSINIGGYVLPVPPSEFT--KTGT---LLDSG 330
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY-ASMTFHF-D 293
I T++ Y + F FT G + E + CY + + ++F+F D
Sbjct: 331 TILTYLPPPAYTALRDRF--KFTMQGSKPAPPYDE-LDTCYDFTGQSGILIPGVSFNFSD 387
Query: 294 RADFKVEPTYMYFIFQNEGYFCVA-ISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQ 347
A F + + F ++ V ++F R SVVG+ Q+ +YD+ I
Sbjct: 388 GAVFNLN-FFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIG 446
Query: 348 FVPENC 353
F+P +C
Sbjct: 447 FIPASC 452
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 41/364 (11%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
E + Y V + G+P+ ++++ D+GS ++W QC PC C+NQ+ PIFNP S+++ +
Sbjct: 123 EEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGV 182
Query: 61 PCDDLICRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C +C + C G+C +++ Y G+ G ++ ET T + GC
Sbjct: 183 ACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITI----GRTVIQDTAIGC 238
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ N L P S +GQL + G F YCLV
Sbjct: 239 GHWNEGMFVGAAGLLGL--GGGPMSFVGQLGAQTGGAFGYCLV----------------- 279
Query: 179 ANIQRKDMKTIRMFVDR------SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ M M+V S YY+SL ++V R+ + F L GTGG ++
Sbjct: 280 ----SRAMPVGAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVM 335
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFH 291
DTG T + Y F T+ R ++ CY + +++F+
Sbjct: 336 DTGTAITRLPTVAYNAFRDAFIAQTTNLPRAP---GVSIFDTCYDLNGFVTVRVPTVSFY 392
Query: 292 FDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFV 349
F P + I ++ G FC A + S S++G QQ+ + D G + F
Sbjct: 393 FSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFG 452
Query: 350 PENC 353
P C
Sbjct: 453 PNVC 456
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 140/311 (45%), Gaps = 36/311 (11%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
+K Y + G P + DTGS L+W +C PC C +P+++P S + ++P
Sbjct: 82 QKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLP 141
Query: 62 CDDLICRR-PPFRCENGQCV---------HRINYAGGASASGLVSTETFTF---HLKNKL 108
C +C+ R + QC + ++G S G++ TETFTF ++ N
Sbjct: 142 CSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANN- 200
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
V FG S D D S G AG++G SL+ QL + G F+YCL +
Sbjct: 201 -----VSFGRS-DTIDGSQFGGTAGLVGLGRGHLSLVSQLGA---GRFAYCLAA---DPN 248
Query: 169 ATSILRFGKDANIQRK--DMKTIRMFV----DRSSHYYLSLQDISVADHRIGFAPGTFAL 222
S + FG A + D+ + + DR +HYY++LQ ISV R+ GTFA+
Sbjct: 249 VYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAI 308
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
+G+GG D+GAI T ++ Y+VV + G ++A +D +
Sbjct: 309 NSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLG----YDAGDDTCFVAANQQAV 364
Query: 283 RAYASMTFHFD 293
+ HFD
Sbjct: 365 AQMPPLVLHFD 375
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 146/319 (45%), Gaps = 27/319 (8%)
Query: 54 SSTYKRIPCDDLICRRP------PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNK 107
SST+K + C D ICR EN QC + +Y + +G + +TFTF N
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 108 L-VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYRE 166
+ V V + FGC + N F N +GI GF P SL QLK G FSYCL
Sbjct: 62 VPVAVSELAFGCGDYNTGL-FVSNESGIAGFGRGPQSLPSQLKV---GRFSYCLTLVTES 117
Query: 167 MEATSILRFGKDANIQRK----DMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTF 220
+ IL D + R ++ + + + YYLSL+ I+V R+ F F
Sbjct: 118 KSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVF 177
Query: 221 ALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE-YCYRYD 279
AL+++G+GG +ID+G T + +E++ +E F R N E + C+R
Sbjct: 178 ALKKDGSGGTVIDSGTSLTTLPEAVFELLQ---EELVAQFPLPRYDNTPEVGDRLCFRRP 234
Query: 280 SRFR--AYASMTFHFDRADFKVEPTYMYFIFQ-NEGYFCVAISFSDRNSVV--GAWQQQD 334
+ + H AD + P YF+ + + G C+ I+ ++ ++V G +QQQ+
Sbjct: 235 KGGKQVPVPKLILHLAGADMDL-PRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQN 293
Query: 335 TRFVYDLNTGTIQFVPENC 353
VYD+ + F P C
Sbjct: 294 MHVVYDVENNKLLFAPAQC 312
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 23/348 (6%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCV---NCFNQSAPIFNPNASSTYKRIPCDDLICR-R 69
G P + F + DTGS + W QCLPC C+ Q PIF+P SS+Y + CD C+
Sbjct: 4 GQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLL 63
Query: 70 PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDG 129
C C++++ Y G+ G ++TET TF N +P + GC +DN
Sbjct: 64 DEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNS---IPNISIGCGHDNEGLFVGA 120
Query: 130 NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTI 189
+ LG S+ QLK+++ FSYCLV ++++ S + + + +
Sbjct: 121 DGLIGLGGGAI--SISSQLKASS---FSYCLV----DIDSPSFSTLDFNTDPPSDSLISP 171
Query: 190 RMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEV 248
+ DR S Y+ + +SV + + F + +G GG ++D+G T + YEV
Sbjct: 172 LVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231
Query: 249 VMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYMYFI 307
+ F T+ + ++ CY S+ ++ F + P I
Sbjct: 232 LREAFLGLTTNLPPAPEISP---FDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLI 288
Query: 308 -FQNEGYFCVA-ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ G FC+A +S + S++G +QQQ R YDL + F C
Sbjct: 289 QVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 153/364 (42%), Gaps = 26/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD GTP + L+ D+GS L+W QC PC C+ Q +P++ P+ SST+ +PC
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSD 123
Query: 67 CRRPP----FRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P F C+ G C + YA +S+ G+ + E+ T V + V FGC
Sbjct: 124 CLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV----DGVRIDKVAFGCG 179
Query: 120 NDNR-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+DN+ F+ G G+LG P S Q+ F+YCLV +S L FG +
Sbjct: 180 SDNQGSFAAAG---GVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLIFGDE 236
Query: 179 ANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
DM+ + + S YY+ ++ ++V + + + + G GG + D+G
Sbjct: 237 LISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDSGT 296
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRA 295
T+ Y ++ FD + + + C + ++ S T FD
Sbjct: 297 TLTYWFPSAYSHILAAFDSGV----HYPRAESVQGLDLCVELTGVDQPSFPSFTIEFDDG 352
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
YF+ C+A++ + +G QQ+ YD I F P
Sbjct: 353 AVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIGFAPA 412
Query: 352 NCAN 355
C++
Sbjct: 413 KCSS 416
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 157/385 (40%), Gaps = 49/385 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S LIWTQC PC C++Q P+FNP SSTY +PC
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 67 CRRPPF-RC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C RC ++ C + Y+G A+ G ++ + GV FGCS +
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGED----AFRGVAFGCSTSS 204
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ +G++G P SL+ QL F+YCL + +L G DA+
Sbjct: 205 TGGAPPPQASGVVGLGRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVL--GADADAA 259
Query: 183 RKDMKTIRMFVDR----SSHYYLSLQDISVADHRIGF---------------------AP 217
R I + + R S+YYL+L + + D + +P
Sbjct: 260 RNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSP 319
Query: 218 GTFALRRNGTG--GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYC 275
A+ G +ID + TF++ Y+ ++ + R +S + C
Sbjct: 320 NATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIR---LPRGTGSSLGLDLC 376
Query: 276 YRYDSRF---RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGA 329
+ R Y ++ FD +++ ++ + G C+ + ++ SV +G
Sbjct: 377 FILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGN 436
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCA 354
+QQQ+ + +Y+L G + FV C
Sbjct: 437 FQQQNMQVLYNLRRGRVTFVQSPCG 461
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 157/385 (40%), Gaps = 49/385 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S LIWTQC PC C++Q P+FNP SSTY +PC
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 67 CRRPPF-RC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C RC ++ C + Y+G A+ G ++ + GV FGCS +
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGED----AFRGVAFGCSTSS 204
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ +G++G P SL+ QL F+YCL + +L G DA+
Sbjct: 205 TGGAPPPQASGVVGLGRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVL--GADADAA 259
Query: 183 RKDMKTIRMFVDR----SSHYYLSLQDISVADHRIGF---------------------AP 217
R I + + R S+YYL+L + + D + +P
Sbjct: 260 RNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSP 319
Query: 218 GTFALRRNGTG--GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYC 275
A+ G +ID + TF++ Y+ ++ + R +S + C
Sbjct: 320 NATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIR---LPRGTGSSLGLDLC 376
Query: 276 YRYDSRF---RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGA 329
+ R Y ++ FD +++ ++ + G C+ + ++ SV +G
Sbjct: 377 FILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGN 436
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCA 354
+QQQ+ + +Y+L G + FV C
Sbjct: 437 FQQQNMQVLYNLRRGRVTFVQSPCG 461
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 154/372 (41%), Gaps = 52/372 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTPS S+ LL DTGS L W QC PC C+ Q P+F+P+ SSTY IPC+
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNT 183
Query: 65 LICRRPP-----FRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
CR C +G QC I Y G+ G+ S ET L PGV
Sbjct: 184 DACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNET--------LALAPGVA 235
Query: 116 -----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
FGC +D +D + D G+LG +P SL+ Q S G FSYCL ++
Sbjct: 236 VKDFRFGCGHD-QDGAND-KYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFL 293
Query: 171 SILRFGKDAN--IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
++ G + + M + + Y +++ I+V I P F +G
Sbjct: 294 ALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAF------SG 347
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR---YDSRFRAY 285
G +ID+G + T +Q Y + F + ++ R + + + CY Y +
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVR----NGELDTCYDFSGYSNVTLPK 403
Query: 286 ASMTFHFDRA-DFKVEPTYMY---FIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDL 341
++TF D V + FQ G D+ ++G Q+ +YD
Sbjct: 404 VALTFSGGATIDLDVPNGILLDDCLAFQESGP-------DDQPGILGNVNQRTLEVLYDA 456
Query: 342 NTGTIQFVPENC 353
G + F C
Sbjct: 457 GRGRVGFRAAVC 468
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 152/371 (40%), Gaps = 28/371 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ L DT S L W QC PC C+ QS P+F+P S++Y + D
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPD 200
Query: 67 C----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI------F 116
C R + G C++ + Y G G ST T L + + G +
Sbjct: 201 CQALGRSGGGDAKRGTCIYTVLYGDG---DGHGSTSTSVGDLVEETLTFAGGVRQAYLSI 257
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA-QGLFSYCLV-YAYREMEATSILR 174
GC +DN+ F AGILG S S+ Q+ FSYCLV + +S L
Sbjct: 258 GCGHDNKGL-FGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLT 316
Query: 175 FGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRI-GFAPGTFALRR-NGTGGC 230
FG A ++++ + YY+ L +SV R+ G L G GG
Sbjct: 317 FGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGV 376
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-----AY 285
++D+G T + R Y F T G+ S ++ CY R
Sbjct: 377 ILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKV 436
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFS-DRN-SVVGAWQQQDTRFVYDLN 342
+++ HF + ++P + G C A + + DR+ SV+G QQ R VYD+
Sbjct: 437 PAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIG 496
Query: 343 TGTIQFVPENC 353
+ F P +C
Sbjct: 497 GQRVGFAPNSC 507
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 165/355 (46%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
Y + G P K +L+ DTGS + W QC PC + C+ Q PIF+P +SS+Y + C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 64 DLICR-RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C+ C + C+++++Y G+ +G ++TET +F N +P + GC +DN
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLPIGCGHDN 264
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
G LG SL QLK+++ FSYCLV + +++S L F ++ +
Sbjct: 265 EGLFAGGAGLIGLGGGAI--SLSSQLKASS---FSYCLVNL--DSDSSSTLEF--NSYMP 315
Query: 183 RKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + + DR S+ Y+ + ISV + +P F + +G GG ++D+G I + +
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE 300
YE + F + +S + ++ CY + + ++ F
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISV---FDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 301 PTYMYFI-FQNEGYFCVA-ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G +C+A I S++G++QQQ R YDL + F C
Sbjct: 433 PARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 161/361 (44%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
+ V V GTP++ L+FDTGS L W QC PC + C Q P+F+P+ SSTY + C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+ C C +N C++ ++Y G+S +G++S +T L P FGC
Sbjct: 209 EPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFP---FGCGTR 265
Query: 122 NR-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N DF G + G+LG SL Q ++ +FSYCL + T L G
Sbjct: 266 NLGDF---GRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSN---STTGYLTIGATPA 319
Query: 181 IQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ M S Y++ L I + + + P F GG ++D+G +
Sbjct: 320 TDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSGTVL 374
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHF-DRAD 296
T++ YE++ F + R ++ + CY + +++F F D A
Sbjct: 375 TYLPAQAYELLRDRFR---LTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAV 431
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
F+++ + IF +E C+A + D S++G QQ+ +YD+ I FVP +
Sbjct: 432 FELD-FFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPAS 490
Query: 353 C 353
C
Sbjct: 491 C 491
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 157/372 (42%), Gaps = 40/372 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ ++ DTGS ++W QC PC C+ QS +F+P S +Y + C
Sbjct: 139 EYFTKIGV--GTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCA 196
Query: 64 DLICRR-PPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+CRR C+ C++++ Y G+ +G +TET TF + V V GC +
Sbjct: 197 APLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGAR---VARVALGCGH 253
Query: 121 DNRDF-----SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEATSI 172
DN G G L F Q+ FSYCLV + +S
Sbjct: 254 DNEGLFVAAAGLLGLGRGSLSFPT-------QISRRYGRSFSYCLVDRTSSANTASRSST 306
Query: 173 LRFGKDANIQ------RKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RR 224
+ FG A +K RM + YY+ L ISV R+ G A L
Sbjct: 307 VTFGSGAVGSTVASSFTPMVKNPRM----ETFYYVQLIGISVGGARVPGVANSDLRLDPS 362
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FR 283
+G GG ++D+G T + R Y + F + G + ++ CY R
Sbjct: 363 SGRGGVIVDSGTSVTRLARPAYSALRDAF--RGAAAGLRLSPGGFSLFDTCYDLSGRKVV 420
Query: 284 AYASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDL 341
+++ HF P Y I ++G FC A + +D S++G QQQ R V+D
Sbjct: 421 KVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDG 480
Query: 342 NTGTIQFVPENC 353
+ + F P+ C
Sbjct: 481 DGQRVAFTPKGC 492
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 154/371 (41%), Gaps = 39/371 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y + L G P + L DTGS L+WTQC C+ C Q+ P +N +ASST+ +PC
Sbjct: 90 YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAA 149
Query: 65 LICRRPPFRCENGQCVHRINYAGGAS---------ASGLVSTETFTFHLKNKLVCVPGVI 115
IC N +H + A G S +G + TE F F +
Sbjct: 150 RIC------AANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGTEAFAFQSGTAELA----- 198
Query: 116 FGCSNDNR--DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC R + G +G++G SL+ Q +T FSYCL + AT L
Sbjct: 199 FGCVTFTRIVQGALHGA-SGLIGLGRGRLSLVSQTGATK---FSYCLTPYFHNNGATGHL 254
Query: 174 RFGKDANIQRKDMKTIRMFV---DRSSHYYLSLQDISVADHRIGFAPGTFALRRNG---- 226
G A++ FV S YYL L ++V + R+ F LR
Sbjct: 255 FVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLF 314
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
+GG +ID+G+ T + Y+ + +A +D C R
Sbjct: 315 SGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDA-DDGALCVARRDVGRVVP 373
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSD---RNSVVGAWQQQDTRFVYDLNT 343
++ FHF P Y+ ++ C+AI+ + R SV+G +QQQ+ R +YDL
Sbjct: 374 AVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLAN 433
Query: 344 GTIQFVPENCA 354
G F P +C+
Sbjct: 434 GDFSFQPADCS 444
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 166/366 (45%), Gaps = 39/366 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ + + GTP + + DTGS L WTQCLPC CFNQS PIFNP SS+Y+++ C
Sbjct: 90 FLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDT 149
Query: 67 CRR-------PPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
CR P + C G ++ G AS ++ +F +P + GC
Sbjct: 150 CRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFK---------LPKTVIGC 200
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
+ N +F G +GI+G SL+ Q+++ A + FSYCL + T + FG
Sbjct: 201 GHQNGG-TFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFG 259
Query: 177 KDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ A + + + + + V RS + Y+L+L+ ISV R A G A+ +G +ID+
Sbjct: 260 RKAVVSGRQVVSTPL-VPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGN--IIIDS 316
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGR----QRMHNASEDWEYCYRYDSRFRA-YASMT 289
G T + R Y V F++ R +R+ + S E CY +T
Sbjct: 317 GTTLTLLPRSLYYGV-------FSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIIT 369
Query: 290 FHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
HF AD K+ P F + C+ + + + ++ G Q + YDL + F
Sbjct: 370 AHFAGGADVKLLPVNT-FAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSF 428
Query: 349 VPENCA 354
P+ CA
Sbjct: 429 EPKLCA 434
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 154/359 (42%), Gaps = 36/359 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+++ L DT + W C CV C S+ +FN S+T+K + C+
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVKSTTFKTVGCEAPQ 152
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P +C C + Y G +S + +S + T + +P FGC +
Sbjct: 153 CKQVPNSKCGGSACAFNMTY-GSSSIAANLSQDVVTLATDS----IPSYTFGCLTEATGS 207
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S G+LG P SLL Q ++ Q FSYCL ++R + + LR G Q K
Sbjct: 208 SIPPQ--GLLGLGRGPMSLLSQTQNLYQSTFSYCL-PSFRSLNFSGSLRLGPVG--QPKR 262
Query: 186 MKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+KT + + RSS YY++L I V + P A G + D+G + T +
Sbjct: 263 IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVA 322
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKV 299
Y V F R+R+ NA+ ++ CY + ++TF F + +
Sbjct: 323 PAYTAVRDAF--------RKRVGNATVTSLGGFDTCY---TSPIVAPTITFMFSGMNVTL 371
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + C+A++ + N +V+ QQQ+ R ++D+ + E C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 163/383 (42%), Gaps = 51/383 (13%)
Query: 1 HEKNYFYTV-DVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR 59
H + Y V + GTP + + D L+WTQC C CF Q P+F PNASST++
Sbjct: 36 HWSRHLYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRP 95
Query: 60 IPCDDLICRRPPF-RCENGQCVH------RINYAGGASASGLVSTETFTFHLKNKLVCVP 112
PC C+ P C C + R++ + G+V TETF
Sbjct: 96 EPCGTDACKSTPTSNCSGDVCTYESTTNIRLDR---HTTLGIVGTETFAIG-----TATA 147
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
+ FGC + + DG +G +G +P SL+ Q+K T FSYCL + R +S
Sbjct: 148 SLAFGCVVASDIDTMDGT-SGFIGLGRTPRSLVAQMKLTK---FSYCL--SPRGTGKSSR 201
Query: 173 LRFGKDANIQRKDMKTIRMFV-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNG 226
L G A + + + F+ D S HYY LSL I + I A
Sbjct: 202 LFLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQ--------- 252
Query: 227 TGGCMI-DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
+GG ++ T + + + Y + E M + ++ C++ + F RA
Sbjct: 253 SGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRA 312
Query: 285 YAS---MTFHFDRADFKVEPT-YMYFIFQNEGYFCVAI-SFSDRN-------SVVGAWQQ 332
A TF A V P Y+ + + + C AI S + N SV+G+ QQ
Sbjct: 313 TAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQ 372
Query: 333 QDTRFVYDLNTGTIQFVPENCAN 355
++ F+YDL T+ F P +C++
Sbjct: 373 ENVHFLYDLKKETLSFEPADCSS 395
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 161/370 (43%), Gaps = 45/370 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV + G + ++ DT S L W QC PC +C +Q P+F+P +S +Y +PC+
Sbjct: 125 NYVATVGLGGGEAT----VIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCN 180
Query: 64 DLIC----------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
C E C + ++Y G+ + G+++ + + + + G
Sbjct: 181 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE----VIDG 236
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+FGC N+ F G +G++G S SL+ Q G+FSYCL +E E++ L
Sbjct: 237 FVFGCGTSNQG-PF-GGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESESSGSL 292
Query: 174 RFGKDANIQRKDMKTI--RMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G D ++ R + M D + Y+++L I++ + + G
Sbjct: 293 VLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSAGK 342
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASM 288
++D+G I T + Y V F F + + + + C+ R S+
Sbjct: 343 VIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI---LDTCFNLTGFREVQIPSL 399
Query: 289 TFHFD-RADFKVEPT-YMYFIFQNEGYFCVAISFSD---RNSVVGAWQQQDTRFVYDLNT 343
F F+ + +V+ + +YF+ + C+A++ S++G +QQ++ R ++D
Sbjct: 400 KFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLG 459
Query: 344 GTIQFVPENC 353
I F E C
Sbjct: 460 SQIGFAQETC 469
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 161/370 (43%), Gaps = 45/370 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV + G + ++ DT S L W QC PC +C +Q P+F+P +S +Y +PC+
Sbjct: 126 NYVATVGLGGGEAT----VIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCN 181
Query: 64 DLIC----------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
C E C + ++Y G+ + G+++ + + + + G
Sbjct: 182 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE----VIDG 237
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+FGC N+ F G +G++G S SL+ Q G+FSYCL +E E++ L
Sbjct: 238 FVFGCGTSNQG-PF-GGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESESSGSL 293
Query: 174 RFGKDANIQRKDMKTI--RMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G D ++ R + M D + Y+++L I++ + + G
Sbjct: 294 VLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSAGK 343
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASM 288
++D+G I T + Y V F F + + + + C+ R S+
Sbjct: 344 VIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI---LDTCFNLTGFREVQIPSL 400
Query: 289 TFHFD-RADFKVEPT-YMYFIFQNEGYFCVAISFSD---RNSVVGAWQQQDTRFVYDLNT 343
F F+ + +V+ + +YF+ + C+A++ S++G +QQ++ R ++D
Sbjct: 401 KFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLG 460
Query: 344 GTIQFVPENC 353
I F E C
Sbjct: 461 SQIGFAQETC 470
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 157/377 (41%), Gaps = 41/377 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC----LPCVNCFNQSAPIFNPNASST 56
H Y L G+P + L DTGS LIWTQC LP +C Q P +N + SST
Sbjct: 80 HRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLP-KSCAKQGLPYYNLSQSST 138
Query: 57 YKRIPCDDLICRRPPFRCENG--------QCVHRINYAGGASASGLVSTETFTFHLKNKL 108
+ +PC D + F NG C +Y G G + TE+F F
Sbjct: 139 FVPVPCAD----KAGFCAANGVHLCGLDGSCTFIASYGAG-RVIGSLGTESFAFESGTT- 192
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM 167
+ FGC + R S N A G++G SL+ Q+ +T FSYCL +
Sbjct: 193 ----SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATR---FSYCLTPYFHSS 245
Query: 168 EATSIL--RFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRI-GFAPGTFAL 222
A+S L M ++ D S+ YYL L+ I+V R+ TF L
Sbjct: 246 GASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQL 305
Query: 223 RR--NG--TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCY 276
R+ G GG +IDTG+ T + YE + +E G + A ED E C
Sbjct: 306 RQLFKGYWAGGVIIDTGSPLTQLASHAYEALK---EEVAAQLGNGSLVPAPEDSGLELCV 362
Query: 277 RYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTR 336
+ + ++ FHF P Y+ ++ C+ I +S++G +QQQD
Sbjct: 363 AREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDSIIGNFQQQDMH 422
Query: 337 FVYDLNTGTIQFVPENC 353
+YDL G F +C
Sbjct: 423 LLYDLRRGRFSFQTADC 439
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 151/376 (40%), Gaps = 34/376 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-FNQSAPIFNPNASSTYKRIPCDDL 65
Y VD+ GTP +S L+ DTGS L+W +C C NC + + F P SS++ C D
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 147
Query: 66 ICRRPPFRCE--------NGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPGVIF 116
CR P + C +YA G+ +SG S ET T L + + G+ F
Sbjct: 148 HCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSF 207
Query: 117 GCSNDNRDFSFDG----NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
GC S G G++G S QL FSYCL+ TS
Sbjct: 208 GCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSF 267
Query: 173 LRFGKD------ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
L G N + +++ + YY+++ I++ ++ P + + G
Sbjct: 268 LMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQG 327
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE---DWEYCYRY--DSR 281
GG ++D+G T++ + YE V++ R ++ NA+E ++ C +SR
Sbjct: 328 NGGTVVDSGTTLTYLTKTAYEEVLKSVRR------RVKLPNAAELTPGFDLCVNASGESR 381
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFV 338
+ + F P YF+ EG C+AI + SV+G QQ
Sbjct: 382 RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLE 441
Query: 339 YDLNTGTIQFVPENCA 354
+D + F C
Sbjct: 442 FDKEESRLGFTRRGCG 457
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 148/349 (42%), Gaps = 24/349 (6%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFRCE--NGQ 78
++ DTGS ++W QC PC C+ QS P+F+P SS+Y + C +CRR C+ G
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 79 CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFS 138
C++++ Y G+ +G TET TF + V V GC +DN LG
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFAGGAR---VARVALGCGHDNEGLFVAAAGLLGLGRG 117
Query: 139 VSPFSLLGQLKSTAQGLFSYCLV-------YAYREMEATSILRFGKDANIQRKDMKTIRM 191
F Q+ FSYCLV A +S + FG + T +
Sbjct: 118 GLSFPT--QISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMV 175
Query: 192 FVDR-SSHYYLSLQDISVADHRI-GFAPGTFAL-RRNGTGGCMIDTGAIATFIQRGPYEV 248
R + YY+ L ISV R+ G A L G GG ++D+G T + R Y
Sbjct: 176 RNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSA 235
Query: 249 VMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYF 306
+ F + G R+ ++ CY R +++ HF P Y
Sbjct: 236 LRDAF--RAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYL 293
Query: 307 I-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
I + G FC A + +D S++G QQQ R V+D + + F P+ C
Sbjct: 294 IPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 159/363 (43%), Gaps = 33/363 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP + L+FDTGS L WTQC PC +C+ Q IF+P+ SS+Y I C
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSS 105
Query: 66 ICRR---PPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
+C + + E + C++ Y +++ G +S E T + V +FG
Sbjct: 106 LCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATD---IVDDFLFG 162
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C DN F+G+ AG++G P S++ Q S +FSYCL + L FG
Sbjct: 163 CGQDNEGL-FNGS-AGLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSLGH---LTFGA 217
Query: 178 DANIQRKDMKT-IRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTGGCMIDTG 235
A + T + +S Y L + ISV ++ + TF+ GG +ID+G
Sbjct: 218 SAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFS-----AGGSIIDSG 272
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-R 294
+ T + Y + F + + N + + CY S ++ + F+
Sbjct: 273 TVITRLAPTVYAALRSAFRRXMEKY---PVANEAGLLDTCYDL-SGYKEISVPRIDFEFS 328
Query: 295 ADFKVEPTYMYFI-FQNEGYFCVAISF--SDRN-SVVGAWQQQDTRFVYDLNTGTIQFVP 350
VE + + ++E C+A + SD + +V G QQ+ VYD+ G I F
Sbjct: 329 GGVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGA 388
Query: 351 ENC 353
C
Sbjct: 389 AGC 391
>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 435
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 148/369 (40%), Gaps = 22/369 (5%)
Query: 6 FYTVDVLFGTPSKSEF--LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y V V G+ F L D L W QC PCV Q +F S YK
Sbjct: 66 LYGVLVGVGSGQTRHFYKLGLDLVGNLTWIQCQPCVPEVRQEGAVFKSAVSPRYKDTKAT 125
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH----LKNKLVCVPGVIFGCS 119
D C PP+ G + +A G + ++ F F V + FGC+
Sbjct: 126 DPKCT-PPYTPSVGNRCSFYTTSWNVAAHGYLGSDMFGFAGSPGTGGHGTDVDKLTFGCA 184
Query: 120 NDNRDFSF--DGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEAT-SILR 174
+ F G +AG L S P S L QL + A FSYCL A LR
Sbjct: 185 HTTDGFERLNHGVLAGALSLSRHPTSFLSQLTARRLADSRFSYCLFPGQSHPNARHGFLR 244
Query: 175 FGKDANIQRKDMKTIRMFVDRSS--HYYLSLQDISVADHRI-GFAPGTFALRRNGT---G 228
FG+D T +F R S YY+ + IS+ RI G P F RRN G
Sbjct: 245 FGRDIPRHDHAHSTSLLFTGRGSGSMYYIGVTSISLNGKRIIGLQPAFF--RRNPQTRRG 302
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYAS 287
G ++D G T + R Y +V + + G +R + C+ + +
Sbjct: 303 GSVVDPGTPLTRLVREAYNIVEAELVAYMQTQGSRRAPAPVQGHRLCFVSWGHAHLPSMT 362
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
+ + DRA ++P ++ +E + C + + +V+GA QQ DTRF +DL+ +
Sbjct: 363 INMNEDRAKLFIKPELLFLKVTHE-HLCFLVVPDEEMTVLGAAQQVDTRFTFDLHANRLY 421
Query: 348 FVPENCAND 356
F E+C D
Sbjct: 422 FAQEHCTAD 430
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 157/355 (44%), Gaps = 39/355 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC 62
NYF V V GTP + L+FDTGS L WTQC PC +C+ Q IF+P+ S++Y I C
Sbjct: 145 NYF--VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITC 202
Query: 63 DDLICRR--------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+C + P C++ I Y + + G S E T + V
Sbjct: 203 TSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD---VVDNF 259
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC +N+ F G+ AG++G P S + Q + + +FSYCL +T L
Sbjct: 260 LFGCGQNNQGL-FGGS-AGLIGLGRHPISFVQQTAAKYRKIFSYCLP---STSSSTGHLS 314
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
FG A + SS Y L + I+V ++ + TF+ TGG +ID+
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS-----TGGAIIDS 369
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRAYASMTFHF 292
G + T + Y + F + G + +A E + CY S ++ ++ T F
Sbjct: 370 GTVITRLPPTAYGALRSAFRQ-----GMSKYPSAGELSILDTCYDL-SGYKVFSIPTIEF 423
Query: 293 DRA---DFKVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDL 341
A K+ P + F+ + C+A + + +S + G QQ+ VYD+
Sbjct: 424 SFAGGVTVKLPPQGILFVASTK-QVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 163/371 (43%), Gaps = 39/371 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV G ++ L+ DTGS L W QCLPC C+NQ P+FNP+ SS++ +PC+
Sbjct: 65 NYIVTV----GIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCN 120
Query: 64 DLIC-RRPPFRCENGQCVHR--------INYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C P +G C ++ I+Y G+ + G + E T +
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE----IDNF 176
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
IFGC +N+ G +G++G + S SL+ Q S +FSYCL + ++ L
Sbjct: 177 IFGCGRNNKGLF--GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLT 232
Query: 175 FGKDANIQRKDMKTI---RMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G K++ I RM + S+ Y+L+L IS+ + L N
Sbjct: 233 LGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----RLSSNEGVL 288
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
++D+G + T + Y+ F++ F+ + R C+ ++
Sbjct: 289 SLLDSGTVITRLSPSIYKAFKAEFEKQFSGY---RTTPGFSILNTCFNLTGYEEVNIPTV 345
Query: 289 TFHFD-RADFKVE-PTYMYFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNT 343
F F+ A+ V+ YF+ + C+A + + D+ ++G +QQ++ R +Y+
Sbjct: 346 KFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKE 405
Query: 344 GTIQFVPENCA 354
+ F E C+
Sbjct: 406 SKVGFAGEPCS 416
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 163/371 (43%), Gaps = 39/371 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV G ++ L+ DTGS L W QCLPC C+NQ P+FNP+ SS++ +PC+
Sbjct: 144 NYIVTV----GIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCN 199
Query: 64 DLIC-RRPPFRCENGQCVHR--------INYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C P +G C ++ I+Y G+ + G + E T +
Sbjct: 200 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE----IDNF 255
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
IFGC +N+ G +G++G + S SL+ Q S +FSYCL + ++ L
Sbjct: 256 IFGCGRNNKGLF--GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLT 311
Query: 175 FGKDANIQRKDMKTI---RMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
G K++ I RM + S+ Y+L+L IS+ + L N
Sbjct: 312 LGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----RLSSNEGVL 367
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
++D+G + T + Y+ F++ F+ + R C+ ++
Sbjct: 368 SLLDSGTVITRLSPSIYKAFKAEFEKQFSGY---RTTPGFSILNTCFNLTGYEEVNIPTV 424
Query: 289 TFHFD-RADFKVE-PTYMYFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNT 343
F F+ A+ V+ YF+ + C+A + + D+ ++G +QQ++ R +Y+
Sbjct: 425 KFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKE 484
Query: 344 GTIQFVPENCA 354
+ F E C+
Sbjct: 485 SKVGFAGEPCS 495
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 158/374 (42%), Gaps = 32/374 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-APIFNPNASSTYKRIPCDDL 65
Y V + GTP ++ L+ DTGS LIW +C PC NC ++S F S+TY I C
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145
Query: 66 ICRRPPFRCEN--------GQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIF 116
C+ P N C ++ YA ++ +G S E T + K+ + G+ F
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSF 205
Query: 117 GCSNDNRDFSFDG----NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
GC S G G++G +P S QL FSYCL+ TS
Sbjct: 206 GCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPTSF 265
Query: 173 LRFGKDANI---QRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGT 227
L G N+ ++ M + ++ S YY++++ + V ++ P +++ G
Sbjct: 266 LTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGN 325
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED---WEYCYRYDSRFR- 283
GG +ID+G TFI Y +++ F + R ++ + +E ++ C R
Sbjct: 326 GGTIIDSGTTLTFITEPAYTEILKAFKK------RVKLPSPAEPTPGFDLCMNVSGVTRP 379
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYD 340
A M+F+ P YFI + C+A+ ++ SV+G QQ +D
Sbjct: 380 ALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFD 439
Query: 341 LNTGTIQFVPENCA 354
+ + F CA
Sbjct: 440 RDKSRLGFTRRGCA 453
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 146/363 (40%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP++ ++FDTGS L W QC PC +C+ Q P+F+P SSTY +PC
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPE 205
Query: 67 CRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ R + +C + + Y + G ++ +T T + L PG +FGC +D
Sbjct: 206 CQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVL---PGFVFGCG--EQD 260
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG--KDANIQ 182
G G++G SL Q S FSYCL A L G AN +
Sbjct: 261 TGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCL---PSSPSAAGYLSLGGPAPANAR 317
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
M+T D S YY+ L + VA + +P F+ G +ID+G + T +
Sbjct: 318 FTAMETRH---DSPSFYYVRLVGVKVAGRTVRVSPIVFS-----AAGTVIDSGTVITRLP 369
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-----------FRAYASMTFH 291
Y + F +G +R A + CY + F A++
Sbjct: 370 PRVYAALRSAFARSMGRYGYKRAP-ALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLD 428
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
F + + + F G A ++G QQ+ VYD+ I F
Sbjct: 429 FSGVLYVAKVSQACLAFAPNGDGADA-------GIIGNTQQKTLAVVYDVARQKIGFGAN 481
Query: 352 NCA 354
C+
Sbjct: 482 GCS 484
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 155/361 (42%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L+FDTGS + WTQC PCV C+ Q P NP+ S++YK I C
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 178
Query: 66 IC------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C ++ C + C++++ Y G+ + G +TET T N +FGC
Sbjct: 179 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG 235
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N AG+LG + +L Q T + LFSYCL + + L G
Sbjct: 236 QQNNGLFG--GAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS---SSSKGYLSLGGQV 290
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ K + D + Y L + +SV ++ F+ G +ID+G + T
Sbjct: 291 SKSVK-FTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGTVIDSGTVIT 343
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRAD 296
+ Y + F T + ++ ++ CY +YD+ +TF +
Sbjct: 344 RLSPTAYSELSSAFQNLMTDYPSTSGYSI---FDTCYDFSKYDTVRIPKVGVTFK-GGVE 399
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ + + + C+A + +D + S+ G QQ+ + VYD G + F P C
Sbjct: 400 MDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459
Query: 354 A 354
+
Sbjct: 460 S 460
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 155/361 (42%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L+FDTGS + WTQC PCV C+ Q P NP+ S++YK I C
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 190
Query: 66 IC------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C ++ C + C++++ Y G+ + G +TET T N +FGC
Sbjct: 191 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVF---KNFLFGCG 247
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N AG+LG + +L Q T + LFSYCL + + L G
Sbjct: 248 QQNNGLFG--GAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS---SSSKGYLSLGGQV 302
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ K + D + Y L + +SV ++ F+ G +ID+G + T
Sbjct: 303 SKSVK-FTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGTVIDSGTVIT 355
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRAD 296
+ Y + F T + ++ ++ CY +YD+ +TF +
Sbjct: 356 RLSPTAYSELSSAFQNLMTDYPSTSGYSI---FDTCYDFSKYDTVRIPKVGVTFK-GGVE 411
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ + + + C+A + +D + S+ G QQ+ + VYD G + F P C
Sbjct: 412 MDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471
Query: 354 A 354
+
Sbjct: 472 S 472
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 158/364 (43%), Gaps = 23/364 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V GTP+ + ++ DTGS ++W QC PC +C+ QS +F+P S +Y + C
Sbjct: 121 EYFAQVGV--GTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 64 DLICRR-PPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
ICRR C+ C++++ Y G+ +G ++ET TF + V V GC +
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR---VQRVAIGCGH 235
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA----TSILRFG 176
DN + LG F Q+ + FSYCLV + +S + FG
Sbjct: 236 DNEGLFIAASGLLGLGRGRLSFPT--QIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFG 293
Query: 177 KDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMI 232
A M + ++ YY+ L SV R+ G + L G GG ++
Sbjct: 294 AGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVIL 353
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFH 291
D+G T + R YE V F + G + ++ CY R +++ H
Sbjct: 354 DSGTSVTRLARPVYEAVRDAFRA--AAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411
Query: 292 FDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFV 349
P Y I G FC A++ +D S++G QQQ R V+D + + FV
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471
Query: 350 PENC 353
P++C
Sbjct: 472 PKSC 475
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 158/364 (43%), Gaps = 23/364 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V GTP+ + ++ DTGS ++W QC PC +C+ QS +F+P S +Y + C
Sbjct: 121 EYFAQVGV--GTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 64 DLICRR-PPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
ICRR C+ C++++ Y G+ +G ++ET TF + V V GC +
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR---VQRVAIGCGH 235
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA----TSILRFG 176
DN + LG F Q+ + FSYCLV + +S + FG
Sbjct: 236 DNEGLFIAASGLLGLGRGRLSFP--SQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFG 293
Query: 177 KDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMI 232
A M + ++ YY+ L SV R+ G + L G GG ++
Sbjct: 294 AGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVIL 353
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFH 291
D+G T + R YE V F + G + ++ CY R +++ H
Sbjct: 354 DSGTSVTRLARPVYEAVRDAFRA--AAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411
Query: 292 FDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFV 349
P Y I G FC A++ +D S++G QQQ R V+D + + FV
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471
Query: 350 PENC 353
P++C
Sbjct: 472 PKSC 475
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT W +PC +C S+P F+PN SSTY + C
Sbjct: 99 YVVRVKLGTPGQLMFMVLDTSRDAAW---VPCADCAGCSSPTFSPNTSSTYASLQCSVPQ 155
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + C C Y G +S S ++S ++ + +P FGC N
Sbjct: 156 CTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDT----LPSYSFGCVNAV 211
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ G+LG P SLL Q S G+FSYC +++ + LR G Q
Sbjct: 212 SGSTLPPQ--GLLGLGRGPMSLLSQSGSLYSGVFSYCF-PSFKSYYFSGSLRLGPLG--Q 266
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K+++T + + R + YY++L +SV + AP A N G +ID+G + T
Sbjct: 267 PKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITR 326
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
Y + F + F + G ++ C+ + A +TFHF D
Sbjct: 327 FVEPVYAAIRDEFRKQVKGPFATIGA---------FDTCFAATNEDIA-PPVTFHFTGMD 376
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
K+ P I + G C+A++ + N +V+ QQQ+ R ++D+ +
Sbjct: 377 LKL-PLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIAR 435
Query: 351 ENC 353
E C
Sbjct: 436 ELC 438
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 154/365 (42%), Gaps = 34/365 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + G+P+K ++ DTGS W QC PC + C Q P+FNP+AS TYK +PC
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 66 --------ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
P ++ CV++ +Y + + G +S + T L ++G
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLS---SFVYG 219
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS--ILRF 175
C DN+ G GI+G + + S+L QL FSYCL ++ + L
Sbjct: 220 CGQDNQGLF--GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSI 277
Query: 176 GKDANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G + K + + S Y++ L+ I+VA +G A ++ + +ID
Sbjct: 278 GTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------IID 331
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-WEYCYRYD----SRFRAYASM 288
+G T I R P V + + T ++ + C++ S +
Sbjct: 332 SG---TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRI 388
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
F AD +++ + + G C+A++ S +++G +QQQ + YD+ + F
Sbjct: 389 IFK-GGADLQLK-GHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 349 VPENC 353
P C
Sbjct: 447 APGGC 451
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 158/345 (45%), Gaps = 28/345 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V + G P + +++FD + W QC PC+ C++Q IF+P+ SS+Y + C+
Sbjct: 187 FLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLSCETKH 246
Query: 67 CRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C P ++G C + I Y G + G++ ET +F V V GCSN N+
Sbjct: 247 CNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG---WVDRVSLGCSNKNQG 303
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK---DANI 181
F G+ G G S ++ +++ SYCLV + ++ ++S L F ++
Sbjct: 304 -PFVGS-DGTFGLGRGSLSFPSRINASS---MSYCLVES-KDGYSSSTLEFNSPPCSGSV 357
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ K ++ + + YY+ L+ I V +I TF + G GG ++ + ++ T +
Sbjct: 358 KAKLLQNPKA----ENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITML 413
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV-- 299
+ Y VV F R + A ++ CY S + F+ D K
Sbjct: 414 ENDTYNVVRDAFVAKTQHLERLK---AFLQFDTCYNLSSNNTVELPI-LEFEVNDGKSWL 469
Query: 300 --EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDL 341
+ +Y+Y + +N G FC A + S + S++G QQ TR +DL
Sbjct: 470 LPKESYLYAVDKN-GTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 155/350 (44%), Gaps = 20/350 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP L+FDTGS L WTQC PC+ +C++Q P FNP++SSTY+ + C
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
+C C CV+ I Y + G ++ E FT L N V + V FGC +N+
Sbjct: 192 MCEDAE-SCSASNCVYSIGYGDKSFTQGFLAKEKFT--LTNSDV-LEDVYFGCGENNQGL 247
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
FD +AG+LG SL Q +T +FSYCL +T L FG +
Sbjct: 248 -FD-GVAGLLGLGPGKLSLPAQTTTTYNNIFSYCL--PSFTSNSTGHLTFGSAGISESVK 303
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
I F + +Y + + ISV D + P +F+ T G +ID+G + T +
Sbjct: 304 FTPISSF-PSAFNYGIDIIGISVGDKELAITPNSFS-----TEGAIIDSGTVFTRLPTKV 357
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYM 304
Y + F E +S+ + ++ CY + Y ++ F F
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGL---FDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSG 414
Query: 305 YFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ C+A + +D ++ G QQ VYD+ G + F P C
Sbjct: 415 ISLPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 145/357 (40%), Gaps = 28/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP++ ++FDTGS W QC PCV C+ Q P+F+P S+TY I C
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ I Y G+ G + +T T + FGC NR
Sbjct: 156 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT----IKNFRFGCGEKNRG 211
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F+YCL T L G A
Sbjct: 212 LF--GRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLP---ATSAGTGFLDLGPGA--PAA 264
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + M VDR + YY+ + I V H + F+ T G ++D+G + T +
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFS-----TAGTLVDSGTVITRLPP 319
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-----RYDSRFRAYASMTFHFDRADFK 298
Y + F + G A + CY + S S+ F A
Sbjct: 320 SAYAPLRSAFSKAMQGLGYSAA-PAFSILDTCYDLTGHKGGSIALPAVSLVFQ-GGACLD 377
Query: 299 VEPTYMYFIFQ-NEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+ + + ++ ++ A + D + ++VG QQ+ +YD+ + F P C
Sbjct: 378 VDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 158/364 (43%), Gaps = 23/364 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF V V GTP+ + ++ DTGS ++W QC PC +C+ QS +F+P S +Y + C
Sbjct: 127 EYFAQVGV--GTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 184
Query: 64 DLICRR-PPFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
ICRR C+ C++++ Y G+ +G ++ET TF + V V GC +
Sbjct: 185 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR---VQRVAIGCGH 241
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA----TSILRFG 176
DN + LG F Q+ + FSYCLV + +S + FG
Sbjct: 242 DNEGLFIAASGLLGLGRGRLSFP--SQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFG 299
Query: 177 KDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMI 232
A M + ++ YY+ L SV R+ G + L G GG ++
Sbjct: 300 AGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVIL 359
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFH 291
D+G T + R YE V F + G + ++ CY R +++ H
Sbjct: 360 DSGTSVTRLARPVYEAVRDAFRA--AAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 417
Query: 292 FDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFV 349
P Y I G FC A++ +D S++G QQQ R V+D + + FV
Sbjct: 418 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 477
Query: 350 PENC 353
P++C
Sbjct: 478 PKSC 481
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 155/361 (42%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L+FDTGS + WTQC PCV C+ Q P NP+ S++YK I C
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 130
Query: 66 IC------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C ++ C + C++++ Y G+ + G +TET T N +FGC
Sbjct: 131 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVF---KNFLFGCG 187
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
N AG+LG + +L Q T + LFSYCL + + L G
Sbjct: 188 QQNNGLFG--GAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS---SSSKGYLSLGGQV 242
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ K + D + Y L + +SV ++ F+ G +ID+G + T
Sbjct: 243 SKSVK-FTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFS------AGTVIDSGTVIT 295
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRAD 296
+ Y + F T + ++ ++ CY +YD+ +TF +
Sbjct: 296 RLSPTAYSELSSAFQNLMTDYPSTSGYSI---FDTCYDFSKYDTVRIPKVGVTFK-GGVE 351
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ + + + C+A + +D + S+ G QQ+ + VYD G + F P C
Sbjct: 352 MDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
Query: 354 A 354
+
Sbjct: 412 S 412
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 154/365 (42%), Gaps = 34/365 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + G+P+K ++ DTGS W QC PC + C Q P+FNP+AS TYK +PC
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 66 --------ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
P ++ CV++ +Y + + G +S + T L ++G
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLS---SFVYG 219
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS--ILRF 175
C DN+ G GI+G + + S+L QL FSYCL ++ + L
Sbjct: 220 CGQDNQGLF--GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSI 277
Query: 176 GKDANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G + K + + S Y++ L+ I+VA +G A ++ + +ID
Sbjct: 278 GTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------IID 331
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-WEYCYRYD----SRFRAYASM 288
+G T I R P V + + T ++ + C++ S +
Sbjct: 332 SG---TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRI 388
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
F AD +++ + + G C+A++ S +++G +QQQ + YD+ + F
Sbjct: 389 IFK-GGADLQLK-GHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 349 VPENC 353
P C
Sbjct: 447 APGGC 451
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 152/360 (42%), Gaps = 42/360 (11%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP-------PFRC 74
++ DTGS L W QC PC C+ Q P+F+P+ S++Y +PC+ C P C
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237
Query: 75 ----------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
++ +C + + Y G+ + G+++T+T + V G +FGC NR
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS----VDGFVFGCGLSNRG 293
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G + SL+ Q G+FSYCL A +A L G D + R
Sbjct: 294 LF--GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLP-AATSGDAAGSLSLGGDTSSYRN 350
Query: 185 DMKT--IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
RM D + + +++ A N ++D+G + T +
Sbjct: 351 ATPVSYTRMIADPAQPPFY-FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVITRLA 405
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRAYAS-MTFHFD-RADFK 298
Y V F FG +R A + CY +T + AD
Sbjct: 406 PSVYRAVRAEFARQ---FGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMT 462
Query: 299 VEPTYMYFIFQNEG-YFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
V+ M F+ + +G C+A +SF D+ ++G +QQ++ R VYD + F E+C+
Sbjct: 463 VDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 142/355 (40%), Gaps = 24/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP+ ++FDTGS W QC PCV C+ Q P+F P S+TY I C
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C G C++ + Y G+ G + +T T V FGC NR
Sbjct: 225 YCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDT----VKDFRFGCGEKNRG 280
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G S+ Q G+F+YC+ T L FG A
Sbjct: 281 LF--GKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIP---ATSSGTGFLDFGPGAPAAAN 335
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
T + + + YY+ + I V H + F+ G ++D+G + T +
Sbjct: 336 ARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTVITRLPPS 390
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR---YDSRFRAYASMTFHFDRADFKVEP 301
YE + F + G + A + CY Y A A V+
Sbjct: 391 AYEPLRSAFAKGMEGLG-YKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + ++ + C+A + +D + ++VG QQ+ +YDL + F P C
Sbjct: 450 SGILYV-ADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/348 (28%), Positives = 157/348 (45%), Gaps = 34/348 (9%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PP 71
GTP + D S L+WT C +AP FNP S+T +PC D C++ P
Sbjct: 106 IGTPPQQVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVADVPCTDDACQQFAP 157
Query: 72 FRCENG--QCVHRINYAGGAS-ASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-RDFSF 127
C G +C + Y GGA+ +GL+ TE FTF + GV+FGC N DFS
Sbjct: 158 QTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR----IDGVVFGCGLKNVGDFS- 212
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMK 187
++G++G SL+ QL+ FSY ++ S + FG DA Q
Sbjct: 213 --GVSGVIGLGRGNLSLVSQLQVDR---FSYHFA-PDDSVDTQSFILFGDDATPQTSHTL 266
Query: 188 TIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATFIQRG 244
+ R+ + S YY+ L I V + GTF LR ++G+GG + + T ++
Sbjct: 267 STRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEA 326
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF-DRADFKVEPT 302
Y+ + + + G ++ ++ + CY +S +A SM F A ++E
Sbjct: 327 AYKPLRQAVA---SKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELG 383
Query: 303 YMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+++ G C+ I S + SV+G+ Q T +YD+N + F
Sbjct: 384 NYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 152/370 (41%), Gaps = 33/370 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ L DTGS + W QC PC C+ QS P+F+P S++Y+ + D
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMGYDAPD 193
Query: 67 C----RRPPFRCENGQCVHRINYA-GGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C R + CV+ + Y G++ G ET TF V VP + GC +D
Sbjct: 194 CQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA---GGVQVPHMSIGCGHD 250
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV-----YAYREMEATSILR 174
N+ F AGILG S Q+ + + FSYCL R + +T +
Sbjct: 251 NKGL-FAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVSSTLTIG 309
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYY--------LSLQDISVADHRIGFAPGTFALRRNG 226
G A ++ ++ YY ++ V + + P T G
Sbjct: 310 DGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYT------G 363
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
GG ++D+G T + R Y F G+ + S ++ CY R
Sbjct: 364 RGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAMKVP 423
Query: 287 SMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFS-DRN-SVVGAWQQQDTRFVYDLNT 343
+++ HF P Y I + G C A + + DR+ S++G QQQ R VY++
Sbjct: 424 TVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGG 483
Query: 344 GTIQFVPENC 353
G + F P +C
Sbjct: 484 GRVGFAPNSC 493
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 112/222 (50%), Gaps = 8/222 (3%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y Y +++ GTP + DTGS LIW QC+PC NC+ Q P+F+ +SST+ I C
Sbjct: 57 YDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGS 116
Query: 65 LICRRP-PFRCENGQ--CVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSN 120
C + C Q C + +Y G+ G+++ ET T + V GVIFGC +
Sbjct: 117 ESCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGH 176
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKDA 179
+N +F+ GI+G P SL+ Q+ S+ G +FS CLV +S + FGK +
Sbjct: 177 NNNG-AFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGS 235
Query: 180 NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGT 219
+ + + + + S Y+++L ISV D + F G+
Sbjct: 236 EVLGNGVVSTPLVSKTTYQSFYFVTLLGISVEDINLPFNAGS 277
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 152/360 (42%), Gaps = 42/360 (11%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP-------PFRC 74
++ DTGS L W QC PC C+ Q P+F+P+ S++Y +PC+ C P C
Sbjct: 179 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238
Query: 75 ----------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
++ +C + + Y G+ + G+++T+T + V G +FGC NR
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS----VDGFVFGCGLSNRG 294
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G + SL+ Q G+FSYCL A +A L G D + R
Sbjct: 295 LF--GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLP-AATSGDAAGSLSLGGDTSSYRN 351
Query: 185 DMKT--IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
RM D + + +++ A N ++D+G + T +
Sbjct: 352 ATPVSYTRMIADPAQPPFY-FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVITRLA 406
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRAYAS-MTFHFD-RADFK 298
Y V F FG +R A + CY +T + AD
Sbjct: 407 PSVYRAVRAEFARQ---FGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMT 463
Query: 299 VEPTYMYFIFQNEG-YFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
V+ M F+ + +G C+A +SF D+ ++G +QQ++ R VYD + F E+C+
Sbjct: 464 VDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 158/361 (43%), Gaps = 32/361 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
+ V V GTP++ L+FDTGS L W QC PC + C Q P+F+P+ SSTY + C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+ C C +N C++ + Y G+S +G++S +T L P FGC
Sbjct: 204 EPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFP---FGCGTR 260
Query: 122 NR-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N DF G + G+LG SL Q ++ +FSYCL T L G
Sbjct: 261 NLGDF---GRVDGLLGLGRGELSLPSQAAASFGAVFSYCLP---SSNSTTGYLTIGATPA 314
Query: 181 IQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ M S Y++ L I + + + P F GG ++D+G +
Sbjct: 315 TDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSGTVL 369
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHF-DRAD 296
T++ Y ++ F + R ++ + CY + +++F F D A
Sbjct: 370 TYLPAQAYALLRDRFR---LTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAV 426
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
F+++ + IF +E C+A + D S++G QQ+ +YD+ I FVP +
Sbjct: 427 FELD-FFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPAS 485
Query: 353 C 353
C
Sbjct: 486 C 486
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 156/350 (44%), Gaps = 20/350 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP L+FDTGS L WTQC PC+ +C++Q P FNP++SSTY+ + C
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
+C C CV+ I Y + G ++ E FT L N V + V FGC +N+
Sbjct: 192 MCEDAE-SCSASNCVYSIVYGDKSFTQGFLAKEKFT--LTNSDV-LEDVYFGCGENNQGL 247
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
FD +AG+LG SL Q +T +FSYCL +T L FG +
Sbjct: 248 -FD-GVAGLLGLGPGKLSLPAQTTTTYNNIFSYCL--PSFTSNSTGHLTFGSAGISESVK 303
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
I F + +Y + + ISV D + P +F+ T G +ID+G + T +
Sbjct: 304 FTPISSF-PSAFNYGIDIIGISVGDKELAITPNSFS-----TEGAIIDSGTVFTRLPTKV 357
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYM 304
Y + F E +S+ + ++ CY + Y ++ F F +
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGL---FDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 414
Query: 305 YFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ C+A + +D ++ G QQ VYD+ G + F P C
Sbjct: 415 ISLPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 145/357 (40%), Gaps = 28/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP++ ++FDTGS W QC PCV C+ Q P+F+P S+TY I C
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 220
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ I Y G+ G + +T T + FGC NR
Sbjct: 221 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT----IKNFRFGCGEKNRG 276
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F+YCL T L G A
Sbjct: 277 LF--GRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLP---ATSAGTGFLDLGPGA--PAA 329
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + M VDR + YY+ + I V H + F+ T G ++D+G + T +
Sbjct: 330 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFS-----TAGTLVDSGTVITRLPP 384
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-----RYDSRFRAYASMTFHFDRADFK 298
Y + F + G A + CY + S S+ F A
Sbjct: 385 SAYAPLRSAFSKAMQGLGYS-AAPAFSILDTCYDLTGHKGGSIALPAVSLVFQ-GGACLD 442
Query: 299 VEPTYMYFIFQ-NEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+ + + ++ ++ A + D + ++VG QQ+ +YD+ + F P C
Sbjct: 443 VDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 154/363 (42%), Gaps = 38/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP + L+FDTGS + WTQC PC+ +C+ Q F+P S++Y + C
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSA 194
Query: 66 ICRRPPF-----RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C P N C+++I Y + + G +TET T + +FGC
Sbjct: 195 SCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFT---NFLFGCGQ 251
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
N G AG+LG S S SL Q Q FSYCL +T L FG +
Sbjct: 252 SNNGLF--GQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLP---STPSSTGYLNFGGKVS 306
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
F SS Y + + ISVA ++ P F T G +ID+G + T
Sbjct: 307 QTAGFTPISPAF---SSFYGIDIVGISVAGSQLPIDPSIFT-----TSGAIIDSGTVITR 358
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK-- 298
+ Y+ + FDE +++ + N E + CY F Y +++F FK
Sbjct: 359 LPPTAYKALKEAFDEKMSNYPKT---NGDELLDTCY----DFSNYTTVSFPKVSVSFKGG 411
Query: 299 ----VEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPE 351
++ + + ++ C+A + + +S + G QQ+ VYD G I F
Sbjct: 412 VEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAG 471
Query: 352 NCA 354
C+
Sbjct: 472 ACS 474
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 159/363 (43%), Gaps = 33/363 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP K ++ DTGS L W QC PC V C Q+ P+++P+ S TYK++ C +
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 66 ICRR--------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C R P ++ C++ +Y + + G +S + T L P +G
Sbjct: 185 ECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTL---PQFTYG 241
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C DN+ G AGI+G + S+L QL + FSYCL A L
Sbjct: 242 CGQDNQGLF--GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSI-- 297
Query: 178 DANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+I K M D S Y+L L I+V+ + A A+ R T +ID+G
Sbjct: 298 -GSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLA---AAMYRVPT---LIDSG 350
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHFD- 293
+ T + Y + + F + ++ + A + C++ + A + F
Sbjct: 351 TVITRLPMSMYAALRQAFVKIMST--KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQG 408
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
AD + + I ++G C+A S +++ +++G QQQ YD++T I F P
Sbjct: 409 GADLTLRAPSI-LIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAP 467
Query: 351 ENC 353
+C
Sbjct: 468 GSC 470
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 153/364 (42%), Gaps = 26/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y VD GTP + L+ D+GS L+W QC PC+ C+ Q P++ P+ SST+ +PC
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPE 124
Query: 67 CRRPP----FRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P F C+ G C + YA + + G+ + E+ T V + V FGC
Sbjct: 125 CLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV----DDVRIDKVAFGCG 180
Query: 120 NDNR-DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN+ F+ G G+LG P S Q+ F+YCLV +S L FG +
Sbjct: 181 RDNQGSFAAAG---GVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGDE 237
Query: 179 ANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D++ + + + YY+ ++ + V + + ++L G GG + D+G
Sbjct: 238 LISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGT 297
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRA 295
T+ Y ++ FD++ R + + + C + ++ S T
Sbjct: 298 TVTYWLPPAYRNILAAFDKNV----RYPRAASVQGLDLCVDVTGVDQPSFPSFTIVLGGG 353
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
YF+ C+A++ + + +G QQ+ YD I F P
Sbjct: 354 AVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPA 413
Query: 352 NCAN 355
C++
Sbjct: 414 KCSS 417
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+ Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSC 135
Query: 63 DDLICR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+C P C++ + C R++Y G+++ G++ +T TF K +PG
Sbjct: 136 GTSMCLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFS 191
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATS 171
FGC+ D+ + GN+ G+LG P S+L Q T FSYCL E + T
Sbjct: 192 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTG 250
Query: 172 ILRFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
GK A R D++ +M + + +++ L ISV R+G +P F+ + G
Sbjct: 251 YFSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK-----G 303
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
+ D+G+ ++I V+ + E +R E CY S ++
Sbjct: 304 VVFDSGSELSYIPDRALSVLSQRIRELLL----KRGAAEEESERNCYDMRSVDEGDMPAI 359
Query: 289 TFHFD---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGT 345
+ HFD R D ++ Q + +C+A + ++ S++G+ Q VYDL
Sbjct: 360 SLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQL 419
Query: 346 IQFVP 350
I P
Sbjct: 420 IGIGP 424
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 145/358 (40%), Gaps = 35/358 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++ GTP + L DTGS LIWT+C + ++PNASST+ R+PC D +
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRL 159
Query: 67 CRR----PPFRCENG--QCVHRINYAGGAS---ASGLVSTETFTFHLKNKLVCVPGVIFG 117
C RC G +C ++ Y G G + +ETFT VPGV FG
Sbjct: 160 CAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGD----AVPGVGFG 215
Query: 118 CSND-NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C+ D+ G AG++G P SL+ QL + G F YCL + S L FG
Sbjct: 216 CTTALEGDY---GEGAGLVGLGRGPLSLVSQLDA---GTFMYCLT---ADASKASPLLFG 266
Query: 177 KDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
A + + S+ +Y ++L+ I++ G + + D+G
Sbjct: 267 ALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPGGV--------VFDSG 318
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA 295
T++ Y F TS +E CY R +M HFD
Sbjct: 319 TTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYG---FEACYEKPDSARLIPAMVLHFDGG 375
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y + ++G C + S S++G Q + ++D+ + F P NC
Sbjct: 376 ADMALPVANYVVEVDDGVVCWVVQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 157/352 (44%), Gaps = 38/352 (10%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PP 71
GTP + D S L+WT C +AP FNP S+T +PC D C++ P
Sbjct: 106 IGTPPQQVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVADVPCTDDACQQFAP 157
Query: 72 FRCENG------QCVHRINYAGGAS-ASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-R 123
C G +C + Y GGA+ +GL+ TE FTF + GV+FGC N
Sbjct: 158 QTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR----IDGVVFGCGLQNVG 213
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
DFS ++G++G SL+ QL+ FSY ++ S + FG DA Q
Sbjct: 214 DFS---GVSGVIGLGRGNLSLVSQLQVDR---FSYHFA-PDDSVDTQSFILFGDDATPQT 266
Query: 184 KDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMIDTGAIATF 240
+ R+ + S YY+ L I V + GTF LR ++G+GG + + T
Sbjct: 267 SHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTV 326
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF-DRADFK 298
++ Y+ + + + G ++ ++ + CY +S +A SM F A +
Sbjct: 327 LEEAAYKPLRQAV---ASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVME 383
Query: 299 VEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+E +++ G C+ I S + SV+G+ Q T +YD+N + F
Sbjct: 384 LELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 151/362 (41%), Gaps = 46/362 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DVL G+P K L+ DTGS L W QCLPC +CF Q+
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQN--------------------- 208
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCVPGVIFGCSND 121
+N C + Y ++ +G + ETFT +L ++L V ++FGC +
Sbjct: 209 --------DNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHW 260
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
NR LG FS QL+S FSYCLV + +S L FG+D ++
Sbjct: 261 NRGLFHGAAGLLGLGRGPLSFS--SQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDL 318
Query: 182 QRKDMKTIRMFVDRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
FV + YY+ ++ I VA + T+ + +G GG +ID+G
Sbjct: 319 LSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGT 378
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRA 295
++ YE + E + G+ ++ + C+ + F
Sbjct: 379 TLSYFAEPAYEFIKNKIAEK--AKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADG 436
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
PT FI+ NE C+A+ + ++ S++G +QQQ+ +YD + + P C
Sbjct: 437 AVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496
Query: 354 AN 355
A+
Sbjct: 497 AD 498
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 147/370 (39%), Gaps = 65/370 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + GTP + L DTGS + WTQC P CFNQ+ P+F+P+ASS++ +PC
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147
Query: 65 LICRRPPFRCENGQ------CVHRINYAGGASASGLVSTETFTFHL---KNKLVCVPGVI 115
C P C G C + I+Y G+ + G + E FTF + VPG++
Sbjct: 148 PACETTP-PCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLV 206
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC + NR F N GI GF SL QLK G FS+C + +L
Sbjct: 207 FGCGHANRGV-FTSNETGIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSKTSAVLLGL 262
Query: 176 GKDANIQRKDMKTIR-----MFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
A + R RSS+ S+ + +R A+R
Sbjct: 263 PGVAPPSASPLGRRRGSYRCRSTPRSSNSGTSITSLPPRTYR--------AVREE----- 309
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
F + VV + + FT F + D +M
Sbjct: 310 ---------FAAQVKLPVVPGNATDPFTCFSAP-LRGPKPD-------------VPTMAL 346
Query: 291 HFDRADFKV-EPTYMYFIFQNEG------YFCVAISFSDRNSVVGAWQQQDTRFVYDLNT 343
HF+ A ++ + Y++ + ++ C+A+ ++G QQQ+ +YDL
Sbjct: 347 HFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV-IEGGEIILGNIQQQNMHVLYDLQN 405
Query: 344 GTIQFVPENC 353
+ FVP C
Sbjct: 406 SKLSFVPAQC 415
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 160/365 (43%), Gaps = 35/365 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+ Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSC 135
Query: 63 DDLICR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+C P C++ + C R++Y G+++ G++ +T TF K +P
Sbjct: 136 GTSMCLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFT 191
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATS 171
FGC+ D+ + GN+ G+LG P S+L Q G FSYCL E + T
Sbjct: 192 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTG 250
Query: 172 ILRFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
GK A R D++ +M R + +++ L ISV R+G +P F+ + G
Sbjct: 251 YFSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----G 303
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASM 288
+ D+G+ ++I V+ + E +R E CY S ++
Sbjct: 304 VVFDSGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAI 359
Query: 289 TFHFD---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGT 345
+ HFD R D ++ Q + +C+A + ++ S++G+ Q VYDL
Sbjct: 360 SLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQL 419
Query: 346 IQFVP 350
I P
Sbjct: 420 IGIGP 424
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 152/358 (42%), Gaps = 29/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP+K ++ DTGS L W QC PC V+C QS P+F+P SS+Y + C
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSP 176
Query: 66 IC-------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P + C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 177 QCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFGANS----VPNFYYGC 232
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL T FSYCL ++ L G
Sbjct: 233 GQDNEG--LFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL----PSTSSSGYLSIGSY 286
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ +D S Y++SL ++VA + + + +ID+G +
Sbjct: 287 NPGGYSYTPMVSNTLDD-SLYFISLSGMTVAGKPLAVSSSEYTSLPT-----IIDSGTVI 340
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRAYASMTFHFD-RAD 296
T + Y + + G + A + C+ S+ RA +++ F A
Sbjct: 341 TRLPTSVYTALSKAVAAAMK--GSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGAT 398
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K+ + + + C+A + + +++G QQQ VYD+ + I F C+
Sbjct: 399 LKLSAGNL-LVDVDGATTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 154/372 (41%), Gaps = 30/372 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASSTYKRIP 61
Y V + GTP++ L+ DTGS L W +C + + A +F P S ++ +P
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163
Query: 62 CDDLICRR-PPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLK----NKLVCV 111
CD C+ PF N C + Y +SA G+V ++ T L + +
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKL 223
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
V+ GC+ SF + G+L S S + S G FSYCLV ATS
Sbjct: 224 QEVVLGCTTSYDGQSFKSS-DGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282
Query: 172 ILRFGKDANIQRKDMKTIR----MFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRN 225
L FG + D + R + D + Y++S+ ++VA R+ P + R+N
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDFRKN 342
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
GG ++D+G T + Y+ V++ + F R M + +EYCY +
Sbjct: 343 --GGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM----DPFEYCYNWTGVSAEI 396
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNT 343
M F A P Y I G C+ + SV+G QQ+ + +DL
Sbjct: 397 PRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDLAN 456
Query: 344 GTIQFVPENCAN 355
++F CA+
Sbjct: 457 RWLRFKQSRCAH 468
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 170/375 (45%), Gaps = 51/375 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ K+ L+ DTGS L W QC PC +C+NQ P+++P+ SS+YK + C+
Sbjct: 86 NYIVTVEL----GGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN 141
Query: 64 DLICR--------RPPFRCENG----QCVHRINYAGGASASGLVSTETFTF---HLKNKL 108
C+ P NG C + ++Y G+ G +++E+ L+N
Sbjct: 142 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN-- 199
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+FGC +N+ + +G S SL+ Q T G+FSYCL E
Sbjct: 200 -----FVFGCGRNNKGLFGGSSGL--MGLGRSSVSLVSQTLKTFNGVFSYCL--PSLEDG 250
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSL-QDISVADHRI----GFAPGTFALR 223
A+ L FG D+++ + + +S Y L Q+ + I G + G L+
Sbjct: 251 ASGSLSFGNDSSV----------YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK 300
Query: 224 RNGTG-GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
+ G G +ID+G + T + Y+ V F + F+ F ++ + Y+
Sbjct: 301 SSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS 360
Query: 283 RAYASMTFHFDRADFKVEPTYM-YFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFV 338
M F + A+ +V+ T + YF+ + C+A +S+ + ++G +QQ++ R +
Sbjct: 361 IPIIKMIFQGN-AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVI 419
Query: 339 YDLNTGTIQFVPENC 353
YD + V ENC
Sbjct: 420 YDTTQERLGIVGENC 434
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 152/376 (40%), Gaps = 37/376 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-APIFNPNASSTYKRIPCDDL 65
Y VD+ G P +S L+ DTGS L+W +C C NC + S A +F P SST+ C D
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 143
Query: 66 ICRRPPFRCENGQCVH-RIN--------YAGGASASGLVSTETFTFHLKN-KLVCVPGVI 115
+CR P C H RI+ YA G+ SGL + ET + + K + V
Sbjct: 144 VCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVA 203
Query: 116 FGC-----SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
FGC SF+G G++G P S QL FSYCL+ T
Sbjct: 204 FGCGFRISGQSVSGTSFNG-ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPT 262
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
S L G + K T + S + YY+ L+ + V ++ P + + +G GG
Sbjct: 263 SYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGG 322
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-----WEYCYRYDSRFRA 284
++D+G F+ Y V+ R+R+ D ++ C +
Sbjct: 323 TVVDSGTTLAFLAEPAYRSVIAAV--------RRRVKLPIADALTPGFDLCVNVSGVTKP 374
Query: 285 ---YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFV 338
+ F F V P YFI E C+AI D SV+G QQ F
Sbjct: 375 EKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFE 434
Query: 339 YDLNTGTIQFVPENCA 354
+D + + F CA
Sbjct: 435 FDRDRSRLGFSRRGCA 450
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 144/356 (40%), Gaps = 31/356 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y V GTP ++ L DTGS L W QC PC +C+ Q P+F+P SS+Y +PC
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196
Query: 65 LICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C C QC + ++Y G++ +G+ S++T T V G +FGC +
Sbjct: 197 SACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANAT---VQGFLFGCGHA 253
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
F G I G+LGF SL+ Q G+FSYCL + T L G + +
Sbjct: 254 QSGGLFTG-IDGLLGFGREQPSLVQQTAGAYGGVFSYCLP---TKSSTTGYLTLGGPSGV 309
Query: 182 QRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
T + + YY + L ISV + FA G ++DTG + T
Sbjct: 310 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFA------AGTVVDTGTVITR 363
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVE 300
+ Y + F S+ + CY F Y ++ F
Sbjct: 364 LPPAAYAALRSAFRSGMASYPSAPPIGI---LDTCY----SFAGYGTVNLTSVALTFSSG 416
Query: 301 PTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
T + C+A + S + +++G QQ+ F ++ ++ F P +C
Sbjct: 417 ATMTLGADGIMSFGCLAFASSGSDGSMAILGNVQQRS--FEVRIDGSSVGFRPSSC 470
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 166/374 (44%), Gaps = 49/374 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ K+ L+ DTGS L W QC PC +C+NQ P+++P+ SS+YK + C+
Sbjct: 134 NYIVTVEL----GGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 64 DLICR--------RPPFRCENG----QCVHRINYAGGASASGLVSTETFTF---HLKNKL 108
C+ P NG C + ++Y G+ G +++E+ L+N
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN-- 247
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+FGC +N+ + +G S SL+ Q T G+FSYCL E
Sbjct: 248 -----FVFGCGRNNKGLFGGSSGL--MGLGRSSVSLVSQTLKTFNGVFSYCL--PSLEDG 298
Query: 169 ATSILRFGKDANIQRKDMKT----IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
A+ L FG D+++ + S Y L+L S+ G L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASI---------GGVELKS 349
Query: 225 NGTG-GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+ G G +ID+G + T + Y+ V F + F+ F ++ + Y+
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISI 409
Query: 284 AYASMTFHFDRADFKVEPTYM-YFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVY 339
M F + A+ +V+ T + YF+ + C+A +S+ + ++G +QQ++ R +Y
Sbjct: 410 PIIKMIFQGN-AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIY 468
Query: 340 DLNTGTIQFVPENC 353
D + V ENC
Sbjct: 469 DTTQERLGIVGENC 482
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 163/374 (43%), Gaps = 45/374 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV G + ++ DT S L W QC PC +C +Q P+F+P++S +Y +PC+
Sbjct: 119 NYVATV----GLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCN 174
Query: 64 DLICR---------RPPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCV 111
C P +N Q C + ++Y G+ + G+++ + ++ +
Sbjct: 175 SSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQD----I 230
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
G +FGC N+ F G +G++G S SL+ Q G+FSYCL RE ++
Sbjct: 231 EGFVFGCGTSNQGAPF-GGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCL--PMRESGSSG 287
Query: 172 ILRFGKDANIQRKDMKTI--RMFVD----RSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
L G D++ R + M D + Y+L+L I+V + +P A
Sbjct: 288 SLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEV-ESPWFSA---- 342
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRA 284
G +ID+G I T + Y V F + + + + C+ +
Sbjct: 343 --GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSI---LDTCFNLTGLKEVQ 397
Query: 285 YASMTFHFD-RADFKVEPT-YMYFIFQNEGYFCVAISFSDR---NSVVGAWQQQDTRFVY 339
S+ F F+ + +V+ +YF+ + C+A++ S++G +QQ++ R ++
Sbjct: 398 VPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIF 457
Query: 340 DLNTGTIQFVPENC 353
D I F E C
Sbjct: 458 DTLGSQIGFAQETC 471
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 166/374 (44%), Gaps = 49/374 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV++ K+ L+ DTGS L W QC PC +C+NQ P+++P+ SS+YK + C+
Sbjct: 134 NYIVTVEL----GGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 64 DLICR--------RPPFRCENG----QCVHRINYAGGASASGLVSTETFTF---HLKNKL 108
C+ P NG C + ++Y G+ G +++E+ L+N
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN-- 247
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+FGC +N+ + +G S SL+ Q T G+FSYCL E
Sbjct: 248 -----FVFGCGRNNKGLFGGSSGL--MGLGRSSVSLVSQTLKTFNGVFSYCL--PSLEDG 298
Query: 169 ATSILRFGKDANIQRKDMKT----IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
A+ L FG D+++ + S Y L+L S+ G L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASI---------GGVELKS 349
Query: 225 NGTG-GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+ G G +ID+G + T + Y+ V F + F+ F ++ + Y+
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISI 409
Query: 284 AYASMTFHFDRADFKVEPTYM-YFIFQNEGYFCVA---ISFSDRNSVVGAWQQQDTRFVY 339
M F + A+ +V+ T + YF+ + C+A +S+ + ++G +QQ++ R +Y
Sbjct: 410 PIIKMIFQGN-AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIY 468
Query: 340 DLNTGTIQFVPENC 353
D + V ENC
Sbjct: 469 DSTQERLGIVGENC 482
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 164/373 (43%), Gaps = 33/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC---LPCVNCFNQSA------PIFNPNASSTY 57
Y+V GTPS+ L+ DTGS L W C NC N+ A +F+ N SS++
Sbjct: 12 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 71
Query: 58 KRIPCDDLICR---RPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKN-KL 108
K IPC +C+ F N C + Y+ G++A G + ET T LK +
Sbjct: 72 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 131
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ + V+ GCS + SF G++G S +S + G FSYCLV
Sbjct: 132 MKLHNVLIGCSESFQGQSFQA-ADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 190
Query: 169 ATSILRFGKDANIQR--KDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRN 225
++ L FG + + +M + + +S Y +++ IS+ + + ++
Sbjct: 191 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVK-- 248
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G GG ++D+G+ TF+ Y+ VM F + M EYC+ +
Sbjct: 249 GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG--PLEYCFNSTGFEESL 306
Query: 286 AS-MTFHF-DRADFKVEPTYMYFIFQNEGYFC---VAISFSDRNSVVGAWQQQDTRFVYD 340
+ FHF D A+F+ P Y I +G C V++++ SVVG QQ+ + +D
Sbjct: 307 VPRLVFHFADGAEFE-PPVKSYVISAADGVRCLGFVSVAWPG-TSVVGNIMQQNHLWEFD 364
Query: 341 LNTGTIQFVPENC 353
L + F P +C
Sbjct: 365 LGLKKLGFAPSSC 377
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 147/362 (40%), Gaps = 46/362 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + G+P K L+FDTGS L W +C +A F+P S++Y + C +
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARC--------SAAETFDPTKSTSYANVSCSTPL 185
Query: 67 CRRP------PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C P RC CV+ I Y G+ + G + E T + FGC
Sbjct: 186 CSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD---IFNNFYFGCGQ 242
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D D F G AG+LG S++ Q LFSYCL +T L FG
Sbjct: 243 D-VDGLF-GKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL----PSSSSTGFLSFGSS-- 294
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
Q K K + SS Y L L I+V ++ F+ T G +ID+G + T
Sbjct: 295 -QSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFS-----TAGTIIDSGTVVTR 348
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA--YASMTFHFDRA-DF 297
+ Y + F + S+ M + CY + S+++ + F D
Sbjct: 349 LPPAAYSALRSAFRKAMASY---PMGKPLSILDTCYDF-SKYKTIKVPKIVISFSGGVDV 404
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDR-----NSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
V+ IF G V ++F+ ++ G QQ++ VYD++ G + F P +
Sbjct: 405 DVD---QAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPAS 461
Query: 353 CA 354
C+
Sbjct: 462 CS 463
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 161/374 (43%), Gaps = 48/374 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV + G + ++ DT S L W QC PC C +Q P+F+P++S +Y +PC+
Sbjct: 112 NYVATVGIGGGEAT----VIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCN 167
Query: 64 DLICRRPPFRCENGQ-----------CVHRINYAGGASASGLVSTETFTFHLKNKLVCVP 112
C R G C + ++Y G+ + G+++ + + ++ +
Sbjct: 168 SSSCDA--LRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED----IQ 221
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
G +FGC N+ F G +G++G S SL+ Q G+FSYCL +E ++
Sbjct: 222 GFVFGCGTSNQG-PF-GGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PPKESGSSGS 277
Query: 173 LRFGKDANIQRKDMKTI--RMFVD--RSSHYYLSLQDISVADHRI---GFAPGTFALRRN 225
L G DA++ R + M D + Y +L I+V + GF+ G
Sbjct: 278 LVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAG------- 330
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRA 284
G G ++D+G I T + Y V F + + + + C+ R
Sbjct: 331 GGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSI---LDTCFDLTGLREVQ 387
Query: 285 YASMTFHFD-RADFKVEPT-YMYFIFQNEGYFCVAISFSDR---NSVVGAWQQQDTRFVY 339
S+ FD A+ +V+ +Y + + C+A++ ++G +QQ++ R ++
Sbjct: 388 VPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIF 447
Query: 340 DLNTGTIQFVPENC 353
D I F E C
Sbjct: 448 DTVGSQIGFAQETC 461
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 163/375 (43%), Gaps = 35/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +DV GTP + ++ DTGS L W QC PC++CF+Q P+F+P ASS+Y+ + C D
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQR 210
Query: 67 C-----RRPPFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC--VPGVIF 116
C PP C C + Y ++ +G ++ E+FT +L V V+F
Sbjct: 211 CGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVF 270
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC + NR LG F+ QL++ FSYCLV ++ + + FG
Sbjct: 271 GCGHWNRGLFHGAAGLLGLGRGPLSFA--SQLRAVYGHTFSYCLVDHGSDVASKVV--FG 326
Query: 177 KDANIQRKDMK---TIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTF--ALRRNGT 227
+D + F SS YY+ L+ + V + + T+ G+
Sbjct: 327 EDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGS 386
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRA 284
GG +ID+G ++ Y+V+ + F + R + D+ CY R
Sbjct: 387 GGTIIDSGTTLSYFVEPAYQVIRQAFIDRMG-----RSYPLIPDFPVLSPCYNVSGVDRP 441
Query: 285 -YASMTFHFDRADFKVEPTYMYFI-FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYD 340
++ F P YFI +G C+A+ + R S++G +QQQ+ VYD
Sbjct: 442 EVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVVYD 501
Query: 341 LNTGTIQFVPENCAN 355
L + F P CA
Sbjct: 502 LKNNRLGFAPRRCAE 516
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 152/357 (42%), Gaps = 28/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTPS S ++ DTGS L W QC PC V+C Q P+F+P ASSTY + C
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 66 ICRR------PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C C+++ +Y + + G +ST+T +F + P +GC
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS----YPSFYYGC 249
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL A +T L G
Sbjct: 250 GQDNEGLF--GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA----ASTGYLSIGPY 303
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ +S Y+++L +SV + +P ++ +ID+G +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----IIDSGTVI 358
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRAYASMTFHFDRADF 297
T + + + + + G QR A + C+ S+ R + A
Sbjct: 359 TRLPTAVHTALSKAVAQAMA--GAQRA-PAFSILDTCFEGQASQLRVPTVVMAFAGGASM 415
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K+ T I ++ C+A + +D +++G QQQ +YD+ I F C+
Sbjct: 416 KLT-TRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 148/355 (41%), Gaps = 25/355 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 238
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 239 ACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERNEG 295
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG + R
Sbjct: 296 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSTGTGYLDFGAGSPAAR- 349
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ T M VD + YY+ L I V + FA T G ++D+G + T +
Sbjct: 350 -LTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFA-----TAGTIVDSGTVITRLPP 403
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y + F ++ G ++ S + CY + + A +++ F A V+
Sbjct: 404 AAYSSLRSAFAAAMSARGYKKAPAVSL-LDTCYDFAGMSQVAIPTVSLLFQGGARLDVDA 462
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+A + ++ +VG Q + YD+ + F P C
Sbjct: 463 SGIMYA-ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 153/357 (42%), Gaps = 29/357 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y ++ GTP+ S ++ DTGS L W QC PC V+C Q P+++P ASSTY +PC
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSAS 193
Query: 66 ICRR------PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C C+++ +Y + + G +S +T +F + P +GC
Sbjct: 194 QCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS----YPNFYYGC 249
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL +T L G
Sbjct: 250 GQDNEG--LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL----PTPASTGYLSIGPY 303
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ +D +S Y+++L +SV + +P ++ +ID+G +
Sbjct: 304 TSGHYSYTPMASSSLD-ASLYFVTLSGMSVGGSPLAVSPAEYSSLPT-----IIDSGTVI 357
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRAYASMTFHFDRADF 297
T + Y + + G Q A + C++ S+ R A A
Sbjct: 358 TRLPTAVYTALSKAVAAAM--VGVQSAP-AFSILDTCFQGQASQLRVPAVAMAFAGGATL 414
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K+ T I ++ C+A + +D +++G QQQ VYD+ I F C+
Sbjct: 415 KLA-TQNVLIDVDDSTTCLAFAPTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 164/373 (43%), Gaps = 33/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC---LPCVNCFNQSA------PIFNPNASSTY 57
Y+V GTPS+ L+ DTGS L W C NC N+ A +F+ N SS++
Sbjct: 83 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 58 KRIPCDDLICR---RPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKN-KL 108
K IPC +C+ F N C + Y+ G++A G + ET T LK +
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 202
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ + V+ GCS + SF G++G S +S + G FSYCLV
Sbjct: 203 MKLHNVLIGCSESFQGQSFQA-ADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 261
Query: 169 ATSILRFGKDANIQR--KDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRN 225
++ L FG + + +M + + +S Y +++ IS+ + + ++
Sbjct: 262 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVK-- 319
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G GG ++D+G+ TF+ Y+ VM F + M EYC+ +
Sbjct: 320 GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG--PLEYCFNSTGFEESL 377
Query: 286 AS-MTFHF-DRADFKVEPTYMYFIFQNEGYFC---VAISFSDRNSVVGAWQQQDTRFVYD 340
+ FHF D A+F+ P Y I +G C V++++ SVVG QQ+ + +D
Sbjct: 378 VPRLVFHFADGAEFE-PPVKSYVISAADGVRCLGFVSVAWPG-TSVVGNIMQQNHLWEFD 435
Query: 341 LNTGTIQFVPENC 353
L + F P +C
Sbjct: 436 LGLKKLGFAPSSC 448
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 111/217 (51%), Gaps = 20/217 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + +L DTGS LIWTQC PC C + AP F P +SST+ ++PC +
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 67 CR--RPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C+ P+R C CV+ Y G +A G ++TE T H+ PGV FGCS +N
Sbjct: 150 CQFLTSPYRTCNATGCVYYYPYGMGFTA-GYLATE--TLHVGGA--SFPGVTFGCSTEN- 203
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ +GI+G SP SL+ Q+ FSYCL S + FG A +
Sbjct: 204 --GVGNSSSGIVGLGRSPLSLVSQV---GVARFSYCL--RSNADAGDSPILFGSLAKVTG 256
Query: 184 KDMKTIRMF----VDRSSHYYLSLQDISVADHRIGFA 216
++++ + + SS+YY++L I+V + A
Sbjct: 257 GNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMA 293
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 156/371 (42%), Gaps = 45/371 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC--- 62
Y V + GTP+K ++ DTGS L W QC PCV C Q PIF P+ S TYK +PC
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 63 -----DDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
P G CV++ +Y + + G +S + T L G ++G
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLT--LTPSEAPSSGFVYG 230
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI---LR 174
C DN+ G +GI+G + S+LGQL FSYCL ++ ++S+ L
Sbjct: 231 CGQDNQGLF--GRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLS 288
Query: 175 FGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G +++ K + ++ S Y+L L I+VA +G + ++ + +I
Sbjct: 289 IGA-SSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPT------II 341
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQR---------MHNASEDWEYCYRYDSRFR 283
D+G + T + Y + + F + Q + ++ FR
Sbjct: 342 DSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFR 401
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLN 342
A + + ++E +G C+AI+ S S++G +QQQ + YD+
Sbjct: 402 GGAGLELKAHNSLVEIE----------KGTTCLAIAASSNPISIIGNYQQQTFKVAYDVA 451
Query: 343 TGTIQFVPENC 353
I F P C
Sbjct: 452 NFKIGFAPGGC 462
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 144/364 (39%), Gaps = 26/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-- 64
Y VD GTP + L+ DTGS L + QC PC C+ Q P++ P+ SST+ +PCD
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAE 93
Query: 65 -LICRRP---------PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
L+ P P G C + Y +S G+ + ET T + V V
Sbjct: 94 CLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNHV 149
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC N N+ SF + G+LG S Q + F+YCL S L
Sbjct: 150 AFGCGNRNQG-SFV-SAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSVFSSLI 207
Query: 175 FGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
FG D D++ + + S YY+ + I + + + G GG +
Sbjct: 208 FGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNGGTIF 267
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFH 291
D+G T+ Y ++ F++ S R + + C Y S T
Sbjct: 268 DSGTTVTYWSPQAYARIIAAFEK---SVPYPRAPPSPQGLPLCVNVSGIDHPIYPSFTIE 324
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
FD+ YFI + C+A+ S SD +V+G QQ+ YD I F
Sbjct: 325 FDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRIGFA 384
Query: 350 PENC 353
NC
Sbjct: 385 HANC 388
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/286 (31%), Positives = 122/286 (42%), Gaps = 27/286 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + FGTP+ + LL DTGS L W QC PC C+ Q P+F+P+ASSTY +PC
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181
Query: 65 LICRR-PPFRCENG---------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
CR P NG C + I Y G + G+ STET T + V V
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATV-VNNF 240
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC + G+LG +P SL+ Q T G FSYCL L
Sbjct: 241 SFGCGLVQKGVFD--LFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGN---STAGFLA 295
Query: 175 FGKDANI--QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G A + + V ++ Y + L ISV ++ P FA GG +I
Sbjct: 296 LGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA------GGMII 349
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY 278
D+G I T + Y + F +++ N ED + CY +
Sbjct: 350 DSGTIVTGLPETAYSALRTAFRSAMSAY-PLLPPNDDEDLDTCYDF 394
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 165/365 (45%), Gaps = 37/365 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LP-CVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y + G+P+ + + D+GS L+W QC P C NC+ Q P+FNP+ S TY + C+
Sbjct: 101 YVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNT 160
Query: 65 LICRRPP----FRCE--NGQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPGVIF 116
CR +RC+ N C + +Y + G++ST+ FTF H+ +IF
Sbjct: 161 AECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIF 220
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-VYAYREMEATSILRF 175
GC +N D G++G + + SL+GQ+ FSYC+ + + ++ + +RF
Sbjct: 221 GCGYNNSDPQ-HFYPPGLVGLTNNKASLVGQMDVDQ---FSYCVSIDTEQNLKGSMEIRF 276
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYL--SLQDISVADHRI-GFAPGTFALRRNGTGGCMI 232
G A+I + V S +Y+ ++ I V + + G+ F G GG +
Sbjct: 277 GLAASISGHSTQ----LVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQGGLTM 332
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFH 291
DTG T + + +++ +EH T + N+ +E CY D A +
Sbjct: 333 DTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSG--FELCYFSDDFLGATLPDIELR 390
Query: 292 F-DRADFKVEPTYMYFIFQN------EGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
F D D TY F +N C+A+ ++ S++G Q +D + YDL+
Sbjct: 391 FTDNKD-----TYFSFNTRNAWTPNGRSQMCLAMFRTNGMSIIGMHQLRDIKIGYDLHHN 445
Query: 345 TIQFV 349
+ F
Sbjct: 446 IVSFT 450
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 155/370 (41%), Gaps = 43/370 (11%)
Query: 1 HEKNYF-----YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASS 55
H N F + VDV FGTP L+ DTGS + WTQC CVNC S F+ +ASS
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASS 176
Query: 56 TYKRIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
TY C P EN + + Y +++ G +T T +
Sbjct: 177 TYSFGSC-------IPSTVENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVF---QKFQ 223
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC +N+ F + G+LG S + Q S +FSYCL E ++ L F
Sbjct: 224 FGCGRNNKG-DFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLLF 278
Query: 176 GKDANIQRKDMKTIRMF-----VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
G+ A Q +K + + S +Y+++L DISV + R+ FA + G
Sbjct: 279 GEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFA-----SPGT 333
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSF----GRQRMHNASEDWEYCYRYDSRFRA-Y 285
+ID+ + T + + Y + F + + GR++ + + CY R
Sbjct: 334 IIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDI---LDTCYNLSGRKDVLL 390
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTG 344
+ HF AD ++ T + + + C+A + + +++G QQ +YD+
Sbjct: 391 PEIVLHFGGGADVRLNGTNIVW-GSDASRLCLAFAGTSELTIIGNRQQLSLTVLYDIQGR 449
Query: 345 TIQFVPENCA 354
I F C+
Sbjct: 450 RIGFGGNGCS 459
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 152/380 (40%), Gaps = 42/380 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + L G P + + DTGS LIWTQC C CF Q+ P ++P+ S + + C+D
Sbjct: 71 YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDA 130
Query: 66 ICRR-PPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C +C +N C Y G + +G ++TE TF + ++FGC
Sbjct: 131 ACALGSETQCLSDNKTCAVVTGY-GAGNIAGTLATENLTFQSETV-----SLVFGCIVVT 184
Query: 123 R--DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
+ S +G +GI+G SL QL T FSYCL + + S + G A
Sbjct: 185 KLSPGSLNGA-SGIIGLGRGKLSLPSQLGDTR---FSYCLTPYFEDTIEPSHMVVGASAG 240
Query: 181 IQRKDMKT--------IRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTG-- 228
+ + +R D S+ YYL L I+ ++ F LR+ G
Sbjct: 241 LINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMW 300
Query: 229 -GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G ID+GA T + Y+ + + Q + + ++ C R
Sbjct: 301 TGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAG-TTGFDLCVALKDAERLVPP 359
Query: 288 MTFHFDRA-----DFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDT 335
+ HF D V P + + V S DR S V+G + QQ+
Sbjct: 360 LVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNM 419
Query: 336 RFVYDLNTGTIQFVPENCAN 355
+YDL G + F P +C++
Sbjct: 420 HVLYDLAGGVLSFQPADCSS 439
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 149/362 (41%), Gaps = 30/362 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC 62
NYF TV + GTP K L+FDTGS L WTQC PCV +C+NQ IFNP+ S++Y I C
Sbjct: 152 NYFVTVGL--GTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISC 209
Query: 63 DDLICRRPP------FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C F C + CV+ I Y + + G E + + F
Sbjct: 210 GSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATDVF---NDFYF 266
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC +N+ AG+LG SL+ Q +FSYCL +T L FG
Sbjct: 267 GCGQNNKGLFG--GAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLP---SSSSSTGFLTFG 321
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
+ + + SS Y L L ISV ++ +P F+ T G +ID+G
Sbjct: 322 GSTS-KSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFS-----TAGTIIDSGT 375
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ T + Y + F + + + A + C+ + + F
Sbjct: 376 VITRLPPAAYSALSSTFRKLMSQY---PAAPALSILDTCFDFSNHDTISVPKIGLFFSGG 432
Query: 297 FKVEPTYMYFIFQNE-GYFCVAISF-SDRNSVV--GAWQQQDTRFVYDLNTGTIQFVPEN 352
V+ + N+ C+A + SD + V G QQ+ VYD G + F P
Sbjct: 433 VVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAG 492
Query: 353 CA 354
C+
Sbjct: 493 CS 494
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 149/356 (41%), Gaps = 26/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTPS S ++ DTGS L W QC PC V+C Q P+F+P ASSTY + C
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 66 ICRR------PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C C+++ +Y + + G +ST+T +F P +GC
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR----YPSFYYGC 249
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN F G AG++G + + SLL QL + FSYCL A +T L G
Sbjct: 250 GQDNEGL-F-GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA----ASTGYLSIGPY 303
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ +S Y+++L +SV + +P ++ +ID+G +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----IIDSGTVI 358
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + + + + + G QR A + C+ + ++ F
Sbjct: 359 TRLPTAVHTALSKAVAQAMA--GAQRA-PAFSILDTCFEGQASQLRVPTVAMAFAGGASM 415
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
T I ++ C+A + +D +++G QQQ +YD+ I F C+
Sbjct: 416 KLTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 148/346 (42%), Gaps = 36/346 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ VDV FGTP + L+ DTGS + WTQC PCV C S F+P+AS TY C
Sbjct: 162 FLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCI--- 218
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFS 126
P N + + Y +++ G +T T + P FGC +N +
Sbjct: 219 ----PSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEHSD---VFPKFQFGCGRNN-EGD 267
Query: 127 FDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDM 186
F G+LG S + Q S + +FSYCL E ++ L FG+ A Q +
Sbjct: 268 FGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCL----PEEDSIGSLLFGEKATSQSSSL 323
Query: 187 KTIRMF-------VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
K + ++ S +Y++ L DISV + R+ FA + G +ID+G + T
Sbjct: 324 KFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFA-----SPGTIIDSGTVIT 378
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-WEYCYRYDSRFRA-YASMTFHF-DRAD 296
+ + Y + F + + D + CY R + HF + AD
Sbjct: 379 RLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGAD 438
Query: 297 FKVEPTYMYFIFQNEGY-FCVAISFSDRNSVVGAWQQQDTRFVYDL 341
++ + I+ N+ C+A + + +++G QQ +YD+
Sbjct: 439 VRLNGKRV--IWGNDASRLCLAFAGNSELTIIGNRQQVSLTVLYDI 482
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 152/364 (41%), Gaps = 30/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ + DTGS W QC PC +C+ Q P+F+P ASSTY +PC
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARE 198
Query: 67 CRR--------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV---CVPGVI 115
C+ N C + ++Y + G ++ +T T VPG +
Sbjct: 199 CQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFV 258
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC + N +F G + G+LG + SL Q+ + FSYCL A L F
Sbjct: 259 FGCGHSNAG-TF-GEVDGLLGLGLGKASLPSQVAARYGAAFSYCLP---SSPSAAGYLSF 313
Query: 176 GKDANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G A R + + M + + YYL+L I VA I FA G +ID+
Sbjct: 314 GGAA--ARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFAT----AAGTIIDS 367
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA--SEDWEYCYRYDSR--FRAYASMTF 290
G + + Y + F ++ GR R A S ++ CY + R A
Sbjct: 368 GTAFSRLPPSAYAALRSSFR---SAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELV 424
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
D A + P+ + + + + C+A + ++G QQ+ +YD+ + I F
Sbjct: 425 FADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGR 484
Query: 351 ENCA 354
+ CA
Sbjct: 485 KGCA 488
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 148/364 (40%), Gaps = 32/364 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP--IFNPNASSTYKRIPCDD 64
Y V + GTP + L+ DTGS L W V C S P +F P S ++ IPC
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTW------VKCAGASPPGRVFRPKTSRSWAPIPCSS 169
Query: 65 LICRRP-PFRCEN-----GQCVHRINYA-GGASASGLVSTETFTFHLK-NKLVCVPGVIF 116
C+ PF N C + Y G A A G+V TE+ T L K+ + V+
Sbjct: 170 DTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVL 229
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GCS+ + SF + G+L + S Q + G FSYCLV AT L FG
Sbjct: 230 GCSSSHDGQSFR-SADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFG 288
Query: 177 KDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+ R ++F+D +Y + + I VA + + + +GG ++D+G
Sbjct: 289 P-GQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAK---SGGVILDSG 344
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----YASMTFH 291
T + Y+ V+ +H + +E+CY + +R +
Sbjct: 345 NTLTVLAAPAYKAVVAALSKHLDGVPKVSF----PPFEHCYNWTARRPGAPEIIPKLAVQ 400
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFV 349
F + P Y I G C+ + + SV+G QQ+ + +DL ++F
Sbjct: 401 FAGSARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFK 460
Query: 350 PENC 353
NC
Sbjct: 461 QSNC 464
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 150/365 (41%), Gaps = 40/365 (10%)
Query: 7 YTVDVLFGTPSKSE-----FLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
Y + GTP +++ L D GS + W QC+PC C++Q P++N SS+ +
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVG 184
Query: 62 CDDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
C CR G +C +++ Y G+S++G ET TF V VPGV
Sbjct: 185 CYAPACRA--LGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPG---VRVPGVA 239
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
GC +DN+ F AGILG S Q+ FSYCL +S L F
Sbjct: 240 IGCGSDNQGL-FPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLA-GQGTGGRSSTLTF 297
Query: 176 GKDANIQRKD---------MKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFAL-RR 224
G A+ + RM+ + YY+ L ISV R+ G L
Sbjct: 298 GSGASATTTTTTPPSFTPMLTNSRMY----TFYYVGLVGISVGGVRVRGVTESDLRLDPS 353
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFD-EHFTSFGRQRMHNASEDWEYCYRY--DSR 281
G GG ++D+G T + Y F G ++ CY
Sbjct: 354 TGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRV 413
Query: 282 FRAYASMTFHF-DRADFKVEP-TYMYFIFQNEGYFCVAISFS-DRN-SVVGAWQQQDTRF 337
+ +++ HF + K+ P Y+ + N+G C A + S DR S++G Q Q R
Sbjct: 414 MKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRV 473
Query: 338 VYDLN 342
VYD++
Sbjct: 474 VYDVD 478
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 159/395 (40%), Gaps = 67/395 (16%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-------------------CFNQ 44
++ Y V GTP + DTGS L+W +C N +
Sbjct: 79 DFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPE 138
Query: 45 SAPIFNPNASSTYKRIPCDDLICRRPPFRCE-NGQ---CVHRINYAGGASASGLVSTETF 100
+ FNP SS+Y R+ CD C NG C R +Y GASA+GL++ +TF
Sbjct: 139 AVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAADTF 198
Query: 101 TF--HLKNKLVCVPGVIFGCSNDN--RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLF 156
TF ++ N + FGC+ R+F D G++G P SL QL F
Sbjct: 199 TFGGNINNDTTSTASIDFGCATGTAGREFQAD----GMVGLGAGPLSLASQLGRK----F 250
Query: 157 SYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRI 213
S+CL AY +A+SIL FG A + T + S ++Y +S+ + VA +
Sbjct: 251 SFCLT-AYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPV 309
Query: 214 GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY-----EVVMRHFDEHFTSFGRQRMHNA 268
PGT ++ + ++DTG + TF+ R E + R D G R
Sbjct: 310 ---PGTTSVSK-----VIVDTGTVLTFLDRAALLAPLTESLARVMD----GAGLPRAPPP 357
Query: 269 SEDWEYCYRYDSRFR----AYASMTFHF-DRADFKVEPT-YMYFIFQNEGYFCVAISFSD 322
E E CY SR + +T +V T F+ EG C+A+ +
Sbjct: 358 DETLELCYDV-SRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTS 416
Query: 323 RN----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
SV+G QD DL+ T F NC
Sbjct: 417 PELQPLSVLGNVALQDLHVGIDLDARTATFATANC 451
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 163/367 (44%), Gaps = 36/367 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-------IFNPNASSTYKR 59
Y + G PS DT + LIW Q C NC +Q P F + S TY+
Sbjct: 75 YLMSFNIGNPSSQVMGFLDTSNGLIWVQ---CSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131
Query: 60 IPCDDLICRR-PPFRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
PC C F+ N C +R+ Y + SG++S+++F F + ++ G +
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191
Query: 116 -FGCSN---DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGCS + S+ GN+ G + +P SL+ QL FSYCLV + + +TS
Sbjct: 192 NFGCSEAPLTGDEQSYTGNV----GLNQTPLSLISQLGIKK---FSYCLV-PFNNLGSTS 243
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+ FG ++ + S YY+ + IS+ + F G F + G +
Sbjct: 244 KMYFG---SLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFD-GVFDVYE-VRDGWI 298
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR--FRAYASMT 289
IDTG + ++ ++ ++ F F QR + E +E C+ + ++ +T
Sbjct: 299 IDTGITYSSLETDAFDSLLAKF-LTLKDF-PQRKDDPKERFELCFELQNANDLESFPDVT 356
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQF 348
HFD AD + + +++G FC+A+ S S++G +Q Q+ YDL I F
Sbjct: 357 VHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISF 416
Query: 349 VPENCAN 355
P +CA+
Sbjct: 417 APVDCAD 423
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 157/357 (43%), Gaps = 44/357 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC 62
NYF V V GTP + L+FDTGS L WTQC PC +C+ Q IF+P+ S++Y I C
Sbjct: 144 NYF--VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITC 201
Query: 63 DDLICRR--------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+C + P C++ I Y + + G S E + + V
Sbjct: 202 TSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD---IVDNF 258
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+FGC +N+ F G+ AG++G P S + Q + + +FSYCL +T L
Sbjct: 259 LFGCGQNNQGL-FGGS-AGLIGLGRHPISFVQQTAAVYRKIFSYCLP---ATSSSTGRLS 313
Query: 175 FGKDAN--IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
FG ++ TI SS Y L + ISV ++ + TF+ TGG +I
Sbjct: 314 FGTTTTSYVKYTPFSTISR---GSSFYGLDITGISVGGAKLPVSSSTFS-----TGGAII 365
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRAYASMTF 290
D+G + T + Y + F + G + +A E + CY S + ++
Sbjct: 366 DSGTVITRLPPTAYTALRSAFRQ-----GMSKYPSAGELSILDTCYDL-SGYEVFSIPKI 419
Query: 291 HFDRA---DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVV---GAWQQQDTRFVYDL 341
F A ++ P + ++ + C+A + + +S V G QQ+ VYD+
Sbjct: 420 DFSFAGGVTVQLPPQGILYVASAK-QVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 160/392 (40%), Gaps = 75/392 (19%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S L+W QC PCV+C+ Q PIFNP SS+Y +PC
Sbjct: 88 YLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDT 147
Query: 67 CRR-PPFRC---ENGQCVHRINYAGGASASGLVSTETF-----TFHLKNKLVCVPGVIFG 117
C + RC ++ C + Y+G A +G ++ + FH V+ G
Sbjct: 148 CSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFH---------AVVLG 198
Query: 118 CSNDNRDFSFDG---NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
CS D S G +G++G + P SLL QL F YCL +L
Sbjct: 199 CS----DSSVGGPPPQASGLVGLARGPLSLLSQLSVRR---FMYCLPPPMSRTPGKLVLG 251
Query: 175 FGKDANIQR--KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTG-- 228
G A+ R D T+ M S+YYL+ ++V D PGT +RR +
Sbjct: 252 AGAGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQ----TPGT--IRRPTSPPA 305
Query: 229 -------------------GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS 269
G ++D + +F++ Y+ + +E R+ A+
Sbjct: 306 TGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEI------RLPRAT 359
Query: 270 ED----WEYCYRYDSRF---RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS 321
+ C+ R Y +++ FD ++E ++ ++ C+ I +
Sbjct: 360 PSTRLGLDLCFILPEGVGIDRVYVPTVSMSFDGRWLELERDRLF--LEDGRMMCLMIGRT 417
Query: 322 DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
S++G +QQQ+ +Y+L G I F +C
Sbjct: 418 SGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 163/373 (43%), Gaps = 33/373 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC---LPCVNCFNQSA------PIFNPNASSTY 57
Y V GTPS+ L+ DTGS L W C NC N+ A +F+ N SS++
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 58 KRIPCDDLICR---RPPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKN-KL 108
K IPC +C+ F N C + Y+ G++A G + ET T LK +
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 202
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ + V+ GCS + SF G++G S +S + G FSYCLV
Sbjct: 203 MKLHNVLIGCSESFQGQSFQA-ADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 261
Query: 169 ATSILRFGKDANIQR--KDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRN 225
++ L FG + + +M + + +S Y +++ IS+ + + ++
Sbjct: 262 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVK-- 319
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G GG ++D+G+ TF+ Y+ VM F + M EYC+ +
Sbjct: 320 GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIG--PLEYCFNSTGFEESL 377
Query: 286 AS-MTFHF-DRADFKVEPTYMYFIFQNEGYFC---VAISFSDRNSVVGAWQQQDTRFVYD 340
+ FHF D A+F+ P Y I +G C V++++ SVVG QQ+ + +D
Sbjct: 378 VPRLVFHFADGAEFE-PPVKSYVISAADGVRCLGFVSVAWPG-TSVVGNIMQQNHLWEFD 435
Query: 341 LNTGTIQFVPENC 353
L + F P +C
Sbjct: 436 LGLKKLGFAPSSC 448
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 158/358 (44%), Gaps = 27/358 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC---VNCFNQSAPIFNPNASSTYKRIP 61
YF + V G P +S F + DTGS + W QC PC C+ Q PIF+P +SS+Y +
Sbjct: 184 YFARIGV--GQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLS 241
Query: 62 CDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CD C C+ C++ + Y G+ G ++TETF+F N +P + GC +
Sbjct: 242 CDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLPIGCGH 298
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN LG SL QL++T+ FSYCLV + E++S L F D
Sbjct: 299 DNEGLFVGAAGLIGLGGGAI--SLSSQLEATS---FSYCLVDL--DSESSSTLDFNAD-- 349
Query: 181 IQRKDMKTIRMFV-DR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q D T + DR + Y+ + +SV + + +F + +G+GG ++D+G
Sbjct: 350 -QPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTI 408
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T I Y+V+ F + ++ CY S+ +
Sbjct: 409 TEIPSDVYDVLRDAFVGLTKNLPPAP---GVSPFDTCYDLSSQSNVEVPTIAFILPGENS 465
Query: 299 VEPTYMYFIFQ--NEGYFCVAISFSD-RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ +FQ + G FC+A S S++G QQQ R YDL + F + C
Sbjct: 466 LQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 163/377 (43%), Gaps = 44/377 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY TV + G + ++ DT S L W QC PC +C +Q P+F+P++S +Y +PC+
Sbjct: 152 NYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCN 207
Query: 64 DLICRRPPF----------RCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKL 108
C C+ C + ++Y G+ + G+++ + + +
Sbjct: 208 SSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE--- 264
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ G +FGC N+ F G +G++G S SL+ Q G+FSYCL +E +
Sbjct: 265 -VIDGFVFGCGTSNQGPPF-GGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL--PLKESD 320
Query: 169 ATSILRFGKDANIQRKDMKTI--RMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRR 224
++ L G D+++ R + M D + Y+++L I+V + +
Sbjct: 321 SSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEV---ESSGFSSG 377
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFR 283
G G +ID+G + T + Y V F F + + + + C+ R
Sbjct: 378 GGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSI---LDTCFNMTGLREV 434
Query: 284 AYASMTFHFDRADFKVE---PTYMYFIFQNEGYFCVAISFSD---RNSVVGAWQQQDTRF 337
S+ FD +VE +YF+ + C+A++ +++G +QQ++ R
Sbjct: 435 QVPSLKLVFD-GGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRV 493
Query: 338 VYDLNTGTIQFVPENCA 354
++D + + F E C
Sbjct: 494 IFDTSGSQVGFAQETCG 510
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 158/370 (42%), Gaps = 35/370 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS--APIFNPNASSTYKRI 60
+++ Y + V GTP + DTGS L+W C S A +F+P+ S+TY +
Sbjct: 96 RSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLL 155
Query: 61 PCDDLICRR-PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKL----VCVPGV 114
C C+ C+ + +C ++ Y G+ G++STETF+F V VP V
Sbjct: 156 SCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRV 215
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSI 172
FGCS + SF + G++G SL+ QL + A+ FSYCLV Y ++S
Sbjct: 216 SFGCSTGSAG-SFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSST 272
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
L FG A + + + YY ++L+ ++VA + A + +
Sbjct: 273 LSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASA---------NSSRII 323
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----YAS 287
+D+G TF+ ++ + R + + CY + +A
Sbjct: 324 VDSGTTLTFLDPALLRPLVAELERRIR---LPRAQPPEQLLQLCYDVQGKSQAEDFGIPD 380
Query: 288 MTFHF-DRADFKVEPTYMYFIFQNEGYFC---VAISFSDRNSVVGAWQQQDTRFVYDLNT 343
+T F A + P + + + EG C V +S S S++G QQ+ YDL+
Sbjct: 381 VTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 439
Query: 344 GTIQFVPENC 353
T+ F +C
Sbjct: 440 RTVTFAAVDC 449
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 149/378 (39%), Gaps = 40/378 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LP-----CVNCF-----NQSAPIFNPNASS 55
Y+V GTP + L+ DTGS L+WT C +P C NC PI+ N SS
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 56 TYKRIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLK-NKLVCVPGV 114
T + +PC C N R Y G G + + + L +KL +P
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKLNRIPDF 193
Query: 115 IFGCS-NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREMEAT 170
+FGCS NR GI GF S+ QL T FSYCLV +
Sbjct: 194 LFGCSLVSNR------QPEGIAGFGRGLASIPAQLGLTK---FSYCLVSHRFDDTPQSGD 244
Query: 171 SILRFGK---DA---NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
+L G+ DA + S +YY+SL I V + P +
Sbjct: 245 LVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSK 304
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA 284
G GG ++D+G+ TF++R ++ V R ++H T + R + S CY +
Sbjct: 305 EGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEV 364
Query: 285 -YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI--------SFSDRNSVVGAWQQQDT 335
+TF F P YF +G C+ + S + ++G +QQQ+
Sbjct: 365 DVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNF 424
Query: 336 RFVYDLNTGTIQFVPENC 353
YDL F P+ C
Sbjct: 425 YIEYDLKKQRFGFKPQQC 442
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 151/363 (41%), Gaps = 30/363 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP--IFNPNASSTYKRIPCDD 64
Y V VL GTP++ L+ DTGS L W +C S P +F P AS ++ +PC
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKC-----AGGASPPGLVFRPEASKSWAPVPCSS 145
Query: 65 LICRRP-PFRCEN-----GQCVHRINYA-GGASASGLVSTETFTFHLK-NKLVCVPGVIF 116
C+ PF N C + Y G A A G+V T++ T L K+ + V+
Sbjct: 146 DTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVL 205
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GCS+ + SF ++ G+L + S + + G FSYCLV AT L FG
Sbjct: 206 GCSSTHDGQSFK-SVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFG 264
Query: 177 KDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+ R ++F+D + +Y + + + VA + + + +GG ++D+G
Sbjct: 265 P-GQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK---SGGVILDSG 320
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YASMTFHF 292
T + Y+ V+ + + +E+CY + + + F
Sbjct: 321 TTLTVLATPAYKAVVAALTKLLAGVPKVDF----PPFEHCYNWTAPRPGAPEIPKLAVQF 376
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y I G C+ + + SV+G QQ+ + +DL ++F+P
Sbjct: 377 TGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMP 436
Query: 351 ENC 353
C
Sbjct: 437 STC 439
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 155/351 (44%), Gaps = 69/351 (19%)
Query: 9 VDVLFGTP-SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC 67
+++ GTP +++ L D SY +W QC P
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAP------------------------------ 119
Query: 68 RRPPFRCENGQCVHRINYAGGAS-ASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-RDF 125
+ Y G A+ SG ++T+TFTF VPGV+FGCS+ + DF
Sbjct: 120 ---------------LTYGGSAANTSGYLATDTFTFGA----TAVPGVVFGCSDASYGDF 160
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AYREMEATSILRFGKDANIQR 183
+ +G++G SL+ QL+ G FSY L+ A + A S++RFG DA +
Sbjct: 161 A---GASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKT 214
Query: 184 KDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAP-GTFALRRNGTGGCMIDTGAIATF 240
K ++ + YY++L + V +R+ P GTF LR NGTGG ++ + T+
Sbjct: 215 KRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTY 274
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDSRFRA-YASMTFHFD-RADF 297
+++ Y+VV G ++ +A+ + + CY S + +T FD AD
Sbjct: 275 LEQAAYDVVRAAVASR---IGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADM 331
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ ++I + G C+ + S SV+G Q T +YD++ G + F
Sbjct: 332 DLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 158/371 (42%), Gaps = 37/371 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP+++ +LFDTGS L W QC PC + C+ Q P+F+P+ SSTY +PC
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTP 185
Query: 66 ICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND- 121
C+ C C + + Y + G ++ E FT L GV+FGCS++
Sbjct: 186 QCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFT--LSPSAPPAAGVVFGCSHEY 243
Query: 122 ---NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGK 177
+ + ++AG+LG S+L Q + G +FSYCL + L G
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLP---PRGSSAGYLTIGA 300
Query: 178 DANIQRKDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
A Q ++ + D SS Y ++L ISV+ + F + G +ID+
Sbjct: 301 AAPPQ-SNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDS 353
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR---YDSRFRAYASMTFH 291
G + T + Y V+ F H + E + CY +D ++ F
Sbjct: 354 GTVITHMPAAAYYVLRDEFRRHMGGY-TMLPEGHVESLDTCYDVTGHDVVTAPPVALEFG 412
Query: 292 FDRADFKVEPTYMYFIF------QNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNT 343
A V+ + + +F Q+ C+A ++ ++G QQ+ V+D+
Sbjct: 413 -GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEG 471
Query: 344 GTIQFVPENCA 354
I F C+
Sbjct: 472 RRIGFGANGCS 482
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 158/359 (44%), Gaps = 32/359 (8%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + GTP + + D S +W QC C C SAP F SST + +
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREV 155
Query: 61 PCDDLICRR-PPFRC--ENGQCVHRINYAGGAS--ASGLVSTETFTFHLKNKLVCVPGVI 115
C + C+R P C ++ C + Y GGA+ +GL++ + F F V GVI
Sbjct: 156 RCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADGVI 211
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC+ + +G+I G++G SL+ QL+ G FSY L ++ S + F
Sbjct: 212 FGCA-----VATEGDIGGVIGLGRGELSLVSQLQI---GRFSYYLA-PDDAVDVGSFILF 262
Query: 176 GKDANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
DA + + + +R+S YY+ L I V + GTF L+ +G+GG ++
Sbjct: 263 LDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLS 322
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
TF+ G Y+VV + + G + + + CY +S A SM F
Sbjct: 323 ITIPVTFLDAGAYKVVRQAM---ASKIGLRAADGSELGLDLCYTSESLATAKVPSMALVF 379
Query: 293 -DRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
A ++E +++ G C+ I S + S++G+ Q T +YD++ + F
Sbjct: 380 AGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 58/381 (15%)
Query: 1 HEKNYFYTV-DVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR 59
H Y V + GTP ++ D L+WTQC C++CF Q P+F PNASST+K
Sbjct: 17 HWSPELYNVANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKP 76
Query: 60 IPCDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI-FG 117
PC +C+ P +C + C G G+V+T+TF P + FG
Sbjct: 77 EPCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIG-----TAAPASLGFG 131
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + D G +G +G +P+SL+ Q+K T FSYCL A + S L G
Sbjct: 132 CVVAS-DIDTMGGPSGFIGLGRTPWSLVAQMKLTR---FSYCL--APHDTGKNSRLFLGA 185
Query: 178 DANIQRKDMKTIRMFV-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
A + T FV D S YY + L++I D I T RN
Sbjct: 186 SAKLAGGGAWT--PFVKTSPNDGMSQYYPIELEEIKAGDATI-----TMPRGRN------ 232
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN---------ASEDWEYCYRYDSRF 282
T + T + R V D + F + M + E +E C+ +
Sbjct: 233 --TVLVQTAVVR-----VSLLVDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFP-KAGV 284
Query: 283 RAYASMTFHFDR-ADFKVEPTYMYFIFQNEGYFCVAISFS-------DRNSVVGAWQQQD 334
+ F F A V P F N+ +S + D +++G++QQ++
Sbjct: 285 SGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQEN 344
Query: 335 TRFVYDLNTGTIQFVPENCAN 355
++DL+ + F P +C++
Sbjct: 345 VHLLFDLDKDMLSFEPADCSS 365
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 156/350 (44%), Gaps = 50/350 (14%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFRCENGQCVH 81
L+ DTGS LIWTQC + +A A++ + P + R P R G
Sbjct: 55 LIVDTGSDLIWTQC----KLSSSTA------AAARHGSPP----LSRTAPAR--TGAFTR 98
Query: 82 RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSP 141
+ A+A G++++ETFTF + + G FGC + S G GILG S
Sbjct: 99 TCTAS--AAAVGVLASETFTFGARRAVSLRLG--FGCGALSAG-SLIGA-TGILGLSPES 152
Query: 142 FSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDANIQR----KDMKTIRMFVD-- 194
SL+ QLK FSYCL +A ++ TS L FG A++ R + ++T + +
Sbjct: 153 LSLITQLKIQR---FSYCLTPFADKK---TSPLLFGAMADLSRHKTTRPIQTTAIVSNPV 206
Query: 195 RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFD 254
+ +YY+ L IS+ R+ + A+R +G GG ++D+G+ ++ +E V
Sbjct: 207 ETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV----K 262
Query: 255 EHFTSFGRQRMHNAS-EDWEYCYRYDSRFRAYA-------SMTFHFDRADFKVEPTYMYF 306
E R + N + ED+E C+ R A A + HFD V P YF
Sbjct: 263 EAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYF 322
Query: 307 IFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G C+A+ + S++G QQQ+ ++D+ F P C
Sbjct: 323 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 151/339 (44%), Gaps = 33/339 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ L DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 228
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 157/395 (39%), Gaps = 79/395 (20%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S L+W QC PCV+C+ Q P+FNP SS+Y +PC
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151
Query: 67 CRR-PPFRC---ENGQCVHRINYAGGASASGLVSTETF-----TFHLKNKLVCVPGVIFG 117
C + RC ++G C + Y+G G ++ + FH V+FG
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFH---------AVVFG 202
Query: 118 CSNDNRDFSFDGNIA---GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
CS D S G A G++G P SL+ QL F YCL +L
Sbjct: 203 CS----DSSVGGPAAQASGLVGLGRGPLSLVSQLSVHR---FMYCLPPPMSRTSGKLVLG 255
Query: 175 FGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTG---- 228
G DA D T+ M S+YYL+L ++V D PGT RN T
Sbjct: 256 AGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQ----TPGT---TRNATSPPSG 308
Query: 229 ----------------------GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMH 266
G ++D + +F++ Y+ + +E R+
Sbjct: 309 GAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEI------RLP 362
Query: 267 NASED----WEYCYRYDSRF---RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI 318
A+ + C+ R Y +++ FD +++ ++ + C+ I
Sbjct: 363 RATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFVT--DGRMMCLMI 420
Query: 319 SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ S++G +Q Q+ R +++L G I F +C
Sbjct: 421 GRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 152/374 (40%), Gaps = 32/374 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-APIFNPNASSTYKRIPCDDL 65
Y VD+ GTP + L+ DTGS L+W +C C NC + F S+T+ C D
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDS 148
Query: 66 ICRRPPF----RCENGQ----CVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIF 116
C+ P RC + + C + +Y G+ SG S ET T + + + + G+ F
Sbjct: 149 ACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAF 208
Query: 117 GCSNDNRDFSFDG----NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
GC+ S G G++G P SL QL FSYCL+ TS
Sbjct: 209 GCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSY 268
Query: 173 LRFGK---DANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGT 227
L G D ++ M+ + ++ S YY+ ++ +SV ++ P +AL G
Sbjct: 269 LLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGN 328
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED---WEYCYRYDS-RFR 283
GG ++D+G TF+ Y ++ R R+ + +E ++ C
Sbjct: 329 GGTIVDSGTTLTFLPEPAYLQILTVIKR------RVRLPSPAEPTPGFDLCVNVSEIEHP 382
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISF---SDRNSVVGAWQQQDTRFVYD 340
++F P YF+ +E C+A+ SV+G QQ +D
Sbjct: 383 RLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFD 442
Query: 341 LNTGTIQFVPENCA 354
+ + F CA
Sbjct: 443 KDRTRLGFSRHGCA 456
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 139/354 (39%), Gaps = 21/354 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 242 ACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERNEG 298
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG +
Sbjct: 299 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSSGTGYLDFGPGSPAAVG 353
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+T M D + YY+ + I V + F+ T G ++D+G + T +
Sbjct: 354 ARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFS-----TAGTIVDSGTVITRLPP 408
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPT 302
Y + F + G ++ S + CY + A ++ F +
Sbjct: 409 AAYSSLRSAFASAMAARGYKKAPALSL-LDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNA 467
Query: 303 YMYFIFQNEGYFCVAISFS---DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ C+ + + D +VG Q + VYD+ T+ F P C
Sbjct: 468 SGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 121/292 (41%), Gaps = 38/292 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + GTP+ + +L DTGS L W QC PC +C+ Q P+F+P+ SST+ IPC
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184
Query: 65 LICRRPPFR-----CENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
C++ P C N QC + I Y GA G+ STET V
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSAVVKS 241
Query: 114 VIFGCSNDNRDF--SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGC +D FD G+LG +P SL+ Q S G FSYCL
Sbjct: 242 FRFGCGSDQHGPYDKFD----GLLGLGGAPESLVSQTASVYGGAFSYCLP---PLNSGAG 294
Query: 172 ILRFGKDANIQRKD----MKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
L G + + + F + ++ Y ++L ISV + P FA
Sbjct: 295 FLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAK---- 350
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY 278
G ++D+G + T I Y+ + F + + A + CY +
Sbjct: 351 --GNIVDSGTVITGIPTTAYKALRTAFRSAMAEY--PLLPPADSALDTCYNF 398
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 147/355 (41%), Gaps = 22/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP ++FDTGS W QC PC V+C+ Q +F+P SSTY + C D
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 66 ICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ I Y G+ G + +T + G FGC NR
Sbjct: 223 ACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQD----AIKGFKFGCGEKNRG 278
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF-GKDANIQR 183
G AG+LG P S+ Q G FSYCL + AT L F +
Sbjct: 279 LF--GQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPAS---SAATGYLEFGPLSPSSSG 333
Query: 184 KDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ KT M D+ + YY+ L I V ++G P + +GT ++D+G + T +
Sbjct: 334 SNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESV-FSNSGT---LVDSGTVITRLP 389
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA---YASMTFHFDRADFKV 299
Y + F + G ++ A + CY + + S+ F
Sbjct: 390 DTAYAALSSAFAAAMAASGYKKAA-AYSILDTCYDFTGLSQVSLPTVSLVFQGGACLDLD 448
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+Y I Q++ A + D + +VG QQ+ +YD++ + F P C
Sbjct: 449 ASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 156/381 (40%), Gaps = 58/381 (15%)
Query: 1 HEKNYFYTV-DVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR 59
H Y V + GTP ++ D L+WTQC C++CF Q P+F PNASST+K
Sbjct: 47 HWSPELYNVANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKP 106
Query: 60 IPCDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI-FG 117
PC +C+ P +C + C + G G+V+T+TF P + FG
Sbjct: 107 EPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIG-----TAAPASLGFG 161
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C + D G +G +G +P+SL+ Q+K T FSYCL A + S L G
Sbjct: 162 CVVAS-DIDTMGGPSGFIGLGRTPWSLVAQMKLTR---FSYCL--APHDTGKNSRLFLGA 215
Query: 178 DANIQRKDMKTIRMFV-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
A + T FV D S YY + L++I D I T RN
Sbjct: 216 SAKLAGGGAWT--PFVKTSPNDGMSQYYPIELEEIKAGDATI-----TMPRGRN------ 262
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN---------ASEDWEYCYRYDSRF 282
T + T + R V D + F + M + +E C+ +
Sbjct: 263 --TVLVQTAVVR-----VSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFP-KAGV 314
Query: 283 RAYASMTFHFDR-ADFKVEPTYMYFIFQNEGYFCVAISFS-------DRNSVVGAWQQQD 334
+ F F A V P F N+ +S + D +++G++QQ++
Sbjct: 315 SGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQEN 374
Query: 335 TRFVYDLNTGTIQFVPENCAN 355
++DL+ + F P +C++
Sbjct: 375 VHLLFDLDKDMLSFEPADCSS 395
>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 147/368 (39%), Gaps = 21/368 (5%)
Query: 6 FYTVDVLFGTPSKSEF--LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y V V G+ F L D L W QC PCV Q +F+ S YK +
Sbjct: 67 LYGVLVGVGSGQTRHFYKLGLDLVGNLTWMQCQPCVPEVRQEGAVFDSAESPRYKHMKAT 126
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH---LKNKLVCVPGVIFGCSN 120
D +C PP+ G +A G + ++ F F V +IFGC++
Sbjct: 127 DPMCT-PPYTPSVGNRCSFYTTTWNVAAHGYLGSDMFAFAGTGAGGHSTDVDQLIFGCAH 185
Query: 121 --DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAYREMEAT-SIL 173
D + G +AG L S P S L QL TA+GL FSYCL A L
Sbjct: 186 TTDGLERLSHGVLAGALSLSRHPMSFLSQL--TARGLADSRFSYCLFPEQSHPIAKHGFL 243
Query: 174 RFGKDANIQRKDMKTIRMFVDRSS--HYYLSLQDISVADHRI-GFAPGTFALR-RNGTGG 229
RFG+D T +F S Y++ + IS+ RI P F + GG
Sbjct: 244 RFGRDIPRHDHAHSTSLLFTGPGSGGMYHIRVVGISLNGRRIMRLQPAMFTRNLQTRRGG 303
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYASM 288
++D G T + R Y++V + G +R + C+ + ++
Sbjct: 304 SVVDPGTPLTRLVRQAYDIVEAEVVANMQKQGARRAKAQVQGHRLCFVSWGHVHLPSLTI 363
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ D A ++P ++ C + + +V+GA QQ DTRF +DL+ + F
Sbjct: 364 NMYEDTAKLFIKPELLFRKVTAR-LLCFTVMPDEEMTVLGAAQQMDTRFTFDLHANRLYF 422
Query: 349 VPENCAND 356
ENC D
Sbjct: 423 AQENCNAD 430
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 160/379 (42%), Gaps = 41/379 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYK 58
H Y + GTP ++ + D L+WTQC C CF Q P+F+P+AS+TY+
Sbjct: 56 HWSGAHYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYR 115
Query: 59 RIPCDDLICRRPPFR-CE-NGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVI 115
C +C+ P R C +G+C + G G+ ST+ + + +L
Sbjct: 116 AEQCGSPLCKSIPTRNCSGDGECGYEAPSMFG-DTFGIASTDAIAIGNAEGRLA------ 168
Query: 116 FGC---SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
FGC S+ + D + DG +G +G +P+SL+GQ TA FSYCL A S
Sbjct: 169 FGCVVASDGSIDGAMDGP-SGFVGLGRTPWSLVGQSNVTA---FSYCL--ALHGPGKKSA 222
Query: 173 LRFGKDANI--QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L G A + K + +S+ D G G A+ +GG
Sbjct: 223 LFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGG 282
Query: 231 MIDTGAIATF-----IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
I + TF + Y+ + + + G M N E ++ C++ ++
Sbjct: 283 AITVLQLETFRPLSYLPDAAYQALEKVVT---AALGSPSMANPPEPFDLCFQ-NAAVSGV 338
Query: 286 ASMTFHFD-RADFKVEPT-YMYFIFQNEGYFCVAI-------SFSDRNSVVGAWQQQDTR 336
+ F F A +P+ Y+ G C++I S D S++G+ Q++
Sbjct: 339 PDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVH 398
Query: 337 FVYDLNTGTIQFVPENCAN 355
F++DL T+ F P +C++
Sbjct: 399 FLFDLEKETLSFEPADCSS 417
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 159/358 (44%), Gaps = 27/358 (7%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC---VNCFNQSAPIFNPNASSTYKRIP 61
YF + V G P +S F + DTGS + W QC PC C+ Q PIF+P +SS+Y +
Sbjct: 184 YFARIGV--GQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLS 241
Query: 62 CDDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
CD C C+ C++ + Y G+ G ++TETF+F N +P + GC +
Sbjct: 242 CDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLPIGCGH 298
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
DN + LG SL QL++T+ FSYCLV + E++S L F D
Sbjct: 299 DNEGLFVGADGLIGLGGGAI--SLSSQLEATS---FSYCLVDL--DSESSSTLDFNAD-- 349
Query: 181 IQRKDMKTIRMFV-DR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q D T + DR + Y+ + +SV + + +F + +G+GG ++D+G
Sbjct: 350 -QPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTI 408
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADF 297
T I Y+V+ F + ++ CY S+ ++ F +
Sbjct: 409 TEIPSDVYDVLRDAFVGLTKNLPPAP---GVSPFDTCYDLSSQSNVEVPTIAFILPGENS 465
Query: 298 KVEPTYMYFI-FQNEGYFCVAISFSD-RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P I + G FC+A S S++G QQQ R YDL + F + C
Sbjct: 466 LQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 140/340 (41%), Gaps = 39/340 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP + + D L+WTQC PC CF Q P+F+P SST++ +PC
Sbjct: 56 LYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 66 ICRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC--SN 120
+C P C + C++ G + G T+TF + + FGC
Sbjct: 116 LCESIPESSRNCTSDVCIYEAPTKAGDTG-GKAGTDTFAIGAAKETLG-----FGCVVMT 169
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D R G +GI+G +P+SL+ Q+ TA FSYCL +++ L G A
Sbjct: 170 DKR-LKTIGGPSGIVGLGRTPWSLVTQMNVTA---FSYCLAG-----KSSGALFLGATAK 220
Query: 181 IQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
+ FV ++S YY+ + +A + G AP A T +
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYM----VKLAGIKTGGAPLQAASSSGST--VL 274
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
+DT + A+++ G Y+ + + + G Q + + + ++ C+ A + F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTA---AVGVQPVASPPKPYDLCFPKAVAGDA-PELVFT 330
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQ 331
FD P Y + G C+ I S ++ G +
Sbjct: 331 FDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELE 370
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 152/361 (42%), Gaps = 40/361 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP ++ L DTGS LIW +C C C + + + P SS++ ++PC +
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 67 CRR---------PPFRCENGQCVHRINYAGGAS----ASGLVSTETFTFHLKNKLVCVPG 113
CR R C +R +Y ++ G + +ETFT V G
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSD----AVQG 196
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+ FGC+ G+ +G++G SL+ QLK G FSYCL + +S L
Sbjct: 197 IGFGCT--TMSEGGYGSGSGLVGLGRGKLSLVRQLK---VGAFSYCLT---SDPSTSSPL 248
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
FG A T + + S+ Y ++L IS+ + PGT G G + D
Sbjct: 249 LFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKT---PGT------GRHGIIFD 299
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD 293
+G TF+ Y + T+ R ++ +E C++ S + SM HFD
Sbjct: 300 SGTTLTFLAEPAYTLAEAGLLSQTTNLTRV---PGTDGYEVCFQ-TSGGAVFPSMVLHFD 355
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
D ++ T YF N+ C + S S+VG Q D YDL+ + F P N
Sbjct: 356 GGDMALK-TENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTN 414
Query: 353 C 353
C
Sbjct: 415 C 415
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 152/358 (42%), Gaps = 32/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP++ + DT + W C CV C S+ +F+P+ SS+ + + CD
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASSVLFDPSKSSSSRNLQCDAPQ 148
Query: 67 CRRPP-FRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C++ P C G+ C + Y GG++ ++ +T T L N + + FGC +
Sbjct: 149 CKQAPNPTCTAGKSCGFNMTY-GGSTIEASLTQDTLT--LANDV--IKSYTFGCISKATG 203
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
S G++G P SL+ Q ++ FSYCL + + + LR G R
Sbjct: 204 TSLPAQ--GLMGLGRGPLSLISQTQNLYMSTFSYCLPNS-KSSNFSGSLRLGPKYQPVRI 260
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ RSS YY++L I V + + A + G + D+G + T +
Sbjct: 261 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVEP 320
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKVE 300
Y V F R+R+ NA+ ++ CY S Y S+TF F + +
Sbjct: 321 AYVAVRNEF--------RRRIKNANATSLGGFDTCY---SGSVVYPSVTFMFAGMNVTLP 369
Query: 301 PTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + + C+A++ + N +V+ + QQQ+ R + DL + E C
Sbjct: 370 PDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 154/391 (39%), Gaps = 49/391 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----------IFNPNASS 55
Y V GTP++ L+ DTGS L W +C P + F P S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 56 TYKRIPCDDLICRRP-PFR-----CENGQCVHRINYAGGASASGLVSTETFTFHL----- 104
T+ IPC C + PF C + Y G++A G V TE+ T L
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214
Query: 105 --KNKL--VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
KNK+ + G++ GC+ SF+ + G+L S S S G FSYCL
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFEAS-DGVLSLGYSNVSFASHAASRFGGRFSYCL 273
Query: 161 VYAYREMEATSILRFGKDANIQRK--------DMKTIRMFVDRSSHYY-LSLQDISVADH 211
V ATS L FG ++ + +T + R +Y +S++ ISV
Sbjct: 274 VDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGE 333
Query: 212 RIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED 271
+ + + +G GG ++D+G T + + Y V+ + F R M +
Sbjct: 334 LLKIPRDVWEV--DGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM----DP 387
Query: 272 WEYCYRYDSRFRA-----YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-- 324
+EYCY + S R + HF + P+ Y I G C+ +
Sbjct: 388 FEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWPGI 447
Query: 325 SVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
SV+G QQ+ + +DL ++F C +
Sbjct: 448 SVIGNILQQEHLWEFDLKNRRLRFKRSRCTH 478
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 157/379 (41%), Gaps = 41/379 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYK 58
H Y + GTP ++ + D L+WTQC C CF Q P+F+P+AS+TY+
Sbjct: 56 HWSGACYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYR 115
Query: 59 RIPCDDLICRRPPFR-CE-NGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVI 115
C +C+ P R C +G+C + G G+ ST+ + + +L
Sbjct: 116 AEQCGSPLCKSIPTRNCSGDGECGYEAPSMFG-DTFGIASTDAIAIGNAEGRLA------ 168
Query: 116 FGC---SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
FGC S+ + D + DG +G +G +P+SL+GQ TA FSYCL A S
Sbjct: 169 FGCVVASDGSIDGAMDGP-SGFVGLGRTPWSLVGQSNVTA---FSYCL--APHGPGKKSA 222
Query: 173 LRFGKDANI--QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L G A + K + +S+ D G G A+ +GG
Sbjct: 223 LFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGG 282
Query: 231 MIDTGAIATF-----IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
I + TF + Y+ + + + G M N E ++ C++ ++
Sbjct: 283 AITILQLETFRPLSYLPDAAYQALEKVVT---AALGSPSMANPPEPFDLCFQ-NAAVSGV 338
Query: 286 ASMTFHFDRADFKVEPTYMYFI--FQNEGYFCVAI-------SFSDRNSVVGAWQQQDTR 336
+ F F P Y + G C++I S D S++G+ Q++
Sbjct: 339 PDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVH 398
Query: 337 FVYDLNTGTIQFVPENCAN 355
F++DL T+ F P +C++
Sbjct: 399 FLFDLEKETLSFEPADCSS 417
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 163/373 (43%), Gaps = 35/373 (9%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPI--FNPNASSTYK 58
+++ Y + V G+P +S + DTGS L+W +C N + +AP F+P+ SSTY
Sbjct: 96 SRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYG 155
Query: 59 RIPCDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFH-----LKNKLVCV 111
R+ C C C++G C + Y G++ +G++STETFTF + V V
Sbjct: 156 RVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRV 215
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
GV FGCS + G+ G +VS + LG S + FSYCLV + A+S
Sbjct: 216 GGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGR-RFSYCLV--PHSVNASS 272
Query: 172 ILRFGKDANIQRKDMKTIRMFV-DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L FG A++ + + D ++Y + L + V + + A A R
Sbjct: 273 ALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASA----ASSR-----I 323
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR----FRAYA 286
++D+G TF+ ++ T + + + CY R +
Sbjct: 324 IVDSGTTLTFLDPSLLGPIVDELSRRIT---LPPVQSPDGLLQLCYNVAGREVEAGESIP 380
Query: 287 SMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLN 342
+T F A ++P + Q EG C+AI + S++G QQ+ YDL+
Sbjct: 381 DLTLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 439
Query: 343 TGTIQFVPENCAN 355
GT+ F +CA
Sbjct: 440 AGTVTFAGADCAG 452
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 152/348 (43%), Gaps = 33/348 (9%)
Query: 22 LLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDLICRR--------PPF 72
++ DTGS L W QC PC V C Q+ P+++P+ S TYK++ C + C R P
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 73 RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIA 132
++ C++ +Y + + G +S + T L P +GC DN+ G A
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTL---PQFTYGCGQDNQGLF--GRAA 115
Query: 133 GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMF 192
GI+G + S+L QL + FSYCL A L +I K M
Sbjct: 116 GIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSI---GSISPTSYKFTPML 172
Query: 193 VDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVM 250
D S Y+L L I+V+ + A A+ R T +ID+G + T + Y +
Sbjct: 173 TDSKNPSLYFLRLTAITVSGRPLDLAA---AMYRVPT---LIDSGTVITRLPMSMYAALR 226
Query: 251 RHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHFD-RADFKVEPTYMYFIF 308
+ F + ++ + A + C++ + A + F AD + + I
Sbjct: 227 QAFVKIMST--KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSI-LIE 283
Query: 309 QNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++G C+A S +++ +++G QQQ YD++T I F P +C
Sbjct: 284 ADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 159/372 (42%), Gaps = 39/372 (10%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP----IFNPNASSTYK 58
+++ Y + V GTP + DTGS L+W C + +F P SSTY
Sbjct: 99 RSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYS 158
Query: 59 RIPCDDLICRR-PPFRCE-NGQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPGV 114
++ C C+ C+ + +C ++ +Y G+ G++STETF+F V VP V
Sbjct: 159 QLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRV 218
Query: 115 IFGCSNDNRD-FSFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREMEATS 171
FGCS + F D G++G FSL+ QL +T SYCL+ +Y + ++S
Sbjct: 219 NFGCSTASAGTFRSD----GLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSY-DANSSS 273
Query: 172 ILRFGKDANIQRKDMKTIRMF-VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L FG A + + + D S+Y ++L+ ++V + T R
Sbjct: 274 TLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEV----ATHDSR------I 323
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR----AYA 286
++D+G TF+ ++ + QR+ + + CY +
Sbjct: 324 IVDSGTTLTFLDPALLGPLVTELERRIK---LQRVQPPEQLLQLCYDVQGKSETDNFGIP 380
Query: 287 SMTFHF-DRADFKVEPTYMYFIFQNEGYFC---VAISFSDRNSVVGAWQQQDTRFVYDLN 342
+T F A + P + + Q EG C V +S S S++G QQ+ YDL+
Sbjct: 381 DVTLRFGGGAAVTLRPENTFSLLQ-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLD 439
Query: 343 TGTIQFVPENCA 354
T+ F +CA
Sbjct: 440 ARTVTFAAADCA 451
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 162/374 (43%), Gaps = 43/374 (11%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQ---SAPIFNPNASSTYK 58
KN F+ + + GTP+ + DTGS + W QC C V+C+ Q + P FN ++SSTY+
Sbjct: 20 KNQFF-MGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYR 78
Query: 59 RIPCDDLIC------RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC 110
R+ C +C + P C E C++ + YA G ++G +S + T L N
Sbjct: 79 RVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLT--LANSY-S 135
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKS-TAQGLFSYCLVYAYREMEA 169
+ IFGC +DNR ++G+ AGI+GF +S Q+ T FSYC
Sbjct: 136 IQKFIFGCGSDNR---YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQENEGF 192
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSH---YYLSLQDISVADHRIGFAPGTFALRRNG 226
SI + +D+N + ++F D +H Y L D+ V R+ P + R
Sbjct: 193 LSIGPYVRDSN----KLILTQLF-DYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMT- 246
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
++D+G + TF+ + + R + + G R S+ E C+ + ++
Sbjct: 247 ----VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVR---GSDSKEICFHSNGDSVDWS 299
Query: 287 SM---TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFVY 339
+ F R+ K+ +++ ++G C D ++G + R V+
Sbjct: 300 KLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVF 359
Query: 340 DLNTGTIQFVPENC 353
D+ F C
Sbjct: 360 DIQQRNFGFEAGAC 373
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 153/375 (40%), Gaps = 49/375 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKR--- 59
NY+ + V GTP+K ++ DTGS L W QC PCV C Q PIF P+ S TYK
Sbjct: 106 NYYVKIGV--GTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSC 163
Query: 60 -----IPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
P G CV++ +Y + + G +S + T L G
Sbjct: 164 SSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLT--LTPSAAPSSGF 221
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI-- 172
++GC DN+ G AGI+G + S+LGQL + FSYCL ++ +S+
Sbjct: 222 VYGCGQDNQG--LFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSG 279
Query: 173 -LRFGKDANIQRKDMKTIRMFVDR-SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L G + T + + S Y+L L I+VA +G + ++ +
Sbjct: 280 FLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPT------ 333
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR--------- 281
+ID+G + T + Y + + F + + + C++ +
Sbjct: 334 IIDSGTVITRLPVAIYNALKKSFVMIMSK--KYAQAPGFSILDTCFKGSVKEMSTVPEIR 391
Query: 282 --FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFV 338
FR A + + ++E +G C+AI+ S S++G +QQQ
Sbjct: 392 IIFRGGAGLELKVHNSLVEIE----------KGTTCLAIAASSNPISIIGNYQQQTFTVA 441
Query: 339 YDLNTGTIQFVPENC 353
YD+ I F P C
Sbjct: 442 YDVANSKIGFAPGGC 456
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 142/355 (40%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY I C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 240 ACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGFRFGCGERNEG 296
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPA---RSSGTGYLDFGPGSPAAAG 351
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M D + YY+ + I V + F T G ++D+G + T +
Sbjct: 352 ARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFT-----TAGTIVDSGTVITRLPP 406
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y + F + G ++ S + CY + + A +++ F A V+
Sbjct: 407 AAYSSLRSAFASAMAARGYKKAPAVSL-LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+ + ++ +VG Q + YD+ + F P C
Sbjct: 466 SGIMYA-ASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 155/379 (40%), Gaps = 47/379 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTYKRIP 61
N TV + G+P ++ ++ DTGS L W C LP +N FNP SS+Y P
Sbjct: 57 NVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNS------TFNPLLSSSYTPTP 110
Query: 62 CDDLICRRP------PFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
C+ IC P C+ N C ++YA +SA G ++ ETF+ + PG
Sbjct: 111 CNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ----PG 166
Query: 114 VIFGCSND---NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
+FGC + D + D G++G + SL+ Q+ FSYC+ +A
Sbjct: 167 TLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPK---FSYCI----SGEDAL 219
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYY------LSLQDISVADHRIGFAPGTFALRR 224
+L G + T + SS Y+ + L+ I V++ + F
Sbjct: 220 GVLLLGDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDH 279
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE----YCYRYDS 280
G G M+D+G TF+ Y + F E T R+ + + +E CY +
Sbjct: 280 TGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQ-TKGVLTRIEDPNFVFEGAMDLCYHAPA 338
Query: 281 RFRAYASMTFHFDRADFKVE-PTYMYFIFQNEGY-FCVAISFSD----RNSVVGAWQQQD 334
F A ++T F A+ +V +Y + + + +C SD V+G QQ+
Sbjct: 339 SFAAVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQN 398
Query: 335 TRFVYDLNTGTIQFVPENC 353
+DL + F C
Sbjct: 399 VWMEFDLLKSRVGFTQTTC 417
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 154/363 (42%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + W C C C S+ F PNAS+T + C
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSGAQ 154
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + F C + C+ +Y G +S + + + T L N + +PG FGC N
Sbjct: 155 CSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAIT--LANDV--IPGFTFGCINAV 210
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G+LG P SL+ Q + G+FSYCL +++ + L+ G Q
Sbjct: 211 SGGSIPPQ--GLLGLGRGPISLISQAGAMYSGVFSYCL-PSFKSYYFSGSLKLGPVG--Q 265
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S YY++L +SV ++ N G +ID+G + T
Sbjct: 266 PKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITR 325
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ Y + F + +S G ++ C+ + A A +T HF+ +
Sbjct: 326 FVQPVYFAIRDEFRKQVNGPISSLGA---------FDTCFAATNEAEAPA-ITLHFEGLN 375
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
V P I + G C++++ + N +V+ QQQ+ R ++D +
Sbjct: 376 L-VLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIAR 434
Query: 351 ENC 353
E C
Sbjct: 435 ELC 437
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 160/368 (43%), Gaps = 52/368 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y ++ GTP + L DTGS LIW +C +C Q +P + PNASST+ ++PC D
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 65 LICRRPPFRCEN--------GQCVHRINYAGGAS----ASGLVSTETFTFHLKNKLVCVP 112
+C R ++ +C +R +Y G G ++ ETFT VP
Sbjct: 151 RLCSL--LRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGAD----AVP 204
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
V FGC+ + + +G++G P SL+ QL ++ F YCL + S
Sbjct: 205 SVRFGCTTASEGGYG--SGSGLVGLGRGPLSLVSQLNAST---FMYCLT---SDASKASP 256
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG A++ +++ + ++ Y ++L+ IS+ PG G +
Sbjct: 257 LLFGSLASLTGAQVQSTGLLAS-TTFYAVNLRSISIGS---ATTPGV-----GEPEGVVF 307
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED---WEYCYRYDSRFR----AY 285
D+G T++ Y E +F Q + ED +E C++ + R A
Sbjct: 308 DSGTTLTYLAEPAYS-------EAKAAFLSQTSLDQVEDTDGFEACFQKPANGRLSNAAV 360
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGT 345
+M HFD AD + P Y + +G C + S S++G Q + ++D++
Sbjct: 361 PTMVLHFDGADMAL-PVANYVVEVEDGVVCWIVQRSPSLSIIGNIMQVNYLVLHDVHRSV 419
Query: 346 IQFVPENC 353
+ F P NC
Sbjct: 420 LSFQPANC 427
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 151/378 (39%), Gaps = 50/378 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V G+PS+ L DT + W C PC C + S +F P SS+Y +PC
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSW 138
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH----------------LKNKLVC 110
C P F+ GQ GG +A + T F L+
Sbjct: 139 C--PLFQ---GQACPAPQ-GGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDA 192
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
+P FGC + + + G+LG P +LL Q S G+FSYCL +YR +
Sbjct: 193 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLP-SYRSYYFS 251
Query: 171 SILRFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
LR G Q + ++ M + RSS YY+++ +SV + G+FA
Sbjct: 252 GSLRLGAGGG-QPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGA 310
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEH------FTSFGRQRMHNASEDWEYCYRYDSRF 282
G ++D+G + T Y + F +TS G ++ C+ D
Sbjct: 311 GTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGA---------FDTCFNTDEVA 361
Query: 283 RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDT 335
A ++T H D P I + C+A++ + +N +V+ QQQ+
Sbjct: 362 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 421
Query: 336 RFVYDLNTGTIQFVPENC 353
R V+D+ I F E+C
Sbjct: 422 RVVFDVANSRIGFAKESC 439
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 109/219 (49%), Gaps = 26/219 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
NY T+++ + ++ DTGS L W QC PC++C+NQ P+F P+ SS+Y+ IPC+
Sbjct: 144 NYIVTMEL----GGQDMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCN 199
Query: 64 DLICRRPPFRCENG--------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
C+ N C + +NY G+ +G + E HL + V +
Sbjct: 200 SSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAE----HLSFGGISVSNFV 255
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC +N+ G ++G++G S SL+ Q ST G+FSYCL + A+ L
Sbjct: 256 FGCGKNNKGLF--GGVSGLMGLGRSNLSLISQTNSTFGGVFSYCL--PPTDAGASGSLAM 311
Query: 176 GKDANIQRKDMKTI---RMFVD--RSSHYYLSLQDISVA 209
G ++++ K++ I RM + S+ Y L+L I V
Sbjct: 312 GNESSV-FKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 34/360 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + GTP + + D S +W QC C C SAP F SST + +
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREV 155
Query: 61 PCDDLICRR-PPFRC--ENGQCVHRINYAGGAS--ASGLVSTETFTFHLKNKLVCVPGVI 115
C + C+R P C ++ C + Y GGA+ +GL++ + F F V GVI
Sbjct: 156 RCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADGVI 211
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC+ + +G+I G++G S + QL+ G FSY L ++ S + F
Sbjct: 212 FGCA-----VATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLA-PDDAVDVGSFILF 262
Query: 176 GKDANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
DA + + + R+S YY+ L I V + GTF L+ +G+GG ++
Sbjct: 263 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLS 322
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-DWEYCYRYDSRFRA-YASMTFH 291
TF+ G Y+VV + R + SE + CY +S A SM
Sbjct: 323 ITIPVTFLDAGAYKVVRQAMASKI----ELRAADGSELGLDLCYTSESLATAKVPSMALV 378
Query: 292 F-DRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
F A ++E +++ G C+ I S + S++G+ Q T +YD++ + F
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 151/378 (39%), Gaps = 50/378 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V G+PS+ L DT + W C PC C + S +F P SS+Y +PC
Sbjct: 79 YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSW 136
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH----------------LKNKLVC 110
C P F+ GQ GG +A + T F L+
Sbjct: 137 C--PLFQ---GQACPAPQ-GGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDA 190
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
+P FGC + + + G+LG P +LL Q S G+FSYCL +YR +
Sbjct: 191 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLP-SYRSYYFS 249
Query: 171 SILRFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
LR G Q + ++ M + RSS YY+++ +SV + G+FA
Sbjct: 250 GSLRLGAGGG-QPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGA 308
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEH------FTSFGRQRMHNASEDWEYCYRYDSRF 282
G ++D+G + T Y + F +TS G ++ C+ D
Sbjct: 309 GTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGA---------FDTCFNTDEVA 359
Query: 283 RAYA-SMTFHFDRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDT 335
A ++T H D P I + C+A++ + +N +V+ QQQ+
Sbjct: 360 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 419
Query: 336 RFVYDLNTGTIQFVPENC 353
R V+D+ + F E+C
Sbjct: 420 RVVFDVANSRVGFAKESC 437
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP+ ++FDTGS W QC PC V C+ Q +F+P SST I C
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C + C G C++ + Y G+ + G + +T T + + G FGC N
Sbjct: 246 ACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AIKGFRFGCGERNEG 302
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++C T L FG ++
Sbjct: 303 LF--GEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFP---ARSSGTGYLDFGPGSSPAVS 357
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M VD + YY+ L I V + P F T G ++D+G + T +
Sbjct: 358 TKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFT-----TAGTIVDSGTVITRLPP 412
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y + F + G ++ S + CY + + A +++ F A V+
Sbjct: 413 AAYSSLRSAFASAIAARGYKKAPALSL-LDTCYDFTGMSQVAIPTVSLLFQGGASLDVDA 471
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+ + ++ + +VG Q + VYD+ + F P C
Sbjct: 472 SGIIYA-ASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 165/375 (44%), Gaps = 34/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP + DTGS L W Q PC C+ Q PIF+P+ S+T+ ++PC
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAP 139
Query: 67 CR---RPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C C + C + +Y + +G ++++T T + N V + V FGC N
Sbjct: 140 CNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVT--VGNASVQIRNVAFGCGTRN 197
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME-------ATSILRF 175
+FD +GI+G S + QL T FSYCL+ E+ ATS + F
Sbjct: 198 GG-NFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVF 256
Query: 176 GKDANIQRKDMK------TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTF--ALRRNGT 227
G + T + + S++YYL+++ I+V ++ ++ + A +G+
Sbjct: 257 GDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGS 316
Query: 228 ------GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-WEYCYRYDS 280
G +ID+G TF++ Y + E +R+++ + C++
Sbjct: 317 KSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIK---MERVNDVKNSMFSLCFKSGK 373
Query: 281 RFRAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVY 339
M HF AD +++P F+ EG C + ++ + G Q + Y
Sbjct: 374 EEVELPLMKVHFRGGADVELKPVNT-FVRAEEGLVCFTMLPTNDVGIYGNLAQMNFVVGY 432
Query: 340 DLNTGTIQFVPENCA 354
DL T+ F+P +C+
Sbjct: 433 DLGKRTVSFLPADCS 447
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 157/354 (44%), Gaps = 24/354 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ ++ G P + +++ DTGS L W QC PC C+ Q PI+N S +Y + C++
Sbjct: 93 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 152
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTE--TFTFHLKNKLVCVPGVIFGCSND 121
C R ++G C+++ YA GA SGL+S E FT H ++ V FGC
Sbjct: 153 CVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTA-QVGFGCGLQ 211
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME---ATSILRFGKD 178
N +F G+LG SL+ QL +A G S Y + + A L FG D
Sbjct: 212 NLNFITSNRDGGVLGLGPGLVSLVSQL--SAIGKVSKSFAYCFGNISNPNAGGFLVFG-D 268
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDI--SVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A DM + + + YY++L I V + R+ +F + +G+GG +ID+G+
Sbjct: 269 ATYLNGDMTPMVI----AEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGS 324
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDR 294
+ YEVV + +S D C+ + + + ++ + +
Sbjct: 325 TLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD---CFEGKIERDLPLFPTLVLYLES 381
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ + F+ + + FC+ + + S++G QQ +F Y+L T+
Sbjct: 382 TGI-LNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLSI 434
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 151/357 (42%), Gaps = 28/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC---VNCFNQSAPIFNPNASSTYKRIPCD 63
Y V GTP ++ + DTGS L W QC PC +C++Q P+F+P SS+Y +PC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 64 DLICRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C C QC + ++Y G++ +G+ S++T T + V G FGC
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFGCG 256
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ F+G + G+LG SL+ Q T G+FSYCL ++ G
Sbjct: 257 HAQSGL-FNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSG 314
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+ + ++Y + L ISV ++ FA GG ++DTG + T
Sbjct: 315 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA------GGTVVDTGTVIT 368
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ Y + F S+G + + + CY F Y ++T F
Sbjct: 369 RLPPTAYAALRSAFRSGMASYGYPTAPS-NGILDTCY----NFAGYGTVTLPNVALTFGS 423
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
T M + C+A + S + +++G QQ+ F ++ ++ F P +C
Sbjct: 424 GATVMLGADGILSFGCLAFAPSGSDGGMAILGNVQQRS--FEVRIDGTSVGFKPSSC 478
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 159/388 (40%), Gaps = 47/388 (12%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI---FNPNASSTYKR 59
+ + Y + + GTP + DTGS L+W +C N N +AP F P+ASSTY R
Sbjct: 106 RQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGR 165
Query: 60 IPCDDLICR--RPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHL------------ 104
+ CD CR C +G C + +Y G+ ASG +STETFTF
Sbjct: 166 VGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGN 225
Query: 105 ------KNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSY 158
+ V + + FGCS + G+ G VS S LG S + FSY
Sbjct: 226 NNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGATTSLGRK-FSY 284
Query: 159 CLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAP 217
CL Y A+S L FG A + + + YY ++L I+VA + P
Sbjct: 285 CLA-PYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGTK---RP 340
Query: 218 GTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY- 276
T A ++D+G T++ +++ R + + + CY
Sbjct: 341 TTAAQAH-----IIVDSGTTLTYLDSALLTPLVKDLTRRIK---LPRAESPEKILDLCYD 392
Query: 277 ----RYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVA-ISFSDRNSV--VGA 329
R + + ++P + + Q EG C+A ++ S+R SV +G
Sbjct: 393 ISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQ-EGVLCLALVATSERQSVSILGN 451
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCANDH 357
QQ+ YDL GT+ F +CA H
Sbjct: 452 IAQQNLHVGYDLEKGTVTFAAADCAKSH 479
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 150/374 (40%), Gaps = 46/374 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V G+P++ L DT + W C PC C S +F P S++Y +PC +
Sbjct: 77 YVVRAGLGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPANSTSYAPLPCSSTM 135
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFT------------FHLKNKLVCVPGV 114
C + C + Y A T+ F HL +P
Sbjct: 136 CTV----LQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHLGKD--AIPNY 189
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC + + + G+LG P +LL Q+ + G+FSYCL +Y+ + LR
Sbjct: 190 AFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLP-SYKSYYFSGSLR 248
Query: 175 FGKDANIQRKDMKTIRMF--VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G A Q + ++ M +RSS YY+++ +SV + G+FA G ++
Sbjct: 249 LG--AAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVV 306
Query: 233 DTGAIATFIQRGPYEVVMRHFDEH------FTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
D+G + T Y + F H +TS G ++ C+ D A
Sbjct: 307 DSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGA---------FDTCFNTDEVAAGVA 357
Query: 287 -SMTFHFDRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVY 339
++T H D P I + C+A++ + +N +V+ QQQ+ R V+
Sbjct: 358 PAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVF 417
Query: 340 DLNTGTIQFVPENC 353
D+ + F E+C
Sbjct: 418 DVANSRVGFARESC 431
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 155/363 (42%), Gaps = 39/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + W +PC C S+ F PNAS+T + C
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAW---VPCSGCTGFSSTTFLPNASTTLGSLDCSGAQ 154
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + F C + C+ +Y G +S + + + T L N ++ PG FGC N
Sbjct: 155 CSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAIT--LANDVI--PGFTFGCINAV 210
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G+LG P SL+ Q + G+FSYCL +++ + L+ G Q
Sbjct: 211 SGGSIPPQ--GLLGLGRGPISLISQAGAMYSGVFSYCL-PSFKSYYFSGSLKLGPVG--Q 265
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S YY++L +SV ++ N G +ID+G + T
Sbjct: 266 PKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITR 325
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ Y + F + +S G ++ C+ + A A +T HF+ +
Sbjct: 326 FVQPVYFAIRDEFRKQVNGPISSLGA---------FDTCFAATNEAEAPA-ITLHFEGLN 375
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
V P I + G C++++ + N +V+ QQQ+ R ++D +
Sbjct: 376 L-VLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIAR 434
Query: 351 ENC 353
E C
Sbjct: 435 ELC 437
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 149/358 (41%), Gaps = 32/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+++ + DT + W C CV C S+ +F+P+ SS+ + + C+
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQ 145
Query: 67 CRRPP-FRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C++ P C + C + Y GG++ ++ +T T +P FGC N
Sbjct: 146 CKQAPNPSCTVSKSCGFNMTY-GGSAIEAYLTQDTLTLATD----VIPNYTFGCINKASG 200
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
S G++G P SL+ Q ++ Q FSYCL + + + LR G R
Sbjct: 201 TSLPAQ--GLMGLGRGPLSLISQSQNLYQSTFSYCLPNS-KSSNFSGSLRLGPKNQPIRI 257
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ RSS YY++L I V + + A G + D+G + T +
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKVE 300
Y + F R+R+ NA+ ++ CY S + S+TF F + +
Sbjct: 318 AYVAMRNEF--------RRRVKNANATSLGGFDTCY---SGSVVFPSVTFMFAGMNVTLP 366
Query: 301 PTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + C+A++ + N +V+ + QQQ+ R + D+ + E C
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 151/357 (42%), Gaps = 27/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP+ S ++ DTGS L W QC PC V+C Q+ P+F+P AS TY + C
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 66 ICRR------PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C + C+++ +Y + + G +S +T +F + PG +GC
Sbjct: 191 ECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGS----FPGFYYGC 246
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL A L G
Sbjct: 247 GQDNEGLF--GRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLP---TSSAAAGYLSIGSY 301
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q +D +S Y+++L ISVA + P + R T +ID+G +
Sbjct: 302 NPGQYSYTPMASSSLD-ASLYFVTLSGISVAGAPLAVPPSEY--RSLPT---IIDSGTVI 355
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF-DRADF 297
T + Y + R S + + D C+R + + F A
Sbjct: 356 TRLPPNVYTALSRAVAAAMASAAPRAPTYSILDT--CFRGSAAGLRVPRVDMAFAGGATL 413
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ P + I ++ C+A + + +++G QQQ VYD+ I F C+
Sbjct: 414 ALSPGNV-LIDVDDSTTCLAFAPTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 147/355 (41%), Gaps = 26/355 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P +SSTY + C
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 242
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N D
Sbjct: 243 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERN-D 298
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G AG+LG SL Q G+F++CL T L FG A
Sbjct: 299 GLF-GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLP---ARSTGTGYLDFG--AGSPPA 352
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
T + + + YY+ + I V + AP FA G ++D+G + T +
Sbjct: 353 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFA-----AAGTIVDSGTVITRLPPA 407
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEPT 302
Y +R + R A + CY + + A +++ F A V+ +
Sbjct: 408 AYS-SLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 466
Query: 303 -YMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
MY + ++ C+A + ++ +VG Q + YD+ + F P C
Sbjct: 467 GIMYTVSASQ--VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 147/355 (41%), Gaps = 26/355 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P +SSTY + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 238
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N D
Sbjct: 239 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERN-D 294
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G AG+LG SL Q G+F++CL T L FG A
Sbjct: 295 GLF-GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLP---ARSTGTGYLDFG--AGSPPA 348
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
T + + + YY+ + I V + AP FA G ++D+G + T +
Sbjct: 349 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFA-----AAGTIVDSGTVITRLPPA 403
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEPT 302
Y +R + R A + CY + + A +++ F A V+ +
Sbjct: 404 AYS-SLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 462
Query: 303 -YMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
MY + ++ C+A + ++ +VG Q + YD+ + F P C
Sbjct: 463 GIMYTVSASQ--VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 157/354 (44%), Gaps = 24/354 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ ++ G P + +++ DTGS L W QC PC C+ Q PI+N S +Y + C++
Sbjct: 106 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 165
Query: 67 CR---RPPFRCENGQCVHRINYAGGASASGLVSTE--TFTFHLKNKLVCVPGVIFGCSND 121
C R ++G C+++ +YA G+ SGL+S E FT H ++ V FGC
Sbjct: 166 CLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTA-QVGFGCGLQ 224
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME---ATSILRFGKD 178
N +F G+LG SL+ QL +A G S Y + + A L FG D
Sbjct: 225 NLNFVTSSRDGGVLGLGPGLVSLVSQL--SAIGKVSKSFAYCFGNLSNPNAGGFLVFG-D 281
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDIS--VADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A DM + + + YY++L I V + R+ +F + +G+GG +ID+G+
Sbjct: 282 ATYLNGDMTPMVI----AEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGS 337
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHFDR 294
+ YEVV + +S D C+ + + ++ + +
Sbjct: 338 TLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD---CFEGKIGRDLPLFPTLVLYLES 394
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ + F+ + + FC+ + + S++G QQ +F Y+L T+
Sbjct: 395 TGI-LNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLSI 447
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 148/358 (41%), Gaps = 32/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP++ + DT + W C CV C S+ +F+P+ SS+ + + C+
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQ 145
Query: 67 CRRPP-FRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C++ P C + C + Y GG++ ++ +T T +P FGC N
Sbjct: 146 CKQAPNPSCTVSKSCGFNMTY-GGSTIEAYLTQDTLTLASD----VIPNYTFGCINKASG 200
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
S G++G P SL+ Q ++ Q FSYCL + + + LR G R
Sbjct: 201 TSLPAQ--GLMGLGRGPLSLISQSQNLYQSTFSYCLPNS-KSSNFSGSLRLGPKNQPIRI 257
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ RSS YY++L I V + + A G + D+G + T +
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKVE 300
Y V F R+R+ NA+ ++ CY S + S+TF F + +
Sbjct: 318 AYVAVRNEF--------RRRVKNANATSLGGFDTCY---SGSVVFPSVTFMFAGMNVTLP 366
Query: 301 PTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + C+A++ + N +V+ + QQQ+ R + D+ + E C
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 148/358 (41%), Gaps = 32/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP++ + DT + W C CV C S+ +F+P+ SS+ + + C+
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQ 145
Query: 67 CRRPP-FRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C++ P C + C + Y GG++ ++ +T T +P FGC N
Sbjct: 146 CKQAPNPSCTVSKSCGFNMTY-GGSTIEAYLTQDTLTLASD----VIPNYTFGCINKASG 200
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
S G++G P SL+ Q ++ Q FSYCL + + + LR G R
Sbjct: 201 TSLPAQ--GLMGLGRGPLSLISQSQNLYQSTFSYCLPNS-KSSNFSGSLRLGPKNQPIRI 257
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ RSS YY++L I V + + A G + D+G + T +
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKVE 300
Y V F R+R+ NA+ ++ CY S + S+TF F + +
Sbjct: 318 AYVAVRNEF--------RRRVKNANATSLGGFDTCY---SGSVVFPSVTFMFAGMNVTLP 366
Query: 301 PTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P + C+A++ + N +V+ + QQQ+ R + D+ + E C
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLRQRIRELLL----KRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + + S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTKSVSIIG 321
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 143/355 (40%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C R C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 239 ACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERNEG 295
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG +
Sbjct: 296 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSSGTGYLDFGPGSPAAAG 350
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M D + YY+ + I V + FA T G ++D+G + T +
Sbjct: 351 ARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPP 405
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y + F + G ++ S + CY + + A +++ F A V+
Sbjct: 406 PAYSSLRSAFVSAMAARGYKKAPAVSL-LDTCYDFTGMSQVAIPTVSLLFQGGAILDVDA 464
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+ + ++ +VG Q + YD+ + F P C
Sbjct: 465 SGIMYA-ASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 147/355 (41%), Gaps = 26/355 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P +SSTY + C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 239
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N D
Sbjct: 240 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---VKGFRFGCGERN-D 295
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
F G AG+LG SL Q G+F++CL T L FG A
Sbjct: 296 GLF-GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLP---PRSTGTGYLDFG--AGSPPA 349
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
T + + + YY+ + I V + AP FA G ++D+G + T +
Sbjct: 350 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFA-----AAGTIVDSGTVITRLPPA 404
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEPT 302
Y +R + R A + CY + + A +++ F A V+ +
Sbjct: 405 AYS-SLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463
Query: 303 -YMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
MY + ++ C+A + ++ +VG Q + YD+ + F P C
Sbjct: 464 GIMYTVSASQ--VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 152/372 (40%), Gaps = 55/372 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V VL GTP++ L DT S + W C CV C + +A F+P S+++K + C
Sbjct: 99 YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQ 156
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C + Y G +S + +S +T + FGC N
Sbjct: 157 CKQVPNPACGARACSFNLTY-GSSSIAANLSQDTIRLAADP----IKAFTFGCVNK---- 207
Query: 126 SFDGNIAGILGFSVSP-----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+AG G ++ P SL+ Q +S + FSYCL ++R + + LR
Sbjct: 208 -----VAG--GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCL-PSFRSLTFSGSLR 259
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G + QR + RSS YY++L I V + P A + G + D+
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319
Query: 235 GAIATFIQRGPYEVVMRHFDEH-------FTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G + T + + YE V F + TS G ++ CY + +
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLG---------GFDTCYSGQVKV---PT 367
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLN 342
+TF F + + + C+A++ + N +V+ + QQQ+ R + D+
Sbjct: 368 ITFMFKGVNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVP 427
Query: 343 TGTIQFVPENCA 354
G + E C+
Sbjct: 428 NGRLGLARERCS 439
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 156/372 (41%), Gaps = 47/372 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC--FNQSAPIFNPNASSTYKRIPCDD 64
Y +++ GTP + + DTGS L+W +C C +C + IF +ASS+YK++PC+
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 65 LICRRPPF-----RCENGQCVHRINYAGGASASGLVSTETFTFHL----KNKLVCVPGVI 115
C RCE C ++ Y G+ SG V ++ +F ++ G +
Sbjct: 65 THCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFL 123
Query: 116 FGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC R D N G++G SL+ QL FSYCLV A S L
Sbjct: 124 FGC---GRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180
Query: 175 FGKDANIQRKDMKTIRMF----VDRSSHYYLSLQDISVA-------DHRIGFAPGTFALR 223
G A ++ D+ + + +D++ YY+ LQ I+V D G
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTL-YYVDLQSITVGGVPVVVYDKESGHNTSVGPFL 239
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF------TSFGRQRMHNASEDWEYCYR 277
N T +ID+G T + YE + + +E S G N+S D Y
Sbjct: 240 ANKT---VIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSGDTSY--- 293
Query: 278 YDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTR 336
+ S+TF+F V P F + C+++ S + S++G QQQ+
Sbjct: 294 ------GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFH 347
Query: 337 FVYDLNTGTIQF 348
+YDL I F
Sbjct: 348 ILYDLVASQISF 359
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 149/371 (40%), Gaps = 29/371 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASSTYKRIP 61
Y V GTP++ L+ DTGS L W +C ++P +F P S ++ IP
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169
Query: 62 CDDLICRR-PPFRCEN--------GQCVHRINYAGGASASGLVSTETFTFHLK----NKL 108
C C+ PF N C + Y +SA G+V T+ T L ++
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRK 229
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ V+ GC+ SF + G+L S S + + G FSYCLV
Sbjct: 230 AKLQEVVLGCTTSYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
ATS L FG + + + Y +++ +SVA + + +++N G
Sbjct: 289 ATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKN--G 346
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR--AYA 286
G ++D+G T + Y+ V+ + R M + +EYCY + + R A
Sbjct: 347 GAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM----DPFEYCYNWTATRRPPAVP 402
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS--DRNSVVGAWQQQDTRFVYDLNTG 344
+ F + PT Y I G C+ + SV+G QQ+ + +DL
Sbjct: 403 RLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDLANR 462
Query: 345 TIQFVPENCAN 355
++F CA+
Sbjct: 463 WLRFQESRCAH 473
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ L DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 150/388 (38%), Gaps = 46/388 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI---------FNPNASSTY 57
Y V GTP++ L+ DTGS L W +C + + +P F P S T+
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 58 KRIPCDDLICRRP-PFR-----CENGQCVHRINYAGGASASGLVSTETFTFHL---KNKL 108
I C C + PF C + Y G++A G V TE+ T L + +
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERK 216
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ G++ GCS+ SF+ + G+L S S S G FSYCLV
Sbjct: 217 AKLKGLVLGCSSSYTGPSFEAS-DGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRN 275
Query: 169 ATSILRFGKDANIQRK------------DMKTIRMFVDRSSH--YYLSLQDISVADHRIG 214
ATS L FG + + + + +DR Y +SL+ ISVA +
Sbjct: 276 ATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLK 335
Query: 215 FAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY 274
+ + GG ++D+G T + + Y V+ + R M + +EY
Sbjct: 336 IPRAVWDV--EAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM----DPFEY 389
Query: 275 CYRYDSRFR-----AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVV 327
CY + S A M HF A P Y I G C+ + SV+
Sbjct: 390 CYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVI 449
Query: 328 GAWQQQDTRFVYDLNTGTIQFVPENCAN 355
G QQ+ + +D+ ++F C +
Sbjct: 450 GNILQQEHLWEFDIKNRRLKFQRSRCTH 477
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 149/357 (41%), Gaps = 23/357 (6%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N Y V GTP+++ + DT S + W +PC C S+ +FN AS+TYK + C
Sbjct: 32 QNPTYIVRAKIGTPAQTMLMAMDTSSDVAW---IPCNGCLGCSSTLFNSPASTTYKSLGC 88
Query: 63 DDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C++ P C G C + Y GG+S + +S +T T VPG FGC
Sbjct: 89 QAAQCKQVPKPTCGGGVCSFNLTY-GGSSLAANLSQDTITLATD----AVPGYSFGCIQK 143
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
S LG SLL Q ++ Q FSYCL +++ + + LR G
Sbjct: 144 ATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCL-PSFKSLNFSGSLRLGPVGQP 200
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+R + R S Y+++L + V + PG+F + G + D+G + T +
Sbjct: 201 KRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRL 260
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEP 301
Y V F GR + ++ CY A ++TF F + + P
Sbjct: 261 VTPAYIAVRDAFRNR---VGRNLTVTSLGGFDTCYTVP---IAAPTITFMFTGMNVTLPP 314
Query: 302 TYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ C+A++ + N +V+ QQQ+ R +YD+ + E C
Sbjct: 315 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 134/361 (37%), Gaps = 35/361 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY I C
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C + C G C++ + Y G+ + G + +T T + + G FGC N
Sbjct: 221 ACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA---IKGFRFGCGERNEG 277
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++C T L FG +
Sbjct: 278 LY--GEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFP---ARSSGTGYLDFGPGSLPAVS 332
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M VD + YY+ L I V + F T G ++D+G + T +
Sbjct: 333 AKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPP 387
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-----------FRAYASMTFHF 292
Y + F G ++ S + CY + F+ AS+ H
Sbjct: 388 AAYSSLRSAFASAMAERGYKKAPALSL-LDTCYDFTGMSEVAIPTVSLLFQGGASLDVHA 446
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ + F D +VG Q + VYD+ + F P
Sbjct: 447 SGIIYAASVSQACLGFAGN-------KEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGA 499
Query: 353 C 353
C
Sbjct: 500 C 500
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 147/375 (39%), Gaps = 52/375 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + GTP+ + +L DTGS L W QC PC C+ Q P+F+P++SS+Y +PCD
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 150
Query: 65 LICRRPPFRC-----------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
CR+ C + I Y A+ +G+ STET T PG
Sbjct: 151 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLK--------PG 202
Query: 114 VI-----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
V+ FGC D++ ++ G+LG +P SL+ Q S G FSYCL
Sbjct: 203 VVVADFGFGC-GDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCLP---PTSG 257
Query: 169 ATSILRFGKDANIQRKDMKT------IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL 222
L G N + +R + Y ++L ISV + P F
Sbjct: 258 GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 315
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
+ G +ID+G + T + Y + F + + R + + CY +
Sbjct: 316 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGGVLDTCYDFTGHA 370
Query: 283 RAYA---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFV 338
S+TF P + +G A + +D ++G Q+ +
Sbjct: 371 NVTVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIGIIGNVNQRTFEVL 426
Query: 339 YDLNTGTIQFVPENC 353
YD GT+ F C
Sbjct: 427 YDSGKGTVGFRAGAC 441
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 149/357 (41%), Gaps = 23/357 (6%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N Y V GTP+++ + DT S + W +PC C S+ +FN AS+TYK + C
Sbjct: 97 QNPTYIVRAKIGTPAQTMLMAMDTSSDVAW---IPCNGCLGCSSTLFNSPASTTYKSLGC 153
Query: 63 DDLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C++ P C G C + Y GG+S + +S +T T VPG FGC
Sbjct: 154 QAAQCKQVPKPTCGGGVCSFNLTY-GGSSLAANLSQDTITLATD----AVPGYSFGCIQK 208
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
S LG SLL Q ++ Q FSYCL +++ + + LR G
Sbjct: 209 ATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLP-SFKSLNFSGSLRLGPVGQP 265
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+R + R S Y+++L + V + PG+F + G + D+G + T +
Sbjct: 266 KRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRL 325
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEP 301
Y V F GR + ++ CY A ++TF F + + P
Sbjct: 326 VTPAYIAVRDAFRNR---VGRNLTVTSLGGFDTCYTVP---IAAPTITFMFTGMNVTLPP 379
Query: 302 TYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ C+A++ + N +V+ QQQ+ R +YD+ + E C
Sbjct: 380 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 436
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 154/381 (40%), Gaps = 59/381 (15%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P++ P + K +PC
Sbjct: 56 HYYVTMNI--GDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPC 110
Query: 63 DDLIC------RRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+ IC P +C QC ++I Y AS+ G++ T++F+ L+NK P +
Sbjct: 111 ANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRPSLS 170
Query: 116 FGCSNDN---RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEAT 170
FGC D ++ + G+LG SLL QLK + + +CL
Sbjct: 171 FGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL-----STSGG 225
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL---RRNGT 227
L FG D + + + M S +YY +PG+ L RR+ +
Sbjct: 226 GFLFFGDDM-VPTSRVTWVPMVRSTSGNYY---------------SPGSATLYFDRRSLS 269
Query: 228 GGCM---IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA 284
M D+G+ T+ PY+ + S + + C++ F++
Sbjct: 270 TKPMEVVFDSGSTYTYFSAQPYQATISAIKG---SLSKSLKQVSDPSLPLCWKGQKAFKS 326
Query: 285 -------YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQ 333
+ S+ F F + P Y I G C+ I S++G Q
Sbjct: 327 VSDVKKDFKSLQFIFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQ 386
Query: 334 DTRFVYDLNTGTIQFVPENCA 354
D +YD + ++ +C+
Sbjct: 387 DQMVIYDNEKAQLGWIRGSCS 407
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 150/339 (44%), Gaps = 33/339 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG S+L Q T G FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 228
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 285 DDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 157/372 (42%), Gaps = 47/372 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC--FNQSAPIFNPNASSTYKRIPCDD 64
Y +++ GTP + + DTGS L+W +C C +C + IF +ASS+YK++PC+
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 65 LICRRPPF-----RCENGQCVHRINYAGGASASGLVSTETFTFHL----KNKLVCVPGVI 115
C RCE C ++ Y G+ SG V ++ +F ++ G +
Sbjct: 65 THCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFL 123
Query: 116 FGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC+ R D N G++G SL+ QL FSYCLV A S L
Sbjct: 124 FGCA---RKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180
Query: 175 FGKDANIQRKDMKTIRMF----VDRSSHYYLSLQDISVA-------DHRIGFAPGTFALR 223
G A ++ D+ + + +D++ YY+ LQ I++ D G
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTL-YYVDLQSITIGGVPVVVYDKESGHNTSVGPFL 239
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF------TSFGRQRMHNASEDWEYCYR 277
N T +ID+G T + YE + + +E S G N+S D Y
Sbjct: 240 ANKT---VIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSGDTSY--- 293
Query: 278 YDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTR 336
+ S+TF+F V P F + C+++ S + S++G QQQ+
Sbjct: 294 ------GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFH 347
Query: 337 FVYDLNTGTIQF 348
+YDL I F
Sbjct: 348 ILYDLVASQISF 359
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 150/376 (39%), Gaps = 30/376 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNC-FNQSAPIFNPNASSTYKRIPCDD 64
Y V + G+P ++ L+ DTGS L W +C C NC + F S+T+ C
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142
Query: 65 LICRRPPFRCEN--------GQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVI 115
+C+ P N C + Y+ G+ SG S ET T + + + + + +
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIA 202
Query: 116 FGCSNDNRDFSFDGN----IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGC S G+ +G++G P S QL FSYCL+ TS
Sbjct: 203 FGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPTS 262
Query: 172 ILRFGKDANIQRKDMKTIRMFV------DRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
L G D +KD K++ F + + YY+S++ + V ++ P ++L
Sbjct: 263 YLMIG-DVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDEL 321
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFR 283
G GG +ID+G TF+ Y ++ F AS ++ C R
Sbjct: 322 GNGGTVIDSGTTLTFLTEPAYREILSAFKRE-VKLPSPTPGGASTRSGFDLCVNVTGVSR 380
Query: 284 A-YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTRFV 338
+ ++ P YFI +EG C+AI + S R SV+G QQ
Sbjct: 381 PRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLE 440
Query: 339 YDLNTGTIQFVPENCA 354
+D + F CA
Sbjct: 441 FDRGKSRLGFSRRGCA 456
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 163/393 (41%), Gaps = 59/393 (15%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNC-----FNQSAPIFNPNASSTYK 58
Y++ + FGTP ++ + DTGS L+W C C C P F P SS+
Sbjct: 92 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSN 151
Query: 59 RIPCDDLICR---RPPFRCENGQC------------VHRINYAGGASASGLVSTETFTFH 103
I C + C P + + +C + I Y G++A GL+ +ET F
Sbjct: 152 LIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTA-GLLLSETLDFP 210
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV 161
K +PG + GCS FS GI GF SP SL QL GL FSYCLV
Sbjct: 211 HKKT---IPGFLVGCS----LFSIR-QPEGIAGFGRSPESLPSQL-----GLKKFSYCLV 257
Query: 162 -YAYREMEATS--ILRFGKDANIQRKDMKTIRMFVDRSS-----HYYLSLQDISVADHRI 213
+A+ + A+S +L G ++ + + F + +YY+ L++I + D +
Sbjct: 258 SHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHV 317
Query: 214 GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE 273
+G GG ++D+G TF+++ YE+V + F++ +
Sbjct: 318 KVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLR 377
Query: 274 YCYRYD-SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS------- 325
C+ + + FHF P YF F + G C+ I SD S
Sbjct: 378 PCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTI-VSDNMSGSGIGGG 436
Query: 326 ---VVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
++G +QQ++ +DL F +NC +
Sbjct: 437 PAIILGNYQQRNFHVEFDLKNERFGFKQQNCVS 469
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 144/358 (40%), Gaps = 26/358 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP K L+FDTGS L WTQC PC C+NQ P+F P+ S+TY I C
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 66 ICRR-------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C + P C++ I Y + + G + ET T + + +FGC
Sbjct: 191 DCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD---VIENFLFGC 247
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+NR G+ AG++G S++ Q +FSYCL + +T L FG
Sbjct: 248 GQNNRGLF--GSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLP---KTSSSTGYLTFGGG 302
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
I ++ Y + + + V +I + F+ T G +ID+G +
Sbjct: 303 GGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFS-----TSGAIIDSGTVI 357
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + Y + F++ + + + + +Y + F
Sbjct: 358 TRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDL 417
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
MY ++ C+A + + S ++G QQ+ + VYD+ G I F C
Sbjct: 418 DGIGIMYGASTSQ--VCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D + ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGIHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 149/372 (40%), Gaps = 49/372 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + GTP+ + +L DTGS L W QC PC C+ Q P+F+P++SS+Y +PCD
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 177
Query: 65 LICRRPP-----FRCENGQ---CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI- 115
CR+ C +G C + I Y A+ +G+ STET T PGV+
Sbjct: 178 DACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLK--------PGVVV 229
Query: 116 ----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
FGC D++ ++ G+LG +P SL+ Q S G FSYCL
Sbjct: 230 ADFGFGC-GDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCLP---PTSGGAG 284
Query: 172 ILRFGKDANIQRKDMKT------IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
L G + +R + Y ++L ISV + P F
Sbjct: 285 FLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAF----- 339
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
+ G +ID+G + T + Y + F + + R + + CY +
Sbjct: 340 -SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGAVLDTCYDFTGHTNVT 397
Query: 286 A---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDL 341
++TF P + +G A + +D ++G Q+ +YD
Sbjct: 398 VPTIALTFSGGATIDLATPAGVLV----DGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDS 453
Query: 342 NTGTIQFVPENC 353
GT+ F C
Sbjct: 454 GKGTVGFRAGAC 465
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 147/375 (39%), Gaps = 52/375 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + GTP+ + +L DTGS L W QC PC C+ Q P+F+P++SS+Y +PCD
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 230
Query: 65 LICRRPPFRC-----------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG 113
CR+ C + I Y A+ +G+ STET T PG
Sbjct: 231 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLK--------PG 282
Query: 114 VI-----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
V+ FGC D++ ++ G+LG +P SL+ Q S G FSYCL
Sbjct: 283 VVVADFGFGC-GDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCLP---PTSG 337
Query: 169 ATSILRFGKDANIQRKDMKT------IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL 222
L G N + +R + Y ++L ISV + P F
Sbjct: 338 GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 395
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
+ G +ID+G + T + Y + F + + R + + CY +
Sbjct: 396 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGGVLDTCYDFTGHA 450
Query: 283 RAYA---SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFV 338
S+TF P + +G A + +D ++G Q+ +
Sbjct: 451 NVTVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIGIIGNVNQRTFEVL 506
Query: 339 YDLNTGTIQFVPENC 353
YD GT+ F C
Sbjct: 507 YDSGKGTVGFRAGAC 521
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 153/363 (42%), Gaps = 38/363 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP ++ +++ DT + W C C+ C S F+ SST+ + C
Sbjct: 95 YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC--SSTTTFSAQNSSTFATLDCSKPE 152
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + C N C+ Y G ++ S + + + HL + +P FGC +
Sbjct: 153 CTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQD--SLHLGPNV--IPNFSFGCISSA 208
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G++G P SL+ Q S GLFSYCL +++ + L+ G Q
Sbjct: 209 SGSSIPPQ--GLMGLGRGPLSLISQSGSLYSGLFSYCLP-SFKSYYFSGSLKLGPVG--Q 263
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S YY++L ISV + +P A N G +ID+G + T
Sbjct: 264 PKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITR 323
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
Y V F + F+ G ++ C+ ++ A A +T H D
Sbjct: 324 FVPAIYTAVRDEFRKQVGGSFSPLGA---------FDTCFATNNEVSAPA-ITLHLSGLD 373
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSD-----RNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
K+ P I + G C+A++ + +V+ QQQ+ R ++D+N +
Sbjct: 374 LKL-PMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIAR 432
Query: 351 ENC 353
E C
Sbjct: 433 ELC 435
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 153/404 (37%), Gaps = 70/404 (17%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPC----------VNCFNQSAPIFNPNASSTYKRIPC 62
G P + + DTGS L+WTQC C CF Q+ P +N + S T + +PC
Sbjct: 84 IGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRTARAVPC 143
Query: 63 DD---LICRRPP--FRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCV 111
DD +C P C G CV +Y G A G++ T+ FTF + +
Sbjct: 144 DDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGV-ALGVLGTDAFTFPSSSSVT-- 200
Query: 112 PGVIFGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
+ FGC + R N A GI+G SL+ QL +T FSYCL +R+ +
Sbjct: 201 --LAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYFRDTVSP 255
Query: 171 SILRFGKDANIQRKDMK----------TIRMFVDR------SSHYYLSLQDISVADHRIG 214
S L G T F S+ YYL L ++ + +
Sbjct: 256 SHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVA 315
Query: 215 FAPGTFALRRNG----TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA-- 268
G F LR GG +ID+G+ T + + + + G A
Sbjct: 316 LPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKL 375
Query: 269 SEDWEYCYRYDSRFRAYAS-----MTFHFDRADFK----VEPTYMYFIFQNEGYFCVAIS 319
E C + A+ + FD V P Y+ +C+A+
Sbjct: 376 GGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVV 435
Query: 320 FS---------DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
S + +++G + QQD R +YDL G + F P NC+
Sbjct: 436 SSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 150/339 (44%), Gaps = 33/339 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG S+L Q T G FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 228
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 284
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 115/253 (45%), Gaps = 30/253 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCD-- 63
Y V V FG+P++ ++ DTGS L W QC PCV C Q+ P+F+P+AS TYK + C
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 64 ------DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D P + CV+ +Y + + G +S + T L PG ++G
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTL---PGFVYG 234
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C D+ D F G AGILG + S+LGQ+ S FSYCL L GK
Sbjct: 235 CGQDS-DGLF-GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL----PTRGGGGFLSIGK 288
Query: 178 DANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
A++ K M D S Y+L L I+V +G A + + +ID+G
Sbjct: 289 -ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP------TIIDSG 341
Query: 236 AIATFIQRGPYEV 248
T I R P V
Sbjct: 342 ---TVITRLPMSV 351
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+K++ + DTGS + W C C C + + F + S+T ++ C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSSGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 149/389 (38%), Gaps = 41/389 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYK 58
H Y + L G P + + DTGS LIWTQC C CF Q ++P+ S T K
Sbjct: 78 HWNETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAK 137
Query: 59 RIPCDDLIC-RRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+ C+D C RC +G+ + G + G + TE FTF + F
Sbjct: 138 PVACNDTACLLGSETRCARDGKACAVLTAYGAGAIGGFLGTEVFTFGHGQSSENNVSLAF 197
Query: 117 GCSNDNR--DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC +R S DG +GI+G SL QL FSYCL + + TS L
Sbjct: 198 GCITASRLTPGSLDG-ASGIIGLGRGKLSLPSQLGDNK---FSYCLTPYFSDAANTSTLF 253
Query: 175 FGKDANIQRKDMKTIRMFVDRS-------SHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
G A + + ++ S YYL L I+V ++ F LR
Sbjct: 254 VGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAP 313
Query: 228 ---GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRM--HNASEDWEYC---YRYD 279
GG +ID+G+ T + Y+ + DE G + +E + C
Sbjct: 314 AKWGGTLIDSGSPFTSLIDVAYQALR---DELVRQLGASVVPPPAGAEGLDLCVGGVAPG 370
Query: 280 SRFRAYASMTFHF-----DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--------V 326
+ + HF D V P + + V S NS +
Sbjct: 371 DAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTI 430
Query: 327 VGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+G + QQD +YDL G + F P +C++
Sbjct: 431 IGNYMQQDMHLLYDLGQGVLSFQPADCSS 459
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 154/361 (42%), Gaps = 38/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y + V GTP+ ++ + DTGS + W QC PC N C+ Q+ +F+P SSTY+ + C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186
Query: 65 LICRRPPFR-----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C + + N +C + + Y G++ +G S +T T L V G FGCS
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT--LSGASDAVKGFQFGCS 244
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ FS + G++G SL+ Q + FSYCL + S
Sbjct: 245 HVESGFSDQTD--GLMGLGGGAQSLVSQTAAAYGNSFSYCL-----PPTSGSSGFLTLGG 297
Query: 180 NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
T RM R + Y LQDI+V ++G +P FA G ++D+G I
Sbjct: 298 GGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTI 351
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RA 295
T + Y + F + R A + C+ + + + + ++ F A
Sbjct: 352 ITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQTQISIPTVALVFSGGA 408
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
++P + ++ N C+A + + + ++G QQ+ +YD+ + T+ F
Sbjct: 409 AIDLDPNGI--MYGN----CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 353 C 353
C
Sbjct: 463 C 463
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 110/211 (52%), Gaps = 19/211 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP++ ++++ DTGS + W QC PC C++Q+ PIFNP+ S+++ + CD
Sbjct: 156 EYFTRIGV--GTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 64 DLICRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C + + C +G C++ +Y G+ ++G +TET TF + V V GC + N
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTS----VANVAIGCGHKN 269
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
LG F Q+ + FSYCLV RE +++ L+FG +
Sbjct: 270 VGLFIGAAGLLGLGAGALSFP--NQIGTQTGHTFSYCLV--DRESDSSGPLQFGP----K 321
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVA 209
+ +I ++++ H YYLS+ IS++
Sbjct: 322 SVPVGSIFTPLEKNPHLPTFYYLSVTAISIS 352
>gi|326531368|dbj|BAK05035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 148/367 (40%), Gaps = 41/367 (11%)
Query: 5 YFYTVDVLFGTP--SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
Y Y V V GT ++ + L DT + W C PC Q +F+P AS T+ +
Sbjct: 68 YAYGVFVSLGTGEGTRLKVLALDTEASTSWVMCKPCHPSPPQVGNLFSPGASPTFHGVHS 127
Query: 63 DDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV-----CVPGVIFG 117
+D +C P + NG H +S +G +S +TF HL+ +P V+FG
Sbjct: 128 NDPVCTVPYRKTANGCSFHF------SSITGYLSRDTF--HLRTGRAGAVRESIPRVVFG 179
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C++ + F D + G+L S P SLL QL + A G FSYCL + S+ G
Sbjct: 180 CAHSSTGFHNDNTLGGVLSLSHLPLSLLTQLGAHASGRFSYCLPKSTGHNPHGSLF-LGA 238
Query: 178 DANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D T + + S Y+L+L I+ R+ + C I+
Sbjct: 239 DVPSPPPHSHTTNLVIHPGVSGYHLNLIGITRGYKRLKIDKRVLV-----SHSCSINPAE 293
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRAYASMTFHFDRA 295
T I Y VV + G R+ + R Y S +M FHF+
Sbjct: 294 TITHIAEPIYLVVEKALVARMKELGSDRVKGPPGGPLWFDRMYQSVKEQLPNMAFHFEGG 353
Query: 296 D---------FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
F+V F+ GY R +V+GA QQ +TRF +D+ G +
Sbjct: 354 AELWFTSDRLFEVHGMNARFMVAGRGY---------RRTVIGAAQQVNTRFTFDVARGKL 404
Query: 347 QFVPENC 353
FV E C
Sbjct: 405 SFVSEVC 411
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 153/366 (41%), Gaps = 40/366 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + F++ DT + +W C C C N S FN N+SSTY + C
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTYSTVSCSTAQ 88
Query: 67 CRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C + P + C +Y G +S S + +T T +P FGC N
Sbjct: 89 CTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD----VIPNFSFGCIN 144
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
S G++G P SL+ Q S G+FSYCL ++R + L+ G
Sbjct: 145 SASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCL-PSFRSFYFSGSLKLGLLG- 200
Query: 181 IQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q K ++ + + R S YY++L +SV ++ P N G +ID+G +
Sbjct: 201 -QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVI 259
Query: 239 TFIQRGPYEVVMRHFDEH-----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD 293
T + YE + F + F++ G ++ C+ D+ A +T H
Sbjct: 260 TRFAQPVYEAIRDEFRKQVNVSSFSTLGA---------FDTCFSADNENVA-PKITLHMT 309
Query: 294 RADFKVEPTYMYFIFQNEGYF-CVAISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQ 347
D K+ P I + G C++++ +N+ V+ QQQ+ R ++D+ I
Sbjct: 310 SLDLKL-PMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIG 368
Query: 348 FVPENC 353
PE C
Sbjct: 369 IAPEPC 374
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 153/366 (41%), Gaps = 40/366 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + F++ DT + +W C C C N S FN N+SSTY + C
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTYSTVSCSTAQ 162
Query: 67 CRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C + P + C +Y G +S S + +T T +P FGC N
Sbjct: 163 CTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD----VIPNFSFGCIN 218
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
S G++G P SL+ Q S G+FSYCL ++R + L+ G
Sbjct: 219 SASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCL-PSFRSFYFSGSLKLGLLG- 274
Query: 181 IQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q K ++ + + R S YY++L +SV ++ P N G +ID+G +
Sbjct: 275 -QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVI 333
Query: 239 TFIQRGPYEVVMRHFDEH-----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD 293
T + YE + F + F++ G ++ C+ D+ A +T H
Sbjct: 334 TRFAQPVYEAIRDEFRKQVNVSSFSTLGA---------FDTCFSADNENVA-PKITLHMT 383
Query: 294 RADFKVEPTYMYFIFQNEGYF-CVAISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQ 347
D K+ P I + G C++++ +N+ V+ QQQ+ R ++D+ I
Sbjct: 384 SLDLKL-PMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIG 442
Query: 348 FVPENC 353
PE C
Sbjct: 443 IAPEPC 448
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 151/372 (40%), Gaps = 55/372 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V L GTP++ L DT S + W C CV C + +A F+P S+++K + C
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQ 172
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C + Y G +S + +S +T + FGC N
Sbjct: 173 CKQVPNPTCGARACSFNLTY-GSSSIAANLSQDTIRLAADP----IKAFTFGCVNK---- 223
Query: 126 SFDGNIAGILGFSVSP-----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+AG G ++ P SL+ Q +S + FSYCL ++R + + LR
Sbjct: 224 -----VAG--GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCL-PSFRSLTFSGSLR 275
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G + QR + RSS YY++L I V + P A + G + D+
Sbjct: 276 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 335
Query: 235 GAIATFIQRGPYEVVMRHFDEH-------FTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G + T + + YE V F + TS G ++ CY + +
Sbjct: 336 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLG---------GFDTCYSGQVKV---PT 383
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLN 342
+TF F + + + C+A++ + N +V+ + QQQ+ R + D+
Sbjct: 384 ITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVP 443
Query: 343 TGTIQFVPENCA 354
G + E C+
Sbjct: 444 NGRLGLARERCS 455
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 143/372 (38%), Gaps = 56/372 (15%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V GTP+ + L+ DTGS L W QC PC C+ Q P+F+PN SS+Y +PCD
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188
Query: 65 LICRRPPFRCE--------NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI- 115
CR + + C + I+Y GA+ +G ST+ T PG I
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG--------PGAIV 240
Query: 116 ----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKS-TAQGLFSYCLVYAYREMEAT 170
FGC + + FD G+LG P SL Q + G+FS+CL +T
Sbjct: 241 KRFHFGCGHHQQRGKFD-MADGVLGLGRLPQSLAWQASARRGGGVFSHCLP---PTGVST 296
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
L G + + D+ Y L ISVA + P F R G
Sbjct: 297 GFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF---REGV--- 350
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR---YDSRFRAYAS 287
+ D+G + + +Q Y + F + + + C+ YD+ S
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEY---PLAPPVGHLDTCFNFTGYDNVTVPTVS 407
Query: 288 MTF------HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDL 341
+TF H D + + + F + Y ++G+ Q+ +YD+
Sbjct: 408 LTFRGGATVHLDASSGVLMDGCLAFWSSGDEY----------TGLIGSVSQRTIEVLYDM 457
Query: 342 NTGTIQFVPENC 353
+ F C
Sbjct: 458 PGRKVGFRTGAC 469
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 151/372 (40%), Gaps = 55/372 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V L GTP++ L DT S + W C CV C + +A F+P S+++K + C
Sbjct: 99 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQ 156
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C + Y G +S + +S +T + FGC N
Sbjct: 157 CKQVPNPTCGARACSFNLTY-GSSSIAANLSQDTIRLAADP----IKAFTFGCVNK---- 207
Query: 126 SFDGNIAGILGFSVSP-----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
+AG G ++ P SL+ Q +S + FSYCL ++R + + LR
Sbjct: 208 -----VAG--GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCL-PSFRSLTFSGSLR 259
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G + QR + RSS YY++L I V + P A + G + D+
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319
Query: 235 GAIATFIQRGPYEVVMRHFDEH-------FTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G + T + + YE V F + TS G ++ CY + +
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLG---------GFDTCYSGQVKV---PT 367
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLN 342
+TF F + + + C+A++ + N +V+ + QQQ+ R + D+
Sbjct: 368 ITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVP 427
Query: 343 TGTIQFVPENCA 354
G + E C+
Sbjct: 428 NGRLGLARERCS 439
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 166/386 (43%), Gaps = 49/386 (12%)
Query: 7 YTVDVLFGTPSKSEFLL-FDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + + GTP L DTGS L+WTQC C CF Q P F+ AS T +PC D
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDP 158
Query: 66 IC---RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFH--------LKNKLVCVP 112
IC + P C + C + +YA + SG + +TFTF + V VP
Sbjct: 159 ICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVP 218
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
V FGC N+ F N +GI GFS P SL QLK FS+C A + + +
Sbjct: 219 NVRFGCGQYNKGI-FKSNESGIAGFSRGPMSLPSQLK---VARFSHCFT-AIADARTSPV 273
Query: 173 LRFGKDA--NI---QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA--LRRN 225
G N+ +++ S YYL+L+ I+V R+ FA +
Sbjct: 274 FLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGS 333
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDS---- 280
G+GG +ID+G T I+ P + R F + + + N S D E +++
Sbjct: 334 GSGGTIIDSG---TGIRTLPGP-MYRSLRAAFVARVKLPVANESAADAESTLCFEAARSA 389
Query: 281 ------RFRAYASMTFHFDRADFKV-EPTYMYFIFQNE-----GYFCVAISFSDRN-SVV 327
A + H AD+ + +Y+ + ++E G V S D + +++
Sbjct: 390 SLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTII 449
Query: 328 GAWQQQDTRFVYDLNTGTIQFVPENC 353
G +QQQ+ YDL + FVP C
Sbjct: 450 GNFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 155/354 (43%), Gaps = 32/354 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD--- 63
Y + + GTP + L DT S L+W QC PC C+ Q P+F+P C+
Sbjct: 31 YLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKE-------CNSFF 83
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
D C C + YA ++ G+++ E TF + V +IFGC ++N
Sbjct: 84 DHSCS------PEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHNNT 137
Query: 124 DF--SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
D + G+ G +S S +G L + + FS CLV + + + + G+ +++
Sbjct: 138 GVFNENDMGLIGLGGGPLSLVSQMGNLYGSKR--FSQCLVPFHADPHTSGTISLGEASDV 195
Query: 182 QRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + T + + + Y ++L+ ISV D + F + G MID+G T+
Sbjct: 196 SGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSK----GNIMIDSGTPETY 251
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDSRFRAYASMTFHFDRADFKV 299
+ P E R +E +H + + CY+ ++ +T HF+ AD K+
Sbjct: 252 L---PQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEG-PILTAHFEGADVKL 307
Query: 300 EPTYMYFIFQNEGYFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
P FI +G FC A++ +D + G + Q + +DL+ + F P +
Sbjct: 308 LP-LQTFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTD 360
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 153/371 (41%), Gaps = 43/371 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P SS+YK + C+
Sbjct: 77 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN 136
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C E CV+ YA +S+SG++S + +F +++L V FGC N
Sbjct: 137 ------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAV-FGCENV 189
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
F GI+G S++ QL K + +FS C Y ME G A
Sbjct: 190 ETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC----YGGMEV------GGGA 239
Query: 180 NIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ K M RS +Y + L+ + VA + P F NG G ++D+
Sbjct: 240 MVLGKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDS 295
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSR-------FRAYA 286
G + + + + + S +R+H ++ + C+ R F
Sbjct: 296 GTTYAYFPKEAFIAIKDAIIKEIPSL--KRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 353
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTG 344
M F + Y++ + G +C+ I F DR+S ++G ++T YD
Sbjct: 354 DMEFGNGQKLILSPENYLFRHTKVRGAYCLGI-FPDRDSTTLLGGIVVRNTLVTYDREND 412
Query: 345 TIQFVPENCAN 355
+ F+ NC++
Sbjct: 413 KLGFLKTNCSD 423
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 152/365 (41%), Gaps = 39/365 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + F++ DT + +W C C C N S FN N+SSTY + C
Sbjct: 105 YVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTYSTVSCSTTQ 163
Query: 67 CRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C + P + C +Y G +S S + +T T +P FGC N
Sbjct: 164 CTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPD----VIPNFSFGCIN 219
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
S G++G P SL+ Q S G+FSYCL ++R + L+ G
Sbjct: 220 SASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCL-PSFRSFYFSGSLKLGLLG- 275
Query: 181 IQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q K ++ + + R S YY++L +SV ++ P N G +ID+G +
Sbjct: 276 -QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTVI 334
Query: 239 TFIQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDR 294
T + YE + F + F++ G ++ C+ D+ +T H
Sbjct: 335 TRFAQPVYEAIRDEFRKQVNGSFSTLGA---------FDTCFSADNE-NVTPKITLHMTS 384
Query: 295 ADFKVEPTYMYFIFQNEGYF-CVAISFSDRNS-----VVGAWQQQDTRFVYDLNTGTIQF 348
D K+ P I + G C++++ +N+ V+ QQQ+ R ++D+ I
Sbjct: 385 LDLKL-PMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGI 443
Query: 349 VPENC 353
PE C
Sbjct: 444 APEPC 448
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 147/367 (40%), Gaps = 33/367 (8%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC- 67
VD+ GTP + + ++ DTGS L W QC F+P+ SST+ +PC +C
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158
Query: 68 -RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
R P P C +N C + YA G A G + E FTF ++ + P +I GC+ +
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SRSLFTPPLILGCATE 215
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ D GILG + S Q K T FSYC+ T F N
Sbjct: 216 STD------PRGILGMNRGRLSFASQSKITK---FSYCVPTRVTRPGYTPTGSFYLGHNP 266
Query: 182 QRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ I M S Y ++LQ I + ++ +P F G+G M+
Sbjct: 267 NSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTML 326
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
D+G+ T++ Y+ V + ++ D + R M F
Sbjct: 327 DSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFE 386
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNTGTIQ 347
F++ V P G C+ I+ SD+ ++++G + QQ+ +DL +
Sbjct: 387 FEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMG 446
Query: 348 FVPENCA 354
F +C+
Sbjct: 447 FGTADCS 453
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 151/366 (41%), Gaps = 24/366 (6%)
Query: 7 YTVDVLFGTPSKSEFLLF-DTGSYLIWTQC----LPCVNCFNQSAPIFNPNASSTYKRIP 61
Y V + GTP +F+L DTGS L W C C +F N SS+++ IP
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIP 178
Query: 62 CDDLICRRPP------FRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVP 112
C C+ C N C+ Y G A G+ + ET T L + K + +
Sbjct: 179 CSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLF 238
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
V+ GC+ + +G G++G SL +L FSYCLV +
Sbjct: 239 DVLIGCTESFNET--NGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNF 296
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
L FG ++ M+ + + + +Y +++ ISV + + + + G GG +
Sbjct: 297 LSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNV--TGVGGMI 354
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS-MTF 290
+D+G T + Y+ V+ F + E +C+ RA +
Sbjct: 355 VDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKGFDRAAVPRLLI 414
Query: 291 HF-DRADFKVEPTYMYFIFQNEGYFCVAISFSD--RNSVVGAWQQQDTRFVYDLNTGTIQ 347
HF D A FK P Y I EG C+ I +D +S++G QQ+ + YDL G +
Sbjct: 415 HFADGAIFK-PPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYDLGRGKLG 473
Query: 348 FVPENC 353
F P +C
Sbjct: 474 FGPSSC 479
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 96/217 (44%), Gaps = 17/217 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S LIWTQC PC C++Q P+FNP SSTY +PC
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 67 CRRPPF-RC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C RC ++ C + Y+G A+ G ++ + GV FGCS +
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGED----AFRGVAFGCSTSS 204
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ +G++G P SL+ QL F+YCL + +L G DA+
Sbjct: 205 TGGAPPPQASGVVGLGRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVL--GADADAA 259
Query: 183 RKDMKTIRMFVDR----SSHYYLSLQDISVADHRIGF 215
R I + + R S+YYL+L + + D +
Sbjct: 260 RNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSL 296
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----KRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 155/381 (40%), Gaps = 47/381 (12%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTYKR 59
+ N T+ + G+P ++ ++ DTGS L W C LP +N FNP SS+Y
Sbjct: 54 QHNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNS------TFNPLLSSSYTP 107
Query: 60 IPCDDLICRRP------PFRCE--NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCV 111
PC+ +C P C+ N C ++YA +SA G ++ ETF+ +
Sbjct: 108 TPCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ---- 163
Query: 112 PGVIFGCSND---NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
PG +FGC + D + D G++G + SL+ Q+ FSYC+ +
Sbjct: 164 PGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI----SGED 216
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYY------LSLQDISVADHRIGFAPGTFAL 222
A +L G + T + SS Y+ + L+ I V++ + F
Sbjct: 217 AFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVP 276
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE----YCYRY 278
G G M+D+G TF+ Y + F E T R+ + + +E CY
Sbjct: 277 DHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQ-TKGVLTRIEDPNFVFEGAMDLCYHA 335
Query: 279 DSRFRAYASMTFHFDRADFKVE-PTYMYFIFQNEGY-FCVAISFSD----RNSVVGAWQQ 332
+ A ++T F A+ +V +Y + + + +C SD V+G Q
Sbjct: 336 PASLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQ 395
Query: 333 QDTRFVYDLNTGTIQFVPENC 353
Q+ +DL + F C
Sbjct: 396 QNVWMEFDLVKSRVGFTETTC 416
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 153/361 (42%), Gaps = 38/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y + V GTP+ ++ + DTGS + W QC PC N C Q+ +F+P SSTY+ + C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186
Query: 65 LICRRPPFR-----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C + + N +C + + Y G++ +G S +T T L V G FGCS
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT--LSGASDAVKGFQFGCS 244
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ FS + G++G SL+ Q + FSYCL + S
Sbjct: 245 HLESGFSDQTD--GLMGLGGGAQSLVSQTAAAYGNSFSYCL-----PPTSGSSGFLTLGG 297
Query: 180 NIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
T RM + + Y LQDI+V ++G +P FA G ++D+G I
Sbjct: 298 GGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTI 351
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RA 295
T + Y + F + R A + C+ + + + + ++ F A
Sbjct: 352 ITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQTQISIPTVALVFSGGA 408
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
++P + ++ N C+A + + + ++G QQ+ +YD+ + T+ F
Sbjct: 409 AIDLDPNGI--MYGN----CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 353 C 353
C
Sbjct: 463 C 463
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 147/359 (40%), Gaps = 43/359 (11%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + S+ ++ D+GS + W QC PC + C Q P+F+P S+TY +PC C R
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC I YA GA+A+G S++ T + V G +FGC++ ++ +F
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD---VVRGFLFGCAHADQGSTF 191
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYC----------LVYAYREMEATSILRFGK 177
++AG L S + Q S +FSYC +++ A + F
Sbjct: 192 SYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVS 251
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ M + Y + L+ I VA + P F + +ID+ +
Sbjct: 252 TPLLSSSTMS--------PTFYRVLLRSIIVAGRPLPVPPTVF------SASSVIDSATV 297
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RA 295
+ I Y+ + F T + R + CY + R S+ FD A
Sbjct: 298 ISRIPPTAYQALRAAFRSAMTMY---RPAPPVSILDTCYDFSGVRSITLPSIALVFDGGA 354
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++ + +G A + SDR +G QQ+ VYD+ I+F C
Sbjct: 355 TVNLDAAGILL----QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q T G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 153/381 (40%), Gaps = 59/381 (15%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P++ P + K +PC
Sbjct: 56 HYYVTMNI--GDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPC 110
Query: 63 DDLIC------RRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+ IC P +C QC ++I Y AS+ G++ ++F+ L+NK P +
Sbjct: 111 ANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSNVRPSLS 170
Query: 116 FGCSNDN---RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEAT 170
FGC D ++ + G+LG SLL QLK + + +CL
Sbjct: 171 FGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL-----STSGG 225
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL---RRNGT 227
L FG D + + + M S +YY +PG+ L RR+ +
Sbjct: 226 GFLFFGDDM-VPTSRVTWVSMVRSTSGNYY---------------SPGSATLYFDRRSLS 269
Query: 228 GGCM---IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA 284
M D+G+ T+ PY+ + S + + C++ F++
Sbjct: 270 TKPMEVVFDSGSTYTYFSAQPYQATISAIKG---SLSKSLKQVSDPSLPLCWKGQKAFKS 326
Query: 285 -------YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQ 333
+ S+ F F + P Y I G C+ I S++G Q
Sbjct: 327 VSDVKKDFKSLQFIFGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQ 386
Query: 334 DTRFVYDLNTGTIQFVPENCA 354
D +YD + ++ +C+
Sbjct: 387 DQMVIYDNEKAQLGWIRGSCS 407
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 120/273 (43%), Gaps = 18/273 (6%)
Query: 76 NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGIL 135
N CV+ Y + +GL+ + FTF VPGV FGC N F N GI
Sbjct: 59 NQTCVYTYYYNDKSVTTGLIEVDKFTFGAG---ASVPGVAFGCGLFNNGV-FKSNETGIA 114
Query: 136 GFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDR 195
GF P SL QLK G FS+C ++T +L D + +
Sbjct: 115 GFGRGPLSLPSQLKV---GNFSHCFTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQN 171
Query: 196 SSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRH 252
S++ YYLSL+ I+V R+ FAL NGTGG +ID+G T + Y+VV
Sbjct: 172 SANPTFYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVR-- 228
Query: 253 FDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKV-EPTYMYFIFQN 310
DE + + C+ S+ + + HF+ A + Y++ + +
Sbjct: 229 -DEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDD 287
Query: 311 EG--YFCVAISFSDRNSVVGAWQQQDTRFVYDL 341
G C+AI+ D +++G +QQQ+ +YDL
Sbjct: 288 AGNSIICLAINKGDETTIIGNFQQQNMHVLYDL 320
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 154/374 (41%), Gaps = 63/374 (16%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP+ ++ DTGS L W QC PC V+C QS P+FNP +SSTY + C
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 66 ICRRPPFRCENGQ-------CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P N C+++ +Y + + G +S +T +F + +P +GC
Sbjct: 182 QCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYYGC 237
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV----------------- 161
DN G AG++G + + SLL QL + F+YCL
Sbjct: 238 GQDNEGLF--GRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQ 295
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
Y+Y M ++S+ S Y++ L ++VA + + + ++
Sbjct: 296 YSYTPMVSSSL----------------------DDSLYFIKLSGMTVAGNPLSVSSSAYS 333
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
+ID+G + T + Y + + R ++ + C++ +
Sbjct: 334 SLPT-----IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQAS 385
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYD 340
+ ++T F A K+ + + ++ C+A + + +++G QQQ VYD
Sbjct: 386 RVSAPAVTMSFAGGAALKLSAQNL-LVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYD 444
Query: 341 LNTGTIQFVPENCA 354
+ + I F C+
Sbjct: 445 VKSSRIGFAAGGCS 458
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 151/376 (40%), Gaps = 55/376 (14%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI--FNPNASSTYKRIPCDDLI 66
+++ GTP +++ ++ DTGS L W QC + P F+P+ SST+ +PC +
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQC------HKKQPPTASFDPSLSSTFSILPCTHPL 130
Query: 67 C--RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C R P P C +N C + YA G A G + E FTF ++ V P +I GC+
Sbjct: 131 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SRSVSTPPLILGCA 187
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
++ D GILG ++ S Q K T FSYC+ T F
Sbjct: 188 TESTD------PRGILGMNLGRLSFAKQSKITK---FSYCVPPRQTRPGFTPTGSFYLGN 238
Query: 180 NIQRKDMKTIRMFVDRSSH--------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
N K K + M Y + + I +A ++ +P F G+G M
Sbjct: 239 NPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTM 298
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF--------- 282
ID+G+ T++ V +D+ R + + Y D F
Sbjct: 299 IDSGSEFTYL-------VSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIG 351
Query: 283 RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFV 338
R M F F+R V P G CV I SD+ ++++G + QQ+
Sbjct: 352 RLIGEMVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVE 411
Query: 339 YDLNTGTIQFVPENCA 354
+DL + F +C+
Sbjct: 412 FDLVRRRVGFGKADCS 427
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 157/379 (41%), Gaps = 51/379 (13%)
Query: 1 HEKNYF-----YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASS 55
H N F + VDV FGTP + L+ DTGS + WTQC CV+C S F+ ASS
Sbjct: 116 HNNNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASS 175
Query: 56 TYKRIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
TY C P N + + Y +++ G +T T +
Sbjct: 176 TYSFGSC-------IPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSD---VFQKFQ 222
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC +N + F G+LG S + Q S + +FSYCL E + L F
Sbjct: 223 FGCGRNN-EGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCL----PEENSIGSLLF 277
Query: 176 GKDANIQRKDMKTIRMF-------VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G+ A Q +K + ++ S +Y++ L DISV + R+ FA +
Sbjct: 278 GEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFA-----SP 332
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSF----GRQRMHNASEDWEYCYRYDSRFRA 284
G +ID+G + T + + Y + F + + GR++ ++ + CY R
Sbjct: 333 GTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDM---LDTCYNLSGRKDV 389
Query: 285 -YASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRN------SVVGAWQQQDTR 336
HF D AD ++ + + + C+A + + ++ +++G QQ
Sbjct: 390 LLPEXVLHFGDGADVRLNGKRVVW-GNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLT 448
Query: 337 FVYDLNTGTIQFVPENCAN 355
+YD+ I F C+N
Sbjct: 449 VLYDIRGRRIGFGGNGCSN 467
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 152/365 (41%), Gaps = 28/365 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++ GTP+K ++ DTGS L W C + +F + S ++K + C
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSFKTVGCLTQT 164
Query: 67 CRRPPFR--------CENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFG 117
C+ + C + YA G++A G+ + ET T L N ++ +PG + G
Sbjct: 165 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIG 224
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
CS+ SF G G+LG + S FS S FSYCLV ++ L FG
Sbjct: 225 CSSSFTGQSFQG-ADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 283
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGF----APGTFALRRNGTGGCMID 233
+ + +T + + R +Y I+V +G+ P +G GG ++D
Sbjct: 284 SRSTKTAFRRTTPLDLTRIPPFYA----INVIGISLGYDMLDIPSQVWDATSG-GGTILD 338
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA--YASMTFH 291
+G T + Y+ V+ + +R+ EYC+ + S F +TFH
Sbjct: 339 SGTSLTLLADAAYKQVVTGLARYLVEL--KRVKPEGVPIEYCFSFTSGFNVSKLPQLTFH 396
Query: 292 FDRADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ + EP Y + G C+ + + +V+G QQ+ + +DL T+ F
Sbjct: 397 L-KGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSF 455
Query: 349 VPENC 353
P C
Sbjct: 456 APSAC 460
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D + ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSKGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 152/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYRE----MEATSILRF 175
D+ + GN+ G+LG P S+L Q T FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----KRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 DRA---DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D A D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDAARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 152/365 (41%), Gaps = 28/365 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y ++ GTP+K ++ DTGS L W C + +F + S ++K + C
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSFKTVGCLTQT 142
Query: 67 CRRPPFR--------CENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFG 117
C+ + C + YA G++A G+ + ET T L N ++ +PG + G
Sbjct: 143 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIG 202
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
CS+ SF G G+LG + S FS S FSYCLV ++ L FG
Sbjct: 203 CSSSFTGQSFQG-ADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 261
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGF----APGTFALRRNGTGGCMID 233
+ + +T + + R +Y I+V +G+ P +G GG ++D
Sbjct: 262 SRSTKTAFRRTTPLDLTRIPPFYA----INVIGISLGYDMLDIPSQVWDATSG-GGTILD 316
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA--YASMTFH 291
+G T + Y+ V+ + +R+ EYC+ + S F +TFH
Sbjct: 317 SGTSLTLLADAAYKQVVTGLARYLVEL--KRVKPEGVPIEYCFSFTSGFNVSKLPQLTFH 374
Query: 292 FDRADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
+ + EP Y + G C+ + + +V+G QQ+ + +DL T+ F
Sbjct: 375 L-KGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSF 433
Query: 349 VPENC 353
P C
Sbjct: 434 APSAC 438
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 123/286 (43%), Gaps = 19/286 (6%)
Query: 76 NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGIL 135
N CV+ Y + +GL+ + FTF VPGV FGC N F N GI
Sbjct: 211 NQTCVYTYYYNDKSVTTGLLEVDKFTFGAG---ASVPGVAFGCGLFNNGV-FKSNETGIA 266
Query: 136 GFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDR 195
GF P SL QLK G FS+C ++T +L D + +
Sbjct: 267 GFGRGPLSLPSQLKV---GNFSHCFTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQN 323
Query: 196 SSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRH 252
S++ YYLSL+ I+V R+ FAL NGTGG +ID+G T + Y+VV
Sbjct: 324 SANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVR-- 380
Query: 253 FDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE-PTYMYFIFQN 310
DE + + C+ S+ + + HF+ A + Y++ + +
Sbjct: 381 -DEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDD 439
Query: 311 EG--YFCVAIS-FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
G C+AI+ D + +G +QQQ+ +YDL + FV C
Sbjct: 440 AGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 60/133 (45%), Gaps = 8/133 (6%)
Query: 206 ISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRM 265
I+V R+ FAL NGTGG +ID+G T + Y+VV DE +
Sbjct: 42 ITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVR---DEFAAQIKLPVV 97
Query: 266 HNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKV-EPTYMYFIFQNEG--YFCVAISFS 321
+ C+ S+ + + HF+ A + Y++ + + G C+AI+
Sbjct: 98 PGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG 157
Query: 322 DRNSVVGAWQQQD 334
D +++G +QQQ+
Sbjct: 158 DETTIIGNFQQQN 170
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 159/384 (41%), Gaps = 56/384 (14%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V G+P+K ++ DTGS ++W C C C +S +++PN S T +
Sbjct: 71 LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAV 130
Query: 61 PCDDLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP-- 112
PC D C P C ++ C + I Y G++ SG ++ TF + L P
Sbjct: 131 PCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDN 190
Query: 113 -GVIFGCS-------NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVY 162
VIFGC + N D + D GI+GF + S+L QL ++ + +FS+CL
Sbjct: 191 SSVIFGCGAKQSGSLSSNSDEALD----GIIGFGQANSSVLSQLAASGKVKRIFSHCL-- 244
Query: 163 AYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL 222
I G+ + T + V R +HY + L+D+ V I P L
Sbjct: 245 --DSHHGGGIFSIGQ---VMEPKFNTTPL-VPRMAHYNVILKDMDVDGEPI-LLP--LYL 295
Query: 223 RRNGTG-GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN--ASEDWEYCYRYD 279
+G+G G +ID+G ++ Y ++ GRQ ED C+ Y
Sbjct: 296 FDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKV------LGRQPGLKLMIVEDQFTCFHYS 349
Query: 280 SRF-RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQ 331
+ + + FHF+ V P F+++ E +C+ S + ++G
Sbjct: 350 DKLDEGFPVVKFHFEGLSLTVHPHDYLFLYK-EDIYCIGWQKSSTQTKEGRDLILIGDLV 408
Query: 332 QQDTRFVYDLNTGTIQFVPENCAN 355
+ VYDL I + NC++
Sbjct: 409 LSNKLVVYDLENMVIGWTNFNCSS 432
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 124/283 (43%), Gaps = 40/283 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V FGTP+ + ++ DTGS + W QC PC + CF Q P+++P+ SSTY +PC
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 65 LICRRPPF-----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI--- 115
+C++ C +G QC I+YA G S G S ++KL PG I
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYS--------QDKLTLAPGAIVQN 224
Query: 116 --FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC + + G G+LG L L + G+FSYCL + L
Sbjct: 225 FYFGCGHGKH--AVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSVSSK---PGFL 275
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G N + + + ++L I+V ++ P F +GG ++D
Sbjct: 276 ALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGMIVD 329
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY 276
+G + T +Q Y + F + ++ + + N D + CY
Sbjct: 330 SGTVITGLQSTAYRALRSAFRKAMEAY--RLLPNG--DLDTCY 368
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 150/351 (42%), Gaps = 29/351 (8%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDLICRRPP 71
GTP+ ++ DTGS L W QC PC V+C QS P+FNP +SSTY + C C P
Sbjct: 3 LGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLP 62
Query: 72 FRCENGQ-------CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
N C+++ +Y + + G +S +T +F + +P +GC DN
Sbjct: 63 SATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYYGCGQDNEG 118
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG++G + + SLL QL + F+YCL ++S N +
Sbjct: 119 --LFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL-----PSSSSSGYLSLGSYNPGQY 171
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
+ S Y++ L ++VA + + + ++ +ID+G + T +
Sbjct: 172 SYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT-----IIDSGTVITRLPTS 226
Query: 245 PYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF-DRADFKVEPTY 303
Y + + R ++ + C++ + + ++T F A K+
Sbjct: 227 VYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSAPAVTMSFAGGAALKLSAQN 283
Query: 304 MYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + ++ C+A + + +++G QQQ VYD+ + I F C+
Sbjct: 284 L-LVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 155/371 (41%), Gaps = 43/371 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P S++Y+ + C+
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C E CV+ YA +S+SG++S + +F +++L +FGC N+
Sbjct: 133 ------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQL-SPQRAVFGCENE 185
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
F GI+G S++ QL K + +FS C Y ME G A
Sbjct: 186 ETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC----YGGMEV------GGGA 235
Query: 180 NIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ K M RS +Y + L+ + VA + P F NG G ++D+
Sbjct: 236 MVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDS 291
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSR-------FRAYA 286
G + + + + + S +R+H ++ + C+ R F
Sbjct: 292 GTTYAYFPKEAFIAIKDAVIKEIPSL--KRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTG 344
+M F + Y++ + G +C+ I F DR+S ++G ++T YD
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGI-FPDRDSTTLLGGIVVRNTLVTYDREND 408
Query: 345 TIQFVPENCAN 355
+ F+ NC++
Sbjct: 409 KLGFLKTNCSD 419
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 151/357 (42%), Gaps = 31/357 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + GTP+K ++ DTGS L W QC PC V+C QS P+F+P SS+Y + C
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 66 ICRR------PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C P C + C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 197 QCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSNS----VPNFYYGC 252
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL T FSYCL + +
Sbjct: 253 GQDNEGLF--GRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSY---- 306
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
N + + S Y++ L ++VA + + ++ +ID+G +
Sbjct: 307 -NPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPT-----IIDSGTVI 360
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYA-SMTFHFDRAD 296
T + Y+ + + G +R +A + C+ S R A SM F A
Sbjct: 361 TRLPTTVYDALSKAVAGAMK--GTKRA-DAYSILDTCFVGQASSLRVPAVSMAFSGGAA- 416
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
K+ + + + C+A + + +++G QQQ VYD+ + I F C
Sbjct: 417 LKLSAQNL-LVDVDSSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 148/362 (40%), Gaps = 37/362 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W C C C SAP F+P AS++Y+ +PC +
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169
Query: 67 CRRPP-FRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C + P C G C + YA +S +S ++ V FGC
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYA-DSSLQAALSQDSLAVAGD----AVKTYTFGCLQKAT 224
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ LG S L Q + QG FSYCL +++ + + LR G+ N Q
Sbjct: 225 GTAAPPQGLLGLGRGPL--SFLSQTRDMYQGTFSYCL-PSFKSLNFSGTLRLGR--NGQP 279
Query: 184 KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+KT + + RSS YY+++ I V + P A G ++D+G + T +
Sbjct: 280 PRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRL 339
Query: 242 QRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
Y + V R +S G ++ C +++ A+ +T FD
Sbjct: 340 VAPAYVAVRDEVRRRVGAPVSSLG---------GFDTC--FNTTAVAWPPVTLLFDGMQV 388
Query: 298 KVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ P I G C+A++ + +V+ + QQQ+ R ++D+ G + F E
Sbjct: 389 TL-PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARE 447
Query: 352 NC 353
C
Sbjct: 448 RC 449
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 143/355 (40%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 296
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG + +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSTGTGYLDFGAGSLAAAR 351
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M + + YY+ + I V + FA T G ++D+G + T +
Sbjct: 352 ARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPP 406
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y +R+ + + A + CY + + A +++ F A V+
Sbjct: 407 AAYS-SLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+A + ++ +VG Q + YD+ + F P C
Sbjct: 466 SGIMYA-ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 164/397 (41%), Gaps = 69/397 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNC-----FNQSAPIFNPNASSTYK 58
Y++ + FGTP ++ + DTGS L+W C C C P F P SS+ K
Sbjct: 83 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSK 142
Query: 59 RIPCDDLICRR---PPFRCENGQC------------VHRINYAGGASASGLVSTETFTFH 103
I C + C P + + +C + I Y G++A GL+ +ET F
Sbjct: 143 LIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTA-GLLLSETLDFP 201
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV 161
K +P + GCS FS GI GF SP SL QL GL FSYCLV
Sbjct: 202 NKKT---IPDFLVGCS----IFSIK-QPEGIAGFGRSPESLPSQL-----GLKKFSYCLV 248
Query: 162 -YAYREMEATS--ILRFGKDANIQRKDMKTIRMFVDRSS-----HYYLSLQDISVADHRI 213
+A+ + +S +L G + + + + F+ + +YY+ L++I + D +
Sbjct: 249 SHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHV 308
Query: 214 G-----FAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA 268
PGT +G GG ++D+G TF++ YE+V + F++ +
Sbjct: 309 KVPYKFLVPGT-----DGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQN 363
Query: 269 SEDWEYCYRYD-SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-- 325
CY + + + F F P YF + G C+ I SD +
Sbjct: 364 LTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTI-VSDNVAGP 422
Query: 326 --------VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++G +QQ++ +DL F ++CA
Sbjct: 423 GLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSCA 459
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 124/283 (43%), Gaps = 40/283 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V FGTP+ + ++ DTGS + W QC PC + CF Q P+++P+ SSTY +PC
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138
Query: 65 LICRRPPF-----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI--- 115
+C++ C +G QC I+YA G S G S ++KL PG I
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYS--------QDKLTLAPGAIVQN 190
Query: 116 --FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC + + G G+LG L L + G+FSYCL + L
Sbjct: 191 FYFGCGHGKH--AVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSVSSK---PGFL 241
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G N + + + ++L I+V ++ P F +GG ++D
Sbjct: 242 ALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGMIVD 295
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY 276
+G + T +Q Y + F + ++ + + N D + CY
Sbjct: 296 SGTVITGLQSTAYRALRSAFRKAMEAY--RLLPNG--DLDTCY 334
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 155/371 (41%), Gaps = 43/371 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P S++Y+ + C+
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C E CV+ YA +S+SG++S + +F +++L +FGC N+
Sbjct: 133 ------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQL-SPQRAVFGCENE 185
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
F GI+G S++ QL K + +FS C Y ME G A
Sbjct: 186 ETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC----YGGMEV------GGGA 235
Query: 180 NIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ K M RS +Y + L+ + VA + P F NG G ++D+
Sbjct: 236 MVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDS 291
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSR-------FRAYA 286
G + + + + + S +R+H ++ + C+ R F
Sbjct: 292 GTTYAYFPKEAFIAIKDAVIKEIPSL--KRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTG 344
+M F + Y++ + G +C+ I F DR+S ++G ++T YD
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGI-FPDRDSTTLLGGIVVRNTLVTYDREND 408
Query: 345 TIQFVPENCAN 355
+ F+ NC++
Sbjct: 409 KLGFLKTNCSD 419
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+K++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +P FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPSFTFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG P S+L Q G FSYCL E + T
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M R + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGRRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 146/375 (38%), Gaps = 37/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC----LPCVNCFNQSAPIFNPNASSTYKRIPC 62
Y V GTP++ L+ DTGS L W +C A +F AS ++ I C
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIAC 160
Query: 63 DDLICRR-PPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVC------ 110
C PF N C + Y G++A G+V T++ T L +
Sbjct: 161 SSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSS 220
Query: 111 ------VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAY 164
+ GV+ GC+ SF + G+L S S + + G FSYCLV
Sbjct: 221 GGRRAKLQGVVLGCAATYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHL 279
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFAL 222
ATS L FG A + +DR + Y +++ + VA + + +
Sbjct: 280 APRNATSYLTFGPGATAPAAQTP---LLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDV 336
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSR 281
RN GG ++D+G T + Y V+ +H R M + +EYCY + D+
Sbjct: 337 DRN--GGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM----DPFEYCYNWTDAG 390
Query: 282 FRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVY 339
M HF + P Y I G C+ + SV+G QQ+ + +
Sbjct: 391 ALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWEF 450
Query: 340 DLNTGTIQFVPENCA 354
DL ++F CA
Sbjct: 451 DLRDRWLRFKHTRCA 465
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 150/358 (41%), Gaps = 25/358 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCL---PCVNCFNQSAPIFNPNASSTYKRIPCD 63
+ + + G P + TGS L+W CL PC + N F+P SSTYK +PCD
Sbjct: 98 FLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTH--NCDLRFFDPMESSTYKNVPCD 155
Query: 64 DLICR-RPPFRCENGQCVHRINYAGGAS-ASGLVSTETFTFH-LKNKLVCVPGVIFGCSN 120
C+ C+ C + + S G ++ +T T + K +P F C N
Sbjct: 156 SYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFMLPNTGFICGN 215
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDA 179
D GILG SLL ++ G FS+C+V Y+ + TS L FG A
Sbjct: 216 R---IGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIVPYSSNQ---TSKLSFGDKA 269
Query: 180 NIQRKDMKTIRMFVDRSSH-YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ M + R+ + + Y LS ISV + I NG G +D+G +
Sbjct: 270 VVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLG---MDSGTMF 326
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T+ Y +D + + + CYRY F ++T HF+ +
Sbjct: 327 TYFPE--YFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYSPDFSP-PTITMHFEGGSVE 383
Query: 299 VEPTYMYFIFQNEGYFCVA--ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + FI E C+A S S++++V G WQQ + YDL+ G + F+ +C
Sbjct: 384 LSSSNS-FIRMTEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDLDAGFLSFLKTDCT 440
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 166/377 (44%), Gaps = 48/377 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K + DTGS ++W C PC C N +F+ NASST K++
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132
Query: 61 PCDDLICR--RPPFRCENG-QCVHRINYAGGASASG-----LVSTETFTFHLKNKLVCVP 112
CDD C C+ C + I YA +++ G +++ E T LK +
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQE 192
Query: 113 GVIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYREME 168
V+FGC +D +G+ A G++GF S S+L QL +T A+ +FS+CL ++
Sbjct: 193 -VVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL----DNVK 247
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G + +KT M V HY + L + V + ++ RN G
Sbjct: 248 GGGIFAVGV---VDSPKVKTTPM-VPNQMHYNVMLMGMDVDGTSLDLPR---SIVRN--G 298
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQ--RMHNASEDWEYCYRYDSRF-RAY 285
G ++D+G + + Y+ ++ T RQ ++H E ++ C+ + + A+
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIE------TILARQPVKLHIVEETFQ-CFSFSTNVDEAF 351
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFV 338
++F F D V P F + E G+ ++ +R+ V+ G + V
Sbjct: 352 PPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVV 411
Query: 339 YDLNTGTIQFVPENCAN 355
YDL+ I + NC++
Sbjct: 412 YDLDNEVIGWADHNCSS 428
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 142/356 (39%), Gaps = 31/356 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP S+ + DTGS + W QC PC C +Q +F+P SSTY +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 65 LIC---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C R C QC + ++Y G++ +G+ ++T N V +FGC +
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT---VGTFLFGCGHA 259
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
F G I G+L SL Q G+FSYCL + A L G +
Sbjct: 260 QAGM-FAG-IDGLLALGRQSMSLKSQAAGAYGGVFSYCLP---SKQSAAGYLTLGGPTSA 314
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + Y + L ISV ++ FA GG ++DTG + T +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRL 368
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRADFK 298
Y + F +G A+ + CY RY ++TF A
Sbjct: 369 PPTAYAALRSAFRGAIAPYGYPSAP-ANGILDTCYDFSRYGVVTLPTVALTFS-GGATLA 426
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+E + + G A + D + +++G QQ+ F + T+ F+P C
Sbjct: 427 LEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGSTVGFMPGAC 476
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 53/383 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C+ C +C +S +++P AS++ K +
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147
Query: 61 PCDDLICRR------PPFRCENGQCVHRINYAGGASASGLVSTETFTF---------HLK 105
C C PP N C + I Y G+S +G + + +L
Sbjct: 148 TCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLA 207
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQ--GLFSYCLV 161
N V FGC N+A GILGF + S+L QL S + +FS+CL
Sbjct: 208 NA-----SVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL- 261
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
+ I G N+ + +KT + V HY + L+ I V + F
Sbjct: 262 ---DTVNGGGIFAIG---NVVQPKVKTTPL-VPGMPHYNVVLKTIDVGGSTLQLPTNIFD 314
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
+ G+ G +ID+G ++ Y+ V+ F++ + N +D+ C++Y
Sbjct: 315 I-GGGSRGTIIDSGTTLAYLPEVVYKAVLSAV---FSNHPDVTLKNV-QDF-LCFQYSGS 368
Query: 282 F-RAYASMTFHFDRADFKVEPTYMYFIFQN-EGYFCVAISFSDRNS-------VVGAWQQ 332
+ +TFHFD D + ++FQN E +CV S ++G
Sbjct: 369 VDNGFPEVTFHFD-GDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLAL 427
Query: 333 QDTRFVYDLNTGTIQFVPENCAN 355
+ VYDL I + NC++
Sbjct: 428 SNKLVVYDLENQVIGWTNYNCSS 450
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 151/341 (44%), Gaps = 39/341 (11%)
Query: 40 NCFNQSAPIFNPNASSTYKRIPCDDLICR---RPPFRCENGQCVHRINYAGGASASGLVS 96
C + AP F P +SST+ ++PC +C+ P C CV+ Y G +A G ++
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMGFTA-GYLA 145
Query: 97 TETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLF 156
TE T H+ PGV FGCS +N + +GI+G SP SL+ Q+ G F
Sbjct: 146 TE--TLHVGGA--SFPGVAFGCSTEN---GVGNSSSGIVGLGRSPLSLVSQV---GVGRF 195
Query: 157 SYCLVYAYREMEAT-SILRFGKDANIQRKDMKTIRM---FVDRSSHYYLSLQDISVADHR 212
SYCL + +A S + FG A + + + SS+YY++L I+V
Sbjct: 196 SYCL---RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATD 252
Query: 213 IGFAPGTFALRRNG----TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA 268
+ TF R GG ++D+G T++ + Y +V R F + N
Sbjct: 253 LPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNG 312
Query: 269 SE-DWEYCYRYDSRFRAYA----SMTFHF-DRADFKV-EPTYMYFI-FQNEGYFCV---- 316
+ ++ C+ ++ ++ F A++ V +Y+ + ++G V
Sbjct: 313 TRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLL 372
Query: 317 AISFSDR--NSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ S++ S++G Q D +YDL+ G F P +CAN
Sbjct: 373 VLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 160/375 (42%), Gaps = 40/375 (10%)
Query: 7 YTVDVLFGTPS---KSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPC 62
Y V + GTP+ ++LFDTGS L WTQC PC NC + + P +P+ S T++R+ C
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 182
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNK---LVCVPGVI 115
D +C + G C+ R Y G + SG + ++ F F V
Sbjct: 183 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 242
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA--------YREM 167
FGC++ + G GIL + S + QL FSYC+ + E
Sbjct: 243 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR---FSYCIPASEITDDDDDDDEE 299
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADH-----RIGFAPGTFAL 222
+ S LRFG A + K F S Y + L+ + V H + P A
Sbjct: 300 RSASFLRFGSHARMTGKRAP----FKQDGSGYAVRLKSV-VYQHGGRLNQQQPVPVYVAG 354
Query: 223 RRNGTGGCM-IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
M +D+G ++ + + R +E + R ++ + YCY +
Sbjct: 355 EEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS---LTRRYDLTHPSLYCYLGNMT 411
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTRFV 338
S+T F AD ++ T ++F +N E + C+A++ +R +++G + Q++
Sbjct: 412 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQRNINVG 470
Query: 339 YDLNTGTIQFVPENC 353
YDL+T I F + C
Sbjct: 471 YDLSTMEIAFDRDQC 485
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 151/377 (40%), Gaps = 52/377 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N +Y V + G PSK FL DTGS L W QC PCV C P + P + +PC
Sbjct: 31 NGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNN----LVPC 86
Query: 63 DDLICR----RPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D IC+ RCEN GQC + + YA G S+ G++ T+TF + ++ P + G
Sbjct: 87 MDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPLLALG 146
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRF 175
C D I G+LG S++ QL S + + +CL +
Sbjct: 147 CGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLY 206
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
+ M D + HY L +++ GF +N D+G
Sbjct: 207 ------DSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGF--------KNLL--TTFDSG 249
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHFD 293
A T++ Y+ ++ + + + + A +D C++ F++ + +F
Sbjct: 250 ASYTYLNSQAYQGLISLLKKELSG---KPLREALDDQTLPLCWKGRKPFKSIRDVKKYFK 306
Query: 294 ----------RADFKVE-PTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTR 336
++ ++E P Y I ++G C+ I +D N V+G QD
Sbjct: 307 TFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLN-VIGDISMQDRV 365
Query: 337 FVYDLNTGTIQFVPENC 353
+YD I + P NC
Sbjct: 366 VIYDNEKERIGWAPGNC 382
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 151/394 (38%), Gaps = 70/394 (17%)
Query: 23 LFDTGSYLIWTQCLPC----------VNCFNQSAPIFNPNASSTYKRIPCDD---LICRR 69
+ DTGS L+WTQC C CF Q+ P +N + S T + +PCDD +C
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 70 PP--FRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C G CV +Y G A G++ T+ FTF + + + FGC +
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAGV-ALGVLGTDAFTFPSSSSVT----LAFGCVSQ 191
Query: 122 NRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
R N A GI+G SL+ QL +T FSYCL +R+ + S L G
Sbjct: 192 TRISPGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYFRDTVSPSHLFVGDGEL 248
Query: 181 IQRKDMK----------TIRMFVDR------SSHYYLSLQDISVADHRIGFAPGTFALRR 224
+ T F S+ YYL L ++ + + G F LR
Sbjct: 249 AGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLRE 308
Query: 225 NG----TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA--SEDWEYCYRY 278
GG +ID+G+ T + + + + G A E C
Sbjct: 309 AAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEA 368
Query: 279 DSRFRAYAS-----MTFHFDRADFK----VEPTYMYFIFQNEGYFCVAISFS-------- 321
+ A+ + FD V P Y+ +C+A+ S
Sbjct: 369 GDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATLP 428
Query: 322 -DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ +++G + QQD R +YDL G + F P NC+
Sbjct: 429 TNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 147/370 (39%), Gaps = 37/370 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP-CVNCFNQSAPIFNPNASSTYKR 59
H FY V++ GTP + + D G L+WTQC C CF Q P+F+ NASST++
Sbjct: 45 HFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRP 104
Query: 60 IPCDDLICRRPPFRCENGQCVHRINYAGGAS---ASGLVSTETFTFHLKNKLVCVPGVIF 116
PC +C P R G Y S G + T+ + F
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR----LAF 160
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC+ + + G+ +G +G + SL Q+ +TA FSYCL A + +S L G
Sbjct: 161 GCAVASEMDTMWGS-SGSVGLGRTNLSLAAQMNATA---FSYCL--APPDTGKSSALFLG 214
Query: 177 KDANIQRKDMKT-IRMFVDRSS--------HYYLSLQDISVADHRIGFAPGTFALRRNGT 227
A + FV S+ Y L L+ I R G A T A+ ++G
Sbjct: 215 ASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAI-----RAGNA--TIAMPQSGN 267
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
M+ T T + Y + + + + G + ++++ C+ S
Sbjct: 268 -TIMVSTATPVTALVDSVYRDLRKAVAD---AVGAAPVPPPVQNYDLCFPKASASGGAPD 323
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR---NSVVGAWQQQDTRFVYDLNTG 344
+ F P Y CVAI S S++G+ QQ + ++DL+
Sbjct: 324 LVLAFQGGAEMTVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKE 383
Query: 345 TIQFVPENCA 354
T+ F P +C+
Sbjct: 384 TLSFEPADCS 393
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 158/376 (42%), Gaps = 46/376 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K + DTGS ++W C PC C N +F+ NASST K++
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKV 132
Query: 61 PCDDLICR--RPPFRCENG-QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---G 113
CDD C C+ C + I YA +++ G + T + L P
Sbjct: 133 GCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQE 192
Query: 114 VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYREMEA 169
V+FGC +D D + G++GF S S+L QL +T A+ +FS+CL ++
Sbjct: 193 VVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL----DNVKG 248
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
I G + +KT M V HY + L + V + P ++ RN GG
Sbjct: 249 GGIFAVGV---VDSPKVKTTPM-VPNQMHYNVMLMGMDVDGTALDLPP---SIMRN--GG 299
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQ--RMHNASEDWEYCYRYDSRFR-AYA 286
++D+G + + Y+ ++ T RQ ++H ED C+ + A+
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIE------TILARQPVKLH-IVEDTFQCFSFSENVDVAFP 352
Query: 287 SMTFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFVY 339
++F F D V P F + E G+ ++ +R V+ G + VY
Sbjct: 353 PVSFEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVY 412
Query: 340 DLNTGTIQFVPENCAN 355
DL I + NC++
Sbjct: 413 DLENEVIGWADHNCSS 428
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 145/360 (40%), Gaps = 48/360 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF +V V GTP L+ DTGS ++W QC PC C+ QS +F+P S +Y + C
Sbjct: 141 EYFASVGV--GTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCG 198
Query: 64 DLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
CR G C++++ Y G+ +G ++TET F + VP V G
Sbjct: 199 APPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGAR---VPRVAVG 255
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +DN LG SL Q FSYC G
Sbjct: 256 CGHDNEGLFVAAAGLLGLGRGRL--SLPTQTARRYGRRFSYCFQ--------------GS 299
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDI-SVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D D +TI R+ H ++ + V + + P T G GG ++D+G
Sbjct: 300 DL-----DHRTI----IRTVHQHVGGARVRGVGERSLRLDPST------GRGGVILDSGT 344
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHF-DR 294
T + R Y V F + G + ++ CY R +++ H
Sbjct: 345 SVTRLARPVYVAVREAF--RAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGG 402
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
A+ + P G FC+A++ +D S+VG QQQ R V+D + + VP++C
Sbjct: 403 AEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 160/375 (42%), Gaps = 40/375 (10%)
Query: 7 YTVDVLFGTPS---KSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPC 62
Y V + GTP+ ++LFDTGS L WTQC PC NC + + P +P+ S T++R+ C
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 161
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNK---LVCVPGVI 115
D +C + G C+ R Y G + SG + ++ F F V
Sbjct: 162 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 221
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA--------YREM 167
FGC++ + G GIL + S + QL FSYC+ + E
Sbjct: 222 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR---FSYCIPASEITDDDDDDDEE 278
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADH-----RIGFAPGTFAL 222
+ S LRFG A + K F S Y + L+ + V H + P A
Sbjct: 279 RSASFLRFGSHARMTGKRAP----FKQDGSGYAVRLKSV-VYQHGGRLNQQQPVPVYVAG 333
Query: 223 RRNGTGGCM-IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
M +D+G ++ + + R +E + R ++ + YCY +
Sbjct: 334 EEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS---LTRRYDLTHPSLYCYLGNMT 390
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTRFV 338
S+T F AD ++ T ++F +N E + C+A++ +R +++G + Q++
Sbjct: 391 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQRNINVG 449
Query: 339 YDLNTGTIQFVPENC 353
YDL+T I F + C
Sbjct: 450 YDLSTMEIAFDRDQC 464
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 171/379 (45%), Gaps = 46/379 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP+KS ++ DTGS ++W C+ C C +S ++N + S + K +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 61 PCDDLICRR----PPFRCE-NGQCVHRINYAGGASASG-----LVSTETFTFHLKNKLVC 110
CDD C + P C+ N C + Y G+S +G +V ++ LK +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQ-TA 197
Query: 111 VPGVIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
VIFGC + + D S + + GILGF + S++ QL S+ + +F++CL
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----D 253
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I G+ +Q K + V HY +++ + V + F +
Sbjct: 254 GRNGGGIFAIGR--VVQPK--VNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLF--QPG 307
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
G +ID+G ++ YE +++ + ++H +D++ C++Y R
Sbjct: 308 DRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPAL---KVHIVDKDYK-CFQYSGRVDEG 363
Query: 285 YASMTFHFDRADF-KVEPTYMYFIFQNEGYFCV-----AISFSDRN--SVVGAWQQQDTR 336
+ ++TFHF+ + F +V P ++F +EG +C+ A+ DR +++G +
Sbjct: 364 FPNVTFHFENSVFLRVYP--HDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKL 421
Query: 337 FVYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 422 VLYDLENQLIGWTEYNCSS 440
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG S+L Q T FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 150/361 (41%), Gaps = 24/361 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTPS+ L+ D+GS + + C C C N P F P+ SSTY + C+
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN 147
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C E QC + YA +S+SG++ + +F +++L V FGC N
Sbjct: 148 VDCTCDN-----ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAV-FGCENTE 201
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ L Y ++ +++ G A
Sbjct: 202 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAP-- 259
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM RS +Y + L++I VA + P F N G ++D+G ++
Sbjct: 260 -PDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGTTYAYLP 314
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH------FDRAD 296
+ S + R + + + C+ R + S F +
Sbjct: 315 EQAFVAFKDAVTNKVNSLKKIRGPDPNYK-DICFAGAGRNVSQLSEVFPDVDMVFGNGQK 373
Query: 297 FKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y++ + EG +C+ + + D +++G ++T YD + I F NC
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433
Query: 354 A 354
+
Sbjct: 434 S 434
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 160/377 (42%), Gaps = 42/377 (11%)
Query: 7 YTVDVLFGTPS---KSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPC 62
Y V + GTP+ ++LFDTGS L WTQC PC NC + + P +P+ S T++R+ C
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 181
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNK---LVCVPGVI 115
D +C + G C+ R Y G + SG + ++ F F V
Sbjct: 182 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 241
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA----------YR 165
FGC++ + G GIL + S + QL FSYC+ +
Sbjct: 242 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR---FSYCIPASEITDDDDDDDDD 298
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADH-----RIGFAPGTF 220
E + S LRFG A + K F S Y + L+ + V H + P
Sbjct: 299 EERSASFLRFGSHARMTGKRAP----FKQDGSGYAVRLKSV-VYQHGGRLNQQQPVPVYV 353
Query: 221 ALRRNGTGGCM-IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD 279
A M +D+G ++ + + R +E + R ++ + YCY +
Sbjct: 354 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS---LTRRYDLTHPSLYCYLGN 410
Query: 280 SRFRAYASMTFHF-DRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTR 336
S+T F AD ++ T ++F +N E + C+A++ +R +++G + Q++
Sbjct: 411 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQRNIN 469
Query: 337 FVYDLNTGTIQFVPENC 353
YDL+T I F + C
Sbjct: 470 VGYDLSTMEIAFDRDQC 486
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 141/351 (40%), Gaps = 27/351 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ ++ ++ DTGS + W +C +F+P+ S+TY C
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRC-----NSTDGLTLFDPSKSTTYAPFSCSSAA 183
Query: 67 CRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C + C N C +R+ Y G++ +G S++T + V FGCS+
Sbjct: 184 CAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDT---VTDFHFGCSHHEE 240
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
DF + I G++G SL+ Q +T FSYCL R + L FG
Sbjct: 241 DFDGE-KIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNR---TSGFLTFGAPNGTSG 296
Query: 184 KDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ T + ++ Y + LQDISV +G P + G ++D+G + T++
Sbjct: 297 GFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVL------SNGSVMDSGTVITWLP 350
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
R Y + F T QR + CY + V+
Sbjct: 351 RRAYSALSSAFRSSMTRLRHQRAAPLGI-LDTCYDFTGLVNVSIPAVSLVLDGGAVVDLD 409
Query: 303 YMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ Q+ C+A + + +S++G QQ+ ++D+ G F C
Sbjct: 410 GNGIMIQD----CLAFAATSGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 151/372 (40%), Gaps = 37/372 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP++ ++FDTGS L W QC PC + C++Q P+F P++SST+ + C +
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGE 144
Query: 65 LICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHL-------KNKLVCVPG 113
C R C + +C + + Y + G + +T T +N +PG
Sbjct: 145 PECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPG 204
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+FGC +N G G+ G SL Q FSYCL + A L
Sbjct: 205 FVFGCGENNTGLF--GKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSS--SSNAHGYL 260
Query: 174 RFGKDANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRIGFA--PGTFALRRNGTG 228
G A T ++RS S YY+ L I VA I + P +
Sbjct: 261 SLGTPAPAPAHARFT--PMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALW------PA 312
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
G ++D+G + T + Y + F +G +R S + CY + + A S+
Sbjct: 313 GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSI-LDTCYDFTAHANATVSI 371
Query: 289 T----FHFDRADFKVE-PTYMYFIFQNEGYFCVAISFSDRNS-VVGAWQQQDTRFVYDLN 342
A V+ +Y + A + + R++ ++G QQ+ VYD+
Sbjct: 372 PAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVG 431
Query: 343 TGTIQFVPENCA 354
I F + C+
Sbjct: 432 RQKIGFAAKGCS 443
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 142/356 (39%), Gaps = 31/356 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP S+ + DTGS + W QC PC C +Q +F+P SSTY +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 65 LIC---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C R C QC + ++Y G++ +G+ ++T N V +FGC +
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT---VGTFLFGCGHA 259
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
F G I G+L SL Q G+FSYCL + A L G ++
Sbjct: 260 QAGM-FAG-IDGLLALGRQSMSLKSQAAGAYGGVFSYCLP---SKQSAAGYLTLGGPSSA 314
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ + Y + L ISV ++ FA GG ++DTG + T +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRL 368
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRADFK 298
Y + F G A+ + CY RY ++TF A
Sbjct: 369 PPTAYAALRSAFRGAIAPCGYPSAP-ANGILDTCYDFSRYGVVTLPTVALTFS-GGATLA 426
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+E + + G A + D + +++G QQ+ F + T+ F+P C
Sbjct: 427 LEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGSTVGFMPGAC 476
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 84/154 (54%), Gaps = 7/154 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V + GTP L+FDTGS L WTQC PC+ +C++Q P FNP++SS+Y + C
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSP 193
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
+C P C C++ I Y G+ G ++ E FT L N V + + FGC +N+
Sbjct: 194 MCGNPE-SCSASNCLYGIGYGDGSVTVGFLAKEKFT--LTNSDV-LDDIYFGCGENNKGV 249
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYC 159
F G+ AGILG FS Q +T +FSYC
Sbjct: 250 -FIGS-AGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 145/356 (40%), Gaps = 25/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIP---- 61
Y + GTP+KS ++ DTGS L W QC PC V+C QS P+FNP ASS+Y +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 62 -CDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C DL P C C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 189 QCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNFYYGC 244
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL + +
Sbjct: 245 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 302
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M + + S Y++ + I VA P + + + +ID+G +
Sbjct: 303 GQYSYTPMASSSL---DDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVI 354
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + G Y + + R +A + C++ + +T F
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRA---SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + C+A + + +++G QQQ VYD+ I F C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 146/348 (41%), Gaps = 31/348 (8%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPF 72
P + ++ D+ S + W QC+PC C Q ++P+ S T C C P+
Sbjct: 25 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPY 84
Query: 73 R--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
C N QC + + Y G+S SG + T N V G FGCS+ + SFD
Sbjct: 85 ANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA---VSGFKFGCSHAEQG-SFDAR 140
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIR 190
AGI+ P SLL Q S FSYC + A L + A+ + +R
Sbjct: 141 AAGIMALGGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVR 199
Query: 191 MFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVM 250
F ++ Y + L+ I+V R+G AP FA G ++D+ T + Y+ +
Sbjct: 200 -FRQAATFYGVLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAYQALR 252
Query: 251 RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR-ADFKVEPTYMYFIF 308
F T + R + CY + ++ FDR A ++P+ + F
Sbjct: 253 AAFRSSMTMY---RSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 307
Query: 309 QNEGYFCVAISFS--DR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
N+ C+A + + DR V+G+ QQQ +YD+ G + F C
Sbjct: 308 -ND---CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 160/377 (42%), Gaps = 42/377 (11%)
Query: 7 YTVDVLFGTPS---KSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPC 62
Y V + GTP+ ++LFDTGS L WTQC PC NC + + P +P+ S T++R+ C
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 163
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNK---LVCVPGVI 115
D +C + G C+ R Y G + SG + ++ F F V
Sbjct: 164 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 223
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA----------YR 165
FGC++ + G GIL + S + QL FSYC+ +
Sbjct: 224 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR---FSYCIPASEITDDDDDDDDD 280
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADH-----RIGFAPGTF 220
E + S LRFG A + K F S Y + L+ + V H + P
Sbjct: 281 EERSASFLRFGSHARMTGKRAP----FKQDGSGYAVRLKSV-VYQHGGRLNQQQPVPVYV 335
Query: 221 ALRRNGTGGCM-IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD 279
A M +D+G ++ + + R +E + R ++ + YCY +
Sbjct: 336 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS---LTRRYDLTHPSLYCYLGN 392
Query: 280 SRFRAYASMTFHF-DRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTR 336
S+T F AD ++ T ++F +N E + C+A++ +R +++G + Q++
Sbjct: 393 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQRNIN 451
Query: 337 FVYDLNTGTIQFVPENC 353
YDL+T I F + C
Sbjct: 452 VGYDLSTMEIAFDRDQC 468
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 160/377 (42%), Gaps = 42/377 (11%)
Query: 7 YTVDVLFGTPS---KSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRIPC 62
Y V + GTP+ ++LFDTGS L WTQC PC NC + + P +P+ S T++R+ C
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 160
Query: 63 DDLICRRPPFRCENG----QCVHRINYAGGASASGLVSTETFTFHLKNK---LVCVPGVI 115
D +C + G C+ R Y G + SG + ++ F F V
Sbjct: 161 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 220
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA----------YR 165
FGC++ + G GIL + S + QL FSYC+ +
Sbjct: 221 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR---FSYCIPASEITDDDDDDDDD 277
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADH-----RIGFAPGTF 220
E + S LRFG A + K F S Y + L+ + V H + P
Sbjct: 278 EERSASFLRFGSHARMTGKRAP----FKQDGSGYAVRLKSV-VYQHGGRLNQQQPVPVYV 332
Query: 221 ALRRNGTGGCM-IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD 279
A M +D+G ++ + + R +E + R ++ + YCY +
Sbjct: 333 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS---LTRRYDLTHPSLYCYLGN 389
Query: 280 SRFRAYASMTFHF-DRADFKVEPTYMYFIFQN--EGYFCVAISFSDRNSVVGAWQQQDTR 336
S+T F AD ++ T ++F +N E + C+A++ +R +++G + Q++
Sbjct: 390 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQRNIN 448
Query: 337 FVYDLNTGTIQFVPENC 353
YDL+T I F + C
Sbjct: 449 VGYDLSTMEIAFDRDQC 465
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 149/371 (40%), Gaps = 37/371 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N Y V GTP+++ + DT S + W +PC C S+ +FN AS+TYK + C
Sbjct: 97 QNPTYIVRAKIGTPAQTMLMAMDTSSDVAW---IPCNGCLGCSSTLFNSPASTTYKSLGC 153
Query: 63 DDLICRR---------------PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNK 107
C++ P C G C + Y GG+S + +S +T T
Sbjct: 154 QAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTY-GGSSLAANLSQDTITLATD-- 210
Query: 108 LVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM 167
VPG FGC S LG SLL Q ++ Q FSYCL +++ +
Sbjct: 211 --AVPGYSFGCIQKATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCL-PSFKSL 265
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ LR G +R + R S Y+++L + V + PG+F +
Sbjct: 266 NFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTG 325
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G + D+G + T + Y V F GR + ++ CY A +
Sbjct: 326 AGTIFDSGTVFTRLVTPAYIAVRDAFRNR---VGRNLTVTSLGGFDTCYTVP---IAAPT 379
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLN 342
+TF F + + P + C+A++ + N +V+ QQQ+ R +YD+
Sbjct: 380 ITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVP 439
Query: 343 TGTIQFVPENC 353
+ E C
Sbjct: 440 NSRLGVARELC 450
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 140/343 (40%), Gaps = 34/343 (9%)
Query: 19 SEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFRCEN 76
S+ ++ DT S + W QCLPC C Q P+++P SST+ IPC C+ N
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227
Query: 77 G------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
G +C + +NY G + +G T+T T + + V FGCS+ R SF
Sbjct: 228 GCSPTTDECKYIVNYGDGKATTGTYVTDTLTM---SPTIVVKDFRFGCSHAVRG-SFSNQ 283
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIR 190
AGIL SLL Q FSYC+ + + L G K T
Sbjct: 284 NAGILALGGGRGSLLEQTADAYGNAFSYCI----PKPSSAGFLSLGGPVEASLKFSYTPL 339
Query: 191 MFVDRSSHYYL-SLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVV 249
+ + +Y+ L+ I VA ++ P FA G ++D+GA+ T + Y +
Sbjct: 340 IKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAAL 393
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTFHFDRADFKVEPTYMYF 306
F ++G + + + CY R+ S+ F A +EP +
Sbjct: 394 RAAFRSAMAAYG--PLAAPVRNLDTCYDFTRFPDVKVPKVSLVFA-GGATLDLEPASIIL 450
Query: 307 IFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQF 348
+G A + + + +G QQQ +YD+ G + F
Sbjct: 451 ----DGCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGF 489
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 135/348 (38%), Gaps = 72/348 (20%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y +++ GTP S + DTGS LIW QCLPC +C+ Q P+F+P S TYK +
Sbjct: 29 YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTL------ 82
Query: 67 CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPGVIFGCSNDNRDF 125
G +S+ETFT + PG+ FGC + N
Sbjct: 83 --------------------------GYLSSETFTIGSTEGDPASFPGLAFGCGHSNGG- 115
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
+F+ +G++G P SL+ QL S G FSYCLV + A+S + FGK A +
Sbjct: 116 TFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVS--- 172
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
GT + +ID+G T + R
Sbjct: 173 ------------------------------GSGTSSPAAAEESNIIIDSGTTLTLLPRDF 202
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
Y + + G Q + + CY + ++T HF AD ++ P
Sbjct: 203 YTDMESALTK---VIGGQTTTDPRGTFSLCYSGVKKLE-IPTITAHFIGADVQLPP-LNT 257
Query: 306 FIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
F+ E C ++ S ++ G Q + YDL + F P +C
Sbjct: 258 FVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 305
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 151/339 (44%), Gaps = 35/339 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTPSK++ + DTGS W C C C + + F + S+T ++ C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGC-HTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 67 CR---RPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C P C++ + C R++Y G+++ G++ +T TF K +PG FGC+
Sbjct: 59 CLLGGSDP-HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCN 114
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRF 175
D+ + GN+ G+LG S+L Q T FSYCL E + T
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSL 173
Query: 176 GKDANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
GK A R D++ +M + + +++ L ISV R+G +P F+ + G + D
Sbjct: 174 GKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK-----GVVFD 226
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHF 292
+G+ ++I V+ + E +R E CY S +++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLL----RRGAAEEESERNCYDMRSVDEGDMPAISLHF 282
Query: 293 D---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVG 328
D R D ++ Q + +C+A + ++ S++G
Sbjct: 283 DDGARFDLGRGGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 145/364 (39%), Gaps = 25/364 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCF-NQSAPIFNPNASSTYKRIPCDDL 65
Y GTP ++ + D + W C C+ C S+P F+P SSTY+ + C
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAP 159
Query: 66 ICRR-PPF--RCENG---QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG--VIFG 117
C + PP C G C ++YA ++ ++ + + N VP FG
Sbjct: 160 QCAQVPPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNG-AAVPDDHYTFG 217
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C G++GF P S L Q K+T +FSYCL +Y+ + LR G
Sbjct: 218 CLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCL-PSYKSSNFSGTLRLGP 276
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNGTGGCMIDTGA 236
+R + R S YY+++ + V + AL G GG ++D G
Sbjct: 277 AGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGT 336
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ T + Y + F ++ + ++ CY Y + ++ ++ F F
Sbjct: 337 MFTRLSPPAYAALRNAFRRGVSAPAAPALGG----FDTCY-YVNGTKSVPAVAFVFAGGA 391
Query: 297 FKVEPTYMYFIFQNE-GYFCVAISFSDRN------SVVGAWQQQDTRFVYDLNTGTIQFV 349
P I G C+A++ + +V+ + QQQ+ R V+D+ G + F
Sbjct: 392 RVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFS 451
Query: 350 PENC 353
E C
Sbjct: 452 RELC 455
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 156/358 (43%), Gaps = 30/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIP---- 61
Y + GTP+KS ++ DTGS L W QC PC V+C QS P+FNP +SS+Y +
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 62 -CDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
CD L P C C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 181 QCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNFYYGC 236
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL + SI +
Sbjct: 237 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYNPG 294
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
Q + +D S Y++ + I+VA + + ++ +ID+G +
Sbjct: 295 ---QYSYTPMAKSSLD-DSLYFIKMTGITVAGKPLSVSASAYSSLPT-----IIDSGTVI 345
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-YDSRFRA-YASMTFHFDRAD 296
T + Y + + R +A + C++ SR R SM F A
Sbjct: 346 TRLPTDVYSALSKAVAGAMKGTPRA---SAFSILDTCFQGQASRLRVPQVSMAFA-GGAA 401
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
K++ T + + + C+A + + +++G QQQ VYD+ I F C+
Sbjct: 402 LKLKATNL-LVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 170/379 (44%), Gaps = 46/379 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP+KS ++ DTGS ++W C+ C C +S ++N + S + K +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 61 PCDDLICRR----PPFRCE-NGQCVHRINYAGGASASG-----LVSTETFTFHLKNKLVC 110
CDD C + P C+ N C + Y G+S +G +V ++ LK +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQ-TA 197
Query: 111 VPGVIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
VIFGC + + D S + + GILGF + S++ QL S+ + +F++CL
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----D 253
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I G+ +Q K + V HY +++ + V + F +
Sbjct: 254 GRNGGGIFAIGR--VVQPK--VNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLF--QPG 307
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
G +ID+G ++ YE +++ + ++H +D++ C++Y R
Sbjct: 308 DRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPAL---KVHIVDKDYK-CFQYSGRVDEG 363
Query: 285 YASMTFHFDRADF-KVEPTYMYFIFQNEGYFCV-----AISFSDRN--SVVGAWQQQDTR 336
+ ++TFHF+ + F +V P ++F EG +C+ A+ DR +++G +
Sbjct: 364 FPNVTFHFENSVFLRVYP--HDYLFPYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKL 421
Query: 337 FVYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 422 VLYDLENQLIGWTEYNCSS 440
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 144/362 (39%), Gaps = 35/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y GTP+++ + D + W C C C S+P F+P SSTY+ +PC
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-ASSPSFSPTQSSTYRTVPCGSPQ 160
Query: 67 CRR-PPFRCENG---QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + P C G C + YA + L + L+N +V FGC
Sbjct: 161 CAQVPSPSCPAGVGSSCGFNLTYAASTFQAVL---GQDSLALENNVVV--SYTFGCLRVV 215
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G++GF P S L Q K T +FSYCL YR + L+ G Q
Sbjct: 216 SGNSVPPQ--GLIGFGRGPLSFLSQTKDTYGSVFSYCLPN-YRSSNFSGTLKLGPIG--Q 270
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K +KT + + R S YY+++ I V + A G +ID G + T
Sbjct: 271 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 330
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRAYASMTFHFDRADFK 298
+ Y V F GR R A ++ CY + ++TF F A
Sbjct: 331 LAAPVYAAVRDAFR------GRVRTPVAPPLGGFDTCYNVT---VSVPTVTFMFAGAVAV 381
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRN------SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
P I + G C+A++ + +V+ + QQQ+ R ++D+ G + F E
Sbjct: 382 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 441
Query: 352 NC 353
C
Sbjct: 442 LC 443
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 157/364 (43%), Gaps = 30/364 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P+ SSTY+ + C
Sbjct: 78 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC- 136
Query: 64 DLICRRPPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
L C C+N QCV+ YA +++SG++ + +F +++L V FGC N
Sbjct: 137 TLDC-----NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAV-FGCENV 190
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ + GI+G S++ QL K+ FS C Y ++ +++ G
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLC--YGGMDVGGGAMVLGGISP 248
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
DM + RS +Y + L++I VA R+ P F +G G ++D+G
Sbjct: 249 ---PSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVF----DGKHGSVLDSGTTYA 301
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF------D 293
++ + + SF + + + + + C+ + S TF +
Sbjct: 302 YLPEEAFLAFKEAIVKELQSFSQISGPDPNYN-DLCFSGAGIDVSQLSKTFPVVDMIFGN 360
Query: 294 RADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ + P YM+ + G +C+ I + D +++G ++T +YD I F
Sbjct: 361 GHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWK 420
Query: 351 ENCA 354
NCA
Sbjct: 421 TNCA 424
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 144/362 (39%), Gaps = 35/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y GTP+++ + D + W C C C S+P F+P SSTY+ +PC
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-ASSPSFSPTQSSTYRTVPCGSPQ 141
Query: 67 CRR-PPFRCENG---QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + P C G C + YA + L + L+N +V FGC
Sbjct: 142 CAQVPSPSCPAGVGSSCGFNLTYAASTFQAVL---GQDSLALENNVVV--SYTFGCLRVV 196
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G++GF P S L Q K T +FSYCL YR + L+ G Q
Sbjct: 197 SGNSVPPQ--GLIGFGRGPLSFLSQTKDTYGSVFSYCLPN-YRSSNFSGTLKLGPIG--Q 251
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K +KT + + R S YY+++ I V + A G +ID G + T
Sbjct: 252 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 311
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRYDSRFRAYASMTFHFDRADFK 298
+ Y V F GR R A ++ CY + ++TF F A
Sbjct: 312 LAAPVYAAVRDAFR------GRVRTPVAPPLGGFDTCYNVTV---SVPTVTFMFAGAVAV 362
Query: 299 VEPTYMYFIFQNE-GYFCVAISFSDRN------SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
P I + G C+A++ + +V+ + QQQ+ R ++D+ G + F E
Sbjct: 363 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 422
Query: 352 NC 353
C
Sbjct: 423 LC 424
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 146/365 (40%), Gaps = 37/365 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCD- 63
Y V + GTP+ + +L DTGS L W QC PC +C+ Q P+++P ASSTY +PCD
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDS 186
Query: 64 ----DLICRRPPFRCENGQ----CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
DL+ C N C + I Y + G+ STET T + V V
Sbjct: 187 KACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQ---VSVKDFG 243
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC + G+LG +P SL+ Q T G FSYCL T L
Sbjct: 244 FGCGLVQQGTFD--LFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGN---STTGFLAL 298
Query: 176 GKDANIQRKD---MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G N + ++++ Y ++L +SV + P +GG +I
Sbjct: 299 GAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL------SGGMII 352
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFH 291
D+G I T + Y + F +++ +N + + CY + ++
Sbjct: 353 DSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNN-DDVLDTCYNFTGIANVTVPTVALT 411
Query: 292 FDRA---DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
FD D V + Q+ F S D ++G Q+ +YD G + F
Sbjct: 412 FDGGATIDLDVP---SGVLIQDCLAFAGGASDGDVG-IIGNVNQRTFEVLYDSGRGHVGF 467
Query: 349 VPENC 353
P C
Sbjct: 468 RPGAC 472
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 154/384 (40%), Gaps = 52/384 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W C N FN IFNP AS TY +IPC
Sbjct: 64 NVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPN-FNS---IFNPLASKTYTKIPCS 119
Query: 64 DLICRRP------PFRCENGQCVH-RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
C P C+ + H I+YA +S G ++ ETF + V P +F
Sbjct: 120 SPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETF----RVGSVTGPATVF 175
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC + + + D G++G + S + Q+ FSYC+ + +++ +L
Sbjct: 176 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRK---FSYCI----SDRDSSGVLL 228
Query: 175 FGKDANIQRK--------DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G+ + K +M T + DR + Y + L+ I V+D + F G
Sbjct: 229 LGEASFSWLKPLNYTPLVEMSTPLPYFDRVA-YSVQLEGIRVSDKVLSLPKSVFVPDHTG 287
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-----DWEYCYRYDSR 281
G M+D+G TF+ Y + + F + G R+ N + CY +
Sbjct: 288 AGQTMVDSGTQFTFLLGPVYSALKQEF--LLQTKGVLRVLNEPRYVFQGAMDLCYLIEPT 345
Query: 282 FRAYASM---TFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSD----RNSVVGA 329
A ++ F A+ V + + E +C SD + V+G
Sbjct: 346 RAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGH 405
Query: 330 WQQQDTRFVYDLNTGTIQFVPENC 353
QQQ+ YDL I F C
Sbjct: 406 HQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 157/373 (42%), Gaps = 51/373 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA----PIFNPNASSTYKR 59
N T+ + GTP ++ ++ DTGS L W C N +A P FNPN SS+Y
Sbjct: 63 NVSLTISITVGTPPQNMSMVIDTGSELSWLHC-----NTNTTATIPYPFFNPNISSSYTP 117
Query: 60 IPCDDLICRR------PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVP 112
I C C P C+ N C ++YA +S+ G ++++TF F P
Sbjct: 118 ISCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN----P 173
Query: 113 GVIFGCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV---YAYREM 167
G++FGC N + + D N G++G ++ SL+ QLK FSYC+ ++ +
Sbjct: 174 GIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPK---FSYCISGSDFSGILL 230
Query: 168 EATSILRFGKDAN-IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
S +G N + T + DRS+ Y + L+ I ++D + + F G
Sbjct: 231 LGESNFSWGGSLNYTPLVQISTPLPYFDRSA-YTVRLEGIKISDKLLNISGNLFVPDHTG 289
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFT-SFGRQRMHNASE-----DWEYCYRY-- 278
G M D G +++ GP +R DE + G R + + CYR
Sbjct: 290 AGQTMFDLGTQFSYL-LGPVYNALR--DEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPV 346
Query: 279 -DSRFRAYASMTFHFDRADFKVEPTYMY-----FIFQNEGYFCVAISFSDRNSV----VG 328
S S++ F+ A+ +V + F++ N+ +C SD V +G
Sbjct: 347 NQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406
Query: 329 AWQQQDTRFVYDL 341
QQ +DL
Sbjct: 407 HHHQQSMWMEFDL 419
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 128/316 (40%), Gaps = 43/316 (13%)
Query: 62 CDDLI---CRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF----HLKNKLVCVPGV 114
C D++ C RP C +R NY G G+ +TE FTF VP +
Sbjct: 8 CSDILHHSCERP------DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-L 60
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSIL 173
FGC + N +G +GI+GF +P SL+ QL FSYCL YA R S L
Sbjct: 61 GFGCGSVNVGSLNNG--SGIVGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQ---STL 112
Query: 174 RFGKDAN------IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
FG ++ R + + YY+ ++V R+ FALR +G+
Sbjct: 113 LFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGS 172
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE-YCYRYDSRFRAYA 286
GG ++D+G T + V+R F + R N + C+ + +R +
Sbjct: 173 GGVIVDSGTALTLLPAAVLAEVVRAFRQQL----RLPFANGGNPEDGVCFLVPAAWRRSS 228
Query: 287 S--------MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRF 337
S M HF AD + G C+ ++ S D S +G QQD R
Sbjct: 229 STSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRV 288
Query: 338 VYDLNTGTIQFVPENC 353
+YDL T+ P C
Sbjct: 289 LYDLEAETLSIAPARC 304
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 159/378 (42%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y ++ GTP K ++ DTGS ++W C+ C C +S ++P ASS+ +
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTV 142
Query: 61 PCDDLICR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C + P N C + + Y G+S +G T+ F + PG
Sbjct: 143 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGN 202
Query: 114 --VIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
V FGC + D S + + GILGF + S+L QL + + +F++CL +
Sbjct: 203 ATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL----DTI 258
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRNG 226
+ I G N+ + +KT + D HY ++L+ I V + F R G
Sbjct: 259 KGGGIFAIG---NVVQPKVKTTPLVADM-PHYNVNLKSIDVGGTTLQLPAHVFETGERKG 314
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
T +ID+G T++ ++ VM F HN +D+ C++Y +
Sbjct: 315 T---IIDSGTTLTYLPELVFKEVMAAI---FNKHQDIVFHNV-QDF-MCFQYPGSVDDGF 366
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCV-----AISFSDRNSVV--GAWQQQDTRF 337
++TFHF D V P + YF +CV A+ D +V G +
Sbjct: 367 PTITFHFEDDLALHVYP-HEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLV 425
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 426 IYDLENQVIGWTDYNCSS 443
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 139/358 (38%), Gaps = 32/358 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP ++ + D W C CV C S+ +FN S+T+K + C
Sbjct: 35 YIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTLGCGAPQ 91
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C Y S L + +T + VP FGC
Sbjct: 92 CKQVPNPICGGSTCTWNTTYGSSTILSNL-TRDTIALSMDP----VPYYAFGCIQKATGS 146
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S G+LGF P S L Q ++ + FSYCL ++R + + LR G R
Sbjct: 147 SVPPQ--GLLGFGRGPLSFLSQTQNLYKSTFSYCL-PSFRTLNFSGSLRLGPVGQPPRIK 203
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ RSS YY+ L I V + A G + D+G + T +
Sbjct: 204 TTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPA 263
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADFKVEP 301
Y V F R+R+ NA+ ++ CY S ++TF F + + P
Sbjct: 264 YIAVRNEF--------RKRVGNATVSSLGGFDTCY---SVPIVPPTITFMFSGMNVTMPP 312
Query: 302 TYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ C+A++ + N +V+ + QQQ+ R ++D+ + E C+
Sbjct: 313 ENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 145/356 (40%), Gaps = 25/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIP---- 61
Y + GTP+KS ++ DTGS L W QC PC V+C QS P+FNP ASS+Y +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 62 -CDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C DL P C C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 189 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNFYYGC 244
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL + +
Sbjct: 245 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 302
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M + + S Y++ + I VA P + + + +ID+G +
Sbjct: 303 GQYSYTPMASSSL---DDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVI 354
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + G Y + + R +A + C++ + +T F
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRA---SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + C+A + + +++G QQQ VYD+ I F C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 162/385 (42%), Gaps = 54/385 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P ++ ++ DTGS L W C N + +F+P SS+Y IPC
Sbjct: 53 NVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCT 108
Query: 64 DLICR------RPPFRCENGQCVHR-INYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
CR P C+ + H I+YA +S G ++++ TFH+ N +P IF
Sbjct: 109 SPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASD--TFHIGNS--AIPATIF 164
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSI 172
GC + + + D G++G + S + Q+ GL FSYC+ +++ I
Sbjct: 165 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM-----GLQKFSYCI----SGQDSSGI 215
Query: 173 LRFGKDANIQRKDMK--------TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
L FG+ + K +K T + DR + Y + L+ I VA+ + +A
Sbjct: 216 LLFGESSFSWLKALKYTPLVQISTPLPYFDRVA-YTVQLEGIKVANSMLQLPKSVYAPDH 274
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCYRYDS 280
G G M+D+G TF+ GP +++ T + + + + + CYR
Sbjct: 275 TGAGQTMVDSGTQFTFL-LGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPL 333
Query: 281 RFRA---YASMTFHFDRADFKVEPTYMYF-----IFQNEGYFCVAISFSD----RNSVVG 328
R ++T F A+ V + + I ++ +C S+ + ++G
Sbjct: 334 TRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIG 393
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENC 353
QQ+ +DL + F C
Sbjct: 394 HHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 162/385 (42%), Gaps = 54/385 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P ++ ++ DTGS L W C N + +F+P SS+Y IPC
Sbjct: 60 NVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCT 115
Query: 64 DLICR------RPPFRCENGQCVHR-INYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
CR P C+ + H I+YA +S G ++++ TFH+ N +P IF
Sbjct: 116 SPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASD--TFHIGNS--AIPATIF 171
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSI 172
GC + + + D G++G + S + Q+ GL FSYC+ +++ I
Sbjct: 172 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM-----GLQKFSYCI----SGQDSSGI 222
Query: 173 LRFGKDANIQRKDMK--------TIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
L FG+ + K +K T + DR + Y + L+ I VA+ + +A
Sbjct: 223 LLFGESSFSWLKALKYTPLVQISTPLPYFDRVA-YTVQLEGIKVANSMLQLPKSVYAPDH 281
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCYRYDS 280
G G M+D+G TF+ GP +++ T + + + + + CYR
Sbjct: 282 TGAGQTMVDSGTQFTFL-LGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPL 340
Query: 281 RFRA---YASMTFHFDRADFKVEPTYMYF-----IFQNEGYFCVAISFSD----RNSVVG 328
R ++T F A+ V + + I ++ +C S+ + ++G
Sbjct: 341 TRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIG 400
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENC 353
QQ+ +DL + F C
Sbjct: 401 HHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 156/381 (40%), Gaps = 41/381 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC----LPCVNCFNQSA---PIFNPNASSTYKR 59
Y V + FGTP + L+ DTGS LIW QC P C ++ P F + S+T
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 60 IPCDDLICRRPPFRCENG---------QCVHRINYAGGASASGLVSTETFTF-HLKNKLV 109
+PC C P +G C + +YA G+S +G ++ +T T + +
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 173
Query: 110 CVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL--VYAYREM 167
V GV FGC N+ SF G G++G S Q S FSYCL + R
Sbjct: 174 AVRGVAFGCGTRNQGGSFSGT-GGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRR 224
++S L G+ +R+ V + YY+ + I V + + +A+
Sbjct: 233 RSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDV 289
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS---EDWEYCYRYDSR 281
G GG +ID+G+ T+++ G Y ++ F S R+ +++ + E CY S
Sbjct: 290 LGNGGTVIDSGSTLTYLRLGAYLHLVSAFA---ASVHLPRIPSSATFFQGLELCYNVSSS 346
Query: 282 FR------AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQ 332
+ +T F + PT Y + + C+AI + +V+G Q
Sbjct: 347 SSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 406
Query: 333 QDTRFVYDLNTGTIQFVPENC 353
Q +D + I F C
Sbjct: 407 QGYHVEFDRASARIGFARTEC 427
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 157/364 (43%), Gaps = 41/364 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP + DTGS L W QCLPC+ C+ Q PIFNP S+++ +PC+
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQT 151
Query: 67 CRR-PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN-DNR 123
C C G C + Y + G + E T + V VI GC + +
Sbjct: 152 CHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS----VKSVI-GCGHASSG 206
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL---FSYCLVYAYREMEATSILRFGKDAN 180
F F +G++G SL+ Q+ T+ G+ FSYCL A + FG++A
Sbjct: 207 GFGF---ASGVIGLGGGQLSLVSQMSQTS-GISRRFSYCLPTLLS--HANGKINFGENAV 260
Query: 181 IQRKDMKTIRMFVDRS-SHYYLSLQDISVADHR-IGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + + + ++YY++L+ IS+ + R + FA G +ID+G
Sbjct: 261 VSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAK---------QGNVIIDSGTTL 311
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM-----TFHFD 293
T + + Y+ V+ + + +R+ + + C +D A AS+ T HF
Sbjct: 312 TILPKELYDGVVSSLLKVVKA---KRVKDPHGSLDLC--FDDGINAAASLGIPVITAHFS 366
Query: 294 -RADFKVEPTYMY-FIFQNEGYFCV-AISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
A+ + P + + N + A S + ++G Q + YDL + F P
Sbjct: 367 GGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKP 426
Query: 351 ENCA 354
CA
Sbjct: 427 TVCA 430
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 152/376 (40%), Gaps = 51/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV+C P++ P + IPC+D
Sbjct: 56 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLYQP----SNDLIPCND 111
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ F RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 112 PLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCG 171
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 172 YDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCL-----SSLGGGILFFG 226
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + M + S HY ++ + F T L+ T + D+G+
Sbjct: 227 NDLYDSSR-VSWTPMARENSKHYSPAMGG------ELLFGGRTTGLKNLLT---VFDSGS 276
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF-- 292
T+ Y+ V + + + A +D C++ F + + +F
Sbjct: 277 SYTYFNSKAYQAVTYLLKRELSG---KPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 333
Query: 293 ----------DRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRN-SVVGAWQQQDTRF 337
+ F++ P Y I +G C+ I +N +++G QD
Sbjct: 334 LALSFKTGWRSKTLFEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMI 392
Query: 338 VYDLNTGTIQFVPENC 353
+YD +I ++P +C
Sbjct: 393 IYDNEKQSIGWIPADC 408
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 156/382 (40%), Gaps = 47/382 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNC-FNQSAPIFNPNASSTYKRIPC 62
Y++ + FGTP ++ + DTGS +W C C NC F F P SS+ K I C
Sbjct: 77 YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGC 136
Query: 63 DDLIC---RRPPFRCENGQ---------CVHRINYAGGASASGLVSTETFTFHLKNKLVC 110
+ C + RC + C + G + G+ +ET H +
Sbjct: 137 KNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSETLHLHG----LI 192
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEA 169
VP + GCS FS AGI GF P SL QL T FSYCL+ + + + +
Sbjct: 193 VPNFLVGCS----VFS-SRQPAGIAGFGRGPSSLPSQLGLTK---FSYCLLSHKFDDTQE 244
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDR----------SSHYYLSLQDISVADHRIGFAPGT 219
+S L ++ +K + + + S +YY+SL+ IS+ +
Sbjct: 245 SSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKY 304
Query: 220 FALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD 279
+ ++G GG +ID+G T++ +E++ F ++ R M A + C+
Sbjct: 305 LSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVS 364
Query: 280 -SRFRAYASMTFHFDRADFKVEPTYMYFIF---QNEGYFCVAISFSDRNS----VVGAWQ 331
++ + HF P YF F + F V +++ S ++G +Q
Sbjct: 365 GAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQ 424
Query: 332 QQDTRFVYDLNTGTIQFVPENC 353
Q+ YDL + F E+C
Sbjct: 425 MQNFYVEYDLQNERLGFKKESC 446
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 141/355 (39%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 296
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSTGTGYLDFGAGSLAAAS 351
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M D + YY+ + I V + FA T G ++D+G + T +
Sbjct: 352 ARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPP 406
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y +R+ + + A + CY + + A +++ F A V+
Sbjct: 407 AAYS-SLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+A + ++ +VG Q + YD+ + F P C
Sbjct: 466 SGIMYA-ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 156/372 (41%), Gaps = 46/372 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F+P +SSTYK I C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D IC + QCV+ YA +++SG++ + +F +++L+ V FGC N
Sbjct: 140 IDCIC-----DSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAV-FGCENME 193
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F GI+G SL+ QL S+ L Y ++ +++ G
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISP--- 250
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM RS +Y + L++I VA ++ + G F +G G ++D+G ++
Sbjct: 251 PSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLP 306
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
E F++F + + + + D F+ D A+ +
Sbjct: 307 A-----------EAFSAF-KDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 354
Query: 303 YMYFIFQN------------------EGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLN 342
+ +F+N G +C+ I + +D+ +++G ++T +YD
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414
Query: 343 TGTIQFVPENCA 354
I F NC+
Sbjct: 415 NSKIGFWKTNCS 426
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 156/372 (41%), Gaps = 46/372 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F+P +SSTYK I C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D IC + QCV+ YA +++SG++ + +F +++L+ V FGC N
Sbjct: 140 IDCIC-----DSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAV-FGCENME 193
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F GI+G SL+ QL S+ L Y ++ +++ G
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISP--- 250
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM RS +Y + L++I VA ++ + G F +G G ++D+G ++
Sbjct: 251 PSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLP 306
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPT 302
E F++F + + + + D F+ D A+ +
Sbjct: 307 A-----------EAFSAF-KDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 354
Query: 303 YMYFIFQN------------------EGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLN 342
+ +F+N G +C+ I + +D+ +++G ++T +YD
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414
Query: 343 TGTIQFVPENCA 354
I F NC+
Sbjct: 415 NSKIGFWKTNCS 426
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 159/376 (42%), Gaps = 42/376 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C C C +S +++ AS+T +
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 61 PCDDLICRR---PPFRCENG-QCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP--- 112
CDD C P C+ G QC++ + Y G+S +G + ++ + P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 273
Query: 113 GVIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREME 168
V+FGC N S + GILGF + S+L QL S+ + +FS+CL ++
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL----DNVD 329
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G+ + I V +HY + +++I V + F
Sbjct: 330 GGGIFAIGEVVEPKV----NITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRK 383
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYAS 287
G +ID+G + P EV + ++ + R+H + + C+ Y + +
Sbjct: 384 GTIIDSGTTLAYF---PQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPT 439
Query: 288 MTFHFDRA-DFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTRFVY 339
+T HFD++ V P ++FQ+E +C+ S +++G + VY
Sbjct: 440 VTLHFDKSISLTVYP--HEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVY 497
Query: 340 DLNTGTIQFVPENCAN 355
DL I +V NC++
Sbjct: 498 DLEKQGIGWVEYNCSS 513
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 149/384 (38%), Gaps = 52/384 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N T + GTP ++ ++ DTGS L W +C N IFNP AS TY +IPC
Sbjct: 64 NVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEPNF----TSIFNPLASKTYTKIPCS 119
Query: 64 DLICRRP------PFRCENGQCVH-RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
C+ P C+ + H I+YA +S G ++ ETF F + P +F
Sbjct: 120 SQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTR----PATVF 175
Query: 117 GC--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC S + + D G++G + S + Q+ FSYC+ +++T L
Sbjct: 176 GCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRK---FSYCI----SGLDSTGFLL 228
Query: 175 FGKDANIQRKDM--------KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G+ K + T + DR + Y + L+ I V + + F G
Sbjct: 229 LGEARYSWLKPLNYTPLVQISTPLPYFDRVA-YSVQLEGIKVNNKVLPLPKSVFVPDHTG 287
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-----DWEYCYRYDSR 281
G M+D+G TF+ Y + + F + G R+ N + + CY DS
Sbjct: 288 AGQTMVDSGTQFTFLLGPVYSALRKEF--LLQTAGVLRVLNEPQYVFQGAMDLCYLIDST 345
Query: 282 FRAYASM---TFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNSV----VGA 329
++ F A+ V + + E +C SD + +G
Sbjct: 346 SSTLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGH 405
Query: 330 WQQQDTRFVYDLNTGTIQFVPENC 353
QQQ+ YDL I F C
Sbjct: 406 HQQQNVWMEYDLENSRIGFAELRC 429
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 142/355 (40%), Gaps = 23/355 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V GTP+ ++FDTGS W QC PC V C+ Q +F+P SSTY + C
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 66 ICRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C C G C++ + Y G+ + G + +T T + V G FGC N
Sbjct: 238 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 294
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G AG+LG SL Q G+F++CL T L FG +
Sbjct: 295 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP---ARSTGTGYLDFGAGSPAAAS 349
Query: 185 DMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
T M D + YY+ + I V + FA T G ++D+G + T +
Sbjct: 350 ARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPP 404
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-RADFKVEP 301
Y +R+ + + A + CY + + A +++ F A V+
Sbjct: 405 PAYS-SLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 463
Query: 302 TYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + + + C+A + ++ +VG Q + YD+ + F P C
Sbjct: 464 SGIMYA-ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 113/267 (42%), Gaps = 20/267 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
Y + V G+P+ ++ ++ DTGS + W QC PC C + +F+P ASSTY C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 64 DLICRRPPFRCE-NG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C + E NG +C + + Y G++ +G S++ T + V G FG
Sbjct: 195 AAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD---VVRGFQFG 251
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
CS+ D G++G SL+ Q + FSYCL ++
Sbjct: 252 CSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLGAPAS 311
Query: 178 DANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
T M + ++Y+ +L+DI+V ++G +P FA G ++D+G
Sbjct: 312 GGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA------GSLVDSG 365
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGR 262
+ T + Y + F T + R
Sbjct: 366 TVITRLPPAAYAALSSAFRAGMTRYAR 392
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 155/374 (41%), Gaps = 39/374 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K ++ DTGS ++W C PC C +++ ASST K +
Sbjct: 76 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNV 135
Query: 61 PCDDLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---G 113
C+D C + C + + Y G+++ G + T + L P
Sbjct: 136 GCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQE 195
Query: 114 VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYREMEA 169
V+FGC + + + GI+GF S S++ QL + + + +FS+CL M
Sbjct: 196 VVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCL----DNMNG 251
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
I G+ ++ +KT + V HY + L+ + V I P + NG GG
Sbjct: 252 GGIFAIGE---VESPVVKTTPL-VPNQVHYNVILKGMDVDGEPIDLPPSLAS--TNGDGG 305
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYASM 288
+ID+G ++ + Y ++ E T+ + ++H E + C+ + S +A+ +
Sbjct: 306 TIIDSGTTLAYLPQNLYNSLI----EKITAKQQVKLHMVQETFA-CFSFTSNTDKAFPVV 360
Query: 289 TFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFVYDL 341
HF D V P F + + G+ ++ D V+ G + VYDL
Sbjct: 361 NLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDL 420
Query: 342 NTGTIQFVPENCAN 355
I + NC++
Sbjct: 421 ENEVIGWADHNCSS 434
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 148/358 (41%), Gaps = 24/358 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V+ G P ++ + DTGS L W QC PC+NC Q P++NP++SSTY D
Sbjct: 110 FLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSC--SDFD 167
Query: 67 CRRPPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNK-LVCVPGVIFGCSNDNRD 124
F +G C + YA + G + E F + + + VIFGC ++N
Sbjct: 168 RTDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQ 227
Query: 125 F-SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
G +G+ G S S++ +L FSYC+ + L G I+
Sbjct: 228 LPGPTGYASGVFGLGDSGSSIISKLGFG----FSYCIGNIGDPLYGFHRLTLGNKLKIEG 283
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA-LRRNGTGG-CMIDTGAIATFI 241
+ + YY++L IS+ R+ P F + NG +ID+GA ++I
Sbjct: 284 YSTPLVPRGL-----YYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLSYI 338
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY--RYDSRFRAYASMTFHF-DRADFK 298
R Y VV + F R + CY + + + + TFH D AD
Sbjct: 339 PRQAYNVVRDKVSSILSGF-LSRYRYIARHLSLCYIGKLNQDLQGFPDATFHLADGADLV 397
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ ++F + + C+A+ ++ + ++G QQ YDL + F C
Sbjct: 398 FQVEGLFFQY-TDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYFQRIEC 454
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 166/380 (43%), Gaps = 46/380 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTPSK +L DTG+ ++W C+ C C +S ++N SS+ K +
Sbjct: 72 LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLV 131
Query: 61 PCDDLICRRPPFRCENGQCVHRIN--------YAGGASASGLVSTETFTFHLKN---KLV 109
PCD +C+ G C + N Y G+S +G + F + K
Sbjct: 132 PCDQELCKEINGGLLTG-CTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTA 190
Query: 110 CVPG-VIFGC-SNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTA--QGLFSYCLVYA 163
G VIFGC + + D S+ A GILGF + +S++ QL S+ + +F++CL
Sbjct: 191 SANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL--- 247
Query: 164 YREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR 223
+ I G ++ + + T + D+ HY +++ I V + + T A
Sbjct: 248 -NGVNGGGIFAIG---HVVQPTVNTTPLLPDQ-PHYSVNMTAIQVGHTFLNLS--TDASE 300
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+ + G +ID+G ++ G Y+ ++ + Q +H+ ++Y D F
Sbjct: 301 QRDSKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQYSGSVDDGF- 359
Query: 284 AYASMTFHFDRA-DFKVEPTYMYFIFQNEGYFCV------AISFSDRN-SVVGAWQQQDT 335
++TF+F+ KV P ++F +E +C+ A S +N +++G +
Sbjct: 360 --PNVTFYFENGLSLKVYP--HDYLFLSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNK 415
Query: 336 RFVYDLNTGTIQFVPENCAN 355
YDL I + NC++
Sbjct: 416 LVFYDLENQVIGWTEYNCSS 435
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 155/367 (42%), Gaps = 36/367 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P +SSTYK + C+
Sbjct: 85 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCN 144
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C E QC + YA +S+SGL++ + +F +++L IFGC
Sbjct: 145 ------PSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQ-RAIFGCETV 197
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
F GI+G P S++ QL K FS C Y ++ +++
Sbjct: 198 ETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC--YGGMDVVGGAMVL----G 251
Query: 180 NI-QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
NI DM RS++Y + L+++ VA R+ P F +G G ++D+G
Sbjct: 252 NIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVF----DGKHGTVLDSGTTY 307
Query: 239 TFIQRGPYEVVMRHFDEHFTSFG-RQRMHNASEDW-EYCYRYDSRFRAYASMTFHF---- 292
++ P E + D +++H + + C+ R + S F
Sbjct: 308 AYL---PEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMV 364
Query: 293 --DRADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
+ + P Y++ + G +C+ I + D +++G ++T YD + I
Sbjct: 365 FGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIG 424
Query: 348 FVPENCA 354
F NC+
Sbjct: 425 FWKTNCS 431
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 149/365 (40%), Gaps = 27/365 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
+YFYT + GTP ++ ++ DTGS + + C C +C +A F+P+ S+T K++ C
Sbjct: 11 SYFYTT-LKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACG 69
Query: 64 DLICR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
D +C P C N +C + YA +S+ G + +TF F + V ++FGC N
Sbjct: 70 DPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFGCENG 126
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ GI+G + + QL + + +FS C Y IL G
Sbjct: 127 ETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGY-----PKDGILLLGDVT 181
Query: 180 NIQRKDMKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + HYY + + I+V + F F R GT ++D+G
Sbjct: 182 LPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFD-RGYGT---VLDSGTTF 237
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYR-----YDSRFRAYASMTFHF 292
T++ ++ + + ++ G Q A + + C++ + + + F F
Sbjct: 238 TYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVF 297
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y +C+ I F + NS +VG +D YD + F
Sbjct: 298 GGGAKLTLPPLRYLFLSKPAEYCLGI-FDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTT 356
Query: 351 ENCAN 355
CA+
Sbjct: 357 MACAD 361
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 149/365 (40%), Gaps = 30/365 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SSTY + C
Sbjct: 82 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCS 141
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L V FGC N
Sbjct: 142 ADCTC-----DSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAV-FGCENSE 195
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ + Y ++ +++ A
Sbjct: 196 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAP-- 253
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM R RS +Y + L++I VA + P F + G ++D+G ++
Sbjct: 254 -PDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIF----DSKHGTVLDSGTTYAYLP 308
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY---CYRYDSRFRAYASMTFH------FD 293
E F + TS R D Y C+ R + S F D
Sbjct: 309 ----EQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGD 364
Query: 294 RADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ P Y++ + EG +C+ + + D +++G ++T YD + I F
Sbjct: 365 GQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWK 424
Query: 351 ENCAN 355
NC+
Sbjct: 425 TNCSE 429
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 146/370 (39%), Gaps = 37/370 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP-CVNCFNQSAPIFNPNASSTYKR 59
H FY V++ GTP + + D G L+WTQC C CF Q P+F+ NASST++
Sbjct: 45 HFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRP 104
Query: 60 IPCDDLICRRPPFRCENGQCVHRINYAGGAS---ASGLVSTETFTFHLKNKLVCVPGVIF 116
PC +C P R G Y S G + T+ + F
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR----LAF 160
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC+ + + G+ +G +G + SL Q+ +TA FSYCL A + +S L G
Sbjct: 161 GCAVASEMDTMWGS-SGSVGLGRTNLSLAAQMNATA---FSYCL--APPDTGKSSALFLG 214
Query: 177 KDANIQRKDMKT-IRMFVDRSS--------HYYLSLQDISVADHRIGFAPGTFALRRNGT 227
A + FV S+ Y L L+ I R G A T A+ ++G
Sbjct: 215 ASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAI-----RAGNA--TIAMPQSGN 267
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
+ T T + Y + + + + G + ++++ C+ S
Sbjct: 268 -TITVSTATPVTALVDSVYRDLRKAVAD---AVGAAPVPPPVQNYDLCFPKASASGGAPD 323
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR---NSVVGAWQQQDTRFVYDLNTG 344
+ F P Y CVAI S S++G+ QQ + ++DL+
Sbjct: 324 LVLAFQGGAEMTVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKE 383
Query: 345 TIQFVPENCA 354
T+ F P +C+
Sbjct: 384 TLSFEPADCS 393
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 139/377 (36%), Gaps = 51/377 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P K FL D+GS L W QC PC +C P++ P S K +PC
Sbjct: 63 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 119
Query: 65 LICRR-------PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+C RCE+ QC + I YA S++G++ ++F L N V P V
Sbjct: 120 RLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTNGSVARPSVA 179
Query: 116 FGCSNDN--RDFSFDGNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATS 171
FGC D R G+LG SLL QLK + + +CL +
Sbjct: 180 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL-----SLRGGG 234
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
L FG D ++ T ++Y + D +G +
Sbjct: 235 FLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVR----------LAKVV 284
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA------- 284
D+G+ T+ PY+ ++ + + R C++ F++
Sbjct: 285 FDSGSSFTYFAAKPYQALVTALKDGLS---RTLEEEPDTSLPLCWKGQEPFKSVLDVRKE 341
Query: 285 YASMTFHF--DRADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTR 336
+ S+ +F + P Y I G C+ I D S++G QD
Sbjct: 342 FKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDL-SIIGDITMQDHM 400
Query: 337 FVYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 401 VIYDNEKGKIGWIRAPC 417
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 145/343 (42%), Gaps = 31/343 (9%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPF 72
P + ++ D+ S + W QC+PC C Q ++P+ S + C C P+
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPY 214
Query: 73 R--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
C N QC + + Y G+S SG + T N V G FGCS+ + SFD
Sbjct: 215 ANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA---VSGFKFGCSHAEQG-SFDAR 270
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIR 190
AGI+ P SLL Q S FSYC + A L + A+ + +R
Sbjct: 271 AAGIMALGGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVR 329
Query: 191 MFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVM 250
F ++ Y + L+ I+V R+G AP FA G ++D+ T + Y+ +
Sbjct: 330 -FRQAATFYGVLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAYQALR 382
Query: 251 RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDR-ADFKVEPTYMYFIF 308
F T + R + CY + ++ FDR A ++P+ + F
Sbjct: 383 SAFRSSMTMY---RSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 437
Query: 309 QNEGYFCVAISFS--DR-NSVVGAWQQQDTRFVYDLNTGTIQF 348
N+ C+A + + DR V+G+ QQQ +YD+ G + F
Sbjct: 438 -ND---CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGF 476
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 145/356 (40%), Gaps = 25/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIP---- 61
Y + GTP+KS ++ DTGS L W QC PC V+C QS P+FNP ASS+Y +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 62 -CDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C DL P C C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNFYYGC 242
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL + +
Sbjct: 243 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 300
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M + + S Y++ + I VA P + + + +ID+G +
Sbjct: 301 GQYSYTPMASSSL---DDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVI 352
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + G Y + + R +A + C++ + +T F
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRA---SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + C+A + + +++G QQQ VYD+ I F C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 142/364 (39%), Gaps = 42/364 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+++ L DT + W C CV C + F P S+T+K++ C
Sbjct: 98 YIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPAKSTTFKKVGCGASQ 155
Query: 67 CR--RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ R P C+ C Y + A+ LV +T T VP FGC
Sbjct: 156 CKQVRNP-TCDGSACAFNFTYGTSSVAASLVQ-DTVTLATDP----VPAYAFGC------ 203
Query: 125 FSFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
I + G SV P SLL Q + Q FSYCL +++ + + LR
Sbjct: 204 ------IQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCL-PSFKTLNFSGSLR 256
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
G A +R + RSS YY++L I V + P A N G + D+
Sbjct: 257 LGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDS 316
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDR 294
G + T + Y V F + ++ + ++ CY ++TF F
Sbjct: 317 GTVFTRLVEPAYNAVRNEFRRRI-AVHKKLTVTSLGGFDTCYTAPI---VAPTITFMFSG 372
Query: 295 ADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFV 349
+ + P + C+A++ + N +V+ QQQ+ R ++D+ +
Sbjct: 373 MNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVA 432
Query: 350 PENC 353
E C
Sbjct: 433 RELC 436
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 151/370 (40%), Gaps = 52/370 (14%)
Query: 5 YFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCD 63
Y Y V GTP+K+ +L DT S L W C PC+N C P FNPNASSTYK + C
Sbjct: 124 YSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINACL---IPTFNPNASSTYKVVGCG 180
Query: 64 DLICRRPPF--------RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+C P C +R +Y + + G+VS++T T+ L ++ I
Sbjct: 181 SALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYGLGSQ-----KFI 235
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQ-GLFSYCLVYAYREMEATSILR 174
FGC N R G +GILG SV+ FSL Q+ + SYC + + L+
Sbjct: 236 FGCCNLFR--GVGGRYSGILGMSVNKFSLFSQMTVGHRYRAMSYCFPHPRNQ----GFLQ 289
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
FG+ + + ++ +++D ++Y++ + ++ V + N T C DT
Sbjct: 290 FGR-YDEHKSLLRFTPLYID-GNNYFVHVSNVMVETMSLDVQSSG-----NQTMRCFFDT 342
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGR------QRMHNASEDW----EYCYRYDSRFRA 284
G T + + + + + R Q A +W Y F+
Sbjct: 343 GTPYTMLPQSLFVSLSDTVGNLVEGYYRVGASTGQTCFQADGNWIEGDLYMPTVKIEFQN 402
Query: 285 YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVV-GAWQQQDTRFVYDLNT 343
A +T + + F EP FC+A +D +V G+ V DL
Sbjct: 403 GARITLNSEDLMFMEEPN----------VFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEM 452
Query: 344 GTIQFVPENC 353
T+ + C
Sbjct: 453 MTMGLRGQGC 462
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 163/374 (43%), Gaps = 48/374 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K + DTGS ++W C PC C N +F+ NASST K++
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132
Query: 61 PCDDLICR--RPPFRCENG-QCVHRINYAGGASASG-----LVSTETFTFHLKNKLVCVP 112
CDD C C+ C + I YA +++ G +++ E T LK +
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQE 192
Query: 113 GVIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYREME 168
V+FGC +D +G+ A G++GF S S+L QL +T A+ +FS+CL ++
Sbjct: 193 -VVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL----DNVK 247
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G + +KT M V HY + L + V + ++ RN G
Sbjct: 248 GGGIFAVGV---VDSPKVKTTPM-VPNQMHYNVMLMGMDVDGTSLDLPR---SIVRN--G 298
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQ--RMHNASEDWEYCYRYDSRF-RAY 285
G ++D+G + + Y+ ++ T RQ ++H E ++ C+ + + A+
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIE------TILARQPVKLHIVEETFQ-CFSFSTNVDEAF 351
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFV 338
++F F D V P F + E G+ ++ +R+ V+ G + V
Sbjct: 352 PPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVV 411
Query: 339 YDLNTGTIQFVPEN 352
YDL+ I + N
Sbjct: 412 YDLDNEVIGWADHN 425
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 161/375 (42%), Gaps = 37/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-L 65
Y + G+P + L+ DTGS L W QCLPC C I++ S++Y+ + C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQ 159
Query: 66 ICRRPP----FRCENG-QCVHRINYAGGASASGLVSTETFTFH--LKNKLVCVPGVIFGC 118
+C C G QC Y G+ + G +ST+T + K V V FGC
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGC 219
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + + G +GILG + +L QL FS+C + +T ++ FG +
Sbjct: 220 AQGDLELVPTG-ASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFG-N 277
Query: 179 ANIQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAP-GTFALRRNGTGGCMID 233
A + + ++ + + S Y+++L+ +S+ H + F P G+ + +G+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVVILDSGS------ 331
Query: 234 TGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRY-----DSRFRAYAS 287
+ ++F++ P+ +R F +H + ++ D C++ D R S
Sbjct: 332 --SFSSFVR--PFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPS 387
Query: 288 MTFHFDRADFKVEPTYMYFI----FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDL 341
++ F+ P+ + FQN C A N +V+G +QQQ+ YD+
Sbjct: 388 LSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447
Query: 342 NTGTIQFVPENCAND 356
+ F +C D
Sbjct: 448 QRSRVGFARASCVID 462
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 156/367 (42%), Gaps = 47/367 (12%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-PIFNPNASSTYKRI 60
++ + ++ FG+P K +FL DTGS L WTQC PC +C+ Q P + P AS TY+
Sbjct: 53 QRGLAFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDA 112
Query: 61 PCDDLICRRPP---FRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIF 116
C+D + P F C ++ +Y + G ++ E T + V GV F
Sbjct: 113 MCEDSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYF 172
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC+ + F G GILG V +S++G+ S FS+CL E +A+ L G
Sbjct: 173 GCNTLSDGSYFTG--TGILGLGVGKYSIIGEFGSK----FSFCL-GEISEPKASHNLILG 225
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
AN+Q T+ + H L+ I V + + + +DTG+
Sbjct: 226 DGANVQGH--PTVINITE--GHTIFQLESIIVGEE----------ITLDDPVQVFVDTGS 271
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFD-R 294
+ + Y + FD+ S S + CY+ D+ R + F FD
Sbjct: 272 TLSHLSTNLYYKFVDAFDDLIGS------RPLSYEPTLCYKADTIERLEKMDVGFKFDVG 325
Query: 295 ADFKVEPTYMYFIFQNEG---YFCVAI-----SFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
A+ V ++ IF +G C+AI SFS + ++G Q YDL+ T
Sbjct: 326 AELSVN---IHNIFIQQGPPEIRCLAIQNNKESFS--HVIIGVIAMQGYNVGYDLSAKTA 380
Query: 347 QFVPENC 353
++C
Sbjct: 381 YINKQDC 387
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 140/368 (38%), Gaps = 52/368 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+++ + DT + W C CV C S+ +FN S+T+K + CD
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCDAPQ 146
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C Y G S L T L + VPG FGC
Sbjct: 147 CKQVPNPTCGGSTCTWNTTYGGSTILSNLTRD---TIALSTDI--VPGYTFGC------- 194
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
I G SV P S L Q + + FSYCL ++R + + LR
Sbjct: 195 -----IQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCL-PSFRTLNFSGTLRL 248
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G R + RSS YY++L I V + A G + D+G
Sbjct: 249 GPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSG 308
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFH 291
+ T + Y V F R+R+ NA ++ CY + +MTF
Sbjct: 309 TVFTRLVAPVYTAVRDEF--------RKRVGNAIVSSLGGFDTCY---TGPIVAPTMTFM 357
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTI 346
F + + P + C+A++ + N +V+ QQQ+ R ++D+ I
Sbjct: 358 FSGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRI 417
Query: 347 QFVPENCA 354
E C+
Sbjct: 418 GVAREPCS 425
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 156/381 (40%), Gaps = 41/381 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC----LPCVNCFNQSA---PIFNPNASSTYKR 59
Y V + FGTP + L+ DTGS LIW QC P C ++ P F + S+T
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 60 IPCDDLICRRPPFRCENG---------QCVHRINYAGGASASGLVSTETFTF-HLKNKLV 109
+PC C P +G C + +YA G+S +G ++ +T T + +
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 172
Query: 110 CVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL--VYAYREM 167
V GV FGC N+ SF G G++G S Q S FSYCL + R
Sbjct: 173 AVRGVAFGCGTRNQGGSFSGT-GGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDR---SSHYYLSLQDISVADHRIGFAPGTFALRR 224
++S L G+ +R+ V + YY+ + I V + + +A+
Sbjct: 232 RSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDV 288
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS---EDWEYCYRYDSR 281
G GG +ID+G+ T+++ G Y ++ F S R+ +++ + E CY S
Sbjct: 289 LGNGGTVIDSGSTLTYLRLGAYLHLVSAFA---ASVHLPRIPSSATFFQGLELCYNVSSS 345
Query: 282 FR------AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQ 332
+ +T F + PT Y + + C+AI + +V+G Q
Sbjct: 346 SSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 405
Query: 333 QDTRFVYDLNTGTIQFVPENC 353
Q +D + I F C
Sbjct: 406 QGYHVEFDRASARIGFARTEC 426
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 156/394 (39%), Gaps = 62/394 (15%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI--------FNPNASSTYK 58
Y+V + FGTP ++ +FDTGS L+W C C S P F P SS+ K
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191
Query: 59 RIPCDDLICR---RPPF--RCENGQCVHR----------INYAGGASASGLVSTETFTFH 103
+ C + C P RC N R + Y GA+A L+S T
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSE---TLD 248
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-- 161
L+NK VP + GCS + AGI GF P SL Q++ FS+CLV
Sbjct: 249 LENKR--VPDFLVGCSVMSVH-----QPAGIAGFGRGPESLPSQMRLKR---FSHCLVSR 298
Query: 162 -YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSS--------HYYLSLQDISVADHR 212
+ + + +L G +++ + F + S +YYLSL+ I +
Sbjct: 299 GFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKP 358
Query: 213 IGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW 272
+ F G GG +ID+G+ TF+ + +E + ++ + R + A
Sbjct: 359 VKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGL 418
Query: 273 EYCYRYDSRFRA--YASMTFHFD---RADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-- 325
C+ + + + F + E Y+ + +EG C+ + +
Sbjct: 419 RPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAE-NYLAMV-TDEGVVCLTMMTDEAVVGG 476
Query: 326 ------VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++GA+QQQ+ YDL I F + C
Sbjct: 477 GGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 145/376 (38%), Gaps = 49/376 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P++ P + K +PC
Sbjct: 51 HYYVTMNI--GDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKN---KLVPC 105
Query: 63 DDLIC------RRPPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
IC + P +C QC ++I Y AS+ G++ T+ FT L+N P
Sbjct: 106 AASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSSSVRPSFT 165
Query: 116 FGCSNDN---RDFSFDGNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEAT 170
FGC D ++ G+LG SL+ QLK + + +CL
Sbjct: 166 FGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCL-----STNGG 220
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHR-IGFAPGTFALRRNGTGG 229
L FG D + + M S +YY D R +G P
Sbjct: 221 GFLFFG-DNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEV--------- 270
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
+ D+G+ T+ PY+ + + + + C++ F++
Sbjct: 271 -VFDSGSTYTYFAAQPYQATVSALKAGLS---KSLQQVSDPSLPLCWKGQKVFKSVSDVK 326
Query: 285 --YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN----SVVGAWQQQDTRFV 338
+ S+ F + P Y I G C+ I +++G QD +
Sbjct: 327 NDFKSLFLSFVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQDQLII 386
Query: 339 YDLNTGTIQFVPENCA 354
YD G + ++ +C+
Sbjct: 387 YDNERGQLGWIRGSCS 402
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 160/378 (42%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y +V GTP K ++ DTGS ++W C+ C C ++S +++P ASST +
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTV 146
Query: 61 PCDDLICR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C R P N C + + Y G+S G + F + P
Sbjct: 147 MCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPAN 206
Query: 114 --VIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
VIFGC + D S + GILGF + S+L QL + + +F++CL +
Sbjct: 207 ASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL----DTI 262
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ I G ++ + +KT + D+ HY ++L+ I V + F +
Sbjct: 263 KGGGIFAIG---DVVQPKVKTTPLVADK-PHYNVNLKTIDVGGTTLELPADIF--KPGEK 316
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G T++ ++ VM F++H + + +D+ C+ Y +
Sbjct: 317 RGTIIDSGTTLTYLPELVFKKVMLAVFNKH-----QDITFHDVQDF-LCFEYSGSVDDGF 370
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCV-----AISFSDRNSVV--GAWQQQDTRF 337
++TFHF D V P + YF +CV A+ D +V G +
Sbjct: 371 PTLTFHFEDDLALHVYP-HEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLV 429
Query: 338 VYDLNTGTIQFVPENCAN 355
VYDL I + NC++
Sbjct: 430 VYDLENRVIGWTDYNCSS 447
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 145/356 (40%), Gaps = 25/356 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAPIFNPNASSTYKRIP---- 61
Y + GTP+KS ++ DTGS L W QC PC V+C QS P+FNP ASS+Y +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 62 -CDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C DL P C C+++ +Y + + G +S +T +F + VP +GC
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNFYYGC 242
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
DN G AG++G + + SLL QL + FSYCL + +
Sbjct: 243 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 300
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M + + S Y++ + I VA P + + + +ID+G +
Sbjct: 301 GQYSYTPMASSSL---DDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVI 352
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T + G Y + + R +A + C++ + +T F
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRA---SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ + C+A + + +++G QQQ VYD+ I F C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 156/385 (40%), Gaps = 51/385 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W C N + S+ FNP SS+Y IPC
Sbjct: 70 NISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWSSSYSPIPCS 128
Query: 64 DLIC----RRPPFR--CENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
C R P R C++ Q C ++YA +S+ G ++T+TF +P V+F
Sbjct: 129 SSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG----IPNVVF 184
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC + + + D G++G + S + Q+ FSYC+ E + + +L
Sbjct: 185 GCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPK---FSYCI----SEYDFSGLLL 237
Query: 175 FGKDANIQ---------RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
G DAN +M T + DR + Y + L+ I VA + F
Sbjct: 238 LG-DANFSWLAPLNYTPLIEMSTPLPYFDRVA-YTVQLEGIKVAHKLLPIPESVFEPDHT 295
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-----DWEYCYRY-- 278
G G M+D+G TF+ Y + HF + G R++ S + CYR
Sbjct: 296 GAGQTMVDSGTQFTFLLGPAYTALRDHFLNK--TAGSLRVYEDSNFVFQGAMDLCYRVPT 353
Query: 279 -DSRFRAYASMTFHFDRADFKVEPTYMYFIFQ-----NEGYFCVAISFSD----RNSVVG 328
+R S+T F A+ V + + N+ C SD V+G
Sbjct: 354 NQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIG 413
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENC 353
QQ+ +DL I C
Sbjct: 414 HLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 158/378 (41%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y ++ GTP K ++ DTGS ++W C+ C C ++S +++P ASST +
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMV 144
Query: 61 PCDDLICR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C + P N C + + Y G+S G T+ F + P
Sbjct: 145 MCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPAN 204
Query: 114 --VIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
VIFGC + D S + + GILGF + S+L QL + + +F++CL +
Sbjct: 205 ASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCL----DTI 260
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ I G ++ + +KT + D+ HY ++L+ I V + F
Sbjct: 261 KGGGIFSIG---DVVQPKVKTTPLVADK-PHYNVNLKTIDVGGTTLQLPAHIFEPGEK-- 314
Query: 228 GGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G T++ + EV++ F++H Q + C++Y +
Sbjct: 315 KGTIIDSGTTLTYLPELVFKEVMLAVFNKH------QDITFHDVQGFLCFQYPGSVDDGF 368
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRF 337
++TFHF D V P + YF +CV S ++G +
Sbjct: 369 PTITFHFEDDLALHVYP-HEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLV 427
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 428 IYDLENRVIGWTDYNCSS 445
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 152/390 (38%), Gaps = 50/390 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP---------------IFNP 51
Y V GTP++ L+ DTGS L W +C + + +A +F P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 52 NASSTYKRIPCDDLICRRP-PFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLK 105
S T+ IPC C+ PF N C + Y ++A G+V T++ T L
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALS 229
Query: 106 ---------NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLF 156
++ + GV+ GC+ + F+ + G+L S S + S G F
Sbjct: 230 GGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEAS-DGVLSLGYSNISFASRAASRFGGRF 288
Query: 157 SYCLVYAYREMEATSILRFGKDANIQRKDM-----KTIRMFVDRSSHYY-LSLQDISVAD 210
SYCLV ATS L FG + +T + R +Y +++ +SV
Sbjct: 289 SYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDG 348
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ + + N GG +ID+G T + Y+ V+ E R M +
Sbjct: 349 VALDIPAEVWDVGSN--GGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAM----D 402
Query: 271 DWEYCYRYDSRFR-----AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN- 324
++YCY + +R A + F + P Y I G C+ +
Sbjct: 403 PFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPG 462
Query: 325 -SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
SV+G QQ+ + +DLN ++F +C
Sbjct: 463 VSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 139/376 (36%), Gaps = 50/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P K FL D+GS L W QC PC +C P++ P S K +PC
Sbjct: 56 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 112
Query: 65 LICRR------PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C RC++ QC + I YA S++G++ ++F L N V P V F
Sbjct: 113 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAF 172
Query: 117 GCSNDN--RDFSFDGNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D R G+LG SLL QLK + + +CL +
Sbjct: 173 GCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL-----SLRGGGF 227
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D ++ T ++Y + D +G +
Sbjct: 228 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVR----------LAKVVF 277
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-------Y 285
D+G+ T+ PY+ ++ + + R C++ F++ +
Sbjct: 278 DSGSSFTYFAAKPYQALVTALKDGLS---RTLEEEPDTSLPLCWKGQEPFKSVLDVRKEF 334
Query: 286 ASMTFHF--DRADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTRF 337
S+ +F + P Y I G C+ I D S++G QD
Sbjct: 335 KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDL-SIIGDITMQDHMV 393
Query: 338 VYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 394 IYDNEKGKIGWIRAPC 409
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 154/374 (41%), Gaps = 39/374 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K ++ DTGS ++W C PC C +++ SST K +
Sbjct: 73 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 132
Query: 61 PCDDLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---G 113
C+D C + C + + Y G+++ G + T + L P
Sbjct: 133 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQE 192
Query: 114 VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYREMEA 169
V+FGC + D + GI+GF S S++ QL + + + +FS+CL M
Sbjct: 193 VVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCL----DNMNG 248
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
I G+ ++ +KT + V HY + L+ + V I P + NG GG
Sbjct: 249 GGIFAVGE---VESPVVKTTPI-VPNQVHYNVILKGMDVDGDPIDLPPSLAS--TNGDGG 302
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYASM 288
+ID+G ++ + Y ++ E T+ + ++H E + C+ + S +A+ +
Sbjct: 303 TIIDSGTTLAYLPQNLYNSLI----EKITAKQQVKLHMVQETFA-CFSFTSNTDKAFPVV 357
Query: 289 TFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFVYDL 341
HF D V P F + + G+ ++ D V+ G + VYDL
Sbjct: 358 NLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDL 417
Query: 342 NTGTIQFVPENCAN 355
I + NC++
Sbjct: 418 ENEVIGWADHNCSS 431
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 139/376 (36%), Gaps = 50/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P K FL D+GS L W QC PC +C P++ P S K +PC
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 121
Query: 65 LICRR------PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C RC++ QC + I YA S++G++ ++F L N V P V F
Sbjct: 122 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAF 181
Query: 117 GCSNDN--RDFSFDGNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D R G+LG SLL QLK + + +CL +
Sbjct: 182 GCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL-----SLRGGGF 236
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D ++ T ++Y + D +G +
Sbjct: 237 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVR----------LAKVVF 286
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-------Y 285
D+G+ T+ PY+ ++ + + R C++ F++ +
Sbjct: 287 DSGSSFTYFAAKPYQALVTALKDGLS---RTLEEEPDTSLPLCWKGQEPFKSVLDVRKEF 343
Query: 286 ASMTFHF--DRADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTRF 337
S+ +F + P Y I G C+ I D S++G QD
Sbjct: 344 KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDL-SIIGDITMQDHMV 402
Query: 338 VYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 403 IYDNEKGKIGWIRAPC 418
>gi|222617032|gb|EEE53164.1| hypothetical protein OsJ_35998 [Oryza sativa Japonica Group]
Length = 384
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 145/348 (41%), Gaps = 88/348 (25%)
Query: 9 VDVLFGTP-SKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC 67
+++ GTP +++ L D SY +W QC P
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAP------------------------------ 119
Query: 68 RRPPFRCENGQCVHRINYAGGAS-ASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-RDF 125
+ + Y G A+ SG ++T+TFTF VPGV+FGCS+ + DF
Sbjct: 120 -------------YSLTYGGSAANTSGYLATDTFTFGA----TAVPGVVFGCSDASYGDF 162
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY--AYREMEATSILRFGKDANIQR 183
+ +G++G SL+ QL+ G FSY L+ A + A S++RFG DA
Sbjct: 163 A---GASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAV--- 213
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
KT R +D GTF LR NGTGG ++ + T++++
Sbjct: 214 --PKTKRGRLD-------------------AIPAGTFDLRANGTGGVILSSTTPVTYLEQ 252
Query: 244 GPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYDSRFRA-YASMTFHFD-RADFKVE 300
Y+VV G ++ +A+ + + CY S + +T FD AD +
Sbjct: 253 AAYDVVRAAVASR---IGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLS 309
Query: 301 PTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
++I + G C+ + S SV+G Q T +YD++ G + F
Sbjct: 310 AANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRLTF 357
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 154/374 (41%), Gaps = 39/374 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + G+P K ++ DTGS ++W C PC C +++ SST K +
Sbjct: 77 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 136
Query: 61 PCDDLICR---RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---G 113
C+D C + C + + Y G+++ G + T + L P
Sbjct: 137 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQE 196
Query: 114 VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYREMEA 169
V+FGC + D + GI+GF S S++ QL + + + +FS+CL M
Sbjct: 197 VVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCL----DNMNG 252
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
I G+ ++ +KT + V HY + L+ + V I P + NG GG
Sbjct: 253 GGIFAVGE---VESPVVKTTPI-VPNQVHYNVILKGMDVDGDPIDLPPSLAS--TNGDGG 306
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYASM 288
+ID+G ++ + Y ++ E T+ + ++H E + C+ + S +A+ +
Sbjct: 307 TIIDSGTTLAYLPQNLYNSLI----EKITAKQQVKLHMVQETFA-CFSFTSNTDKAFPVV 361
Query: 289 TFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRNSVV--GAWQQQDTRFVYDL 341
HF D V P F + + G+ ++ D V+ G + VYDL
Sbjct: 362 NLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDL 421
Query: 342 NTGTIQFVPENCAN 355
I + NC++
Sbjct: 422 ENEVIGWADHNCSS 435
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 155/365 (42%), Gaps = 45/365 (12%)
Query: 21 FLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRIPCDDLICRR----PP 71
++ DTGS +W C+ C C +S +++PN+S T K +PCDD C P
Sbjct: 89 YVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPI 148
Query: 72 FRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---GVIFGCSNDNR--- 123
C ++ C + I Y G++ SG + TF + L VP VIFGC +
Sbjct: 149 SGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTL 208
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFGKDANI 181
+ D ++ GI+GF + S+L QL + + +FS+CL + I G+ +
Sbjct: 209 SSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCL----DTVNGGGIFAIGE---V 261
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ +KT + V R +HY + L+DI VA I F G +ID+G ++
Sbjct: 262 VQPKVKTTPL-VPRMAHYNVVLKDIEVAGDPIQLPTDIF--DSTSGRGTIIDSGTTLAYL 318
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMH-NASEDWEYCYRYD---SRFRAYASMTFHFDRADF 297
Y+ ++ T R M ED C+ Y S A+ ++ F F+
Sbjct: 319 PVSIYDQLLEK-----TLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLT 373
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRFVYDLNTGTIQFVP 350
+ Y E +C+ S + ++G + F+YDL+ +I +
Sbjct: 374 LTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIGWTD 433
Query: 351 ENCAN 355
NC++
Sbjct: 434 YNCSS 438
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 144/363 (39%), Gaps = 40/363 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G P+K F+ DTGS ++W C PC C N FNP++SST RI
Sbjct: 88 LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147
Query: 61 PCDDLICRRPPFRCE---------NGQCVHRINYAGGASASGLVSTETFTFH--LKNKLV 109
PC D C E + C + Y G+ SG ++T F + N+
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQT 207
Query: 110 C--VPGVIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYA 163
V+FGCSN D + GI GF S++ QL S + FS+CL
Sbjct: 208 ANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL--- 264
Query: 164 YREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR 223
IL G+ I + V HY L+L+ I+V+ ++ FA
Sbjct: 265 KGSDNGGGILVLGE---IVEPGL-VFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFA-- 318
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+ T G ++D+G ++ G Y+ + + R + + + DS F
Sbjct: 319 TSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSF- 377
Query: 284 AYASMTFHFDRA-DFKVEPTYMYFIFQ----NEGYFCVAISFSDRNSVVGAWQQQDTRFV 338
+ T +F V+P Y + Q N +C+ S +++G +D FV
Sbjct: 378 --PTATLYFKGGVSMTVKPEN-YLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFV 434
Query: 339 YDL 341
YDL
Sbjct: 435 YDL 437
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 146/354 (41%), Gaps = 27/354 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P KS+ +L DTGS + W QC PC C +Q+ P+F+P++SSTY C
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAA 192
Query: 67 CRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C + C + QC + + Y G+S +G S++T V FGCS N
Sbjct: 193 CAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLALGSN----AVRKFQFGCS--NV 246
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ F+ G++G SL+ Q T FSYCL A L G ++
Sbjct: 247 ESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLP-ATSSSSGFLTLGAGTSGFVKT 305
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+++ ++ + Y + +Q I V ++ F + G ++D+G + T +
Sbjct: 306 PMLRSSQV----PTFYGVRIQAIRVGGRQLSIPTSVF------SAGTIMDSGTVLTRLPP 355
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEPT 302
Y + F + S + C+ + + + ++ F +
Sbjct: 356 TAYSALSSAFKAGMKQY---PSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDIAS 412
Query: 303 YMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + C+A + + +S ++G QQ+ +YD+ G + F C
Sbjct: 413 DGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 29/288 (10%)
Query: 79 CVHRINYAGGASASGLVSTETFTF---HLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGIL 135
C + Y G S +G +S E T H+ + +FGCS + DG +G+L
Sbjct: 138 CPYAYQYGPGISTTGYISAEEVTAVGTHITGR------ALFGCSLAST-VPLDGE-SGVL 189
Query: 136 GFSVSPFSLLGQLKSTAQGLFSY-CLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVD 194
GFS P+SLL QLK + FSY L + ++ S+L G DA Q ++ + +
Sbjct: 190 GFSRGPYSLLSQLKISR---FSYFMLPDDADKPDSESVLLLGDDAVPQTNSSRSTPLLRN 246
Query: 195 RS--SHYYLSLQDISVADHRI-GFAPGTFALRRNG-TGGCMIDTGAIATFIQRGPYEVVM 250
+ YY+ L I V D + G GTF L NG +GG ++ T + T++Q Y +
Sbjct: 247 EAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMSTLSPITYLQPAAYNALT 306
Query: 251 RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADFKVEP----TYMY 305
R S + + D CY S + +T F D + P T Y
Sbjct: 307 RALASKIKSQPVRPKADDVADLRLCYNIQSVANLTFPKITLVFHGVDGRPAPMELTTAHY 366
Query: 306 FIFQNE-GYFCVAI----SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
FI +N G C+ + + S +SV+G+ Q T +YDL G++ F
Sbjct: 367 FIRENSTGLQCLTMLPTPAGSPVSSVLGSLLQTGTHMIYDLRGGSLTF 414
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 151/363 (41%), Gaps = 40/363 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ DTGS LIWT+C C C + +P + P +SS+ + C D
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRT 151
Query: 67 CRRPPFR-CEN--------GQCVHRINYAGGAS----ASGLVSTETFTFHLKNKLVCVPG 113
C P C N G C + Y G++ TETFTF + PG
Sbjct: 152 CGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAFPG 209
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+ FGC+ R G +G++G SL+ QL A F Y L ++ A S +
Sbjct: 210 IAFGCT--LRSEGGFGTGSGLVGLGRGKLSLVTQLNVEA---FGYRL---SSDLSAPSPI 261
Query: 174 RFGKDANIQRKD----MKTIRM---FVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRN 225
FG A++ + M T + V YY+ L ISV + GTF+ R
Sbjct: 262 SFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRST 321
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY-CYRYDSRFRA 284
G GG + D+G T + Y +V DE + G Q+ A+ D + C+ S
Sbjct: 322 GAGGVIFDSGTTLTMLPDPAYTLVR---DELLSQMGFQKPPPAANDDDLICFTGGSSTTT 378
Query: 285 YASMTFHFDRA---DFKVEPTYMYFIFQN-EGYFCVAISFSDRN-SVVGAWQQQDTRFVY 339
+ SM HFD D E QN E C ++ S + +++G Q D V+
Sbjct: 379 FPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVF 438
Query: 340 DLN 342
DL+
Sbjct: 439 DLS 441
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 149/375 (39%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI------FNPNASSTYKR 59
Y V GTP K + DTGS ++W C C NC QS+ + F+ SST
Sbjct: 77 LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNC-PQSSQLGIELNFFDTVGSSTAAL 135
Query: 60 IPCDDLIC----RRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHL----KNKLV 109
IPC D IC + C QC + Y G+ SG ++ F L +
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195
Query: 110 CVPGVIFGCS-NDNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
++FGCS + + D + D + GI GF P S++ QL S +FS+CL
Sbjct: 196 SSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGD 255
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+ + + + V HY L+LQ I+V + P F++ N
Sbjct: 256 GGGVLVLGEILEPSIVYSP-------LVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNN 308
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
GG ++D G ++ + Y+ ++ + + RQ ++ CY +
Sbjct: 309 -RGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ----CYLVSTSIGDI 363
Query: 285 YASMTFHFDRADFKVEPTYMYFI----FQNEGYFCVAI-SFSDRNSVVGAWQQQDTRFVY 339
+ S++ +F+ V Y + +C+ F + S++G +D VY
Sbjct: 364 FPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVY 423
Query: 340 DLNTGTIQFVPENCA 354
D+ I + +C+
Sbjct: 424 DIAQQRIGWANYDCS 438
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 151/363 (41%), Gaps = 40/363 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP+ DTGS LIWT+C C C + +P + P +SS+ + C D
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRT 151
Query: 67 CRRPPFR-CEN--------GQCVHRINYAGGAS----ASGLVSTETFTFHLKNKLVCVPG 113
C P C N G C + Y G++ TETFTF + PG
Sbjct: 152 CGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAFPG 209
Query: 114 VIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
+ FGC+ R G +G++G SL+ QL A F Y L ++ A S +
Sbjct: 210 IAFGCT--LRSEGGFGTGSGLVGLGRGKLSLVTQLNVEA---FGYRL---SSDLSAPSPI 261
Query: 174 RFGKDANIQRKD----MKTIRM---FVDRSSHYYLSLQDISVADHRIGFAPGTFAL-RRN 225
FG A++ + M T + V YY+ L ISV + GTF+ R
Sbjct: 262 SFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRST 321
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY-CYRYDSRFRA 284
G GG + D+G T + Y +V DE + G Q+ A+ D + C+ S
Sbjct: 322 GAGGVIFDSGTTLTMLPDPAYTLVR---DELLSQMGFQKPPPAANDDDLICFTGGSSTTT 378
Query: 285 YASMTFHFDRA---DFKVEPTYMYFIFQN-EGYFCVAISFSDRN-SVVGAWQQQDTRFVY 339
+ SM HFD D E QN E C ++ S + +++G Q D V+
Sbjct: 379 FPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVF 438
Query: 340 DLN 342
DL+
Sbjct: 439 DLS 441
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 149/384 (38%), Gaps = 49/384 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP--CVNCFNQSAPIFNPNASSTYKRIP 61
N TV + GTP ++ ++ DTGS L W C P +SA F P AS T+ +P
Sbjct: 63 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 122
Query: 62 CDDLICRR----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
CD CR P C+ QC ++YA G+S+ G ++TE FT L
Sbjct: 123 CDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAA---- 178
Query: 116 FGCSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC D S DG AG+LG + S + Q + FSYC+ + + +L
Sbjct: 179 FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCI----SDRDDAGVLL 231
Query: 175 FGKD---------ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
G + + M + DR + Y + L I V + A
Sbjct: 232 LGHSDLPFLPLNYTPLYQPAMPLP--YFDRVA-YSVQLLGIRVGGKPLPIPASVLAPDHT 288
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCYRYDSR 281
G G M+D+G TF+ Y + F T +++ + E ++ C+R
Sbjct: 289 GAGQTMVDSGTQFTFLLGDAYSALKAEFSRQ-TKPWLPALNDPNFAFQEAFDTCFRVPQG 347
Query: 282 FRAYA---SMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS----VVGA 329
A ++T F+ A V + + E G +C+ +D V+G
Sbjct: 348 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGH 407
Query: 330 WQQQDTRFVYDLNTGTIQFVPENC 353
Q + YDL G + P C
Sbjct: 408 HHQMNVWVEYDLERGRVGLAPIRC 431
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 149/376 (39%), Gaps = 44/376 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V GTP + + DTGS ++W C C NC S F+ +SST + +
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLV 139
Query: 61 PCDDLICRR---------PPFRCENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLV 109
PC IC PP ++ QC + Y G+ SG ++TF F L L+
Sbjct: 140 PCSHPICTSQIQTTATQCPP---QSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLI 196
Query: 110 C--VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYA 163
++FGCS + D + D + GI GF S++ QL S +FS+CL
Sbjct: 197 ANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL--- 253
Query: 164 YREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR 223
E IL G+ I + V HY L LQ I+V+ + P FA
Sbjct: 254 KGEDSGGGILVLGE---ILEPGI-VYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATS 309
Query: 224 RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRF 282
N G +IDTG ++ Y+ + + ++ ++ CY +S
Sbjct: 310 SN--RGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ----CYLVSNSVS 363
Query: 283 RAYASMTFHFDRAD---FKVEPTYMYFI-FQNEGYFCVAI-SFSDRNSVVGAWQQQDTRF 337
+ ++F+F K E MY + +C+ +++G +D F
Sbjct: 364 EVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIF 423
Query: 338 VYDLNTGTIQFVPENC 353
VYDL I + +C
Sbjct: 424 VYDLAHQRIGWANYDC 439
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 153/355 (43%), Gaps = 30/355 (8%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD-DLICRRPP 71
GTP + L+ DTGS + + C C C N P F P+ S TY + C+ D C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPDCTC---- 57
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNI 131
EN QC + YA +S+SG++ + +F ++L V FGC N F +
Sbjct: 58 -DTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAV-FGCENAETGDLFSQHA 115
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRM 191
GI+G S++ QL S+ L Y E+ +++ G+ + DM
Sbjct: 116 DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMV-LGQIS--PPSDMVFSHS 172
Query: 192 FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
DRS +Y + L+ + VA ++ P F +G G ++D+G ++ E
Sbjct: 173 DPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGTTYAYLP----EAAFL 224
Query: 252 HFDEHFTS--FGRQRMHNASEDW-EYCY-----RYDSRFRAYASMTFHFDRAD-FKVEP- 301
F + TS G +++ ++ + C+ ++ + S+ FD + + + P
Sbjct: 225 PFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284
Query: 302 TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
Y++ + G +C+ + + D +++G ++T YD + F NC+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 151/376 (40%), Gaps = 51/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV C P++ P++ IPC+D
Sbjct: 59 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSD----LIPCND 114
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 115 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCG 174
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 175 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL-----SSLGGGILFFG 229
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + M + S HY ++ + F T L+ T + D+G+
Sbjct: 230 DDLYDSSR-VSWTPMSREYSKHYSPAM------GGELLFGGRTTGLKNLLT---VFDSGS 279
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF-- 292
T+ Y+ V + + + A +D C++ F + + +F
Sbjct: 280 SYTYFNSKAYQAVTYLLKRELSG---KPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 336
Query: 293 ----------DRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRN-SVVGAWQQQDTRF 337
+ F++ P Y I +G C+ I +N +++G QD
Sbjct: 337 LALSFKTGWRSKTLFEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMI 395
Query: 338 VYDLNTGTIQFVPENC 353
+YD +I ++P +C
Sbjct: 396 IYDNEKQSIGWMPADC 411
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 153/355 (43%), Gaps = 30/355 (8%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD-DLICRRPP 71
GTP + L+ DTGS + + C C C N P F P+ S TY + C+ D C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPDCTC---- 57
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNI 131
EN QC + YA +S+SG++ + +F ++L V FGC N F +
Sbjct: 58 -DTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAV-FGCENAETGDLFSQHA 115
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRM 191
GI+G S++ QL S+ L Y E+ +++ G+ + DM
Sbjct: 116 DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMV-LGQIS--PPSDMVFSHS 172
Query: 192 FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
DRS +Y + L+ + VA ++ P F +G G ++D+G ++ E
Sbjct: 173 DPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGTTYAYLP----EAAFL 224
Query: 252 HFDEHFTS--FGRQRMHNASEDW-EYCY-----RYDSRFRAYASMTFHFDRAD-FKVEP- 301
F + TS G +++ ++ + C+ ++ + S+ FD + + + P
Sbjct: 225 PFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284
Query: 302 TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
Y++ + G +C+ + + D +++G ++T YD + F NC+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 33/316 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + W C C C S+ F PNAS+T + C +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEAQ 101
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + F C + C+ +Y G +S + + + T L N ++ PG FGC N
Sbjct: 102 CSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAIT--LANDVI--PGFTFGCINAV 157
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G+LG P SL+ Q + G+FSYCL +++ + L+ G Q
Sbjct: 158 SGGSIPPQ--GLLGLGRGPISLISQAGAMYSGVFSYCL-PSFKSYYFSGSLKLGPVG--Q 212
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S YY++L +SV ++ N G +ID+G + T
Sbjct: 213 PKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITR 272
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ Y + F + +S G ++ C+ + A A +T HF+ +
Sbjct: 273 FVQPVYFAIRDEFRKQVNGPISSLGA---------FDTCFAATNEAEAPA-VTLHFEGLN 322
Query: 297 FKVEPTYMYFIFQNEG 312
V P I + G
Sbjct: 323 L-VLPMENSLIHSSSG 337
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 46/372 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P K FL DTGS L W QC PCV C P++ PN + + C D
Sbjct: 66 YYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNN----LVICKD 121
Query: 65 LIC---RRPPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+C P ++CE+ QC + + YA G S+ G++ + F + N L P + GC
Sbjct: 122 PMCASLHPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFGKD 178
D + G+LG S++ QL S + + +C+ L FG D
Sbjct: 182 DQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCV-----SSRGGGFLFFGDD 236
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + T M D+ +HY ++ + G + +N D+G+
Sbjct: 237 LYDSSRVVWT-PMLRDQHTHYSSGYAELILG--------GKTTVFKNLL--VTFDSGSSY 285
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
T++ Y+ ++ H S R + C+R F++ + F
Sbjct: 286 TYLNSLAYQALV-HLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALS 344
Query: 299 VE-----------PTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTRFVYDL 341
P Y I +G C+ I D N ++G QD VYD
Sbjct: 345 FPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFN-LIGDISMQDKMVVYDN 403
Query: 342 NTGTIQFVPENC 353
I + P NC
Sbjct: 404 EKNQIGWAPTNC 415
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 42/365 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V GTP+++ L DT + W C C+ C S +F+ + SS+++ +PC
Sbjct: 26 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQSPQ 83
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + P C C + Y A+ LV + T + VP FGC
Sbjct: 84 CNQVPNPSCSGSACGFNLTYGSSTVAADLVQ-DNLTLATDS----VPSYTFGC------- 131
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
I G SV P SLLGQ +S Q FSYCL +++ + + LR
Sbjct: 132 -----IRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL-PSFKSVNFSGSLRL 185
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G A R + RSS YY++L I V + P A G +ID+G
Sbjct: 186 GPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSG 245
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA 295
T + Y V DE GR ++ ++ CY ++TF F
Sbjct: 246 TTFTRLVAPAYTAVR---DEFRRRVGRNVTVSSLGGFDTCYTVP---IISPTITFMFAGM 299
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ + P + C+A++ + N +V+ + QQQ+ R ++D+ +
Sbjct: 300 NVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVAR 359
Query: 351 ENCAN 355
E+C++
Sbjct: 360 ESCSS 364
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 81/163 (49%), Gaps = 13/163 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP+ + +++ DTGS ++W QC PC C+NQ+ IF+P S T+ +PC
Sbjct: 134 EYFMRLGV--GTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191
Query: 64 DLICRRPPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+CRR E + C+++++Y G+ G STET TFH V V GC
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGC 247
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV 161
+DN LG F Q K+ G FSYCLV
Sbjct: 248 GHDNEGLFVGAAGLLGLGRGGLSFP--SQTKNRYNGKFSYCLV 288
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 138/376 (36%), Gaps = 50/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P + FL DTGS L W QC PCV+C P++ P + K +PC D
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKN---KLVPCVD 113
Query: 65 LICR--------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C R QC + I YA S+ G++ T++F L N + PG+ F
Sbjct: 114 QMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAF 173
Query: 117 GCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D + S A G+LG SLL QLK + + +CL
Sbjct: 174 GCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL-----STRGGGF 228
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D + M S +YY S + F +R +
Sbjct: 229 LFFGDDI-VPYSRATWAPMARSTSRNYY------SPGSANLYFGGRPLGVRPMEV---VF 278
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY------- 285
D+G+ T+ PY+ ++ D + C++ F++
Sbjct: 279 DSGSSFTYFSAQPYQALV---DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 286 --ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTRF 337
++F + P Y I G C+ I D N +VG QD
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN-IVGDITMQDQMV 394
Query: 338 VYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 395 IYDNERGQIGWIRAPC 410
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 138/376 (36%), Gaps = 50/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P + FL DTGS L W QC PCV+C P++ P + K +PC D
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKN---KLVPCVD 113
Query: 65 LICR--------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C R QC + I YA S+ G++ T++F L N + PG+ F
Sbjct: 114 QMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAF 173
Query: 117 GCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D + S A G+LG SLL QLK + + +CL
Sbjct: 174 GCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL-----STRGGGF 228
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D + M S +YY S + F +R +
Sbjct: 229 LFFGDDI-VPYSRATWAPMARSTSRNYY------SPGSANLYFGGRPLGVRPMEV---VF 278
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY------- 285
D+G+ T+ PY+ ++ D + C++ F++
Sbjct: 279 DSGSSFTYFSAQPYQALV---DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 286 --ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTRF 337
++F + P Y I G C+ I D N +VG QD
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN-IVGDITMQDQMV 394
Query: 338 VYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 395 IYDNERGQIGWIRAPC 410
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 163/375 (43%), Gaps = 40/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP++ ++ DTGS ++W C+ C C +S+ +++ S T K +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 61 PCDDLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C PP C N C + YA G+S+ G + + + L
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSAN 216
Query: 114 --VIFGCS-NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREME 168
VIFGCS + D S + + GILGF S S++ QL S+ + +F++CL +
Sbjct: 217 GSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----DGLN 272
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G +I + + T + V +HY ++++ + V + + F +
Sbjct: 273 GGGIFAIG---HIVQPKVNTTPL-VPNQTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKK 326
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
G +ID+G ++ Y+ ++ + +H+ ++Y D F A +
Sbjct: 327 GTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPA---V 383
Query: 289 TFHFDRADF-KVEPTYMYFIFQNEGYFCVAISFS-----DRNSV--VGAWQQQDTRFVYD 340
TFHF+ + + KV P ++F +G +C+ S DR ++ +G + +YD
Sbjct: 384 TFHFENSLYLKVHP--HEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYD 441
Query: 341 LNTGTIQFVPENCAN 355
L I + NC++
Sbjct: 442 LENQVIGWTEYNCSS 456
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 138/376 (36%), Gaps = 50/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P + FL DTGS L W QC PCV+C P++ P + K +PC D
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKN---KLVPCVD 113
Query: 65 LICR--------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C R QC + I YA S+ G++ T++F L N + PG+ F
Sbjct: 114 QMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAF 173
Query: 117 GCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D + S A G+LG SLL QLK + + +CL
Sbjct: 174 GCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL-----STRGGGF 228
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D + M S +YY S + F +R +
Sbjct: 229 LFFGDDI-VPYSRATWAPMARSTSRNYY------SPGSANLYFGGRPLGVRPMEV---VF 278
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY------- 285
D+G+ T+ PY+ ++ D + C++ F++
Sbjct: 279 DSGSSFTYFSAQPYQALV---DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 286 --ASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTRF 337
++F + P Y I G C+ I D N +VG QD
Sbjct: 336 KTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN-IVGDITMQDQMV 394
Query: 338 VYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 395 IYDNERGQIGWIRAPC 410
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 144/380 (37%), Gaps = 61/380 (16%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
FY+V + G P K L D+GS L W QC PCV+C P + PN I C+D
Sbjct: 67 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGP----ITCND 122
Query: 65 LICR------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+C +PP + + QC + ++YA S+ G++ + F+ L N + P + FGC
Sbjct: 123 PMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGC 182
Query: 119 SNDNRDFSFDGN-----IAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATS 171
D S+ G + G+LG S++ QL+S + + +CL
Sbjct: 183 GYDQ---SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLG 239
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
I + S Y L D+ F + +G G
Sbjct: 240 DGLSTTPGIIWTPMSRK-----SGESAYALGPADL------------LFNGQNSGVKGLR 282
Query: 232 I--DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT 289
+ D+G+ T+ Y+ + ++ + A E C+R F++ +
Sbjct: 283 LVFDSGSSYTYFNAQAYKTTLSLVRKYLNG---KLKETADESLPVCWRGAKPFKSIFEVK 339
Query: 290 FHF----------DRADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQ 333
+F A ++ P Y I G C+ I D N V+G Q
Sbjct: 340 NYFKPFALSFTKAKSAQLQLPPE-SYLIISKHGNACLGILNGSEVGLGDSN-VIGDIAFQ 397
Query: 334 DTRFVYDLNTGTIQFVPENC 353
D +YD I +VP++C
Sbjct: 398 DKMVIYDNERQQIGWVPKDC 417
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 33/316 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + W C C C S+ F PNAS+T + C +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEAQ 101
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + F C + C+ +Y G +S + + + T L N ++ PG FGC N
Sbjct: 102 CSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAIT--LANDVI--PGFTFGCINAV 157
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G+LG P SL+ Q + G+FSYCL +++ + L+ G Q
Sbjct: 158 SGGSIPPQ--GLLGLGRGPISLISQAGAMYSGVFSYCL-PSFKSYYFSGSLKLGPVG--Q 212
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S YY++L +SV ++ N G +ID+G + T
Sbjct: 213 PKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITR 272
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ Y + F + +S G ++ C+ + A A +T HF+ +
Sbjct: 273 FVQPVYFAIRDEFRKQVNGPISSLGA---------FDTCFAETNEAEAPA-VTLHFEGLN 322
Query: 297 FKVEPTYMYFIFQNEG 312
V P I + G
Sbjct: 323 L-VLPMENSLIHSSSG 337
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/380 (23%), Positives = 144/380 (37%), Gaps = 61/380 (16%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
FY+V + G P K L D+GS L W QC PCV+C P + PN I C+D
Sbjct: 34 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGP----ITCND 89
Query: 65 LICR------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+C +PP + + QC + ++YA S+ G++ + F+ L N + P + FGC
Sbjct: 90 PMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGC 149
Query: 119 SNDNRDFSFDGN-----IAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATS 171
D S+ G + G+LG S++ QL+S + + +CL
Sbjct: 150 GYDQ---SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLG 206
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
I T S Y L D+ F + +G G
Sbjct: 207 DGLSTTPGIIW-----TPMSRKSGESAYALGPADL------------LFNGQNSGVKGLR 249
Query: 232 I--DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMT 289
+ D+G+ T+ Y+ + ++ + A E C+R F++ +
Sbjct: 250 LVFDSGSSYTYFNAQAYKTTLSLVRKYLNG---KLKETADESLPVCWRGAKPFKSIFEVK 306
Query: 290 FHF----------DRADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQ 333
+F A ++ P Y I G C+ I D N V+G Q
Sbjct: 307 NYFKPFALSFTKAKSAQLQLPPE-SYLIISKHGNACLGILNGSEVGLGDSN-VIGDIAFQ 364
Query: 334 DTRFVYDLNTGTIQFVPENC 353
D +YD I +VP++C
Sbjct: 365 DKMVIYDNERQQIGWVPKDC 384
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 161/373 (43%), Gaps = 40/373 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP++ ++ DTGS ++W C+ C C +S+ +++ S T K +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 61 PCDDLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C PP C N C + YA G+S+ G + + + L
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSAN 216
Query: 114 --VIFGCS-NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREME 168
VIFGCS + D S + + GILGF S S++ QL S+ + +F++CL +
Sbjct: 217 GSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----DGLN 272
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G +I + + T + V +HY ++++ + V + + F +
Sbjct: 273 GGGIFAIG---HIVQPKVNTTPL-VPNQTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKK 326
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
G +ID+G ++ Y+ ++ + +H+ ++Y D F A +
Sbjct: 327 GTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPA---V 383
Query: 289 TFHFDRADF-KVEPTYMYFIFQNEGYFCVAISFS-----DRNSV--VGAWQQQDTRFVYD 340
TFHF+ + + KV P ++F +G +C+ S DR ++ +G + +YD
Sbjct: 384 TFHFENSLYLKVHP--HEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYD 441
Query: 341 LNTGTIQFVPENC 353
L I + NC
Sbjct: 442 LENQVIGWTEYNC 454
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 148/388 (38%), Gaps = 48/388 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCL-PCVNCFNQSAP---IFNPNASSTYKRIPC 62
Y V GTP++ L+ DTGS L W +C P N + F P S T+ I C
Sbjct: 94 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISC 153
Query: 63 DDLICRRP-PFR-----CENGQCVHRINYAGGASASGLVSTETFTFHL-----KNKLVCV 111
C + PF C + Y G++A G V TE+ T L + + +
Sbjct: 154 ASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKL 213
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
G++ GC++ SF+ + G+L S S S G FSYCLV ATS
Sbjct: 214 KGLVLGCTSSYTGPSFEVS-DGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATS 272
Query: 172 ILRFGKDANIQRKDMKT--------------------IRMFVDRSSH--YYLSLQDISVA 209
L FG + + + + +DR Y ++++ +SVA
Sbjct: 273 YLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVA 332
Query: 210 DHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS 269
+ + + + GG ++D+G T + + Y V+ E R M
Sbjct: 333 GQFLKIPRAVWDV--DAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTM---- 386
Query: 270 EDWEYCYRYDSRFR--AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--S 325
+ +EYCY + S M HF A P Y I G C+ + S
Sbjct: 387 DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGIS 446
Query: 326 VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+G QQ+ + +D+ ++F C
Sbjct: 447 VIGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 149/367 (40%), Gaps = 33/367 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP--CVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y + G+P + + DTGS ++W QC C NC+ Q P+FNP SSTY C
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 65 LICRRPPF--------RCENGQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPGV 114
C++ + + C + I+Y + + G +ST+ TF H+ +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 115 IFGCS-NDNRDFSFDGN---IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA-YREMEA 169
FGC N++ D N G++G SL+GQL G FSYC+ ++
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNG 284
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALRRNGTG 228
T +RFG A+I + + ++ I V D ++ G+ F G G
Sbjct: 285 TIEIRFGLAASISGHSTALANNL--EGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIG 342
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
G ++D+G T + + ++ E + H+ S ++ CY + Y
Sbjct: 343 GLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNS-NYSLCYNAANFLLTYVP- 400
Query: 289 TFHFDRADFKVEPTYMYFIFQN------EGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
D K Y F +N +C+A+ + S++G +Q +D + YDL
Sbjct: 401 AIELKFTDNK--EAYFPFTLRNAWIDNGNDQYCLAMFGTSGISIIGIYQHRDIKIGYDLK 458
Query: 343 TGTIQFV 349
+ F
Sbjct: 459 YNLVSFT 465
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 147/365 (40%), Gaps = 50/365 (13%)
Query: 23 LFDTGSYLIWTQCLPCVNCFNQS--APIFNPNASSTYKRIPCDDLICRR-PPFRCENGQC 79
L++ ++ I T P + + AP PNASST++ PC C+ P C + C
Sbjct: 65 LYNVANFTIGTPPQPASAIIDVAGPAPCSFPNASSTFRPEPCGTDACKSIPTSNCSSNMC 124
Query: 80 VHR--INYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGF 137
+ IN G G+V+T+TF + FGC + G +G++G
Sbjct: 125 TYEGTINSKLGGHTLGIVATDTFAIGTATA-----SLGFGCVVAS-GIDTMGGPSGLIGL 178
Query: 138 SVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFV---- 193
+P SL+ Q+ T FSYCL + S L G A + T FV
Sbjct: 179 GRAPSSLVSQMNITK---FSYCLTP--HDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSP 233
Query: 194 --DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
D S +Y + L I D I P + ++ T A +F+ Y+ + +
Sbjct: 234 GDDMSQYYPIQLDGIKAGDAAIALPPSGNTV--------LVQTLAPMSFLVDSAYQALKK 285
Query: 252 HFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS---MTFHFDRADFKVEPT-YMYFI 307
+ + G + ++ C+ A A TF A V P Y+ +
Sbjct: 286 EVTK---AVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDV 342
Query: 308 FQNEGYFCVAI---------SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN--- 355
+ +G C+AI + + +++G+ QQ++T F+ DL T+ F P +CA+
Sbjct: 343 GEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCAHLSL 402
Query: 356 -DHFL 359
D FL
Sbjct: 403 IDGFL 407
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 151/376 (40%), Gaps = 51/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV C P++ P++ IPC+D
Sbjct: 59 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSD----LIPCND 114
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 115 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 174
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 175 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL-----SSLGGGILFFG 229
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + M + S HY ++ + F T L+ T + D+G+
Sbjct: 230 DDLYDSSR-VSWTPMSREYSKHYSPAM------GGELLFGGRTTGLKNLLT---VFDSGS 279
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF-- 292
T+ Y+ V + + + A +D C++ F + + +F
Sbjct: 280 SYTYFNSKAYQAVTYLLKRELSG---KPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 336
Query: 293 ----------DRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRN-SVVGAWQQQDTRF 337
+ F++ P Y I +G C+ I +N +++G QD
Sbjct: 337 LALSFKTGWRSKTLFEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMI 395
Query: 338 VYDLNTGTIQFVPENC 353
+YD +I ++P +C
Sbjct: 396 IYDNEKQSIGWMPVDC 411
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 163/407 (40%), Gaps = 87/407 (21%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----------PIFNPNASS 55
Y+V + FGTPS++ +FDTGS L+W LPC + + S P F P SS
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVW---LPCTSRYLCSGCDFSGLDPTLIPRFIPKNSS 146
Query: 56 TYKRIPCDDLICR---RPPFRCENGQ---------CVHRINYAGGASASGLVSTETFTFH 103
+ K I C C+ P +C C I G S +G++ TE F
Sbjct: 147 SSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDF- 205
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA 163
+ VP + GCS + AGI GF P SL Q+ FS+CLV
Sbjct: 206 ---PDLTVPDFVVGCSIISTR-----QPAGIAGFGRGPVSLPSQMNLKR---FSHCLVS- 253
Query: 164 YREMEATSILR-----------------------FGKDANIQRKDMKTIRMFVDRSSHYY 200
R + T++ F K+ N+ K F++ +YY
Sbjct: 254 -RRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK------AFLE---YYY 303
Query: 201 LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSF 260
L+L+ I V + A NG GG ++D+G+ TF++R +E+V F +++
Sbjct: 304 LNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNY 363
Query: 261 GRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE-PTYMYFIF-QNEGYFCVA 317
R++ C+ + + F F + K+E P YF F N C+
Sbjct: 364 TREKDLEKETGLGPCFNISGKGDVTVPELIFEF-KGGAKLELPLSNYFTFVGNTDTVCLT 422
Query: 318 ISFSDRNS----------VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ SD+ ++G++QQQ+ YDL F + C+
Sbjct: 423 V-VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 149/372 (40%), Gaps = 38/372 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI----FNPNASSTYKRIP 61
Y + GTPS+ + DTGS ++W C C+ C +S + ++ +ASST K +
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVS 143
Query: 62 CDDLICRRPPFR--CENGQ-CVHRINYAGGASASGLVSTETFTFHL----KNKLVCVPGV 114
C D C R C +G C + I Y G+S +G + + L + +
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI 203
Query: 115 IFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
IFGC + + A GI+GF S S + QL S QG + I
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLAS--QGKVKRSFAHCLDNNNGGGI 261
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G+ + +KT M +S+HY ++L I V + + + F G +I
Sbjct: 262 FAIGE---VVSPKVKTTPML-SKSAHYSVNLNAIEVGNSVLELSSNAF--DSGDDKGVII 315
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G ++ Y ++ +E S +H E + C+ Y + + ++TF F
Sbjct: 316 DSGTTLVYLPDAVYNPLL---NEILASHPELTLHTVQESFT-CFHYTDKLDRFPTVTFQF 371
Query: 293 DRAD----------FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
D++ F+V F +QN G + +++G + VYD+
Sbjct: 372 DKSVSLAVYPREYLFQVREDTWCFGWQNGG---LQTKGGASLTILGDMALSNKLVVYDIE 428
Query: 343 TGTIQFVPENCA 354
I + NC+
Sbjct: 429 NQVIGWTNHNCS 440
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 151/376 (40%), Gaps = 51/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV C P++ P++ IPC+D
Sbjct: 47 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSD----LIPCND 102
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 103 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 162
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 163 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL-----SSLGGGILFFG 217
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + M + S HY ++ + F T L+ T + D+G+
Sbjct: 218 DDLYDSSR-VSWTPMSREYSKHYSPAM------GGELLFGGRTTGLKNLLT---VFDSGS 267
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF-- 292
T+ Y+ V + + + A +D C++ F + + +F
Sbjct: 268 SYTYFNSKAYQAVTYLLKRELSG---KPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 324
Query: 293 ----------DRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRN-SVVGAWQQQDTRF 337
+ F++ P Y I +G C+ I +N +++G QD
Sbjct: 325 LALSFKTGWRSKTLFEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMI 383
Query: 338 VYDLNTGTIQFVPENC 353
+YD +I ++P +C
Sbjct: 384 IYDNEKQSIGWMPVDC 399
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 142/369 (38%), Gaps = 54/369 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP+++ + DT + W C CV C S+ +FN S+T+K + CD
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCDAPQ 146
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C Y G S L T L + VPG FGC
Sbjct: 147 CKQVPNPTCGGSTCTWNTTYGGSTILSNLTRD---TIALSTDI--VPGYTFGC------- 194
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
I G SV P S L Q + + FSYCL ++R + + LR
Sbjct: 195 -----IQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCL-PSFRTLNFSGTLRL 248
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G R + RSS YY++L I V + A G + D+G
Sbjct: 249 GPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSG 308
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFH 291
+ T + Y V F R+R+ NA ++ CY + +MTF
Sbjct: 309 TVFTRLVAPVYTAVRDEF--------RKRVGNAIVSSLGGFDTCY---TGPIVAPTMTFM 357
Query: 292 FDRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGT 345
F + + PT I G C+A++ + N +V+ QQQ+ R ++D+
Sbjct: 358 FSGMNVTL-PTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSR 416
Query: 346 IQFVPENCA 354
I E C+
Sbjct: 417 IGVAREPCS 425
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 83/163 (50%), Gaps = 14/163 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V FG+P++ ++ DTGS L W QC PCV C Q+ P+F+P+AS TYK + C
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 66 IC--------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C P + CV+ +Y + + G +S + T L PG ++G
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTL---PGFVYG 234
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
C D+ D F G AGILG + S+LGQ+ S FSYCL
Sbjct: 235 CGQDS-DGLF-GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL 275
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 157/376 (41%), Gaps = 41/376 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C C C +S +++ AS+T +
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 132
Query: 61 PCDDLICRR---PPFRCENG-QCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP--- 112
CDD C P C+ G QC++ + Y G+S +G + ++ + P
Sbjct: 133 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 192
Query: 113 GVIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREME 168
V+FGC N S + GILGF + S+L QL S+ + +FS+CL ++
Sbjct: 193 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL----DNVD 248
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTF-ALRRNGT 227
I G+ + I V +HY + +++I V + F + R GT
Sbjct: 249 GGGIFAIGEVVEPKVN----ITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 304
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYA 286
+ID+G + P EV + ++ + R+H + + C+ Y +
Sbjct: 305 ---IIDSGTTLAYF---PQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFP 357
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTRFVY 339
++T HFD++ + Y E +C+ S +++G + VY
Sbjct: 358 TVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVY 417
Query: 340 DLNTGTIQFVPENCAN 355
DL I +V NC++
Sbjct: 418 DLEKQGIGWVEYNCSS 433
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 156/388 (40%), Gaps = 60/388 (15%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W +C N F+PN SS+Y +PC
Sbjct: 82 NVSLTVSLTVGTPPQNVSMVLDTGSELSWLRC----NKTQTFQTTFDPNRSSSYSPVPCS 137
Query: 64 DLICR------RPPFRCENGQCVHRI-NYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
L C P C++ Q H I +YA +S+ G ++++ TF++ N +PG IF
Sbjct: 138 SLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASD--TFYIGNS--DMPGTIF 193
Query: 117 GC--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC S+ + + D G++G + S + Q+ FSYC+ + + + +L
Sbjct: 194 GCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPK---FSYCI----SDSDFSGVLL 246
Query: 175 FGKDANIQ---------RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
G DAN + T + DR + Y + L+ I V+ + F
Sbjct: 247 LG-DANFSWLMPLNYTPLIQISTPLPYFDRVA-YTVQLEGIKVSSKLLPLPKSVFVPDHT 304
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY--------CYR 277
G G M+D+G TF+ Y + F + R ED Y CYR
Sbjct: 305 GAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILR-----VLEDPNYVFQGGMDLCYR 359
Query: 278 Y---DSRFRAYASMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS---- 325
+ +++ F A+ KV + + E +C SD +
Sbjct: 360 VPLSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAY 419
Query: 326 VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+G QQ+ +DL I F C
Sbjct: 420 VIGHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 156/384 (40%), Gaps = 54/384 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC--------LPCVNCFNQSAP--IFNPNASST 56
Y + V GTP + DTGS LIW C L + P F+P+ S+T
Sbjct: 100 YLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTT 159
Query: 57 YKRIPCDDLICRR-PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF------HLKNKL 108
++ + CD + C P C + +C + +Y G+ SG++STETFTF
Sbjct: 160 FRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDGTT 219
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
V V FGCS S + G+ G +S S LG S + FSYCLV ++
Sbjct: 220 TRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGR-RFSYCLV--PYSVK 276
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYL-SLQDISVADHRIGFAPGTFALRRNGT 227
A+S L FG A + T + + YY+ L+ + V + AP L
Sbjct: 277 ASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTF-EAPDRSPL----- 330
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED---WEYCYRYDS-RFR 283
++D+G TF+ + +++ GR ++ A C+ R
Sbjct: 331 ---IVDSGTTLTFLPEALVDPLVKELT------GRIKLPPAQSPERLLPLCFDVSGVREG 381
Query: 284 AYASMTFHFD-------RADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQ 333
A+M K E T F+ EG C+A+S S++G QQ
Sbjct: 382 QVAAMIPDVTVGLGGGAAVTLKAENT---FVEVQEGTLCLAVSAMSEQFPASIIGNIAQQ 438
Query: 334 DTRFVYDLNTGTIQFVPENCANDH 357
+ YDL+ GT+ F P CA+ +
Sbjct: 439 NMHVGYDLDKGTVTFAPAACASSY 462
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 147/371 (39%), Gaps = 41/371 (11%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC- 67
V + GTP +S+ ++ DTGS L W QC V + +F+P+ SS++ +PC+ +C
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCK 138
Query: 68 -RRP----PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
R P P C+ N C + YA G A G + E TF P +I GC+ D
Sbjct: 139 PRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQS---TPPLILGCAED 195
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
D GILG ++ S Q K T FSYC+ T F N
Sbjct: 196 ASD------DKGILGMNLGRLSFASQAKITK---FSYCVPTRQVRPGFTPTGSFYLGENP 246
Query: 182 QRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ I + S + ++LQ I + + ++ F +G G MI
Sbjct: 247 NSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMI 306
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA---SEDWEYCYRYDSR--FRAYAS 287
D+G+ T++ +V E R+ S + C+ ++ R +
Sbjct: 307 DSGSEFTYL----VDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGN 362
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNT 343
M F FD+ V G CV I S+ ++++G + QQ+ +D+
Sbjct: 363 MVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIAN 422
Query: 344 GTIQFVPENCA 354
+ F +C+
Sbjct: 423 RRVGFGKADCS 433
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 145/365 (39%), Gaps = 42/365 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ V GTP+++ L DT + W C C+ C S +F+ + SS+++ +PC
Sbjct: 103 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQSPQ 160
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + P C C + Y A+ LV + T + VP FGC
Sbjct: 161 CNQVPNPSCSGSACGFNLTYGSSTVAADLVQ-DNLTLATDS----VPSYTFGC------- 208
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
I G SV P SLLGQ +S Q FSYCL +++ + + LR
Sbjct: 209 -----IRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL-PSFKSVNFSGSLRL 262
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G A R + RSS YY++L I V + P A G +ID+G
Sbjct: 263 GPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSG 322
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA 295
T + Y V DE GR ++ ++ CY ++TF F
Sbjct: 323 TTFTRLVAPAYTAVR---DEFRRRVGRNVTVSSLGGFDTCYTVP---IISPTITFMFAGM 376
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ + P C+A++ + N +V+ + QQQ+ R ++D+ +
Sbjct: 377 NVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVAR 436
Query: 351 ENCAN 355
E+C++
Sbjct: 437 ESCSS 441
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 146/350 (41%), Gaps = 26/350 (7%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + ++ ++ D+GS + W QC PC C Q P+F+P S+TY +PC C +
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC INY G++A+G S + T + + G FGC++ +R +F
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD---VIRGFRFGCAHADRGSAF 278
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMK 187
D ++AG L SL+ Q + +FSYCL + + + A + +
Sbjct: 279 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 338
Query: 188 TIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
T + + +Y + L+ I VA + P F + +ID+ I + + Y
Sbjct: 339 TPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVF------SASSVIDSSTIISRLPPTAY 392
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD-RADFKVEPTYM 304
+ + F T + R + CY + R S+ FD A ++ +
Sbjct: 393 QALRAAFRSAMTMY---RAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGI 449
Query: 305 YFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
A + SDR +G QQ+ VYD+ ++F C
Sbjct: 450 LL----GSCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 80/150 (53%), Gaps = 10/150 (6%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICR-RPP 71
G P +++ DTGS + W QC PC +C+ Q+ PIF P AS++Y + C+ CR
Sbjct: 138 IGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYAPLSCEAAQCRYLDQ 197
Query: 72 FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNI 131
+C NG C+++++Y G+ G TET T + NK V V GC ++N
Sbjct: 198 SQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGV-NK---VKNVALGCGHNNEGLFV--GA 251
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLV 161
AG++G P S QL ST+ FSYCLV
Sbjct: 252 AGLIGLGGGPLSFPAQLNSTS---FSYCLV 278
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 157/376 (41%), Gaps = 40/376 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
YT V GTP + + DTGS ++W C C NC S F+ SST +
Sbjct: 83 LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHL------KNKL 108
PC D +C + +C + QC + Y G+ SG+ ++ F + +
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202
Query: 109 VCVPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAY 164
++FGCS + D + D + GILGF S++ QL S +FS+CL
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL---K 259
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
+ IL G+ I + V HY L+LQ I+V + P FA
Sbjct: 260 GDGNGGGILVLGE---ILEPSI-VYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSD 315
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA 284
G +ID+G +++ + Y+ ++ D + F + S+ + D F
Sbjct: 316 K--RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQCYLVLTSIDDSF-- 371
Query: 285 YASMTFHFD-RADFKVEPTYMYFI---FQNEG-YFCVAI-SFSDRNSVVGAWQQQDTRFV 338
+++F+F+ A ++P+ Y + FQ+ +C+ + +++G +D V
Sbjct: 372 -PTVSFNFEGGASMDLKPS-QYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVV 429
Query: 339 YDLNTGTIQFVPENCA 354
YDL I + +C+
Sbjct: 430 YDLARQQIGWTNYDCS 445
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 154/375 (41%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C C C +S +++ AS+T +
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 61 PCDDLICRR---PPFRCENG-QCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP--- 112
CDD C P C+ G QC++ + Y G+S +G + ++ + P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 273
Query: 113 GVIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREME 168
V+FGC N S + GILGF + S+L QL S+ + +FS+CL ++
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL----DNVD 329
Query: 169 ATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
I G+ + I V +HY + +++I V + F
Sbjct: 330 GGGIFAIGEVVEPKV----NITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAF--ESGDRK 383
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYAS 287
G +ID+G + P EV + ++ + R+H + + C+ Y + +
Sbjct: 384 GTIIDSGTTLAYF---PQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPT 439
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTRFVYD 340
+T HFD++ + Y E +C+ S +++G + VYD
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYD 499
Query: 341 LNTGTIQFVPENCAN 355
L I +V NC++
Sbjct: 500 LEKQGIGWVEYNCSS 514
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 155/358 (43%), Gaps = 41/358 (11%)
Query: 12 LFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
+ GTP + DTGS L W QCLPC+ C+ Q PIFNP S+++ +PC+ C
Sbjct: 85 IIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD 144
Query: 71 PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN-DNRDFSFD 128
C G C + Y + G + E T + V VI GC + + F F
Sbjct: 145 DGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS----VKSVI-GCGHASSGGFGF- 198
Query: 129 GNIAGILGFSVSPFSLLGQLKSTAQGL---FSYCLVYAYREMEATSILRFGKDANIQRKD 185
+G++G SL+ Q+ T+ G+ FSYCL A + FG++A +
Sbjct: 199 --ASGVIGLGGGQLSLVSQMSQTS-GISRRFSYCLPTLLS--HANGKINFGQNAVVSGPG 253
Query: 186 MKTIRMFVDRS-SHYYLSLQDISVADHR-IGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+ + + + ++YY++L+ IS+ + R + FA G +ID+G +F+ +
Sbjct: 254 VVSTPLISKNTVTYYYITLEAISIGNERHMAFAK---------QGNVIIDSGTTLSFLPK 304
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS----MTFHFD-RADFK 298
Y+ V+ + + +R+ + W+ C+ D A +S +T F A+
Sbjct: 305 ELYDGVVSSLLKVVKA---KRVKDPGNFWDLCFD-DGINVATSSGIPIITAQFSGGANVN 360
Query: 299 VEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P + N C+ + S +D ++G + YDL + F P C
Sbjct: 361 LLPVNTFQKVANN-VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 147/367 (40%), Gaps = 33/367 (8%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC- 67
V + GTP +++ ++ DTGS L W QC V + +F+P+ SS++ +PC+ +C
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCK 143
Query: 68 -RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
R P P C +N C + YA G A G + E TF ++ P +I GC+ +
Sbjct: 144 PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITF---SRSQSTPPLILGCAEE 200
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ D GILG ++ S Q K T FSYC+ T F N
Sbjct: 201 SSD------AKGILGMNLGRLSFASQAKLTK---FSYCVPTRQVRPGFTPTGSFYLGENP 251
Query: 182 QRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ I + S Y +++Q I + + ++ F +G G MI
Sbjct: 252 NSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMI 311
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTS-FGRQRMHNASEDWEYCYRYDSRFRAYASMTFH 291
D+G+ T++ Y V + + ++ D + R +M F
Sbjct: 312 DSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFE 371
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNTGTIQ 347
FD+ V G CV I S+ ++++G + QQ+ +DL +
Sbjct: 372 FDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVG 431
Query: 348 FVPENCA 354
F +C+
Sbjct: 432 FGKADCS 438
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 148/371 (39%), Gaps = 42/371 (11%)
Query: 8 TVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI--FNPNASSTYKRIPCDDL 65
V + GTP + + ++ DTGS L W QC N++ P F+P+ SS++ +PC
Sbjct: 89 VVTLPIGTPPQPQQMVLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLPCTHP 142
Query: 66 IC--RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+C R P P C +N C + YA G A G + E F P +I GC
Sbjct: 143 LCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQT---TPPLILGC 199
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY---AYREMEATSILRF 175
S+++RD GILG ++ S Q K T FSYC+ A T
Sbjct: 200 SSESRD------ARGILGMNLGRLSFPFQAKVTK---FSYCVPTRQPANNNNFPTGSFYL 250
Query: 176 GKDANIQR---KDMKTI----RMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
G + N R M T RM Y + +Q I + ++ P F G+G
Sbjct: 251 GNNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSG 310
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
M+D+G+ TF+ Y+ V + ++ D + R
Sbjct: 311 QTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGD 370
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNT 343
+ F F++ V P G CV I S+R ++++G + QQ+ +DL
Sbjct: 371 VAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLAN 430
Query: 344 GTIQFVPENCA 354
I F +C+
Sbjct: 431 RRIGFGVADCS 441
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 93/362 (25%), Positives = 145/362 (40%), Gaps = 39/362 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W C C C + FNP AS +Y+ +PC
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165
Query: 67 CRRPP---FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C R P C + YA + + L + + N +V FGC
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSLEAAL---SQDSLAVANDVV--KSYTFGCLQKAT 220
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
+ LG F L Q K +G FSYCL +++ + + LR G+ R
Sbjct: 221 GTATPPQGLLGLGRGPLSF--LSQTKDMYEGTFSYCLP-SFKSLNFSGTLRLGRKGQPLR 277
Query: 184 KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+KT + V+ RSS YY+S+ I V + P A G ++D+G + T +
Sbjct: 278 --IKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRFRAYASMTFHFDRADF 297
P V +R DE R+R+ A ++ CY + + +TF F
Sbjct: 336 V-APAYVAVR--DEV-----RRRIRGAPLSSLGGFDTCYNTTVK---WPPVTFMFTGMQV 384
Query: 298 KVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ P I G C+A++ + +V+ + QQQ+ R ++D+ G + F E
Sbjct: 385 TL-PADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFARE 443
Query: 352 NC 353
C
Sbjct: 444 QC 445
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 157/368 (42%), Gaps = 38/368 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + G+P + L+ DTGS + + C CV C N P F P SSTY+ + C+
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145
Query: 64 -DLICRRPPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
D C ENG QC + YA +++SG+++ + +F +++LV V FGC
Sbjct: 146 ADCNCD------ENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAV-FGCETM 198
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ GI+G S++ QL S+ L Y ++ +++ G +
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISS-- 256
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
M RS +Y + L++I VA + P TF +G G ++D+G +
Sbjct: 257 -PPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYF 311
Query: 242 QRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
Y + +M+ SF +Q + C + R + F D
Sbjct: 312 PEKAYYAFKDAIMKKI-----SFLKQISGPDPNFKDIC--FSGAGRDVTELPKVFPEVDM 364
Query: 298 ------KVEPTYMYFIFQN---EGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
K+ + ++F++ G +C+ I + +D+ +++G ++T Y+ TI
Sbjct: 365 VFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTI 424
Query: 347 QFVPENCA 354
F NC+
Sbjct: 425 GFWKTNCS 432
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 159/366 (43%), Gaps = 35/366 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C +C P F P+ SSTY + C+
Sbjct: 85 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG-VIFGCSND 121
D C + CV+ YA +S+SG++ + +F N+ VP +FGC N
Sbjct: 145 MDCNCDH-----DGVNCVYERRYAEMSSSSGVLGEDIISF--GNQSEVVPQRAVFGCENV 197
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ GI+G S++ QL K+ FS C Y + +++ G
Sbjct: 198 ETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLC--YGGMHVGGGAMVLGGIPP 255
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
DM R RS +Y + L++I VA + +P TF R++GT ++D+G
Sbjct: 256 ---PPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFD-RKHGT---VLDSGTTYA 308
Query: 240 FIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDW-EYCYRYDSR-----FRAYASMTFHF 292
++ P E + D S +++H ++ + C+ R +A+ + F
Sbjct: 309 YL---PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVF 365
Query: 293 DRADFKVEPTYMYFIFQN---EGYFCVAI-SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
K+ T ++FQ+ G +C+ I D +++G ++T YD I F
Sbjct: 366 SNGQ-KLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGF 424
Query: 349 VPENCA 354
NC+
Sbjct: 425 WKTNCS 430
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 145/368 (39%), Gaps = 79/368 (21%)
Query: 34 QCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFRC---ENGQCVHRINYAGGA 89
QC PCV+C+ Q P+FNP SS+Y +PC C + RC ++G C + Y+G
Sbjct: 2 QCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHG 61
Query: 90 SASGLVSTETF-----TFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIA---GILGFSVSP 141
G ++ + FH V+FGCS D S G A G++G P
Sbjct: 62 VTKGTLAIDKLAIGGDVFH---------AVVFGCS----DSSVGGPAAQASGLVGLGRGP 108
Query: 142 FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDR--SSHY 199
SL+ QL F YCL +L G DA D T+ M S+Y
Sbjct: 109 LSLVSQLSVHR---FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYY 165
Query: 200 YLSLQDISVADHRIGFAPGTFALRRNGTG--------------------------GCMID 233
YL+L ++V D PGT RN T G ++D
Sbjct: 166 YLNLDGLAVGDQ----TPGT---TRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVD 218
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED----WEYCYRYDSRF---RAYA 286
+ +F++ Y+ + +E R+ A+ + C+ R Y
Sbjct: 219 VASTISFLETSLYDELADDLEEEI------RLPRATPSLRLGLDLCFILPEGVGMDRVYV 272
Query: 287 -SMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGT 345
+++ FD +++ ++ + C+ I + S++G +Q Q+ R +++L G
Sbjct: 273 PTVSLSFDGRWLELDRDRLF--VTDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGK 330
Query: 346 IQFVPENC 353
I F +C
Sbjct: 331 ITFAKASC 338
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 144/362 (39%), Gaps = 35/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYK------- 58
Y V + GTP S L DTGS + WTQC PCV +C+ Q+ F+P SS+YK
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSS 104
Query: 59 --RIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
RI D R C + C++++ Y G+ + G +TE T + + +F
Sbjct: 105 SCRIITDSGGAR----GCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSD---VISNFLF 157
Query: 117 GCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
GC N G IAG+LG SL Q LF+YCL +T L G
Sbjct: 158 GCGQQNAGRF--GRIAGLLGLGRGKLSLALQTSEKYNNLFTYCL--PSFSSSSTGHLTLG 213
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
K F + Y + ++ +SV H + F+ G +ID+G
Sbjct: 214 GQVPKSVKFTPLSPAF-KNTPFYGIDIKGLSVGGHVLPIDASVFS-----NAGAIIDSGT 267
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ T +Q Y + F + + + + + CY + F +
Sbjct: 268 VITRLQPTVYSALSSKFQQLMKDYPKT---DGFSILDTCYDFSGNESISVPRISFFFKGG 324
Query: 297 FKVEPTY--MYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPE 351
+V+ + + + C+A + +D + V G QQQ V+DL G I F P
Sbjct: 325 VEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPS 384
Query: 352 NC 353
C
Sbjct: 385 GC 386
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 149/367 (40%), Gaps = 43/367 (11%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+ Y V GTP + L DT + W C C C SA F+P AS++Y+ +PC
Sbjct: 108 QTLTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPC 167
Query: 63 DDLICRRPP-FRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C + P C G C + YA +S +S ++ V FGC
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTYA-DSSLQAALSQDSLAVAGN----AVKAYTFGCL 222
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
R G+LG P S L Q K + FSYCL +++ + + LR G++
Sbjct: 223 --QRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCL-PSFKSLNFSGTLRLGRNG 279
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI---GFAPGTFALRRNGTGGCMIDTGA 236
QR + RSS YY+++ + V + F P T A G ++D+G
Sbjct: 280 QPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGA-------GTVLDSGT 332
Query: 237 IATFIQRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
+ T + Y + V R +S G ++ C +++ A+ MT F
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLG---------GFDTC--FNTTAVAWPPMTLLF 381
Query: 293 DRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTI 346
D + P I G C+A++ + +V+ + QQQ+ R ++D+ G +
Sbjct: 382 DGMQVTL-PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRV 440
Query: 347 QFVPENC 353
F E C
Sbjct: 441 GFARERC 447
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 156/382 (40%), Gaps = 53/382 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y ++ GTP+K ++ DTGS ++W C+ C C +S +++P SST ++
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C P + C + + Y G+S +G ++ F + P
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 114 --VIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
V FGC S D S + + GI+GF S S+L QL + + +F++CL +
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL----DTI 178
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G N+ + +KT + V HY ++L+ I V + F
Sbjct: 179 NGGGIFAIG---NVVQPKVKTTPL-VPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK-- 232
Query: 228 GGCMIDTGAIATFIQRGPYEVVM-----RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
G +ID+G T++ Y+ +M +H D F HN E C++Y R
Sbjct: 233 KGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITF--------HNVQE--FLCFQYVGRV 282
Query: 283 -RAYASMTFHFDR-ADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQ 333
+ +TFHF+ V P + YF + +CV S ++G
Sbjct: 283 DDDFPKITFHFENDLPLNVYP-HDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLS 341
Query: 334 DTRFVYDLNTGTIQFVPENCAN 355
+ VYDL I + NC++
Sbjct: 342 NKLVVYDLENQVIGWTEYNCSS 363
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 150/378 (39%), Gaps = 53/378 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N +Y V + G PSK FL DTGS L W QC PCV C P + P + +PC
Sbjct: 17 NGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNN----LVPC 72
Query: 63 DDLICR----RPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D IC+ RCEN GQC + + YA G S+ G++ +TF + ++ P + G
Sbjct: 73 MDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFTSEKRHSPLLALG 132
Query: 118 -CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILR 174
C D I G+LG S++ QL S + + +CL
Sbjct: 133 LCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDL 192
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDT 234
+ + M D + HY L +++ GF +N D+
Sbjct: 193 Y------DSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGF--------KNLL--TTFDS 235
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF 292
GA T++ Y+ ++ + + + + A +D C++ F++ + +F
Sbjct: 236 GASYTYLNSQAYQGLISLLKKELSG---KPLREALDDQTLPLCWKGRKPFKSIRDVKKYF 292
Query: 293 D----------RADFKVE-PTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDT 335
++ ++E P Y I ++G C+ I +D N V+G QD
Sbjct: 293 KTFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLN-VIGDISMQDR 351
Query: 336 RFVYDLNTGTIQFVPENC 353
+YD I + P NC
Sbjct: 352 VVIYDNEKERIGWAPGNC 369
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 157/368 (42%), Gaps = 38/368 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + G+P + L+ DTGS + + C CV C N P F P SSTY+ + C+
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145
Query: 64 -DLICRRPPFRCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
D C ENG QC + YA +++SG+++ + +F +++LV V FGC
Sbjct: 146 ADCNCD------ENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAV-FGCETM 198
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ GI+G S++ QL S+ L Y ++ +++ G +
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISS-- 256
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
M RS +Y + L++I VA + P TF +G G ++D+G +
Sbjct: 257 -PPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYF 311
Query: 242 QRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF 297
Y + +M+ SF +Q + C + R + F D
Sbjct: 312 PEKAYYAFKDAIMKKI-----SFLKQISGPDPNFKDIC--FSGAGRDVTELPKVFPEVDM 364
Query: 298 ------KVEPTYMYFIFQN---EGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTI 346
K+ + ++F++ G +C+ I + +D+ +++G ++T Y+ TI
Sbjct: 365 VFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTI 424
Query: 347 QFVPENCA 354
F NC+
Sbjct: 425 GFWKTNCS 432
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 147/372 (39%), Gaps = 38/372 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI----FNPNASSTYKRIP 61
Y + GTPS+ + DTGS ++W C C+ C +S + ++ +ASST K +
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSVS 143
Query: 62 CDDLICRRPPFR--CENGQ-CVHRINYAGGASASGLVSTETFTFHL----KNKLVCVPGV 114
C D C R C +G C + I Y G+S +G + + L + +
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTI 203
Query: 115 IFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
IFGC + + A GI+GF S S + QL S QG + I
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLAS--QGKVKRSFAHCLDNNNGGGI 261
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
G+ + +KT M +S+HY ++L I V + + + F G +I
Sbjct: 262 FAIGE---VVSPKVKTTPML-SKSAHYSVNLNAIEVGNSVLQLSSDAF--DSGDDKGVII 315
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G ++ Y +M S +H + + C+ Y R + ++TF F
Sbjct: 316 DSGTTLVYLPDAVYNPLMNQI---LASHQELNLHTVQDSFT-CFHYIDRLDRFPTVTFQF 371
Query: 293 DRAD----------FKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLN 342
D++ F+V F +QN G + +++G + VYD+
Sbjct: 372 DKSVSLAVYPQEYLFQVREDTWCFGWQNGG---LQTKGGASLTILGDMALSNKLVVYDIE 428
Query: 343 TGTIQFVPENCA 354
I + NC+
Sbjct: 429 NQVIGWTNHNCS 440
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 148/386 (38%), Gaps = 53/386 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W C P SA F P ASST+ +PC
Sbjct: 82 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCA 141
Query: 64 DLICRR----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
CR P C+ +C ++YA G+S+ G ++T+ F L FG
Sbjct: 142 SAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAA----FG 197
Query: 118 CSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C + D S DG AG+LG + S + Q + FSYC+ + + +L G
Sbjct: 198 CMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRR---FSYCI----SDRDDAGVLLLG 250
Query: 177 KDANIQRKDMKTIRM--------FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
+ M + DR + Y + L I V + A G G
Sbjct: 251 HSDLPTFLPLNYTPMYQPALPLPYFDRVA-YSVQLLGIRVGGKHLPIPASVLAPDHTGAG 309
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRA-- 284
M+D+G TF+ Y + F + + A +D + ++ +D+ FR
Sbjct: 310 QTMVDSGTQFTFLLGDAYSALKAEFTRQ-----ARPLLPALDDPSFAFQEAFDTCFRVPQ 364
Query: 285 --------YASMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS----VV 327
+T F+ A+ V + + E G +C+ +D V+
Sbjct: 365 GRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVI 424
Query: 328 GAWQQQDTRFVYDLNTGTIQFVPENC 353
G Q + YDL G + P C
Sbjct: 425 GHHHQMNVWVEYDLERGRVGLAPVRC 450
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 106/226 (46%), Gaps = 28/226 (12%)
Query: 4 NYFYTVDV--LFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
NY T+ + G+P+ + ++ DTGS L W QC PC C+ Q P+F+P S+TY +
Sbjct: 91 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVR 150
Query: 62 CDDLICRRPPFRCENG-------------QCVHRINYAGGASASGLVSTETFTFHLKNKL 108
C+ C R G +C + + Y G+ + G+++T+T +
Sbjct: 151 CNASACAD-SLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 207
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME 168
+ G +FGC NR F G AG++G + SL+ Q S G+FSYCL A
Sbjct: 208 --LGGFVFGCGLSNRGL-F-GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDA 263
Query: 169 ATSI-LRFGKDANIQRKDMKTI---RMFVDRSSH--YYLSLQDISV 208
+ S+ L G DA ++ + RM D + Y+L++ +V
Sbjct: 264 SGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAV 309
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 148/368 (40%), Gaps = 39/368 (10%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
EK+ V + GTP+ + L+FDT S L+WTQC PC++C Q+ +++PN + TY +
Sbjct: 82 QEKHVEPHVFLGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANL 141
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
+ Y+ + SG +TETF L N V V + FGC
Sbjct: 142 TSSS----------------YNYTYSKQSFTSGYFATETFA--LGN--VTVANITFGCGT 181
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD-- 178
N+ + +D N+AG+ G + L FSYC + + L +
Sbjct: 182 RNQGY-YD-NVAGVFGVGRGGRGGVSLLNQLGIDRFSYCFSSSGAPGSSAVFLGGSPELA 239
Query: 179 ANIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
N + M D S Y++ L ++V + A + A G +ID+ +
Sbjct: 240 TNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVDVAGASSA--EGGGRALVIDSTS 297
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE--DWEYCYRY----DSRFRAYASMTF 290
T + Y V R ++ NAS + C+ + +MT
Sbjct: 298 PVTVLDEATYGPVRRALVAQLAPL-KEANANASAGVGLDLCFELAAGGATPTPPNVTMTL 356
Query: 291 HFD--RADFKVEPTYMYFIFQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTGTI 346
HFD AD + P G C+ ++ S N V+G+W DT +YDL +
Sbjct: 357 HFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVV 416
Query: 347 QFVPENCA 354
F P +CA
Sbjct: 417 SFQPLDCA 424
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 75/156 (48%), Gaps = 7/156 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP+K ++FDTGS L W QC PC +C+ Q P+F+P+ SSTY + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 67 CRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ + +C + + Y + G + +T T + L PG +FGC + N
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTL---PGFVFGCGDQNAG 265
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
G + G+ G SL Q + F+YCL
Sbjct: 266 LF--GQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL 299
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 156/382 (40%), Gaps = 53/382 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y ++ GTP+K ++ DTGS ++W C+ C C +S +++P SST ++
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 147
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C P + C + + Y G+S +G ++ F + P
Sbjct: 148 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 207
Query: 114 --VIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
V FGC S D S + + GI+GF S S+L QL + + +F++CL +
Sbjct: 208 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL----DTI 263
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G N+ + +KT + V HY ++L+ I V + F
Sbjct: 264 NGGGIFAIG---NVVQPKVKTTPL-VPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK-- 317
Query: 228 GGCMIDTGAIATFIQRGPYEVVM-----RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
G +ID+G T++ Y+ +M +H D F HN E C++Y R
Sbjct: 318 KGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITF--------HNVQE--FLCFQYVGRV 367
Query: 283 -RAYASMTFHFDR-ADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQ 333
+ +TFHF+ V P + YF + +CV S ++G
Sbjct: 368 DDDFPKITFHFENDLPLNVYP-HDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLS 426
Query: 334 DTRFVYDLNTGTIQFVPENCAN 355
+ VYDL I + NC++
Sbjct: 427 NKLVVYDLENQVIGWTEYNCSS 448
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 152/383 (39%), Gaps = 51/383 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P + ++ DTGS L W C N + +FNP +SS+Y IPC
Sbjct: 37 NVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCS 92
Query: 64 DLICRR------PPFRCENGQCVHRI-NYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+CR P C+ + H I +YA +S G ++++ F +PG +F
Sbjct: 93 SPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS----ALPGTLF 148
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC + + + D G++G + S + QL FSYC+ +++ +L
Sbjct: 149 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPK---FSYCI----SGRDSSGVLL 201
Query: 175 FGKD-----ANIQRKDMKTIRM---FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
FG N+ + I + DR + Y + L I V + + FA G
Sbjct: 202 FGDSHLSWLGNLTYTPLVQISTPLPYFDRVA-YTVQLDGIRVGNKILPLPKSIFAPDHTG 260
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEH----FTSFGRQR-MHNASEDWEYCYRYDS- 280
G M+D+G TF+ Y + F E G + + D CYR +
Sbjct: 261 AGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMD--LCYRVPAG 318
Query: 281 -RFRAYASMTFHFDRADFKVEPTYMYF-----IFQNEGYFCVAISFSDRNS----VVGAW 330
+ +++ F A+ V + + + E +C+ SD V+G
Sbjct: 319 GKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHH 378
Query: 331 QQQDTRFVYDLNTGTIQFVPENC 353
QQ+ +DL + FV C
Sbjct: 379 HQQNVWMEFDLVKSRVGFVETRC 401
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 162/382 (42%), Gaps = 53/382 (13%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRIP 61
Y + GTP K + DTGS ++W C+ C C +S +++P SS+ +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 62 CDDLIC-------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLV----- 109
CD+ C + P C +R Y G+S +G +++ + N+L
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQY---NQLSGNAQT 203
Query: 110 --CVPGVIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYA 163
VIFGC + D S + + GI+GF S S L QL S + +FS+CL
Sbjct: 204 RHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL--- 260
Query: 164 YREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALR 223
++ I G+ + + +K+ + + SHY ++LQ I VA + + P F
Sbjct: 261 -DTIKGGGIFAIGE---VVQPKVKSTPLLPNM-SHYNVNLQSIDVAGNALQLPPHIFETS 315
Query: 224 RNGTGGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSR 281
G +ID+G T++ Y +++ F +H Q + + C+ Y +S
Sbjct: 316 EK--RGTIIDSGTTLTYLPELVYKDILAAVFQKH------QDITFRTIQGFLCFEYSESV 367
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAI---SFSDRNS----VVGAWQQQ 333
+ +TFHF D V P + YF + +C+ F +++ ++G
Sbjct: 368 DDGFPKITFHFEDDLGLNVYP-HDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLS 426
Query: 334 DTRFVYDLNTGTIQFVPENCAN 355
+ VYDL I + NC++
Sbjct: 427 NKVVVYDLEKQVIGWTDYNCSS 448
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 75/156 (48%), Gaps = 7/156 (4%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP+K ++FDTGS L W QC PC +C+ Q P+F+P+ SSTY + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 67 CRRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ + +C + + Y + G + +T T + L PG +FGC + N
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTL---PGFVFGCGDQNAG 265
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
G + G+ G SL Q + F+YCL
Sbjct: 266 LF--GQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL 299
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 149/383 (38%), Gaps = 64/383 (16%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P + FL DTGS L W QC PCV+C P++ P + K +PC D
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTKN---KIVPCVD 113
Query: 65 LICRR------PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C +C++ QC + I YA S+ G++ T++F L N + P + F
Sbjct: 114 QLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRLANSSIVRPSLAF 173
Query: 117 GCSNDNRDFSFD--GNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D + S G+LG SLL QLK + + +CL +
Sbjct: 174 GCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL-----SIRGGGF 228
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG---- 228
L FG D + + M +YY +PGT +L G
Sbjct: 229 LFFG-DNLVPYSRATWVPMVRSAFKNYY---------------SPGTASLYFGGRSLGVR 272
Query: 229 --GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-- 284
++D+G+ T+ PY+ ++ + ++ + C++ F++
Sbjct: 273 PMEVVLDSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPS---LPLCWKGKKPFKSVL 329
Query: 285 -----YASMTFHFD---RADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAW 330
+ S+ F +A ++ P Y I G C+ I D N +VG
Sbjct: 330 DVKKEFKSLVLSFSNGKKALMEIPPEN-YLIVTKFGNACLGILNGSEIGLKDLN-IVGDI 387
Query: 331 QQQDTRFVYDLNTGTIQFVPENC 353
QD +YD G I ++ C
Sbjct: 388 TMQDQMVIYDNERGQIGWIRAPC 410
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 116/271 (42%), Gaps = 28/271 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAP---IFNPNASSTYKRIPC 62
Y + + GTP + DTGS L W QC C + C++Q+A IFNP SSTY ++ C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 63 DDLICR------RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C + C E+ C++ + Y G + G + + T +
Sbjct: 66 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNF 122
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQL-KSTAQGLFSYCLVYAYREMEATSIL 173
IFGC DN ++G AGI+GF +S Q+ + T FSYC + + +I
Sbjct: 123 IFGCGEDNL---YNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 179
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+ +D N+ M T ++ D Y + D+ V R+ P + + ++D
Sbjct: 180 PYARDINL----MWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT-----IVD 230
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQR 264
+G T+I ++ + + + + G R
Sbjct: 231 SGTADTYILSPVFDALDKAMTKEMQAKGYTR 261
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 150/377 (39%), Gaps = 53/377 (14%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P K FL DTGS L W QC PC +C P++ P + K +PC D
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTKN---KLVPCVD 121
Query: 65 LIC--------RRPPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
+C R+ +C++ QC + I YA S++G++ ++F L N V P +
Sbjct: 122 QLCASLHNGLNRK--HKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLANGSVVRPSL 179
Query: 115 IFGCSNDNRDFSFDGNIA-GILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATS 171
FGC D + S + + G+LG SLL Q K + + +CL +
Sbjct: 180 AFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCL-----SLRGGG 234
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
L FG D + + + M +YY S + F G +LR T +
Sbjct: 235 FLFFGDDL-VPYQRVTWTPMVRSPLRNYY------SPGSASLYF--GDQSLRVKLT-EVV 284
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA------- 284
D+G+ T+ PY+ ++ + R + C++ F++
Sbjct: 285 FDSGSSFTYFAAQPYQALVTALKGDLS---RTLKEVSDPSLPLCWKGKKPFKSVLDVKKE 341
Query: 285 YASMTFHFDRAD--FKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDTR 336
+ S+ +F + F P Y I G C+ I D S++G QD
Sbjct: 342 FKSLVLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDL-SILGDITMQDQM 400
Query: 337 FVYDLNTGTIQFVPENC 353
+YD G I ++ C
Sbjct: 401 VIYDNEKGQIGWIRAPC 417
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 122/287 (42%), Gaps = 39/287 (13%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRI 60
K+ Y V FGTP+ + ++ DTGS L W QC PC + C Q P+F+P+ SSTY +
Sbjct: 108 KSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAV 167
Query: 61 PCDDLICRRPPF-----RCENGQ-CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
PC C++ C NGQ C I+Y G S G+ + K+KL PG
Sbjct: 168 PCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGV--------YGKDKLTLAPGA 219
Query: 115 I-----FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA 169
I FGC + S G G+LG SL Q FSYCL +
Sbjct: 220 IVKDFYFGCGHSKS--SLPGLFDGLLGLGRLSESLGAQYGGGGG--FSYCLPAVNSK--- 272
Query: 170 TSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
L FG N + + + ++L I+V ++ P F +GG
Sbjct: 273 PGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGG 326
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY 276
++D+G + T +Q Y + F E ++ + +H D + CY
Sbjct: 327 MIVDSGTVVTVLQSTVYRALRAAFREAMKAY--RLVHG---DLDTCY 368
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 151/371 (40%), Gaps = 34/371 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA----------PIFNPNA 53
N +YT + GTPS+ L+ D+GS + + C C C N + P F P+
Sbjct: 89 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 148
Query: 54 SSTYKRIPCD-DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVP 112
SSTY + C+ D C E QC + YA +S+SG++ + +F +++L
Sbjct: 149 SSTYSPVKCNVDCTCDN-----ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQR 203
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
V FGC N F + GI+G S++ QL S+ L Y ++ ++
Sbjct: 204 AV-FGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTM 262
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ G A DM RS +Y + L++I VA + P F N G ++
Sbjct: 263 VLGGMPA---PPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF----NSKHGTVL 315
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH- 291
D+G ++ + S + R + + + C+ R + S F
Sbjct: 316 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYK-DICFAGAGRNVSQLSEVFPD 374
Query: 292 -----FDRADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNT 343
+ + P Y++ + EG +C+ + + D +++G ++T YD +
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434
Query: 344 GTIQFVPENCA 354
I F NC+
Sbjct: 435 EKIGFWKTNCS 445
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 138/353 (39%), Gaps = 22/353 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V FGTP ++ L DT S W C CV C + S P F P S++++ + C
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCGSPH 154
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C Y G +S + V +T T +PG FGC N
Sbjct: 155 CKQVPNPTCGGSACAFNFTY-GSSSIAASVVQDTLTLAADP----IPGYTFGCVNKTTGS 209
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S LG SLL Q ++ + FSYCL +++ + + LR G +R
Sbjct: 210 SAPQQGLLGLGRGPL--SLLSQSQNLYKSTFSYCL-PSFKSINFSGSLRLGPVYQPKRIK 266
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ RSS YY++L I V + P A G + D+G + T +
Sbjct: 267 YTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPV 326
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
Y V F G + ++ CY ++TF F + + P +
Sbjct: 327 YTAVRNEFRRR---VGPKLPVTTLGGFDTCYNVP---IVVPTITFLFSGMNVALPPDNIV 380
Query: 306 FIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
C+A++ + N +V+ QQQ+ R ++D+ I E C
Sbjct: 381 IHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 161/399 (40%), Gaps = 71/399 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCF-----NQSAPIFNPNASSTYK 58
Y+V + FGTPS++ +FDTGS L+W C C +C P F P SS+ +
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSR 149
Query: 59 RIPCDDLICR------------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN 106
I C + C+ P R C I G S +G++ +E F
Sbjct: 150 VIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGILISEKLDF---- 205
Query: 107 KLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYRE 166
+ VP + GCS + AGI GF P SL Q+K + FS+CLV R
Sbjct: 206 PDLTVPDFVVGCSVISTR-----TPAGIAGFGRGPESLPSQMKLKS---FSHCLV--SRR 255
Query: 167 MEATSILR-FGKDANIQRKD-MKTIRMF---------VDRSS---HYYLSLQDISVADHR 212
+ T++ G D K KT + V ++ +YYL+L+ I V
Sbjct: 256 FDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKH 315
Query: 213 IG-----FAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN 267
+ APGT NG GG ++D+G+ TF++R +E+V F +++ R++
Sbjct: 316 VKIPYKFLAPGT-----NGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLE 370
Query: 268 ASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVEPTYMYFIF-QNEGYFCVAISFSDRNS 325
C+ + + F F P YF F N C+ + SD
Sbjct: 371 KVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTV-VSDNTV 429
Query: 326 ----------VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++G++QQQ+ YDL F + C+
Sbjct: 430 NPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 163/407 (40%), Gaps = 87/407 (21%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----------PIFNPNASS 55
Y+V + FGTPS++ +FDTGS L+ CLPC + + S P F P SS
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLV---CLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSS 146
Query: 56 TYKRIPCDDLICR---RPPFRCENGQ---------CVHRINYAGGASASGLVSTETFTFH 103
+ K I C C+ P +C C I G S +G++ TE F
Sbjct: 147 SSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDF- 205
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYA 163
+ VP + GCS + AGI GF P SL Q+ FS+CLV
Sbjct: 206 ---PDLTVPDFVVGCSIISTR-----QPAGIAGFGRGPVSLPSQMNLKR---FSHCLVS- 253
Query: 164 YREMEATSILR-----------------------FGKDANIQRKDMKTIRMFVDRSSHYY 200
R + T++ F K+ N+ K F++ +YY
Sbjct: 254 -RRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK------AFLE---YYY 303
Query: 201 LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSF 260
L+L+ I V + A NG GG ++D+G+ TF++R +E+V F +++
Sbjct: 304 LNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNY 363
Query: 261 GRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRADFKVE-PTYMYFIF-QNEGYFCVA 317
R++ C+ + + F F + K+E P YF F N C+
Sbjct: 364 TREKDLEKETGLGPCFNISGKGDVTVPELIFEF-KGGAKLELPLSNYFTFVGNTDTVCLT 422
Query: 318 ISFSDRNS----------VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ SD+ ++G++QQQ+ YDL F + C+
Sbjct: 423 V-VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 138/353 (39%), Gaps = 22/353 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V FGTP ++ L DT S W C CV C + S P F P S++++ + C
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCGSPH 154
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C Y G +S + V +T T +PG FGC N
Sbjct: 155 CKQVPNPTCGGSACAFNFTY-GSSSIAASVVQDTLTLATDP----IPGYTFGCVNKTTGS 209
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S LG SLL Q ++ + FSYCL +++ + + LR G +R
Sbjct: 210 SAPQQGLLGLGRGPL--SLLSQSQNLYKSTFSYCL-PSFKSINFSGSLRLGPVYQPKRIK 266
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ RSS YY++L I V + P A G + D+G + T +
Sbjct: 267 YTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPV 326
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
Y V F G + ++ CY ++TF F + + P +
Sbjct: 327 YTAVRNEFRRR---VGPKLPVTTLGGFDTCYNVP---IVVPTITFLFSGMNVTLPPDNIV 380
Query: 306 FIFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
C+A++ + N +V+ QQQ+ R ++D+ I E C
Sbjct: 381 IHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 155/388 (39%), Gaps = 56/388 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P + ++ DTGS L W C N + +FNP +SS+Y IPC
Sbjct: 997 NVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCS 1052
Query: 64 DLICRR------PPFRCENGQCVHRI-NYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
ICR P C+ + H I +YA +S G ++++ F +PG +F
Sbjct: 1053 SPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS----ALPGTLF 1108
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLVYAYREMEATSI 172
GC + + + D G++G + S + QL GL FSYC+ +++ +
Sbjct: 1109 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYCI----SGRDSSGV 1159
Query: 173 LRFGK-----DANIQRKDMKTIRM---FVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
L FG N+ + I + DR + Y + L I V + + FA
Sbjct: 1160 LLFGDLHLSWLGNLTYTPLVQISTPLPYFDRVA-YTVQLDGIRVGNKILPLPKSIFAPDH 1218
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEH----FTSFGRQR-MHNASEDWEYCYRYD 279
G G M+D+G TF+ Y + F E G + + D Y
Sbjct: 1219 TGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAG 1278
Query: 280 SRFRAYASMTFHFDRADFKVEPTYMYF-----IFQNEGYFCVAISFSD----RNSVVGAW 330
+ S++ F A+ V + + + NE +C+ SD V+G
Sbjct: 1279 GKLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHH 1338
Query: 331 QQQDTRFVYDLNTGTIQFVPENCAN-DH 357
QQ+ +DL + F + C + DH
Sbjct: 1339 HQQNVWMEFDL----VAFAADLCGSIDH 1362
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 116/271 (42%), Gaps = 28/271 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAP---IFNPNASSTYKRIPC 62
Y + + GTP + DTGS L W QC C + C++Q+A IFNP SSTY ++ C
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84
Query: 63 DDLICR------RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C + C E+ C++ + Y G + G + + T +
Sbjct: 85 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNF 141
Query: 115 IFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQL-KSTAQGLFSYCLVYAYREMEATSIL 173
IFGC DN ++G AGI+GF +S Q+ + T FSYC + + +I
Sbjct: 142 IFGCGEDNL---YNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 198
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
+ +D N+ M T ++ D Y + D+ V R+ P + + ++D
Sbjct: 199 PYARDINL----MWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT-----IVD 249
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQR 264
+G T+I ++ + + + + G R
Sbjct: 250 SGTADTYILSPVFDALDKAMTKEMQAKGYTR 280
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 153/362 (42%), Gaps = 50/362 (13%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPI--FNPNASSTYKR 59
+++ Y + V G+P +S + DTGS L+W +C N + +AP F+P+ SSTY R
Sbjct: 97 RSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGR 156
Query: 60 IPCDDLICRR-PPFRCENGQ-CVHRINYAGGASASGLVSTETFTFH-----LKNKLVCVP 112
+ C C C++G C + Y G++ +G++STETFTF + V +
Sbjct: 157 VSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRIG 216
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
GV FGCS + G+ G +VS + LG S + FSYCLV + A+S
Sbjct: 217 GVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGR-RFSYCLV--PHSVNASSA 273
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG A++ + + +++ S A RI ++
Sbjct: 274 LNFGALADVTEPGAASTPLVGNKTVA--------SAASSRI-----------------IV 308
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR----FRAYASM 288
D+G TF+ ++ T + + + CY R + +
Sbjct: 309 DSGTTLTFLDPSLLGPIVDELSRRIT---LPPVQSPDGLLQLCYNVAGREVEAGESIPDL 365
Query: 289 TFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTG 344
T F A ++P + Q EG C+AI + S++G QQ+ YDL+ G
Sbjct: 366 TLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAG 424
Query: 345 TI 346
T+
Sbjct: 425 TV 426
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 146/360 (40%), Gaps = 37/360 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ ++ + DTGS + W QC PC C ++ +F+P+ASSTY C
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAA 190
Query: 67 C-----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + C + QC + ++Y G+S +G S++T T + G FGCS
Sbjct: 191 CVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGSN----AIKGFQFGCSQ- 245
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-----VYAYREMEATSILRFG 176
+ F G++G SL+ Q T FSYCL + + A S F
Sbjct: 246 SESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGAASRSGFV 305
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
K ++ + T +Y + L+ I V ++ F + G ++D+G
Sbjct: 306 KTPMLRSTQIPT---------YYGVLLEAIRVGGQQLNIPTSVF------SAGSVMDSGT 350
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRA 295
+ T + Y + F + + S + C+ + + + S+ F
Sbjct: 351 VITRLPPTAYSALSSAFKAGMKKYPPAQ---PSGILDTCFDFSGQSSVSIPSVALVFSGG 407
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPENC 353
V + + + + + + SD +S+ +G QQ+ +YD+ G + F C
Sbjct: 408 AV-VNLDFNGIMLELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 149/366 (40%), Gaps = 52/366 (14%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N+ Y + + TP L DTGS L+W +C P + ASS+Y R+PC
Sbjct: 72 QNFEYLMALDVSTPPVRMLALADTGSSLVWLKC---------KLPAAHTPASSSYARLPC 122
Query: 63 DDLICRR--PPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
D C+ C N CV+R +A G+ +G V+ + FTF + +
Sbjct: 123 DAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTR--------LD 174
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSIL 173
FGC+ S + G++G + P SL+ QL K+ FSYCLV +S L
Sbjct: 175 FGCATRTEGLSVPDD--GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSL 232
Query: 174 RFGKDANIQRKD-MKTIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
FG A + T + R+ +Y ++L I VA + T L +
Sbjct: 233 NFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTTKL--------I 284
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA------- 284
+D+G + T++ + + ++ + R+ + + C YD R RA
Sbjct: 285 VDSGTMLTYLPKAVLDPLVAALT---AAIKLPRVKSPETLYAVC--YDVRRRAPEDVGKS 339
Query: 285 YASMTFHFDRADFKVEPTYMYFIFQNEG-YFCVAISFSDRNS-VVGAWQQQDTRFVYDLN 342
+T P F+ +N+G C+A+ S ++G QQ+ +DL
Sbjct: 340 IPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLE 399
Query: 343 TGTIQF 348
T+ F
Sbjct: 400 RRTVSF 405
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 151/371 (40%), Gaps = 34/371 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA----------PIFNPNA 53
N +YT + GTPS+ L+ D+GS + + C C C N + P F P+
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 147
Query: 54 SSTYKRIPCD-DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVP 112
SSTY + C+ D C E QC + YA +S+SG++ + +F +++L
Sbjct: 148 SSTYSPVKCNVDCTCDN-----ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQR 202
Query: 113 GVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSI 172
V FGC N F + GI+G S++ QL S+ L Y ++ ++
Sbjct: 203 AV-FGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTM 261
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
+ G A DM RS +Y + L++I VA + P F N G ++
Sbjct: 262 VLGGMPA---PPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF----NSKHGTVL 314
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH- 291
D+G ++ + S + R + + + C+ R + S F
Sbjct: 315 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYK-DICFAGAGRNVSQLSEVFPD 373
Query: 292 -----FDRADFKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNT 343
+ + P Y++ + EG +C+ + + D +++G ++T YD +
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433
Query: 344 GTIQFVPENCA 354
I F NC+
Sbjct: 434 EKIGFWKTNCS 444
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 146/364 (40%), Gaps = 29/364 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W+ C PC C S F P +SS+Y +PC
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDW 136
Query: 67 CRRPPFRCENGQCVHRINYAGGASA---SGLVSTETFTFHLKNKLV-----CVPGVIFGC 118
C P F E C + + A S + +F L + + + G FGC
Sbjct: 137 C--PLF--EGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAFGC 192
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + G+LG P SLL Q ST G+FSYCL +YR + LR G
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLP-SYRSYYFSGSLRLG-- 249
Query: 179 ANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A Q ++++ + + R S YY+++ +SV + G+FA G +ID+G
Sbjct: 250 AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGT 309
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRA 295
+ T Y + F + + + ++ C+ D A +T H D
Sbjct: 310 VITRWTAPVYAALREEFRRQVAA---PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGG 366
Query: 296 DFKVEPTYMYFIFQNEGYF-CVAISFSDR-----NSVVGAWQQQDTRFVYDLNTGTIQFV 349
P I + C+A++ + + +VV QQQ+ R V D+ + F
Sbjct: 367 VDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFA 426
Query: 350 PENC 353
E C
Sbjct: 427 REPC 430
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 148/384 (38%), Gaps = 49/384 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP--CVNCFNQSAPIFNPNASSTYKRIP 61
N TV + GTP ++ ++ DTGS L W C P +SA F P AS T+ +P
Sbjct: 62 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 121
Query: 62 CDDLICRR----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
C CR P C+ QC ++YA G+S+ G ++TE FT L
Sbjct: 122 CGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAA---- 177
Query: 116 FGCSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
FGC D S DG AG+LG + S + Q + FSYC+ + + +L
Sbjct: 178 FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCI----SDRDDAGVLL 230
Query: 175 FGKD---------ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
G + + M + DR + Y + L I V + A
Sbjct: 231 LGHSDLPFLPLNYTPLYQPAMPLP--YFDRVA-YSVQLLGIRVGGKPLPIPASVLAPDHT 287
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCYRYDSR 281
G G M+D+G TF+ Y + F T +++ + E ++ C+R
Sbjct: 288 GAGQTMVDSGTQFTFLLGDAYSALKAEFSRQ-TKPWLPALNDPNFAFQEAFDTCFRVPQG 346
Query: 282 FRAYA---SMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS----VVGA 329
A ++T F+ A V + + E G +C+ +D V+G
Sbjct: 347 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGH 406
Query: 330 WQQQDTRFVYDLNTGTIQFVPENC 353
Q + YDL G + P C
Sbjct: 407 HHQMNVWVEYDLERGRVGLAPIRC 430
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/403 (23%), Positives = 149/403 (36%), Gaps = 63/403 (15%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC----------LPCVNCFNQSAP-----IFNP 51
Y V GTP++ L+ DTGS L W +C + AP F P
Sbjct: 87 YFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRP 146
Query: 52 NASSTYKRIPCDDLICRRP-PFR-----CENGQCVHRINYAGGASASGLVSTETFTFHLK 105
+ S T+ IPC CR PF C + Y G++A G V ++ T L
Sbjct: 147 DKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALS 206
Query: 106 NKLV---CVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVY 162
+ + GV+ GC+ SF + G+L S S + S G FSYCLV
Sbjct: 207 GRAARKAKLRGVVLGCTTSYNGQSFLAS-DGVLSLGYSNISFASRAASRFGGRFSYCLVD 265
Query: 163 AYREMEATSILRFGKDANIQRK-----------------------DMKTIRMFVDRSSH- 198
ATS L FG + + + + +D +
Sbjct: 266 HLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRP 325
Query: 199 -YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF 257
Y ++++ +SVA + + + + GG ++D+G T + + Y V+ +
Sbjct: 326 FYAVTVKGVSVAGELLKIPRAVWDVEQG--GGAILDSGTSLTMLAKPAYRAVVAALSKRL 383
Query: 258 TSFGRQRMHNASEDWEYCYRYDSRFRAYAS-----MTFHFDRADFKVEPTYMYFIFQNEG 312
R M + ++YCY + S + + + HF + P Y I G
Sbjct: 384 AGLPRVTM----DPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPG 439
Query: 313 YFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
C+ + SV+G QQ+ + YDL ++F C
Sbjct: 440 VKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 151/377 (40%), Gaps = 39/377 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASSTYKRI 60
Y V G+P + DTGS ++W C C NC + S F+ S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 61 PCDDLIC----RRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC--- 110
C D IC + +C EN QC + Y G+ SG T+TF F L LV
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 111 VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
P ++FGCS + D + D + GI GF S++ QL S +FS+CL +
Sbjct: 219 AP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL---KGD 274
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+ G+ I M V HY L+L I V + F +
Sbjct: 275 GSGGGVFVLGE---ILVPGM-VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVF--EASN 328
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AY 285
T G ++DTG T++ + Y++ + + + N E CY + +
Sbjct: 329 TRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG----EQCYLVSTSISDMF 384
Query: 286 ASMTFHF-DRADFKVEPT---YMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYD 340
S++ +F A + P + Y I+ +C+ + + +++G +D FVYD
Sbjct: 385 PSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYD 444
Query: 341 LNTGTIQFVPENCANDH 357
L I + +C +H
Sbjct: 445 LARQRIGWASYDCKCNH 461
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 153/361 (42%), Gaps = 37/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP + DTGS L+W QCLPC+ C+ QS PIF+P S+++ +PC+
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQN 151
Query: 67 CRR-PPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C+ C G C + Y G + E T + V VI GC +++
Sbjct: 152 CKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSS----VKSVI-GCGHESGG 206
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGL---FSYCLVYAYREMEATSILRFGKDANI 181
+G++G SL+ Q+ T+ G+ FSYCL A + FG++A +
Sbjct: 207 GFG--FASGVIGLGGGQLSLVSQMSQTS-GISRRFSYCLPTLLS--HANGKINFGQNAVV 261
Query: 182 QRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
+ + + ++YY++L+ IS+ + R + G +ID+G +F
Sbjct: 262 SGPGVVSTPLISKNPVTYYYVTLEAISIGNER--------HMASAKQGNVIIDSGTTLSF 313
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS----MTFHFD-RA 295
+ + Y+ V+ +R+ + W+ C+ D A +S +T F A
Sbjct: 314 LPKELYDGVVSSL---LKVVKAKRVKDPGNFWDLCFD-DGINVATSSGIPIITAQFSGGA 369
Query: 296 DFKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ + P + N C+ + S +D ++G + YDL + F P
Sbjct: 370 NVNLLPVNTFQKVANN-VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTV 428
Query: 353 C 353
C
Sbjct: 429 C 429
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 138/336 (41%), Gaps = 23/336 (6%)
Query: 24 FDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFRCENGQCVHR 82
DT S + W +PC C S+ +FN AS+TYK + C C++ P C G C
Sbjct: 1 MDTSSDVAW---IPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57
Query: 83 INYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPF 142
+ Y GG+S + +S +T T VPG FGC S LG
Sbjct: 58 LTY-GGSSLAANLSQDTITLATD----AVPGYSFGCIQKATGGSLPAQGLLGLGRGPL-- 110
Query: 143 SLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLS 202
SLL Q ++ Q FSYCL +++ + + LR G +R + R S Y+++
Sbjct: 111 SLLSQTQNLYQSTFSYCLP-SFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVN 169
Query: 203 LQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGR 262
L + V + PG+F + G + D+G + T + Y V F GR
Sbjct: 170 LMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNR---VGR 226
Query: 263 QRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSD 322
+ ++ CY A ++TF F + + P + C+A++ +
Sbjct: 227 NLTVTSLGGFDTCYTVP---IAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAP 283
Query: 323 RN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
N +V+ QQQ+ R +YD+ + E C
Sbjct: 284 DNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 319
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 147/360 (40%), Gaps = 43/360 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+K++ +L D+GS + W QC PC+ C +Q P+F+P+ SSTY C
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAA 190
Query: 67 CRRPPFRCENG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + + NG QC + + YA G+S +G S++T + FGCS
Sbjct: 191 CAQ-LGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLALGSNT----ISNFQFGCS-- 243
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ + F+ G++G SL Q T FSYCL ++ L G +
Sbjct: 244 HVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLP---PTPSSSGFLTLGAGTS- 299
Query: 182 QRKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
++ + RSS Y + L+ I V ++ F + G ++D+G I
Sbjct: 300 -----GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMVMDSGTI 348
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRAD 296
T + R Y + F + R + C+ + + S+ F
Sbjct: 349 ITRLPRTAYSALSSAFKAGMKQY---RPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGA 405
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V I N C+A + + +S +VG QQ+ +YD+ G + F C
Sbjct: 406 V-VNLDANGIILGN----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 43/365 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC---LPCVNCFNQSAPIFNPNASSTYKRIPCD 63
Y +V G+P + +L+ DTGS W C V C ++ + + S +
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNCSKSFEAVTCASRKCKV---DLSELFSLS--- 166
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDN 122
+C +P + C++ I+YA G+SA G T++ T L N K + + GC+
Sbjct: 167 --VCPKP-----SDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGCTKSM 219
Query: 123 RD-FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN- 180
+ +F+ GILG + S + + + FSYCLV +S L G N
Sbjct: 220 LNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNA 279
Query: 181 -----IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
I+R ++ F Y +++ IS+ + P + N GG +ID+G
Sbjct: 280 KLLGEIRRTELILFPPF------YGVNVVGISIGGQMLKIPPQVWDF--NAEGGTLIDSG 331
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW---EYCYRYDS-RFRAYASMTFH 291
T + YE V + T R ED+ E+C+ + + FH
Sbjct: 332 TTLTSLLLPAYEAVFEALTKSLTKVKRV----TGEDFDALEFCFDAEGFDDSVVPRLVFH 387
Query: 292 FDRADFKVEPTYMYFIFQNEGYFCVAISFSDR---NSVVGAWQQQDTRFVYDLNTGTIQF 348
F P Y I C+ I D SV+G QQ+ + +DL+T T+ F
Sbjct: 388 FAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGF 447
Query: 349 VPENC 353
P C
Sbjct: 448 APSTC 452
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 159/375 (42%), Gaps = 37/375 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDD-L 65
Y + G+P + L+ DTGS L W +CLPC C I++ S +YK + C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159
Query: 66 ICRRPP----FRCENG-QCVHRINYAGGASASGLVSTETFTFH--LKNKLVCVPGVIFGC 118
+C C G QC Y G+ + G +ST+T + K V V FGC
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGC 219
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + + G +GILG + +L QL FS+C + +T ++ FG +
Sbjct: 220 AQGDLELVPTG-ASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFG-N 277
Query: 179 ANIQRKDMKTIRMFVDRS----SHYYLSLQDISVADHRIGFAP-GTFALRRNGTGGCMID 233
A + + ++ + + S Y+++L+ +S+ H + P G+ + +G+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGS------ 331
Query: 234 TGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRY-----DSRFRAYAS 287
+ ++F++ P+ +R F +H + ++ D C++ D R S
Sbjct: 332 --SFSSFVR--PFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPS 387
Query: 288 MTFHFDRADFKVEPTYMYFI----FQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDL 341
++ F+ P+ + +QN C A N +V+G +QQQ+ YD+
Sbjct: 388 LSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447
Query: 342 NTGTIQFVPENCAND 356
+ F +C D
Sbjct: 448 QRSRVGFARASCVID 462
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 92/207 (44%), Gaps = 19/207 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN-CFNQSAPIFNPNASSTYKRIPCDDL 65
Y V V G+P + +FDTGS L WTQC PCV C+ Q IF+P+ S +Y + CD
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 148
Query: 66 ICRRPPFR------CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C + C + C++ I Y G+ + G + E + + FGC
Sbjct: 149 SCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFN---NFQFGCG 205
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+NR G AG+LG + +P SL+ Q +FSYCL + +T L FG
Sbjct: 206 QNNRGLF--GGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSS---SSSTGYLSFGSGD 260
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDI 206
D K ++ Y S+Q +
Sbjct: 261 G----DSKAVKFTPRLPPTVYSSVQKV 283
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 114/256 (44%), Gaps = 33/256 (12%)
Query: 1 HEKNYF-----YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASS 55
H N F + VDV FGTP ++ L+ DTGS + WTQC CVNC S FN +ASS
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASS 176
Query: 56 TYKRIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
TY C P EN + + Y +++ G +T T +
Sbjct: 177 TYSSGSCI-------PGTVENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQ---KFQ 223
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FGC +N+ F + G+LG S + Q S +FSYCL E ++ L F
Sbjct: 224 FGCGRNNKG-DFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLLF 278
Query: 176 GKDANIQRKDMKTIRMF-----VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
G+ A Q +K + + S +Y+++L DISV + R+ FA + G
Sbjct: 279 GEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFA-----SPGT 333
Query: 231 MIDTGAIATFIQRGPY 246
+ID+ + T + + Y
Sbjct: 334 IIDSRTVITRLPQRAY 349
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 157/364 (43%), Gaps = 30/364 (8%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C +C P F P+ S TY+ + C
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC- 144
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C + QC++ YA +S+SG++ + +F ++L V FGC ND
Sbjct: 145 -----TPDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAV-FGCEND 198
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 199 ETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISP-- 256
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+DM DRS +Y ++L+++ VA ++ P F +G G ++D+G ++
Sbjct: 257 -PEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVF----DGKHGTVLDSGTTYAYL 311
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYR-----YDSRFRAYASMTFHFDRA 295
+ R + S ++++ ++ + C+ +++ + F+
Sbjct: 312 PETAFLAFKRAIMKERNSL--KQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENG 369
Query: 296 -DFKVEP-TYMYFIFQNEGYFCVAISFS---DRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ P Y++ + G +C+ + FS D +++G ++T +YD I F
Sbjct: 370 HKLSLSPENYLFRHSKVRGAYCLGV-FSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWK 428
Query: 351 ENCA 354
NC+
Sbjct: 429 TNCS 432
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/361 (22%), Positives = 153/361 (42%), Gaps = 24/361 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P +SSTY+ + C
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 168
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QCV+ YA +++SG++ + +F +++L V FGC N
Sbjct: 169 IDCNC-----DGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAV-FGCENVE 222
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ + GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 223 TGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGISP--- 279
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM DRS +Y + L+++ VA R+ F +G G ++D+G ++
Sbjct: 280 PSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVF----DGKHGTVLDSGTTYAYLP 335
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF------DRAD 296
+ + S + + + + + C+ + S +F +
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYN-DICFSGAGNDVSQLSKSFPVVDMVFGNGHK 394
Query: 297 FKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + P YM+ + G +C+ I + +D+ +++G ++T +YD I F NC
Sbjct: 395 YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454
Query: 354 A 354
A
Sbjct: 455 A 455
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 116/270 (42%), Gaps = 16/270 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP ++ L+ DTGS + + C C C P F P SSTY+ + C+
Sbjct: 87 NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCN 146
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C E QCV+ YA +S+SG++ + +F +++LV IFGC N
Sbjct: 147 IDCTCDN-----ERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELV-PQRAIFGCENQE 200
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 201 TGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISP--- 257
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
M RS +Y + L+ I VA ++ P F +G G ++D+G ++
Sbjct: 258 PSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIF----DGKHGTVLDSGTTYAYLP 313
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDW 272
+ + TS +++H ++
Sbjct: 314 EAAFTAFKDAMMKELTSL--KQIHGPDPNY 341
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 114/267 (42%), Gaps = 28/267 (10%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPC-VNCFNQSAP---IFNPNASSTYKRIPCDDLI 66
+ GTP + DTGS L W QC C + C++Q+A IFNP SSTY ++ C
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEA 62
Query: 67 CR------RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C + C E+ C++ + Y G + G + + T + IFGC
Sbjct: 63 CNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNFIFGC 119
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQL-KSTAQGLFSYCLVYAYREMEATSILRFGK 177
DN ++G AGI+GF +S Q+ + T FSYC + + +I + +
Sbjct: 120 GEDNL---YNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIGPYAR 176
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
D N+ M T ++ D Y + D+ V R+ P + + ++D+G
Sbjct: 177 DINL----MWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT-----IVDSGTA 227
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQR 264
T+I ++ + + + + G R
Sbjct: 228 DTYILSPVFDALDKAMTKEMQAKGYTR 254
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 145/369 (39%), Gaps = 41/369 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P + P + K +PC
Sbjct: 72 HYYVTMNI--GDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKN---KIVPC 126
Query: 63 DDLICRR--PPFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C P +C QC ++I Y AS+ G++ + FT L+N + FGC
Sbjct: 127 AASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCG 186
Query: 120 NDN---RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
D ++ + G+LG SLL QLK QG+ L + + L FG
Sbjct: 187 YDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQ--QGVTKNVLGHCF-STNGGGFLFFG 243
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHR-IGFAPGTFALRRNGTGGCMIDTG 235
D + + + M S +YY D R +G P + D+G
Sbjct: 244 DDI-VPTSRVTWVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEV----------VFDSG 292
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-------YASM 288
+ + PY+ + + + + C++ F++ + S+
Sbjct: 293 STYAYFAAEPYQATVSALKAGLS---KSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKSL 349
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTRFVYDLNTG 344
F + P Y I G C+ I + + +++G QD +YD G
Sbjct: 350 FLSFGKNSVMEIPPENYLIVTKYGNVCLGILDGTTAKLKFNIIGDITMQDQMIIYDNEKG 409
Query: 345 TIQFVPENC 353
+ ++ +C
Sbjct: 410 QLGWIRGSC 418
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 143/362 (39%), Gaps = 38/362 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G+P K F+ DTGS ++W C PC C N FNP+ SST +I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 61 PCDDLICRRPPFRCE-------NGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC- 110
PC D C E N C + Y G+ SG ++T F + N+
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 111 -VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL---KG 266
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
IL G+ I + V HY L+L+ I V ++ F +
Sbjct: 267 SDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFT--TS 320
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
T G ++D+G ++ G Y+ + + R + ++ + DS F
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 380
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQ----NEGYFCVAI--SFSDRNSVVGAWQQQDTRFVY 339
+ + V+P Y + Q N +C+ + + +++G +D FVY
Sbjct: 381 S--LYFMGGVAMTVKPEN-YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVY 437
Query: 340 DL 341
DL
Sbjct: 438 DL 439
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 142/362 (39%), Gaps = 38/362 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G+P K F+ DTGS ++W C PC C N FNP+ SST +I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 61 PCDDLICRRPPFRCE-------NGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC- 110
PC D C E N C + Y G+ SG ++T F + N+
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTAN 209
Query: 111 -VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL---KG 266
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
IL G+ I + V HY L+L+ I V ++ F
Sbjct: 267 SDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSN- 321
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
T G ++D+G ++ G Y+ + + R + ++ + DS F
Sbjct: 322 -TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 380
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQ----NEGYFCVAI--SFSDRNSVVGAWQQQDTRFVY 339
+ + V+P Y + Q N +C+ + + +++G +D FVY
Sbjct: 381 S--LYFMGGVAMTVKPEN-YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVY 437
Query: 340 DL 341
DL
Sbjct: 438 DL 439
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 144/362 (39%), Gaps = 34/362 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYK------- 58
Y V V GTP K L+FDTGS + WTQC PC +C+ Q IF+P+ S++Y
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 59 -RIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
P C + CV+ I Y + + G TE T + + FG
Sbjct: 209 ICNSLTSATGNTP--GCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAF---NNIYFG 263
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +N+ AG+LG S++ Q +FSYCL +T L FG
Sbjct: 264 CGQNNQGLFG--GSAGLLGLGRDKLSVVSQTAQKYNKIFSYCLP---SSSSSTGFLTFGG 318
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
A+ K + S Y L ISV ++ + F+ T G +ID+G +
Sbjct: 319 SASKNAK-FTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFS-----TAGAIIDSGTV 372
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRA- 295
T + Y + F + + M A + CY + S + + F F
Sbjct: 373 ITRLPPAAYSALRASFRNLMSKY---PMTKALSILDTCYDFSSYTTISVPKIGFSFSSGI 429
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISF-SDRNSV--VGAWQQQDTRFVYDLNTGTIQFVPEN 352
+ ++ T + + + C+A + SD V G QQ+ YD + G + F P
Sbjct: 430 EVDIDATGILYA-SSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGG 488
Query: 353 CA 354
C+
Sbjct: 489 CS 490
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 149/364 (40%), Gaps = 52/364 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P++ P A+ + +PC
Sbjct: 52 HYYVTMNI--GNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTAN---RLVPC 106
Query: 63 DDLIC------RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+ +C + +C + QC ++I Y AS+ G++ ++F+ +++ + PG+
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI-RPGLT 165
Query: 116 FGCSNDN---RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEAT 170
FGC D ++ + I G+LG SL+ QLK + + +CL
Sbjct: 166 FGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL-----STNGG 220
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHR-IGFAPGTFALRRNGTGG 229
L FG D + + + M S +YY D R +G P
Sbjct: 221 GFLFFGDDV-VPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV--------- 270
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
+ D+G+ T+ PY+ V+ + +Q + C++ F++
Sbjct: 271 -VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQV---SDPTLPLCWKGQKAFKSVFDVK 326
Query: 285 --YASMTFHFDRADFKVE--PTYMYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTR 336
+ SM F A P Y I G C+ I + +V+G QD
Sbjct: 327 NEFKSMFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQM 386
Query: 337 FVYD 340
+YD
Sbjct: 387 VIYD 390
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 9/153 (5%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + S+ ++ D+GS + W QC PC + C Q P+F+P S+TY +PC C R
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC I YA GA+A+G S++ T + V G +FGC++ ++ +F
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD---VVRGFLFGCAHADQGSTF 191
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
++AG L S + Q S +FSYC+
Sbjct: 192 SYDVAGTLALGGGSQSFVQQTASQYSRVFSYCV 224
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 111/269 (41%), Gaps = 20/269 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN---CFNQSAPIFNPNASSTYKRIPCD 63
Y + V G+P+ ++ ++ DTGS + W QC PC C + +F+P ASSTY C
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 64 DLICRRPPFRCE-NG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C + E NG +C + + Y G++ +G S++ T + V G FG
Sbjct: 168 AAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD---VVRGFQFG 224
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
CS+ D G++G S + Q + F YCL ++
Sbjct: 225 CSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGAPAS 284
Query: 178 DANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
T M + ++Y+ +L+DI+V ++G +P FA G ++D+G
Sbjct: 285 GGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA------AGSLVDSG 338
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQR 264
+ T + Y + F T + R
Sbjct: 339 TVITRLPPAAYAALSSAFRAGMTRYARAE 367
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 73/142 (51%), Gaps = 13/142 (9%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
YF V V GTPS L+ DTGS L+W QC PC C+ Q +F+P SSTY+R+
Sbjct: 82 ESGEYFALVGV--GTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRV 139
Query: 61 PCDDLICRRPPFR-CEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
PC CR F C++ G C + + Y G+S++G ++T+ F V V
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTY---VNNV 196
Query: 115 IFGCSNDNRDFSFDGNIAGILG 136
GC DN FD + AG+LG
Sbjct: 197 TLGCGRDNEGL-FD-SAAGLLG 216
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 73/140 (52%), Gaps = 10/140 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP K +++ DTGS ++W QC PC C++Q+ P+F+P S ++ I C
Sbjct: 173 EYFTRLGV--GTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 230
Query: 64 DLICRR--PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+C R P C++++ Y G+ G STET TF + VP V GC +D
Sbjct: 231 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTF----RGTRVPKVALGCGHD 286
Query: 122 NRDFSFDGNIAGILGFSVSP 141
N F G AG+LG P
Sbjct: 287 NEGL-FVG-AAGLLGLGRQP 304
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 144/388 (37%), Gaps = 95/388 (24%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPC----------VNCFNQSAPIFNPNASSTYKRIPC 62
G P + + DTGS L+WTQC C CF Q+ P +N + S T + +PC
Sbjct: 84 IGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRTARAVPC 143
Query: 63 DD---LICRRPP--FRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCV 111
DD +C P C G CV +Y G A G++ T+ FTF + +
Sbjct: 144 DDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGV-ALGVLGTDAFTFPSSSSVT-- 200
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
+ FGC + R +SP +L G A+
Sbjct: 201 --LAFGCVSQTR---------------ISPGALTG----------------------ASG 221
Query: 172 ILRFGKDA-NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG---- 226
I+ G+ A ++ KD S+ YYL L ++ + + G F LR
Sbjct: 222 IIGLGRGALSLNPKDSPF-------STFYYLPLVGLAAGNATVALPAGAFDLREAAPKVW 274
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA--SEDWEYCYRY----DS 280
GG +ID+G+ T + + + + G A E C DS
Sbjct: 275 AGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGDS 334
Query: 281 -RFRAYASMTFHFDRADFK----VEPTYMYFIFQNEGYFCVAISFS---------DRNSV 326
A S+ FD V P Y+ +C+A+ S + ++
Sbjct: 335 LAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATLPTNETTI 394
Query: 327 VGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+G + QQD R +YDL G + F P NC+
Sbjct: 395 IGNFMQQDMRVLYDLANGLLSFQPANCS 422
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 150/367 (40%), Gaps = 34/367 (9%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N +YT + GTP + L+ DTGS + + C C +C + P F P AS TY+ + C
Sbjct: 89 RNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC 148
Query: 63 DDLICRRPPFRC----ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
++C + QC + YA +++SG++ + +F +++L IFGC
Sbjct: 149 T--------WQCNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSEL-SPQRAIFGC 199
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFG 176
ND ++ GI+G S++ QL K FS C A +
Sbjct: 200 ENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGIS 259
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A DM RS +Y + L++I VA R+ P F +G G ++D+G
Sbjct: 260 PPA-----DMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVF----DGKHGTVLDSGT 310
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
++ + + S R + + + C+ + S +F
Sbjct: 311 TYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYN-DICFSGAEINVSQLSKSFPVVEMV 369
Query: 297 F------KVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
F + P Y++ + G +C+ + + +D +++G ++T +YD I
Sbjct: 370 FGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIG 429
Query: 348 FVPENCA 354
F NC+
Sbjct: 430 FWKTNCS 436
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 150/374 (40%), Gaps = 39/374 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASSTYKRI 60
Y V G+P + DTGS ++W C C NC + S F+ S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 61 PCDDLIC----RRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC--- 110
C D IC + +C EN QC + Y G+ SG T+TF F L LV
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 111 VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
P ++FGCS + D + D + GI GF S++ QL S +FS+CL +
Sbjct: 219 AP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL---KGD 274
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+ G+ I M V HY L+L I V + F +
Sbjct: 275 GSGGGVFVLGE---ILVPGM-VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVF--EASN 328
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AY 285
T G ++DTG T++ + Y++ + + + N E CY + +
Sbjct: 329 TRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG----EQCYLVSTSISDMF 384
Query: 286 ASMTFHF-DRADFKVEP---TYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYD 340
S++ +F A + P + Y I+ +C+ + + +++G +D FVYD
Sbjct: 385 PSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYD 444
Query: 341 LNTGTIQFVPENCA 354
L I + +C+
Sbjct: 445 LARQRIGWASYDCS 458
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 84/360 (23%), Positives = 140/360 (38%), Gaps = 62/360 (17%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP-------PFRC 74
++ DTGS L W QC PC C+ Q P+F+P+ S++Y +PC+ C P C
Sbjct: 124 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 183
Query: 75 ----------ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
++ +C + + Y G+ + G+++T+T + V G +FGC NR
Sbjct: 184 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS----VDGFVFGCGLSNRG 239
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
G+ A + SP G +A L G D + R
Sbjct: 240 LRRPGSAAS--SPTASPPGTSG---------------------DAAGSLSLGGDTSSYRN 276
Query: 185 DMKT--IRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
RM D + + +++ A N ++D+G + T +
Sbjct: 277 ATPVSYTRMIADPAQPPFY-FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVITRLA 331
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNAS--EDWEYCYRYDSRFRAYAS-MTFHFDR-ADFK 298
Y V F FG +R A + CY +T + AD
Sbjct: 332 PSVYRAVRAEFARQ---FGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMT 388
Query: 299 VEPTYMYFIFQNEG-YFCVA---ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
V+ M F+ + +G C+A +SF D+ ++G +QQ++ R VYD + F E+C+
Sbjct: 389 VDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 149/364 (40%), Gaps = 52/364 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
+Y+ T+++ G P+K FL DTGS L W QC PC +C P++ P A+ + +PC
Sbjct: 52 HYYVTMNI--GNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTAN---RLVPC 106
Query: 63 DDLIC------RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
+ +C + +C + QC ++I Y AS+ G++ ++F+ +++ + PG+
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI-RPGLT 165
Query: 116 FGCSNDN---RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEAT 170
FGC D ++ + I G+LG SL+ QLK + + +CL
Sbjct: 166 FGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL-----STNGG 220
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHR-IGFAPGTFALRRNGTGG 229
L FG D + + + M S +YY D R +G P
Sbjct: 221 GFLFFGDDV-VPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV--------- 270
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
+ D+G+ T+ PY+ V+ + +Q + C++ F++
Sbjct: 271 -VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQV---SDPTLPLCWKGQKAFKSVFDVK 326
Query: 285 --YASMTFHFDRADFKVE--PTYMYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTR 336
+ SM F A P Y I G C+ I + +V+G QD
Sbjct: 327 NEFKSMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQM 386
Query: 337 FVYD 340
+YD
Sbjct: 387 VIYD 390
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 156/380 (41%), Gaps = 52/380 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC---FNQSAPI--FNPNASSTYKRI 60
Y V GTP ++ L DTGS L+W C PC+ C + PI ++ AS++ ++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 61 PCDDLICRRPPFRCENG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
PC D C E+G QC + Y G+ G + + + + VI
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----VI 150
Query: 116 FGCS-NDNRDFSF-DGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREMEATS 171
FGC + D S + + GI+GF S S QL + +F++CL R
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGER---GGG 207
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
IL G N+ D++ + V SHY + LQ ISV + + P F+ + G +
Sbjct: 208 ILVLG---NVIEPDIQYTPL-VPYMSHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTI 261
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF--RAYASMT 289
D+G ++ DE + +F Q + + C SRF + + ++
Sbjct: 262 FDSGTTLAYLP-----------DEAYQAF-TQAVSLVVAPFLLCDTRLSRFIYKLFPNVV 309
Query: 290 FHFDRADFKVEPTYMYFIFQ----NEGYFCV------AISFSDRNSVVGAWQQQDTRFVY 339
+F+ A + P Y I Q N +C+ + + ++ G ++ VY
Sbjct: 310 LYFEGASMTLTPA-EYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVY 368
Query: 340 DLNTGTIQFVPENCANDHFL 359
DL G I + P +C FL
Sbjct: 369 DLERGRIGWRPFDCKTSFFL 388
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 141/361 (39%), Gaps = 38/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRIP 61
Y V G+P K F+ DTGS ++W C PC C N FNP+ SST +IP
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 62 CDDLICRRPPFRCE-------NGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC-- 110
C D C E N C + Y G+ SG ++T F + N+
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 111 VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL---KGS 293
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
IL G+ I + V HY L+L+ I V ++ F
Sbjct: 294 DNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSN-- 347
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
T G ++D+G ++ G Y+ + + R + ++ + DS F +
Sbjct: 348 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVS 407
Query: 287 SMTFHFDRADFKVEPTYMYFIFQ----NEGYFCVAISFSDRN--SVVGAWQQQDTRFVYD 340
+ V+P Y + Q N +C+ + +++G +D FVYD
Sbjct: 408 --LYFMGGVAMTVKPEN-YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 464
Query: 341 L 341
L
Sbjct: 465 L 465
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 153/389 (39%), Gaps = 51/389 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y ++ GTP K ++ DTGS ++W C+ C C +S ++P ASS+ +
Sbjct: 86 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTV 145
Query: 61 PCDDLICR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVPG- 113
CD C + P N C + + Y G+S +G T+ F + PG
Sbjct: 146 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGN 205
Query: 114 --VIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYREM 167
+ FGC + N A GILGF + S+L QL + A+ +F++CL +
Sbjct: 206 ATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCL----DTI 261
Query: 168 EATSILRFGKDA------------NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGF 215
+ I G + + + M + HY ++L+ I V +
Sbjct: 262 KGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQL 321
Query: 216 APGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYC 275
F G +ID+G T++ ++ VM D F+ HN + C
Sbjct: 322 PAHVFETGEK--KGTIIDSGTTLTYLPELVFKQVM---DVVFSKHRDIAFHNLQD--FLC 374
Query: 276 YRYDSRF-RAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCV-----AISFSDRNSVV- 327
++Y + ++TFHF D V P + YF +CV A+ D +V
Sbjct: 375 FQYSGSVDDGFPTITFHFEDDLALHVYP-HEYFFPNGNDIYCVGFQNGALQSKDGKDIVL 433
Query: 328 -GAWQQQDTRFVYDLNTGTIQFVPENCAN 355
G + VYDL I + NC++
Sbjct: 434 MGDLVLSNKLVVYDLENQVIGWTDYNCSS 462
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/369 (23%), Positives = 146/369 (39%), Gaps = 40/369 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P+ SSTY+ + C+
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG-VIFGCSND 121
D C E QCV+ YA +++SG++ + +F N P +FGC N
Sbjct: 70 IDCNCDD-----EKQQCVYERQYAEMSTSSGVLGEDIISF--GNLSALAPQRAVFGCENM 122
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ + GI+G S++ L K FS C A + +
Sbjct: 123 ETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPS 182
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
N+ + RS +Y + L++I VA + P F +G G ++D+G
Sbjct: 183 NMVFSQSDPV-----RSPYYNIDLKEIHVAGKPLPLNPTVF----DGKHGTILDSGTTYA 233
Query: 240 FIQRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCY--------RYDSRFRAYAS 287
++ + + +M+ G +N + C+ + S F A
Sbjct: 234 YLPEAAFVSFKDAIMKELHSLKPIRGPDPNYN-----DICFSGAGSDISQLSSSFPA-VE 287
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGT 345
M F + Y++ + G +C+ I + D +++G ++T +YD
Sbjct: 288 MVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSK 347
Query: 346 IQFVPENCA 354
I F NC+
Sbjct: 348 IGFWKTNCS 356
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 65/121 (53%), Gaps = 5/121 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + G PS + + DTGS L WTQC+PC +C+ Q PI++P+ SSTY + C
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 64 DLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C P C + C + Y +S G++S ETFT ++ +P + FGC DN
Sbjct: 78 SSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS----IPHIAFGCGQDN 133
Query: 123 R 123
Sbjct: 134 E 134
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 151/378 (39%), Gaps = 39/378 (10%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASST 56
+ Y V G+P + DTGS ++W C C NC + S F+ S T
Sbjct: 100 KMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLT 159
Query: 57 YKRIPCDDLIC----RRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLV 109
+ C D IC + +C EN QC + Y G+ SG T+TF F L LV
Sbjct: 160 AGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 219
Query: 110 C---VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVY 162
P ++FGCS + D + D + GI GF S++ QL S +FS+CL
Sbjct: 220 ANSSAP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-- 276
Query: 163 AYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL 222
+ + G+ I M V HY L+L I V + F
Sbjct: 277 -KGDGSGGGVFVLGE---ILVPGM-VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVF-- 329
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
+ T G ++DTG T++ + Y++ + + + N E CY +
Sbjct: 330 EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG----EQCYLVSTSI 385
Query: 283 R-AYASMTFHF-DRADFKVEP---TYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTR 336
+ S++ +F A + P + Y I+ +C+ + + +++G +D
Sbjct: 386 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKV 445
Query: 337 FVYDLNTGTIQFVPENCA 354
FVYDL I + +C+
Sbjct: 446 FVYDLARQRIGWASYDCS 463
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 148/374 (39%), Gaps = 50/374 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V G P K FL DTGS L W QC PC+ C P++ P T + C D
Sbjct: 66 YYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQP----TNDLVVCKD 121
Query: 65 LIC---RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
IC +RC++ QC + + YA G S+ G++ + F +L + + P + GC
Sbjct: 122 PICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGY 181
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D + G+LG S++ QL S QGL + + + L FG D
Sbjct: 182 DQLPGIAYHPLDGVLGLGRGSSSIVAQLSS--QGLVRNVVGHCFSR-RGGGYLFFGDDIY 238
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DTGAIA 238
K + T M D HY GFA R +G ++ D+G+
Sbjct: 239 DSSKVIWT-PMSRDYLKHY------------TPGFAELILNGRSSGLKNLLVVFDSGSSY 285
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHFD--- 293
T+ Y+ ++ + + + A ED C+R F++ +F
Sbjct: 286 TYFNTQTYQTLLSFIKKDLHG---KPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLA 342
Query: 294 ---------RADFKVEPTYMYFIFQNEGYFCVAI----SFSDRN-SVVGAWQQQDTRFVY 339
++ F+++ Y I ++G C+ I +N +++G Q+ +Y
Sbjct: 343 LSFGSGWKTKSQFEIQ-QESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIY 401
Query: 340 DLNTGTIQFVPENC 353
D I + P NC
Sbjct: 402 DNEKQVIGWQPSNC 415
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 148/363 (40%), Gaps = 43/363 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W C C C SA F+P +S++Y+ +PC +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171
Query: 67 CRRPP-FRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C + P C G C + YA +S +S ++ V FGC R
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYA-DSSLQAALSQDSLAVAGN----AVKAYTFGCL--QR 224
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
G+LG P S L Q K + FSYCL +++ + + LR G++ QR
Sbjct: 225 ATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCL-PSFKSLNFSGTLRLGRNGQPQR 283
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRI---GFAPGTFALRRNGTGGCMIDTGAIATF 240
+ RSS YY+++ I V + F P T A G ++D+G + T
Sbjct: 284 IKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGA-------GTVLDSGTMFTR 336
Query: 241 IQRGPY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
+ Y + V R +S G ++ C +++ A+ +T FD
Sbjct: 337 LVAPAYVAVRDEVRRRVGAPVSSLG---------GFDTC--FNTTAVAWPPVTLLFDGMQ 385
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVP 350
+ P I G C+A++ + +V+ + QQQ+ R ++D+ G + F
Sbjct: 386 VTL-PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFAR 444
Query: 351 ENC 353
E C
Sbjct: 445 ERC 447
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 146/370 (39%), Gaps = 41/370 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W+ C PC C S F P +SS+Y +PC
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDW 136
Query: 67 CRRPPFRCENGQCVHRINYAGGASA---SGLVSTETFTFHLKNKLV-----CVPGVIFGC 118
C P F E C + + A S + +F L + + + G FGC
Sbjct: 137 C--PLF--EGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAFGC 192
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + G+LG P SLL Q S G+FSYCL +YR + LR G
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLP-SYRSYYFSGSLRLG-- 249
Query: 179 ANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A Q ++++ + + R S YY+++ +SV + G+FA G +ID+G
Sbjct: 250 AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGT 309
Query: 237 IATFIQRGPYEVVMRHFDEH------FTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMT 289
+ T Y + F +TS G ++ C+ D A +T
Sbjct: 310 VITRWTAPVYAALREEFRRQVAAPSGYTSLGA---------FDTCFNTDEVAAGGAPPVT 360
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYF-CVAISFSDR-----NSVVGAWQQQDTRFVYDLNT 343
H D P I + C+A++ + + +VV QQQ+ R V D+
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420
Query: 344 GTIQFVPENC 353
+ F E C
Sbjct: 421 SRVGFAREPC 430
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 157/378 (41%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTP+K ++ DTGS ++W C+ C C +S +++P S + + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPG- 113
CD C P C + I+Y G+S +G T+ ++ + P
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPAN 208
Query: 114 --VIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREM 167
V FGC N+A GILGF S S+L QL + + +F++CL +
Sbjct: 209 ASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----DTV 264
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G N+ + +KT + D HY + L+ I V +G F +
Sbjct: 265 NGGGIFAIG---NVVQPKVKTTPLVPDM-PHYNVILKGIDVGGTALGLPTNIF--DSGNS 318
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G ++ G Y+ + FD+H Q + + S C++Y +
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKH-QDISVQTLQDFS-----CFQYSGSVDDGF 372
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQN-EGYFCV-----AISFSDRNSVV--GAWQQQDTRF 337
+TFHF+ D + + ++FQN + +C+ + D +V G +
Sbjct: 373 PEVTFHFE-GDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLV 431
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 432 LYDLENQAIGWADYNCSS 449
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 155/363 (42%), Gaps = 28/363 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P +SSTY+ + C
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 140
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QCV+ YA +++SG++ + +F +++L V FGC N
Sbjct: 141 IDCNC-----DSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAV-FGCENVE 194
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
+ + GI+G S++ QL K+ FS C Y ++ +++ G
Sbjct: 195 TGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC--YGGMDVGGGAMVLGGISP- 251
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
DM RS +Y + L++I VA R+ F +G G ++D+G +
Sbjct: 252 --PSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVF----DGKHGTVLDSGTTYAY 305
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR-----YDSRFRAYASMTFHFDRA 295
+ + + S + + + + + C+ +++ + F+
Sbjct: 306 LPEAAFLAFKDAIVKELQSLKKISGPDPNYN-DICFSGAGIDVSQLSKSFPVVDMVFENG 364
Query: 296 D-FKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPE 351
+ + P YM+ + G +C+ + + +D+ +++G ++T VYD I F
Sbjct: 365 QKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKT 424
Query: 352 NCA 354
NCA
Sbjct: 425 NCA 427
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 157/378 (41%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTP+K ++ DTGS ++W C+ C C +S +++P S + + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPG- 113
CD C P C + I+Y G+S +G T+ ++ + P
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPAN 208
Query: 114 --VIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREM 167
V FGC N+A GILGF S S+L QL + + +F++CL +
Sbjct: 209 ASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----DTV 264
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G N+ + +KT + D HY + L+ I V +G F +
Sbjct: 265 NGGGIFAIG---NVVQPKVKTTPLVSDM-PHYNVILKGIDVGGTALGLPTNIF--DSGNS 318
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G ++ G Y+ + FD+H Q + + S C++Y +
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKH-QDISVQTLQDFS-----CFQYSGSVDDGF 372
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQN-EGYFCV-----AISFSDRNSVV--GAWQQQDTRF 337
+TFHF+ D + + ++FQN + +C+ + D +V G +
Sbjct: 373 PEVTFHFE-GDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLV 431
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 432 LYDLENQAIGWADYNCSS 449
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 146/370 (39%), Gaps = 41/370 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W+ C PC C S F P +SS+Y +PC
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDW 136
Query: 67 CRRPPFRCENGQCVHRINYAGGASA---SGLVSTETFTFHLKNKLV-----CVPGVIFGC 118
C P F E C + + A S + +F L + + + G FGC
Sbjct: 137 C--PLF--EGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAFGC 192
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + G+LG P SLL Q S G+FSYCL +YR + LR G
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLP-SYRSYYFSGSLRLG-- 249
Query: 179 ANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A Q ++++ + + R S YY+++ +SV + G+FA G +ID+G
Sbjct: 250 AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGT 309
Query: 237 IATFIQRGPYEVVMRHFDEH------FTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMT 289
+ T Y + F +TS G ++ C+ D A +T
Sbjct: 310 VITRWTAPVYAALREEFRRQVAAPSGYTSLGA---------FDTCFNTDEVAAGGAPPVT 360
Query: 290 FHFDRADFKVEPTYMYFIFQNEGYF-CVAISFSDR-----NSVVGAWQQQDTRFVYDLNT 343
H D P I + C+A++ + + +VV QQQ+ R V D+
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420
Query: 344 GTIQFVPENC 353
+ F E C
Sbjct: 421 SRVGFAREPC 430
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 124/288 (43%), Gaps = 20/288 (6%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + ++ ++ D+GS + W QC PC C Q P+F+P S+TY +PC C +
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC INY G++A+G S + T + + G FGC++ +R +F
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD---VIRGFRFGCAHADRGSAF 187
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMK 187
D ++AG L SL+ Q + +FSYCL + + + A + +
Sbjct: 188 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 247
Query: 188 TIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
T + + +Y + L+ I VA + P F + +ID+ I + + Y
Sbjct: 248 TPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVF------SASSVIDSSTIISRLPPTAY 301
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD 293
+ + F T + R + CY + R S+ FD
Sbjct: 302 QALRAAFRSAMTMY---RAAPPVSILDTCYDFTGVRSITLPSIALVFD 346
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 159/384 (41%), Gaps = 39/384 (10%)
Query: 2 EKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP 61
E +++ + G+ K+ + DTGS + QC ++S P+F+P AS +Y+++P
Sbjct: 95 EDYALFSMQLGIGSLQKNLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVP 148
Query: 62 CDDLICRRPPFRCENGQ----------CVHRINYAGGASASGLVSTETFTFHLKN---KL 108
C +C + NG C + ++Y +++G S + + N +
Sbjct: 149 CISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQA 208
Query: 109 VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREM 167
V V FGC++ + F D GI+GF+ SL QLK G FSYC +
Sbjct: 209 VQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQP 268
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFAL 222
AT ++ G D+ + + + + + RS YY+ L ISV + F L
Sbjct: 269 RATGVIFLG-DSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKL 327
Query: 223 R-RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD-- 279
G GG ++D+G T + Y F S R+++ A+ ++ CY
Sbjct: 328 DPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKV-GAAAGFDDCYNISAG 386
Query: 280 SRFRAYASMTFHFD---RADFKVEPTYMYF-IFQNEGYFCVAISFSDRN-----SVVGAW 330
S + R + + E ++ NE C+AI S ++ +V+G +
Sbjct: 387 SSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNY 446
Query: 331 QQQDTRFVYDLNTGTIQFVPENCA 354
QQ + YD + F +C+
Sbjct: 447 QQSNYLVEYDNERSRVGFERADCS 470
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 65/121 (53%), Gaps = 5/121 (4%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N + + + G PS + + DTGS L WTQC+PC +C+ Q PI++P+ SSTY + C
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 64 DLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+C P C + C + Y +S G++S ETFT ++ +P + FGC DN
Sbjct: 78 SSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS----IPHIAFGCGQDN 133
Query: 123 R 123
Sbjct: 134 E 134
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 15/159 (9%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP 70
V G SK+ ++ DT S L W QC PC++C+NQ PIF P+ SS+Y+ + C+ C+
Sbjct: 67 VTMGLGSKNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL 126
Query: 71 PFRCEN---------GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
F N C + +NY G+ +G + E +F V V +FGC +
Sbjct: 127 QFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFG----GVSVSDFVFGCGRN 182
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
N+ G ++G++G S SL+ Q +T G+FSYCL
Sbjct: 183 NKGLF--GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL 219
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 140/368 (38%), Gaps = 27/368 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP + L DT + W C C C +AP FNP +S+T++ +PC
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAPP 152
Query: 67 CRRPP------FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C + P C ++Y G +S +S + ++ G FGC
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSY-GDSSLDATLSQDNLAVTANGGVI--KGYTFGCLT 209
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEA-TSILRFGKDA 179
+ + LG F + Q K +G FSYCL YR + L G+
Sbjct: 210 KSNGSAAPAQGLLGLGRGPLGF--VAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKG 267
Query: 180 NIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
+ MKT + R S YY+++ + + + P A G ++D+G +
Sbjct: 268 QPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTM 327
Query: 238 ATFIQRGPYEVVMRHFDEHFT-SFGRQRMHNASEDWEYCYRYDSRFR----AYASMTFHF 292
+ + Y V S R+ AS +D+ + A+ ++T F
Sbjct: 328 FARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVSTVAWPAVTLVF 387
Query: 293 DRADFKVEPTYMYFIFQNEGYF-CVAISFSDRN------SVVGAWQQQDTRFVYDLNTGT 345
P I G C+A++ S + +V+G+ QQQ+ R ++D+
Sbjct: 388 GGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNAR 447
Query: 346 IQFVPENC 353
+ F E C
Sbjct: 448 VGFARERC 455
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 81/352 (23%), Positives = 148/352 (42%), Gaps = 44/352 (12%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPF 72
P + +L DT S + W QC PC C+ Q+ +++P+ S + + C CR+ P+
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPY 237
Query: 73 R--CEN-----GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + GQC +R+ Y G++ SG + + + ++ VP FGCS+ R
Sbjct: 238 ANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ---VPKFEFGCSHAARGS 294
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL--VYAYREMEATSILRFGKDANIQR 183
AGI+ SL+ Q + +FSYC +++ + R
Sbjct: 295 FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSSRYAVT 354
Query: 184 KDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQR 243
+KT + Y + L+ I+VA R+ P FA G +D+ + T +
Sbjct: 355 PMLKTPML-------YQVRLEAIAVAGQRLDVPPTVFA------AGAALDSRTVITRLPP 401
Query: 244 GPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY---DSRFRAYASMTFHFDRADFKVE 300
Y+ + F + + + R A+ + CY + S S+ F A +++
Sbjct: 402 TAYQALRSAFRDKMSMY---RPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLD 458
Query: 301 PTYMYFIFQNEGYFCVAISFS---DRNS-VVGAWQQQDTRFVYDLNTGTIQF 348
P+ + F C+A + + DR + ++G Q Q +Y++ G++ F
Sbjct: 459 PSGVLF------GSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGF 504
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 159/379 (41%), Gaps = 44/379 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + G+PSK ++ DTGS ++W C+ C C +S +++P S T + +
Sbjct: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
Query: 61 PCDDLIC------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN----KLVC 110
C+ C R + EN C + I+Y G++ +G + TF+ N
Sbjct: 128 SCEHNFCSSTYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQ 186
Query: 111 VPGVIFGCSNDNRDF---SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
+IFGC S + + GI+GF + S+L QL ++ + +FS+CL
Sbjct: 187 NSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----D 242
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I G+ + +KT + V +HY + L++I V + TF N
Sbjct: 243 TNVGGGIFSIGE---VVEPKVKTTPL-VPNMAHYNVILKNIEVDGDILQLPSDTFD-SEN 297
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
G G +ID+G ++ R Y+ +M R +++ E + C++Y
Sbjct: 298 GK-GTVIDSGTTLAYLPRIVYDQLMSKV---LAKQPRLKVYLVEEQYS-CFQYTGNVDSG 352
Query: 285 YASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTR 336
+ + HF D V P F ++ + Y+C+ S +++G + +
Sbjct: 353 FPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKL 412
Query: 337 FVYDLNTGTIQFVPENCAN 355
VYDL TI + NC++
Sbjct: 413 VVYDLENMTIGWTDYNCSS 431
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 144/354 (40%), Gaps = 25/354 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP ++ L DT + W +PC C ++ +F P S+T+K + C
Sbjct: 78 YIVRAKIGTPPQTLLLAMDTSNDAAW---IPCTACDGCASTLFAPEKSTTFKNVSCAAPE 134
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C C + Y + A+ LV +T T VP FGC +
Sbjct: 135 CKQVPNPGCGVSSCNFNLTYGSSSIAANLVQ-DTITLATDP----VPSYTFGCVSKTTGT 189
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S LG SLL Q ++ Q FSYCL +++ + + LR G A +R
Sbjct: 190 SAPPQGLLGLGRGPL--SLLSQTQNLYQSTFSYCL-PSFKSLNFSGSLRLGPVAQPKRIK 246
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ RSS YY++L+ I V + P A G + D+G + T + P
Sbjct: 247 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV-AP 305
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
V +R DE G + + ++ CY ++TF F + + P
Sbjct: 306 VYVAVR--DEFRRRVGPKLTVTSLGGFDTCYNVPI---VVPTITFIFTGMNVTL-PQDNI 359
Query: 306 FIFQNEG-YFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
I G C+A++ + N +V+ QQQ+ R +YD+ + E C
Sbjct: 360 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELC 413
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 124/288 (43%), Gaps = 20/288 (6%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + ++ ++ D+GS + W QC PC C Q P+F+P S+TY +PC C +
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC INY G++A+G S + T + + G FGC++ +R +F
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD---VIRGFRFGCAHADRGSAF 278
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMK 187
D ++AG L SL+ Q + +FSYCL + + + A + +
Sbjct: 279 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 338
Query: 188 TIRMFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
T + + +Y + L+ I VA + P F + +ID+ I + + Y
Sbjct: 339 TPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVF------SASSVIDSSTIISRLPPTAY 392
Query: 247 EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFD 293
+ + F T + R + CY + R S+ FD
Sbjct: 393 QALRAAFRSAMTMY---RAAPPVSILDTCYDFTGVRSITLPSIALVFD 437
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 144/372 (38%), Gaps = 40/372 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDD 64
Y V V GTP++ ++FDTGS L W QC PC + C+ Q P+F P+ SST+ + C
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213
Query: 65 LICR-RPPFRCENG--QCVHRINYAGGASASGLVSTETFTFHL----------KNKLVCV 111
CR R G +C + + Y + G + +T T NKL
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKL--- 270
Query: 112 PGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
PG +FGC +N G G+ G SL Q FSYCL A
Sbjct: 271 PGFVFGCGENNTGLF--GQADGLFGLGRGKVSLSSQAAGKFGEGFSYCL--PSSSSSAPG 326
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRS---SHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
L G + ++R+ S YY+ L I VA I + AL
Sbjct: 327 YLSLGTP--VPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP----- 379
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
++D+G + T + Y + F +G +R S + CY + + A S+
Sbjct: 380 -LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSI-LDTCYDFTAHANATVSI 437
Query: 289 T----FHFDRADFKVE-PTYMYFIFQNEGYFCVAISFSDRNS-VVGAWQQQDTRFVYDLN 342
A V+ +Y + A + R++ ++G QQ+ VYD+
Sbjct: 438 PAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVA 497
Query: 343 TGTIQFVPENCA 354
I F + C+
Sbjct: 498 RQKIGFAAKGCS 509
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 148/360 (41%), Gaps = 45/360 (12%)
Query: 24 FDTGSYLIWTQCLPCVNCFNQSAPI------FNPNASSTYKRIPCDDLICRRPPFRCENG 77
DTGS ++W C C NC QS+ + F+ SST IPC DLIC +
Sbjct: 85 IDTGSDILWVNCNTCSNC-PQSSQLGIELNFFDTVGSSTAALIPCSDLICTS-GVQGAAA 142
Query: 78 QCVHRIN-------YAGGASASGLVSTETFTFHL----KNKLVCVPGVIFGCS-NDNRDF 125
+C R+N Y G+ SG ++ F+L + ++FGCS + + D
Sbjct: 143 ECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDL 202
Query: 126 S-FDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAYREMEATSILRFGKDAN 180
+ D + GI GF P S++ QL S QG+ FS+CL + IL G+
Sbjct: 203 TKTDKAVDGIFGFGPGPLSVVSQLSS--QGITPKVFSHCL---KGDGNGGGILVLGE--- 254
Query: 181 IQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
I + V HY L+LQ I+V + P F++ N GG ++D G +
Sbjct: 255 ILEPSI-VYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNN-RGGTIVDCGTTLAY 312
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYASMTFHFDRADFKV 299
+ + Y+ ++ + + RQ ++ CY + + ++ +F+ V
Sbjct: 313 LIQEAYDPLVTAINTAVSQSARQTNSKGNQ----CYLVSTSIGDIFPLVSLNFEGGASMV 368
Query: 300 EPTYMYFI----FQNEGYFCVAI-SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
Y + +CV + S++G +D VYD+ I + +C+
Sbjct: 369 LKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 146/382 (38%), Gaps = 53/382 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V +L G PSK FL D+GS L W QC PC++C P++ S +P D
Sbjct: 78 LYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSL---VPSKD 134
Query: 65 LICRRPP--------FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C + + +C + + YA + G + ++ L NK V +F
Sbjct: 135 PLCAAVQAGSGHYHNHKEASQRCDYDVAYADHGYSEGFLVRDSVRALLTNKTVLTANSVF 194
Query: 117 GCSNDNRDF--SFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSI 172
GC + R+ D GILG SL Q + + +C+ A R+
Sbjct: 195 GCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRD---GGY 251
Query: 173 LRFGKDANIQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNG----T 227
+ FG D + M + M S HYY + A G P L ++G
Sbjct: 252 MFFGDDL-VSTSAMTWVPMLGRPSIKHYY-----VGAAQMNFGNKP----LDKDGDGKKL 301
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYRYDSRFRAYA 286
GG + D+G+ T+ Y + E+ + G+Q ++S+ + C+R FR+ A
Sbjct: 302 GGIIFDSGSTYTYFTNQAYGAFLSVVKENLS--GKQLEQDSSDSFLSLCWRRKEGFRSVA 359
Query: 287 SMTFHFDRADFKVEPTYM---------YFIFQNEGYFCV------AISFSDRNSVVGAWQ 331
+F K T Y + +G C+ AI D N V+G
Sbjct: 360 EAAAYFKPLTLKFRSTKTKQMEIFPEGYLVVNKKGNVCLGILNGTAIGIVDTN-VLGDIS 418
Query: 332 QQDTRFVYDLNTGTIQFVPENC 353
Q VYD I + +C
Sbjct: 419 FQGQLVVYDNEKNQIGWARSDC 440
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 153/392 (39%), Gaps = 66/392 (16%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP +S ++ DTGS L W C N + +FNP+ SS+Y IPC
Sbjct: 67 NVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINS----VFNPHLSSSYTPIPCM 122
Query: 64 DLICRRP------PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
IC+ P C+ N C ++YA S G ++++TF + PG+IF
Sbjct: 123 SPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQ----PGIIF 178
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
G + + + + D G++G + S + Q+ FSYC+ +A+ +L
Sbjct: 179 GSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPK---FSYCI----SGKDASGVLL 231
Query: 175 FGKDANIQ---------RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
FG DA + M T + DR + Y + L I V + FA
Sbjct: 232 FG-DATFKWLGPLKYTPLVKMNTPLPYFDRVA-YTVRLMGIRVGSKPLQVPKEIFAPDHT 289
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHF------------DEHFTSFGRQRMHNASEDWE 273
G G M+D+G TF+ Y + F D +F + + D
Sbjct: 290 GAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNF-------VFEGAMDLC 342
Query: 274 YCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEG--------YFCVAISFSD--- 322
+ R A ++T F+ A+ V + + +G +C+ SD
Sbjct: 343 FRVRRGGVVPAVPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLG 402
Query: 323 -RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+G QQ+ +DL + F C
Sbjct: 403 IEAYVIGHHHQQNVWMEFDLVNSRVGFADTKC 434
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/357 (21%), Positives = 148/357 (41%), Gaps = 30/357 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y + V GTP+ ++ + DTGS + W QC PC +C +Q +F+P S+TY C
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSS 189
Query: 65 LICRRPPFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + C N C + + Y ++ +G ++T + V FGCS
Sbjct: 190 AQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDA---VKNFQFGCS-- 244
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+R F G + G++G SL+ Q +T FSYCL + L
Sbjct: 245 HRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTS 304
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ +T + + + Y + LQ I+VA ++ F +G ++D+G + T +
Sbjct: 305 SSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF------SGASVVDSGTVITQL 358
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDR-ADFKV 299
Y+ + F + ++ + C+ + + +T F R A +
Sbjct: 359 PPTAYQALRTAFKKEMKAYPSAAPVGI---LDTCFDFSGIKTVRVPVVTLTFSRGAVMDL 415
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN---SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + +++ C+A + + ++ ++G QQ+ ++D+ T+ F P C
Sbjct: 416 DVSGIFYA------GCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 160/384 (41%), Gaps = 44/384 (11%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
++ Y ++ G + ++LL DTGS L+WTQC C +C P + + S T++ + C
Sbjct: 78 EDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSC 137
Query: 63 DD------------LICRRPP---FRCENGQCVHRINY---AGGASASGLVSTETFTFHL 104
D +PP C NG+C+ + Y G + G +S +TF F
Sbjct: 138 GDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTFHFID 197
Query: 105 KNKLVCVPG--VIFGCSN-DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV 161
+ ++FGC++ +N + GILG + S L Q T FSYC+
Sbjct: 198 DRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGITK---FSYCVP 254
Query: 162 -----YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFA 216
Y+YR S LRFG A I K + V R YYL L I+ + +
Sbjct: 255 PRMPGYSYRR---HSWLRFGSHAQISGKKVP----LVMRWGKYYLPLTAITYTYNELMSP 307
Query: 217 PGTFALR-RNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYC 275
A + + M+DTG + ++ +++ + S M A+ ++C
Sbjct: 308 VPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKS--ENIMEGATRWPKHC 365
Query: 276 YRYDSRFRAYASMTFHFDRA-DFKVEPTYMYFIFQNEG--YFCVAISFSDRNS--VVGAW 330
Y+ ++T FD D ++ + ++ + C+A++ D +S ++G +
Sbjct: 366 YKRTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMF 425
Query: 331 QQQDTRFVYDLNTGTIQFVPENCA 354
Q + YDL + I P CA
Sbjct: 426 AQTNINVGYDLLSREIAMDPIRCA 449
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/362 (21%), Positives = 148/362 (40%), Gaps = 24/362 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SSTY + C+
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L +FGC N
Sbjct: 145 VDCTC-----DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQ-RAVFGCENSE 198
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ + Y ++ +++ A
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAP-- 256
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
M RS +Y + L+++ VA + P F +G G ++D+G ++
Sbjct: 257 -PGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYLP 311
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH------FDRAD 296
+ + R +++ + C+ R + S F +
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDSNYK-DICFAGAGRNVSQLSEVFPKVDMVFGNGQK 370
Query: 297 FKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y++ + EG +C+ + + D +++G ++T YD + I F NC
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
Query: 354 AN 355
+
Sbjct: 431 SE 432
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 148/383 (38%), Gaps = 57/383 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P ++ ++ DTGS L W C N +FNP +SSTY +PC
Sbjct: 58 NVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCS 113
Query: 64 DLICRRP------PFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
ICR P C+ C I+YA S G ++ +TF V PG +
Sbjct: 114 SPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVI----GSVTRPGTL 169
Query: 116 FGC--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC S + D D G++G + S + QL + FSYC+ + +++ IL
Sbjct: 170 FGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSK---FSYCISGS----DSSGIL 222
Query: 174 RFGKDAN------IQRKDM---KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
G DA+ IQ + T + DR + Y + L+ I V + F
Sbjct: 223 LLG-DASYSWLGPIQYTPLVLQTTPLPYFDRVA-YTVQLEGIRVGSKILSLPKSVFVPDH 280
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGR-----QRMHNASEDWEYCYRYD 279
G G M+D+G TF+ Y + F S R + + D CYR
Sbjct: 281 TGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMD--LCYRVG 338
Query: 280 S----RFRAYASMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAISFSD----RNS 325
S F ++ F A+ V + + G +C SD
Sbjct: 339 SSTRPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAF 398
Query: 326 VVGAWQQQDTRFVYDLNTGTIQF 348
V+G QQ+ +DL + F
Sbjct: 399 VIGHHHQQNVWMEFDLAKSRVGF 421
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 150/406 (36%), Gaps = 77/406 (18%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYK 58
N TV V GTP ++ ++ DTGS L W C N S P FN + SS+Y
Sbjct: 52 NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLC-------NGSYAPPLTPAFNASGSSSYG 104
Query: 59 RIPCDDLICR-------RPPFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKL 108
+PC C PPF C+ + C ++YA +SA G+++T+TF
Sbjct: 105 AVPCPSTACEWRGRDLPVPPF-CDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163
Query: 109 VCVPGVIFGC---------SNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSY 158
V V G FGC +N N + A G+LG + S + Q T F+Y
Sbjct: 164 VAV-GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQ---TGTRRFAY 219
Query: 159 CLVYAYREMEATSILRFGKDANIQRK-------DMKTIRMFVDRSSHYYLSLQDISVADH 211
C+ E +L G D + ++ + DR + Y + L+ I V
Sbjct: 220 CIAPG----EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVA-YSVQLEGIRVGCA 274
Query: 212 RIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED 271
+ G G M+D+G TF+ Y FTS R + E
Sbjct: 275 LLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAA----LKAEFTSQARLLLAPLGEP 330
Query: 272 -------WEYCYRYDSRFRAYAS-----MTFHFDRADFKVEPTYMYFIFQN--------E 311
++ C+R A AS + A+ V + ++ E
Sbjct: 331 GFVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAE 390
Query: 312 GYFCVAISFSDRNS----VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+C+ SD V+G QQ+ YDL G + F P C
Sbjct: 391 AVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 156/373 (41%), Gaps = 45/373 (12%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRIPCDDLIC 67
G K ++ DTGS +W C+ C C +S +++PN S T K +PCDD C
Sbjct: 80 IGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFC 139
Query: 68 RRP----PFRCENG-QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP---GVIFGC 118
C G C + I Y G++ SG + TF + L VP VIFGC
Sbjct: 140 TSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGC 199
Query: 119 SNDNR---DFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSIL 173
+ + D ++ GI+GF + S+L QL + + +FS+CL + I
Sbjct: 200 GSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL----DSISGGGIF 255
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
G+ + + +KT + +HY + L+DI VA I P +G G +ID
Sbjct: 256 AIGE---VVQPKVKTTPLL-QGMAHYNVVLKDIEVAGDPIQL-PSDILDSSSGR-GTIID 309
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY---DSRFRAYASMTF 290
+G ++ P + + ++ +++ ED C+ Y +S + ++ F
Sbjct: 310 SGTTLAYL---PVSIYDQLLEKILAQRSGMKLYLV-EDQFTCFHYSDEESVDDLFPTVKF 365
Query: 291 HFDRA-DFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRFVYDLN 342
F+ P F+F+ E +CV S + ++G + VYDL+
Sbjct: 366 TFEEGLTLTTYPRDYLFLFK-EDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDLD 424
Query: 343 TGTIQFVPENCAN 355
I + NC++
Sbjct: 425 NMAIGWADYNCSS 437
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 148/370 (40%), Gaps = 52/370 (14%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G P+K F+ DTGS ++W C PC C N FNP++SST RI
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63
Query: 61 PCDD-----------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH--LKNK 107
C D IC+ ++ C + Y G+ SG ++T F + N+
Sbjct: 64 TCSDDRCTAGFQTGEAICQTS--NSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121
Query: 108 LVC--VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLV 161
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 180
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
IL G+ I + V HY L+L+ I+V ++ F
Sbjct: 181 --KGSDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFT 234
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
T G ++D+G ++ G Y+ + + R + S+ + DS
Sbjct: 235 TSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 292
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQ----NEGYFCVAISFSDRN-----SVVGAWQ 331
F ++T +F V+P Y + Q N +C+ RN +++G
Sbjct: 293 F---PTVTLYFMGGVAMSVKPEN-YLLQQASVDNSVLWCIGW---QRNQGQEITILGDLV 345
Query: 332 QQDTRFVYDL 341
+D FVYDL
Sbjct: 346 LKDKIFVYDL 355
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 146/378 (38%), Gaps = 52/378 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N FY V + G P K FL DTGS L W QC PC C P++ P + +PC
Sbjct: 54 NGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP----SNDLVPC 109
Query: 63 DDLIC----RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D +C RCEN QC + + YA G S+ G++ + F +L N P + G
Sbjct: 110 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 169
Query: 118 CSNDNRDFSFDGN-IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C D S + + GILG S++ QL + QG+ + + + + L FG
Sbjct: 170 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHN--QGIVRNVVGHCFNS-KGGGYLFFG 226
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DT 234
D + M D HY GF F R G + D+
Sbjct: 227 -DGIYDPYRLVWTPMSRDYPKHY------------SPGFGELIFNGRSTGLRNLFVVFDS 273
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF 292
G+ T+ Y+V+ + + + A +D C+R ++ + +F
Sbjct: 274 GSSYTYFNAQAYQVLTSLLNRELAG---KPLREAMDDDTLPLCWRGRKPIKSLRDVRKYF 330
Query: 293 ------------DRADFKVEPTYMYFIFQNEGYFCVAISFS-----DRNSVVGAWQQQDT 335
+A F++ PT Y I + G C+ I + ++++G QD
Sbjct: 331 KPLALSFSSGGRSKAVFEI-PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 336 RFVYDLNTGTIQFVPENC 353
VY+ I + NC
Sbjct: 390 MVVYNNEKQAIGWATANC 407
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 88/208 (42%), Gaps = 22/208 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + GTP + + D L+WTQC C CF Q P+F+P AS+TY+ PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 67 CRRPPF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C C ++ + G G V T+TF + FGC +
Sbjct: 111 CESIPSDSRNCSGNVCAYQASTNAG-DTGGKVGTDTFAVGTAKA-----SLAFGCVVAS- 163
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
D G +GI+G +P+SL+ Q T FSYCL A + S L G A +
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCL--APHDAGRNSALFLGSSAKLAG 218
Query: 184 KDMKTIRMFV-------DRSSHYYLSLQ 204
FV D S++Y + L+
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLE 246
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 150/406 (36%), Gaps = 77/406 (18%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYK 58
N TV V GTP ++ ++ DTGS L W C N S P FN + SS+Y
Sbjct: 52 NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLC-------NGSYAPPLTPAFNASGSSSYG 104
Query: 59 RIPCDDLICR-------RPPFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKL 108
+PC C PPF C+ + C ++YA +SA G+++T+TF
Sbjct: 105 AVPCPSTACEWRGRDLPVPPF-CDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163
Query: 109 VCVPGVIFGC---------SNDNRDFSFDGNIA-GILGFSVSPFSLLGQLKSTAQGLFSY 158
V V G FGC +N N + A G+LG + S + Q T F+Y
Sbjct: 164 VAV-GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQ---TGTRRFAY 219
Query: 159 CLVYAYREMEATSILRFGKDANIQRK-------DMKTIRMFVDRSSHYYLSLQDISVADH 211
C+ E +L G D + ++ + DR + Y + L+ I V
Sbjct: 220 CIAPG----EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVA-YSVQLEGIRVGCA 274
Query: 212 RIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED 271
+ G G M+D+G TF+ Y FTS R + E
Sbjct: 275 LLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAA----LKAEFTSQARLLLAPLGEP 330
Query: 272 -------WEYCYRYDSRFRAYAS-----MTFHFDRADFKVEPTYMYFIFQN--------E 311
++ C+R A AS + A+ V + ++ E
Sbjct: 331 GFVFQGAFDACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAE 390
Query: 312 GYFCVAISFSDRNS----VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+C+ SD V+G QQ+ YDL G + F P C
Sbjct: 391 AVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/362 (21%), Positives = 147/362 (40%), Gaps = 24/362 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SSTY + C+
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L +FGC N
Sbjct: 145 VDCTC-----DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQ-RAVFGCENSE 198
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ + Y ++ +++ A
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAP-- 256
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
M RS +Y + L+++ VA + P F +G G ++D+G ++
Sbjct: 257 -PGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYLP 311
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFH------FDRAD 296
+ + R + + + C+ R + S F +
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDPNYK-DICFAGAGRNVSQLSEVFPKVDMVFGNGQK 370
Query: 297 FKVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ P Y++ + EG +C+ + + D +++G ++T YD + I F NC
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
Query: 354 AN 355
+
Sbjct: 431 SE 432
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 153/380 (40%), Gaps = 46/380 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G P+K F+ DTGS ++W C PC C N FNP++SST RI
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149
Query: 61 PCDD-----------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH--LKNK 107
C D IC+ ++ C + Y G+ SG ++T F + N+
Sbjct: 150 TCSDDRCTAGFQTGEAICQTS--NSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 207
Query: 108 LVC--VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLV 161
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 208 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 266
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
IL G+ I + V HY L+L+ I+V ++ F
Sbjct: 267 --KGSDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFT 320
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
+ T G ++D+G ++ G Y+ + + R + S+ + DS
Sbjct: 321 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 378
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQ----NEGYFCVAISFSDRN--SVVGAWQQQD 334
F ++T +F V+P Y + Q N +C+ + +++G +D
Sbjct: 379 F---PTVTLYFMGGVAMSVKPEN-YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKD 434
Query: 335 TRFVYDLNTGTIQFVPENCA 354
FVYDL + + +C+
Sbjct: 435 KIFVYDLANMRMGWADYDCS 454
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 147/374 (39%), Gaps = 50/374 (13%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI-FNPNASSTYKRIPCDDLIC 67
V + GTP +++ ++ DTGS L W QC ++ P F+P SS++ +PC+ +C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQC----KVPPKTPPTAFDPLLSSSFSVLPCNHSLC 135
Query: 68 --RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
R P P C +N C + YA G A G + E FTF P +I GC+
Sbjct: 136 KPRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQ---TTPPLILGCAT 192
Query: 121 DNRDFSFDGNIAGILGFSVS--PFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
D+ D GILG ++ FS L ++ FSYC+ + ++ F
Sbjct: 193 DSSD------TQGILGMNLGRLSFSSLAKISK-----FSYCVPPRRSQSGSSPTGSFYLG 241
Query: 179 ANIQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
N K + + R S Y L + I + ++ + F +G G
Sbjct: 242 PNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQ 301
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA---SEDWEYCYRYDSRF--RA 284
+ID+G TF+ Y V E ++ + C+ D+ R
Sbjct: 302 TLIDSGTWFTFLVDEAYSKV----KEEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRM 357
Query: 285 YASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSD----RNSVVGAWQQQDTRFVYD 340
+M F F+ V G C+ I SD ++++G + QQD +D
Sbjct: 358 IGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEFD 417
Query: 341 LNTGTIQFVPENCA 354
L + F +C+
Sbjct: 418 LVGRRVGFGRTDCS 431
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 8/154 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V + GTP DT S LIWTQC PC C++Q P+FNP SSTY +PC
Sbjct: 89 YLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDT 148
Query: 67 CRRPPF-RC---ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C RC ++ C + Y+G A+ G ++ + GV FGCS +
Sbjct: 149 CDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGED----AFRGVAFGCSTSS 204
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLF 156
+ +G++G P SL+ QL G+
Sbjct: 205 TGGAPPPQASGVVGLGRGPLSLVSQLSVRRYGMI 238
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 153/380 (40%), Gaps = 46/380 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G P+K F+ DTGS ++W C PC C N FNP++SST RI
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147
Query: 61 PCDD-----------LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH--LKNK 107
C D IC+ ++ C + Y G+ SG ++T F + N+
Sbjct: 148 TCSDDRCTAGFQTGEAICQTS--NSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 205
Query: 108 LVC--VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLV 161
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 206 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 264
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
IL G+ I + V HY L+L+ I+V ++ F
Sbjct: 265 --KGSDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFT 318
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR 281
+ T G ++D+G ++ G Y+ + + R + S+ + DS
Sbjct: 319 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 376
Query: 282 FRAYASMTFHF-DRADFKVEPTYMYFIFQ----NEGYFCVAISFSDRN--SVVGAWQQQD 334
F ++T +F V+P Y + Q N +C+ + +++G +D
Sbjct: 377 F---PTVTLYFMGGVAMSVKPEN-YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKD 432
Query: 335 TRFVYDLNTGTIQFVPENCA 354
FVYDL + + +C+
Sbjct: 433 KIFVYDLANMRMGWADYDCS 452
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 110/259 (42%), Gaps = 17/259 (6%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDD 64
Y + V GTP+ ++ + DTGS + W QC PC +C +Q +F+P S+TY C
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGS 188
Query: 65 LICRR---PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C + C QC + + Y G++ +G ++T + + V FGCS
Sbjct: 189 AQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDA---VKSFQFGCS-- 243
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+R F G + G++G SL+ Q +T FSYCL L A+
Sbjct: 244 HRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASS 303
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
R + F + Y + LQ I+VA + F +G ++D+G + T +
Sbjct: 304 SRYSHTPMVRF-SVPTFYGVFLQGITVAGTMLNVPASVF------SGASVVDSGTVITQL 356
Query: 242 QRGPYEVVMRHFDEHFTSF 260
Y+ + F + ++
Sbjct: 357 PPTAYQALRTAFKKEMKAY 375
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 147/362 (40%), Gaps = 24/362 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SS+Y + C+
Sbjct: 85 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCN 144
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L IFGC N
Sbjct: 145 VDCTC-----DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQ-HAIFGCENSE 198
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ L Y ++ +++ G +
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGM---LA 255
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM RS +Y + L++I VA + F N G ++D+G ++
Sbjct: 256 PPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIF----NSKHGTVLDSGTTYAYLP 311
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY----RYDSRFRAY---ASMTFHFDRA 295
+ S + R + S + C+ R S+ M F +
Sbjct: 312 EQAFVAFKEAVTSKVHSLKKIRGPDPSYK-DICFAGAGRNVSKLHEVFPDVDMVFGNGQK 370
Query: 296 DFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y++ + +G +C+ + + D +++G ++T YD + I F NC
Sbjct: 371 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430
Query: 354 AN 355
+
Sbjct: 431 SE 432
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 142/355 (40%), Gaps = 50/355 (14%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC---- 67
G P+K FL DTGS L W QC PC +C P++ P A+ + +PC + +C
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTAN---RLVPCANALCTALH 57
Query: 68 --RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN-- 122
+ +C + QC ++I Y AS+ G++ ++F+ +++ + PG+ FGC D
Sbjct: 58 SGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI-RPGLTFGCGYDQQV 116
Query: 123 -RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFGKDA 179
++ + I G+LG SL+ QLK + + +CL L FG D
Sbjct: 117 GKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL-----STNGGGFLFFGDDV 171
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHR-IGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + M S +YY D R +G P + D+G+
Sbjct: 172 -VPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGSTY 220
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-------YASMTFH 291
T+ PY+ V+ + +Q + C++ F++ + SM
Sbjct: 221 TYFTAQPYQAVVSALKGGLSKSLKQV---SDPTLPLCWKGQKAFKSVFDVKNEFKSMFLS 277
Query: 292 FDRADFKVE--PTYMYFIFQNEGYFCVAI----SFSDRNSVVGAWQQQDTRFVYD 340
F A P Y I G C+ I + +V+G QD +YD
Sbjct: 278 FASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYD 332
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 117/257 (45%), Gaps = 32/257 (12%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP----IFNPNASST 56
HE +F +D+ GTP + + DTGS L W C C + +AP +F+P+ S+T
Sbjct: 71 HEGKFF--MDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTT 128
Query: 57 YKRIPCDDLICRR------PPFRC--ENGQCVHRINYAGGAS---ASGLVSTETFTFHLK 105
Y+ + C C PF C E C++ + Y G S ++G + T+ T
Sbjct: 129 YELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASS 188
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQL-KSTAQGLFSYCLVYAY 164
+ + + G IFGCS D+ SF G +G++GF + FS Q+ + T FSYC +
Sbjct: 189 SSI--IDGFIFGCSGDD---SFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDH 243
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
SI + KD + I F DRS Y L D+ V +R+ + R
Sbjct: 244 TAEGFLSIGAYPKDELVY---TNLIPHFGDRSV-YSLQQIDMMVDGNRLQVDQSEYTKRM 299
Query: 225 NGTGGCMIDTGAIATFI 241
++D+G + TF+
Sbjct: 300 -----MVVDSGTVDTFL 311
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 153/372 (41%), Gaps = 39/372 (10%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPF 72
G+ K+ + DTGS + QC ++S P+F+P AS +Y+++PC +C
Sbjct: 5 IGSLQKNLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQLCLAVQQ 58
Query: 73 RCENGQ----------CVHRINYAGGASASGLVSTETFTFHLKN---KLVCVPGVIFGCS 119
+ NG C + ++Y +++G S + + N + V V FGC+
Sbjct: 59 QTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCA 118
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQG-LFSYCLVYAYREMEATSILRFGKD 178
+ + F D GI+GF+ SL QLK G FSYC + AT ++ G D
Sbjct: 119 HSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLG-D 177
Query: 179 ANIQRKDMKTIRMFVD-----RSSHYYLSLQDISVADHRIGFAPGTFALR-RNGTGGCMI 232
+ + + + + + RS YY+ L ISV + F L G GG ++
Sbjct: 178 SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVL 237
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD--SRFRAYASMTF 290
D+G T + Y F S R+++ A+ ++ CY S +
Sbjct: 238 DSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKV-GAAAGFDDCYNISAGSSLPGVPEVRL 296
Query: 291 HFD---RADFKVEPTYMYF-IFQNEGYFCVAISFSDRN-----SVVGAWQQQDTRFVYDL 341
R + + E ++ NE C+AI S ++ +V+G +QQ + YD
Sbjct: 297 SLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDN 356
Query: 342 NTGTIQFVPENC 353
+ F +C
Sbjct: 357 ERSRVGFERADC 368
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 140/365 (38%), Gaps = 43/365 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP ++ L DT + W +PC C ++ +F P S+T+K + C
Sbjct: 97 YIVRAKIGTPPQTLLLAIDTSNDAAW---IPCTACDGCTSTLFAPEKSTTFKNVSCGSPE 153
Query: 67 CRR-PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + P C C + Y G +S + V +T T +PG FGC
Sbjct: 154 CNKVPSPSCGTSACTFNLTY-GSSSIAANVVQDTVTLATDP----IPGYTFGC------- 201
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
+A G S P SLL Q ++ Q FSYCL +++ + + LR
Sbjct: 202 -----VAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-PSFKSLNFSGSLRL 255
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G A R + RSS YY++L I V + P A G + D+G
Sbjct: 256 GPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSG 315
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDSRFRAYASMTFHFDR 294
+ T + Y V F + + S ++ CY ++TF F
Sbjct: 316 TVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVP---IVAPTITFMFSG 372
Query: 295 ADFKVEPTYMYFIFQNEGYF-CVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQF 348
+ + P I G C+A++ + N +V+ QQQ+ R +YD+ +
Sbjct: 373 MNVTL-PQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGV 431
Query: 349 VPENC 353
E C
Sbjct: 432 ARELC 436
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 133/346 (38%), Gaps = 40/346 (11%)
Query: 20 EFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPF---- 72
+ + DT + W QC PC C+ Q P+F+P SST + C CR P+
Sbjct: 148 QTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGC 207
Query: 73 --RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
R N +C + I Y+ + +G T+T T + V FGCS+ R F
Sbjct: 208 SNRSANAECRYLIEYSDDRATAGTYMTDTLTI---SGTTAVRNFRFGCSHAVRG-RFSDL 263
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIR 190
AG + SLL Q + FSYC+ A A+ L G A +
Sbjct: 264 TAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQA----SASGFLSIGGPATTNSTTVFATT 319
Query: 191 MFVDRS---SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYE 247
V + S Y + LQ I VA R+G P F + G ++D+ A+ T + Y
Sbjct: 320 PLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAF------SAGAVMDSSAVITQLPPTAYR 373
Query: 248 VVMRHFDEHFTSFGRQRMHNASEDWEYCYRY----DSRFRAYASMTFHFDRADFKVEPTY 303
+ R F ++ R A+ + CY + + R A S+ F P
Sbjct: 374 ALRRAFRNAMRAYPRS---GATGTLDTCYDFLGLTNVRVPAV-SLVFGGGAVVVLDPPAV 429
Query: 304 MYFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQF 348
M G + SD +G QQQ +YD+ G + F
Sbjct: 430 MI-----GGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGF 470
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 139/343 (40%), Gaps = 43/343 (12%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFRCE 75
PS E L + WTQC PCV C S F+P+AS TY C P
Sbjct: 84 PSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCI-------PSTVG 136
Query: 76 NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGIL 135
N + + Y +++ G +T T + P FGC +N + F G+L
Sbjct: 137 N---TYNMTYGDKSTSVGNYGCDTMTLEPSDVF---PKFQFGCGRNN-EGDFGSGADGML 189
Query: 136 GFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMF--- 192
G S + Q S + +FSYCL E ++ L FG+ A Q +K +
Sbjct: 190 GLGQGQLSTVSQTASKFKKVFSYCL----PEEDSIGSLLFGEKATSQ-SSLKFTSLVNGP 244
Query: 193 ----VDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEV 248
++ S +Y++ L DISV + R+ FA + G +ID+G + T + + Y
Sbjct: 245 GTSGLEESGYYFVKLLDISVGNKRLNVPSSVFA-----SPGTIIDSGTVITCLPQRAYSA 299
Query: 249 VMRHFDEHFTSFGRQRMHNASED-WEYCYRYDSRFRA-YASMTFHF-DRADFKVEPTYMY 305
+ F + + D + CY R + HF + AD ++ +
Sbjct: 300 LTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV- 358
Query: 306 FIFQNEGY-FCVAISFSDRN------SVVGAWQQQDTRFVYDL 341
I+ N+ C+A + + ++ +++G QQ +YD+
Sbjct: 359 -IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 148/366 (40%), Gaps = 30/366 (8%)
Query: 3 KNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPC 62
+N +YT + GTP + L+ DTGS + + C C +C + P F P S TY+ + C
Sbjct: 89 RNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC 148
Query: 63 DDLICRRPPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
C C+N QC + YA +++SG + + +F + +L IFGC N
Sbjct: 149 -TWQC-----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTEL-SPQRAIFGCEN 201
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKD 178
D ++ GI+G S++ QL K FS C A +
Sbjct: 202 DETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPP 261
Query: 179 ANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
A DM R RS +Y + L++I VA R+ P F +G G ++D+G
Sbjct: 262 A-----DMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVF----DGKHGTVLDSGTTY 312
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADF- 297
++ + + S R + + + C+ + S +F F
Sbjct: 313 AYLPESAFLAFKHAIMKETHSLKRISGPDPRYN-DICFSGAEIDVSQISKSFPVVEMVFG 371
Query: 298 -----KVEP-TYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFV 349
+ P Y++ + G +C+ + + +D +++G ++T +YD I F
Sbjct: 372 NGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFW 431
Query: 350 PENCAN 355
NC+
Sbjct: 432 KTNCSE 437
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 150/381 (39%), Gaps = 44/381 (11%)
Query: 4 NYFYTVDVLF----GTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTY 57
N+ Y++ ++ GTPS+S+ L+ DTGS L W QC F+P+ SS++
Sbjct: 74 NFKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSF 133
Query: 58 KRIPCDDLIC--RRP----PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVC 110
+PC +C R P P C+ N C + YA G A G + E FTF +
Sbjct: 134 SDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTF---SNSQT 190
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
P +I GC+ ++ D + GILG ++ S + Q K + FSYC+
Sbjct: 191 TPPLILGCAKESTD------VKGILGMNLGRLSFISQAKISK---FSYCIPTRSNRPGLA 241
Query: 171 SILRFGKDANIQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFA 221
S F N + K + + S Y + L I + R+ F
Sbjct: 242 STGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFR 301
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTS-FGRQRMHNASEDWEYCYRYDS 280
G+G M+D+G+ T + Y+ V S + ++ ++ D C+ +
Sbjct: 302 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM--CFDGNH 359
Query: 281 RF---RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQ 333
+ R + F F R + + G CV I S ++++G QQ
Sbjct: 360 QMVIGRLIGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQ 419
Query: 334 DTRFVYDLNTGTIQFVPENCA 354
+ +D+ + F C+
Sbjct: 420 NLWVEFDVANRRVGFSKAECS 440
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 31/280 (11%)
Query: 90 SASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAG---ILGFSVSPFSLLG 146
+++G+++TETFTF + FGC +G IAG I+G S P S+L
Sbjct: 2 TSTGVLATETFTFGAHQNFSA--NLTFGCGKLT-----NGTIAGASGIMGVSPGPLSVLK 54
Query: 147 QLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR----KDMKTIRMFVD--RSSHYY 200
QL T FSYCL + + TS + FG A++ + ++TI + + +YY
Sbjct: 55 QLSITK---FSYCLT-PFTD-HKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYY 109
Query: 201 LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSF 260
+ + IS+ R+ ALR +GTGG ++D+ ++ ++ + + E
Sbjct: 110 VPMVGISIGSKRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLP 169
Query: 261 GRQRMHNASEDWEYCYRYDSRFRA----YASMTFHFDRADFKVEPTYMYFIFQNEGYFCV 316
R + +D+ C+ + HF P YF + G C+
Sbjct: 170 AANR---SIDDYPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCL 226
Query: 317 AI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
A+ F +V+G QQQ+ +YDL + P C
Sbjct: 227 AVMQAPFEGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 156/378 (41%), Gaps = 44/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP+K ++ DTGS ++W C+ C C S+ ++N N S T K +
Sbjct: 77 LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLV 136
Query: 61 PCDDLIC-----RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKN---KLVCVP 112
PCD C + P N C + Y G+S +G + + + K
Sbjct: 137 PCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAAN 196
Query: 113 G-VIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYRE 166
G VIFGC + + S + + GILGF S S++ QL T + +F++CL
Sbjct: 197 GSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL----DG 252
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
I G +Q K + + HY +++ + V + F
Sbjct: 253 TNGGGIFVIGH--VVQPK--VNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVF--EAGD 306
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRAY 285
G +ID+G ++ Y+ ++ ++H +++ C++Y DS +
Sbjct: 307 RKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDL---KVHTVRDEYT-CFQYSDSLDDGF 362
Query: 286 ASMTFHFDRA-DFKVEPTYMYFIFQNEGYFCV-----AISFSDRN--SVVGAWQQQDTRF 337
++TFHF+ + KV P F F EG +C+ + DR +++G +
Sbjct: 363 PNVTFHFENSVILKVYPHEYLFPF--EGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLV 420
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 421 LYDLENQAIGWTEYNCSS 438
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 153/385 (39%), Gaps = 64/385 (16%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP------- 61
V + GTP + L+ DTGS L W QC + P+ P +S +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126
Query: 62 CDDLIC--RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C+ IC R P P C +N C + YA G A G + E FTF +K + P V
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTPPV 183
Query: 115 IFGC---SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
I GC S +NR GILG + S + Q K + FSYC V + T
Sbjct: 184 ILGCAQASTENR---------GILGMNRGRLSFISQAKISK---FSYC-VPSRTGSNPTG 230
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSH-------YYLSLQDISVADHRIGFAPGTFALRR 224
+ G + N + T+ F + S Y L ++ I +A R+ P F
Sbjct: 231 LFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDA 290
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-- 282
G+G MID+G+ T++ YE V + ++ + ++ + C+
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAM-MKKGYVYADVADMCFDAGVTAEV 349
Query: 283 -RAYASMTFHFDRADFKVEPTYMYFIFQNEGYF--------CVAISFSDR----NSVVGA 329
R ++F FD VE F+ + EG CV I S+R ++++G
Sbjct: 350 GRRIGGISFEFDNG---VE----IFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGT 402
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCA 354
QQ+ YDL + F C+
Sbjct: 403 VHQQNMWVEYDLANKRVGFGGAECS 427
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 153/385 (39%), Gaps = 64/385 (16%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIP------- 61
V + GTP + L+ DTGS L W QC + P+ P +S +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126
Query: 62 CDDLIC--RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGV 114
C+ IC R P P C +N C + YA G A G + E FTF +K + P V
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTPPV 183
Query: 115 IFGC---SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS 171
I GC S +NR GILG + S + Q K + FSYC V + T
Sbjct: 184 ILGCAQASTENR---------GILGMNHGRLSFISQAKISK---FSYC-VPSRTGSNPTG 230
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSH-------YYLSLQDISVADHRIGFAPGTFALRR 224
+ G + N + T+ F + S Y L ++ I +A R+ P F
Sbjct: 231 LFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA 290
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-- 282
G+G MID+G+ T++ YE V + ++ + ++ + C+
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAM-MKKGYVYADVADMCFDAGVTAEV 349
Query: 283 -RAYASMTFHFDRADFKVEPTYMYFIFQNEGYF--------CVAISFSDR----NSVVGA 329
R ++F FD VE F+ + EG CV I S+R ++++G
Sbjct: 350 GRRIGGISFEFDNG---VE----IFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGT 402
Query: 330 WQQQDTRFVYDLNTGTIQFVPENCA 354
QQ+ YDL + F C+
Sbjct: 403 VHQQNMWVEYDLANKRVGFGGAECS 427
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 108/262 (41%), Gaps = 29/262 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y V G+P K F+ DTGS ++W C PC C N FNP+ SST +I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 61 PCDDLICRRPPFRCE-------NGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC- 110
PC D C E N C + Y G+ SG ++T F + N+
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 111 -VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
++FGCSN D + D + GI GF S++ QL S + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL---KG 266
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
IL G+ I + V HY L+L+ I V ++ F +
Sbjct: 267 SDNGGGILVLGE---IVEPGL-VYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFT--TS 320
Query: 226 GTGGCMIDTGAIATFIQRGPYE 247
T G ++D+G ++ G Y+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYD 342
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 146/391 (37%), Gaps = 56/391 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCFNQSAPI------FNPNASSTY 57
Y+V + FGTP ++ + DTGS ++W C C +C S+ F P SS+
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126
Query: 58 KRIPCDDLICRRPPF------------RCENGQCVHRINYAGGASASGLVSTETFTFHLK 105
K + C + C C N C + + G + G+ +ET H
Sbjct: 127 KLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHLHSL 186
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYR 165
+K P + GCS FS AGI GF SL QL G FSYCL+
Sbjct: 187 SK----PNFLVGCSV----FS-SHQPAGIAGFGRGLSSLPSQL---GLGKFSYCLLSHRF 234
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMF---------VDRSS----HYYLSLQDISVADHR 212
+ + D D KT + VD S +YYL L+ I+V H
Sbjct: 235 DDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHH 294
Query: 213 IGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW 272
+ + +G GG +ID+G TF+ R +E + F + R + +
Sbjct: 295 VKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGL 354
Query: 273 EYCYRY-DSRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAI---------SFSD 322
C+ D++ ++ + +F P YF F C+ +
Sbjct: 355 RPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGG 414
Query: 323 RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++G +Q Q+ YDL + F E C
Sbjct: 415 PGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 154/365 (42%), Gaps = 34/365 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD-D 64
+YT + GTP ++ L+ DTGS L + C C C P F P+ SSTY+ + C +
Sbjct: 91 YYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSME 150
Query: 65 LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C E CV+ YA +S+SG++ + +F +++L +FGC N
Sbjct: 151 CTC-----DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQ-RTVFGCENVETG 204
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
+ GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP---PA 261
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
M RS++Y + L++I +A ++ P F +G G ++D+G ++
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGTTYAYLPEP 317
Query: 245 PY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF--------HF 292
+ + +M+ + G R +N + C+ + S TF +
Sbjct: 318 AFKAFKDAIMKELNSLKLIQGPDRNYN-----DICFSGVGSDVSQLSKTFPAVDLVFSNG 372
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+R E Y++ + G +C+ I + +D+ +++G ++T +YD I F
Sbjct: 373 NRLSLSPE-NYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWK 431
Query: 351 ENCAN 355
NC+
Sbjct: 432 TNCSE 436
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 154/378 (40%), Gaps = 45/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTP+K ++ DTGS ++W C+ C C +S +++P S + + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVPG- 113
CD C P C + I+Y G+S +G T+ ++ + P
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPAN 208
Query: 114 --VIFGCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREM 167
V FGC N+A GILGF S S+L QL + + +F++CL +
Sbjct: 209 ASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----DTV 264
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G N+ + +KT + D HY + L+ I V +G F +
Sbjct: 265 NGGGIFAIG---NVVQPKVKTTPLVPDM-PHYNVILKGIDVGGTALGLPTNIF--DSGNS 318
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRH-FDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G ++ G Y+ + FD+H Q + + S C++Y +
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKH-QDISVQTLQDFS-----CFQYSGSVDDGF 372
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQN-EGYFCVAISFSDRNSVVG-------AWQQQDTRF 337
+TFHF+ D + + ++FQN + +C+ + G +
Sbjct: 373 PEVTFHFE-GDVSLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLV 431
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 432 LYDLENQAIGWADYNCSS 449
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 154/365 (42%), Gaps = 34/365 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD-D 64
+YT + GTP ++ L+ DTGS L + C C C P F P+ SSTY+ + C +
Sbjct: 91 YYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSME 150
Query: 65 LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
C E CV+ YA +S+SG++ + +F +++L +FGC N
Sbjct: 151 CTC-----DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQ-RTVFGCENVETG 204
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRK 184
+ GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP---PA 261
Query: 185 DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRG 244
M RS++Y + L++I +A ++ P F +G G ++D+G ++
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGTTYAYLPEP 317
Query: 245 PY----EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF--------HF 292
+ + +M+ + G R +N + C+ + S TF +
Sbjct: 318 AFKAFKDAIMKELNSLKLIQGPDRNYN-----DICFSGVGSDVSQLSKTFPAVDLVFSNG 372
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
+R E Y++ + G +C+ I + +D+ +++G ++T +YD I F
Sbjct: 373 NRLSLSPE-NYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWK 431
Query: 351 ENCAN 355
NC+
Sbjct: 432 TNCSE 436
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 137/365 (37%), Gaps = 66/365 (18%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y ++ GTP + + +WTQC PC CF Q P+FN R + +
Sbjct: 27 LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN--------RYEVETM 78
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
SG+ T+TF + FGC+ D+
Sbjct: 79 F----------------------GDTSGIGGTDTFAIGTATA-----SLAFGCAMDSNIK 111
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
G +G++G +P+SL+GQ+ +TA FSYCL + S L G A +
Sbjct: 112 QLLGA-SGVVGLGRTPWSLVGQMNATA---FSYCLA-PHGAAGKKSALLLGASAKLAGGK 166
Query: 186 MKTIRMFV---DRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
V D SS Y + L+ I D I P NG+ ++DT +F+
Sbjct: 167 SAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPP-------NGSV-VLVDTIFGVSFLV 218
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS------MTFHFDRAD 296
+ + + + G M ++ ++ C+ + S + F A
Sbjct: 219 DAAFHAIKKAVT---VAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAA 275
Query: 297 FKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y G C+A ++ + S++G Q++ F++DL+ T+ F P
Sbjct: 276 ALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 335
Query: 351 ENCAN 355
+C++
Sbjct: 336 ADCSS 340
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 131/327 (40%), Gaps = 34/327 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + G P + FL DTGS L W QC PC NC P++ P + K +P D
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKP---AKEKIVPPRDS 247
Query: 66 ICRRPPFR---CEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
+C+ CE QC + I YA +S+ G+++ + N +FGC+ D
Sbjct: 248 LCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGGREKLDFVFGCAYD 307
Query: 122 NRD--FSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFGK 177
+ S GILG S + SL QL S +F +C+ RE + G
Sbjct: 308 QQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCIT---RETNGGGYMFLGD 364
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
D + R M + + Y+ Q ++ D L + + D+G+
Sbjct: 365 DY-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQE---------LHAGNSVQVIFDSGSS 414
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFDRAD 296
T++ Y+ ++ E SF + ++ C++ D R+ + + HF R
Sbjct: 415 YTYLPEEMYKNLIDAIKEDSPSFVQD---SSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471
Query: 297 FKVEPTYM-----YFIFQNEGYFCVAI 318
F V T+ Y I ++G C+ +
Sbjct: 472 FVVPKTFTIVPDDYLIISDKGNVCLGL 498
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 147/338 (43%), Gaps = 40/338 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C C C +S +++ AS+T +
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 136
Query: 61 PCDDLICRR---PPFRCENG-QCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP--- 112
CDD C P C+ G QC++ + Y G+S +G + ++ + P
Sbjct: 137 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 196
Query: 113 GVIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCL-------V 161
V+FGC N S + GILGF + S+L QL S+ + +FS+CL +
Sbjct: 197 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI 256
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTF- 220
+A E+ + RF + + + +F+ R +HY + +++I V + F
Sbjct: 257 FAIGEVVEPKV-RF-----LLMNSVMIVVLFLSR-AHYNVVMKEIEVGGDPLDVPSDAFE 309
Query: 221 ALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS 280
+ R GT +ID+G + P EV + ++ + R+H + + C+ Y
Sbjct: 310 SGDRKGT---IIDSGTTLAYF---PQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 362
Query: 281 RF-RAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVA 317
+ ++T HFD++ + Y E +C+
Sbjct: 363 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIG 400
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 160/378 (42%), Gaps = 43/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + G+P K ++ DTGS ++W C+ C C +S +++P S T + I
Sbjct: 69 LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELI 128
Query: 61 PCDDLICRR----PPFRCENG-QCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP-- 112
CD C P C++ C + I Y G++ +G + T+ H+ + L P
Sbjct: 129 SCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQN 188
Query: 113 -GVIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYRE 166
+IFGC + S + + GI+GF S S+L QL ++ + +FS+CL
Sbjct: 189 SSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL----DN 244
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+ I G+ + + T + V R +HY + L+ I V D I P NG
Sbjct: 245 IRGGGIFAIGE---VVEPKVSTTPL-VPRMAHYNVVLKSIEV-DTDILQLPSDIFDSGNG 299
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G ++ P V + R +++ + + C++Y R +
Sbjct: 300 K-GTIIDSGTTLAYL---PAIVYDELIPKVMARQPRLKLYLVEQQFS-CFQYTGNVDRGF 354
Query: 286 ASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFS---DRN----SVVGAWQQQDTRF 337
+ HF D V P F F+ +G +C+ S +N +++G +
Sbjct: 355 PVVKLHFEDSLSLTVYPHDYLFQFK-DGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLV 413
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 414 IYDLENMAIGWTDYNCSS 431
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 132/341 (38%), Gaps = 43/341 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N Y + G+P + FL DTGS L W QC PC +C P++ P + +P
Sbjct: 98 NGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNL---VPL 154
Query: 63 DDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
D +C + G QC + I YA +S+ G+++++ L N + G++F
Sbjct: 155 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 214
Query: 117 GCSNDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC+ D + + GILG S + SL QL S Q + + L +
Sbjct: 215 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLAS--QRIINNVLGHCLTSDATGGGYM 272
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG-TGGCMID 233
F D + M + M S +Y+ + IS ++ R++G T + D
Sbjct: 273 FLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLG------RQDGRTERVVFD 326
Query: 234 TGAIATFIQRGPYEVVMRHF----DEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
TG+ T+ + Y ++ DE G + C+R R+
Sbjct: 327 TGSSYTYFPKEAYYALVASLKDVSDEGLIQDG------SDPTLPVCWRAKFPIRSVIDVK 380
Query: 285 --YASMTFHFDRADFKVE-----PTYMYFIFQNEGYFCVAI 318
+ +T F + V P Y I N+G C+ I
Sbjct: 381 QFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGI 421
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 97/222 (43%), Gaps = 12/222 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV---NCFNQSAPIFNPNASSTYKRIPCD 63
Y V GTP ++ + DTGS L W QC PC +C++Q P+F+P SS+Y +PC
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 107
Query: 64 DLICRRPPFRCENGQCV----HRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C + + ++Y G++ +G+ S++T T + V G FGC
Sbjct: 108 GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFGCG 164
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ F+G + G+LG SL+ Q T G+FSYCL ++ G
Sbjct: 165 HAQSGL-FNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSG 222
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
+ + ++Y + L ISV ++ FA
Sbjct: 223 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA 264
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/412 (23%), Positives = 148/412 (35%), Gaps = 72/412 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----------FNQSAP-------- 47
Y V GTP++ L+ DTGS L W +C + AP
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 48 ---------IFNPNASSTYKRIPCDDLICRRP-PFR-----CENGQCVHRINYAGGASAS 92
+F P+ S T+ IPC C PF C + Y G++A
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAAR 174
Query: 93 GLVSTETFTFHL-------KNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLL 145
G V T++ T L K + + GV+ GC+ SF + G+L S S
Sbjct: 175 GTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLAS-DGVLSLGYSNVSFA 233
Query: 146 GQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQD 205
+ + G FSYCLV ATS L FG + + R S+ + Q
Sbjct: 234 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASAS--RTACAGSAAAPGARQT 291
Query: 206 ISVADHRIG--FAPGTFALRRNGT--------------GGCMIDTGAIATFIQRGPYEVV 249
+ DHR+ +A + +G GG ++D+G T + Y V
Sbjct: 292 PLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAYRAV 351
Query: 250 MRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR------AYASMTFHFDRADFKVEPTY 303
+ + R M + ++YCY + S A ++ HF + P
Sbjct: 352 VAALGKKLVGLPRVAM----DPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPK 407
Query: 304 MYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y I G C+ + D SV+G QQ+ + +DL ++F C
Sbjct: 408 SYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 81/361 (22%), Positives = 147/361 (40%), Gaps = 24/361 (6%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SS+Y + C+
Sbjct: 86 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN 145
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L +FGC N
Sbjct: 146 VDCTC-----DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQ-RAVFGCENSE 199
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ L Y ++ +++ G A
Sbjct: 200 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAP-- 257
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM RS +Y + L++I VA + F N G ++D+G ++
Sbjct: 258 -SDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVF----NSKHGTVLDSGTTYAYLP 312
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY----RYDSRFRAY---ASMTFHFDRA 295
+ S + R + + + C+ R S+ M F +
Sbjct: 313 EQAFVAFKDAVTSKVHSLKKIRGPDPNYK-DICFAGAGRNVSKLHEVFPDVDMVFGNGQK 371
Query: 296 DFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
Y++ + +G +C+ + + D +++G ++T YD + I F NC
Sbjct: 372 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431
Query: 354 A 354
+
Sbjct: 432 S 432
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 132/341 (38%), Gaps = 43/341 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N Y + G+P + FL DTGS L W QC PC +C P++ P + +P
Sbjct: 311 NGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNL---VPL 367
Query: 63 DDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
D +C + G QC + I YA +S+ G+++++ L N + G++F
Sbjct: 368 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 427
Query: 117 GCSNDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC+ D + + GILG S + SL QL S Q + + L +
Sbjct: 428 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLAS--QRIINNVLGHCLTSDATGGGYM 485
Query: 175 FGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG-TGGCMID 233
F D + M + M S +Y+ + IS ++ R++G T + D
Sbjct: 486 FLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLG------RQDGRTERVVFD 539
Query: 234 TGAIATFIQRGPYEVVMRHF----DEHFTSFGRQRMHNASEDWEYCYRYDSRFRA----- 284
TG+ T+ + Y ++ DE G + C+R R+
Sbjct: 540 TGSSYTYFPKEAYYALVASLKDVSDEGLIQDG------SDPTLPVCWRAKFPIRSVIDVK 593
Query: 285 --YASMTFHFDRADFKVE-----PTYMYFIFQNEGYFCVAI 318
+ +T F + V P Y I N+G C+ I
Sbjct: 594 QFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGI 634
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 146/378 (38%), Gaps = 56/378 (14%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
FY V + G P + FL DTGS L W QC PC C P++ P + IPC D
Sbjct: 73 FYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHPLYKP----SNDFIPCKD 128
Query: 65 LICR--RPP--FRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C +P + CE+ QC + I YA S G++ + + + N + + GC
Sbjct: 129 PLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLKVRMALGCG 188
Query: 120 NDNRDFSFDG--NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
D + FS + GILG SL+ QL S QGL + + I FG
Sbjct: 189 YD-QIFSPSTYHPLDGILGLGRGKASLISQLNS--QGLVRNVMGHCLSSRGGGYIF-FGN 244
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DTG 235
+ R I +D HY G A F R+ G G I DTG
Sbjct: 245 VYDSSRMSWTPISS-IDSGKHY------------SAGPAELVFGGRKTGVGSLNIIFDTG 291
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRA-------YA 286
+ T+ Y+ ++ ++ R+ + A +D C+ FR+ +
Sbjct: 292 SSYTYFNSQAYQAMISLLNKELH---RKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFK 348
Query: 287 SMTFHFD-----RADFKVEPTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQQDT 335
+T F + F++ P Y I N G C+ I + N ++G D
Sbjct: 349 PLTLSFTNGGRVKPQFEIPPE-AYLIISNMGNVCLGILNGPEVGLGELN-LIGDISMLDK 406
Query: 336 RFVYDLNTGTIQFVPENC 353
V+D I + P +C
Sbjct: 407 VMVFDNEKQLIGWGPADC 424
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 141/376 (37%), Gaps = 44/376 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYK-RIPCD 63
Y + +L G P+K +L DTGS L W QC PC +C + +++P + R+P
Sbjct: 22 LYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKARLVDCRVPLC 81
Query: 64 DLICRRPPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
L+ + + C QC + + YA G+S G++ +T T L N I GC D
Sbjct: 82 ALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTNGTRSKTTAIIGCGYD 141
Query: 122 NRD--FSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGK 177
+ + G++G S + SL QL K + + +CL L FG
Sbjct: 142 QQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGS---NGGGYLFFGD 198
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAI 237
V + + S+ + IG G + GG M D+G
Sbjct: 199 S-------------LVPALGMTWTPIMGKSITGN-IGGKSGDADDKTGDIGGVMFDSGTS 244
Query: 238 ATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRA-- 295
T++ Y V+ + G R+ +C+R S F + A + +F
Sbjct: 245 FTYLVPEAYNAVLSAMEMQVEKSGLVRIKT-DNTLPFCWRGPSPFESVADVQRYFKTVTL 303
Query: 296 DFKVEPTYM-----------YFIFQNEGYFCVAI-----SFSDRNSVVGAWQQQDTRFVY 339
DF Y Y I +G C+ I + + +++G + VY
Sbjct: 304 DFGKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGYLVVY 363
Query: 340 DLNTGTIQFVPENCAN 355
D I +V NC N
Sbjct: 364 DNARNQIGWVRRNCHN 379
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 99/222 (44%), Gaps = 12/222 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV---NCFNQSAPIFNPNASSTYKRIPCD 63
Y V GTP ++ + DTGS L W QC PC +C++Q P+F+P SS+Y +PC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 64 DLICRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C C QC + ++Y G++ +G+ S++T T + V G FGC
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFGCG 256
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ F+G + G+LG SL+ Q T G+FSYCL ++ G
Sbjct: 257 HAQSGL-FNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSG 314
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
+ + ++Y + L ISV ++ FA
Sbjct: 315 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA 356
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 150/375 (40%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V G+P+K ++ DTGS ++W C+ C NC + S F+ SST +
Sbjct: 82 LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 61 PCDDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETF---TFHLKNKLVC- 110
C D IC +G QC + Y G+ +G ++T T L +V
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVAN 201
Query: 111 -VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
++FGCS + D + D + GI GF S++ QL S +FS+CL
Sbjct: 202 SSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL---KG 258
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+L G+ I + V HY L+LQ I+V + FA N
Sbjct: 259 GENGGGVLVLGE---ILEPSI-VYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRA 284
G ++D+G ++ + Y + + F + + ++ CY +S
Sbjct: 315 --QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ----CYLVSNSVGDI 368
Query: 285 YASMTFHF-DRADFKVEPTYM---YFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVY 339
+ ++ +F A + P + Y + +C+ +R +++G +D FVY
Sbjct: 369 FPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVY 428
Query: 340 DLNTGTIQFVPENCA 354
DL I + NC+
Sbjct: 429 DLANQRIGWADYNCS 443
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 149/361 (41%), Gaps = 45/361 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPCDDL 65
Y + G+P K L+ DTGS L W +C PC +C + F+ AS+TYK + C D
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLASNTYKALTCAD- 57
Query: 66 ICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTF--HLKNKLVCVPGVIFGCSNDNR 123
+ Y G+ G +S +T ++L PG +FGC + +
Sbjct: 58 --------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLK 103
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME-ATSILRFGKDANIQ 182
G + GIL S S Q+ FSYCL+ + S + FG +A ++
Sbjct: 104 GL-ISGEV-GILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFG-EAAVE 160
Query: 183 RKDMKTIRM-------FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI-DT 234
K+ + ++ + S +Y + L ISV + R+ +P F NG I D+
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFL---NGQDKPTIFDS 217
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRAYASMTFHFD 293
G T + G V + S A + + C+R S + +TFHF+
Sbjct: 218 GTTLTMLPPG----VCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQGLPDITFHFN 273
Query: 294 -RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
ADF P+ ++ C+ ++ S+ G QQQD ++D++ I F +
Sbjct: 274 GGADFVTRPS--NYVIDLGSLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRRIGFKETD 331
Query: 353 C 353
C
Sbjct: 332 C 332
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 151/392 (38%), Gaps = 66/392 (16%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV C P++ P++ IPC+D
Sbjct: 37 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSD----LIPCND 92
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 93 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 152
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 153 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL-----SSLGGGILFFG 207
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
D + + M + S HY ++ + F T L+ T + D+G+
Sbjct: 208 DDLYDSSR-VSWTPMSREYSKHYSPAMGG------ELLFGGRTTGLKNLLT---VFDSGS 257
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF-- 292
T+ Y+ V + + + A +D C++ F + + +F
Sbjct: 258 SYTYFNSKAYQAVTYLLKRELSG---KPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 314
Query: 293 ----------DRADFKVEPTYMYFI----------------FQNEGYFCVAI----SFSD 322
+ F++ P I Q +G C+ I
Sbjct: 315 LALSFKTGWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNVCLGILNGTEIGL 374
Query: 323 RN-SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+N +++G QD +YD +I ++P +C
Sbjct: 375 QNLNLIGDISMQDQMIIYDNEKQSIGWMPVDC 406
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 110/429 (25%), Positives = 154/429 (35%), Gaps = 102/429 (23%)
Query: 7 YTVDVLFGTPSKSE--FLLFDTGSYLIWTQCLP--CVNCFNQ-----SAPIFNPNASSTY 57
YT+ + G S + L DTGS L+W C P C+ C + S P+ P S
Sbjct: 90 YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDS--- 146
Query: 58 KRIPCDDLICRR-----PP------FRC-----ENGQC-------------------VH- 81
+RIPC +C PP RC E G C H
Sbjct: 147 RRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHL 206
Query: 82 ---RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFS 138
R+ GA AS V+ + FTF + + G G+ GF
Sbjct: 207 RRGRVALGAGARASVAVAVDNFTFACAHTAL-------------------GEPVGVAGFG 247
Query: 139 VSPFSLLGQLKSTAQGLFSYCLV-YAYR--EMEATSILRFGKDANIQRKDMKTIRMFV-- 193
P SL GQL G FSYCLV +++R + S L G+ + FV
Sbjct: 248 RGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYT 307
Query: 194 ---DRSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYE 247
H Y ++L+ +SV RI P + R G GG ++D+G T + Y
Sbjct: 308 PLLHNPKHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYA 367
Query: 248 VVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
V F + G R A E CYRY + R + HF P Y
Sbjct: 368 RVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNY 427
Query: 306 FI-FQNE---------GYFCVAISFSDRNS---------VVGAWQQQDTRFVYDLNTGTI 346
F+ F++E C+ + S +G +QQQ VYD++ G +
Sbjct: 428 FMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRV 487
Query: 347 QFVPENCAN 355
F C +
Sbjct: 488 GFARRRCTD 496
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 162/391 (41%), Gaps = 58/391 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCL---PCVNC----FNQSAPIFNPNASSTYKR 59
Y++ + FGTP ++ L+ DTGS L+W C C NC N S+ IF P +SS+ K
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 60 IPCDDLIC-----RRPPFRCENGQ---------CVHRINYAGGASASGLVSTETFTFHLK 105
+ C + C + RC + + C + + G G++ +ET K
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209
Query: 106 NKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYR 165
VP I GCS + AGI GF P SL QL FSYCL+
Sbjct: 210 G----VPNFIVGCSVLSTS-----QPAGISGFGRGPPSLPSQLGLKK---FSYCLLSRRY 257
Query: 166 E--MEATSILRFGKDANIQRKDMKTIRMFVDR---------SSHYYLSLQDISVADHRIG 214
+ E++S++ G+ + ++ + FV S +YYL L+ I+V +
Sbjct: 258 DDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK 317
Query: 215 FAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY 274
+G GG +ID+G T+++ +E+V F++ S R
Sbjct: 318 IPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRP 376
Query: 275 CYRYDS-RFRAYASMTFHFDRADFKVE-PTYMYFIF-QNEGYFCVAI--------SFSDR 323
C+ ++ +T F R ++E P Y F + C+ I FS
Sbjct: 377 CFNISGLNTPSFPELTLKF-RGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGG 435
Query: 324 NSVV-GAWQQQDTRFVYDLNTGTIQFVPENC 353
+++ G +QQQ+ YDL + F ++C
Sbjct: 436 PAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 149/362 (41%), Gaps = 22/362 (6%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA---PIFNPNASSTYKRIPC 62
+YT V GTP++ L+ DTGS + + C C +C + A P F P+ SS+Y+ + C
Sbjct: 98 YYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSC 157
Query: 63 DDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
+ C QC + YA +S+ G++ + F ++L P ++FGC
Sbjct: 158 NSPDCITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHP-LLFGCETAE 216
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ + GI+G P S++ QL T S+ L Y + S++ A
Sbjct: 217 TGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVL---GAIPP 273
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
M + +RS++Y L L +I V + F NG G ++D+G ++
Sbjct: 274 PPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF----NGRLGTVLDSGTTYAYLP 329
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCYR---YDSRF--RAYASMTFHF--DR 294
++ + S Q + + + C+ DS+ + + + F F ++
Sbjct: 330 DKAFDAFKDAITQQLGSL--QAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQ 387
Query: 295 ADFKVEPTYMYFIFQNEGYFCVA-ISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
F Y++ + G +C+ D +++G ++T YD I F NC
Sbjct: 388 KVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGFFKTNC 447
Query: 354 AN 355
N
Sbjct: 448 TN 449
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 146/361 (40%), Gaps = 36/361 (9%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCL---PCVNCFNQSAPIFNPNASSTYKRI 60
YF V V GTP+ + ++ DTGS ++W P + Q + A + R
Sbjct: 121 EYFAQVGV--GTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSS--TGAAPAPTPRW 176
Query: 61 PCDDLICRR-PPFRCEN--GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C ICRR C+ C++++ Y G+ +G ++ET TF + V V G
Sbjct: 177 NCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR---VQRVAIG 233
Query: 118 CSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGK 177
C +DN + LG F Q+ + FSYCLV A R+G
Sbjct: 234 CGHDNEGLFIAASGLLGLGRGRLSFP--SQIARSFGRSFSYCLVDRTSSRRARPSRRWGG 291
Query: 178 DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRI-GFAPGTFALR-RNGTGGCMIDTG 235
T RM ++ YY+ L SV R+ G + L G GG ++D+G
Sbjct: 292 ----------TPRM----ATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSG 337
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSR-FRAYASMTFHFDR 294
T + R YE V F + G + ++ CY R +++ H
Sbjct: 338 TSVTRLARPVYEAVRDAFRA--AAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAG 395
Query: 295 ADFKVEPTYMYFI-FQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVYDLNTGTIQFVPEN 352
P Y I G FC A++ +D S++G QQQ R V+D + + FVP++
Sbjct: 396 GASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKS 455
Query: 353 C 353
C
Sbjct: 456 C 456
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 143/378 (37%), Gaps = 52/378 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPC 62
N FY V + G P K FL DTGS L W QC PC C P++ P + +PC
Sbjct: 54 NGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP----SNDLVPC 109
Query: 63 DDLIC----RRPPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
D +C RCEN QC + + YA G S+ G++ + F +L N P + G
Sbjct: 110 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 169
Query: 118 CSNDNRDFSFDGN-IAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C D S + + GILG S++ QL + QG+ + + +
Sbjct: 170 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHN--QGIVRNVVGHCFNSKGGGYXFFGD 227
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DT 234
+ R + M D HY GF F R G + D+
Sbjct: 228 GIYDPYR--LVWTPMSRDYPKHY------------SPGFGELIFNGRSTGLRNLFVVFDS 273
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF 292
G+ T+ Y+V+ + + + A +D C+R ++ + +F
Sbjct: 274 GSSYTYFNAQAYQVLTSLLNRELAG---KPLREAMDDDTLPLCWRGRKPIKSLRDVRKYF 330
Query: 293 ------------DRADFKVEPTYMYFIFQNEGYFCVAISFS-----DRNSVVGAWQQQDT 335
+A F++ PT Y I + G C+ I + ++++G QD
Sbjct: 331 KPLALSFSSGGRSKAVFEI-PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 336 RFVYDLNTGTIQFVPENC 353
VY+ I + NC
Sbjct: 390 MVVYNNEKQAIGWATANC 407
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/369 (23%), Positives = 141/369 (38%), Gaps = 36/369 (9%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQC--LPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
+ + GTPS+S+ L+ DTGS L W QC F+P+ SS++ +PC +
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141
Query: 67 C--RRP----PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
C R P P C+ N C + YA G A G + E FTF P +I GC+
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT---TPPLILGCA 198
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
++ D GILG ++ S + Q K + FSYC+ S F
Sbjct: 199 KESTD------EKGILGMNLGRLSFISQAKISK---FSYCIPTRSNRPGLASTGSFYLGD 249
Query: 180 NIQRKDMKTIRMFVDRSSH---------YYLSLQDISVADHRIGFAPGTFALRRNGTGGC 230
N + K + + S Y + LQ I + R+ F G+G
Sbjct: 250 NPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQT 309
Query: 231 MIDTGAIATFIQRGPYEVVMRHFDEHFTS-FGRQRMHNASEDWEYCYRYDSRF-RAYASM 288
M+D+G+ T + Y+ V S + ++ ++ D + + R +
Sbjct: 310 MVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDL 369
Query: 289 TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNTG 344
F F R + + G CV I S ++++G QQ+ +D+
Sbjct: 370 VFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 429
Query: 345 TIQFVPENC 353
+ F C
Sbjct: 430 RVGFSKAEC 438
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 144/376 (38%), Gaps = 36/376 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP--IFNPNASSTYKRIPCDD 64
Y V GTP++ L+ DTGS L W +C + AP +F AS ++ I C
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGD-APRRVFRAAASRSWAPIACSS 170
Query: 65 LICRR-PPFRCEN-----GQCVHRINYAGGASASGLVSTETFTFHLKN--------KLVC 110
C PF N C + Y G++A G+V T++ T L +
Sbjct: 171 DTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAK 230
Query: 111 VPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEAT 170
+ GV+ GC+ SF + G+L S S + + G FSYCLV AT
Sbjct: 231 LQGVVLGCTASYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNAT 289
Query: 171 SILRFGKDA--------NIQRKDMKTIRMFVDR--SSHYYLSLQDISVADHRIGFAPGTF 220
S L FG + + +DR S Y +++ + VA + +
Sbjct: 290 SYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADVW 349
Query: 221 ALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS 280
+ R GG ++D+G T + Y V+ E R M + +EYCY + +
Sbjct: 350 DVARG--GGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSM----DPFEYCYNWTA 403
Query: 281 RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFV 338
+ F + P Y + G C+ + SV+G QQD +
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463
Query: 339 YDLNTGTIQFVPENCA 354
+DL ++F CA
Sbjct: 464 FDLRDRWLRFKHTRCA 479
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 146/364 (40%), Gaps = 56/364 (15%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
G+ ++ ++ DT S + W QC PC +C Q+ +++P+ SS+ PC CR
Sbjct: 150 GSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLG 209
Query: 71 PFRCENG------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND-NR 123
P+ NG QC +R+ Y G++++G ++ T + + FGCS+ +
Sbjct: 210 PY--ANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQ 267
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL----------VYAYREMEATSIL 173
SF +GI+ SL Q K+T +FSYCL + + A+
Sbjct: 268 PGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAAS--- 324
Query: 174 RFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMID 233
R+ ++ K + Y + L I VA R+ P FA G ++D
Sbjct: 325 RYAVTPMLRSKAAPML---------YLVRLIAIEVAGKRLPVPPAVFA------AGAVMD 369
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA------YAS 287
+ I T + Y + F ++ R E + CY +
Sbjct: 370 SRTIVTRLPPTAYMALRAAFVAEMRAY---RAAAPKEHLDTCYDFSGAAPGGGGGVKLPK 426
Query: 288 MTFHFDRADFKVE--PTYMYFIFQNEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTG 344
+T FD + VE P+ + +G A + D+ ++G QQQ +Y+++
Sbjct: 427 ITLVFDGPNGAVELDPSGVLL----DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGA 482
Query: 345 TIQF 348
T+ F
Sbjct: 483 TVGF 486
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 148/369 (40%), Gaps = 39/369 (10%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C P F P SSTY+ + C+
Sbjct: 91 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN 150
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QCV+ YA +S+ G++ + +F +++L V FGC
Sbjct: 151 MDCNCDD-----DKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAV-FGCETVE 204
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ GI+G SL+ QL S+ L Y ++ S++ G D
Sbjct: 205 TGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFD---Y 261
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI- 241
DM DRS +Y + L I VA ++ F +G G ++D+G ++
Sbjct: 262 PSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVF----DGEHGAVLDSGTTYAYLP 317
Query: 242 ---QRGPYEVVMRHF---------DEHF--TSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
E VMR D +F T F ++ SE S+
Sbjct: 318 DAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSE--------LSKIFPSVE 369
Query: 288 MTFHFDRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGT 345
M F ++ YM+ + G +C+ + + D +++G ++T VYD
Sbjct: 370 MIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSK 429
Query: 346 IQFVPENCA 354
+ F NC+
Sbjct: 430 VGFWRTNCS 438
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 157/378 (41%), Gaps = 44/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTPSK ++ DTGS ++W C+ C C S+ ++ S+T K +
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 61 PCDDLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKN---KLVCVP 112
CD+ C P C N C + Y G+S +G + ++ + +
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 113 GVI-FGC-SNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYRE 166
G I FGC + + D G A GILGF S S++ QL ST + +F++CL
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL----DG 261
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTF-ALRRN 225
I G +Q K + V HY +++ + V + + F A R
Sbjct: 262 TNGGGIFAMGH--VVQPK--VNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRK 317
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
GT +ID+G ++ YE ++ + Q +H + ++Y R D F
Sbjct: 318 GT---IIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQYSERVDDGF--- 371
Query: 286 ASMTFHFDRA-DFKVEPTYMYFIFQNEGYFCVAISFS-----DRNSVV--GAWQQQDTRF 337
+ FHF+ + KV P ++FQ E +C+ S DR +V G +
Sbjct: 372 PPVIFHFENSLLLKVYP--HEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLV 429
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL TI + NC++
Sbjct: 430 LYDLENQTIGWTEYNCSS 447
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 146/356 (41%), Gaps = 40/356 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCD 63
F+ V+V FG P ++ L+ DTGS W +C C NC N+ P FNP+ SS+Y C
Sbjct: 128 FFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSC- 186
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI--FGCSND 121
+ + + +NY + + G+ + T P V F
Sbjct: 187 ----------IPSTKTNYTMNYEDNSYSKGVFVCDEVTLK--------PDVFPKFQFGCG 228
Query: 122 NRDFSFDGNIAGILGFSVSP-FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
+ G+ +G+LG + +SL+ Q S + FSYC + + E S+L FG+ A
Sbjct: 229 DSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYC--FPHNENTRGSLL-FGEKAI 285
Query: 181 IQRKDMKTIRMFVDRS-SHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
+K R+ S S Y++ L ISVA R+ + FA + G +ID+G + T
Sbjct: 286 SASPSLKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLFA-----SPGTIIDSGTVIT 340
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD---SRFRAYASMTFHF-DRA 295
+ YE + F + + + CY R + HF
Sbjct: 341 HLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEV 400
Query: 296 DFKVEPTYMYFIFQNEGYFCVAI---SFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
D + P+ + + + C+A S +++G QQ + VYD+ G + F
Sbjct: 401 DVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF 456
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 155/381 (40%), Gaps = 51/381 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y ++ GTP K + DTGS ++W C+ C C +S +++P SS+ +
Sbjct: 82 LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141
Query: 61 PCDDLICR-----RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHL----KNKLVCV 111
CD C + P +N C + + Y G+S +G +++ ++
Sbjct: 142 SCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHAN 201
Query: 112 PGVIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREM 167
VIFGC + D S + + GI+GF S S+L QL + + +FS+CL +
Sbjct: 202 ASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL----DTI 257
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ I G ++ + +K+ + D HY ++L+ I+V + F
Sbjct: 258 KGGGIFAIG---DVVQPKVKSTPLVPDM-PHYNVNLESINVGGTTLQLPSHMFETGEK-- 311
Query: 228 GGCMIDTGAIATFIQRGPYEVVM-----RHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF 282
G +ID+G T++ Y+ V+ +H D F S +D+ + S
Sbjct: 312 KGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSV---------QDFLCIQYFQSVD 362
Query: 283 RAYASMTFHF-DRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQD 334
+ +TFHF D V P + YF + +C S ++G +
Sbjct: 363 DGFPKITFHFEDDLGLNVYP-HDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSN 421
Query: 335 TRFVYDLNTGTIQFVPENCAN 355
VYDL + + NC++
Sbjct: 422 KVVVYDLENQVVGWTDYNCSS 442
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 99/222 (44%), Gaps = 12/222 (5%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPC---VNCFNQSAPIFNPNASSTYKRIPCD 63
Y V GTP ++ + DTGS L W QC PC +C++Q P+F+P SS+Y +PC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 64 DLICRR----PPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C C QC + ++Y G++ +G+ S++T T + V G FGC
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFGCG 256
Query: 120 NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+ F+G + G+LG SL+ Q T G+FSYCL ++ G
Sbjct: 257 HAQSGL-FNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSG 314
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
+ + ++Y + L ISV ++ FA
Sbjct: 315 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA 356
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 157/378 (41%), Gaps = 44/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP K+ +L DTGS ++W C+ C C +S+ +++ SS+ K +
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141
Query: 61 PCDDLICRRPPFRCENG-----QCVHRINYAGGASASG-----LVSTETFTFHLKNKLVC 110
PCD C+ G C + Y G+S +G +V + + LK
Sbjct: 142 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTD-SA 200
Query: 111 VPGVIFGC-SNDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
++FGC + + D S A GILGF + S++ QL S+ + +F++CL
Sbjct: 201 NGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL----N 256
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+ I G ++ + + + D+ HY +++ + V + + T A +
Sbjct: 257 GVNGGGIFAIG---HVVQPKVNMTPLLPDQ-PHYSVNMTAVQVGHTFLSLSTDTSA--QG 310
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G +ID+G ++ G YE ++ Q +H+ ++Y D F A
Sbjct: 311 DRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTCFQYSESVDDGFPA- 369
Query: 286 ASMTFHFDRA-DFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRF 337
+TF F+ KV P ++F + ++C+ S S ++G +
Sbjct: 370 --VTFFFENGLSLKVYP--HDYLFPSVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLV 425
Query: 338 VYDLNTGTIQFVPENCAN 355
YDL I + NC++
Sbjct: 426 FYDLENQAIGWAEYNCSS 443
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 156/396 (39%), Gaps = 67/396 (16%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCF-----NQSAPIFNPNASSTYK 58
Y+ + FGTP ++ L+FDTGS L+W C C C P F P SS+ K
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 59 RIPCDDLICR---RPPFRCENGQC------------VHRINYAGGASASGLVSTETFTFH 103
+ C + C P + + C + + Y G++A GL+ +ET F
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA-GLLLSETLDFP 199
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV 161
K +P + GCS F +GI GF SL Q+ GL F+YCL
Sbjct: 200 DKK----IPNFVVGCS-----FLSIHQPSGIAGFGRGSESLPSQM-----GLKKFAYCL- 244
Query: 162 YAYRE-----------MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVAD 210
A R+ +++T + G R++ +YYL+++ I V +
Sbjct: 245 -ASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKE--YYYLNIRKIIVGN 301
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ +G GG +ID+G+ TF+ + EVV R F++ ++ R
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLT 361
Query: 271 DWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYF-IFQNEGYFCVAI---------- 318
C+ + + + F F P YF + + G C+ +
Sbjct: 362 GLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGG 421
Query: 319 SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ ++GA+QQQ+ YDL + F + C+
Sbjct: 422 GGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/406 (24%), Positives = 155/406 (38%), Gaps = 71/406 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCFNQSA---PIFNPNASSTYKRI 60
Y GTP + +L DTGS+L W C C NC + SA P+F+P SS+ + +
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 61 PCDD-------------LICRRPPFRCENGQCVHRIN-----YA---GGASASGLVSTET 99
C + CRR P C + YA G S +GL+ +T
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218
Query: 100 FTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FS 157
L+ VPG + GCS S +G+ GF S+ QL GL FS
Sbjct: 219 ----LRAPGRAVPGFVLGCS----LVSVHQPPSGLAGFGRGAPSVPAQL-----GLPKFS 265
Query: 158 YCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSS-------HYYLSLQDISVAD 210
YCL+ + A + M+ + + + +YYL+L+ ++V
Sbjct: 266 YCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGG 325
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ FA G+GG ++D+G T++ P GR + +E
Sbjct: 326 KAVRLPARAFAGNAAGSGGTIVDSGTTFTYLD--PTVFQPVADAVVAAVGGRYKRSKDAE 383
Query: 271 D---WEYCYRYD--SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEG---YFCVAI---- 318
D C+ +R A ++FHF+ P YF+ G C+A+
Sbjct: 384 DGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDF 443
Query: 319 --------SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
S ++G++QQQ+ YDL + F ++C +
Sbjct: 444 GGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 489
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 149/377 (39%), Gaps = 44/377 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA---PI--FNPNASSTYKRI 60
Y VL G+P K ++ DTGS ++W C C C S P+ F+P +SST I
Sbjct: 67 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 126
Query: 61 PCDDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHL---KNKLVCV 111
C D C + G QC++ Y G+ SG ++ F +
Sbjct: 127 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 186
Query: 112 PGVIFGCS-NDNRDFS-FDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAYR 165
++FGCS + D + D + GI GF S++ Q+ S QG+ FS+CL
Sbjct: 187 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSS--QGITPKVFSHCLKGDGG 244
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I +D+ V HY L+LQ ISV + P FA N
Sbjct: 245 GGGIL------VLGEIVEEDI-VYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTN 297
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA- 284
G ++D+G ++ Y+ + E + R + ++ CY S +
Sbjct: 298 --RGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ----CYLITSSVKGI 351
Query: 285 YASMTFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRN--SVVGAWQQQDTRF 337
+ +++ +F ++P Y + QN +C+ +++G +D F
Sbjct: 352 FPTVSLNFAGGVSMNLKPED-YLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 410
Query: 338 VYDLNTGTIQFVPENCA 354
VYDL I + +C+
Sbjct: 411 VYDLAGQRIGWANYDCS 427
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 153/379 (40%), Gaps = 42/379 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + GTP + ++ DTGS ++W C PC C + F+P SST +
Sbjct: 40 LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 61 PCDDLICRRPPFRCE-----NGQCVHRINYAGGASASGLVSTETFTFH-LKNKLV---CV 111
C D C E + C + Y G+ G ++ F ++ N+ V
Sbjct: 100 SCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNAS 159
Query: 112 PGVIFGCS-NDNRDFSF-DGNIAGILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYREM 167
+ FGCS N + D + D + GI GF + S++ QL S A +FS+CL A
Sbjct: 160 AKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGA---D 216
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
IL G+ I M V HY L+LQ I+V ++ P FA T
Sbjct: 217 PGGGILVLGE---ITEPGM-VYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTN--T 270
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYAS 287
G +ID G ++ YE + + + M + + + D F S
Sbjct: 271 RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEIF---PS 327
Query: 288 MTFHFDRADFKVEPTYMYFIFQ----NEGYFCVA-------ISFSDRNSVVGAWQQQDTR 336
+T +F+ A ++P Y I Q + +C+ + S + +++G +D
Sbjct: 328 VTLYFEGAPMDLKPK-DYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKV 386
Query: 337 FVYDLNTGTIQFVPENCAN 355
FVYDL I + +C++
Sbjct: 387 FVYDLENQRIGWTSFDCSS 405
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 154/361 (42%), Gaps = 39/361 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV-NCFNQSAPIFNPNASSTYKRIPC-DD 64
Y + G+P K L+ DTGS L W +C PC +C + F+ AS+TYK + C DD
Sbjct: 124 YYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLASNTYKALTCADD 179
Query: 65 LICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRD 124
L R P + R+ ++G + L T ++L PG +FGC + +
Sbjct: 180 L--RLPVLL----RLWRRLFHSGRS----LRDTLKMAGAASDELEEFPGFVFGCGSLLKG 229
Query: 125 FSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREME-ATSILRFGKDANIQR 183
G + GIL S S Q+ FSYCL+ + S + FG +A ++
Sbjct: 230 L-ISGEV-GILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFG-EAAVEL 286
Query: 184 KDMKTIR------MFVDRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMI-DTG 235
K+ + + + SS YY + L ISV + R+ +P TF NG I D+G
Sbjct: 287 KEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFL---NGQDKPTIFDSG 343
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRAYASMTFHFD- 293
T + G V + S A + + C+R S + +TFHF+
Sbjct: 344 TTLTMLPSG----VCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQGLPDITFHFNG 399
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
ADF P+ ++ C+ ++ S+ G QQQD ++D++ I F +C
Sbjct: 400 GADFVTRPS--NYVIDLGSLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 457
Query: 354 A 354
Sbjct: 458 G 458
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 67/129 (51%), Gaps = 16/129 (12%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
YF + V GTP ++ DTGS ++W QC PC C++QS +F+P AS +Y + C
Sbjct: 146 EYFTKIGV--GTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCA 203
Query: 64 DLICRRPPFRCENG-------QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+CR R ++G C++++ Y G+ +G +TET TF + VP V
Sbjct: 204 APLCR----RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGAR---VPRVAL 256
Query: 117 GCSNDNRDF 125
GC +DN
Sbjct: 257 GCGHDNEGL 265
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 149/377 (39%), Gaps = 44/377 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA---PI--FNPNASSTYKRI 60
Y VL G+P K ++ DTGS ++W C C C S P+ F+P +SST I
Sbjct: 82 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141
Query: 61 PCDDLICRRPPFRCENG------QCVHRINYAGGASASGLVSTETFTFHL---KNKLVCV 111
C D C + G QC++ Y G+ SG ++ F +
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 201
Query: 112 PGVIFGCS-NDNRDFS-FDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAYR 165
++FGCS + D + D + GI GF S++ Q+ S QG+ FS+CL
Sbjct: 202 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSS--QGITPKVFSHCLKGDGG 259
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I +D+ V HY L+LQ ISV + P FA N
Sbjct: 260 GGGIL------VLGEIVEEDI-VYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTN 312
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA- 284
G ++D+G ++ Y+ + E + R + ++ CY S +
Sbjct: 313 --RGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ----CYLITSSVKGI 366
Query: 285 YASMTFHF-DRADFKVEPTYMYFIFQNE----GYFCVAISFSDRN--SVVGAWQQQDTRF 337
+ +++ +F ++P Y + QN +C+ +++G +D F
Sbjct: 367 FPTVSLNFAGGVSMNLKPED-YLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 425
Query: 338 VYDLNTGTIQFVPENCA 354
VYDL I + +C+
Sbjct: 426 VYDLAGQRIGWANYDCS 442
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 153/374 (40%), Gaps = 52/374 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC---FNQSAPI--FNPNASSTYKRI 60
Y V GTP ++ L DTGS L+W C PC+ C + PI ++ AS++ ++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 61 PCDDLICRRPPFRCENG-----QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
PC D C E+G QC + Y G+ G + + + + VI
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----VI 150
Query: 116 FGCS-NDNRDFSF-DGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREMEATS 171
FGC + D S + + GI+GF S S QL + +F++CL R
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGER---GGG 207
Query: 172 ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
IL G N+ D++ + V HY + LQ ISV + + P F+ + G +
Sbjct: 208 ILVLG---NVIEPDIQYTPL-VPYMYHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTI 261
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF--RAYASMT 289
D+G ++ DE + +F Q + + C SRF + + ++
Sbjct: 262 FDSGTTLAYLP-----------DEAYQAF-TQAVSLVVAPFLLCDTRLSRFIYKLFPNVV 309
Query: 290 FHFDRADFKVEPTYMYFIFQ----NEGYFCV------AISFSDRNSVVGAWQQQDTRFVY 339
+F+ A + P Y I Q N +C+ + + ++ G ++ VY
Sbjct: 310 LYFEGASMTLTPAE-YLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVY 368
Query: 340 DLNTGTIQFVPENC 353
DL G I + P +C
Sbjct: 369 DLERGRIGWRPFDC 382
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 151/375 (40%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V G+P+K ++ DTGS ++W C+ C NC + S F+ SST +
Sbjct: 82 LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETF---TFHLKNKLVC- 110
C D IC + C + QC + Y G+ +G ++T T L +V
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVAN 201
Query: 111 -VPGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYR 165
+IFGCS + D + D + GI GF S++ QL S +FS+CL
Sbjct: 202 SSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL---KG 258
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+L G+ I + V HY L+LQ I+V + FA N
Sbjct: 259 GENGGGVLVLGE---ILEPSI-VYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRY-DSRFRA 284
G ++D+G ++ + Y ++ + F + + ++ CY +S
Sbjct: 315 --QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ----CYLVSNSVGDI 368
Query: 285 YASMTFHF-DRADFKVEPTYM---YFIFQNEGYFCVAISFSDRN-SVVGAWQQQDTRFVY 339
+ ++ +F A + P + Y +C+ ++ +++G +D FVY
Sbjct: 369 FPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVY 428
Query: 340 DLNTGTIQFVPENCA 354
DL I + +C+
Sbjct: 429 DLANQRIGWADYDCS 443
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 147/364 (40%), Gaps = 29/364 (7%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C P F P SSTY+ + C+
Sbjct: 90 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCN 149
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QCV+ YA +S+ G++ + +F +++L V FGC
Sbjct: 150 MDCNCDD-----DREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAV-FGCETVE 203
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ GI+G SL+ QL S+ L Y ++ S++ G D
Sbjct: 204 TGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDY--- 260
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
DM DRS +Y + L I VA ++ F +G G ++D+G ++
Sbjct: 261 PSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF----DGEHGAVLDSGTTYAYLP 316
Query: 243 RGPY---------EV-VMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
+ EV ++ D +F A+ ++ S+ M F
Sbjct: 317 DAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNY---VSELSKIFPSVEMVFKS 373
Query: 293 DRADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
++ YM+ + G +C+ + + D +++G ++T VYD + F
Sbjct: 374 GQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWR 433
Query: 351 ENCA 354
NC+
Sbjct: 434 TNCS 437
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 94/203 (46%), Gaps = 12/203 (5%)
Query: 156 FSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRS--SHYYLSLQDISVADHRI 213
FSYCL + S+L G A KD + + + S S YYLSL+ I V ++
Sbjct: 6 FSYCL--TSMDDSKASVLLLGSLAK-ATKDAISTPLLTNPSQPSFYYLSLEGIPVGGTQL 62
Query: 214 GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWE 273
F + +G+GG +ID+G T++++ ++ + + F + Q ++S +
Sbjct: 63 SIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEF---ISQSNLQLDKSSSTGLD 119
Query: 274 YCYRYDSRFR--AYASMTFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAW 330
C+ S + FHF D ++ P Y I ++ G C+A+ S+ S+ G
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGGDLEL-PAESYMIADSKLGVACLAMGASNGMSIFGNV 178
Query: 331 QQQDTRFVYDLNTGTIQFVPENC 353
QQQ+ +DL TI FVP C
Sbjct: 179 QQQNILVNHDLEKETISFVPTQC 201
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/392 (23%), Positives = 152/392 (38%), Gaps = 58/392 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCF-----NQSAPIFNPNASSTYK 58
Y++D+ GTP ++ + DTGS L+W C C +C P F P SST K
Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAK 151
Query: 59 RIPCDDLIC----------RRPPFRCENGQC-----VHRINYAGGASASGLVSTETFTFH 103
+ C + C R P + E+ C + I Y G++A G + + F
Sbjct: 152 LLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTA-GFLLLDNLNFP 210
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-Y 162
K VP + GCS + +GI GF SL Q+ FSYCLV +
Sbjct: 211 GKT----VPQFLVGCSILSIR-----QPSGIAGFGRGQESLPSQMNLKR---FSYCLVSH 258
Query: 163 AYREMEATS--ILRFGKDANIQRKDMKTIRMFVDRSS-------HYYLSLQDISVADHRI 213
+ + +S +L+ + + + + S+ +YYL+L+ + V +
Sbjct: 259 RFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDV 318
Query: 214 GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDW 272
+G GG ++D+G+ TF++R Y +V + F + ++ R
Sbjct: 319 KIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGL 378
Query: 273 EYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS------ 325
C+ + + +TF F +P YF + SD +
Sbjct: 379 SPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTT 438
Query: 326 ----VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++G +QQQ+ YDL F P +C
Sbjct: 439 GPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 77/168 (45%), Gaps = 16/168 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P + FL DTGS L W QC PCV+C P++ P + K +PC D
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKN---KLVPCVD 113
Query: 65 LICR--------RPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C R QC + I YA S+ G++ T++F L N + PG+ F
Sbjct: 114 QMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAF 173
Query: 117 GCSNDNRDFSFDGNIA--GILGFSVSPFSLLGQLK--STAQGLFSYCL 160
GC D + S A G+LG SLL QLK + + +CL
Sbjct: 174 GCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 156/396 (39%), Gaps = 67/396 (16%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCF-----NQSAPIFNPNASSTYK 58
Y+ + FGTP ++ L+FDTGS L+W C C C P F P SS+ K
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 59 RIPCDDLICR---RPPFRCENGQC------------VHRINYAGGASASGLVSTETFTFH 103
+ C + C P + + C + + Y G++A GL+ +ET F
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA-GLLLSETLDFP 199
Query: 104 LKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV 161
K +P + GCS F +GI GF SL Q+ GL F+YCL
Sbjct: 200 DKX----IPNFVVGCS-----FLSIHQPSGIAGFGRGSESLPSQM-----GLKKFAYCL- 244
Query: 162 YAYRE-----------MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVAD 210
A R+ +++T + G R++ +YYL+++ I V +
Sbjct: 245 -ASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKE--YYYLNIRKIIVGN 301
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ +G GG +ID+G+ TF+ + EVV R F++ ++ R
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLT 361
Query: 271 DWEYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYF-IFQNEGYFCVAI---------- 318
C+ + + + F F P YF + + G C+ +
Sbjct: 362 GLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGG 421
Query: 319 SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ ++GA+QQQ+ YDL + F + C+
Sbjct: 422 GGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 147/383 (38%), Gaps = 57/383 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G P ++ ++ DTGS L W C N +FNP +SSTY +PC
Sbjct: 62 NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCS 117
Query: 64 DLICRRP------PFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
ICR P C+ C I+YA S G ++ ETF V PG +
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTL 173
Query: 116 FGC--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC S + + D G++G + S + QL + FSYC+ + +++ L
Sbjct: 174 FGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK---FSYCISGS----DSSGFL 226
Query: 174 RFGKDAN------IQRKDM---KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
G DA+ IQ + T + DR + Y + L+ I V + F
Sbjct: 227 LLG-DASYSWLGPIQYTPLVLQSTPLPYFDRVA-YTVQLEGIRVGSKILSLPKSVFVPDH 284
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-----DWEYCYRYD 279
G G M+D+G TF+ Y + F S R+ + + + CY+
Sbjct: 285 TGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSV--LRLVDDPDFVFQGTMDLCYKVG 342
Query: 280 S----RFRAYASMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAISFSD----RNS 325
S F ++ F A+ V + + G +C SD
Sbjct: 343 STTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAF 402
Query: 326 VVGAWQQQDTRFVYDLNTGTIQF 348
V+G QQ+ +DL + F
Sbjct: 403 VIGHHHQQNVWMEFDLAKSRVGF 425
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 144/349 (41%), Gaps = 46/349 (13%)
Query: 22 LLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRPPFRCENGQCVH 81
L+FDT S L+WTQC PC++C Q+ +++PN + TY + + +
Sbjct: 5 LVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSN----------------Y 48
Query: 82 RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSP 141
Y+ + SG +TE TF L N V V + FGC N+ + +D N+AG+ G
Sbjct: 49 NYTYSKQSFTSGYFATE--TFALGN--VTVANITFGCGTRNQGY-YD-NVAGVFGVGRGG 102
Query: 142 FSLLGQLKSTAQGL--FSYCLVYAYREMEATSILRFGKD--ANIQRKDMKTIRMFVDR-- 195
SLL QL G+ FSYC + + L + N + M D
Sbjct: 103 VSLLNQL-----GIDRFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVL 157
Query: 196 SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDE 255
S Y++ L ++V R+ A + A G +ID+ + T + Y V R
Sbjct: 158 KSGYFVKLVGVTVGATRVDVAGASSA--EGGGRALVIDSTSPVTVLDEATYGPVRRALVA 215
Query: 256 HFTSFGRQRMHNASE--DWEYCYRY----DSRFRAYASMTFHFD--RADFKVEPTYMYFI 307
++ NAS + C+ + +MT HFD AD + P
Sbjct: 216 QLAPL-KEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAK 274
Query: 308 FQNEGYFCVAISFSDRNS--VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
G C+ ++ S N V+G+ DT +YDL + F P +CA
Sbjct: 275 DSAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDCA 323
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 156/377 (41%), Gaps = 44/377 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP+KS ++ DTGS ++W C+ C C +S +++P+ SS+ +
Sbjct: 80 LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGV 139
Query: 61 PCDDLICRRP-----PFRCENGQCVHRINYAGGASASGLVSTETFTFHL----KNKLVCV 111
C C P C + I+Y G+S +G T+ ++ +
Sbjct: 140 TCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLAN 199
Query: 112 PGVIFGC-SNDNRDF-SFDGNIAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYREM 167
+ FGC + D S + GILGF S S+L QL + + +F++CL +
Sbjct: 200 TSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL----DTI 255
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G ++ + + T + V HY ++L+ I V ++ F + +
Sbjct: 256 NGGGIFAIG---DVVQPKVSTTPL-VPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGE--S 309
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAYA 286
G +ID+G ++ Y +M F +G + N +D++ C+RY +
Sbjct: 310 KGTIIDSGTTLAYLPGVVYNAIMSKV---FAQYGDMPLKN-DQDFQ-CFRYSGSVDDGFP 364
Query: 287 SMTFHFDRA-DFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRFV 338
+TFHF+ + P ++FQN +C+ + ++G + +
Sbjct: 365 IITFHFEGGLPLNIHP--HDYLFQNGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVL 422
Query: 339 YDLNTGTIQFVPENCAN 355
YDL I + NC++
Sbjct: 423 YDLENQVIGWTDYNCSS 439
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 81/166 (48%), Gaps = 11/166 (6%)
Query: 11 VLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRRP 70
V G+P K +++ DTGS + W QC PC +C+ Q+ PIF P+ SS+Y + C+ C+
Sbjct: 57 VGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSL 116
Query: 71 PF-RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDG 129
C N C++ ++Y G+ G +TET T L V GC +DN
Sbjct: 117 DVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASL---NNVAIGCGHDNEGLFVGA 173
Query: 130 NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
LG F Q+ +++ FSYCLV R+ ++ S L F
Sbjct: 174 AGLLGLGGGSLSFP--SQINASS---FSYCLV--NRDTDSASTLEF 212
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 147/363 (40%), Gaps = 35/363 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
+YT V GTP L+ DTGS + + C C +C N P F+P SS+YK + C
Sbjct: 34 YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS- 92
Query: 66 ICRRPPFRCENGQC----VHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
C G C ++ YA +++SG++ + F + L ++FGC
Sbjct: 93 -------ECSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQR-LVFGCETA 144
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQL--KSTAQGLFSYCLVYAYREMEATSILRFGKDA 179
+D GI+G P S++ QL K+ + +FS C Y + +++ G
Sbjct: 145 ETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLC--YGGMDEGGGAMILGGFQP 202
Query: 180 NIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIAT 239
KDM RS +Y L L+ I V + P F +G G ++D+G
Sbjct: 203 ---PKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVF----DGKYGTVLDSGTTYA 255
Query: 240 FIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDW-EYCY-----RYDSRFRAYASMTFHF- 292
+ ++ E S + + E + + CY + + + S+ F F
Sbjct: 256 YFPGAAFQAFKSAVKEQVGSL--KEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFG 313
Query: 293 DRADFKVEP-TYMYFIFQNEGYFCVAI-SFSDRNSVVGAWQQQDTRFVYDLNTGTIQFVP 350
D + P Y++ + G +C+ + D +++G ++ Y+ +I F+
Sbjct: 314 DGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLK 373
Query: 351 ENC 353
C
Sbjct: 374 TKC 376
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 149/371 (40%), Gaps = 44/371 (11%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ DTGS + + C C C P F P+ SSTY+ + C+
Sbjct: 74 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN 133
Query: 64 DLICRRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSND 121
P C E QC + YA +S+SG+++ + +F +++L V FGC N
Sbjct: 134 ------PSCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAV-FGCENV 186
Query: 122 NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANI 181
+ GI+G S++ QL S+ L Y ++ +++ G+ +
Sbjct: 187 ETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMV-LGQIS-- 243
Query: 182 QRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+M RS +Y + L+++ VA + P F + G ++D+G +
Sbjct: 244 PPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH----GTVLDSGTTYAYF 299
Query: 242 QRGPYEVV-------MRHF-----------DEHFTSFGRQRMHNASEDWEYCYRYDSRFR 283
+ + +RH D F+ GR+ H + E + S +
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQK 359
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNT 343
S + R KV Y IFQN +D +++G ++T YD
Sbjct: 360 LSLSPENYLFRHT-KVSGAYCLGIFQNG---------NDLTTLLGGIVVRNTLVTYDREN 409
Query: 344 GTIQFVPENCA 354
I F NC+
Sbjct: 410 DKIGFWKTNCS 420
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 158/375 (42%), Gaps = 45/375 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI-----FNPNASSTYKRI 60
Y D+ GTP+ ++ DTGS W + C C ++S + ++P +S + K +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 61 PCDDLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP---GVI 115
CDD IC RPP +C + YA G G++ T+ +H L P V
Sbjct: 142 KCDDTICTSRPPCNMTL-RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200
Query: 116 FGC------SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
FGC S +N + D GI+GF S + L QL + + +FS+CL
Sbjct: 201 FGCGLQQSGSLNNSAVAID----GIIGFGNSNQTALSQLAAAGKTKKIFSHCL----DST 252
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G+ + +KT + + ++ ++L+ I+VA + F + T
Sbjct: 253 NGGGIFAIGE---VVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK--T 307
Query: 228 GGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
G ID+G+ ++ Y E+++ F +H M+N + + + D +F
Sbjct: 308 KGTFIDSGSTLVYLPEIIYSELILAVFAKH-PDITMGAMYNF-QCFHFLGSVDDKF---P 362
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEG-YFC-----VAISFSDRNSVVGAWQQQDTRFVYD 340
+TFHF+ D ++ ++ + EG +C I ++G + VYD
Sbjct: 363 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 421
Query: 341 LNTGTIQFVPENCAN 355
+ I + NC++
Sbjct: 422 MEKQAIGWTEHNCSS 436
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 142/379 (37%), Gaps = 57/379 (15%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKR---IPCDDL 65
V + GTP + + ++ DTGS L W QC + P + S +PC+
Sbjct: 84 VTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHP 143
Query: 66 IC--RRP----PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+C R P P C+ N C + YA G A G + E F P +I GC
Sbjct: 144 LCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQ---TTPPIILGC 200
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATS------- 171
+ + D GILG ++ Q K T FSYC+ + + S
Sbjct: 201 ATQSDD------ARGILGMNLGRLGFPSQAKITK---FSYCVPTKQAQPASGSFYLGNNP 251
Query: 172 ---------ILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFAL 222
+L FG+ + D Y L LQ IS+ ++ P F
Sbjct: 252 ASSSFRYVNLLTFGQSQRMPNLD----------PLAYTLPLQGISIGGKKLNIPPSVFKP 301
Query: 223 RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDWEYCYRYDS- 280
G+G MID+G+ T++ Y V+ + + M+ D C+ D+
Sbjct: 302 NAGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVAD--ICFDGDAI 359
Query: 281 -RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDT 335
R M F F++ V P + G C+ + S+R +++G + QQ+
Sbjct: 360 EIGRLVGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGMGRSERLGAGGNIIGNFHQQNL 419
Query: 336 RFVYDLNTGTIQFVPENCA 354
+DL + F +C+
Sbjct: 420 WVEFDLANRRVGFGEADCS 438
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 144/383 (37%), Gaps = 57/383 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G P ++ ++ DTGS L W C N +FNP +SSTY +PC
Sbjct: 62 NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCS 117
Query: 64 DLICRRP------PFRCENGQ--CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
ICR P C+ C I+YA S G ++ ETF V PG +
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTL 173
Query: 116 FGC--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSIL 173
FGC S + + D G++G + S + QL + FSYC+ +S+
Sbjct: 174 FGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK---FSYCI-----SGSDSSVF 225
Query: 174 RFGKDAN------IQRKDM---KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
DA+ IQ + T + DR + Y + L+ I V + F
Sbjct: 226 LLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVA-YTVQLEGIRVGSKILSLPKSVFVPDH 284
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-----DWEYCYRYD 279
G G M+D+G TF+ Y + F S R+ + + + CY+
Sbjct: 285 TGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSV--LRLVDDPDFVFQGTMDLCYKVG 342
Query: 280 S----RFRAYASMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAISFSD----RNS 325
S F ++ F A+ V + + G +C SD
Sbjct: 343 STTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAF 402
Query: 326 VVGAWQQQDTRFVYDLNTGTIQF 348
V+G QQ+ +DL + F
Sbjct: 403 VIGHHHQQNVWMEFDLAKSRVGF 425
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 156/378 (41%), Gaps = 44/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V GTPSK ++ DTGS ++W C+ C C S+ ++N S + K +
Sbjct: 85 LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 61 PCDDLICRR----PPFRC-ENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP-- 112
PCD+ C P C N C + Y G+S +G + + + L
Sbjct: 145 PCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSN 204
Query: 113 -GVIFGC-SNDNRDF--SFDGNIAGILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYRE 166
VIFGC + + D + + + GILGF S S++ QL +T + +F++CL
Sbjct: 205 GSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL----DG 260
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
+ I G +Q K + + HY +++ + V + + F
Sbjct: 261 INGGGIFAIGH--VVQPK--VNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEF--EAGD 314
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +ID+G ++ YE ++ ++H +++ C++Y +
Sbjct: 315 RKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDL---KVHIVRDEYT-CFQYSGSVDDGF 370
Query: 286 ASMTFHFDRADF-KVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRF 337
++TFHF+ + F KV P F F EG +C+ S S ++G +
Sbjct: 371 PNVTFHFENSVFLKVHPHEYLFPF--EGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLV 428
Query: 338 VYDLNTGTIQFVPENCAN 355
+YDL I + NC++
Sbjct: 429 LYDLENQAIGWTEYNCSS 446
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 143/359 (39%), Gaps = 37/359 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ S+ +L DTGS + W QC PC C +Q+ P+F+P++SSTY C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAA 187
Query: 67 CRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + + QC + + Y G+S +G S++T V FGCS N
Sbjct: 188 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS----AVKSFQFGCS--N 241
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ F+ G++G SL+ Q T FSYCL ++ L G
Sbjct: 242 VESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL---PPTPSSSGFLTLGAAGGSG 298
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M RSS Y + LQ I V ++ F + G ++D+G +
Sbjct: 299 TSGFVKTPML--RSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVI 350
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADF 297
T + Y + F + + S + C+ + + + S+ F
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQ---PSGILDTCFDFSGQSSVSIPSVALVFSGGAV 407
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V I N C+A + + +S ++G QQ+ +YD+ G + F C
Sbjct: 408 -VSLDASGIILSN----CLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 139/365 (38%), Gaps = 43/365 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V G+P ++ L DT + W +PC C ++ +F P S+T+K + C
Sbjct: 98 YIVRAKIGSPPQTLLLAMDTSNDAAW---IPCTACDGCTSTLFAPEKSTTFKNVSCGSPQ 154
Query: 67 CRRPP-FRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C + P C C + Y G +S + V +T T +P FGC
Sbjct: 155 CNQVPNPSCGTSACTFNLTY-GSSSIAANVVQDTVTLATDP----IPDYTFGC------- 202
Query: 126 SFDGNIAGILGFSVSP----------FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
+A G S P SLL Q ++ Q FSYCL +++ + + LR
Sbjct: 203 -----VAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-PSFKSLNFSGSLRL 256
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTG 235
G A R + RSS YY++L I V + P A G + D+G
Sbjct: 257 GPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSG 316
Query: 236 AIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS-EDWEYCYRYDSRFRAYASMTFHFDR 294
+ T + Y V F + + S ++ CY ++TF F
Sbjct: 317 TVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVP---IVAPTITFMFSG 373
Query: 295 ADFKVEPTYMYFIFQNEG-YFCVAISFSDRN-----SVVGAWQQQDTRFVYDLNTGTIQF 348
+ + P I G C+A++ + N +V+ QQQ+ R +YD+ +
Sbjct: 374 MNVTL-PEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGV 432
Query: 349 VPENC 353
E C
Sbjct: 433 ARELC 437
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 140/375 (37%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP + ++ DTGS ++W C C C S F+P +S T I
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG- 113
C D C + C +N C + Y G+ SG ++ F + VP
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 114 ---VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
V+FGCS D + GI GF S++ QL S A +FS+CL E
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---KGE 256
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
IL G+ I +M V HY ++L ISV + P F+ NG
Sbjct: 257 NGGGGILVLGE---IVEPNM-VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS-TSNG 311
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +IDTG ++ Y F E T+ Q + CY + +
Sbjct: 312 Q-GTIIDTGTTLAYLSEAAY----VPFVEAITNAVSQSVRPVVSKGNQCYVITTSVGDIF 366
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNE----GYFCVAISFSDRN--SVVGAWQQQDTRFVY 339
++ +F Y I QN +C+ +++G +D FVY
Sbjct: 367 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 426
Query: 340 DLNTGTIQFVPENCA 354
DL I + +C+
Sbjct: 427 DLVGQRIGWANYDCS 441
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 93/220 (42%), Gaps = 30/220 (13%)
Query: 1 HEKNYF-----YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASS 55
H N F + VDV FGTP + L+ DTGS + WTQC CVNC S FB +ASS
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPPQXFXLILDTGSSITWTQCKACVNCLQDSXRYFBXSASS 176
Query: 56 TYKRIPCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVI 115
TY C P EN + + Y +++ G T T +
Sbjct: 177 TYSXGSCI-------PXTVENN---YNMTYGDDSTSVGNYGCXTMTLEPSDVF---QKFQ 223
Query: 116 FGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
FG +N+ F G+LG S + Q S +FSYCL E ++ L F
Sbjct: 224 FGXGRNNKG-DFGSGADGMLGLGQGQLSTVSQTASKFXKVFSYCL----PEEDSIGSLLF 278
Query: 176 GKDANIQRKDMKTIRMF-------VDRSSHYYLSLQDISV 208
G+ A Q +K + + S +Y++ L DISV
Sbjct: 279 GEKATSQSSSLKFTSLVNGPGTSGLXESGYYFVKLLDISV 318
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 140/375 (37%), Gaps = 39/375 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP + ++ DTGS ++W C C C S F+P +S T I
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG- 113
C D C + C +N C + Y G+ SG ++ F + VP
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 114 ---VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
V+FGCS D + GI GF S++ QL S A +FS+CL E
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---KGE 256
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
IL G+ I +M V HY ++L ISV + P F+ NG
Sbjct: 257 NGGGGILVLGE---IVEPNM-VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS-TSNG 311
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RAY 285
G +IDTG ++ Y F E T+ Q + CY + +
Sbjct: 312 Q-GTIIDTGTTLAYLSEAAY----VPFVEAITNAVSQSVRPVVSKGNQCYVITTSVGDIF 366
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNE----GYFCVAISFSDRN--SVVGAWQQQDTRFVY 339
++ +F Y I QN +C+ +++G +D FVY
Sbjct: 367 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 426
Query: 340 DLNTGTIQFVPENCA 354
DL I + +C+
Sbjct: 427 DLVGQRIGWANYDCS 441
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 122/272 (44%), Gaps = 18/272 (6%)
Query: 90 SASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLK 149
+ S L+ T +K +P + FGC +NR D AG+LG SL+ QL
Sbjct: 118 TGSDLIWTHKLCKGVKPSKFSIPRIGFGCGVNNRATGMD-QTAGLLGLGRGVLSLVSQLG 176
Query: 150 STAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRS----SHYYLSLQD 205
+ FSYCL + TS L FG A K R + ++ S+YYL+L+
Sbjct: 177 TQK---FSYCLTSIHEN--KTSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKG 231
Query: 206 ISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRM 265
I+V + F L ++G+GG ++D+G T++Q ++V+ F + Q
Sbjct: 232 ITVGYTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAF---ISQTELQVA 288
Query: 266 HNASEDWEYCYRYDSRFRA---YASMTFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFS 321
++++ + C+ + A + FHF D + P Y + E G C+AI +
Sbjct: 289 NSSTTGLDLCFHLPVKNAAEVKVPKLIFHFKGLDLAL-PVENYMVSDPEMGLICLAIDAT 347
Query: 322 DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
S+ G QQQ+ ++DL T+ VP C
Sbjct: 348 GSLSIFGNIQQQNMLVLHDLKKSTLSLVPTQC 379
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 139/341 (40%), Gaps = 44/341 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNC-----FNQSAPIFNPNASSTYKRI 60
Y + GTP + + D S +W QC C C SAP F S R
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSFHDTRA 155
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGAS--ASGLVSTETFTFHLKNKLVCVPGVIFGC 118
P PP C + Y GGA+ +GL++ + F F V GVIFGC
Sbjct: 156 PT------TPP-------CGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADGVIFGC 198
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKD 178
+ + +G+I G++G S + QL+ G FSY L ++ S + F D
Sbjct: 199 A-----VATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLA-PDDAVDVGSFILFLDD 249
Query: 179 ANIQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGA 236
A + + + R+S YY+ L I V + GTF L+ +G+GG ++
Sbjct: 250 AKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITI 309
Query: 237 IATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE-DWEYCYRYDSRFRA-YASMTFHF-D 293
TF+ G Y+VV + R + SE + CY +S A SM F
Sbjct: 310 PVTFLDAGAYKVVRQAMASKI----ELRAADGSELGLDLCYTSESLATAKVPSMALVFAG 365
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAI--SFSDRNSVVGAWQQ 332
A ++E +++ G C+ I S + S++G+ Q
Sbjct: 366 GAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQ 406
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 141/365 (38%), Gaps = 38/365 (10%)
Query: 13 FGTPSKSEFLLFDTGSYLIWTQCLPCVNC--FNQSAPIFNPNASSTYKRIPCDDLICRRP 70
GTP + D G L+WTQC C + FNQ P F+P SSTY+ PC +C
Sbjct: 30 IGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCEFF 89
Query: 71 PF---RCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P C C + + SG + T+ V FGC +
Sbjct: 90 PASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAA----SVAFGCVMASDIKLM 145
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMK 187
DG +G +G + +P SL+ Q+ TA FS+CL S L F A K
Sbjct: 146 DGGPSGFVGLARTPLSLVAQMNVTA---FSHCLAPHDGGGGKNSRL-FLGAAAKLAGGGK 201
Query: 188 TIRM---FVD------RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ M FV +S +Y ++L+ I D I P ++G ++ T +
Sbjct: 202 SAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAIITVP------QSGR-TVLLQTFSPV 254
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFK 298
+F+ G Y+ + + ++ C++ + F A
Sbjct: 255 SFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGG-VSGAPDVVLTFQGAAAL 313
Query: 299 VEPTYMYFIFQNEGYFCVAISFSDR--------NSVVGAWQQQDTRFVYDLNTGTIQFVP 350
P Y + + CVAI+ S R S++G QQQ+ F+YDL T+ F
Sbjct: 314 TVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKETLSFEA 373
Query: 351 ENCAN 355
+C++
Sbjct: 374 ADCSS 378
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 143/357 (40%), Gaps = 27/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP++ L DT + W C C C S FNP AS++Y+ +PC
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 164
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C ++YA +S +S +T V FGC R
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYA-DSSLQAALSQDTLAVAGD----VVKAYTFGCL--QR 217
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
G+LG P S L Q K FSYCL +++ + + LR G+ N Q
Sbjct: 218 ATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCL-PSFKSLNFSGTLRLGR--NGQP 274
Query: 184 KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ +KT + + RSS YY+++ I V + A G ++D+G + T +
Sbjct: 275 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 334
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD--RADFKV 299
P + +R G + + ++ CY A+ +T FD +
Sbjct: 335 V-APVYLALRDEVRRRVGAGAAAVSSLG-GFDTCYNTTV---AWPPVTLLFDGMQVTLPE 389
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
E ++ + +A + N+ V+ + QQQ+ R ++D+ G + F E+C
Sbjct: 390 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 152/399 (38%), Gaps = 65/399 (16%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV V GTP ++ ++ DTGS L W C N AP F+ +ASS+Y +PC
Sbjct: 60 NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLC----NGSRHDAP-FDASASSSYAPVPCS 114
Query: 64 DLIC----RRPPFR--CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C R P R C++ C ++YA +SA GL++ +TF +FG
Sbjct: 115 SPACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPM-----PALFG 169
Query: 118 C--SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRF 175
C S + + G+LG + S + Q TA F+YC+ + IL
Sbjct: 170 CITSYSSSTDPSETPPTGLLGMNRGGLSFVTQ---TATRRFAYCIAAG----QGPGILLL 222
Query: 176 GKD-------ANIQRK-------DMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
G + + Q++ ++ + DR++ Y + L+ I V +
Sbjct: 223 GGNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAA-YTVQLEGIRVGSALLAIPKHLLT 281
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-------WEY 274
G G M+D+G TF+ Y + F T + E ++
Sbjct: 282 PDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDA 341
Query: 275 CYR-YDSRFRAYAS--------MTFHFDRADFKVEPTYMYFI-----FQNEGYFCVAISF 320
C+R ++R A A+ + +Y + + EG +C+
Sbjct: 342 CFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGS 401
Query: 321 SDRNS----VVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
SD V+G QQD YDL + F CA+
Sbjct: 402 SDMAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 82/182 (45%), Gaps = 18/182 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+Y V + G P + +L DTGS L W QC PCV C P++ P++ IPC+D
Sbjct: 56 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSD----LIPCND 111
Query: 65 LICRRPPF----RCENG-QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C+ RCE QC + + YA G S+ G++ + F+ + L P + GC
Sbjct: 112 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 171
Query: 120 NDN-RDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D S + G+LG S+L QL S + + +CL IL FG
Sbjct: 172 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL-----SSLGGGILFFG 226
Query: 177 KD 178
D
Sbjct: 227 DD 228
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 147/386 (38%), Gaps = 56/386 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + G+P ++ ++ DTGS L W C N +FNP +S TY ++PC
Sbjct: 66 NVSLTVSLTVGSPPQNVTMVLDTGSELSWLHC-KKTQFLNS---VFNPLSSKTYSKVPCL 121
Query: 64 DLICRRP------PFRCENGQCVHRI-NYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
C+ P C+ + H I +YA S G ++ ETF K P IF
Sbjct: 122 SPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK----PATIF 177
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILR 174
GC + + + D G++G + S + Q+ FSYC+ ++ +L
Sbjct: 178 GCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPK---FSYCI----SGFDSAGVLL 230
Query: 175 FGKDANIQRKDM--------KTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
G + K + T + DR + Y + L+ I V + + F G
Sbjct: 231 LGNASFPWLKPLSYTPLVQISTPLPYFDRVA-YTVQLEGIKVKNKVLSLPKSVFVPDHTG 289
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED-------WEYCYRYD 279
G M+D+G TF+ GP +++ F S R + ++D + CY D
Sbjct: 290 AGQTMVDSGTQFTFL-LGPVYTALKN---EFLSQTRGILKVLNDDNFVFQGAMDLCYLLD 345
Query: 280 S---RFRAYASMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSD----RNSVV 327
S + ++ F A+ V + + E +C SD V+
Sbjct: 346 SSRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVI 405
Query: 328 GAWQQQDTRFVYDLNTGTIQFVPENC 353
G QQ+ +DL I C
Sbjct: 406 GHHHQQNVWMEFDLEKSRIGLADVRC 431
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 5/186 (2%)
Query: 171 SILRFGKDANIQRKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTG 228
S+L G N+ T + + + S YY+SL+ ISV D ++ TF + +G+G
Sbjct: 7 SVLLLGSLPNVNATKQVTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSG 66
Query: 229 GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASM 288
G +ID+G T+I+ ++ + + F T + + D + +
Sbjct: 67 GVIIDSGTTITYIEENAFDSLKKEFTSQ-TKLPVDKSGSTGLDVCFSLPSGKTEVEIPKL 125
Query: 289 TFHFDRADFKVEPTYMYFIFQNE-GYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQ 347
FHF D ++ P Y I + G C+A+ S+ S+ G QQQ+ +DL TI
Sbjct: 126 VFHFKGGDLEL-PGENYMIADSSLGVACLAMGASNGMSIFGNIQQQNILVNHDLQKETIT 184
Query: 348 FVPENC 353
F+P C
Sbjct: 185 FIPTQC 190
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 140/340 (41%), Gaps = 28/340 (8%)
Query: 19 SEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFR-- 73
++ ++ DT S + W QC PC C+ Q +++P SS+ C+ C + P+
Sbjct: 168 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 227
Query: 74 -CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR-DFSFDGNI 131
N QC +R+ Y G S +G ++ T V FGCS+ + FSF +
Sbjct: 228 CTNNNQCQYRVRYPDGTSTAGTYISDLLTI---TPATAVRSFQFGCSHGVQGSFSFGSSA 284
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRM 191
AGI+ P SL+ Q +T +FS+C R T L + A + ++
Sbjct: 285 AGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFT--LGVPRVAAWRYVLTPMLKN 342
Query: 192 FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
+ Y + L+ I+VA RI P FA G +D+ T + Y+ + +
Sbjct: 343 PAIPPTFYMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQ 396
Query: 252 HFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDR-ADFKVEPTYMYFIFQ 309
F + + + + CY R A +T FD+ A +++P+ + F
Sbjct: 397 AFRDRMAMY---QPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 450
Query: 310 NEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQF 348
+G +D+ ++G Q Q +Y++ + F
Sbjct: 451 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 489
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 143/357 (40%), Gaps = 27/357 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP++ L DT + W C C C S FNP AS++Y+ +PC
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111
Query: 67 C---RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
C P C ++YA +S +S +T V FGC R
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYA-DSSLQAALSQDTLAVAGD----VVKAYTFGCL--QR 164
Query: 124 DFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQR 183
G+LG P S L Q K FSYCL +++ + + LR G+ N Q
Sbjct: 165 ATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCL-PSFKSLNFSGTLRLGR--NGQP 221
Query: 184 KDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+ +KT + + RSS YY+++ I V + A G ++D+G + T +
Sbjct: 222 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 281
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD--RADFKV 299
P + +R G + + ++ CY A+ +T FD +
Sbjct: 282 V-APVYLALRDEVRRRVGAGAAAVSSLG-GFDTCYNTTV---AWPPVTLLFDGMQVTLPE 336
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
E ++ + +A + N+ V+ + QQQ+ R ++D+ G + F E+C
Sbjct: 337 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 140/340 (41%), Gaps = 28/340 (8%)
Query: 19 SEFLLFDTGSYLIWTQCLPCVN--CFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFR-- 73
++ ++ DT S + W QC PC C+ Q +++P SS+ C+ C + P+
Sbjct: 143 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 202
Query: 74 -CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR-DFSFDGNI 131
N QC +R+ Y G S +G ++ T V FGCS+ + FSF +
Sbjct: 203 CTNNNQCQYRVRYPDGTSTAGTYISDLLTI---TPATAVRSFQFGCSHGVQGSFSFGSSA 259
Query: 132 AGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRM 191
AGI+ P SL+ Q +T +FS+C R T L + A + ++
Sbjct: 260 AGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFT--LGVPRVAAWRYVLTPMLKN 317
Query: 192 FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
+ Y + L+ I+VA RI P FA G +D+ T + Y+ + +
Sbjct: 318 PAIPPTFYMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQ 371
Query: 252 HFDEHFTSFGRQRMHNASEDWEYCYRYDS-RFRAYASMTFHFDR-ADFKVEPTYMYFIFQ 309
F + + + + CY R A +T FD+ A +++P+ + F
Sbjct: 372 AFRDRMAMY---QPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 425
Query: 310 NEGYFCVAISFSDR-NSVVGAWQQQDTRFVYDLNTGTIQF 348
+G +D+ ++G Q Q +Y++ + F
Sbjct: 426 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 464
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 157/406 (38%), Gaps = 71/406 (17%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCFNQSA---PIFNPNASSTYKRI 60
Y GTP + +L DTGS+L W C C NC + SA P+F+P SS+ + +
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 61 PCDD-------------LICRRPPFRCENGQCVHRIN-----YA---GGASASGLVSTET 99
C + CRR P C + YA G S +GL+ +T
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 186
Query: 100 FTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FS 157
L+ VPG + GCS S +G+ GF S+ QL GL FS
Sbjct: 187 ----LRAPGRAVPGFVLGCS----LVSVHQPPSGLAGFGRGAPSVPAQL-----GLPKFS 233
Query: 158 YCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSS-------HYYLSLQDISVAD 210
YCL+ + A + M+ + + + +YYL+L+ ++V
Sbjct: 234 YCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGG 293
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ FA G+GG ++D+G T++ P GR + +E
Sbjct: 294 KAVRLPARAFAANAAGSGGTIVDSGTTFTYLD--PTVFQPVADAVVAAVGGRYKRSKDAE 351
Query: 271 D---WEYCYRYD--SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEG---YFCVAI--SF 320
D C+ +R A ++FHF+ P YF+ G C+A+ F
Sbjct: 352 DELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDF 411
Query: 321 SDRNS----------VVGAWQQQDTRFVYDLNTGTIQFVPENCAND 356
S + ++G++QQQ+ YDL + F ++C +
Sbjct: 412 SGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 457
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 156/407 (38%), Gaps = 63/407 (15%)
Query: 7 YTVDVLFGTPSKSE--FLLFDTGSYLIWTQCLP--CVNCFNQSAPIFNPNASSTY----- 57
YT+ + G S + L DTGS L+W C P C+ C + P N N+S+
Sbjct: 83 YTLSLSVGPLSTANPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTD 142
Query: 58 -KRIPCDDLICRR-----PPFR-CENGQCVHRINYAGGASASGLVSTETFTF---HLKNK 107
+RIPC C PP C +C G +AS + + L +
Sbjct: 143 SRRIPCASPFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVAR 202
Query: 108 L----------VCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA-QGLF 156
L V V F C++ + G G+ GF P SL QL A G F
Sbjct: 203 LRRGRVGIAASVAVENFTFACAH-----TALGEPVGVAGFGRGPLSLPAQLAPAALSGRF 257
Query: 157 SYCLVY----AYREMEATSIL---RFGKDANIQRKDMKTIRMFVDRSSHYY-LSLQDISV 208
SYCLV A R + + ++ G+D + + T + + ++Y ++L+ +SV
Sbjct: 258 SYCLVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSV 317
Query: 209 ADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNA 268
RI P + R G GG ++D+G T + Y V F + +R A
Sbjct: 318 GGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAA 377
Query: 269 SED--WEYCYRYD--------SRFRAYASMTFHFDRADFKVEPTYMYFI-FQNE-----G 312
+ CY YD RA + HF V P YF+ F++E G
Sbjct: 378 EDQTGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVG 437
Query: 313 YFCVAISFSDRN----SVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
+ D +G +QQQ VYD++ G + F C +
Sbjct: 438 CLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/388 (21%), Positives = 142/388 (36%), Gaps = 64/388 (16%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCL----PCVNCFNQSAPIFNPNASST 56
H +FY V + G P+K FL DTGS L W +C PC C P++ P
Sbjct: 35 HPTGHFY-VTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK---- 89
Query: 57 YKRIPCDDLIC--------RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKL 108
K +PC D +C R E QC ++INYA G ++ G++ + F+ +
Sbjct: 90 -KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPTGSAR 148
Query: 109 VCVPGVIFGCSND-----NRDFSFDGNIAGILGFSVSPFSLLGQLK---STAQGLFSYCL 160
+ FGC D + + GILG L+ QLK + ++ + +CL
Sbjct: 149 ----NIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCL 204
Query: 161 VYAYREMEATSILRFGKDANIQRKDMKTIRMF-VDRSSHYYLSLQDISVADHRIGFAPGT 219
+ L G++ N+ + I ++ + R ++Y Q T
Sbjct: 205 -----SSKGGGYLFIGEE-NVPSSHLHIIYIYCISREPNHYSPGQ-------------AT 245
Query: 220 FALRRNGTG----GCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYC 275
L RN G + D+G+ T++ + ++ + + + C
Sbjct: 246 LHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLC 305
Query: 276 YRYDSRFRAYASM--------TFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS-- 325
++ F+ + T FD P Y I G C I
Sbjct: 306 WKGPKPFKTVHDLPKEFKSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLF 365
Query: 326 VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+G Q+ ++D G + ++P C
Sbjct: 366 VIGGISMQEQLVIHDNEKGRLAWMPSPC 393
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 146/378 (38%), Gaps = 57/378 (15%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
FY V + G PSK FL DTGS L W QC +P C P + P+ + + C D
Sbjct: 19 FYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYYKPSNN----LVACKD 74
Query: 65 LICRR----PPFRCEN-GQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG-C 118
IC+ RCEN GQC + + YA G S+ G++ + F + ++ P + G C
Sbjct: 75 PICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNLNFTSEKRQSPLLALGLC 134
Query: 119 SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRFG 176
D I G+LG S++ QL + + +CL +
Sbjct: 135 GYDQLPGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGFLFFGDDLY- 193
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DT 234
SS + + + GFA TF + G ++ D+
Sbjct: 194 ------------------DSSRVAWTPMSPNAKHYSPGFAELTFDGKTTGFKNLIVAFDS 235
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF 292
GA T++ Y+ ++ ++ + + A +D C++ F++ + +F
Sbjct: 236 GASYTYLNSQVYQGLISLIKRELST---KPLREALDDQTLPICWKGRKPFKSVRDVKKYF 292
Query: 293 D----------RADFKVE-PTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDT 335
++ ++E P Y I ++G C+ + +D N V+G QD
Sbjct: 293 KTFALSFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLN-VIGDISMQDR 351
Query: 336 RFVYDLNTGTIQFVPENC 353
+YD I + P NC
Sbjct: 352 VVIYDNEKQLIGWAPRNC 369
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 140/342 (40%), Gaps = 25/342 (7%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V GTP ++ L DT + W +PC C ++ +F P S+T+K + C
Sbjct: 93 YIVRAKIGTPPQTLLLAMDTSNDAAW---IPCTACDGCASTLFAPEKSTTFKNVSCAAPE 149
Query: 67 CRRPPFR-CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDF 125
C++ P C + Y + A+ LV +T T VP FGC +
Sbjct: 150 CKQVPNPGCGVSSRNFNLTYGSSSIAANLVQ-DTITLATDP----VPSYTFGCVSKTTGT 204
Query: 126 SFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD 185
S LG SLL Q ++ Q FSYCL +++ + + LR G A +R
Sbjct: 205 SAPPQGLLGLGRGPL--SLLSQTQNLYQSTFSYCL-PSFKSLNFSGSLRLGPVAQPKRIK 261
Query: 186 MKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGP 245
+ RSS YY++L+ I V + P A G + D+G + T + P
Sbjct: 262 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV-AP 320
Query: 246 YEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMY 305
V +R DE G + + ++ CY ++TF F + + P
Sbjct: 321 VYVAVR--DEFRRRVGPKLTVTSLGGFDTCYNVPI---VVPTITFIFTGMNVTL-PQDNI 374
Query: 306 FIFQNEG-YFCVAISFSDRN-----SVVGAWQQQDTRFVYDL 341
I G C+A++ + N +V+ QQQ+ R +YD+
Sbjct: 375 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 141/377 (37%), Gaps = 43/377 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + G+P + ++ DTGS ++W C C C S F+P +S T +
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPV 139
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG- 113
C D C + C +N C + Y G+ SG ++ F + VP
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 114 ---VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKSTAQGL----FSYCLVYAY 164
V+FGCS D + GI GF S++ QL S QGL FS+CL
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLAS--QGLAPRVFSHCL---K 254
Query: 165 REMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
E IL G+ I +M V HY ++L ISV + P F+
Sbjct: 255 GENGGGGILVLGE---IVEPNM-VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS-TS 309
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR- 283
NG G +IDTG ++ Y F E T+ Q + CY +
Sbjct: 310 NGQ-GTIIDTGTTLAYLSEAAY----VPFVEAITNAVSQSVRPVVSKGNQCYVIATSVAD 364
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNE----GYFCVAISFSDRN--SVVGAWQQQDTRF 337
+ ++ +F Y I QN +C+ +++G +D F
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIF 424
Query: 338 VYDLNTGTIQFVPENCA 354
VYDL I + +C+
Sbjct: 425 VYDLVGQRIGWANYDCS 441
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 105/239 (43%), Gaps = 14/239 (5%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N +YT + GTP + L+ D+GS + + C C C N P F P+ SS+Y + C+
Sbjct: 86 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN 145
Query: 64 -DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
D C + QC + YA +S+SG++ + +F +++L V FGC N
Sbjct: 146 VDCTC-----DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAV-FGCENSE 199
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F + GI+G S++ QL S+ L Y ++ +++ G
Sbjct: 200 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTP-- 257
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
DM R RS +Y + L++I VA + F + G ++D+G ++
Sbjct: 258 -SDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKH----GTVLDSGTTYAYL 311
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 139/313 (44%), Gaps = 40/313 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP+KS ++ DTGS ++W C+ C C +S ++N + S + K +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 61 PCDDLICRR----PPFRCE-NGQCVHRINYAGGASASG-----LVSTETFTFHLKNKLVC 110
CDD C + P C+ N C + Y G+S +G +V ++ LK +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQ-TA 197
Query: 111 VPGVIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
VIFGC + + D S + + GILGF + S++ QL S+ + +F++CL
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----D 253
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
I G+ +Q K + V HY +++ + V + F +
Sbjct: 254 GRNGGGIFAIGR--VVQPK--VNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLF--QPG 307
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
G +ID+G ++ YE +++ ++H +D++ C++Y R
Sbjct: 308 DRKGAIIDSGTTLAYLPEIIYEPLVKKEPA-------LKVHIVDKDYK-CFQYSGRVDEG 359
Query: 285 YASMTFHFDRADF 297
+ ++TFHF+ + F
Sbjct: 360 FPNVTFHFENSVF 372
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 94/419 (22%), Positives = 150/419 (35%), Gaps = 80/419 (19%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC------LPCVNCFNQSAP------------- 47
Y V GTP++ L+ DTGS L W +C P + +AP
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPG-YGYAAPASNDSSTSSLSAA 165
Query: 48 ---------IFNPNASSTYKRIPCDDLICRRP-PFR-----CENGQCVHRINYAGGASAS 92
+F P+ S T+ IPC C PF C + Y G++A
Sbjct: 166 AASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAAR 225
Query: 93 GLVSTETFTFHL-------KNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLL 145
G V T++ T L K + + GV+ GC+ SF + G+L S S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLAS-DGVLSLGYSNISFA 284
Query: 146 GQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD-------------------- 185
+ + G FSYCLV ATS L FG + +
Sbjct: 285 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPG 344
Query: 186 -MKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + +D Y +++ ISV + + + + GG ++D+G T +
Sbjct: 345 GARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKG--GGAILDSGTSLTVLV 402
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR------AYASMTFHFDRAD 296
Y V+ ++ R M + ++YCY + S A + HF +
Sbjct: 403 SPAYRAVVAALNKKLAGLPRVTM----DPFDYCYNWTSPSTGEDLTVAMPELAVHFAGSA 458
Query: 297 FKVEPTYMYFIFQNEGYFCVAISFSDRN--SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
P Y I G C+ + + SV+G QQ+ + +DL ++F C
Sbjct: 459 RLQPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 517
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 143/359 (39%), Gaps = 37/359 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ S+ +L DTGS + W QC PC C +Q+ P+F+P++SSTY C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 187
Query: 67 CRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + + QC + + Y G+S +G S++T V FGCS N
Sbjct: 188 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS----AVRSFQFGCS--N 241
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ F+ G++G SL+ Q T FSYCL ++ L G
Sbjct: 242 VESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL---PPTPSSSGFLTLGAAGGSG 298
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M RSS Y + LQ I V ++ F + G ++D+G +
Sbjct: 299 TSGFVKTPML--RSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVI 350
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADF 297
T + Y + F + + S + C+ + + + S+ F
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQ---PSGILDTCFDFSGQSSVSIPSVALVFSGGAV 407
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V I N C+A + + +S ++G QQ+ +YD+ G + F C
Sbjct: 408 -VSLDASGIILSN----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 113/300 (37%), Gaps = 34/300 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
Y V + G P K FL D+GS L W QC PC +C P++ P S K +PC
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 121
Query: 65 LICRR------PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
+C RC++ QC + I YA S++G++ ++F L N V P V F
Sbjct: 122 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAF 181
Query: 117 GCSNDN--RDFSFDGNIAGILGFSVSPFSLLGQLK--STAQGLFSYCLVYAYREMEATSI 172
GC D R G+LG SLL QLK + + +CL +
Sbjct: 182 GCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL-----SLRGGGF 236
Query: 173 LRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI 232
L FG D ++ T ++Y + D +G +
Sbjct: 237 LFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVR----------LAKVVF 286
Query: 233 DTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHF 292
D+G+ T+ PY+ ++ + + R C++ F++ + F
Sbjct: 287 DSGSSFTYFAAKPYQALVTALKDGLS---RTLEEEPDTSLPLCWKGQEPFKSVLDVRKEF 343
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 142/359 (39%), Gaps = 37/359 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ S+ +L DTGS + W QC PC C +Q+ P+F+P++SSTY C
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 111
Query: 67 CRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + + QC + + Y G+S +G S++T V FGCS N
Sbjct: 112 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS----AVRSFQFGCS--N 165
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ F+ G++G SL+ Q T FSYCL ++ L G
Sbjct: 166 VESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL---PPTPSSSGFLTLGAAGGSG 222
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M RSS Y + LQ I V ++ F + G ++D+G +
Sbjct: 223 TSGFVKTPML--RSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVI 274
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADF 297
T + Y + F + S + C+ + + + S+ F
Sbjct: 275 TRLPPTAYSALSSAFKAGMKQY---PPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAV 331
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V I N C+A + + +S ++G QQ+ +YD+ G + F C
Sbjct: 332 -VSLDASGIILSN----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 143/359 (39%), Gaps = 37/359 (10%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V G+P+ S+ +L DTGS + W QC PC C +Q+ P+F+P++SSTY C
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 257
Query: 67 CRRPPFR----CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + + QC + + Y G+S +G S++T V FGCS N
Sbjct: 258 CAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS----AVRSFQFGCS--N 311
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
+ F+ G++G SL+ Q T FSYCL ++ L G
Sbjct: 312 VESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL---PPTPSSSGFLTLGAAGGSG 368
Query: 183 RKDMKTIRMFVDRSSH----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
M RSS Y + LQ I V ++ F + G ++D+G +
Sbjct: 369 TSGFVKTPML--RSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVI 420
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYASMTFHFDRADF 297
T + Y + F + + S + C+ + + + S+ F
Sbjct: 421 TRLPPTAYSALSSAFKAGMKQYPPAQ---PSGILDTCFDFSGQSSVSIPSVALVFSGGAV 477
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V I N C+A + + +S ++G QQ+ +YD+ G + F C
Sbjct: 478 -VSLDASGIILSN----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 65/123 (52%), Gaps = 24/123 (19%)
Query: 1 HEKNYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRI 60
H N + +++ GTP+++ + DTGS LIWTQC PC CF+Q PIF+P SS++ ++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKL 150
Query: 61 PCDDLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
PC + H +S G+++TETFTF + V + FGC
Sbjct: 151 PC-------------SSDLYH-------SSTQGVLATETFTFGDAS----VSKIGFGCGE 186
Query: 121 DNR 123
DNR
Sbjct: 187 DNR 189
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 150/365 (41%), Gaps = 34/365 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + + GTP L D L W C C +C F P+ SSTY C+
Sbjct: 97 YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFF-PSESSTYTSAACESYQ 155
Query: 67 CR-RPPFRCENGQCVHRINYAGGASAS----GLVSTETFTFHLKN-KLVCVPGVIFGCSN 120
C+ C+ C++ +S GLV+ +T +FH + + + P F C
Sbjct: 156 CQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSYPNTNFICGT 215
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV-YAYREMEATSILRFGKDA 179
++ + G AGI+G FS+ Q+K G FS CLV Y+ ++ +S + FG
Sbjct: 216 FIDNWHYIG--AGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQ---SSKINFGLKG 270
Query: 180 NIQRKDMKTIRMFVD-RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
+ + + + + D S Y+L L+ +SV +R+ A ++ ++ ID
Sbjct: 271 VVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRV--ANNFYSAPKSNI---YIDWRTTF 325
Query: 239 TFIQRGPYEVVMRHFDE--HFTSFGRQRMHNASEDWEYCYRYDSRFRAYA-SMTFHFDRA 295
T + YE V + + T +N CY+ +S A +T HF A
Sbjct: 326 TSLPHDFYENVEAEVRKAINLTPIN----YNNERKLSLCYKSESDHDFDAPPITMHFTNA 381
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRN-------SVVGAWQQQDTRFVYDLNTGTIQF 348
D ++ P F+ + C A N +V G+WQQ + YDL + T+ F
Sbjct: 382 DVQLSPLNT-FVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLKSSTVSF 440
Query: 349 VPENC 353
+C
Sbjct: 441 KQADC 445
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 111/266 (41%), Gaps = 24/266 (9%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y + V GTP+ ++ ++ DTGS + W C S+ F+P SSTY C
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSLFFDPGKSSTYTPFSCSSAA 182
Query: 67 CRRPPFR---CE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS--N 120
C R R C N C + + Y G++ +G ++T + K V FGCS +
Sbjct: 183 CTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEK---VENFQFGCSETS 239
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDAN 180
D + + G++G SL+ Q +T FSYCL R ++ L G A+
Sbjct: 240 DPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTR---SSGFLTLG--AS 294
Query: 181 IQRKDMKTIRMFVDRSSH--YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIA 238
T MF R + Y++ LQ I+V + +P FA G ++D+G I
Sbjct: 295 TGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA------AGSIMDSGTII 348
Query: 239 TFIQRGPYEVVMRHFDEHFTSFGRQR 264
T + Y + F + R R
Sbjct: 349 TRLPPRAYSALSAAFRAGMRRYPRAR 374
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 143/360 (39%), Gaps = 30/360 (8%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y GTP+++ + D + W C AP F+P SSTY+ + C
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGAPQ 164
Query: 67 CRRPPF-RCENG---QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + P C G C ++YA ++ L+ + H + + V FGC +
Sbjct: 165 CSQAPAPSCPGGLGSSCAFNLSYAA-STFQALLGQDALALH--DDVDAVAAYTFGCLHVV 221
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S G++GF P S Q K +FSYCL +Y+ + LR G Q
Sbjct: 222 TGGSVPPQ--GLVGFGRGPLSFPSQTKDVYGSVFSYCL-PSYKSSNFSGTLRLGPAG--Q 276
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K +KT + + R S YY+++ I V + A G ++D G + T
Sbjct: 277 PKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTR 336
Query: 241 IQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFD-RADFKV 299
+ Y V + F S R + ++ CY + ++TF FD R +
Sbjct: 337 LSAPVYAAVR----DVFRSRVRAPVAGPLGGFDTCYNVTI---SVPTVTFSFDGRVSVTL 389
Query: 300 EPTYMYFIFQNEGYFCVAISFSDRN------SVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
+ + G C+A++ + +V+ + QQQ+ R ++D+ G + F E C
Sbjct: 390 PEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/392 (23%), Positives = 150/392 (38%), Gaps = 58/392 (14%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNCF-----NQSAPIFNPNASSTYK 58
Y++D+ GTP ++ + DTGS L+W C C +C P F P SST K
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 59 RIPCDDLICR---RPPFRCENGQC-------------VHRINYAGGASASGLVSTETFTF 102
+ C + C P QC + I Y GA+A G + + F
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATA-GFLLLDNLNF 206
Query: 103 HLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV- 161
K VP + GCS + +GI GF SL Q+ FSYCLV
Sbjct: 207 PGKT----VPQFLVGCSILSIR-----QPSGIAGFGRGQESLPSQMNLKR---FSYCLVS 254
Query: 162 YAYREMEATS--ILRFGKDANIQRKDMKTIRMFVDRSS------HYYLSLQDISVADHRI 213
+ + + +S +L+ + + + + S+ +YY++L+ + V +
Sbjct: 255 HRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDV 314
Query: 214 GFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDW 272
+G GG ++D+G+ TF++R Y +V + F + R+ A
Sbjct: 315 KIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGL 374
Query: 273 EYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS------ 325
C+ + ++ TF F +P YF F + SD +
Sbjct: 375 SPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTA 434
Query: 326 ----VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
++G +QQQ+ YDL F P NC
Sbjct: 435 GPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/425 (24%), Positives = 151/425 (35%), Gaps = 95/425 (22%)
Query: 7 YTVDVLFGTPSKSE--FLLFDTGSYLIWTQCLP--CVNCFNQSA---------------- 46
YT+ + G S + L DTGS L+W C P C+ C +
Sbjct: 90 YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRI 149
Query: 47 PIFNPNASSTYKRIPCDDL--ICRRPPFRCENGQC-------------------VH---- 81
P +P S+ + P DL + R P E G C H
Sbjct: 150 PCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRG 209
Query: 82 RINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSP 141
R+ GA AS V+ + FTF + + G G+ GF P
Sbjct: 210 RVALGAGARASVAVAVDNFTFACAHTAL-------------------GEPVGVAGFGRGP 250
Query: 142 FSLLGQLKSTAQGLFSYCLV-YAYR--EMEATSILRFGKDANIQRKDMKT----IRMFVD 194
SL GQL G FSYCLV +++R + S L G+ + +T +
Sbjct: 251 LSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLH 310
Query: 195 RSSH---YYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMR 251
H Y ++L+ +SV RI P + R G GG ++D+G T + Y V
Sbjct: 311 NPKHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAE 370
Query: 252 HFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHFDRADFKVEPTYMYFI-F 308
F + G R A E CYRY + R + HF P YF+ F
Sbjct: 371 AFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYFMGF 430
Query: 309 QNE---------GYFCVAISFSDRNS---------VVGAWQQQDTRFVYDLNTGTIQFVP 350
++E C+ + S +G +QQQ VYD++ G + F
Sbjct: 431 KSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFAR 490
Query: 351 ENCAN 355
C +
Sbjct: 491 RRCTD 495
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 144/359 (40%), Gaps = 49/359 (13%)
Query: 22 LLFDTGSYLI---WT----QCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLICRR-PPFR 73
LL D G ++ W+ C C++CF Q P+F PNASST+K PC +C+ P +
Sbjct: 35 LLADGGGAVVPFHWSPELYNCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPK 94
Query: 74 CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFD-GNIA 132
C + C + G G+V+T+TF P R S +
Sbjct: 95 CASDVCAYDGVTGLGGHTVGIVATDTFAIG-----TAAPARPPASGASWRATSTPWAGPS 149
Query: 133 GILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMF 192
G +G +P+SL+ Q+K T FSYCL A + S L G A + T F
Sbjct: 150 GFIGLGRTPWSLVAQMKLTR---FSYCL--APHDTGKNSRLFLGASAKLAGGGAWT--PF 202
Query: 193 V-----DRSSHYY-LSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPY 246
V D S YY + L++I D I T RN ++ T + +
Sbjct: 203 VKTSPNDGMSQYYPIELEEIKAGDATI-----TMPRGRNTV---LVQTAVVRVSLL---V 251
Query: 247 EVVMRHFDEH-FTSFGRQRMHN-ASEDWEYCYRYDSRFRAYASMTFHFDR-ADFKVEPTY 303
+ V + F + S G +E C+ + + F F A V P
Sbjct: 252 DSVYQEFKKAVMASVGAAPTATPVGAPFEVCFP-KAGVSGAPDLVFTFQAGAALTVPPAN 310
Query: 304 MYFIFQNEGYFCVAISFS-------DRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCAN 355
F N+ +S + D +++G++QQ++ ++DL+ + F P +C++
Sbjct: 311 YLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSS 369
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 131/323 (40%), Gaps = 40/323 (12%)
Query: 47 PIFNPNASSTYKRIPCDDLICRRPPFR-CEN--------GQCVHRINYAGGASA----SG 93
P+ P +SS+ + C D C P C N G C + Y G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 94 LVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQ 153
++ TETFTF + PG+ FGC+ R G +G++G SL+ QL A
Sbjct: 73 ILMTETFTF--GDDAAAFPGIAFGCTL--RSEGGFGTGSGLVGLGRGKLSLVTQLNVEA- 127
Query: 154 GLFSYCLVYAYREMEATSILRFGKDANIQRKD----MKTIRM---FVDRSSHYYLSLQDI 206
F Y L ++ A S + FG A++ + M T + V YY+ L I
Sbjct: 128 --FGYRL---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGI 182
Query: 207 SVADHRIGFAPGTFAL-RRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRM 265
SV + GTF+ R G GG + D+G T + Y +V DE + G Q+
Sbjct: 183 SVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR---DELLSQMGFQKP 239
Query: 266 HNASEDWEY-CYRYDSRFRAYASMTFHFDRA---DFKVEPTYMYFIFQN-EGYFCVAISF 320
A+ D + C+ S + SM HFD D E QN E C ++
Sbjct: 240 PPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVK 299
Query: 321 SDRN-SVVGAWQQQDTRFVYDLN 342
S + +++G Q D V+DL+
Sbjct: 300 SSQALTIIGNIMQMDFHVVFDLS 322
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 138/376 (36%), Gaps = 51/376 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDD 64
+YTV + G P K L DTGS L W QC PC C ++ PN + + C D
Sbjct: 63 YYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNGN----LVKCGD 118
Query: 65 LICR----RPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
+C+ P C N QC + + YA S+ G++ + N + P + FGC
Sbjct: 119 PLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPILAFGC 178
Query: 119 SNDNRDFSFD--GNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
D + + + AG+LG S+L QL S GL + + E L FG
Sbjct: 179 GYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSL--GLIRNVVGHCLSE-RGGGFLFFG 235
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMI--DT 234
Q + T + + HY + G A F + G + D+
Sbjct: 236 DQLVPQSGVVWTPLLQSSSTQHY------------KTGPADLFFDRKPTSVKGLQLIFDS 283
Query: 235 GAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYASMTFHF 292
G+ T+ ++ ++ + + A+ED C+R F++ +T +F
Sbjct: 284 GSSYTYFNSKAHKALVNLVTNDLRG---KPLSRATEDSSLPICWRGPKPFKSLHDVTSNF 340
Query: 293 ---------DRADFKVEPTYMYFIFQNEGYFCVA------ISFSDRNSVVGAWQQQDTRF 337
+ P Y I G C+ I + N ++G QD
Sbjct: 341 KPLLLSFTKSKNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN-IIGDISLQDKLV 399
Query: 338 VYDLNTGTIQFVPENC 353
+YD I + NC
Sbjct: 400 IYDNEKQQIGWASANC 415
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 132/333 (39%), Gaps = 33/333 (9%)
Query: 43 NQSAPIFNPNASSTYKRIPCD------DL-------ICRRPPFRCENGQCVHRINYAGGA 89
N +F P+ S +++ + C DL +C +P + C++ I+YA G+
Sbjct: 185 NPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKP-----SDPCLYDISYADGS 239
Query: 90 SASGLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRD-FSFDGNIAGILGFSVSPFSLLGQ 147
SA G T+T T LKN K + + GC+ + +F+ + GILG + S + +
Sbjct: 240 SAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDK 299
Query: 148 LKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDIS 207
FSYCLV +S L G N + + Y +++ IS
Sbjct: 300 AAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFPPFYGVNVVGIS 359
Query: 208 VADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN 267
+ + P + N GG +ID+G T + YE V + T R
Sbjct: 360 IGGQMLKIPPQVWDF--NSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRV---- 413
Query: 268 ASEDW---EYCYRYDS-RFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSD- 322
ED+ ++C+ + + FHF P Y I C+ I D
Sbjct: 414 TGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDG 473
Query: 323 --RNSVVGAWQQQDTRFVYDLNTGTIQFVPENC 353
SV+G QQ+ + +DL+T TI F P C
Sbjct: 474 IGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/378 (21%), Positives = 156/378 (41%), Gaps = 44/378 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQS-----APIFNPNASSTYKRI 60
Y + GTP K+ +L DTGS ++W C+ C C +S +++ SS+ K +
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143
Query: 61 PCDDLICRRPPFRCENG-----QCVHRINYAGGASASG-----LVSTETFTFHLKNKLVC 110
PCD C+ G C + Y G+S +G +V + + LK
Sbjct: 144 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTD-SA 202
Query: 111 VPGVIFGCS---NDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYR 165
++FGC + + S + + GILGF + S++ QL S+ + +F++CL
Sbjct: 203 NGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL----N 258
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+ I G ++ + + + D+ HY +++ + V + + T +
Sbjct: 259 GVNGGGIFAIG---HVVQPKVNMTPLLPDQ-PHYSVNMTAVQVGHAFLSLSTDTST--QG 312
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAY 285
G +ID+G ++ G YE ++ + +H+ ++Y D F A
Sbjct: 313 DRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQYSESVDDGFPA- 371
Query: 286 ASMTFHFDRA-DFKVEPTYMYFIFQNEGYFCVAISFSDRNS-------VVGAWQQQDTRF 337
+TF+F+ KV P ++F + ++C+ S S ++G +
Sbjct: 372 --VTFYFENGLSLKVYP--HDYLFPSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLV 427
Query: 338 VYDLNTGTIQFVPENCAN 355
YDL I + NC++
Sbjct: 428 FYDLENQVIGWTEYNCSS 445
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 151/364 (41%), Gaps = 41/364 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + + C+ C SA F+PNAS++Y + C
Sbjct: 98 YIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPNASTSYVPLECSVPQ 154
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + C +G C +YAG ++ LV L+ +P FG N
Sbjct: 155 CSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDS-----LRLATDVIPSYSFGSINAI 209
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S LG SLL Q S G+FSYCL +++ + L+ G Q
Sbjct: 210 SGSSIPAQGLLGLGRGPL--SLLSQTGSLYSGVFSYCLP-SFKSYYFSGSLKLGPVG--Q 264
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S Y+++L I+V + F A N G +ID+G + T
Sbjct: 265 PKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVITR 324
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
Y V F + F+S G ++ C+ + A A +T HF D
Sbjct: 325 FVEPVYNAVRDEFRKQVTGPFSSLGA---------FDTCFVKNYETLAPA-ITLHFTDLD 374
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN------SVVGAWQQQDTRFVYDLNTGTIQFV 349
K+ P I + G C+A++ + +N +V+ +QQQ+ R ++D +
Sbjct: 375 LKL-PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGIA 433
Query: 350 PENC 353
E C
Sbjct: 434 RELC 437
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 147/368 (39%), Gaps = 34/368 (9%)
Query: 9 VDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDDLIC 67
V + GTP +++ ++ DTGS L W QC V F+P+ SS++ +PC+ +C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 68 --RRP----PFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSN 120
R P P C +N C + YA G A G + E TF P +I GC+
Sbjct: 142 KPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQS---TPPLILGCAE 198
Query: 121 DNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV--YAYREMEATSILRFGKD 178
+ D GILG ++ S Q K + FSYC+ A + +T G +
Sbjct: 199 ASTDEK------GILGMNLGRRSFASQAKISK---FSYCVPTRQARAGLSSTGSFYLGNN 249
Query: 179 ANIQRKDMKTIRMFV--DRSSH-----YYLSLQDISVADHRIGFAPGTFALRRNGTGGCM 231
N R + F RS + Y + +Q I + + R+ + F +G G +
Sbjct: 250 PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTI 309
Query: 232 IDTGAIATFIQRGPYEVVMRHFDEHF-TSFGRQRMHNASEDWEYCYRYDSRFRAYASMTF 290
ID+G+ T++ Y V + ++ D + R +M F
Sbjct: 310 IDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVF 369
Query: 291 HFDRADFKVEPTYMYFIFQNEGYFCVAISFSDR----NSVVGAWQQQDTRFVYDLNTGTI 346
F++ V + G C+ I S+ ++++G + QQ+ YDL I
Sbjct: 370 EFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRI 429
Query: 347 QFVPENCA 354
+C+
Sbjct: 430 GLGKADCS 437
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 140/377 (37%), Gaps = 43/377 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V GTP + DTGS ++W C C C S F+P +SST I
Sbjct: 74 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 133
Query: 61 PCDDLICRR------PPFRCENGQCVHRINYAGGASASGLVSTETF----TFHLKNKLVC 110
C D C +N QC + Y G+ SG ++ F
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 193
Query: 111 VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
V+FGCSN D + D + GI GF S++ QL S A +FS+CL +
Sbjct: 194 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL---KGD 250
Query: 167 MEATSILRFGK--DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
IL G+ + NI + V HY L+LQ I+V + FA
Sbjct: 251 SSGGGILVLGEIVEPNIVYTSL------VPAQPHYNLNLQSIAVNGQTLQIDSSVFA--T 302
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-R 283
+ + G ++D+G ++ Y+ F T+ Q +H CY S
Sbjct: 303 SNSRGTIVDSGTTLAYLAEEAYD----PFVSAITASIPQSVHTVVSRGNQCYLITSSVTE 358
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNE--GYFCVAISFSDRN----SVVGAWQQQDTRF 337
+ ++ +F + Y I QN G I F +++G +D
Sbjct: 359 VFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIV 418
Query: 338 VYDLNTGTIQFVPENCA 354
VYDL I + +C+
Sbjct: 419 VYDLAGQRIGWANYDCS 435
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 145/380 (38%), Gaps = 50/380 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYK--RI 60
++F T+++ G P+K FL DTGS L W QC PC+NC ++ P K
Sbjct: 37 HFFVTMNI--GDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKCTEQ 94
Query: 61 PCDDLICR-RPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGC 118
C DL R P +C QC + I Y GG+S G++ ++F+ N + FGC
Sbjct: 95 RCADLYADLRKPMKCGPKNQCHYGIQYVGGSSI-GVLIVDSFSLPASNGTNPT-SIAFGC 152
Query: 119 --SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
+ + + + GILG +LL QLKS QG+ + ++ + L FG
Sbjct: 153 GYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKS--QGVITKHVLGHCISSKGKGFLFFG 210
Query: 177 KDANIQRKDMKTIRMFVDRSSHYYLSLQ---DISVADHRIGFAPGTFALRRNGTGGCMID 233
DA + + M +R +Y Q + I AP + D
Sbjct: 211 -DAKVPTSGVTWSPM--NREHKHYSPRQGTLQFNSNSKPISAAPMEV----------IFD 257
Query: 234 TGAIATFIQRGPYEVVMRHFDEHFTSFGR--QRMHNASEDWEYCYRYDSRFRA------- 284
+GA T+ PY + + + + C++ + R
Sbjct: 258 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKC 317
Query: 285 YASMTFHFDRADFKVE---PTYMYFIFQNEGYFCVAI--------SFSDRNSVVGAWQQQ 333
+ S++ F D K P Y I EG+ C+ I S + N ++G
Sbjct: 318 FRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTN-LIGGITML 376
Query: 334 DTRFVYDLNTGTIQFVPENC 353
D +YD + +V C
Sbjct: 377 DQMVIYDSERSLLGWVNYQC 396
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 122/328 (37%), Gaps = 37/328 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V GTP + DTGS ++W C C C S F+P +SST I
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 83
Query: 61 PCDDLICRR------PPFRCENGQCVHRINYAGGASASGLVSTETF----TFHLKNKLVC 110
C D C +N QC + Y G+ SG ++ F
Sbjct: 84 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 143
Query: 111 VPGVIFGCSNDNR-DFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
V+FGCSN D + D + GI GF S++ QL S A +FS+CL +
Sbjct: 144 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL---KGD 200
Query: 167 MEATSILRFGK--DANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRR 224
IL G+ + NI + V HY L+LQ I+V + FA
Sbjct: 201 SSGGGILVLGEIVEPNIVYTSL------VPAQPHYNLNLQSIAVNGQTLQIDSSVFATSN 254
Query: 225 NGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-R 283
+ G ++D+G ++ Y+ F T+ Q +H A CY S
Sbjct: 255 --SRGTIVDSGTTLAYLAEEAYD----PFVSAITASIPQSVHTAVSRGNQCYLITSSVTE 308
Query: 284 AYASMTFHFDRADFKVEPTYMYFIFQNE 311
+ ++ +F + Y I QN
Sbjct: 309 VFPQVSLNFAGGASMILRPQDYLIQQNS 336
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 78/153 (50%), Gaps = 9/153 (5%)
Query: 14 GTPSKSEFLLFDTGSYLIWTQCLPC--VNCFNQSAPIFNPNASSTYKRIPCDDLICRR-P 70
GT + + ++ D+GS + W QC PC + C Q P+F+P S+TY +PC C R
Sbjct: 155 GTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLG 214
Query: 71 PFR---CENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSF 127
P+R N QC Y GA+A+G S++ T + V G +FGC++ +R +F
Sbjct: 215 PYRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLGPYD---VVRGFLFGCAHADRGSTF 271
Query: 128 DGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL 160
+++G L S + Q + +FSYC+
Sbjct: 272 SFDVSGTLALGGGAQSFVQQTATQYGRVFSYCI 304
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 156/376 (41%), Gaps = 45/376 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI-----FNPNASSTYKRI 60
Y D+ GTP+ ++ DTGS W + C C ++S + ++P +S + K +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 61 PCDDLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP---GVI 115
CDD IC RPP +C + YA G G++ T+ +H L P V
Sbjct: 118 KCDDTICTSRPPCNMTL-RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176
Query: 116 FGC------SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
FGC S +N + D GI+GF S + L QL + + +FS+CL
Sbjct: 177 FGCGLQQSGSLNNSAVAID----GIIGFGNSNQTALSQLAAAGKTKKIFSHCL----DST 228
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G+ + +KT + + ++ ++L+ I+VA + F + T
Sbjct: 229 NGGGIFAIGE---VVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK--T 283
Query: 228 GGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
G ID+G+ ++ Y E+++ F +H M+N + + + D +F
Sbjct: 284 KGTFIDSGSTLVYLPEIIYSELILAVFAKH-PDITMGAMYNF-QCFHFLGSVDDKF---P 338
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEG-YFC-----VAISFSDRNSVVGAWQQQDTRFVYD 340
+TFHF+ D ++ ++ + EG +C I ++G + VYD
Sbjct: 339 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 397
Query: 341 LNTGTIQFVPENCAND 356
+ I + N +
Sbjct: 398 MEKQAIGWTEHNSVEE 413
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 152/379 (40%), Gaps = 47/379 (12%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + G+PSK ++ DTGS ++W C+ C C S ++P S T +
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TV 141
Query: 61 PCDDLICRR-------PPFRCENGQCVHRINYAGGASASGLVSTETFTF-HLKNKLVCVP 112
CD C P + C RI Y G+S +G +++ + + P
Sbjct: 142 GCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTP 201
Query: 113 ---GVIFGCSND-NRDF-SFDGNIAGILGFSVSPFSLLGQLKST--AQGLFSYCLVYAYR 165
+ FGC D S + GILGF + S+L QL + + +F++CL
Sbjct: 202 SNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL----D 257
Query: 166 EMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRN 225
+ I G N+ + +KT + V +HY ++LQ ISV + TF
Sbjct: 258 TVHGGGIFAIG---NVVQPKVKTTPL-VQNVTHYNVNLQGISVGGATLQLPSSTF--DSG 311
Query: 226 GTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRF-RA 284
+ G +ID+G ++ R Y ++ F + +HN + C+++
Sbjct: 312 DSKGTIIDSGTTLAYLPREVYRTLLTAV---FDKYQDLALHNYQD--FVCFQFSGSIDDG 366
Query: 285 YASMTFHFDRADFKVEPTYMYFIFQNE------GYFCVAISFSDRNSVV--GAWQQQDTR 336
+ +TF F+ + + ++FQNE G+ + D +V G +
Sbjct: 367 FPVVTFSFE-GEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKL 425
Query: 337 FVYDLNTGTIQFVPENCAN 355
VYDL I + NC++
Sbjct: 426 VVYDLEKQVIGWADYNCSS 444
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 129/288 (44%), Gaps = 32/288 (11%)
Query: 79 CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFS 138
C + INY G+ G + E F + V IFGC +N+ G ++G++G
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGT----ILVKDFIFGCGRNNKGLF--GGVSGLMGLG 186
Query: 139 VSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQRKD--MKTIRMFVDRS 196
S SL+ Q G+FSYCL R+ + IL G ++++ R + +M +
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLIL--GGNSSVYRNSSPISYAKMIENPQ 244
Query: 197 SH--YYLSLQDISVADHRIGFAPGTFALRRNGTGG--CMIDTGAIATFIQRGPYEVVMRH 252
+ Y+++L IS+ G AL+ G ++D+G + T + Y+ +
Sbjct: 245 LYNFYFINLTGISI---------GGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAE 295
Query: 253 FDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD-RADFKVEPTYM-YFIFQ 309
F + FT F A + C+ + ++ HF+ A+ V+ T + YF+
Sbjct: 296 FLKQFTGF---PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKS 352
Query: 310 NEGYFCVAIS---FSDRNSVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
+ C+A++ + D +++G +QQ++ R +YD + F E C+
Sbjct: 353 DASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 150/386 (38%), Gaps = 53/386 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W C + FN S +Y+ IPC
Sbjct: 28 NISLTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCS 86
Query: 64 DLICRRP------PFRCE-NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIF 116
C P C+ N C ++YA +S+ G ++++TF + +PG++F
Sbjct: 87 SSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD----IPGMVF 142
Query: 117 GCSND--NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCL-------VYAYREM 167
GC + + + D G++G + S + Q+ FSYC+ + E
Sbjct: 143 GCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPK---FSYCISGTDFSGMLLLGES 199
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
T + +Q + T + DR + Y + L+ I V+D + F G
Sbjct: 200 NFTWAVPLNYTPLVQ---ISTPLPYFDRIA-YTVQLEGIKVSDRLLPIPKSVFEPDHTGA 255
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEY--------CYRYD 279
G M+D+G TF+ Y + F T F R ED ++ CYR
Sbjct: 256 GQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLR-----VLEDPDFVFQGAMDLCYRVP 310
Query: 280 SRFRA---YASMTFHFDRADFKVEPTYMYF-----IFQNEGYFCVAISFSD----RNSVV 327
R +++ F+ A+ V + + I N+ C++ SD V+
Sbjct: 311 ISQRVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVI 370
Query: 328 GAWQQQDTRFVYDLNTGTIQFVPENC 353
G QQ+ +DL I C
Sbjct: 371 GHHHQQNVWMEFDLERSRIGLAQVRC 396
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 132/355 (37%), Gaps = 52/355 (14%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLIC---RRP 70
P ++ + DT L W QC PC C+ Q +F+P S T +PC C R
Sbjct: 158 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 217
Query: 71 PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
C N QC + ++Y G + SG + T N V FGCS+ R +F +
Sbjct: 218 GAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTVVMNFRFGCSHAVRG-NFSAS 273
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCL--------VYAYREMEATSILRFGKDANIQ 182
+G + SLL Q +T FSYC+ + + RF + ++
Sbjct: 274 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVR 333
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + Y + L+ I V R+ P FA GG ++D+ I T +
Sbjct: 334 NPSII--------PTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLP 379
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY---RYDSRFRAYASMTF------HFD 293
Y + F ++ R+ + CY R+ S S+ F D
Sbjct: 380 PTAYRALRLAFRSAMAAY--PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLD 437
Query: 294 RADFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
VE + G F A+ F +G QQQ +YD+ G++ F
Sbjct: 438 AMGVMVEGCLAF--VPTPGDF--ALGF------IGNVQQQTHEVLYDVGGGSVGF 482
>gi|222615721|gb|EEE51853.1| hypothetical protein OsJ_33366 [Oryza sativa Japonica Group]
Length = 315
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 118/264 (44%), Gaps = 25/264 (9%)
Query: 79 CVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFS 138
C R++Y G+++ G++ +T TF K +PG FGC+ D+ + GN+ G+LG
Sbjct: 20 CPFRVSYQDGSASYGILYQDTLTFSDVQK---IPGFSFGCNMDSFGANEFGNVDGLLGMG 76
Query: 139 VSPFSLLGQLKSTAQGLFSYCLVYAYREM----EATSILRFGKDANIQRKDMKTIRMFVD 194
P S+L Q T FSYCL E + T GK A R D++ +M
Sbjct: 77 AGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT--RTDVRYTKMVAR 133
Query: 195 R--SSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRH 252
+ + +++ L ISV R+G +P F+ + G + D+G+ ++I V+ +
Sbjct: 134 KKNTELFFVDLTAISVDGERLGLSPSVFSRK-----GVVFDSGSELSYIPDRALSVLSQR 188
Query: 253 FDEHFTSFGRQRMHNASEDWEYCYRYDSRFRA-YASMTFHFD---RADFKVEPTYMYFIF 308
E +R E CY S +++ HFD R D ++
Sbjct: 189 IRELLL----KRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSV 244
Query: 309 QNEGYFCVAISFSDRNSVVGAWQQ 332
Q + +C+A + ++ S++G+ Q
Sbjct: 245 QEQDVWCLAFAPNESVSIIGSLIQ 268
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 149/385 (38%), Gaps = 53/385 (13%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCD 63
N TV + GTP ++ ++ DTGS L W C +A F P AS+T+ +PC
Sbjct: 58 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCA-TGRAAAAAADSFRPRASATFAAVPCG 116
Query: 64 DLICRR----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFG 117
C P C+ +C ++YA G+++ G ++T+ F L FG
Sbjct: 117 SARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRSA----FG 172
Query: 118 CSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFG 176
C + D S D AG+LG + S + Q ST + FSYC+ + + +L G
Sbjct: 173 CMSAAYDSSPDAVATAGLLGMNRGALSFVTQ-ASTRR--FSYCI----SDRDDAGVLLLG 225
Query: 177 KD------ANIQRKDMKTIRM-FVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGG 229
N T + + DR + Y + L I V + P A G G
Sbjct: 226 HSDLPFLPLNYTPLYQPTPPLPYFDRVA-YSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQ 284
Query: 230 CMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYR--YDSRFRA--- 284
M+D+G TF+ Y V F + + + A ED + ++ +D+ FR
Sbjct: 285 TMVDSGTQFTFLLGDAYSAVKAEFLKQ-----TKPLLPALEDPSFAFQEAFDTCFRVPKG 339
Query: 285 -------YASMTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS----VVG 328
+T F+ A V + + E G +C+ +D V+G
Sbjct: 340 RPPPSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIG 399
Query: 329 AWQQQDTRFVYDLNTGTIQFVPENC 353
Q + YDL G + P C
Sbjct: 400 HHHQMNLWVEYDLERGRVGLAPVKC 424
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 148/351 (42%), Gaps = 41/351 (11%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPIFNPNASSTYKRIPCDDLI 66
Y V V GTP + F++ DT + + C+ C SA F+PNAS++Y + C
Sbjct: 98 YIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPNASTSYVPLECSVPQ 154
Query: 67 CRRP-PFRCE---NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDN 122
C + C +G C +YAG ++ LV L+ +P FG N
Sbjct: 155 CSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDS-----LRLATDVIPSYSFGSINAI 209
Query: 123 RDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
S LG SLL Q S G+FSYCL +++ + L+ G Q
Sbjct: 210 SGSSIPAQGLLGLGRGPL--SLLSQTGSLYSGVFSYCLP-SFKSYYFSGSLKLGPVG--Q 264
Query: 183 RKDMKTIRMFVD--RSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATF 240
K ++T + + R S Y+++L I+V + F A N G +ID+G + T
Sbjct: 265 PKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVITR 324
Query: 241 IQRGPYEVVMRHFDEH----FTSFGRQRMHNASEDWEYCYRYDSRFRAYASMTFHFDRAD 296
Y V F + F+S G ++ C+ + A A +T HF D
Sbjct: 325 FVEPVYNAVRDEFRKQVTGPFSSLGA---------FDTCFVKNYETLAPA-ITLHFTDLD 374
Query: 297 FKVEPTYMYFIFQNEGYF-CVAISFSDRN------SVVGAWQQQDTRFVYD 340
K+ P I + G C+A++ + +N +V+ +QQQ+ R ++D
Sbjct: 375 LKL-PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFD 424
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/384 (23%), Positives = 146/384 (38%), Gaps = 49/384 (12%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCL---PCVNCF---NQSAPIFNPNASSTYKRI 60
+T+ + FGTP + L DTGS+++W C C NC + PIFNP SS+ K +
Sbjct: 87 HTIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKIL 146
Query: 61 PCDDLICRR----------PPFRCENGQCVH-----RINYAGGASASGLVSTETFTFHLK 105
C D C P + +C H + Y GA ASG E F K
Sbjct: 147 GCRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFPGK 205
Query: 106 NKLVCVPGVIFGCSND-NRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGLFSYCLV--- 161
+ + GC+ +R+ S D + GF + FSL Q+ F+YCL
Sbjct: 206 T----IHKFLVGCTTSADREPSSD----ALAGFGRTMFSLPMQMGVKK---FAYCLNSHD 254
Query: 162 YAYREMEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFA 221
Y IL + ++ D +YYL ++D+ + + +
Sbjct: 255 YDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLT 314
Query: 222 LRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDS- 280
+ GG MID+G ++ +++V + + + R CY +
Sbjct: 315 PGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGH 374
Query: 281 RFRAYASMTFHFDRADFKVEPTYMYFIFQNE---GYFCVA-------ISFSDRNSVV-GA 329
+ + + F V P YF+ +E G F V + F+ S++ G
Sbjct: 375 KSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGN 434
Query: 330 WQQQDTRFVYDLNTGTIQFVPENC 353
+QQ D +DL + F + C
Sbjct: 435 YQQVDHYVEFDLKNERLGFRQQTC 458
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 146/373 (39%), Gaps = 37/373 (9%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAP-----IFNPNASSTYKRI 60
Y V G+P + DTGS ++W C C NC + S F+ S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158
Query: 61 PCDDLIC----RRPPFRC-ENGQCVHRINYAGGASASGLVSTETFTFH--LKNKLVC--V 111
C D IC + +C EN QC + Y G+ SG T+TF F L LV
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 112 PGVIFGCSN-DNRDFS-FDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYREM 167
++FGCS + D + D + GI GF S++ QL S +FS+CL +
Sbjct: 219 APIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL---KGDG 275
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
+ G+ I M + HY L+L I V + F + T
Sbjct: 276 SGGGVFVLGE---ILVPGM-VYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVF--EASNT 329
Query: 228 GGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AYA 286
G ++DTG T++ + Y+ + + + N E CY + +
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNG----EQCYLVSTSISDMFP 385
Query: 287 SMTFHF-DRADFKVEP---TYMYFIFQNEGYFCVAISFS-DRNSVVGAWQQQDTRFVYDL 341
++ +F A + P + Y + +C+ + + +++G +D FVYDL
Sbjct: 386 PVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDL 445
Query: 342 NTGTIQFVPENCA 354
I + +C+
Sbjct: 446 ARQRIGWANYDCS 458
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/290 (27%), Positives = 111/290 (38%), Gaps = 32/290 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y + GTP + ++ DTGS ++W C C C S F+P +S T I
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 61 PCDDLIC----RRPPFRC--ENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPG- 113
C D C + C +N C + Y G+ SG ++ F + VP
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 114 ---VIFGCSNDNRD--FSFDGNIAGILGFSVSPFSLLGQLKS--TAQGLFSYCLVYAYRE 166
V+FGCS D + GI GF S++ QL S A +FS+CL E
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---KGE 256
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
IL G+ I +M V HY ++L ISV + P F+ NG
Sbjct: 257 NGGGGILVLGE---IVEPNM-VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS-TSNG 311
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY 276
G +IDTG ++ Y F E T+ Q + CY
Sbjct: 312 Q-GTIIDTGTTLAYLSEAAY----VPFVEAITNAVSQSVRPVVSKGNQCY 356
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 141/326 (43%), Gaps = 39/326 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI-----FNPNASSTYKRI 60
Y D+ GTP+ ++ DTGS W + C C ++S + ++P +S + K +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 61 PCDDLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP---GVI 115
CDD IC RPP +C + YA G G++ T+ +H L P V
Sbjct: 142 KCDDTICTSRPPCNMTL-RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200
Query: 116 FGC------SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
FGC S +N + D GI+GF S + L QL + + +FS+CL
Sbjct: 201 FGCGLQQSGSLNNSAVAID----GIIGFGNSNQTALSQLAAAGKTKKIFSHCL----DST 252
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G+ + +KT + + ++ ++L+ I+VA + F + T
Sbjct: 253 NGGGIFAIGE---VVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK--T 307
Query: 228 GGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
G ID+G+ ++ Y E+++ F +H M+N + + + D +F
Sbjct: 308 KGTFIDSGSTLVYLPEIIYSELILAVFAKH-PDITMGAMYNF-QCFHFLGSVDDKF---P 362
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEG 312
+TFHF+ D ++ ++ + EG
Sbjct: 363 KITFHFEN-DLTLDVYPYDYLLEYEG 387
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 150/332 (45%), Gaps = 42/332 (12%)
Query: 40 NCFNQSAPIFNPNASSTYKRIPCDDLICRR-------PPFRCENGQCVHRINYAGGASAS 92
NCF + + N+ CD +C + P RC + Y +
Sbjct: 7 NCFVKHLTVLAHNS--------CDSPLCHKLDTGVCSPEKRCN-----YTYGYGDNSLTK 53
Query: 93 GLVSTETFTFHLKN-KLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKST 151
G+++ +T TF KLV + +FGC ++N F+ + G++G P SL+ Q+
Sbjct: 54 GVLAQDTATFTSNTGKLVSLSRFLFGCGHNNTG-GFNDHEMGLIGLGGGPTSLISQIGPL 112
Query: 152 AQGL-FSYCLVYAYREMEATSILRFGKDANIQRKDMKTIRMFVDRS---SHYYLSLQDIS 207
G FS CLV +++ +S + FGK + + + T + V R + Y+++L IS
Sbjct: 113 FGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGDGVVTTPL-VQREQDMTSYFVTLLGIS 171
Query: 208 VADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHN 267
V D + P + + G ++D+G + P ++ R + E + + + N
Sbjct: 172 VEDT---YLPMNSTIEK---GNMLVDSGTPPNIL---PQQLYDRVYVEVKNNVPLELITN 222
Query: 268 -ASEDWEYCYRYDSRFRAYASMTFHFDRADFKVEP--TYMYFIFQNEGYFCVAI-SFSDR 323
S + CYR + + ++T+HF+ A+ + P T++ + +G FC+AI ++++
Sbjct: 223 DPSLGPQLCYRTQTNLKG-PTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNS 281
Query: 324 N-SVVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
N V G + Q + +DL+ + F +C
Sbjct: 282 NGGVYGNFAQSNYLIGFDLDRQVVSFKATDCT 313
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 144/354 (40%), Gaps = 38/354 (10%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCD 63
+ V+V FGTP + L+ DTGS W QC C NC N+ FNP+ SS+Y C
Sbjct: 128 LFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKT--FNPSLSSSYSNRSC- 184
Query: 64 DLICRRPPFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNR 123
+ + + Y + + G+ + T K P FGC D+
Sbjct: 185 ----------IPSTDTNYTMKYEDNSYSKGVFVCDEVTL----KPDVFPKFQFGC-GDSG 229
Query: 124 DFSFDGNIAGILGFSVSP-FSLLGQLKSTAQGLFSYCLVYAYREMEATSILRFGKDANIQ 182
F G +G+LG + +SL+ Q S + FSYC + +E S+L FG+ A
Sbjct: 230 GGEF-GTASGVLGLAKGEQYSLISQTASKFKKKFSYC--FPPKEHTLGSLL-FGEKAISA 285
Query: 183 RKDMKTIRMFVDRSS-HYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFI 241
+K ++ S Y++ L ISVA R+ + FA + G +ID+G + T +
Sbjct: 286 SPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLFA-----SPGTIIDSGTVITRL 340
Query: 242 QRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYD---SRFRAYASMTFHF-DRADF 297
YE + F + + + CY R + HF D
Sbjct: 341 PTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDV 400
Query: 298 KVEPTYMYFIFQNEGYFCVAISFSDRNS---VVGAWQQQDTRFVYDLNTGTIQF 348
+ P+ + + + C+A + S ++G QQ + VYD+ G + F
Sbjct: 401 SLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGF 454
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 152/400 (38%), Gaps = 67/400 (16%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQCLP---CVNC-FNQS----APIFNPNASSTYK 58
Y++ + GTPS++ L+ DTGS L+W C C +C F + P F P SS+ K
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 59 RIPCDDLIC-----RRPPFRCEN---------GQCVHRINYAGGASASGLVSTETFTFHL 104
I C + C +C N C I G S +GL+ +ET F
Sbjct: 144 LIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPN 203
Query: 105 KNKLVCVPGVIFGCSNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTAQGL--FSYCLV- 161
K + + GCS GI GF S SL QL GL FSYCLV
Sbjct: 204 KT----ISDFLAGCS-----LLSTRQPEGIAGFGRSQESLPLQL-----GLKKFSYCLVS 249
Query: 162 --YAYREMEATSILRFGKDANIQRKDMKTIRMFVDR---------SSHYYLSLQDISVAD 210
+ + + IL G + + + F +YY+ L+ I V
Sbjct: 250 RRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGK 309
Query: 211 HRIGFAPGTFALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASE 270
+ +G GG ++D+G+ TF++ +E++ + F++ ++
Sbjct: 310 THVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLT 369
Query: 271 DWEYCYRYD-SRFRAYASMTFHFDRADFKVEPTYMYFIFQNEGYFCVAISFSDRNS---- 325
C+ + +TF F P YF F + G C+ I SD +
Sbjct: 370 GLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTI-VSDNAAALGG 428
Query: 326 -----------VVGAWQQQDTRFVYDLNTGTIQFVPENCA 354
++G +QQQ+ YDL F ++CA
Sbjct: 429 DGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 139/381 (36%), Gaps = 63/381 (16%)
Query: 7 YTVDVLFGTPSKSEFLLFDTGSYLIWTQC-LPCVNCFNQSAPIFNPNASSTYKRIPCDDL 65
YTV + G P K L D+GS L W QC PC C ++ PN + + C D
Sbjct: 64 YTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPN----HNLVQCVDQ 119
Query: 66 ICRRPPFRCE------NGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCS 119
+C E + QC + + YA S+ G++ + F N V P V FGC
Sbjct: 120 LCSEVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCG 179
Query: 120 NDNRDFSFDGNIA--GILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREMEATSILRF 175
D + + A G+LG S+L QL S + +CL L F
Sbjct: 180 YDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCL-----SARGGGFLFF 234
Query: 176 GKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGC----- 230
G D F+ S + S+ S H ++ G L NG
Sbjct: 235 GDD-------------FIPSSGIVWTSMLPSSSEKH---YSSGPAELVFNGKATVVKGLE 278
Query: 231 -MIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASED--WEYCYRYDSRFRAYAS 287
+ D+G+ T+ Y+ V+ + G+Q + A++D C++ F++ +
Sbjct: 279 LIFDSGSSYTYFNSQAYQAVVDLVTQDLK--GKQ-LKRATDDPSLPICWKGAKSFKSLSD 335
Query: 288 MTFHFDRADFKVE---------PTYMYFIFQNEGYFCVAI------SFSDRNSVVGAWQQ 332
+ +F P Y I G C+ I + N ++G
Sbjct: 336 VKKYFKPLALSFTKTKILQMHLPPEAYLIITKHGNVCLGILDGTEVGLENLN-IIGDISL 394
Query: 333 QDTRFVYDLNTGTIQFVPENC 353
QD +YD I +V NC
Sbjct: 395 QDKMVIYDNEKQQIGWVSSNC 415
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 132/353 (37%), Gaps = 48/353 (13%)
Query: 16 PSKSEFLLFDTGSYLIWTQCLPCV--NCFNQSAPIFNPNASSTYKRIPCDDLIC---RRP 70
P ++ + DT L W QC PC C+ Q +F+P S T +PC C R
Sbjct: 142 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 201
Query: 71 PFRCENGQCVHRINYAGGASASGLVSTETFTFHLKNKLVCVPGVIFGCSNDNRDFSFDGN 130
C N QC + ++Y G + SG + T N V FGCS+ R +F +
Sbjct: 202 GAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTVVMNFRFGCSHAVRG-NFSAS 257
Query: 131 IAGILGFSVSPFSLLGQLKSTAQGLFSYCL--------VYAYREMEATSILRFGKDANIQ 182
+G + SLL Q +T FSYC+ + + RF + ++
Sbjct: 258 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVR 317
Query: 183 RKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGTGGCMIDTGAIATFIQ 242
+ + Y + L+ I V R+ P FA GG ++D+ I T +
Sbjct: 318 NPSII--------PTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLP 363
Query: 243 RGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCY-RYDSRFRAYASMTF------HFDRA 295
Y + F ++ R A D Y + R+ S S+ F D
Sbjct: 364 PTAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAM 423
Query: 296 DFKVEPTYMYFIFQNEGYFCVAISFSDRNSVVGAWQQQDTRFVYDLNTGTIQF 348
VE + G F A+ F +G QQQ +YD+ G++ F
Sbjct: 424 GVMVEGCLAF--VPTPGDF--ALGF------IGNVQQQTHEVLYDVGGGSVGF 466
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 150/383 (39%), Gaps = 50/383 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V G P K + DTGS ++W C PC C +SA +++P SST +
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 61 PCDDLICRRPPFRCENGQCVHRIN-------YAGGASASGLVSTETFTFHL--KNKLV-C 110
C D +C R R QC N Y G+++ G + +++ N L
Sbjct: 61 SCSDPLCVR-GRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 119
Query: 111 VPGVIFGCS-NDNRDFSFDGN-IAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYRE 166
V+FGCS D S + GI+GF S+ QL + +FS+CL R
Sbjct: 120 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 179
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
I + T V S HY + L+ ISV +R+ F+ N
Sbjct: 180 GGILVIGGIAEPG-------MTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFS-STND 231
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AY 285
T G ++D+G + G Y V ++ E TS R+ C+ R +
Sbjct: 232 T-GVIMDSGTTLAYFPSGAYNVFVQAIREA-TSATPVRVQGMDTQ---CFLVSGRLSDLF 286
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAISFSDRN---------SVVGAW 330
++T +F+ +++P Y ++ +C+ S + +++G
Sbjct: 287 PNVTLNFEGGAMELQPDN-YLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDI 345
Query: 331 QQQDTRFVYDLNTGTIQFVPENC 353
+D VYDL+ I ++ NC
Sbjct: 346 VLKDKLVVYDLDNSRIGWMSYNC 368
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 141/326 (43%), Gaps = 39/326 (11%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSAPI-----FNPNASSTYKRI 60
Y D+ GTP+ ++ DTGS W + C C ++S + ++P +S + K +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 61 PCDDLIC-RRPPFRCENGQCVHRINYAGGASASGLVSTETFTFH-LKNKLVCVP---GVI 115
CDD IC RPP +C + YA G G++ T+ +H L P V
Sbjct: 118 KCDDTICTSRPPCNMTL-RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176
Query: 116 FGC------SNDNRDFSFDGNIAGILGFSVSPFSLLGQLKSTA--QGLFSYCLVYAYREM 167
FGC S +N + D GI+GF S + L QL + + +FS+CL
Sbjct: 177 FGCGLQQSGSLNNSAVAID----GIIGFGNSNQTALSQLAAAGKTKKIFSHCL----DST 228
Query: 168 EATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNGT 227
I G+ + +KT + + ++ ++L+ I+VA + F + T
Sbjct: 229 NGGGIFAIGE---VVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK--T 283
Query: 228 GGCMIDTGAIATFIQRGPY-EVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFRAYA 286
G ID+G+ ++ Y E+++ F +H M+N + + + D +F
Sbjct: 284 KGTFIDSGSTLVYLPEIIYSELILAVFAKH-PDITMGAMYNF-QCFHFLGSVDDKF---P 338
Query: 287 SMTFHFDRADFKVEPTYMYFIFQNEG 312
+TFHF+ D ++ ++ + EG
Sbjct: 339 KITFHFEN-DLTLDVYPYDYLLEYEG 363
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 150/383 (39%), Gaps = 50/383 (13%)
Query: 6 FYTVDVLFGTPSKSEFLLFDTGSYLIWTQCLPCVNCFNQSA-----PIFNPNASSTYKRI 60
Y V G P K + DTGS ++W C PC C +SA +++P SST +
Sbjct: 28 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 87
Query: 61 PCDDLICRRPPFRCENGQCVHRIN-------YAGGASASGLVSTETFTFHL--KNKLV-C 110
C D +C R R QC N Y G+++ G + +++ N L
Sbjct: 88 SCSDPLCVR-GRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146
Query: 111 VPGVIFGCS-NDNRDFSFDGN-IAGILGFSVSPFSLLGQLKSTAQ--GLFSYCLVYAYRE 166
V+FGCS D S + GI+GF S+ QL + +FS+CL R
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 206
Query: 167 MEATSILRFGKDANIQRKDMKTIRMFVDRSSHYYLSLQDISVADHRIGFAPGTFALRRNG 226
I + T V S HY + L+ ISV +R+ F+ N
Sbjct: 207 GGILVIGGIAEPG-------MTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFS-STND 258
Query: 227 TGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNASEDWEYCYRYDSRFR-AY 285
T G ++D+G + G Y V ++ E TS R+ C+ R +
Sbjct: 259 T-GVIMDSGTTLAYFPSGAYNVFVQAIREA-TSATPVRVQGMDTQ---CFLVSGRLSDLF 313
Query: 286 ASMTFHFDRADFKVEPTYMYFIFQNEG------YFCVAISFSDRN---------SVVGAW 330
++T +F+ +++P Y ++ +C+ S + +++G
Sbjct: 314 PNVTLNFEGGAMELQPDN-YLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDI 372
Query: 331 QQQDTRFVYDLNTGTIQFVPENC 353
+D VYDL+ I ++ NC
Sbjct: 373 VLKDKLVVYDLDNSRIGWMSYNC 395
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 150/390 (38%), Gaps = 56/390 (14%)
Query: 4 NYFYTVDVLFGTPSKSEFLLFDTGSYLIWTQC---------LPCVNCFNQSAPIFNPNAS 54
N TV + GTP ++ ++ DTGS L W C +S F P AS
Sbjct: 60 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGES---FRPRAS 116
Query: 55 STYKRIPCDDLICRR----PPFRCENG--QCVHRINYAGGASASGLVSTETFTFHLKNKL 108
+T+ +PC C P C+ QC ++YA G+++ G ++T+ F L
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPL 176
Query: 109 VCVPGVIFGCSNDNRDFSFDG-NIAGILGFSVSPFSLLGQLKSTAQGLFSYCLVYAYREM 167
FGC + D S DG AG+LG + S + Q ST + FSYC+ +
Sbjct: 177 RSA----FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQ-ASTRR--FSYCI----SDR 225
Query: 168 EATSILRFGKD------ANIQRKDMKTIRM-FVDRSSHYYLSLQDISVADHRIGFAPGTF 220
+ +L G N T+ + + DR + Y + L I V +
Sbjct: 226 DDAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVA-YSVQLLGIRVGGKALPIPASVL 284
Query: 221 ALRRNGTGGCMIDTGAIATFIQRGPYEVVMRHFDEHFTSFGRQRMHNAS----EDWEYCY 276
A G G M+D+G TF+ Y + F + R + + S E + C+
Sbjct: 285 APDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRA-LDDPSFAFQEALDTCF 343
Query: 277 RYDSRFRAYAS----MTFHFDRADFKVEPTYMYFIFQNE-----GYFCVAISFSDRNS-- 325
R + ++ +T F+ A+ V + + E G +C+ +D
Sbjct: 344 RVPAGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLT 403
Query: 326 --VVGAWQQQDTRFVYDLNTGTIQFVPENC 353
V+G Q + YDL G + P C
Sbjct: 404 AYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.327 0.140 0.449
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,942,402,038
Number of Sequences: 23463169
Number of extensions: 248718123
Number of successful extensions: 545092
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 950
Number of HSP's successfully gapped in prelim test: 1288
Number of HSP's that attempted gapping in prelim test: 539754
Number of HSP's gapped (non-prelim): 2600
length of query: 359
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 216
effective length of database: 9,003,962,200
effective search space: 1944855835200
effective search space used: 1944855835200
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 77 (34.3 bits)