BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047238
(446 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 280 bits (717), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 251/449 (55%), Gaps = 22/449 (4%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
LA+ F Y ++L L HF S+ GFSL+++ +S ESP YPGN++ ERI ++ E+SK RA
Sbjct: 5 LASPFVYLTILSLIHFAISKPDGFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRA 64
Query: 69 NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC 128
+ +A ++ + F E L +++ D Y V+V IG+P P +L+ DT S L WTQC+PC
Sbjct: 65 HNLA-ITTSSGFSP-EAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPC 122
Query: 129 IRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLAS 185
R F Q PIF+ AS TY ++PC C ++ F+C++ KCVY Y G T G+A+
Sbjct: 123 TRRFRQLPPIFNSTASRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAA 182
Query: 186 RETFAFPVRNGFTFVPRLAFGCSNDN---SGFAFGGKISGILGFNASPLSLSSQLRNRIQ 242
++ + F FGCS DN S F GK GI+G N SP+SL Q+ + +
Sbjct: 183 QDILQSAENDRIPFY----FGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITK 238
Query: 243 GLFSYC-----LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
FSYC L ATS+++FG D RR +TP + P+++L+L+++S+
Sbjct: 239 NRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAG 298
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ ++ PPG F + DGTGG IID+GT VT+I Y ++ + G QR+ S
Sbjct: 299 NRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLS 358
Query: 358 QEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSI 414
CY+ +F YPSM FH Q AD+ V+PE +Y DRG FCVA+Q + +I
Sbjct: 359 GYI--CYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTI 416
Query: 415 LGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+GA Q N IYD L F ENC +
Sbjct: 417 IGALNQANTQFIYDAANRQLLFTPENCQD 445
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 166/455 (36%), Positives = 233/455 (51%), Gaps = 56/455 (12%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
M Q P AA Y ++L L F +S+ GF L+LI SPESP YPG L+ SERI ++
Sbjct: 1 MPLCQNFPSAAPLLYLAILSLLSFATSKPNGFRLQLIHRDSPESPFYPGKLTNSERISRL 60
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
E SK RA+ S AF+ P+ + Y V+V IG P P +L+ DT S+L
Sbjct: 61 VEFSKIRAHNFDSGFSSEAFRP------PVFQDFTCYLVKVRIGNPGIPLYLVPDTGSAL 114
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVT 180
+WT I F+C+N KC YTRRY G +T
Sbjct: 115 IWTVNNQNI-------------------------------FQCRNNKCSYTRRYDDGSIT 143
Query: 181 RGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF---GGKISGILGFNASPLSLSSQL 237
G+A+++ G +P FGCS DN F+ GK G++G N SP+SL QL
Sbjct: 144 TGVAAQDILQ---SEGSERIP-FYFGCSRDNQNFSVFEHTGKSGGVMGLNTSPVSLLQQL 199
Query: 238 RNRIQGLFSYCLV-----REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE 292
+ Q FSYCL E +S+++FG D RR ++TP++ S RP+++L+LL+
Sbjct: 200 SHITQRRFSYCLNPYQHGSEPPPSSLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLD 259
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
+++ + PPG F + +DGTGG IID+GT +TFI Y L+ + G QR+
Sbjct: 260 MTVAGQRLHLPPGTFALRQDGTGGTIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRV 319
Query: 353 PYNASQEFDYCY--RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP 410
EFD CY R + +F + SMTFH + AD+ VQ + +Y D FCVA+Q P
Sbjct: 320 ---HIPEFDLCYSFRGNHTFHDHASMTFHFERADFTVQADYVYLPMEDDNAFCVALQPTP 376
Query: 411 --KYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ +++GA Q N IYD L F +ENC N
Sbjct: 377 PQQRTVIGAINQGNTRFIYDAAAHQLLFIAENCRN 411
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 137/419 (32%), Positives = 215/419 (51%), Gaps = 23/419 (5%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
G + L+ SP SP PGN+S +ER + + S+ R + + E++ + P+
Sbjct: 54 GLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQM-----SVDEVKAVEAPV 108
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ + +++ IGTP + DT S L WTQC+PC C+ Q TPI+DP S+TYS++
Sbjct: 109 YAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKV 168
Query: 151 PCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PC +C++ + C C Y Y T+G+ S E+F ++ +P +AFGC
Sbjct: 169 PCSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQS----LPHIAFGCG 224
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TSVIKFGRDA 265
+N + G++GF PLSL SQL + FSYCLV ++ TS + G+ A
Sbjct: 225 QENE-GGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTA 283
Query: 266 DVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ + + +TP++ S RP F YL L IS+G ++ G FD+ DGTGG IID+GT
Sbjct: 284 SLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTT 343
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQE 382
VT++ Y + + ++ S+ ++ ++ D C+ + SS +P++TFH +
Sbjct: 344 VTYLEQSGYDVVKKA---VISSINLPQVD-GSNIGLDLCFEPQSGSSTSHFPTITFHFEG 399
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
AD+ + EN Y G C+A+ SI G QQQN I+YD L F C
Sbjct: 400 ADFNLPKEN-YIYTDSSGIACLAMLPSNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 123/397 (30%), Positives = 203/397 (51%), Gaps = 20/397 (5%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN ++ ER+ + + K R +++ K +F+ + P+ + + +++ IGTP +
Sbjct: 53 GNYTKFERLQRAMKRGKLRLQRLSA--KTASFES--SVEAPVHAGNGEFLMKLAIGTPAE 108
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGK 167
+ DT S L+WTQC+PC CFDQ TPIFDP+ S+++S++PC LC + P +
Sbjct: 109 TYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG 168
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C Y Y T+G+ + ETFAF G V ++ FGC DN G F + +G++G
Sbjct: 169 CEYLYSYGDYSSTQGVLATETFAF----GDASVSKIGFGCGEDNDGSGF-SQGAGLVGLG 223
Query: 228 ASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF- 286
PLSL SQL + FSYCL ++ + ++ ++ TTP++ + +P F
Sbjct: 224 RGPLSLISQLG---EPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFY 280
Query: 287 YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
YL L IS+G ++ F I DG+GG IID+GT +T++ + + L + + L
Sbjct: 281 YLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQL-- 338
Query: 347 LGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCV 404
+ + + S D C+ D+S P + FH + AD + EN + G C+
Sbjct: 339 --KLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICL 396
Query: 405 AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ SI G +QQQN+++++DL + F C
Sbjct: 397 TMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 142/432 (32%), Positives = 211/432 (48%), Gaps = 36/432 (8%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAF 80
L H ++ TGF + L + S + NL++ + + + E R + +M
Sbjct: 30 LNHRHEAKVTGFQIMLEHVDSGK------NLTKFQLLERAIERGSRRLQRLEAM-----L 78
Query: 81 QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
+ + D Y + ++IGTP +P + DT S L+WTQCQPC +CF+Q+TPIF+
Sbjct: 79 NGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN 138
Query: 141 PRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
P+ S+++S +PC LC+ S C N C YT Y G T+G ET F G
Sbjct: 139 PQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF----GSV 194
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-S 257
+P + FGC +N GF G +G++G PLSL SQL FSYC+ +T S
Sbjct: 195 SIPNITFGCGENNQGFGQGNG-AGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPS 250
Query: 258 VIKFGRDAD-VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGTG 315
+ G A+ V TT I S + +Y+ L +S+G + P AF + +GTG
Sbjct: 251 NLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTG 310
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRY--DSSFK 371
G IID+GT +T+ N YQ++ Q + + + +P +S FD C++ D S
Sbjct: 311 GIIIDSGTTLTYFVNNAYQSVRQEF------ISQINLPVVNGSSSGFDLCFQTPSDPSNL 364
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLN 430
P+ H D + EN YFI P G C+A+ + SI G QQQNML++YD
Sbjct: 365 QIPTFVMHFDGGDLELPSEN-YFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTG 423
Query: 431 VPALRFGSENCA 442
+ F S C
Sbjct: 424 NSVVSFASAQCG 435
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 139/431 (32%), Positives = 208/431 (48%), Gaps = 34/431 (7%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAF 80
L H + GF + L + S + NL++ E + + E R + +M
Sbjct: 30 LNHHHEPKVAGFQIMLEHVDSGK------NLTKFELLERAVERGSRRLQRLEAM-----L 78
Query: 81 QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
+ P+ D Y + ++IGTP +P + DT S L+WTQCQPC +CF+Q+TPIF+
Sbjct: 79 NGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN 138
Query: 141 PRASTTYSEIPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
P+ S+++S +PC LC +SP C N C YT Y G T+G ET F G
Sbjct: 139 PQGSSSFSTLPCSSQLCQALQSP-TCSNNSCQYTYGYGDGSETQGSMGTETLTF----GS 193
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-T 256
+P + FGC +N GF G +G++G PLSL SQL FSYC+ + +
Sbjct: 194 VSIPNITFGCGENNQGFGQGNG-AGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNS 249
Query: 257 SVIKFGRDAD-VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGT 314
S + G A+ V TT I S + +Y+ L +S+G + P F + +GT
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY--DSSFKA 372
GG IID+GT +T+ + YQ + Q + + + +S FD C++ D S
Sbjct: 310 GGIIIDSGTTLTYFVDNAYQAVRQAFISQM----NLSVVNGSSSGFDLCFQMPSDQSNLQ 365
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNV 431
P+ H D ++ EN YFI P G C+A+ + SI G QQQN+L++YD
Sbjct: 366 IPTFVMHFDGGDLVLPSEN-YFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGN 424
Query: 432 PALRFGSENCA 442
+ F S C
Sbjct: 425 SVVSFLSAQCG 435
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/431 (32%), Positives = 207/431 (48%), Gaps = 34/431 (7%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAF 80
L H + GF + L + S + NL++ E + + E R + +M
Sbjct: 30 LNHHHEPKVAGFQIMLEHVDSGK------NLTKFELLERAVERGSRRLQRLEAM-----L 78
Query: 81 QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
+ P+ D Y + ++IGTP +P + DT S L+WTQCQPC +CF+Q+TPIF+
Sbjct: 79 NGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN 138
Query: 141 PRASTTYSEIPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
P+ S+++S +PC LC +SP C N C YT Y G T+G ET F G
Sbjct: 139 PQGSSSFSTLPCSSQLCQALQSP-TCSNNSCQYTYGYGDGSETQGSMGTETLTF----GS 193
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT- 256
+P + FGC +N GF G +G++G PLSL SQL FSYC+ +T
Sbjct: 194 VSIPNITFGCGENNQGFGQGNG-AGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTS 249
Query: 257 SVIKFGRDAD-VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGT 314
S + G A+ V TT I S + +Y+ L +S+G + P F + +GT
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY--DSSFKA 372
GG IID+GT +T+ + YQ + Q + + + +S FD C++ D S
Sbjct: 310 GGIIIDSGTTLTYFADNAYQAVRQAFISQM----NLSVVNGSSSGFDLCFQMPSDQSNLQ 365
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNV 431
P+ H D ++ EN YFI P G C+A+ + SI G QQQN+L++YD
Sbjct: 366 IPTFVMHFDGGDLVLPSEN-YFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGN 424
Query: 432 PALRFGSENCA 442
+ F C
Sbjct: 425 SVVSFLFAQCG 435
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 127/406 (31%), Positives = 207/406 (50%), Gaps = 24/406 (5%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMK 108
NL++ ER+ + K R + + +M A + D + P+ + + +++ IG+P +
Sbjct: 63 NLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPR 122
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNG 166
+ DT S L+WTQC+PC +CFDQ+TPIFDP+ S+++ +I C LC + C +
Sbjct: 123 SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSD 182
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILG 225
C Y Y T+G+ + ETF F +P L FGC NDN+G F + +G++G
Sbjct: 183 GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF-SQGAGLVG 241
Query: 226 FNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADV----RRRDLETTPILLS 280
PLSL SQL+ + F+YCL + S + G A++ + +++TTP++ +
Sbjct: 242 LGRGPLSLVSQLKEQK---FAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKN 298
Query: 281 DLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR 339
+P F YL L IS+G + P F++ DG+GG IID+GT +T++ N + +L
Sbjct: 299 PSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNE 358
Query: 340 YDQILRSLGRQRIPYNASQE--FDYCYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFI 395
+ + + +P + S D C+ + P +TFH + AD + EN
Sbjct: 359 F------IAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIG 412
Query: 396 EPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ G C+AI SI G QQQN ++++DL L F C
Sbjct: 413 DSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 150/443 (33%), Positives = 221/443 (49%), Gaps = 32/443 (7%)
Query: 13 FSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA 72
S FS+ F+ F+ + S GFS++LI SP+SP Y +E ++ F + AR
Sbjct: 9 LSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYY----KPTENKYQHF-VDAARR---- 59
Query: 73 SMSKPNAFQELEDIHLPMAK---QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
S+++ N F + D P + Y + ++GTP + + DT S +VW QC+PC
Sbjct: 60 SINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCE 119
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQN-GKCVYTRRYHVGDVTRGLASR 186
+C++QTTPIF+P S++Y IPC LC S C + C Y Y ++G S
Sbjct: 120 QCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSV 179
Query: 187 ETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
+T + +G P++ GC DN+G FGG SGI+G P+SL +QL + I G F
Sbjct: 180 DTLSLESTSGSPVSFPKIVIGCGTDNAG-TFGGASSGIVGLGGGPVSLITQLGSSIGGKF 238
Query: 246 SYCLV----REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIV 300
SYCLV +E A+S++ FG A V + +TP++ D P FY L L S+G V
Sbjct: 239 SYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKD--PVFYFLTLQAFSVGNKRV 296
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
F G D G IID+GT +T I + Y L +++ L R P +Q+F
Sbjct: 297 EF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVK-LDRVDDP---NQQF 350
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-SILGAWQ 419
CY S+ +P +T H + AD + + F+ G C A Q P+ SI G
Sbjct: 351 SLCYSLKSNEYDFPIITVHFKGADVELHSIST-FVPITDGIVCFAFQPSPQLGSIFGNLA 409
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
QQN+L+ YDL + F +C
Sbjct: 410 QQNLLVGYDLQQKTVSFKPTDCT 432
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/406 (31%), Positives = 208/406 (51%), Gaps = 24/406 (5%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMK 108
NL++ ER+ + K R + + +M A + D + P+ + + +++ IG+P +
Sbjct: 318 NLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPR 377
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNG 166
+ DT S L+WTQC+PC +CFDQ+TPIFDP+ S+++ +I C LC + C +
Sbjct: 378 SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSD 437
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILG 225
C Y Y T+G+ + ETF F +P L FGC NDN+G F + +G++G
Sbjct: 438 GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF-SQGAGLVG 496
Query: 226 FNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADV----RRRDLETTPILLS 280
PLSL SQL+ + F+YCL ++ S + G A++ + +++TTP++ +
Sbjct: 497 LGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKN 553
Query: 281 DLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR 339
+P F YL L IS+G + P F++ DG+GG IID+GT +T++ N + +L
Sbjct: 554 PSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNE 613
Query: 340 YDQILRSLGRQRIPYNASQE--FDYCYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFI 395
+ + + +P + S D C+ + P +TFH + AD + EN
Sbjct: 614 F------IAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIG 667
Query: 396 EPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ G C+AI SI G QQQN ++++DL L F C
Sbjct: 668 DSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/374 (32%), Positives = 194/374 (51%), Gaps = 32/374 (8%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ +P+ + + ++++IGTP + DT S LVWTQC+PC+ CF+Q+TP+FDP +S+
Sbjct: 91 LQVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSS 150
Query: 146 TYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
TY+ +PC LC KC + KC YT Y T+G+ + ETF T +P +
Sbjct: 151 TYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTL----AKTKLPDV 206
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVR-EMEATSVIKF 261
AFGC + N G F + +G++G PLSL SQL N+ FSYCL + + S +
Sbjct: 207 AFGCGDTNEGDGF-TQGAGLVGLGRGPLSLVSQLGLNK----FSYCLTSLDDTSKSPLLL 261
Query: 262 GRDADV-----RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
G A + ++TTP++ + +P F Y++L +++G + P AF + DGTG
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPY--NASQEFDYCYRYDSSF-- 370
G I+D+GT +T++ +Q Y + ++ Q ++P + D C+ +S
Sbjct: 322 GVIVDSGTSITYLE-------LQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVD 374
Query: 371 -KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
P + FHL AD + EN ++ G C+ + SI+G +QQQN+ +YD+
Sbjct: 375 QVEVPKLVFHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGLSIIGNFQQQNIQFVYDV 434
Query: 430 NVPALRFGSENCAN 443
L F CA
Sbjct: 435 GENTLSFAPVQCAK 448
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 131/405 (32%), Positives = 202/405 (49%), Gaps = 23/405 (5%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
++++ + + SKAR + S++ A + + + + Y + + IGTP +
Sbjct: 44 TEAQLLSRAVRRSKARVAALQSLATTTAADAITVARILVLASEGEYLMSMGIGTPPRYYS 103
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCV 169
+ DT S L+WTQC PC+ C DQ TP FDP S +Y+++PC+ P+C + + C CV
Sbjct: 104 AILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPMCNALYYPLCYRNVCV 163
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y T G+ S ETF F + VPR+AFGC N N+G F G SG++GF
Sbjct: 164 YQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGSLFNG--SGMVGFGRG 221
Query: 230 PLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD------LETTPILLSDL 282
PLSL SQL + FSYCL M S + FG A + +++TP +++
Sbjct: 222 PLSLVSQLGSP---RFSYCLTSFMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPG 278
Query: 283 RP-HFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
P +YL++ IS+G ++ P F I DGTGG IID+G+ +T++ Y + Q +
Sbjct: 279 LPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAF 338
Query: 341 -DQILRSLGRQRIPYNASQEFDYCYRYDS---SFKAYPSMTFHLQEADYIVQPENMYFIE 396
DQ+ L + + D C+ + P + FH + A+ + EN I+
Sbjct: 339 ADQVGLPLTNAT---SLADVLDTCFVWPPPPRKIVTMPELAFHFEGANMELPLENYMLID 395
Query: 397 PDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D G C+AI SI+G++Q QN ++YD L F C
Sbjct: 396 GDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 150/443 (33%), Positives = 220/443 (49%), Gaps = 32/443 (7%)
Query: 13 FSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA 72
S FS+ F+ F+ + S GFS++LI SP+SP Y +E ++ F + AR
Sbjct: 9 LSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYY----KPTENKYQHF-VDAARR---- 59
Query: 73 SMSKPNAFQELEDIHLPMAK---QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
S+++ N F + D P + Y + ++GTP + + DT S +VW QC+PC
Sbjct: 60 SINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCE 119
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQN-GKCVYTRRYHVGDVTRGLASR 186
+C++QTTPIF+P S++Y IPC LC S C + C Y Y ++G S
Sbjct: 120 QCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSV 179
Query: 187 ETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
+T + +G P+ GC DN+G FGG SGI+G P+SL +QL + I G F
Sbjct: 180 DTLSLESTSGSPVSFPKTVIGCGTDNAG-TFGGASSGIVGLGGGPVSLITQLGSSIGGKF 238
Query: 246 SYCLV----REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIV 300
SYCLV +E A+S++ FG A V + +TP++ D P FY L L S+G V
Sbjct: 239 SYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKD--PVFYFLTLQAFSVGNKRV 296
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
F G D G IID+GT +T I + Y L +++ L R P +Q+F
Sbjct: 297 EF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVK-LDRVDDP---NQQF 350
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-SILGAWQ 419
CY S+ +P +T H + AD + + F+ G C A Q P+ SI G
Sbjct: 351 SLCYSLKSNEYDFPIITAHFKGADIELHSIST-FVPITDGIVCFAFQPSPQLGSIFGNLA 409
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
QQN+L+ YDL + F +C
Sbjct: 410 QQNLLVGYDLQQKTVSFKPTDCT 432
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/404 (32%), Positives = 207/404 (51%), Gaps = 24/404 (5%)
Query: 50 NLSQSERIHKMFEISKAR-----ANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIG 104
NL++ ER+ + K+R A +A+ S P++ +LE P+ + Y +E+ IG
Sbjct: 59 NLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLE---APIHAGNGEYLIELAIG 115
Query: 105 TPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFK 162
TP + DT S L+WTQC+PC RC+ Q TPIFDP+ S+++S++ C LC +
Sbjct: 116 TPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSST 175
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
C +G C Y Y +T+G+ + ETF F V + FGC DN G F + SG
Sbjct: 176 CSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGF-EQASG 233
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVR-RRDLETTPILLS 280
++G PLSL SQL+ + FSYCL + SV+ G V+ +++ TTP+L +
Sbjct: 234 LVGLGRGPLSLVSQLKEQ---RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKN 290
Query: 281 DLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR 339
L+P F YL L IS+G + F++ DG GG IID+GT +T+++ Y+ L +
Sbjct: 291 PLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKE 350
Query: 340 YDQILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEP 397
+ S + + +S D C+ S+ P + FH + D + EN +
Sbjct: 351 F----ISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDS 406
Query: 398 DRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ G C+A+ SI G QQQN+L+ +DL + F +C
Sbjct: 407 NLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 209/404 (51%), Gaps = 34/404 (8%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN ++ ER+ + + + R +++ K +F+ + P+ + + + + IGTP +
Sbjct: 53 GNYTKFERLQRAVKRGRLRLQRLSA--KTASFEP--SVEAPVHAGNGEFLMNLAIGTPAE 108
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGK 167
+ DT S L+WTQC+PC CFDQ TPIFDP S+++S++PC LC + P +
Sbjct: 109 TYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG 168
Query: 168 CVYTRRYHVGD--VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILG 225
C Y RY GD T+G+ + ETF F G V ++ FGC DN G A+ + +G++G
Sbjct: 169 CEY--RYSYGDHSSTQGVLATETFTF----GDASVSKIGFGCGEDNRGRAY-SQGAGLVG 221
Query: 226 FNASPLSLSSQLRNRIQGL--FSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSD 281
PLSL SQL G+ FSYCL + + + S + G +A V+ TP++ +
Sbjct: 222 LGRGPLSLISQL-----GVPKFSYCLTSIDDSKGISTLLVGSEATVKSA--IPTPLIQNP 274
Query: 282 LRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
RP F YL L IS+G ++ F I DG+GG IID+GT +T++++ + L + +
Sbjct: 275 SRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEF 334
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPD 398
+ + + + S E + C+ D S P + FH + D + EN Y IE
Sbjct: 335 ISQM----KLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKEN-YIIEDS 389
Query: 399 RGR-FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
R C+ + SI G +QQQN+++++DL + F C
Sbjct: 390 ALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 138/397 (34%), Positives = 202/397 (50%), Gaps = 29/397 (7%)
Query: 57 IHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQ--DLFYSVEVNIGTPMKPQHLLF 114
+ + + S+ R + S N Q ++DI P+ Y +++ IGTP +
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQ-MKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIM 59
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP--FKCQN-GKCVYT 171
DT S LVWT+C PC C T+ I+DP +S+TYS++ C LC+ P F C N G C Y
Sbjct: 60 DTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEYV 117
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y T G+ S ETF+ ++ +P + FGC +DN GF K+ G++GF L
Sbjct: 118 YPYGDRSSTSGILSDETFSISSQS----LPNITFGCGHDNQGFD---KVGGLVGFGRGSL 170
Query: 232 SLSSQLRNRIQGLFSYCLVREMEA--TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
SL SQL + FSYCLV ++ TS + G A + + +TP++ S H+YL
Sbjct: 171 SLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLS 230
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L IS+G + P G FDI DG+GG IID+GT +TF++ Y + + ++ S+
Sbjct: 231 LEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEA---MVSSI-- 285
Query: 350 QRIPYNASQEFDYCY-RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI-- 406
+P A + D C+ + SS +PSMTFH + ADY V EN F + C+A+
Sbjct: 286 -NLP-QADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMMP 343
Query: 407 --QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ +I G QQQN I+YD L F C
Sbjct: 344 TNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 209/404 (51%), Gaps = 34/404 (8%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN ++ ER+ + + + R +++ K +F+ + P+ + + + + IGTP +
Sbjct: 53 GNYTKFERLQRAVKRGRLRLQRLSA--KTASFEP--SVEAPVHAGNGEFLMNLAIGTPAE 108
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGK 167
+ DT S L+WTQC+PC CFDQ TPIFDP S+++S++PC LC + P +
Sbjct: 109 TYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG 168
Query: 168 CVYTRRYHVGD--VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILG 225
C Y RY GD T+G+ + ETF F G V ++ FGC DN G A+ + +G++G
Sbjct: 169 CEY--RYSYGDHSSTQGVLATETFTF----GDASVSKIGFGCGEDNRGRAY-SQGAGLVG 221
Query: 226 FNASPLSLSSQLRNRIQGL--FSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSD 281
PLSL SQL G+ FSYCL + + + S + G +A V+ TP++ +
Sbjct: 222 LGRGPLSLISQL-----GVPKFSYCLTSIDDSKGISTLLVGSEATVKSA--IPTPLIQNP 274
Query: 282 LRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
RP F YL L IS+G ++ F I DG+GG IID+GT +T++++ + L + +
Sbjct: 275 SRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEF 334
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPD 398
+ + + + S E + C+ D S P + FH + D + EN Y IE
Sbjct: 335 ISQM----KLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN-YIIEDS 389
Query: 399 RGR-FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
R C+ + SI G +QQQN+++++DL + F C
Sbjct: 390 ALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 194/404 (48%), Gaps = 32/404 (7%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
NL++ E I + + + R + +M Q I P+ D Y + V IGTP
Sbjct: 54 NLTKYELIKRAIKRGERRMRSINAM-----LQSSSGIETPVYAGDGEYLMNVAIGTPDSS 108
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK 167
+ DT S L+WTQC+PC +CF Q TPIF+P+ S+++S +PC+ C+ C N +
Sbjct: 109 FSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNE 168
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C YT Y G T+G + ETF F + VP +AFGC DN GF G +G++G
Sbjct: 169 CQYTYGYGDGSTTQGYMATETFTFETSS----VPNIAFGCGEDNQGFGQGNG-AGLIGMG 223
Query: 228 ASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRDLETTPILLSDLRP-H 285
PLSL SQL G FSYC+ ++ S + G A +T ++ S L P +
Sbjct: 224 WGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTY 280
Query: 286 FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQIL 344
+Y+ L I++G + P F + DGTGG IID+GT +T++ Y + Q + DQI
Sbjct: 281 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI- 339
Query: 345 RSLGRQRIPY--NASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRG 400
+P +S C++ D S P ++ + +N+ I P G
Sbjct: 340 ------NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNI-LISPAEG 392
Query: 401 RFCVAIQDDPKY--SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
C+A+ + SI G QQQ ++YDL A+ F C
Sbjct: 393 VICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 206/439 (46%), Gaps = 48/439 (10%)
Query: 30 TGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM------SKPNAFQEL 83
+GF L L + S + NL++ ++I + R N + ++ SKP+ +
Sbjct: 43 SGFRLSLRHVDSGK------NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPD---DT 93
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRA 143
+I P + +E++IG P + DT S L+WTQC+PC CFDQ TPIFDP
Sbjct: 94 NNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEK 153
Query: 144 STTYSEIPCDDPLC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
S++YS++ C LC RS C Y Y TRGL + ETF F N
Sbjct: 154 SSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS--- 210
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATS 257
+ + FGC +N G F + SG++G PLSL SQL+ + FSYCL + + EA+S
Sbjct: 211 ISGIGFGCGVENEGDGF-SQGSGLVGLGRGPLSLISQLK---ETKFSYCLTSIEDSEASS 266
Query: 258 VIKFGR---------DADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAF 307
+ G A + +T +L + +P F YL L I++G + F
Sbjct: 267 SLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF 326
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN--ASQEFDYCYR 365
++ DGTGG IID+GT +T++ ++ L + + R +P + S D C++
Sbjct: 327 ELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTS------RMSLPVDDSGSTGLDLCFK 380
Query: 366 YDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNM 423
+ K A P M FH + AD + EN + G C+A+ SI G QQQN
Sbjct: 381 LPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNF 440
Query: 424 LIIYDLNVPALRFGSENCA 442
+++DL + F C
Sbjct: 441 NVLHDLEKETVSFVPTECG 459
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 130/403 (32%), Positives = 208/403 (51%), Gaps = 23/403 (5%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMK 108
NL++ ER+ + K+R + +M + + ED + P+ + Y +E+ IGTP
Sbjct: 60 NLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPPV 119
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNG 166
+ DT S L+WTQC+PC +C+ Q TPIFDP+ S+++S++ C LC + C +G
Sbjct: 120 SYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTCSDG 179
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF 226
C Y Y +T+G+ + ETF F V + FGC DN G F + SG++G
Sbjct: 180 -CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGF-EQASGLVGL 237
Query: 227 NASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVR-RRDLETTPILLSDLR 283
PLSL SQL+ + FSYCL M+ T S++ G V+ +++ TTP+L + L+
Sbjct: 238 GRGPLSLVSQLK---EPRFSYCLT-PMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQ 293
Query: 284 PHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
P F YL L IS+G + F++ DG GG IID+GT +T+I ++ L + +
Sbjct: 294 PSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEF-- 351
Query: 343 ILRSLGRQRIPYN--ASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPD 398
+ + ++P + +S D C+ S+ P + FH + D + EN + +
Sbjct: 352 ----ISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSN 407
Query: 399 RGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ SI G QQQN+L+ +DL + F +C
Sbjct: 408 LGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 199/404 (49%), Gaps = 24/404 (5%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
++++ + + S+AR + S++ I L ++ + Y ++V IG+P +
Sbjct: 45 TKAQLLSRAVARSRARVAALQSLATAADAITAARILLRFSEGE--YLMDVGIGSPPRYFS 102
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCV 169
+ DT S L+WTQC PC+ C +Q TP F+P ST+Y+ +PC +C + + C CV
Sbjct: 103 AMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACV 162
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y + G+ + ETF F + VPR++FGC N N+G F G SG++GF
Sbjct: 163 YQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNG--SGMVGFGRG 220
Query: 230 PLSLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRDADVRRRD------LETTPILLSDL 282
LSL SQL + FSYCL M ATS + FG A + + +++TP +++
Sbjct: 221 ALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPA 277
Query: 283 RPHFY-LHLLEISIGRHIVRFPPGAFDIMR-DGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
P Y L++ IS+ ++ P F I DGTGG IID+GT VTF+ Y + +
Sbjct: 278 LPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAF 337
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRYDSS---FKAYPSMTFHLQEADYIVQPENMYFIEP 397
+ +G R S FD C+++ P M H AD + EN ++
Sbjct: 338 ---VAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDG 394
Query: 398 DRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ SI+G++Q QN ++YDL L F C
Sbjct: 395 GTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 144/461 (31%), Positives = 228/461 (49%), Gaps = 45/461 (9%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSE--------STGFSLKLIPIFSPESPLYPGNLS 52
MA++ +L L + F+ +F F++S GF KL + S + NL+
Sbjct: 1 MANMSSLSLVVALAIFAFVFSHAFSTSRRVLEHPKVQNGFRAKLKHVDSGK------NLT 54
Query: 53 QSERIHKMFEISKARA---NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
+ ERI + + R MA ++ N+ +I P+ + + +++ IGTP +
Sbjct: 55 KFERIQHGVKRGRHRLQRFKAMALVASSNS-----EIDAPVLPGNGEFLMKLAIGTPPET 109
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK 167
+ DT S L+WTQC+PC +CFDQ TPIFDP+ S+++S++ C LC + C +G
Sbjct: 110 YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG- 168
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C Y Y T+G+ + ET F G VP +AFGC DN G F + SG++G
Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTF----GKVSVPEVAFGCGEDNEGSGF-SQGSGLVGLG 223
Query: 228 ASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRD--LETTPILLSDLR 283
PLSL SQL+ + FSYCL V + +A++++ G A V+ D ++TTP++ + +
Sbjct: 224 RGPLSLVSQLK---EPKFSYCLTSVDDTKASTLL-MGSLASVKASDSEIKTTPLIQNSAQ 279
Query: 284 PHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
P F YL L IS+G + F + DG+GG IID+GT +T++ + + + +
Sbjct: 280 PSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTS 339
Query: 343 ILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRG 400
+ + + S + C+ S P + FH AD + EN + G
Sbjct: 340 QI----NLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIADASMG 395
Query: 401 RFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A+ SI G QQQNML+++DL L F C
Sbjct: 396 VACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 199/404 (49%), Gaps = 24/404 (5%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
++++ + + S+AR + S++ I L ++ + Y ++V IG+P +
Sbjct: 42 TKAQLLSRAVARSRARVAALQSLATAADAITAARILLRFSEGE--YLMDVGIGSPPRYFS 99
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCV 169
+ DT S L+WTQC PC+ C +Q TP F+P ST+Y+ +PC +C + + C CV
Sbjct: 100 AMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACV 159
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y + G+ + ETF F + VPR++FGC N N+G F G SG++GF
Sbjct: 160 YQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNG--SGMVGFGRG 217
Query: 230 PLSLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRDADVRRRD------LETTPILLSDL 282
LSL SQL + FSYCL M ATS + FG A + + +++TP +++
Sbjct: 218 ALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPA 274
Query: 283 RPHFY-LHLLEISIGRHIVRFPPGAFDIMR-DGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
P Y L++ IS+ ++ P F I DGTGG IID+GT VTF+ Y + +
Sbjct: 275 LPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAF 334
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRYDSS---FKAYPSMTFHLQEADYIVQPENMYFIEP 397
+ +G R S FD C+++ P M H AD + EN ++
Sbjct: 335 ---VAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDG 391
Query: 398 DRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ SI+G++Q QN ++YDL L F C
Sbjct: 392 GTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 138/442 (31%), Positives = 208/442 (47%), Gaps = 32/442 (7%)
Query: 15 YFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM 74
+FS+ F+ F+ ++ GFS++LI S +SPLY ++ + S RAN+
Sbjct: 11 FFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFYKY 70
Query: 75 SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
S N Q +P + Y + ++GTP + + DT S +VW QC+PC C++Q
Sbjct: 71 SLANIPQSTV---IPDIGE---YLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQ 124
Query: 135 TTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAF 191
TTP+F+P S++Y IPC LC+S C + C Y+ Y + G S +T
Sbjct: 125 TTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTL 184
Query: 192 PVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
NG T P + GC +N ++ G SGI+GF + P S +QL + G FSYCL
Sbjct: 185 ESTNGLTVSFPNIVIGCGTNNI-LSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT 243
Query: 251 -------REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRF- 302
+ ATS + FG A V + TTPIL D +YL L S+G V
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIG 303
Query: 303 --PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
P G D G IID+GT +T + Y L +++ L R P +Q
Sbjct: 304 GVPNG------DNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVK-LERVDDP---TQTL 353
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQ 420
+ CY + +P +T H + AD + P + F+ G FC+A + ++I G Q
Sbjct: 354 NLCYSVKAEGYDFPIITMHFKGADVDLHPIST-FVSVADGVFCLAFESSQDHAIFGNLAQ 412
Query: 421 QNMLIIYDLNVPALRFGSENCA 442
QN+++ YDL + F +C
Sbjct: 413 QNLMVGYDLQQKIVSFKPSDCT 434
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 134/373 (35%), Positives = 188/373 (50%), Gaps = 41/373 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP P + DT S L+WTQC PC +CF Q TP+++P +STT++ +PC+
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 156 L--CRS---------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRL 203
L C + P C C Y Y G T ETF F G VP +
Sbjct: 152 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPGI 207
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLV--REMEATSVI 259
AFGCS +SGF SG++G LSL SQL G+ FSYCL ++ +TS +
Sbjct: 208 AFGCSTASSGFN-ASSASGLVGLGRGRLSLVSQL-----GVPKFSYCLTPYQDTNSTSTL 261
Query: 260 KFGRDADVR-RRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G A + + +TP + S + +YL+L IS+G + PP AF + DGT
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 321
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--- 371
GG IID+GT +T + N YQ Q ++ + +A D C+ SS
Sbjct: 322 GGLIIDSGTTITLLGNTAYQ---QVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPP 378
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDL 429
A PSMT H AD +V P + Y + D G +C+A+Q+ D + +ILG +QQQNM I+YD+
Sbjct: 379 AMPSMTLHFNGAD-MVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDI 437
Query: 430 NVPALRFGSENCA 442
L F C+
Sbjct: 438 GQETLSFAPAKCS 450
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 133/371 (35%), Positives = 186/371 (50%), Gaps = 37/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP P + DT S L+WTQC PC +CF Q TP+++P +STT++ +PC+
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 156 L--CRS---------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRL 203
L C + P C C Y Y G T ETF F G VP +
Sbjct: 92 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPGI 147
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKF 261
AFGCS +SGF SG++G LSL SQL FSYCL ++ +TS +
Sbjct: 148 AFGCSTASSGFN-ASSASGLVGLGRGRLSLVSQLGVPK---FSYCLTPYQDTNSTSTLLL 203
Query: 262 GRDADVR-RRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
G A + + +TP + S + +YL+L IS+G + PP AF + DGTGG
Sbjct: 204 GPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGG 263
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AY 373
IID+GT +T + N YQ Q ++ + +A D C+ SS A
Sbjct: 264 LIIDSGTTITLLGNTAYQ---QVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAM 320
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLNV 431
PSMT H AD +V P + Y + D G +C+A+Q+ D + +ILG +QQQNM I+YD+
Sbjct: 321 PSMTLHFNGAD-MVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQ 379
Query: 432 PALRFGSENCA 442
L F C+
Sbjct: 380 ETLSFAPAKCS 390
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 134/439 (30%), Positives = 207/439 (47%), Gaps = 48/439 (10%)
Query: 30 TGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM------SKPNAFQEL 83
+GF L L + S + NL++ ++I + R N + ++ S P+ +
Sbjct: 44 SGFRLSLRHVDSGK------NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPD---DT 94
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRA 143
+I P + +E++IG P + DT S L+WTQC+PC CFDQ TPIFDP
Sbjct: 95 NNIKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEK 154
Query: 144 STTYSEIPCDDPLC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
S++YS++ C LC RS C Y Y TRGL + ETF F N
Sbjct: 155 SSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS--- 211
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATS 257
+ + FGC +N G F + SG++G PLSL SQL+ + FSYCL + + EA+S
Sbjct: 212 ISGIGFGCGVENEGDGF-SQGSGLVGLGRGPLSLISQLK---ETKFSYCLTSIEDSEASS 267
Query: 258 VIKFGR---------DADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAF 307
+ G A++ +T +L + +P F YL L I++G + F
Sbjct: 268 SLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF 327
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN--ASQEFDYCYR 365
++ DGTGG IID+GT +T++ ++ L + + R +P + S D C++
Sbjct: 328 ELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTS------RMSLPVDDSGSTGLDLCFK 381
Query: 366 YDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNM 423
++ K A P + FH + AD + EN + G C+A+ SI G QQQN
Sbjct: 382 LPNAAKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNF 441
Query: 424 LIIYDLNVPALRFGSENCA 442
+++DL + F C
Sbjct: 442 NVLHDLEKETVTFVPTECG 460
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 134/373 (35%), Positives = 190/373 (50%), Gaps = 41/373 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP P + DT S L+WTQC PC +CF Q TP+++P +STT++ +PC+
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 156 L--CRS---------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRL 203
L C + P C C Y Y G T ETF F G + VP +
Sbjct: 150 LSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVPGI 205
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLV--REMEATSVI 259
AFGCS +SGF SG++G LSL SQL G+ FSYCL ++ +TS +
Sbjct: 206 AFGCSTASSGFN-ASSASGLVGLGRGRLSLVSQL-----GVPKFSYCLTPYQDTNSTSTL 259
Query: 260 KFGRDADVR-RRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G A + + +TP + S + +YL+L IS+G + PP AF + DGT
Sbjct: 260 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT 319
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--- 371
GG IID+GT +T + N YQ Q ++ + +A+ D C+ SS
Sbjct: 320 GGLIIDSGTTITLLGNTAYQ---QVRAAVVSLVTLPTTDGSAATGLDLCFMLPSSTSAPP 376
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDL 429
A PSMT H AD +V P + Y + D G +C+A+Q+ D + +ILG +QQQNM I+YD+
Sbjct: 377 AMPSMTLHFNGAD-MVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDI 435
Query: 430 NVPALRFGSENCA 442
L F C+
Sbjct: 436 GQETLSFAPAKCS 448
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 176/364 (48%), Gaps = 33/364 (9%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC- 157
+E++IG P + DT S L+WTQC+PC CFDQ TPIFDP S++YS++ C LC
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 158 ---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
RS C Y Y TRGL + ETF F N + + FGC +N G
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFGCGVENEGD 117
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGR--------- 263
F + SG++G PLSL SQL+ + FSYCL + + EA+S + G
Sbjct: 118 GF-SQGSGLVGLGRGPLSLISQLK---ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKT 173
Query: 264 DADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A + +T +L + +P F YL L I++G + F++ DGTGG IID+G
Sbjct: 174 GASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSG 233
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN--ASQEFDYCYRYDSSFK--AYPSMTF 378
T +T++ ++ L + + R +P + S D C++ + K A P M F
Sbjct: 234 TTITYLEETAFKVLKEEFTS------RMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIF 287
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
H + AD + EN + G C+A+ SI G QQQN +++DL + F
Sbjct: 288 HFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVP 347
Query: 439 ENCA 442
C
Sbjct: 348 TECG 351
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 195/398 (48%), Gaps = 30/398 (7%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
SK A+ + + P D P+A Y +++GTP K ++ DT S L+W
Sbjct: 7 SKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWI 66
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGKCVYTRRYHVGDVTRG 182
QC+PC CF+Q PIFDP S++Y+ + C D LC S P K + C Y+ Y G TRG
Sbjct: 67 QCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPDCDYSYGYGDGSGTRG 126
Query: 183 LASRETFAFPVRNGFTFVPR-LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
S ET G + +AFGC + N G +F SG++G LS SQL +
Sbjct: 127 TLSSETVTLTSTQGEKLAAKNIAFGCGHLNRG-SF-NDASGLVGLGRGNLSFVSQLGDLF 184
Query: 242 QGLFSYCLVREMEA---TSVIKFGRDADV----RRRDLETTPILLS-DLRPHFYLHLLEI 293
FSYCLV +A TS + FG ++ ++ TP++ + + +Y+ L +I
Sbjct: 185 GHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDI 244
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
SI +R P G+FDI DG+GG I D+GT +T + + PYQ +LR+L R +I
Sbjct: 245 SIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQI-------VLRAL-RSKIS 296
Query: 354 Y----NASQEFDYCYRYDSSFKAY----PSMTFHLQEADYIVQPENMYFIEPDRGRF-CV 404
+ +S D CY S +Y P+M FH + ADY + EN + D G C+
Sbjct: 297 FPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCL 356
Query: 405 A-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
A + + I G QQN ++YD+ + + C
Sbjct: 357 AMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 191/377 (50%), Gaps = 33/377 (8%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ +P+ + + ++++IGTP + DT S LVWTQC+PC+ CF+Q+TP+FDP +S
Sbjct: 106 DLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSS 165
Query: 145 TTYSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV 200
+TYS +PC LC S C YT Y T+G+ + ETF T +
Sbjct: 166 STYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTL----AKTKL 221
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVI 259
P +AFGC + N G F + +G++G PLSL SQL G FSYCL + + S +
Sbjct: 222 PGVAFGCGDTNEGDGF-TQGAGLVGLGRGPLSLVSQLG---LGKFSYCLTSLDDTSKSPL 277
Query: 260 KFGRDADVRRRD-----LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDG 313
G A + ++TTP++ + +P F Y+ L +++G + P AF + DG
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDG 337
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPY--NASQEFDYCYRYDSSF 370
TGG I+D+GT +T++ +Q Y + ++ Q ++P ++ D C++ +S
Sbjct: 338 TGGVIVDSGTSITYLE-------LQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASG 390
Query: 371 ---KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLII 426
P + H AD + EN ++ G C+ + SI+G +QQQN+ +
Sbjct: 391 VDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLSIIGNFQQQNIQFV 450
Query: 427 YDLNVPALRFGSENCAN 443
YD++ L F CA
Sbjct: 451 YDVDKDTLSFAPVQCAK 467
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 192/402 (47%), Gaps = 29/402 (7%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
NL++ E I + + + R + +M Q I P+ Y + V IGTP
Sbjct: 54 NLTKYELIKRAIKRGERRMRSINAM-----LQSSSGIETPVYAGSGEYLMNVAIGTPASS 108
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGKC 168
+ DT S L+WTQC+PC +CF Q TPIF+P+ S+++S +PC+ C+ P + C
Sbjct: 109 LSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYNDC 168
Query: 169 VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNA 228
YT Y G T+G + ETF F + VP +AFGC DN GF G +G++G
Sbjct: 169 QYTYGYGDGSSTQGYMATETFTFETSS----VPNIAFGCGEDNQGFGQGNG-AGLIGMGW 223
Query: 229 SPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRP-HF 286
PLSL SQL G FSYC+ + S + G A +T ++ S L P ++
Sbjct: 224 GPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYY 280
Query: 287 YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILR 345
Y+ L I++G + P F + DGTGG IID+GT +T++ Y + Q + DQI
Sbjct: 281 YITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINL 340
Query: 346 SLGRQRIPYN-ASQEFDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRF 402
S P + +S C++ D S P ++ + EN+ I P G
Sbjct: 341 S------PVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENV-LISPAEGVI 393
Query: 403 CVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
C+A+ + SI G QQQ ++YDL A+ F C
Sbjct: 394 CLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 435
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 143/460 (31%), Positives = 224/460 (48%), Gaps = 43/460 (9%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSE--------STGFSLKLIPIFSPESPLYPGNLS 52
MA + +L + F++ F F++S GF ++L + S + NL+
Sbjct: 1 MASMTSLCFVLALAMFTIFFSPAFSTSRRALEHPKMQKGFRVRLKHVDSGK------NLT 54
Query: 53 QSERIHKMFEISKARANYMASMS-KPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
+ ERI + + R + +M+ ++ E+E LP + + +++ IGTP +
Sbjct: 55 KLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGNGE---FLMKLAIGTPPETYS 111
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGKCV 169
+ DT S L+WTQC+PC +CF Q+TPIFDP+ S+++S++ C LC + C NG C
Sbjct: 112 AILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG-CE 170
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y T+G+ + ET F G VP +AFGC DN G F + +G++G
Sbjct: 171 YLYSYGDYSSTQGILASETLTF----GKASVPNVAFGCGADNEGSGF-SQGAGLVGLGRG 225
Query: 230 PLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRD--LETTPILLSDLRPHF 286
PLSL SQL+ + FSYCL + TS + G A V ++TTP++ S P F
Sbjct: 226 PLSLVSQLK---EPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSF 282
Query: 287 -YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILR 345
YL L IS+G + F + DG+GG IID+GT +T++ + + + +
Sbjct: 283 YYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFT---- 338
Query: 346 SLGRQRIPYNASQE--FDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR 401
+ +P ++S D C+ S+ P + FH AD + EN + G
Sbjct: 339 --AKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGV 396
Query: 402 FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A+ SI G QQQNML+++DL L F C
Sbjct: 397 ACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/408 (32%), Positives = 189/408 (46%), Gaps = 27/408 (6%)
Query: 42 PESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEV 101
P SPL + I K A A +SK + E P+A + Y +++
Sbjct: 28 PSSPLRSNTSKTTTEI--FLAAVKRGAERRAQLSK-HILAEGRLFSTPVASGNGEYLIDI 84
Query: 102 NIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-P 160
+ G+P + ++ DT S L+WTQC PC C + IFDP S+TY + C C S P
Sbjct: 85 SFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASNFCSSLP 144
Query: 161 FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
F+ C Y Y G T G S ET +P +AFGC + N G +F G
Sbjct: 145 FQSCTTSCKYDYMYGDGSSTSGALSTETVTVGTGT----IPNVAFGCGHTNLG-SFAGA- 198
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILL 279
+GI+G PLSL SQ + FSYCLV TS + G A + T +L
Sbjct: 199 AGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSA--AAGGVAYTALLT 256
Query: 280 SDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
+ P F Y L IS+ V +P G F I G GGFI+D+GT +T++ G + L+
Sbjct: 257 NTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNALVA 316
Query: 339 RYDQILRSLGRQRIPYNASQ----EFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMY 393
+ +P+ + DYC+ + YP+MTFH + ADY + PEN++
Sbjct: 317 AL--------KAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENVF 368
Query: 394 FIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ +SI+G QQQN LI++DL + F NC
Sbjct: 369 VALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/398 (32%), Positives = 195/398 (48%), Gaps = 30/398 (7%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
SK A+ + + P D P+A Y +++GTP K ++ DT S L+W
Sbjct: 7 SKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWI 66
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQNGKCVYTRRYHVGDVTRG 182
QC+PC CF+Q PIFDP S++Y+ + C D LC S P K + C Y+ Y G TRG
Sbjct: 67 QCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPNCDYSYGYGDGSGTRG 126
Query: 183 LASRETFAFPVRNGFTFVPR-LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
S ET G + +AFGC + N G +F SG++G LS SQL +
Sbjct: 127 TLSSETVTLTSTQGEKLAAKNIAFGCGHLNRG-SF-NDASGLVGLGRGNLSFVSQLGDLF 184
Query: 242 QGLFSYCLVREMEA---TSVIKFGRDADV----RRRDLETTPILLS-DLRPHFYLHLLEI 293
FSYCLV +A TS + FG ++ ++ TP++ + + +Y+ L +I
Sbjct: 185 GHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDI 244
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
SI +R P G+FDI DG+GG I D+GT +T + + PYQ +LR+L R ++
Sbjct: 245 SIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQI-------VLRAL-RSKVS 296
Query: 354 Y----NASQEFDYCYRYDSSFKAY----PSMTFHLQEADYIVQPENMYFIEPDRGRF-CV 404
+ +S D CY S +Y P+M FH + AD+ + EN + D G C+
Sbjct: 297 FPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCL 356
Query: 405 A-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
A + + I G QQN ++YD+ + + C
Sbjct: 357 AMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 131/371 (35%), Positives = 181/371 (48%), Gaps = 45/371 (12%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDPLC 157
+ + IGTP P + DT S L+WTQC PC R CF Q TP+++P +STT+S +PC+ L
Sbjct: 87 MTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSSLG 146
Query: 158 RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF----------VPRLAFGC 207
C C+Y Y G T+ F FTF VP +AFGC
Sbjct: 147 LCAPAC---ACMYNMTYGSG---------WTYVFQGTETFTFGSSTPADQVRVPGIAFGC 194
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDA 265
SN +SGF SG++G LSL SQL FSYCL ++ +TS + G A
Sbjct: 195 SNASSGFN-ASSASGLVGLGRGSLSLVSQLGAPK---FSYCLTPYQDTNSTSTLLLGPSA 250
Query: 266 DVRRRDL-ETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ + +TP + S ++YL+L IS+G + PP AF + DGTGG IID+GT
Sbjct: 251 SLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTT 310
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA---YPSMTFHLQ 381
+T + N YQ Q +L + +A+ D C+ SS A PSMT H
Sbjct: 311 ITMLGNTAYQ---QVRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFD 367
Query: 382 EADYIVQPENMYF----IEPDRGRFCVAIQ-----DDPKYSILGAWQQQNMLIIYDLNVP 432
AD ++ +N + D +C+A+Q D SILG +QQQNM I+YD+
Sbjct: 368 GADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKE 427
Query: 433 ALRFGSENCAN 443
L F C+
Sbjct: 428 TLSFAPAKCST 438
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 129/367 (35%), Positives = 184/367 (50%), Gaps = 30/367 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP P + DT S L+WTQC PC +CF+Q P+++P +STT+S +PC+
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 171
Query: 156 L--CRSPFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCS 208
L C C+Y + Y G T G+ ETF F VP +AFGCS
Sbjct: 172 LSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCS 230
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDAD 266
N +S G +G++G LSL SQL G FSYCL ++ +TS + G A
Sbjct: 231 NASSSDWNGS--AGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAA 285
Query: 267 VRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
+ + +TP + S R ++YL+L IS+G + PGAF + DGTGG IID+G
Sbjct: 286 LNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSG 345
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSMTF 378
T +T + N YQ + ++ +L + + S D C+ + A PSMT
Sbjct: 346 TTITSLANAAYQQVRAAVKSLVTTL--PTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTL 403
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
H AD +V P + Y I G +C+A+ Q D S G +QQQNM I+YD+ L F
Sbjct: 404 HFDGAD-MVLPADSYMIS-GSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSF 461
Query: 437 GSENCAN 443
C+
Sbjct: 462 APAKCST 468
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 134/423 (31%), Positives = 210/423 (49%), Gaps = 27/423 (6%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GFS+ LI SP SP Y +L+ S+RI S + N AS S N + LE + +P
Sbjct: 28 GFSIDLIHRDSPLSPFYKPSLTPSDRIINTALRSIYQLN-RASHSDLNEKKTLERVRIPN 86
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ Y + IGTP + + DTAS L+W QC PC CF Q TP+F+P S+T++ +
Sbjct: 87 HGE---YLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANL 143
Query: 151 PCDDPLCRSP--FKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
CD C S + C C+YT Y G T+G+ E+ F + TF P+ FG
Sbjct: 144 SCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQT-VTF-PKTIFG 201
Query: 207 C-SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV-IKFGRD 264
C SN++ K++GI+G A PLSL SQL ++I FSYCL+ +++ +KFG D
Sbjct: 202 CGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGND 261
Query: 265 ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+ + +TP+++ P +Y LHL+ I+IG+ +++ G IID GT
Sbjct: 262 TTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQ-----VRTTDHTNGNIIIDLGT 316
Query: 324 PVTFIRNGPYQTLMQRYDQILR-SLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQE 382
+T++ Y + + L S + IPY FD+C+ ++ +P + F
Sbjct: 317 VLTYLEVNFYHNFVTLLREALGISETKDDIPY----PFDFCFPNQANI-TFPKIVFQFTG 371
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDD---PKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
A + P+N++F D C+A+ D +S+ G Q + + YD + F
Sbjct: 372 AKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPA 431
Query: 440 NCA 442
+C+
Sbjct: 432 DCS 434
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 131/405 (32%), Positives = 198/405 (48%), Gaps = 31/405 (7%)
Query: 50 NLSQSERIHKMFEISK---ARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTP 106
NL++ +RI + + R N M + NA +I+ P+ + + + + IGTP
Sbjct: 55 NLTKFQRIQHGIKRANHRLERLNAMVLAASSNA-----EINSPVLSGNGEFLMNLAIGTP 109
Query: 107 MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQN 165
+ + DT S L+WTQC+PC +CFDQ +PIFDP+ S+++S++ C LC++ P +
Sbjct: 110 PETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS 169
Query: 166 GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILG 225
C Y Y T+G + ETF F G +P + FGC DN G F + SG++G
Sbjct: 170 DSCEYLYTYGDYSSTQGTMATETFTF----GKVSIPNVGFGCGEDNEGDGF-TQGSGLVG 224
Query: 226 FNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRD--LETTPILLSDL 282
PLSL SQL+ + FSYCL + TS + G A V + TTP++ + L
Sbjct: 225 LGRGPLSLVSQLK---EAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPL 281
Query: 283 RPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+P F YL L IS+G + F + DGTGG IID+GT +T++ + + + +
Sbjct: 282 QPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFT 341
Query: 342 QILRSLGRQRIPYNASQE--FDYCYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEP 397
+ +P + S + CY D+S P + H AD + EN +
Sbjct: 342 S------QMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTGADLELPGENYMIADS 395
Query: 398 DRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
G C+A+ SI G QQQNM + +DL L F NC
Sbjct: 396 SMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCG 440
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 184/375 (49%), Gaps = 34/375 (9%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ +P+ + + ++V+IGTP + DT S LVWTQC+PC+ CF Q+TP+FDP +S
Sbjct: 93 DLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 152
Query: 145 TTYSEIPCDDPLCRS--PFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP 201
+TY+ +PC C KC + KC YT Y T+G+ + ETF +P
Sbjct: 153 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LP 208
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVR-EMEATSV 258
+ FGC + N G F + +G++G PLSL SQL GL FSYCL + S
Sbjct: 209 GVVFGCGDTNEGDGF-SQGAGLVGLGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSP 262
Query: 259 IKFGRDADV-----RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ G A + ++TTP++ + +P F Y+ L I++G + P AF + D
Sbjct: 263 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 322
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSF 370
GTGG I+D+GT +T++ Y+ L + + + +P + D C+R +
Sbjct: 323 GTGGVIVDSGTSITYLEVQGYRALKKAF------AAQMALPAADGSGVGLDLCFRAPAKG 376
Query: 371 ---KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLII 426
P + FH AD + EN ++ G C+ + SI+G +QQQN +
Sbjct: 377 VDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFV 436
Query: 427 YDLNVPALRFGSENC 441
YD+ L F C
Sbjct: 437 YDVGHDTLSFAPVQC 451
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 207/429 (48%), Gaps = 40/429 (9%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHK--MFEISKA-RANYMASMSKPNAFQELEDIH 87
GFS+ LIP SP SPLY ++Q+E + + I+++ R N++ +S P L I
Sbjct: 25 GFSIDLIPRHSPISPLYNSQMTQTELVKSAALRSITRSKRVNFIGQISPP-----LSPII 79
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
P+ Y + ++GTP + +FDT S L W QC PC C+ Q P+FDP S+TY
Sbjct: 80 TPIPDHGE-YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTY 138
Query: 148 SEIPCDDPLC----RSPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPV----RNGFT 198
++PC+ C ++ +C + K C+Y +Y T G +T +F + G T
Sbjct: 139 VDVPCESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGAT 198
Query: 199 FVPRLAFGCS-NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEAT 256
F P+ FGC+ N F K +G +G PLSL+SQL ++I FSYC+V +T
Sbjct: 199 F-PKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTST 257
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTG 315
+KFG A ++ +TP +++ P +Y+ LE I++G+ ++ G
Sbjct: 258 GKLKFGSMAPT--NEVVSTPFMINPSYPSYYVLNLEGITVGQK---------KVLTGQIG 306
Query: 316 GFIIDTGTPV-TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G II P+ T + G Y + + + + +A F+YC R ++ +P
Sbjct: 307 GNIIIDSVPILTHLEQGIYTDFISSVKEAINV----EVAEDAPTPFEYCVRNPTNLN-FP 361
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
FH AD ++ P+NM FI D C+ + SI G W Q N + YDL +
Sbjct: 362 EFVFHFTGADVVLGPKNM-FIALDNNLVCMTVVPSKGISIFGNWAQVNFQVEYDLGEKKV 420
Query: 435 RFGSENCAN 443
F NC+
Sbjct: 421 SFAPTNCST 429
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 135/435 (31%), Positives = 211/435 (48%), Gaps = 25/435 (5%)
Query: 15 YFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM 74
+FS+ F+ F+ S FS +LI S +SPLY ++ + + S RAN +
Sbjct: 11 FFSLCFIISFSHSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKD 70
Query: 75 SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
S N + +++ + + YSV GTP + + DT S +VW QC+PC +C+ Q
Sbjct: 71 SLSNTPES--TVYVNGGEYLMTYSV----GTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQ 124
Query: 135 TTPIFDPRASTTYSEIPCDDPLCRSP--FKC-QNGKCVYTRRYHVGDVTRGLASRETFAF 191
TTPIF+P S++Y IPC LC+S C + C YT + ++G S ET
Sbjct: 125 TTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTL 184
Query: 192 PVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC-- 248
G + P+ GC ++N G F G+ SGI+G P+SL++QL++ I G FSYC
Sbjct: 185 DSTTGHSVSFPKTVIGCGHNNRGM-FQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLL 243
Query: 249 -LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
L+ + TS + FG A V + +TP + D + +YL L S+G + F
Sbjct: 244 PLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIE-----F 298
Query: 308 DIMRDG-TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
+++ D G I+D+GT +T + + Y L Q+++ L R P +Q + CY
Sbjct: 299 EVLDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVK-LDRVDDP---NQLLNLCYSI 354
Query: 367 DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLII 426
S +P +T H + AD + P + + D G C+A I G Q N+L+
Sbjct: 355 TSDQYDFPIITAHFKGADIKLNPISTFAHVAD-GVVCLAFTSSQTGPIFGNLAQLNLLVG 413
Query: 427 YDLNVPALRFGSENC 441
YDL + F +C
Sbjct: 414 YDLQQNIVSFKPSDC 428
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 184/375 (49%), Gaps = 34/375 (9%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ +P+ + + ++V+IGTP + DT S LVWTQC+PC+ CF Q+TP+FDP +S
Sbjct: 83 DLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 142
Query: 145 TTYSEIPCDDPLCRS--PFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP 201
+TY+ +PC C KC + KC YT Y T+G+ + ETF +P
Sbjct: 143 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LP 198
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVR-EMEATSV 258
+ FGC + N G F + +G++G PLSL SQL GL FSYCL + S
Sbjct: 199 GVVFGCGDTNEGDGF-SQGAGLVGLGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSP 252
Query: 259 IKFGRDADV-----RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ G A + ++TTP++ + +P F Y+ L I++G + P AF + D
Sbjct: 253 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 312
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSF 370
GTGG I+D+GT +T++ Y+ L + + + +P + D C+R +
Sbjct: 313 GTGGVIVDSGTSITYLEVQGYRALKKAF------AAQMALPAADGSGVGLDLCFRAPAKG 366
Query: 371 ---KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLII 426
P + FH AD + EN ++ G C+ + SI+G +QQQN +
Sbjct: 367 VDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFV 426
Query: 427 YDLNVPALRFGSENC 441
YD+ L F C
Sbjct: 427 YDVGHDTLSFAPVQC 441
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 179/353 (50%), Gaps = 16/353 (4%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++++GTP + + DT S L W QC PC RCF+Q P+F P AS++YS C D L
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67
Query: 157 CRS---PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + P C Y+ Y G TRG + ET NG T R+ FGC ++ G
Sbjct: 68 CDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTL---NGSTLA-RIGFGCGHNQEG 123
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRRD 271
F G G++G PLSL SQL + +FSYCLV + S I FG A+ R
Sbjct: 124 -TFAGA-DGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRAS 181
Query: 272 LETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
TP+L ++ P ++Y+ + IS+G V PP AF I +G GG I+D+GT +T+ R
Sbjct: 182 F--TPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRL 239
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPE 390
+ ++ + + PY + +D SS PSMT HL D+ +
Sbjct: 240 AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLT-LPSMTVHLTNVDFEIPVS 298
Query: 391 NMYFIEPDRGR-FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
N++ + + G C A+ ++SI+G QQQN LI+ D+ + F + +C+
Sbjct: 299 NLWVLVDNFGETVCTAMSTSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 185/368 (50%), Gaps = 31/368 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP P + DT S L+WTQC PC +CF+Q P+++P +STT+S +PC+
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 173
Query: 156 L--CRSPFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCS 208
L C C+Y + Y G T G+ ETF F VP +AFGCS
Sbjct: 174 LSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCS 232
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDAD 266
N +S G +G++G LSL SQL G FSYCL ++ +TS + G A
Sbjct: 233 NASSSDWNGS--AGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAA 287
Query: 267 VRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
+ + +TP + S R ++YL+L IS+G + PGAF + DGTGG IID+G
Sbjct: 288 LNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSG 347
Query: 323 TPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSMT 377
T +T + N YQ + Q++ +L + + S D C+ + A PSMT
Sbjct: 348 TTITSLANAAYQQVRAAVKSQLVTTL--PTVDGSDSTGLDLCFALPAPTSAPPAVLPSMT 405
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALR 435
H AD +V P + Y I G +C+A+ Q D S G +QQQNM I+YD+ L
Sbjct: 406 LHFDGAD-MVLPADSYMIS-GSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLS 463
Query: 436 FGSENCAN 443
F C+
Sbjct: 464 FAPAKCST 471
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 178/361 (49%), Gaps = 17/361 (4%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
D Y +E+ IGTP + + DT S L+WTQC PC+ C DQ TP FDP S+TY +
Sbjct: 87 ASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLG 146
Query: 152 CDDPLCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C P C + + C CVY Y T G+ + ETF F + +PR++FGC N
Sbjct: 147 CSAPACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGN 206
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVR 268
N+G G SG++GF LSL SQL + FSYCL + S + FG A +
Sbjct: 207 LNAGSLANG--SGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRSRLYFGAYATLN 261
Query: 269 RRD---LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGT 323
+ +++TP +++ P Y L++ IS+G + + P I DGTGG IID+GT
Sbjct: 262 STNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGT 321
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AYPSMTFHL 380
+T++ Y + + + L S + + D C+++ + P + H
Sbjct: 322 TITYLAEPAYYAVREAFVLYLNST-LPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHF 380
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
AD+ + +N ++P G C+A+ SI+G++Q QN ++YDL L F
Sbjct: 381 DGADWELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVLYDLENSLLSFVPAP 440
Query: 441 C 441
C
Sbjct: 441 C 441
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 206/437 (47%), Gaps = 31/437 (7%)
Query: 20 FLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNA 79
FL++ + GF+ LI SP+SP Y + S+R+ S +R + +S+ +A
Sbjct: 19 FLSNANAKSKLGFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDA 78
Query: 80 FQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
I L + Y + +++GTP P + DT S L+WTQC+PC C+ Q P+F
Sbjct: 79 SDNAPQIDLTSNSGE--YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLF 136
Query: 140 DPRASTTYSEIPCDDPLC-----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF--- 191
DP+AS+TY ++ C C ++ ++ C Y+ Y T+G + +T
Sbjct: 137 DPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGST 196
Query: 192 ---PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
PV+ + + GC ++N+G F K SGI+G +SL +QL + I G FSYC
Sbjct: 197 DTRPVQ-----LKNIIIGCGHNNAG-TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYC 250
Query: 249 LV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
LV E + TS I FG +A V + +TP++ +YL L IS+G V++P
Sbjct: 251 LVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGS 310
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
G G IID+GT +T + P + + D + S+ ++ + CY
Sbjct: 311 D---SGSGEGNIIIDSGTTLTLL---PTEFYSELEDAVASSIDAEK-KQDPQTGLSLCYS 363
Query: 366 YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLI 425
K P++T H AD ++P N F++ C A + P +SI G Q N L+
Sbjct: 364 ATGDLKV-PAITMHFDGADVNLKPSNC-FVQISEDLVCFAFRGSPSFSIYGNVAQMNFLV 421
Query: 426 IYDLNVPALRFGSENCA 442
YD + F +CA
Sbjct: 422 GYDTVSKTVSFKPTDCA 438
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 126/413 (30%), Positives = 199/413 (48%), Gaps = 34/413 (8%)
Query: 49 GNLSQSERIHKMFEISKAR-ANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPM 107
GN S+ + + + S R + +A + A D+ +P+ + + ++V IGTP
Sbjct: 51 GNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPA 110
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQN 165
+ DT S LVWTQC+PC+ CF Q+TP+FDP +S+TY+ +PC LC C +
Sbjct: 111 LSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTSTCTS 170
Query: 166 G-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
KC YT Y T+G+ + ETF +P +AFGC + N G F + +G++
Sbjct: 171 ASKCGYTYTYGDASSTQGVLASETFTLGKEK--KKLPGVAFGCGDTNEGDGF-TQGAGLV 227
Query: 225 GFNASPLSLSSQLRNRIQGL--FSYCL--VREMEATSVIKFGRDADVRRRD-----LETT 275
G PLSL SQL GL FSYCL + + + S + G A ++TT
Sbjct: 228 GLGRGPLSLVSQL-----GLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTT 282
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
P++ + +P F Y+ L +++G + P AF I DGTGG I+D+GT +T++ Y+
Sbjct: 283 PLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYR 342
Query: 335 TLMQRYDQILRSLGRQRIPYNASQE--FDYCYRYDSSF---KAYPSMTFHLQ-EADYIVQ 388
L + + + + +P E D C++ + P + H AD +
Sbjct: 343 ALKKAF------VAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADLDLP 396
Query: 389 PENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN ++ G C+ + SI+G +QQQN +YD+ L F C
Sbjct: 397 AENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 123/372 (33%), Positives = 176/372 (47%), Gaps = 26/372 (6%)
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
P+A Y V +GTP + ++ DT S L W QC PC +C+ Q +F P ST+++
Sbjct: 5 PVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFT 64
Query: 149 EIPCDDPLCRS-PF-KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAF 205
++ C LC PF C CVY Y G +T G +T NG VP AF
Sbjct: 65 KLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAF 124
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATSVIKFG 262
GC +DN G +F G GILG PLS SQL++ G FSYCLV + TS + FG
Sbjct: 125 GCGHDNEG-SFAGA-DGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFG 182
Query: 263 RDADVRRRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
A D++ PIL + P ++Y+ L IS+G +++ FDI G G I D+
Sbjct: 183 DAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDS 242
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY-------- 373
GT VT Q Y ++L ++ + Y S++ D R D +
Sbjct: 243 GTTVT-------QLAEAAYKEVLAAMNASTMAY--SRKIDDISRLDLCLSGFPKDQLPTV 293
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPA 433
P+MTFH + D ++ P N + +C A+ P +I+G+ QQQN + YD
Sbjct: 294 PAMTFHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRK 353
Query: 434 LRFGSENCANGR 445
L F ++C R
Sbjct: 354 LGFVPKDCVGRR 365
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 118/372 (31%), Positives = 182/372 (48%), Gaps = 34/372 (9%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
+P+ + + ++V+IGTP + DT S LVWTQC+PC+ CF Q+TP+FDP +S+TY
Sbjct: 65 VPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTY 124
Query: 148 SEIPCDDPLCRS--PFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
+ +PC C KC + KC YT Y T+G+ + ETF +P +
Sbjct: 125 ATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVV 180
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVR-EMEATSVIKF 261
FGC + N G F + +G++G PLSL SQL GL FSYCL + S +
Sbjct: 181 FGCGDTNEGDGF-SQGAGLVGLGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLL 234
Query: 262 GRDADV-----RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
G A + ++TTP++ + +P F Y+ L I++G + P AF + DGTG
Sbjct: 235 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 294
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSF--- 370
G I+D+GT +T++ Y+ L + + + +P + D C+R +
Sbjct: 295 GVIVDSGTSITYLEVQGYRALKKAF------AAQMALPAADGSGVGLDLCFRAPAKGVDQ 348
Query: 371 KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
P + FH AD + EN ++ G C+ + SI+G +QQQN +YD+
Sbjct: 349 VEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDV 408
Query: 430 NVPALRFGSENC 441
L F C
Sbjct: 409 GHDTLSFAPVQC 420
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 145/416 (34%), Positives = 198/416 (47%), Gaps = 45/416 (10%)
Query: 55 ERIHKMFEISKARANYMASMSKPNAF--QELEDIHLPMAKQDLFYSVEVNIGTPMKPQH- 111
ER+ +M S+ARA AS+ + Q + +P + + Y + NIGTP +PQ
Sbjct: 49 ERLSRMAVRSRARA---ASLYQRGGHYGQPVTATAVPSSGE---YLIHFNIGTP-RPQRV 101
Query: 112 -LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-------SPFKC 163
L DT S LVWTQC PC CFDQ P+FDP S+T+ + C DP+CR S
Sbjct: 102 ALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACAL 161
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF----TFVPRLAFGCSNDNSGFAFGGK 219
+ +C Y Y +T G ++TF F NG V LAFGC + N+G F
Sbjct: 162 KTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTG-VFASN 220
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCLVR----EMEATSVIKFGRDADVRRRD---- 271
SGI GF PLSL SQLR G FSYCL E TS + G + R
Sbjct: 221 ESGIAGFGRGPLSLPSQLR---VGRFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGP 277
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+TPI+ S P F YL L I++G+ + F + +DG+GG +ID+GT VT
Sbjct: 278 FRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTF-- 335
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFD--YCYRYDSSFKA--YPSMTFHLQEADYI 386
P Q ++ + L R Y+ + E C++ K P + FHL AD
Sbjct: 336 -PAAVFEQLKNEFVAQLPLPR--YDNTSEVGNLLCFQRPKGGKQVPVPKLIFHLASADMD 392
Query: 387 VQPENMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ EN + D G C+ I + ++G +QQQNM I+YD+ L F S C
Sbjct: 393 LPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQC 448
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 133/379 (35%), Positives = 189/379 (49%), Gaps = 52/379 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP + DT S L+WTQC PC +CF Q TP+++P +STT++ +PC+
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145
Query: 156 L--CRS-------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF----PVRNGFTFVPR 202
L C + P C C+Y Y G T ETF F P T VP
Sbjct: 146 LSMCAAALAGTTPPPGC---TCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQ--TGVPG 199
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLV--REMEATSV 258
+AFGCSN + GF SG++G LSL SQL G+ FSYCL ++ +TS
Sbjct: 200 IAFGCSNASGGFNTS-SASGLVGLGRGSLSLVSQL-----GVPKFSYCLTPYQDTNSTST 253
Query: 259 IKFGRDADVRRRD-LETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G A + + +TP + S + ++YL+L IS+G + P A + DG
Sbjct: 254 LLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADG 313
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY----NASQEFDYCYRYDSS 369
TGGFIID+GT +T + N YQ + ++ +P +A+ D C+ SS
Sbjct: 314 TGGFIIDSGTTITLLGNTAYQQVRAAVVSLV------TLPTTDGGSAATGLDLCFELPSS 367
Query: 370 FKA---YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNML 424
A PSMT H AD +V P + Y + D +C+A+Q+ D SILG +QQQNM
Sbjct: 368 TSAPPTMPSMTLHFDGAD-MVLPADSYMML-DSNLWCLAMQNQTDGGVSILGNYQQQNMH 425
Query: 425 IIYDLNVPALRFGSENCAN 443
I+YD+ L F C+
Sbjct: 426 ILYDVGQETLTFAPAKCST 444
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 129/370 (34%), Positives = 182/370 (49%), Gaps = 37/370 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDD 154
Y + ++IGTP + DT S L+WTQC PC +CF Q P+++P +STT+ +PC+
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151
Query: 155 PL--CRS-------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRLA 204
L C P C C+Y + Y G T G+ ETF F VP +A
Sbjct: 152 SLSMCAGVLAGKAPPPGC---ACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIA 207
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFG 262
FGCSN +S G +G++G LSL SQL G FSYCL ++ +TS + G
Sbjct: 208 FGCSNASSSDWNGS--AGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLG 262
Query: 263 RDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
A + + +TP + S + ++YL+L IS+G + P AF + DGTGG I
Sbjct: 263 PSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLI 322
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY---DSSFKAYPS 375
ID+GT +T + N YQ + ++ I + S D CY S+ A PS
Sbjct: 323 IDSGTTITSLVNAAYQQVRAAVQSLVT---LPAIDGSDSTGLDLCYALPTPTSAPPAMPS 379
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPA 433
MT H AD +V P + Y I G +C+A+ Q D S G +QQQNM I+YD+
Sbjct: 380 MTLHFDGAD-MVLPADSYMIS-GSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEM 437
Query: 434 LRFGSENCAN 443
L F C+
Sbjct: 438 LSFAPAKCST 447
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 139/424 (32%), Positives = 193/424 (45%), Gaps = 44/424 (10%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAF---------QELEDIHLPMAKQDLFYSVE 100
L + E I + + SKARA ++ + F +E E A DL Y ++
Sbjct: 42 ELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLD 101
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP 160
+ +GTP +P L DT S L+WTQC C C Q P+F PR S++Y + C LC
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161
Query: 161 FK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
+ C Y Y G T G + E F F +G T L FGC N G
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSL-- 219
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD----- 271
SGI+GF PLSL SQL R FSYCL + S ++FG ADV D
Sbjct: 220 NNASGIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP 276
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++TTPIL S P F Y+ +++G +R P AF + DG+GG IID+GT +T
Sbjct: 277 VQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLF-- 334
Query: 331 GPYQTLMQRYDQILRSLGRQ-RIPY--NASQEFDYCYRYDSSFK---------AYPSMTF 378
P L +++R+ Q R+P+ +S + C+ + A P M F
Sbjct: 335 -PVAVLA----EVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVF 389
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFG 437
H Q AD + EN + RG CV + D + +G + QQ+M ++YDL L F
Sbjct: 390 HFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFA 449
Query: 438 SENC 441
C
Sbjct: 450 PVEC 453
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 139/424 (32%), Positives = 193/424 (45%), Gaps = 44/424 (10%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAF---------QELEDIHLPMAKQDLFYSVE 100
L + E I + + SKARA ++ + F +E E A DL Y ++
Sbjct: 42 ELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLD 101
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP 160
+ +GTP +P L DT S L+WTQC C C Q P+F PR S++Y + C LC
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161
Query: 161 FK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
+ C Y Y G T G + E F F +G T L FGC N G
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSL-- 219
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD----- 271
SGI+GF PLSL SQL R FSYCL + S ++FG ADV D
Sbjct: 220 NNASGIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP 276
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++TTPIL S P F Y+ +++G +R P AF + DG+GG IID+GT +T
Sbjct: 277 VQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLF-- 334
Query: 331 GPYQTLMQRYDQILRSLGRQ-RIPY--NASQEFDYCYRYDSSFK---------AYPSMTF 378
P L +++R+ Q R+P+ +S + C+ + A P M F
Sbjct: 335 -PAAVLA----EVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVF 389
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFG 437
H Q AD + EN + RG CV + D + +G + QQ+M ++YDL L F
Sbjct: 390 HFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFA 449
Query: 438 SENC 441
C
Sbjct: 450 PVEC 453
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 138/425 (32%), Positives = 197/425 (46%), Gaps = 45/425 (10%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNA---FQELEDIHLPMA--------KQDLFYS 98
LS+SE I + + SKARA ++++ A F D DL Y
Sbjct: 44 QLSRSELIRRAMQRSKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYV 103
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
V++ IGTP +P L DT S L+WTQC PC C Q P+F P S +Y + C LC
Sbjct: 104 VDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQLCS 163
Query: 159 SPFK--CQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNG--FTFVPRLAFGCSNDNSG 213
C+ C Y Y G +T G+ + E F F G VP L FGC + N G
Sbjct: 164 DILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LGFGCGSMNVG 222
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD- 271
G SGI+GF +PLSL SQL R FSYCL S + FG + D
Sbjct: 223 SLNNG--SGIVGFGRNPLSLVSQLSIR---RFSYCLTSYGSGRKSTLLFGSLSGGVYGDA 277
Query: 272 ---LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
++TTP+L S P F Y+HL +++G +R P AF + DG+GG I+D+GT +T
Sbjct: 278 TGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 337
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSFK--------AYPSMT 377
+ +++ + Q L R+P+ + E C+ ++++ P M
Sbjct: 338 LPGAVLAEVVRAFRQQL------RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMV 391
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRF 436
FH Q+AD + N + +GR C+ + D S +G QQ+M ++YDL L F
Sbjct: 392 FHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSF 451
Query: 437 GSENC 441
C
Sbjct: 452 APAQC 456
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 206/436 (47%), Gaps = 42/436 (9%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSK-PNAFQELED 85
+++ GF LKL + + S ++ + + + SKAR + S + P +
Sbjct: 24 NDNVGFQLKLTHVDAGTS------YTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITA 77
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ + Y V++ IGTP + DT S L+WTQC PC+ C DQ TP FD + S
Sbjct: 78 ARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSA 137
Query: 146 TYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDV--TRGLASRETFAFPVRNGFTF-V 200
TY +PC C S C CVY +Y+ GD T G+ + ETF F N
Sbjct: 138 TYRALPCRSSRCASLSSPSCFKKMCVY--QYYYGDTASTAGVLANETFTFGAANSTKVRA 195
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVI 259
+AFGC + N+G SG++GF PLSL SQL FSYCL + AT S +
Sbjct: 196 TNIAFGCGSLNAGDL--ANSSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRL 250
Query: 260 KFGRDADVRRRD------LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRD 312
FG A++ + +++TP +++ P+ Y L L IS+G ++ P F I D
Sbjct: 251 YFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDD 310
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE----FDYCYRY-- 366
GTGG IID+GT +T+++ Y+ + R L IP A + D C+++
Sbjct: 311 GTGGVIIDSGTSITWLQQ-------DAYEAVRRGL-VSAIPLPAMNDTDIGLDTCFQWPP 362
Query: 367 -DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLI 425
+ P + FH A+ + PEN I G C+ + +I+G +QQQN+ +
Sbjct: 363 PPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHL 422
Query: 426 IYDLNVPALRFGSENC 441
+YD+ L F C
Sbjct: 423 LYDIGNSFLSFVPAPC 438
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 128/435 (29%), Positives = 198/435 (45%), Gaps = 26/435 (5%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
G +++LI SP+SPLYPGNL E+I + A ++ SM N + + P+
Sbjct: 13 GLTMELIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHHQTSMMSTNK-AVMNRMMSPL 71
Query: 91 AK--QDLFYSVEVNIG--------TPMKPQHLLFDTASSLVWTQCQPCIR----CFDQTT 136
+ +V +G T K + DT + L W QC+ C CF
Sbjct: 72 TSYGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKD 131
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
P + S +Y + C+ P +C+ G C Y Y G T G + ETF F +G
Sbjct: 132 PPYTSSQSKSYKPVSCNQHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHG 191
Query: 197 -FTFVPRLAFGCSND--NSGFAF---GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
T + ++FGCS D N +AF +SG+LG P S +QL + G FSYC+
Sbjct: 192 KHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCIT 251
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM 310
+ ++FG+ V+ ++L+TT I+ ++++LL IS+ + +
Sbjct: 252 ANNTHNTYLRFGKHV-VKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVR 310
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY--DS 368
+DG+ G IID GT T + + TL L S + D CY D+
Sbjct: 311 KDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDA 370
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLII 426
K P +TFHL+ AD V+PE ++ G+ FC+++ D +I+GA+QQ +
Sbjct: 371 GRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSKTIIGAYQQMKQKFV 430
Query: 427 YDLNVPALRFGSENC 441
YD L FG E+C
Sbjct: 431 YDTKARVLSFGPEDC 445
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 135/445 (30%), Positives = 204/445 (45%), Gaps = 35/445 (7%)
Query: 16 FSVLFLTHFTSSEST------GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARAN 69
FS+LFL S S GF+++LI SP+SP+Y + + +RI S R
Sbjct: 5 FSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNT 64
Query: 70 YMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
+ E + P+ Y VE+++GTP + DT S ++WTQC+PC
Sbjct: 65 VVL---------ESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCS 115
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQ-NGKCVYTRRYHVGDVTRGLAS 185
C+ Q P+FDP STTY + C P+C C + +C+Y+ Y ++G +
Sbjct: 116 NCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLA 175
Query: 186 RETFAFPVRNGFTFV-PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
+T +G PR GC +DN+G F +SGI+G P SL +QL G
Sbjct: 176 VDTVTMQSTSGRPVAFPRTVIGCGHDNAG-TFNANVSGIVGLGRGPASLVTQLGPATGGK 234
Query: 245 FSYCLV----REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHI 299
FSYCL+ ++ + FG +A+V +TPI S FY L L +S+G
Sbjct: 235 FSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTK 294
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
FP GA + G IID+GT +T++ + + Q + SL + P S+
Sbjct: 295 FNFPEGASKL--GGESNIIIDSGTTLTYLPSALLNSFGSAISQSM-SLPHAQDP---SEF 348
Query: 360 FDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSIL--GA 417
DYC+ + P +T H + AD +Q EN+ F+ C+A P +I G
Sbjct: 349 LDYCFATTTDDYEMPPVTMHFEGADVPLQRENL-FVRLSDDTICLAFGSFPDDNIFIYGN 407
Query: 418 WQQQNMLIIYDLNVPALRFGSENCA 442
Q N L+ YD+ A+ F +C
Sbjct: 408 IAQSNFLVGYDIKNLAVSFQPAHCG 432
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 134/420 (31%), Positives = 194/420 (46%), Gaps = 26/420 (6%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYM-ASMSKPNAFQELEDIHLP 89
GFS+++I S SP + +Q +R+ S RAN++ S PN+ + L
Sbjct: 28 GFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTVISALG 87
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Y + ++GTP + DT S ++W QCQPC +C++QTTPIFD S TY
Sbjct: 88 E------YLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKT 141
Query: 150 IPCDDPLCRS---PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAF 205
+PC C+S F C+Y+ Y G + G S ET NG P
Sbjct: 142 LPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVI 201
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRD 264
GC N+ K SGI+G P+SL +QL G FSYCLV + A+S + FG
Sbjct: 202 GCGRYNA-IGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNA 260
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRF-PPGAFDIMRDGTGGFIIDTGT 323
A V R +TP+ + ++L L S+GR+ + F PG+ G G IID+GT
Sbjct: 261 AVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGS-----GGKGNIIIDSGT 315
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQ 381
+T + NG Y L + + L R R P +Q CY+ D + P +T H
Sbjct: 316 TLTALPNGVYSKLEAAVAKTVI-LQRVRDP---NQVLGLCYKVTPDKLDASVPVITAHFS 371
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
AD + N F++ C A Q ++ G QQN+L+ YDL + + F +C
Sbjct: 372 GADVTLNAINT-FVQVADDVVCFAFQPTETGAVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 135/422 (31%), Positives = 192/422 (45%), Gaps = 42/422 (9%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMA------KQDLFYSVEVNI 103
LS+ E I + SKARA ++++ F + P DL Y V++ I
Sbjct: 43 QLSRPELIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAI 102
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPF-- 161
GTP +P L DT S L+WTQC PC C Q P+F P S +Y + C LC
Sbjct: 103 GTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHH 162
Query: 162 KCQN-GKCVYTRRYHVGDVTRGLASRETFAFP----VRNGFTFVPRLAFGCSNDNSGFAF 216
C+ C Y Y G +T G+ + E F F T VP L FGC + N G
Sbjct: 163 SCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGFGCGSVNVGSLN 221
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRD---- 271
G SGI+GF +PLSL SQL R FSYCL S + FG +D D
Sbjct: 222 NG--SGIVGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGR 276
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++TTP+L S P F Y+H +++G +R P AF + DG+GG I+D+GT +T +
Sbjct: 277 VQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPA 336
Query: 331 GPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSFK--------AYPSMTFHL 380
+++ + Q L R+P+ + E C+ ++++ P M H
Sbjct: 337 AVLAEVVRAFRQQL------RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF 390
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
Q AD + N + RGR C+ + D S +G QQ+M ++YDL L
Sbjct: 391 QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPA 450
Query: 440 NC 441
C
Sbjct: 451 RC 452
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 194/402 (48%), Gaps = 34/402 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF-----YSVEVNIG 104
+++E + +M S+ARA A P+ + P+A Y + IG
Sbjct: 43 GFTRNELLRRMVLRSRARA---AKQLCPSRSGTPVRVTAPVASGSHVVGYTEYLIHFGIG 99
Query: 105 TPMKPQH--LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--P 160
TP +PQ L DT S +VWTQC+PC CF Q P FD AS T + C DP+CR+ P
Sbjct: 100 TP-RPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRP 158
Query: 161 FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGK 219
C G C Y Y VT G ++++F F + G VP L FGC N+G F
Sbjct: 159 HACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTG-NFHSN 217
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPI 277
+GI GF PLSL QL FSYC + E ++T V G AD R T PI
Sbjct: 218 ETGIAGFGRGPLSLPRQLG---VSSFSYCFTTIFESKSTPVFLGGAPADGLRAH-ATGPI 273
Query: 278 LLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
L + P ++YL L I++G+ + P AF + DG+GG IID+GT +T +
Sbjct: 274 LSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVF 333
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQE-----FDYCYRYDSSFKAYPSMTFHLQEADYIVQ 388
++L ++ + + YN + E F D+S P MT HL+ AD+ +
Sbjct: 334 RSL---WEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELP 390
Query: 389 PENMYFIEPDRGRFCVAI-QDDPKYSILGAWQQQNMLIIYDL 429
EN PD + CV + D +++G +QQQNM I++DL
Sbjct: 391 RENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDL 432
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 137/428 (32%), Positives = 194/428 (45%), Gaps = 48/428 (11%)
Query: 50 NLSQSERIHKMFEISKARANYMA-----SMSKPNAFQELEDIH----LPM-AKQDLFYSV 99
+S+ E I + + SKARA ++ S P + + H +P+ DL Y +
Sbjct: 46 QMSRRELIRRAMQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLEYLI 105
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS 159
++ IGTP +P L DT S L+WTQC PC C Q P+F P AS++Y + C LC
Sbjct: 106 DLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCND 165
Query: 160 PF--KCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
CQ C Y Y G T G+ + E F F +G L FGC N G
Sbjct: 166 ILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLN 225
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFG-------RDADVR 268
G SGI+GF PLSL SQL R FSYCL S + FG D
Sbjct: 226 NG--SGIVGFGRDPLSLVSQLSIR---RFSYCLTPYTSTRKSTLMFGSLSDGVFEGDDAA 280
Query: 269 RRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
++TT +L S P F Y+ +++G +R P AF + DG+GG I+D+GT +T
Sbjct: 281 TGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTL 340
Query: 328 IRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFD--YCY----------RYDSSFKAYP 374
P L ++LR+ Q R+P+ +S D C+ ++ + P
Sbjct: 341 F---PAAVLT----EVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVP 393
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-SILGAWQQQNMLIIYDLNVPA 433
M FH Q AD + N +P RG C+ + D + +G + QQ+M ++YDL
Sbjct: 394 RMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAET 453
Query: 434 LRFGSENC 441
L F C
Sbjct: 454 LSFAPAQC 461
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 134/446 (30%), Positives = 219/446 (49%), Gaps = 44/446 (9%)
Query: 16 FSVLFLTHFTSSESTG--FSLKLIPIFSPESPLYPGNLSQSERIHKMFE--ISKARANYM 71
S + + H +++E FS+ LI SP+SPLY + + +ER+ + F +S + A+
Sbjct: 17 LSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEASIS 76
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC 131
+ +P P++ + Y ++++IGTP + ++DT S L+WTQC PC+ C
Sbjct: 77 PNTPEP-----------PVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSC 125
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGK--CVYTRRYHVGDVTRGLASRE 187
+ Q P+FDP ST++ E+ C+ CR C + C ++ Y G + +G+ + E
Sbjct: 126 YKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATE 185
Query: 188 TFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG--L 244
T +G T + + FGC ++NSG F G+ G PLSL+SQ+ + +
Sbjct: 186 TLTLNSNSGQPTSILNIVFGCGHNNSG-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRK 244
Query: 245 FSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR 301
FS CLV + TS I FG +A+V D+ +TP++ D ++++ L IS+G +
Sbjct: 245 FSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFP 304
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
F + + G ID GTP T + Y L+Q ++ IP Q+ D
Sbjct: 305 FSSSSPMATK---GNVFIDAGTPPTLLPRDFYNRLVQGV--------KEAIPMEPVQDPD 353
Query: 362 ----YCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD-DPKYSILG 416
CYR ++ P +T H AD ++P N FI P G +C A+Q D I G
Sbjct: 354 LQPQLCYR-SATLIDGPILTAHFDGADVQLKPLNT-FISPKEGVYCFAMQPIDGDTGIFG 411
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCA 442
+ Q N LI +DL+ + F + +C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 173/357 (48%), Gaps = 34/357 (9%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--P 160
IGTP + DT S LVWTQC+PC+ CF Q+TP+FDP +S+TY+ +PC C
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 161 FKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGK 219
KC + KC YT Y T+G+ + ETF +P + FGC + N G F +
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGCGDTNEGDGF-SQ 287
Query: 220 ISGILGFNASPLSLSSQLRNRIQGL--FSYCLVR-EMEATSVIKFGRDADV-----RRRD 271
+G++G PLSL SQL GL FSYCL + S + G A +
Sbjct: 288 GAGLVGLGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 342
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++TTP++ + +P F Y+ L I++G + P AF + DGTGG I+D+GT +T++
Sbjct: 343 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 402
Query: 331 GPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSF---KAYPSMTFHLQ-EAD 384
Y+ L + + + +P + D C+R + P + FH AD
Sbjct: 403 QGYRALKKAF------AAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGAD 456
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ EN ++ G C+ + SI+G +QQQN +YD+ L F C
Sbjct: 457 LDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 129/421 (30%), Positives = 190/421 (45%), Gaps = 25/421 (5%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GFS+++I S SP + +Q +R+ S RAN+ K +
Sbjct: 28 GFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFHKAHKA--------AKATI 79
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ D Y + ++G P + + DT S ++W QC+PC +C++QTT IFDP S TY +
Sbjct: 80 TQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKIL 139
Query: 151 PCDDPLCRS--PFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLA 204
P C+S C + C YT Y G ++G S ET NG + R
Sbjct: 140 PFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTV 199
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL---FSYCLVREMEATSVIKF 261
GC +N+ +F GK SGI+G P+SL +QLR R + FSYCL +S + F
Sbjct: 200 IGCGRNNT-VSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNF 258
Query: 262 GRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
G A V +TPI+ D + +YL L S+G + + F +F G IID+
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDS 316
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
GT +T + N Y L ++ L R + P ++ CYR P + H
Sbjct: 317 GTTLTLLPNDIYSKLESAVADLVE-LDRVKDPL---KQLSLCYRSTFDELNAPVIMAHFS 372
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
AD + N FIE ++G C+A I G QQN L+ YDL + F +C
Sbjct: 373 GADVKLNAVNT-FIEVEQGVTCLAFISSKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDC 431
Query: 442 A 442
+
Sbjct: 432 S 432
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 134/438 (30%), Positives = 202/438 (46%), Gaps = 44/438 (10%)
Query: 15 YFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM 74
YFS+ F+ + + + GFS++LI S +SPLY ++ + I S RAN+
Sbjct: 11 YFSLCFIISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRANHFYKT 70
Query: 75 SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
+ N Q +P + Y + ++GTP + + DT S +VW QC+PC C++Q
Sbjct: 71 ALTNTPQSTV---IPDHGE---YLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQ 124
Query: 135 TTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
TTP F P S+TY IPC LC+S +G S +T
Sbjct: 125 TTPKFKPSKSSTYKNIPCSSDLCKSG-------------------QQGNLSVDTLTLESS 165
Query: 195 NGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--- 250
G P+ GC DN+ +F G SGI+G P SL +QL + I FSYCL+
Sbjct: 166 TGHPISFPKTVIGCGTDNT-VSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNP 224
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM 310
E TS + FG A V + +TPI+ D +YL L S+G + F G+ +
Sbjct: 225 VESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEF-EGSSNGG 283
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF 370
+G IID+GT +T I Y L ++++ L R P ++ F+ CY S
Sbjct: 284 HEGN--IIIDSGTTLTVIPTDVYNNLESAVLELVK-LKRVNDP---TRLFNLCYSVTSDG 337
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY------SILGAWQQQNML 424
+P +T H + AD + P + F++ G C+A + SI G QQN+L
Sbjct: 338 YDFPIITTHFKGADVKLHPIST-FVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLL 396
Query: 425 IIYDLNVPALRFGSENCA 442
+ YDL + F +C+
Sbjct: 397 VGYDLQQKIVSFKPTDCS 414
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 183/385 (47%), Gaps = 42/385 (10%)
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRA 143
+D+ +P+ + + +++++GTP P + DT S LVWTQC+PC+ CF+QTTP+FDP A
Sbjct: 103 KDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAA 162
Query: 144 STTYSEIPCDDPLCRSPFKCQNGKCV----------YTRRYHVGDVTRGLASRETFAFPV 193
S+TY+ +PC LC YT Y T+G+ + ETF
Sbjct: 163 SSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLAR 222
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVRE 252
+ VP +AFGC + N G F + +G++G PLSL SQL +R FSYCL
Sbjct: 223 QK----VPGVAFGCGDTNEGDGF-TQGAGLVGLGRGPLSLVSQLGIDR----FSYCLTSL 273
Query: 253 MEATS------VIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPG 305
+A G A +TTP++ + +P F Y+ L +++G + P
Sbjct: 274 DDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSS 333
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYC 363
AF I DGTGG I+D+GT +T++ Y+ L + + + +P + E D C
Sbjct: 334 AFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAF------VAHMSLPTVDASEIGLDLC 387
Query: 364 YR-----YDSSFKA-YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILG 416
++ D + P + H AD + EN ++ G C+ + SI+G
Sbjct: 388 FQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIG 447
Query: 417 AWQQQNMLIIYDLNVPALRFGSENC 441
+QQQN +YD+ L F C
Sbjct: 448 NFQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 127/365 (34%), Positives = 177/365 (48%), Gaps = 31/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ IGTP + DT S L+WTQC PC+ C DQ TP F P S TY +PC PL
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 157 CRS---PFKCQNGKCVYTRRYHVGD--VTRGLASRETFAFPVRNGF-TFVPRLAFGCSND 210
C + P Q CVY +Y+ GD T G+ + ETF F N V +AFGC N
Sbjct: 152 CAALPYPACFQRSVCVY--QYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKF-------G 262
NSG SG++G PLSL SQL FSYCL + S + F G
Sbjct: 210 NSGQL--ANSSGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+A +++TP++++ P Y + L IS+G+ + P F I DGTGG ID+
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ-EFDYCYRY---DSSFKAYPSMT 377
GT +T+++ Y + +LR L P N ++ + C+ + S P M
Sbjct: 325 GTSLTWLQQDAYDAVRHELVSVLRPLP----PTNDTEIGLETCFPWPPPPSVAVTVPDME 380
Query: 378 FHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
H A+ V PEN I+ G C+A+ +I+G +QQQNM I+YD+ L F
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSF 440
Query: 437 GSENC 441
C
Sbjct: 441 VPAPC 445
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 127/365 (34%), Positives = 178/365 (48%), Gaps = 31/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ IGTP + DT S L+WTQC PC+ C DQ TP F P S TY +PC PL
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 157 CRS---PFKCQNGKCVYTRRYHVGD--VTRGLASRETFAFPVRNGF-TFVPRLAFGCSND 210
C + P Q CVY +Y+ GD T G+ + ETF F N V +AFGC N
Sbjct: 152 CAALPYPACFQRSVCVY--QYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKF-------G 262
NSG SG++G PLSL SQL FSYCL + S + F G
Sbjct: 210 NSGQL--ANSSGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+A +++TP++++ P Y + L IS+G+ + P F I DGTGG ID+
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ-EFDYCYRY---DSSFKAYPSMT 377
GT +T+++ Y + + +LR L P N ++ + C+ + S P M
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLRPLP----PTNDTEIGLETCFPWPPPPSVAVTVPDME 380
Query: 378 FHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
H A+ V PEN I+ G C+A+ +I+G +QQQNM I+YD+ L F
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSF 440
Query: 437 GSENC 441
C
Sbjct: 441 VPAPC 445
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 142/458 (31%), Positives = 219/458 (47%), Gaps = 33/458 (7%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
M V L L+ FF FS+ F+ + S GFS++LI S +SP Y ++ + +
Sbjct: 1 MNTVSFLTLSFFFLCFSI----SFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDA 56
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
S R N+ S N+ + + + D Y + ++GTP + + DT S +
Sbjct: 57 VHRSINRVNH----SNKNSLASTPESTVISYEGD--YIMSYSVGTPPIKSYGIVDTGSDI 110
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK-CVYTRRYHVG 177
VW QC+PC +C++QTTP F+P S++Y I C LC+S C + K C Y+ Y
Sbjct: 111 VWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQ 170
Query: 178 DVTRGLASRETFAFPVRNG--FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
++G S ET G +F P+ GC +N G +F SG++G P SL +
Sbjct: 171 SHSQGDLSLETLTLESTTGRPVSF-PKTVIGCGTNNIG-SFKRVSSGVVGLGGGPASLIT 228
Query: 236 QLRNRIQGLFSYCLVR------EME-ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYL 288
QL I G FSYCLVR M +S + FG A V ++ +TPI+ D +YL
Sbjct: 229 QLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYL 288
Query: 289 HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
+ S+G V F + + G IID+ T VTF+ + Y L I+ +
Sbjct: 289 TIEAFSVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPSDVYTKLNS---AIVDLVT 342
Query: 349 RQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ 407
+R+ + +Q+F CY S + +P MT H + AD ++ N F+E R C A
Sbjct: 343 LERVD-DPNQQFSLCYNVSSDEEYDFPYMTAHFKGADILLYATNT-FVEVARDVLCFAFA 400
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCANGR 445
+I G++ QQ+ ++ YDL + F S +C G+
Sbjct: 401 PSNGGAIFGSFSQQDFMVGYDLQQKTVSFKSVDCTEGQ 438
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 191/379 (50%), Gaps = 43/379 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--------RCFDQTTPIFDPRASTTYS 148
Y + ++IGTP + DT S L+WTQC PC +CF Q+ +++P +STT+
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFG 146
Query: 149 EIPCDDPL--CRS---PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG--FTFVP 201
+PC+ PL C + P C+Y + Y G T G+ S ETF F + VP
Sbjct: 147 VLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTG-WTAGVQSVETFTFGSSSTPPAVRVP 205
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVI 259
+AFGCSN +S G +G++G +SL SQL G FSYCL ++ +TS +
Sbjct: 206 NIAFGCSNASSNDWNGS--AGLVGLGRGSMSLVSQLG---AGAFSYCLTPFQDANSTSTL 260
Query: 260 KFGRDADVRRRD---LETTPILL----SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
G A + + +TP + + + ++YL+L IS+G + PP AF + D
Sbjct: 261 LLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLRAD 320
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY----NASQEFDYCYRYDS 368
GTGG IID+GT +T + + Y Q+ +RSL R+P + S D C+ +
Sbjct: 321 GTGGLIIDSGTTITTLVDSAY----QQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKA 376
Query: 369 SF--KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP--KYSILGAWQQQNML 424
S A PSMT H + +V P Y I G +C+A+++ S++G +QQQN+
Sbjct: 377 STPPPAMPSMTLHFEGGADMVLPVENYMIL-GSGVWCLAMRNQTVGAMSMVGNYQQQNIH 435
Query: 425 IIYDLNVPALRFGSENCAN 443
++YD+ L F C++
Sbjct: 436 VLYDVRKETLSFAPAVCSS 454
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 136/453 (30%), Positives = 207/453 (45%), Gaps = 32/453 (7%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTS---SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
++ + FF+ V FL + GFS+ LI SP SP + + +Q+ER+
Sbjct: 1 MEGFGVKIFFNVVVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDA 60
Query: 61 FEISKARANYMASMSKPNAFQE--LEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
F S +R +P A ++ +P A + Y + + IGTP P + DT S
Sbjct: 61 FRRSVSRVGRF----RPTAMTSDGIQSRIVPSAGE---YLMNLYIGTPPVPVIAIVDTGS 113
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK----CQNGKCVYTRRY 174
L WTQC+PC C+ Q P+FDP+ S+TY + C C + K + KC + Y
Sbjct: 114 DLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSY 173
Query: 175 HVGDVTRGLASRETFAFPVRNG--FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
G T G + ET G +F P AFGC + +SG F SGI+G LS
Sbjct: 174 ADGSFTGGNLASETLTVDSTAGKPVSF-PGFAFGCGH-SSGGIFDKSSSGIVGLGGGELS 231
Query: 233 LSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
L SQL++ I GLFSYCL+ + +S I FG V +TP++ +YL
Sbjct: 232 LISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLT 291
Query: 290 LLEISIGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
L IS+G+ R P + + G I+D+GT TF+ Y L + + S+
Sbjct: 292 LEGISVGKK--RLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK---SVANSIK 346
Query: 349 RQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
+R+ + + F CY + A P +T H ++A+ +QP N F+ C +
Sbjct: 347 GKRV-RDPNGIFSLCYNTTAEINA-PIITAHFKDANVELQPLNT-FMRMQEDLVCFTVAP 403
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+LG Q N L+ +DL + F + +C
Sbjct: 404 TSDIGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 186/370 (50%), Gaps = 39/370 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP + DT S L+WTQC PC +CF Q ++P +STT+ +PC+
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147
Query: 156 --LCRS---PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF---PVRNGFTFVPRLAFGC 207
+C + P C+Y + Y G T G+ S ETF F P T VP +AFGC
Sbjct: 148 VSMCAALAGPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQ--TRVPGIAFGC 204
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDA 265
SN +S G +G++G +SL SQL G+FSYCL ++ +TS + G A
Sbjct: 205 SNASSDDWNGS--AGLVGLGRGSMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLGPSA 259
Query: 266 DVRRRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ + TTP + S + ++YL+L ISIG + PP AF + DGTGG IID+
Sbjct: 260 ALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDS 319
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY---NASQEFDYCYRYDSSFK---AYPS 375
GT +T + + YQ + + ++ +P + S D C+ S + PS
Sbjct: 320 GTTITSLVDAAYQQVRAAIESLV------TLPVADGSDSTGLDLCFALTSETSTPPSMPS 373
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP--KYSILGAWQQQNMLIIYDLNVPA 433
MTFH AD ++ +N + G +C+A+++ S G +QQQN+ ++YD++
Sbjct: 374 MTFHFDGADMVLPVDNYMIL--GSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEET 431
Query: 434 LRFGSENCAN 443
L F C+
Sbjct: 432 LSFAPAKCST 441
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 200/424 (47%), Gaps = 21/424 (4%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI 86
+ + GF+ +L+ SP+SPLY + +R +K S +R ++ + + +E+E
Sbjct: 26 AHNAGFTTELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESE 85
Query: 87 HLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTT 146
+ + Y + +++GTP + DT S L+WTQC PC +C+ Q P+FDP++S T
Sbjct: 86 IIANGGE---YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKT 142
Query: 147 YSEIPCDDPLCRS---PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVP 201
Y ++ CD C++ C + + C Y+ Y T G + +T P NG + P
Sbjct: 143 YRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFP 202
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV----REMEATS 257
+ GC N+G F K SGI+G P+SL SQ+ + + G FSYCLV +S
Sbjct: 203 KTVIGCGRRNNG-TFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSS 261
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ FGR+A V +++TP++ + +YL L +S+G + F G
Sbjct: 262 KLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEF---GGSSFGGSEGNI 318
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
IID+GT +T P + + ++ +AS +CYR K P +T
Sbjct: 319 IIDSGTSLTLF---PVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKV-PVIT 374
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
H AD ++Q N + + D C+A +I G Q N LI YD+ ++ F
Sbjct: 375 AHFNGADVVLQTLNTFILISDD-VLCLAFNSTQSGAIFGNVAQMNFLIGYDIQGKSVSFK 433
Query: 438 SENC 441
+C
Sbjct: 434 PTDC 437
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 132/446 (29%), Positives = 217/446 (48%), Gaps = 44/446 (9%)
Query: 16 FSVLFLTHFTSSESTG--FSLKLIPIFSPESPLYPGNLSQSERIHKMFE--ISKARANYM 71
S + + H +++E FS+ LI SP+SPLY + + +ER+ + F +S + A+
Sbjct: 17 LSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEASIS 76
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC 131
+ +P P++ + Y ++++IGTP + ++DT S L+WTQC PC+ C
Sbjct: 77 PNTPEP-----------PVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSC 125
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGK--CVYTRRYHVGDVTRGLASRE 187
+ Q P+FDP ST++ E+ C+ CR C + C ++ Y G + +G+ + E
Sbjct: 126 YKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATE 185
Query: 188 TFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG--L 244
T +G + + FGC ++NSG F G+ G PLSL+SQ+ + +
Sbjct: 186 TLTLNSNSGQPXSIXNIVFGCGHNNSG-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRK 244
Query: 245 FSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR 301
FS CLV + TS I FG +A+V + +TP++ D ++++ L IS+G +
Sbjct: 245 FSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFP 304
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
F + + G ID GTP T + Y L+Q ++ IP Q+ D
Sbjct: 305 FSSSSPMATK---GNVFIDAGTPPTLLPRDFYNRLVQGV--------KEAIPMEPVQDPD 353
Query: 362 ----YCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD-DPKYSILG 416
CYR ++ P +T H AD ++P N FI P G +C A+Q D I G
Sbjct: 354 LQPQLCYR-SATLIDGPILTAHFDGADVQLKPLNT-FISPKEGVYCFAMQPIDGDTGIFG 411
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCA 442
+ Q N LI +DL+ + F + +C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 135/447 (30%), Positives = 207/447 (46%), Gaps = 34/447 (7%)
Query: 13 FSYFSVLFLTHFTSSEST----GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
F ++ L+ +S E+ GFS+ LI SP SP Y +L+ SERI S +R
Sbjct: 6 FMILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRL 65
Query: 69 NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC 128
++ N E + +P + Y + IG+P + + DT SSL+W QC PC
Sbjct: 66 QRVSHFLDENKLPE--SLLIPDKGE---YLMRFYIGSPPVERLAMVDTGSSLIWLQCSPC 120
Query: 129 IRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQN-GKCVYTRRYHVGDVTRGL 183
CF Q TP+F+P S+TY CD C S C G+C+Y Y + G+
Sbjct: 121 HNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGI 180
Query: 184 ASRETFAFPVRNGFTFV--PRLAFGCSNDNSGFAF-GGKISGILGFNASPLSLSSQLRNR 240
ET +F G V P FGC DN+ + K+ GI G A PLSL SQL +
Sbjct: 181 LGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ 240
Query: 241 IQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRH 298
I FSYCL+ + +TS +KFG +A + + +TP+++ P +Y L+L ++IG+
Sbjct: 241 IGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQK 300
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+V G D G +ID+GTP+T++ N Y + + L Q +P
Sbjct: 301 VVS--TGQTD------GNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLP----S 348
Query: 359 EFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILG 416
C+ ++ A P + F A ++P+N+ D C+A+ S+ G
Sbjct: 349 PLKTCFPNRANL-AIPDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFG 407
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCAN 443
+ Q + + YDL + F +CA
Sbjct: 408 SIAQYDFQVEYDLEGKKVSFAPTDCAK 434
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 134/454 (29%), Positives = 211/454 (46%), Gaps = 31/454 (6%)
Query: 3 HVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHK--M 60
H A AA S + L T + + S+ F++ LI SP SP Y ++++S+ I M
Sbjct: 2 HALAFFFAASCSLLATLPFTEPSKTPSS-FTIDLIHHDSPPSPFYNSSMTRSQLIRNAAM 60
Query: 61 FEISKARANYMASMSKPNAFQEL--EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
IS+A ++ N +E E I +P Y + + IGTP + + DT S
Sbjct: 61 RSISRANQLSLSLSHSLNQLKESSPEPIIIPNNGN---YLMRIYIGTPSVERLAIADTGS 117
Query: 119 SLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQN-GKCVYT 171
L W QC PC +CF Q TP++DP S+T++ +PCD C S + C + G C+Y
Sbjct: 118 DLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCIYA 177
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFA-FGGKISGILGFNASP 230
Y + G S ++ + + ++ FGC N A GK +GI+G A P
Sbjct: 178 YTYGDNSYSYGGLSSDSIRLMLLQ-LHYNSKICFGCGFQNKFTADKSGKTTGIVGLGAGP 236
Query: 231 LSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
LSL SQL + I FSYCL+ + S +KFG A V+ + +TP+++ P +YL+
Sbjct: 237 LSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLN 296
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L I++G V+ G D G IID+G+ +T++ Y + + +
Sbjct: 297 LEGITVGAKTVK--TGQTD------GNIIIDSGSTLTYLEESFYNEFVSLVKETVAVEED 348
Query: 350 QRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPEN-MYFIEPDRGRFCVAIQD 408
Q IPY FD+C+ Y P + FH D +++P N + IE + V
Sbjct: 349 QYIPY----PFDFCFTYKEGMSTPPDVVFHFTGGDVVLKPMNTLVLIEDNLICSTVVPSH 404
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+I G Q + + YD+ + F +C+
Sbjct: 405 FDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 175/365 (47%), Gaps = 29/365 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E+ IGTP P L DT S L WTQCQPC CF Q TPI+D S+++S +PC
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152
Query: 157 CRSPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C + +N C Y Y G + G+ ET FP G + V +AFGC DN
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVS-VGGIAFGCGVDN 211
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR--EMEATSVIKFGRDADVRR 269
G ++ +G +G LSL +QL G FSYCL S + FG A++
Sbjct: 212 GGLSY--NSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGSPVLFGALAELAA 266
Query: 270 ----RDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+++TP++ S P +Y LE IS+G + P G FD+ DG+GG I+D+GT
Sbjct: 267 PSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTT 326
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQ 381
TF+ ++ ++ +L RQ + NAS C+ + A P M H
Sbjct: 327 FTFLVESAFRVVVDHVAGVL----RQPV-VNASSLDSPCFPAATGEQQLPAMPDMVLHFA 381
Query: 382 -EADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGS 438
AD + +N + FC+ I P SILG +QQQN+ +++D+ V L F
Sbjct: 382 GGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMP 441
Query: 439 ENCAN 443
+C
Sbjct: 442 TDCGK 446
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 126/422 (29%), Positives = 197/422 (46%), Gaps = 26/422 (6%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF++ LI SP SP Y + +RI+ S +R ++ ++ + + + +
Sbjct: 31 GFTVDLIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTS 90
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ + Y + +++GTP + DT S L+WTQC+PC RC+ Q P+FDP++S TY +
Sbjct: 91 NRGE--YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDF 148
Query: 151 PCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVPRLAFGC 207
CD C C C Y Y T G + +T G P+ GC
Sbjct: 149 SCDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGC 208
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRD 264
++N G F K SGI+G A PLSL SQ+ + + G FSYCLV +S + FG +
Sbjct: 209 GHENDG-TFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSN 267
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
A V +++TP+L S+ FY LE +S+G ++F + + G G IID+GT
Sbjct: 268 AVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSS---LGTGEGNIIIDSGT 324
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQ---RIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+T + + + L ++G Q R + S CY S K P++T H
Sbjct: 325 TLTIVPDDFFSNLST-------AVGNQVEGRRAEDPSGFLSVCYSATSDLKV-PAITAHF 376
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSE 439
AD ++P N F++ C+A SI G Q N L+ Y++ +L F
Sbjct: 377 TGADVKLKPINT-FVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPT 435
Query: 440 NC 441
+C
Sbjct: 436 DC 437
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 133/450 (29%), Positives = 200/450 (44%), Gaps = 29/450 (6%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
M+H L L L+ F+ + +GFS+++I S SP Y +Q +R+
Sbjct: 1 MSHSSCLTLVLL-----CLYNICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNA 55
Query: 61 FEISKARANYMASMSK-PNAFQELEDIHLPMAK-QDLFYSVEVNIGTPMKPQHLLFDTAS 118
S RAN+ +S NA + P+ D Y + ++GTP P + + DTAS
Sbjct: 56 VRRSMNRANHFNQISVYSNAVES------PVTLLDDGDYLMSYSLGTPPFPVYGIVDTAS 109
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK---CVYTRR 173
++W QCQ C C++ T+P+FDP S TY +PC C+S C + + C +T
Sbjct: 110 DIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVN 169
Query: 174 YHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
Y G ++G ET N F PR GC N+ +F GI+G P+S
Sbjct: 170 YKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIR-NTNVSFDSI--GIVGLGGGPVS 226
Query: 233 LSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE 292
L QL + I FSYCL + +S +KFG A V +T I+ D + +YL L
Sbjct: 227 LVPQLSSSISKKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEA 286
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
S+G + + F + G G IID+GT T + + Y L +++ L R
Sbjct: 287 FSVGNNRIEFRSSS--SRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVK-LERAED 343
Query: 353 PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY 412
P ++F CY+ P +T H AD + N + + R C+A
Sbjct: 344 PL---KQFSLCYKSTYDKVDVPVITAHFSGADVKLNALNTFIVASHR-VVCLAFLSSQSG 399
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+I G QQN L+ YDL + F +C
Sbjct: 400 AIFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 180/365 (49%), Gaps = 31/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + IGTP +P L DT S L+WTQCQPC CFDQ P FDP S+T S CD L
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 94
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C+ SP N CVYT Y VT G + F F V G + VP +AFGC
Sbjct: 95 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF-VGAGAS-VPGVAFGCG 152
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADV 267
N+G F +GI GF PLSL SQL+ G FS+C A S + AD+
Sbjct: 153 LFNNG-VFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADL 208
Query: 268 ---RRRDLETTPIL---LSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+ ++TTP++ ++ P +YL L I++G + P AF + +GTGG IID
Sbjct: 209 FSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAF-ALTNGTGGTIID 267
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFH 379
+GT +T + YQ + D+ + +P NA+ + C+ S K P + H
Sbjct: 268 SGTSITSLPPQVYQVVR---DEFAAQIKLPVVPGNATGHYT-CFSAPSQAKPDVPKLVLH 323
Query: 380 LQEADYIVQPENMYFIEPDRGR---FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ A + EN F PD C+AI + +I+G +QQQNM ++YDL L F
Sbjct: 324 FEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSF 383
Query: 437 GSENC 441
+ C
Sbjct: 384 VAAQC 388
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 192/405 (47%), Gaps = 24/405 (5%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
++ + + + S AR + S++ + + + D Y +E+ IGTP +
Sbjct: 45 TEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYS 104
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCV 169
+ DT S L+WTQC PC+ C DQ TP FDP S TY + C P C + + C CV
Sbjct: 105 AILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCV 164
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y T G+ + ETF F +P ++FGC N N+G G SG++GF
Sbjct: 165 YQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANG--SGMVGFGRG 222
Query: 230 PLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD-----LETTPILLSDLR 283
LSL SQL + FSYCL + S + FG A + + +++TP +++
Sbjct: 223 SLSLVSQLGSP---RFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPAL 279
Query: 284 PHFY-LHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGTPVTFIRNGPYQTLMQRY- 340
P Y L++ IS+G +++ P F I DGTGG IID+GT +T++ Y + +
Sbjct: 280 PTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFA 339
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AYPSMTFHLQEADYIVQPENMYFIEP 397
QI L + + D C+++ + P + H AD+ + +N ++P
Sbjct: 340 SQITLPL----LNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDP 395
Query: 398 DR-GRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ SI+G++Q QN ++YDL + F C
Sbjct: 396 STGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 135/456 (29%), Positives = 210/456 (46%), Gaps = 34/456 (7%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTS---SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
++ + FF+ V FL H + GFS+ LI SP SP + + +++ER+
Sbjct: 1 MEVFGVKIFFNVVVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDA 60
Query: 61 FEISKARANYM--ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
F S +R ++M+ L +P A + Y + ++IGTP P + DT S
Sbjct: 61 FHRSASRVGRFRQSAMTSDGIQSRL----VPSAGE---YIMNLSIGTPPVPVIAIVDTGS 113
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS---PFKCQNG-KCVYTRRY 174
L WTQC+PC C+ Q P FDP+ S+TY + C C + C+NG KC + Y
Sbjct: 114 DLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSY 173
Query: 175 HVGDVTRGLASRETFAFPVRNG--FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
G T G + ET G +F P AFGC + SG F SGI+G + LS
Sbjct: 174 ADGSFTGGNLAVETLTVASTAGKPVSF-PGFAFGCVH-RSGGIFDEHSSGIVGLGVAELS 231
Query: 233 LSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
+ SQL++ I G FSYCL+ + +S I FGR V +TP+++ ++YL
Sbjct: 232 MISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLI 291
Query: 290 LLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
LE S+G+ + + G G I+D+GT T++ P + ++ + + S+
Sbjct: 292 TLEGFSVGKKRLSY-KGFSKKAEVEEGNIIVDSGTTYTYL---PLEFYVKLEESVAHSIK 347
Query: 349 RQRI--PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
+R+ P S CY P +T H ++A+ +QP N F+ C +
Sbjct: 348 GKRVRDPNGISS---LCYNTTVDQIDAPIITAHFKDANVELQPWNT-FLRMQEDLVCFTV 403
Query: 407 QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILG Q N L+ +DL + F + +C
Sbjct: 404 LPTSDIGILGNLAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 190/411 (46%), Gaps = 35/411 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTP-MK 108
++ E + +M S+ARA + S A + + Y + ++IG P +
Sbjct: 45 GFTKRELLRRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQ 104
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC--RSPFKCQNG 166
P L DT S +VWTQC+PC CF Q P FD AS T + C DPLC S C
Sbjct: 105 PVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLH 164
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFP--VRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
C Y Y G ++ G R++F F G VP + FGC N+G F +GI
Sbjct: 165 GCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAG-RFLQTETGIA 223
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRDLETTPIL----L 279
GF PLSL SQL+ R FSYC EA +S + G D++ T PIL +
Sbjct: 224 GFGRGPLSLPSQLKVR---QFSYCFTTRFEAKSSPVFLGGAGDLKAH--ATGPILSTPFV 278
Query: 280 SDLRP-----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
L P H+ L +++G+ + P +I DG+G ID+GT +T + ++
Sbjct: 279 RSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADGSGATFIDSGTDITTFPDAVFR 334
Query: 335 TLMQRYDQILRSLGRQRIPYN-ASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
L + + + +P N + E D C+ +D A P + FHL+ AD+ + EN
Sbjct: 335 QLKSAF------IAQAALPVNKTADEDDICFSWDGKKTAAMPKLVFHLEGADWDLPRENY 388
Query: 393 YFIEPDRGRFCVAIQDDPKY--SILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + G+ CVA+ + +++G +QQQN I+YDL L C
Sbjct: 389 VTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGKLLLVPAQC 439
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 192/405 (47%), Gaps = 24/405 (5%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
++ + + + S AR + S++ + + + D Y +E+ IGTP +
Sbjct: 45 TEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYS 104
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCV 169
+ DT S L+WTQC PC+ C DQ TP FDP S TY + C P C + + C CV
Sbjct: 105 AILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCV 164
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y T G+ + ETF F +P ++FGC N N+G G SG++GF
Sbjct: 165 YQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANG--SGMVGFGRG 222
Query: 230 PLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD-----LETTPILLSDLR 283
LSL SQL + FSYCL + S + FG A + + +++TP +++
Sbjct: 223 SLSLVSQLGSP---RFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPAL 279
Query: 284 PHFY-LHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGTPVTFIRNGPYQTLMQRY- 340
P Y L++ IS+G +++ P F I DGTGG IID+GT +T++ Y + +
Sbjct: 280 PTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFA 339
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AYPSMTFHLQEADYIVQPENMYFIEP 397
QI L + + D C+++ + P + H AD+ + +N ++P
Sbjct: 340 SQITLPL----LNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDP 395
Query: 398 DR-GRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+A+ SI+G++Q QN ++YDL + F C
Sbjct: 396 STGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 143/460 (31%), Positives = 211/460 (45%), Gaps = 37/460 (8%)
Query: 4 VQALPLAAFFSYFSVLFLTHFT------SSESTGFSLKLIPIFSPESPLYPGNLSQSERI 57
++ L F +V+F HF+ +S GFS LI SP SP Y + +Q +R+
Sbjct: 1 MEGFSLKFLFYTLAVIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRL 60
Query: 58 HKMFEISKARANYM-ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDT 116
K F S +RAN+ A+ N+ Q P+ + Y + +++GTP H + DT
Sbjct: 61 QKAFHRSISRANHFRANGVSTNSIQS------PVISNNGEYLMNISLGTPPVSMHGIADT 114
Query: 117 ASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG-----KCVYT 171
S L+W QC+PC C++Q PIFDP S TY + C+ C S Q G C+Y+
Sbjct: 115 GSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSC-SNLGGQGGCSDDNTCIYS 173
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASP 230
Y G T G + +T G VP++ FGC ++N G F SG++G P
Sbjct: 174 YSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHNNGG-TFELHGSGLVGLGGGP 232
Query: 231 LSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY 287
LS+ SQLR I G FSYCLV + +S + FG V +TP+ +Y
Sbjct: 233 LSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYY 292
Query: 288 LHLLEISIGRHIVR---FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQIL 344
L L +S+G + F + G IID+GT +T + Y TL ++
Sbjct: 293 LTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLES---NVV 349
Query: 345 RSLGRQ--RIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRF 402
++G + R P N F CY S + P++T H AD ++P N F++ F
Sbjct: 350 SAIGGKPVRDPNNV---FSLCYSNLSGLR-IPTITAHFVGADLELKPLNT-FVQVQEDLF 404
Query: 403 CVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
C A+ +I G Q N L+ YDL + F +C
Sbjct: 405 CFAMIPVSDLAIFGNLAQMNFLVGYDLKSRTVSFKPTDCT 444
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 174/360 (48%), Gaps = 24/360 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E+ IGTP P L DT S L WTQCQPC CF Q TP++DP AS+T+S +PC
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 125
Query: 157 CRSPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFVPRLAFGCSN 209
C ++ +N C Y Y G + G+ ET V V +AFGC
Sbjct: 126 CLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGT 185
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKF--GRDADV 267
DN G + +G +G LSL +QL G FSYCL +T F G A++
Sbjct: 186 DNGGDSL--NSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSTMDSPFFLGTLAEL 240
Query: 268 R--RRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+++TP+L S L P ++++L IS+G + P G FD+ DG GG ++D+GT
Sbjct: 241 APGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTT 300
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EA 383
T + ++ ++ R Q+L + P NAS C+ P + H A
Sbjct: 301 FTILAKSGFREVVDRVAQLL-----GQPPVNASSLDSPCFPSPDGEPFMPDLVLHFAGGA 355
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
D + +N D FC+ I P +S LG +QQQN+ +++D+ V L F +C+
Sbjct: 356 DMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCS 415
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 122/421 (28%), Positives = 197/421 (46%), Gaps = 22/421 (5%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+++LI SP+SP Y + ++RI S +R ++ + + F + +
Sbjct: 28 GFTVELINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEMIS 87
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ + Y ++ ++GTP + DT S L+WTQC+PC +C++Q P+FDP++S+TY +I
Sbjct: 88 NQGE--YLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDI 145
Query: 151 PCDDPLC---RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVPRL 203
C C + C N C Y+ Y T G + +T +G +P+
Sbjct: 146 SCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKA 205
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIK 260
GC ++N G +F K SGI+G P+SL SQL + I G FSYCLV +S +
Sbjct: 206 IIGCGHNNGG-SFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLN 264
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
FG + V +++TP++ D ++L L +S+G ++FP +F G IID
Sbjct: 265 FGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIID 321
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+GT +T + L + + + S CY D+ K +PS+T H
Sbjct: 322 SGTTLTLFPEDFFSELSSAVQDAVAGTPVE----DPSGILSLCYSIDADLK-FPSITAHF 376
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
AD + P N F++ C A +I G Q N L+ YDL + F +
Sbjct: 377 DGADVKLNPLNT-FVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGYDLEGKTVSFKPTD 435
Query: 441 C 441
C
Sbjct: 436 C 436
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 174/357 (48%), Gaps = 24/357 (6%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
V + +GTP + ++ DT S L W Q +PC CF+Q PIFDP S+TY++I C C
Sbjct: 27 VPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSACA 86
Query: 159 SPFKCQN----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
Q C+Y Y G VTRG S+ET G + FG S N+G
Sbjct: 87 DLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAG----EEVKFGASVYNTGT 142
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TSVIKFGRDADVRRRD 271
GILG P+S+ SQL + + FSYCLV + A TS + FG DA V +
Sbjct: 143 FGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFG-DAAVPSGE 201
Query: 272 LETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++ TPI+ + P ++Y+ + IS+G ++ ++I G+GG IID+GT +T+++
Sbjct: 202 VQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQ 261
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRY-DSSFKAYPSMTFHLQEADYIVQ 388
+ L+ Y + R P S D C+ + +P+MT HL + ++
Sbjct: 262 EVFNALVAAYTS------QVRYPTTTSATGLDLCFNTRGTGSPVFPAMTIHL-DGVHLEL 314
Query: 389 PENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
P FI + C+A D +I G QQQN I+YDL+ + F +CA+
Sbjct: 315 PTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCAS 371
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 140/423 (33%), Positives = 197/423 (46%), Gaps = 43/423 (10%)
Query: 50 NLSQSERIHKMFEISKARANYM--ASMSKPNAFQELEDIH-----LPM-AKQDLFYSVEV 101
LS+ E + + + SKARA + A + N +D + LP+ DL Y V++
Sbjct: 49 QLSRRELVRRAVQRSKARAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLEYLVDL 108
Query: 102 NIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPF 161
+GTP +P L DT S L+WTQC PC C Q PIF P AS++Y + C LC
Sbjct: 109 AVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELCNDIL 168
Query: 162 --KCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA----FGCSNDNSGF 214
CQ C Y Y G TRG+ + E F F + +L+ FGC N G
Sbjct: 169 HHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGS 228
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFG--RDA--DVRR 269
G SGI+GF +PLSL SQL R FSYCL S + FG R D
Sbjct: 229 LNNG--SGIVGFGRAPLSLVSQLAIR---RFSYCLTPYASGRKSTLLFGSLRGGVYDAAT 283
Query: 270 RDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
++TT +L S P F Y+ +++G +R P AF + DG+GG I+D+GT +T
Sbjct: 284 ATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLF 343
Query: 329 RNGPYQTLMQRYDQILRSLGRQ-RIPY--NASQEFD--YCYRYDSSF----KAYPSMTFH 379
P L +++R+ Q R+P+ N S D C+ +S P M FH
Sbjct: 344 ---PAPVLA----EVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFH 396
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-SILGAWQQQNMLIIYDLNVPALRFGS 438
LQ AD + N + +G C+ + D + +G + QQ+M ++YDL L F
Sbjct: 397 LQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLSFAP 456
Query: 439 ENC 441
C
Sbjct: 457 AQC 459
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 177/375 (47%), Gaps = 18/375 (4%)
Query: 76 KPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT 135
P+AF ++ P+ + Y + + +G+P + ++ DT S L W QC PC C+ Q
Sbjct: 19 SPDAFGS-QEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQP 77
Query: 136 TPIFDPRASTTYSEIPCDDPLCRS---PFK-CQNGKCVYTRRYHVGDVTRGLASRETFAF 191
P FDP S ++ + C D LC P K C C Y Y T G + ET +
Sbjct: 78 GPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISL 137
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
G VP AFGC N G F G +G++G PLSL+SQL + FSYCLV
Sbjct: 138 NNGAGTQSVPNFAFGCGTQNLG-TFAGA-AGLVGLGQGPLSLNSQLSHTFANKFSYCLVS 195
Query: 252 -EMEATSVIKFGRDADVRRRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDI 309
+ S + FG A +++ T I+++ P ++Y+ L I +G + P F I
Sbjct: 196 LNSLSASPLTFGSIA--AAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAI 253
Query: 310 -MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
G GG IID+GT +T + Y +++ Y+ + Y D C+
Sbjct: 254 DQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYG----LDLCFNIAG 309
Query: 369 -SFKAYPSMTFHLQEADYIVQPENMY-FIEPDRGRFCVAIQDDPKYSILGAWQQQNMLII 426
S + P M F Q AD+ ++ EN++ ++ C+A+ +SI+G QQQN L++
Sbjct: 310 VSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGSQGFSIIGNIQQQNHLVV 369
Query: 427 YDLNVPALRFGSENC 441
YDL + F + +C
Sbjct: 370 YDLEAKKIGFATADC 384
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 170/348 (48%), Gaps = 35/348 (10%)
Query: 114 FDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYT 171
DT S L+WTQC PC+ C DQ TP FD + S TY +PC C S C CVY
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVY- 59
Query: 172 RRYHVGDV--TRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNA 228
+Y+ GD T G+ + ETF F N +AFGC + N+G SG++GF
Sbjct: 60 -QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL--ANSSGMVGFGR 116
Query: 229 SPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD------LETTPILLSD 281
PLSL SQL FSYCL + AT S + FG A++ + +++TP +++
Sbjct: 117 GPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINP 173
Query: 282 LRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
P+ Y L L IS+G ++ P F I DGTGG IID+GT +T+++ Y
Sbjct: 174 ALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQ-------DAY 226
Query: 341 DQILRSLGRQRIPYNASQE----FDYCYRY---DSSFKAYPSMTFHLQEADYIVQPENMY 393
+ + R L IP A + D C+++ + P + FH A+ + PEN
Sbjct: 227 EAVRRGL-VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYM 285
Query: 394 FIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
I G C+ + +I+G +QQQN+ ++YD+ L F C
Sbjct: 286 LIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 197/424 (46%), Gaps = 33/424 (7%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+++LI SP+SP+Y + R+ S + + + + P+
Sbjct: 29 GFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT----------NTVEAPI 78
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
Y +++++GTP P + DT S ++WTQC+PC C+ Q P+F+P STTY ++
Sbjct: 79 YNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKV 138
Query: 151 PCDDPLCRSPFKCQNGK------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRL 203
C P+C F ++ C Y+ Y ++G + +T +G PR
Sbjct: 139 SCSSPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIK 260
A GC +DN+G +F +SGI+G P SL Q+ + + G FSYCL + ++ +
Sbjct: 197 AIGCGHDNAG-SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
FG +A+V +TPI +SD FY L L +S+GR+ + + + G II
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTA--NSILGGKANIII 313
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
D+GT +T + Y + I S+ QR + +Q +YC+ + P + H
Sbjct: 314 DSGTTLTLLPVDLYHNFAK---AISNSINLQRTD-DPNQFLEYCFETTTDDYKVPFIAMH 369
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ A+ +Q EN+ I C+A D SI G Q N L+ YD+ +L F
Sbjct: 370 FEGANLRLQRENV-LIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFK 428
Query: 438 SENC 441
NC
Sbjct: 429 PMNC 432
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 132/437 (30%), Positives = 204/437 (46%), Gaps = 43/437 (9%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYM--ASMSKPNAFQELE 84
+++ GF LKL + + S ++ + + + SKAR + A++S +
Sbjct: 23 NDNVGFQLKLTHVDAGTS------YTKPQLLSRAIARSKARVAALQSAAVSPAPVADPIT 76
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
+ + Y V++ IGTP + DT S L+WTQC PC+ C Q TP FD + S
Sbjct: 77 AARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRS 136
Query: 145 TTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDV--TRGLASRETFAFPVRNGFTF- 199
TY +PC C S C CVY +Y+ GD T G+ + ETF F +
Sbjct: 137 ATYRALPCRSSRCAALSSPSCFKKMCVY--QYYYGDTASTAGVLANETFTFGAASSTKVR 194
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SV 258
++FGC + N+G SG++GF PLSL SQL FSYCL + T S
Sbjct: 195 AANISFGCGSLNAGEL--ANSSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSPTPSR 249
Query: 259 IKFGRDADVRRRD------LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMR 311
+ FG A++ + +++TP +++ P+ Y L + IS+G + P F I
Sbjct: 250 LYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAIND 309
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE----FDYCYRY- 366
DGTGG IID+GT +T+++ Y+ + R L IP A + D C+++
Sbjct: 310 DGTGGVIIDSGTSITWLQQ-------DAYEAVRRGLA-STIPLPAMNDTDIGLDTCFQWP 361
Query: 367 --DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNML 424
+ P FH A+ + PEN I G C+A+ +I+G +QQQN+
Sbjct: 362 PPPNVTVTVPDFVFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLH 421
Query: 425 IIYDLNVPALRFGSENC 441
++YD+ L F C
Sbjct: 422 LLYDIANSFLSFVPAPC 438
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 199/412 (48%), Gaps = 38/412 (9%)
Query: 50 NLSQSERIHKMFEISKARA-NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
L+ E + +M SKARA ++S + D +P + Y V + IGTP +
Sbjct: 38 GLAARELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTTE----YLVHLAIGTPPQ 93
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--------SP 160
P L DT S L+WTQCQPC CFDQ P FDP S+T S CD LC+ SP
Sbjct: 94 PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSP 153
Query: 161 FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
N CVYT Y VT G + F F V G + VP +AFGC N+G F
Sbjct: 154 KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF-VGAGAS-VPGVAFGCGLFNNG-VFKSNE 210
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV---RRRDLETT 275
+GI GF PLSL SQL+ G FS+C V ++ ++V+ AD+ R +++T
Sbjct: 211 TGIAGFGRGPLSLPSQLK---VGNFSHCFTAVNGLKPSTVL-LDLPADLYKSGRGAVQST 266
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
P++ + P F YL L I++G + P F +++GTGG IID+GT +T + Y+
Sbjct: 267 PLIQNPANPTFYYLSLKGITVGSTRLPVPESEF-TLKNGTGGTIIDSGTAMTSLPTRVYR 325
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFD--YCYRYDSSFKAY-PSMTFHLQEADYIVQPEN 391
+ + + ++P + D +C K Y P + H + A + EN
Sbjct: 326 LVRDAFA------AQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPREN 379
Query: 392 MYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
F D G C+AI + + + +G +QQQNM ++YDL L F C
Sbjct: 380 YVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 134/401 (33%), Positives = 187/401 (46%), Gaps = 36/401 (8%)
Query: 53 QSERIHKMFEISKARANY----MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
Q + + + ISKA AN +A +S F P + + Y ++ +GTP
Sbjct: 93 QRDVLRAAWIISKAAANGTPPPVAGLSSARGFVAPVVSRAPTSGE---YIAKIAVGTPGV 149
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK-----C 163
L DTAS L W QCQPC RC+ Q+ P+FDPR ST+Y E+ + C++ +
Sbjct: 150 EALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDA 209
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGI 223
+ G CVYT Y G T G ET F G +PR++ GC +DN G FG +GI
Sbjct: 210 KRGTCVYTVGYGDGSTTVGDFIEETLTF---AGGVRLPRISIGCGHDNKGL-FGAPAAGI 265
Query: 224 LGFNASPLSLSSQLRNRIQGLFSYCLVREMEA----TSVIKFGRDADVRRRDLETTPILL 279
LG +S +Q+ + G FSYCLV + +S + FG A + TP +L
Sbjct: 266 LGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVL 323
Query: 280 SDLRPHF-YLHLLEISIGRHIVRFPP-GAFDIMRD---GTGGFIIDTGTPVTFIRNGPYQ 334
+ P F Y+ L IS+G VR P D+ D G GG I+D+GT VT + Y
Sbjct: 324 NLNMPTFYYVRLTGISVGG--VRVPGVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYT 381
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQ-EADYIVQPENM 392
+ + LG+ I S FD CY K P+++ H + +QP+N
Sbjct: 382 AFRDAFRAVAVDLGQVSIG-GPSGFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKN- 439
Query: 393 YFIEPDR-GRFCVAIQ--DDPKYSILGAWQQQNMLIIYDLN 430
Y I D G C A D SI+G QQQ I+YD+
Sbjct: 440 YLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIG 480
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 179/402 (44%), Gaps = 28/402 (6%)
Query: 55 ERIHKMFEI---SKARANYMASMSKPNAFQEL------EDIHLPMAKQDLFYSVEVNIGT 105
R H + ++ ARA Y+AS P A+Q + + + Y V V IG+
Sbjct: 76 SRRHAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKVVSGLDEGSGEYFVRVGIGS 135
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC 163
P Q+L+ D+ S ++W QC+PC+ C+ Q P+FDP S T+S +PC +CR+ C
Sbjct: 136 PPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSGC 195
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+G C Y Y G T+G + ET G T V +A GC + N G G +G
Sbjct: 196 GDSGGCDYEVSYGDGSYTKGALALETLTL----GGTAVEGVAIGCGHRNRGLFVG--AAG 249
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDL 282
+LG P+SL QL G FSYCL + V+ GR V + P++ +
Sbjct: 250 LLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVL--GRSEAVPEGAVW-VPLVRNPQ 306
Query: 283 RPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
P F Y+ L I +G + F + DG GG ++DTGT VT + Y L +
Sbjct: 307 APSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFV 366
Query: 342 QILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRG 400
+ +L R P D CY P+++F+ A + P +E D G
Sbjct: 367 AAVGAL--PRAP--GVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVDGG 422
Query: 401 RFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+C+A SILG QQ+ + I D + FG C
Sbjct: 423 IYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 195/412 (47%), Gaps = 41/412 (9%)
Query: 50 NLSQSERIHKMFEISKARANYMASMS-----KPNAFQELEDIHLPMAKQDLFYSVEVNIG 104
LS E + +M SKARA + S S P A+ D +PM + Y + + IG
Sbjct: 47 GLSGRELMRRMALRSKARAPRLLSSSATAPVSPGAY----DDGVPMTE----YLLHLAIG 98
Query: 105 TPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPF 161
TP +P L DT S LVWTQCQPC CF+Q+ P +D S+T++ CD C+ S
Sbjct: 99 TPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT 158
Query: 162 KCQN---GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
C N C Y+ Y T G ET +F VP + FGC +N+G F
Sbjct: 159 MCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGAS---VPGVVFGCGLNNTGI-FRS 214
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADV---RRRDLET 274
+GI GF PLSL SQL+ G FS+C S + F AD+ R ++T
Sbjct: 215 NETGIAGFGRGPLSLPSQLK---VGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQT 271
Query: 275 TPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TP++ + P F YL L I++G + P AF +++GTGG IID+GT T + P
Sbjct: 272 TPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAF-ALKNGTGGTIIDSGTAFTSL---PP 327
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQEADYIVQPEN 391
+ +D+ + +P N + C+ KA P + H + A + EN
Sbjct: 328 RVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPREN 386
Query: 392 MYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
F D G C+AI + + +I+G +QQQNM ++YDL L F C
Sbjct: 387 YVFEAKDGGNCSICLAIIEG-EMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 199/412 (48%), Gaps = 38/412 (9%)
Query: 50 NLSQSERIHKMFEISKARA-NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
L+ E + +M SKARA ++S + D +P + Y V + IGTP +
Sbjct: 38 GLAARELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTTE----YLVHLAIGTPPQ 93
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--------SP 160
P L DT S L+WTQCQPC CFDQ P FDP S+T S CD LC+ SP
Sbjct: 94 PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSP 153
Query: 161 FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
N CVYT Y VT G + F F V G + VP +AFGC N+G F
Sbjct: 154 KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF-VGAGAS-VPGVAFGCGLFNNG-VFKSNE 210
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV---RRRDLETT 275
+GI GF PLSL SQL+ G FS+C V ++ ++V+ AD+ R +++T
Sbjct: 211 TGIAGFGRGPLSLPSQLK---VGNFSHCFTAVNGLKPSTVL-LDLPADLYKSGRGAVQST 266
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
P++ + P F YL L I++G + P F +++GTGG IID+GT +T + Y+
Sbjct: 267 PLIQNPANPTFYYLSLKGITVGSTRLPVPESEF-ALKNGTGGTIIDSGTAMTSLPTRVYR 325
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFD--YCYRYDSSFKAY-PSMTFHLQEADYIVQPEN 391
+ + + ++P + D +C K Y P + H + A + EN
Sbjct: 326 LVRDAFA------AQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPREN 379
Query: 392 MYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
F D G C+AI + + + +G +QQQNM ++YDL L F C
Sbjct: 380 YVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/429 (29%), Positives = 208/429 (48%), Gaps = 26/429 (6%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI 86
S++ GFS++LI S SP Y +Q +RI + S RA+Y+ + + +
Sbjct: 22 SQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPT 81
Query: 87 HLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTT 146
+P A +Y + +IGTP + + DT S +W QC+PC C +QT+PIF+P S+T
Sbjct: 82 IIPYAGS--YYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSST 139
Query: 147 YSEIPCDDPLCR--SPFKCQNG---KCVYTRRYHVGDVTRGLASRETFAFPVRNG--FTF 199
Y I C P+C+ +C + KC Y Y ++G S++T +G +F
Sbjct: 140 YKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISF 199
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEAT 256
P++ GC + NS G SGI+GF S+ SQL + I G FSYCL + +
Sbjct: 200 -PKIVIGCGHKNS-LTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANIS 257
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
S + FG A V + +TP++ S +++ +L S+G HI++ + ++ D G
Sbjct: 258 SKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSS--LIPDNEGN 315
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY--P 374
+ID+G+ +T + N Y L +++ L R + P +Q+ CY+ ++ K Y P
Sbjct: 316 AVIDSGSTITQLPNDVYSQLETAVISMVK-LKRVKDP---TQQLSLCYK--TTLKKYEVP 369
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPA 433
+T H + AD + N FI+ + C A + + G QQN L+ YD
Sbjct: 370 IITAHFRGADVKLNAFNT-FIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNI 428
Query: 434 LRFGSENCA 442
+ F NC
Sbjct: 429 ISFKPTNCT 437
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 200/433 (46%), Gaps = 50/433 (11%)
Query: 39 IFSPESPLYPGN----LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQD 94
+ SPE+ P + L++ E +H+M A + S S A ++ D
Sbjct: 359 MLSPEAARPPRDGGRSLTRREVLHRM------AARLLFSASGRAASARVDPGPYANGVPD 412
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V + IGTP +P L+ DT S LVWTQC+PC CF + DP S+T+ +PC
Sbjct: 413 TEYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSS 472
Query: 155 PLCRSPFKCQNGK-------CVYTRRYHVGDVTRGLASRETFAFPVRNGF--TFVPRLAF 205
P+C + GK CVY Y G +T G ETF F +G VP LAF
Sbjct: 473 PVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAF 532
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVI---- 259
GC N+G F +GI GF LSL SQL+ FS+C + E +SV+
Sbjct: 533 GCGLFNNGI-FTSNETGIAGFGRGALSLPSQLK---VDNFSHCFTAITGSEPSSVLLGLP 588
Query: 260 -KFGRDADVRRRDLETTPIL--LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
DAD +++TP++ S LR +YL L I++G + P F + +DGTGG
Sbjct: 589 ANLYSDAD---GAVQSTPLVQNFSSLRA-YYLSLKGITVGSTRLPIPESTFALKQDGTGG 644
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN---ASQEFDYCYRYDSSFKA- 372
IID+GT +T + Y+ + + + R+P + +S C+ + +A
Sbjct: 645 TIIDSGTGMTTLPQDAYKLVHDAFT------AQVRLPVDNATSSSLSRLCFSFSVPRRAK 698
Query: 373 --YPSMTFHLQEADYIVQPENMYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYD 428
P + H + A + EN F D G C+AI +I+G +QQQN+ ++YD
Sbjct: 699 PDVPKLVLHFEGATLDLPRENYMFEFEDAGGSVTCLAINAGDDLTIIGNYQQQNLHVLYD 758
Query: 429 LNVPALRFGSENC 441
L L F C
Sbjct: 759 LVRNMLSFVPAQC 771
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 137/412 (33%), Positives = 195/412 (47%), Gaps = 41/412 (9%)
Query: 50 NLSQSERIHKMFEISKARANYMASMS-----KPNAFQELEDIHLPMAKQDLFYSVEVNIG 104
LS E + +M SKARA + S S P A+ D +PM + Y + + IG
Sbjct: 47 GLSGRELMRRMALRSKARAPRLLSSSATAPVSPGAY----DDGVPMTE----YLLHLAIG 98
Query: 105 TPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPF 161
TP +P L DT S LVWTQCQPC CF+Q+ P +D S+T++ CD C+ S
Sbjct: 99 TPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT 158
Query: 162 KCQN---GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
C N C ++ Y T G ET +F VP + FGC +N+G F
Sbjct: 159 MCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGAS---VPGVVFGCGLNNTGI-FRS 214
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADV---RRRDLET 274
+GI GF PLSL SQL+ G FS+C S + F AD+ R ++T
Sbjct: 215 NETGIAGFGRGPLSLPSQLK---VGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQT 271
Query: 275 TPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TP++ + P F YL L I++G + P AF +++GTGG IID+GT T + P
Sbjct: 272 TPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAF-ALKNGTGGTIIDSGTAFTSL---PP 327
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQEADYIVQPEN 391
+ +D+ + +P N + C+ KA P + H + A + EN
Sbjct: 328 RVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPREN 386
Query: 392 MYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
F D G C+AI + + +I+G +QQQNM ++YDL L F C
Sbjct: 387 YVFEAKDGGNCSICLAIIEG-EMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 196/424 (46%), Gaps = 33/424 (7%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+++LI SP+SP+Y + R+ S + + + + P+
Sbjct: 29 GFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT----------NTVEAPI 78
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
Y +++++GTP P + DT S ++WTQC PC C+ Q P+F+P STTY ++
Sbjct: 79 YNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKV 138
Query: 151 PCDDPLCRSPFKCQNGK------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRL 203
C P+C F ++ C Y+ Y ++G + +T +G PR
Sbjct: 139 SCSSPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIK 260
A GC +DN+G +F +SGI+G P SL Q+ + + G FSYCL + ++ +
Sbjct: 197 AIGCGHDNAG-SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
FG +A+V +TPI +SD FY L L +S+GR+ + + + G II
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTA--NSILGGKANIII 313
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
D+GT +T + Y + I S+ QR + +Q +YC+ + P + H
Sbjct: 314 DSGTTLTLLPVDLYHNFAK---AISNSINLQRTD-DPNQFLEYCFETTTDDYKVPFIAMH 369
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ A+ +Q EN+ I C+A D SI G Q N L+ YD+ +L F
Sbjct: 370 FEGANLRLQRENV-LIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFK 428
Query: 438 SENC 441
NC
Sbjct: 429 PMNC 432
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/365 (33%), Positives = 177/365 (48%), Gaps = 32/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD--- 153
Y V + IGTP +P L DT S L+WTQC+PC+ CFDQ P FD S+T + +PC+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQ 94
Query: 154 ---DPLCRSPFKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
DP K C Y Y VT GL + + F F T +P + FGC
Sbjct: 95 CKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAG---TSLPGVTFGCG 151
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADV 267
+N+G F +GI GF PLSL SQL+ G FS+C A S + AD+
Sbjct: 152 LNNTG-VFNSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADL 207
Query: 268 ---RRRDLETTPIL---LSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+ ++TTP++ ++ P +YL L I++G + P AF + +GTGG IID
Sbjct: 208 FSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAF-ALTNGTGGTIID 266
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFH 379
+GT +T + YQ + D+ + +P NA+ + C+ S K P + H
Sbjct: 267 SGTSITSLPPQVYQVVR---DEFAAQIKLPVVPGNATGHYT-CFSAPSQAKPDVPKLVLH 322
Query: 380 LQEADYIVQPENMYFIEPDRGR---FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ A + EN F PD C+AI + +I+G +QQQNM ++YDL L F
Sbjct: 323 FEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSF 382
Query: 437 GSENC 441
+ C
Sbjct: 383 VAAQC 387
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 126/430 (29%), Positives = 206/430 (47%), Gaps = 45/430 (10%)
Query: 43 ESPLYPGNLSQSERIHKMFEISKARANYMAS-----MSKPNAFQELEDIHL-PMAKQDLF 96
+ P +LS+ + + SK RA ++ + +S D+ L P++ Q
Sbjct: 33 DHPYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQG-- 90
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ----PCIRCFDQTTPIFDPRASTTYSEIPC 152
+S+ V IGTP +P+ L+ DT S L+WTQC+ + + P++DP S+T++ +PC
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPC 150
Query: 153 DDPLCRS---PFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
D LC+ FK +CVY Y LAS ETF F R + RL FGC
Sbjct: 151 SDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSL--RLGFGC 207
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDAD 266
++G G +GILG + LSL +QL +IQ FSYCL + TS + FG AD
Sbjct: 208 GALSAGSLIGA--TGILGLSPESLSLITQL--KIQ-RFSYCLTPFADKKTSPLLFGAMAD 262
Query: 267 VRR----RDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ R R ++TT I+ + ++ ++Y+ L+ IS+G + P + + DG GG I+D+
Sbjct: 263 LSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDS 322
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-------YP 374
G+ V ++ ++ + + ++ R + ++++ C+ A P
Sbjct: 323 GSTVAYLVEAAFEAVKEAVMDVV----RLPVANRTVEDYELCFVLPRRTAAAAMEAVQVP 378
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNV 431
+ H +V P + YF EP G C+A+ D SI+G QQQNM +++D+
Sbjct: 379 PLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 438
Query: 432 PALRFGSENC 441
F C
Sbjct: 439 HKFSFAPTQC 448
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 137/425 (32%), Positives = 202/425 (47%), Gaps = 31/425 (7%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GFS+ LI SP SP Y +L+ SERI S +R N ++ N E + +P
Sbjct: 31 GFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRSSSRLNRVSHFLDENNLPE--SLLIPE 88
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ Y + + IGTP + + DT S L+W QC PC CF Q TP+F+P S+T+
Sbjct: 89 NGE---YLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAA 145
Query: 151 PCDDPLCRS--PFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV--PRL 203
CD C S P + Q GK C+Y+ Y T G+ ET +F V P
Sbjct: 146 TCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSS 205
Query: 204 AFGCS-NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKF 261
FGC +N F K++G++G PLSL SQL +I FSYCL+ +TS +KF
Sbjct: 206 IFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNSTSKLKF 265
Query: 262 GRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G +A V + +TP+++ L P FY L+L ++IG+ +V P G D G IID
Sbjct: 266 GSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVV--PTGRTD------GNIIID 317
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+GT +T++ Y + ++L Q +P+ F +C+ Y P + F
Sbjct: 318 SGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPF----PFKFCFPYRD--MTIPVIAFQF 371
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
A +QP+N+ DR C+A+ SI G Q + ++YDL + F
Sbjct: 372 TGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAP 431
Query: 439 ENCAN 443
+C
Sbjct: 432 TDCTK 436
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 138/448 (30%), Positives = 216/448 (48%), Gaps = 36/448 (8%)
Query: 11 AFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANY 70
AF+S S LF T S S GF++ LI SP SP Y +L+ S+RI S +R N
Sbjct: 10 AFYS-VSSLFSTEANESPS-GFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSISRLNR 67
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
++++ N +L L + + Y + IGTP + DT S L+W QC PC
Sbjct: 68 VSNLLDQN--NKLPQSVLILHNGE--YLMRFYIGTPPVERLATADTGSDLIWVQCSPCAS 123
Query: 131 CFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQ-----NGKCVYTRRYHVGD---VTRG 182
CF Q+TP+F P S+T+ C C Q +G+C+YT +Y GD + G
Sbjct: 124 CFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKY--GDQYSFSEG 181
Query: 183 LASRETFAFPVRNGFTFV--PRLAFGCSNDNSGFAFGG-KISGILGFNASPLSLSSQLRN 239
L S ET F + G V P FGC N+ F K++GI+G A PLSL SQ+ +
Sbjct: 182 LLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGD 241
Query: 240 RIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGR 297
+I FSYCL+ +TS +KFG ++ + + +TP+++ P +Y L+L +++ +
Sbjct: 242 QIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQ 301
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
V P G+ D G IID+GT +T++ Y + SL + + +
Sbjct: 302 KTV--PTGSTD------GNVIIDSGTLLTYLGESFYYNFAASLQE---SLAVELV-QDVL 349
Query: 358 QEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSIL 415
+C+ Y +F +P + F A ++P N++ + DR C+ I SI
Sbjct: 350 SPLPFCFPYRDNF-VFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIF 408
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCAN 443
G++ Q + + YDL + F +C+
Sbjct: 409 GSFSQIDFQVEYDLEGKKVSFQPTDCSK 436
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 127/424 (29%), Positives = 200/424 (47%), Gaps = 23/424 (5%)
Query: 30 TGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
+GFS+ LI SP SP Y +L+ SERI S AR+ +S+ N + I +P
Sbjct: 27 SGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFARSKRRLRLSQ-NDDRSPGTITIP 85
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+ Y + IGTP + + DT S L+W QC PC +C Q P+FDPR S+T+
Sbjct: 86 -DEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKT 144
Query: 150 IPCDDPLCR----SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+PCD C S C ++G+C Y Y + G+ E+ F +N P+L
Sbjct: 145 VPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKL 204
Query: 204 AFGCS-NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKF 261
FGC+ ++N + G++G PLSL SQL +I FSYC +TS ++F
Sbjct: 205 TFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRF 264
Query: 262 GRDADVRR-RDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
G DA V++ + + +TP+++ + P ++YL+L +SIG V+ D G +I
Sbjct: 265 GNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTD------GNILI 318
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
D+GT T ++ Y + ++ + +IP +++C+ K +P + F
Sbjct: 319 DSGTSFTILKQSFYNKFVALVKEVY-GVEAVKIP---PLVYNFCFENKGKRKRFPDVVFL 374
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAI-QDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
A V N++ E + VA+ D SI G Q + YDL + F
Sbjct: 375 FTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAP 434
Query: 439 ENCA 442
+CA
Sbjct: 435 ADCA 438
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 131/428 (30%), Positives = 201/428 (46%), Gaps = 33/428 (7%)
Query: 25 TSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELE 84
T + + GFS KLI SP SP Y N ++ +++ K + ++
Sbjct: 23 TEAYNKGFSFKLIHKNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTR-------- 74
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
+ + Y +++ +G+P + L DT S LVW QC PC C+ Q +P+F+P S
Sbjct: 75 -----VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRS 129
Query: 145 TTYSEIPCDDPLCR-SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVP 201
TYS IPC+ C + C K C Y+ Y VT+G+ +RE F +G V
Sbjct: 130 KTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVG 189
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL-FSYCLV---REMEATS 257
+ FGC + NSG F GI+G PLSL SQ+ FS CLV + +
Sbjct: 190 DIIFGCGHSNSG-TFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSG 248
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
I FG ++DV + TTP+ + + + + L IS+G VRF + + + G
Sbjct: 249 TINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRF--NSSETLSKGN--I 304
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
+ID+GTP T+I Y+ L++ L + P +Q CYR +++ + P +T
Sbjct: 305 MIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQ---LCYRSETNLEG-PILT 360
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALR 435
H + AD + P FI P G FC A+ D Y I G + Q N+L+ +DL+ +
Sbjct: 361 AHFEGADVQLLPIQT-FIPPKDGVFCFAMAGSTDGDY-IFGNFAQSNILMGFDLDRKTIS 418
Query: 436 FGSENCAN 443
F +C N
Sbjct: 419 FKPTDCTN 426
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 166/364 (45%), Gaps = 30/364 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +GTP + ++ DT S L W QC PC C+ Q +F P ST+++++ C L
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 157 CRS-PF-KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCSNDNSG 213
C P+ C CVY Y G ++ G +T NG VP AFGC +DN G
Sbjct: 63 CNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG 122
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATSVIKFGRDA----- 265
+F G GILG PLS SQL+ G FSYCLV + TS + FG A
Sbjct: 123 -SFAGA-DGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
V+ L T P + ++Y+ L IS+G ++ AFDI G G I D+GT V
Sbjct: 181 GVKYISLLTNP----KVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTV 236
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPY----NASQEFDYCYR--YDSSFKAYPSMTFH 379
T Q + + ++L ++ + Y + S D C + PSMTFH
Sbjct: 237 T-------QLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFH 289
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ D + P N + +C ++ P +I+G+ QQQN + YD + F +
Sbjct: 290 FEGGDMELPPSNYFIFLESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPK 349
Query: 440 NCAN 443
+C
Sbjct: 350 SCVG 353
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 134/398 (33%), Positives = 189/398 (47%), Gaps = 41/398 (10%)
Query: 64 SKARANYMASMS-----KPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
SKARA + S S P A+ D +PM + Y + + IGTP +P L DT S
Sbjct: 5 SKARAPRLLSSSATAPVSPGAY----DDGVPMTE----YLLHLAIGTPPQPVQLTLDTGS 56
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQN---GKCVYTR 172
LVWTQCQPC CF+Q+ P +D S+T++ CD C+ S C N C Y+
Sbjct: 57 VLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSY 116
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
Y T G ET +F VP + FGC +N+G F +GI GF PLS
Sbjct: 117 SYGDKSATIGFLDVETVSFVAGAS---VPGVVFGCGLNNTGI-FRSNETGIAGFGRGPLS 172
Query: 233 LSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADV---RRRDLETTPILLSDLRPHF-Y 287
L SQL+ G FS+C S + F AD+ R ++TTP++ + P F Y
Sbjct: 173 LPSQLK---VGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYY 229
Query: 288 LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL 347
L L I++G + P AF +++GTGG IID+GT T + P + +D+ +
Sbjct: 230 LSLKGITVGSTRLPVPESAF-ALKNGTGGTIIDSGTAFTSL---PPRVYRLVHDEFAAHV 285
Query: 348 GRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQEADYIVQPENMYFIEPDRGR--FC 403
+P N + C+ KA P + H + A + EN F D G C
Sbjct: 286 KLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSIC 344
Query: 404 VAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+AI + + +I+G +QQQNM ++YDL L F C
Sbjct: 345 LAIIEG-EMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 178/405 (43%), Gaps = 32/405 (7%)
Query: 58 HKMFEI---SKARANYMASMSKP-----NAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
H + ++ ARA Y+AS P + F + + + Y V V IG+P
Sbjct: 78 HAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTE 137
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQN-G 166
Q+L+ D+ S ++W QC+PC+ C+ Q P+FDP +S T+S + C +CR+ C + G
Sbjct: 138 QYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSG 197
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF 226
C Y Y G T+G + ET G T V +A GC + N G G +G+LG
Sbjct: 198 GCEYEVSYGDGSYTKGTLALETLTL----GGTAVEGVAIGCGHRNRGLFVG--AAGLLGL 251
Query: 227 NASPLSLSSQLRNRIQGLFSYCLVRE-------MEATSVIKFGRDADVRRRDLETTPILL 279
P+SL QL G FSYCL +A + GR V + P++
Sbjct: 252 GWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGAVW-VPLVR 310
Query: 280 SDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
+ P F Y+ + I +G + G F + DG GG ++DTGT VT + Y L
Sbjct: 311 NPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRD 370
Query: 339 RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEP 397
+ + +L R P D CY P+++F+ A + P +E
Sbjct: 371 AFVGAVGAL--PRAP--GVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV 426
Query: 398 DRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D G +C+A SILG QQ+ + I D + FG C
Sbjct: 427 DGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 159/354 (44%), Gaps = 19/354 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +G+P Q+L+ D+ S ++W QC+PC +C+ QT P+FDP AS+++S + C +
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 157 CRS------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR+ GKC Y+ Y G T+G + ET G T V +A GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRR 269
NSG G +G+LG +SL QL G+FSYCL R + GR V
Sbjct: 246 NSGLFVG--AAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPV 303
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ + + +Y+ L I +G + G F + DG GG ++DTGT VT +
Sbjct: 304 GAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLP 363
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y L +D + +L R A D CY P+++F+ + +
Sbjct: 364 REAYAALRGAFDGAMGALPRSP----AVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTL 419
Query: 389 PENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P +E FC+A SILG QQ+ + I D + FG C
Sbjct: 420 PARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 126/426 (29%), Positives = 193/426 (45%), Gaps = 35/426 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEIS-KARANYMASMSKPNAFQELEDIHLP 89
GF++ LI SP+SP Y + S+R+ S ++ + + PN+ Q
Sbjct: 25 GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSF------ 78
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+ Y + ++IGTP P + DT S L+WTQC PC C+ QT+P+FDP+ S+TY +
Sbjct: 79 ITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRK 138
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF------PVRNGFTF 199
+ C CR + C YT Y T+G + +T PV
Sbjct: 139 VSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVS----- 193
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEAT 256
+ + GC ++N+G F SGI+G SL SQLR I G FSYCLV E T
Sbjct: 194 LRNMIIGCGHENTG-TFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLT 252
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
S I FG + V + +T ++ D +++L+L IS+G ++F F G G
Sbjct: 253 SKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGT---GEGN 309
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
+ID+GT +T + + Y L + ++ S + + CYR SSFK P +
Sbjct: 310 IVIDSGTTLTLLPSNFYYEL----ESVVASTIKAERVQDPDGILSLCYRDSSSFKV-PDI 364
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
T H + D + N F+ C A + + +I G Q N L+ YD + F
Sbjct: 365 TVHFKGGDVKLGNLNT-FVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSF 423
Query: 437 GSENCA 442
+C+
Sbjct: 424 KKTDCS 429
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 175/364 (48%), Gaps = 25/364 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +++GTP + DT S L WTQC PC CF Q TP++DP S+T+S++PC P
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155
Query: 156 LCR---SPFK-CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR----LAFGC 207
LC+ S F+ C CVY RY VG T G + +T A +G +AFGC
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFGC 214
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDAD 266
S N G G SGI+G S LSL SQ+ G FSYCL + +A S I FG A+
Sbjct: 215 STANGGDMDGA--SGIVGLGRSALSLLSQIG---VGRFSYCLRSDADAGASPILFGALAN 269
Query: 267 VRRRDLETTPILLSDL-----RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
V +++T +L + + P++Y++L I++G + F G GG I+D+
Sbjct: 270 VTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDS 329
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
GT T++ Y L Q + + ++ G A +FD C+ ++ P + F
Sbjct: 330 GTTFTYLAEAGYTMLRQAF--LSQTAGLLTRVSGAQFDFDLCFEAGAADTPVPRLVFRFA 387
Query: 382 EADYIVQPENMYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
P YF D G C+ + S++G Q ++ ++YDL+ F
Sbjct: 388 GGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAPA 447
Query: 440 NCAN 443
+CA+
Sbjct: 448 DCAS 451
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 176/362 (48%), Gaps = 26/362 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E++IGTP + + DT S L WT C PC C+ Q P+FDP+ STTY I CD L
Sbjct: 72 YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKL 131
Query: 157 CR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCS 208
C SP K +C YT Y +TRG+ ++ET G + + + FGC
Sbjct: 132 CHKLDTGVCSPQK----RCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCG 187
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL-FSYCLV---REMEATSVIKFGRD 264
++N+G F GI+G P+SL SQ+ + G FS CLV ++ +S + FG+
Sbjct: 188 HNNTG-GFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKG 246
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ V + + +TP++ + +++ LL IS+ + F + ++ + G +D+GTP
Sbjct: 247 SKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEK---GNMFLDSGTP 303
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
T + P Q Q Q+ + + + + CYR ++ + P +T H + AD
Sbjct: 304 PTIL---PTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRG-PVLTAHFEGAD 359
Query: 385 YIVQPENMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ P FI P G FC+ + + G + Q N LI +DL+ + F ++C
Sbjct: 360 VKLSPTQT-FISPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDCTK 418
Query: 444 GR 445
+
Sbjct: 419 HK 420
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/372 (34%), Positives = 182/372 (48%), Gaps = 36/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP + + DT S LVWTQC PC RCF Q +P+++P +S T+ +PC
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 156
Query: 156 L--CRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFG 206
L C + + C Y + Y G T GL ETF F VP +AFG
Sbjct: 157 LNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFG 215
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFG-- 262
CSN +S G LG SL SQL G+FSYCL ++ ++ S + G
Sbjct: 216 CSNASSDDWNGSAGLVGLGRGGL--SLVSQL---AAGMFSYCLTPFQDTKSKSTLLLGPA 270
Query: 263 -RDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
A + + +TP + S +P ++YL+L IS+G + PPGAF + DGTGG
Sbjct: 271 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGL 330
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSF---KAY 373
IID+GT +T + + Y +R +RSL + + ++ D C+ SS
Sbjct: 331 IIDSGTTITSLVDAAY----KRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 386
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNV 431
PSMT H +V P Y I D G +C+A+ Q D + S LG +QQQN+ I+YD+
Sbjct: 387 PSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQK 445
Query: 432 PALRFGSENCAN 443
L F C+
Sbjct: 446 ETLSFAPAKCST 457
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 169/355 (47%), Gaps = 27/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G+P + +++ DT S + W QCQPC C+ Q+ P+FDP ST+Y+ + CD+P
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 222
Query: 157 CR--SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C C+N G C+Y Y G T G + ET V +A GC +DN
Sbjct: 223 CHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGCGHDNE 279
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRD 271
G +G+L PLS SQ+ FSYCLV R+ ++S ++FG AD
Sbjct: 280 GLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLVDRDSPSSSTLQFGDAADAE--- 331
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
T P++ S F Y+ L IS+G I+ PP AF + G GG I+D+GT VT +++
Sbjct: 332 -VTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAVTRLQS 390
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRY-DSSFKAYPSMTFHLQEADYIVQ 388
Y L D +R G Q +P + FD CY D + P+++ +
Sbjct: 391 SAYAALR---DAFVR--GTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRL 445
Query: 389 PENMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A + SI+G QQQ + +D + F S C
Sbjct: 446 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/372 (34%), Positives = 182/372 (48%), Gaps = 36/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP + + DT S LVWTQC PC RCF Q +P+++P +S T+ +PC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 156 L--CRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFG 206
L C + + C Y + Y G T GL ETF F VP +AFG
Sbjct: 152 LNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFG 210
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFG-- 262
CSN +S G LG SL SQL G+FSYCL ++ ++ S + G
Sbjct: 211 CSNASSDDWNGSAGLVGLGRGGL--SLVSQL---AAGMFSYCLTPFQDTKSKSTLLLGPA 265
Query: 263 -RDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
A + + +TP + S +P ++YL+L IS+G + PPGAF + DGTGG
Sbjct: 266 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGL 325
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSF---KAY 373
IID+GT +T + + Y +R +RSL + + ++ D C+ SS
Sbjct: 326 IIDSGTTITSLVDAAY----KRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNV 431
PSMT H +V P Y I D G +C+A+ Q D + S LG +QQQN+ I+YD+
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQK 440
Query: 432 PALRFGSENCAN 443
L F C+
Sbjct: 441 ETLSFAPAKCST 452
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 137/463 (29%), Positives = 204/463 (44%), Gaps = 58/463 (12%)
Query: 16 FSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMS 75
F + L F+ +ES L + S ++ E + +M SKAR + S +
Sbjct: 20 FPCVLLLTFSLAESAALRADLTHVDSGR------GFTKHELLRRMVARSKARLASLRSSA 73
Query: 76 KPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFD 133
A D H Y + + IGTP +PQ ++ DT S LVWTQC C CFD
Sbjct: 74 CDTALTAPVD-HGGSDVGSSEYLIHLGIGTP-RPQRVVLHLDTGSDLVWTQCA-CTVCFD 130
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASR 186
Q P+F S T+S +PC DPLC S ++ C Y Y +T G +
Sbjct: 131 QPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAE 190
Query: 187 ETFAF--PVR-NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG 243
+TF F P R + VP + FGC N G F SGI GF PLSL SQL+ R
Sbjct: 191 DTFTFKAPDRADTAAAVPNIRFGCGMMNYGL-FTPNQSGIAGFGTGPLSLPSQLKVR--- 246
Query: 244 LFSYCL--VREMEATSVIKFGRDADVRRRD---LETTPILLSDL------RPHFYLHLLE 292
FSYC + E + VI G ++ +++TP +P ++L L
Sbjct: 247 RFSYCFTAMEESRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRG 306
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
+++G + F F + DG+GG ID+GT +TF +++L + + ++
Sbjct: 307 VTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAF--------VAQV 358
Query: 353 PYNASQEFD-----YCYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRG----- 400
P ++ + C+ + K A P + HL+ AD+ + EN D G
Sbjct: 359 PLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAGR 418
Query: 401 RFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ CV I + +I+G +QQQNM I+YDL + F C
Sbjct: 419 KLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARC 461
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 136/441 (30%), Positives = 195/441 (44%), Gaps = 45/441 (10%)
Query: 14 SYFSVLFLTHF------TSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKAR 67
S+ ++LF T F + + + GF+L+LI S +SP Y ++ ERI S R
Sbjct: 5 SFLTLLFFTIFCFIISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSINR 64
Query: 68 ANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
N+ S + Q + Y + +IGTP DT S LVW QC+P
Sbjct: 65 VNHFYKYSLTSTPQSTVN------SDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEP 118
Query: 128 CIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRE 187
C +C+ Q TPIFDP S++Y IPC C S R DV RG S E
Sbjct: 119 CKQCYPQITPIFDPSLSSSYQNIPCLSDTCHS------------MRTTSCDV-RGYLSVE 165
Query: 188 TFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
T G++ P+ GC N+G F G SGI+G + P+SL SQL I G FS
Sbjct: 166 TLTLDSTTGYSVSFPKTMIGCGYRNTG-TFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFS 224
Query: 247 YCLVREM-EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
YCL + +TS + FG A V TTPI+ D + +YL L S+G ++ F
Sbjct: 225 YCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGP 284
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
+ G +ID+GT TF+ PY + + + + + + + F CY
Sbjct: 285 TYG---GNEGNILIDSGTTFTFL---PYDVYYRFESAVAEYINLEHVE-DPNGTFKLCYN 337
Query: 366 YDSSFKAYPSMTFHLQEAD----YIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQ 421
P +T H + AD YI FI+ G C+A + +I G QQ
Sbjct: 338 VAYHGFEAPLITAHFKGADIKLYYIST-----FIKVSDGIACLAFIPS-QTAIFGNVAQQ 391
Query: 422 NMLIIYDLNVPALRFGSENCA 442
N+L+ Y+L + F +C
Sbjct: 392 NLLVGYNLVQNTVTFKPVDCT 412
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 128/372 (34%), Positives = 182/372 (48%), Gaps = 36/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + IGTP + + DT S LVWTQC PC RCF Q +P+++P +S T+ +PC
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 156 L--CRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFG 206
L C + + C Y + Y G T GL ETF F VP +AFG
Sbjct: 152 LNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFG 210
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFG-- 262
CSN +S G LG SL SQL G+FSYCL ++ ++ S + G
Sbjct: 211 CSNASSDDWNGSAGLVGLGRGGL--SLVSQL---AAGMFSYCLTPFQDTKSKSTLLLGPA 265
Query: 263 -RDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
A + + +TP + S +P ++YL+L IS+G + PPGAF + DGTGG
Sbjct: 266 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGL 325
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSF---KAY 373
IID+GT +T + + Y +R +RSL + + ++ D C+ SS
Sbjct: 326 IIDSGTTITSLVDAAY----KRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNV 431
PSMT H +V P Y I D G +C+A+ Q D + S LG +QQQN+ I+YD+
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQK 440
Query: 432 PALRFGSENCAN 443
L F C+
Sbjct: 441 ETLSFAPAKCST 452
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 176/367 (47%), Gaps = 19/367 (5%)
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
P++ Y +E++IGTP + DT S L+W QC PC C+ Q P+FDP++S+TYS
Sbjct: 51 PVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYS 110
Query: 149 EIPCDDPLCRSPFKCQ----NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-L 203
I C + C YT Y +T G+ ++ET G + +
Sbjct: 111 NIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGV 170
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLV---REMEATSVI 259
FGC ++N+G F K GI+G PLSL SQ+ + G +FS CLV TS +
Sbjct: 171 IFGCGHNNNG-VFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPM 229
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
FG+ ++V + +TP++ + FY + LL IS+ + F G+ + G +
Sbjct: 230 SFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMV 288
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTF 378
ID+GTP T + Y L++ ++ + IP + + + CYR ++ K ++T
Sbjct: 289 IDSGTPTTLLPEDFYHRLVE---EVRNKVALDPIPIDPTLGYQLCYRTPTNLKG-TTLTA 344
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLNVPALRF 436
H + AD ++ P + FI G FC A +Y I G Q N LI +DL + F
Sbjct: 345 HFEGADVLLTPTQI-FIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSF 403
Query: 437 GSENCAN 443
+ +C N
Sbjct: 404 KATDCTN 410
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 134/443 (30%), Positives = 203/443 (45%), Gaps = 45/443 (10%)
Query: 16 FSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA--NYMAS 73
S++ LT S +G+ L L + S G +++E + + S+ RA Y A+
Sbjct: 7 LSLVLLTSLAVSAPSGYRLVLTHVDSK------GGYTKTELMRRAVHRSRLRALSGYDAT 60
Query: 74 MSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFD 133
+ ++ Q + Y +E+ IG P P L DT S L WTQCQPC CF
Sbjct: 61 SPRLHSVQ-------------VEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFP 107
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCRSPFKCQN----GKCVYTRRYHVGDVTRGLASRETF 189
Q TP++DP AS+T+S +PC C P +N C Y Y G + G+ ET
Sbjct: 108 QDTPVYDPSASSTFSPLPCSSATCL-PIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETL 166
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
+ V +AFGC DN G + +G +G LSL +QL G FSYCL
Sbjct: 167 TLGPSSAPVSVGGVAFGCGTDNGGDSL--NSTGTVGLGRGTLSLLAQLG---VGKFSYCL 221
Query: 250 VREMEAT--SVIKFGRDADVR--RRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPP 304
+ S G A++ +++TP+L S P +++ L IS+G + P
Sbjct: 222 TDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPN 281
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
G FD+ DGTGG I+D+GT T + ++ ++ R + R LG+ P NAS C+
Sbjct: 282 GTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGR---VARVLGQP--PVNASSLDAPCF 336
Query: 365 RYDSSFKAY-PSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDP--KYSILGAWQQ 420
+ Y P + H AD + +N + FC+ I S+LG +QQ
Sbjct: 337 PAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQ 396
Query: 421 QNMLIIYDLNVPALRFGSENCAN 443
QN+ +++D V L F +C+
Sbjct: 397 QNIQMLFDTTVGQLSFLPTDCSK 419
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 126/451 (27%), Positives = 210/451 (46%), Gaps = 35/451 (7%)
Query: 13 FSYFSVLFLTHFTSSESTG----FSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
F Y S+L ++ F +S S+ +++LI SP SPLY + + S+R++ A
Sbjct: 6 FLYCSLLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLN---------A 56
Query: 69 NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC 128
++ S+S+ F D+ + Y + ++IGTP + DT S L W QC+PC
Sbjct: 57 AFLRSISRSRRFTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC 116
Query: 129 IRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK----CQNGKCVYTRRYHVGD--VTRG 182
+C+ Q +P+FD + S+TY CD C++ + C K + RY GD T+G
Sbjct: 117 QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKG 176
Query: 183 LASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
+ ET + +G + P FGC +N G F SGI+G PLSL SQL + I
Sbjct: 177 DVATETISIDSSSGSSVSFPGTVFGCGYNNGG-TFEETGSGIIGLGGGPLSLVSQLGSSI 235
Query: 242 QGLFSYCL---VREMEATSVIKFGRDA----DVRRRDLETTPILLSDLRPHFYLHLLEIS 294
FSYCL TSVI G ++ + TTP++ D +++L L ++
Sbjct: 236 GKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVT 295
Query: 295 IGRHIVRFPPGAFDIMRDG---TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR 351
+G+ + + G + + TG IID+GT +T + +G Y ++ + R
Sbjct: 296 VGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVS 355
Query: 352 IPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK 411
P +C++ P++T H AD + P N F++ + C+++ +
Sbjct: 356 DPQGL---LTHCFKSGDKEIGLPAITMHFTNADVKLSPINA-FVKLNEDTVCLSMIPTTE 411
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+I G Q + L+ YDL + F +C+
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 158/354 (44%), Gaps = 19/354 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +G+P Q+L+ D+ S ++W QC+PC +C+ QT P+FDP AS+++S + C +
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 157 CRS------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR+ GKC Y+ Y G T+G + ET G T V +A GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRR 269
NSG G +G+LG +SL QL G+FSYCL R + GR V
Sbjct: 246 NSGLFVGA--AGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPV 303
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ + + +Y+ L I +G + F + DG GG ++DTGT VT +
Sbjct: 304 GAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLP 363
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y L +D + +L R A D CY P+++F+ + +
Sbjct: 364 REAYAALRGAFDGAMGALPRSP----AVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTL 419
Query: 389 PENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P +E FC+A SILG QQ+ + I D + FG C
Sbjct: 420 PARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 169/355 (47%), Gaps = 27/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G+P + +++ DT S + W QCQPC C+ Q+ P+FDP ST+Y+ + CD+P
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 226
Query: 157 CR--SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C C+N G C+Y Y G T G + ET V +A GC +DN
Sbjct: 227 CHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGCGHDNE 283
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRD 271
G +G+L PLS SQ+ FSYCLV R+ ++S ++FG AD
Sbjct: 284 GLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLVDRDSPSSSTLQFGDAADAE--- 335
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
T P++ S F Y+ L +S+G I+ PP AF + G GG I+D+GT VT +++
Sbjct: 336 -VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQS 394
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRY-DSSFKAYPSMTFHLQEADYIVQ 388
Y L D +R G Q +P + FD CY D + P+++ +
Sbjct: 395 SAYAALR---DAFVR--GTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRL 449
Query: 389 PENMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A + SI+G QQQ + +D + F + C
Sbjct: 450 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 161/358 (44%), Gaps = 25/358 (6%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCD 153
L + V V GTP + ++FDT S + W QC PC C+ Q PIFDP S TYS +PC
Sbjct: 133 LEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCG 192
Query: 154 DPLCRSP--FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
P C + KC NG C+Y Y G + G+ S ET + +P AFGC N
Sbjct: 193 HPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRA---LPGFAFGCGQTN 249
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
G G + G++G LSLSSQ G FSYCL + + G D
Sbjct: 250 LGDF--GDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPASNDD 307
Query: 272 LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++ T ++ P FY + L+ I IG +I+ PP F DGT +D+GT +T++
Sbjct: 308 VQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF--TDDGT---FLDSGTILTYLPP 362
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY-PSMTFHLQEADYIVQP 389
Y L R+ + Q P A FD CY + + P+++F +
Sbjct: 363 EAYTALRDRFKFTM----TQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLS 418
Query: 390 ENMYFIEPDR---GRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
I PD C+ P ++I+G QQ+N +IYD+ + F S +C
Sbjct: 419 FFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 133/461 (28%), Positives = 205/461 (44%), Gaps = 40/461 (8%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTS-----SESTGFSLKLIPIFSPESPLYPGNLSQSERIH 58
++ L F +++ L HF+ ++ GF+ I SP SP Y + ++ +R+
Sbjct: 1 MEGFNLKFVFCTLAIIILIHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQ 60
Query: 59 KMFEISKARANYMASM-SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTA 117
K F S R N+ +M + PN DI + Y + +++GTP P + DT
Sbjct: 61 KAFRRSILRGNHFRAMRASPN------DIQSDVISGGGAYLMNISLGTPPVPMLGIADTG 114
Query: 118 SSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS---PFKC-QNGKCVYTRR 173
S L+W QC PC C++Q P+FDP+ S TY + CD+ C+ C + C Y+
Sbjct: 115 SDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYS 174
Query: 174 YHVGDVTRGLASRETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
Y TRG S +T G P +AFGC +DN G F K G++G PLS
Sbjct: 175 YGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDNGG-TFNEKDGGLIGLGGGPLS 233
Query: 233 LSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
L QL + + G FSYCLV + +S I FG+ V +TP++ +YL
Sbjct: 234 LVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLT 293
Query: 290 LLEISIGRHIVRF--------PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
L +S+G V F P A + G IID+GT +T + P
Sbjct: 294 LEGLSVGSETVAFKGFSENKSSPAAVE-----EGNIIIDSGTTLTLL---PQDFYTDVES 345
Query: 342 QILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR 401
+ ++G Q + + F CY ++ + P++T H AD + P N F++
Sbjct: 346 ALTNAIGGQTTT-DPNGIFSLCYSSVNNLE-IPTITAHFTGADVQLPPLNT-FVQVQEDL 402
Query: 402 FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
C ++ +I G Q N L+ YDL + F +C
Sbjct: 403 VCFSMIPSSNLAIFGNLAQINFLVGYDLKNNKVSFKQTDCT 443
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 170/353 (48%), Gaps = 24/353 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + V IG P K +++ DT S + W QC+PC C+ Q PIFDP +S+++S + C P
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQ 219
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
CR+ F C+N C+Y Y G T G + ET +F V ++A GC +DN G
Sbjct: 220 CRNLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGS---VDKVAIGCGHDNEGL 276
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G++G PLSL+SQ++ FSYCLV R+ +S ++F + D
Sbjct: 277 FV--GAAGLIGLGGGPLSLTSQIK---ASSFSYCLVNRDSVDSSTLEFNS---AKPSDSV 328
Query: 274 TTPILL-SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T PI S + +Y+ + +S+G + PP F++ G GG I+D GT VT ++
Sbjct: 329 TAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQA 388
Query: 333 YQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y L + ++ + L P + FD CY S P++ F + P
Sbjct: 389 YNALRDTFVKLTKDL-----PSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPP 443
Query: 391 NMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ Y I D G FC+A SI+G QQQ + YDL + F S C
Sbjct: 444 SNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 158/354 (44%), Gaps = 28/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +G+P Q+L+ D+ S ++W QC+PC +C+ QT P+FDP AS+++S + C +
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 157 CRS------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR+ GKC Y+ Y G T+G + ET G T V +A GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRR 269
NSG G +G+LG +SL QL G+FSYCL R + GR V R
Sbjct: 246 NSGLFVG--AAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPR 303
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
++ +Y+ L I +G + F + DG GG ++DTGT VT +
Sbjct: 304 GRRASS---------FYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLP 354
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y L +D + +L R A D CY P+++F+ + +
Sbjct: 355 REAYAALRGAFDGAMGALPRS----PAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTL 410
Query: 389 PENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P +E FC+A SILG QQ+ + I D + FG C
Sbjct: 411 PARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 34/416 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
LS E +H+M SKAR+ + +S A ++ D Y V + IGTP +P
Sbjct: 66 GLSTRELLHRMAARSKARSARL--LSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQP 123
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKC---- 163
L+ DT S L WTQC PC+ CF Q+ P F+P S T+S +PCD +CR + C
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRN---GFTFVPRLAFGCSNDNSGFAFGGK 219
NG CVY Y +T G +TF+F + G VP L FGC N+G F
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGI-FVSN 242
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVI-----KFGRDADVRRRDL 272
+GI GF+ LS+ +QL+ FSYC + E + V DA +
Sbjct: 243 ETGIAGFSRGALSMPAQLKVDN---FSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 299
Query: 273 ETTPILL---SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ L+ S +Y+ L +++G + P F + DGTGG I+D+GT +T +
Sbjct: 300 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 359
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y + + + + + + S C+ K P++ H + A +
Sbjct: 360 EAVYNLVCDAF----VAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLP 415
Query: 389 PEN-MYFIEPDRG--RFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN M+ IE G C+AI S++G +QQQNM ++YDL L F C
Sbjct: 416 RENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 194/425 (45%), Gaps = 29/425 (6%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQS-ERIHKMFEISKARANYMASMSKPNAFQELEDIHL- 88
G ++L I SPL P N S + + + F+ R N + S + + + ++ L
Sbjct: 70 GVKIRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNN-GTYSTMSNLPLQ 128
Query: 89 PMAKQDLF-YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
P +K Y V GTP K L+ DT S + W QC+PC C+ Q PIF+P+ S++Y
Sbjct: 129 PGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSY 188
Query: 148 SEIPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
+ C C + C+ G CVY Y G ++G S+ET G P A
Sbjct: 189 KHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTL----GSDSFPSFA 244
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRD 264
FGC + N+G G +G+LG + LS SQ +++ G FSYCL + +TS F
Sbjct: 245 FGCGHTNTGLFKGS--AGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVG 302
Query: 265 ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
P++ + P FY + L IS+G + PP G GG I+D+GT
Sbjct: 303 QGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL-----GRGGTIVDSGT 357
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQ- 381
+T + Y L + R+L + P++ D CY S S P++TFH Q
Sbjct: 358 VITRLVPQAYDALKTSFRSKTRNLPSAK-PFSI---LDTCYDLSSYSQVRIPTITFHFQN 413
Query: 382 EADYIVQPENMYF-IEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALRFG 437
AD V + F I+ D + C+A + +I+G +QQQ M + +D + F
Sbjct: 414 NADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFA 473
Query: 438 SENCA 442
+CA
Sbjct: 474 PGSCA 478
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 211/452 (46%), Gaps = 23/452 (5%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
M H +L + Y ++ FL + GFS+++I S SP Y +Q +R+
Sbjct: 4 MPHSSSLAIVLLCLYINISFLNAL---DGGGFSVEIIHRDSSRSPYYRPTETQFQRVANA 60
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
S RAN+ +KPN + Y + ++GTP + DT S +
Sbjct: 61 LRRSINRANHF---NKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDI 117
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-----PFKCQNGKCVYTRRYH 175
+W QCQPC C++QTTPIFDP S TY +PC +C+S N +C YT Y
Sbjct: 118 IWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYG 177
Query: 176 VGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLS 234
++G S ET +G + P+ GC ++N G F + SGI+G P+SL
Sbjct: 178 DNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCGHNNKG-TFQREGSGIVGLGGGPVSLI 236
Query: 235 SQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLL 291
SQL + I G FSYCL + ++S + FG +A V R +TPI+ + ++L L
Sbjct: 237 SQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLE 296
Query: 292 EISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR 351
S+G + + F + G G IID+GT +T + P + + ++ +R
Sbjct: 297 AFSVGDNRIEFGS-SSFESSGGEGNIIIDSGTTLTIL---PEDDYLNLESAVADAIELER 352
Query: 352 IPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP 410
+ + S+ CYR SS + P +T H + AD + P + FIE D G C A +
Sbjct: 353 VE-DPSKFLRLCYRTTSSDELNVPVITAHFKGADVELNPIST-FIEVDEGVVCFAFRSSK 410
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
I G QQN+L+ YDL + F +C
Sbjct: 411 IGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCT 442
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 134/459 (29%), Positives = 204/459 (44%), Gaps = 31/459 (6%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
MA V ++ ++ F ++ S++ + + GFS LI S SPLY + +R+
Sbjct: 1 MAAVSSIYVSLFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNS 60
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
F S +RAN KPN+ + + Y + ++IG P + DT S L
Sbjct: 61 FHRSISRANRF----KPNSISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDL 116
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQN----GKCVYTR 172
+W QCQPC C+ Q +PIFDPR S++Y + C + C C C YT
Sbjct: 117 IWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTY 176
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFT-----FVPRLAFGCSNDNSGFAFGGKISGILGFN 227
Y + G + E F N T + +AFGC N G F SGI+G
Sbjct: 177 SYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGG-TFDELGSGIIGLG 235
Query: 228 ASPLSLSSQLRNRIQGLFSYCLVREMEA---TSVIKFGRDADV--RRRDLETTPILLSDL 282
+SL SQL ++ G FSYCLV E TS I FG D ++ ++ +TP+L
Sbjct: 236 GGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKP 295
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
++YL L IS+ R P G IID+GT +TF+ + + L ++
Sbjct: 296 ETYYYLTLEAISVENK--RLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEE 353
Query: 343 ILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRF 402
++ R P+ F+ C++ + + + P +T H AD +QP N F + +
Sbjct: 354 AVKG-ERVSDPHGL---FNICFKDEKAIE-LPIITAHFTGADVELQPVNT-FAKVEEDLL 407
Query: 403 CVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C + +I G Q N L+ YDL A+ F +C
Sbjct: 408 CFTMIPSNDIAIFGNLAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 208/455 (45%), Gaps = 50/455 (10%)
Query: 8 PLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKAR 67
PL A S ++ LT S S+G+ L L + S L+++E + + S+ R
Sbjct: 7 PLQALMS--CLVLLTSLAVSASSGYRLALTHVDSKI------GLTKTELMRRAAHRSRLR 58
Query: 68 A--NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
A Y A+ + ++ Q + Y +E+ IGTP P L DT S L WTQC
Sbjct: 59 ALSGYDANSPRLHSVQ-------------VEYLMELAIGTPPVPFVALADTGSDLTWTQC 105
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGK-----CVYTRRYHVGDVT 180
QPC CF Q TP++DP AS+T+S +PC C + +N C Y Y G +
Sbjct: 106 QPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAYS 165
Query: 181 RGLASRETFAF--PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR 238
G+ ET V V +AFGC DN G + +G +G LSL +QL
Sbjct: 166 AGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSL--NSTGTVGLGRGTLSLLAQLG 223
Query: 239 NRIQGLFSYCLVREMEAT--SVIKFGRDADVR--RRDLETTPILLSDLRPHFYLHLLE-I 293
G FSYCL +T S G A++ +++TP+L S L P Y+ L+ I
Sbjct: 224 ---VGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGI 280
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
++G + P FD+ + TGG ++D+GT + + ++ ++ Q+L + P
Sbjct: 281 TLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVL-----GQPP 335
Query: 354 YNASQEFDYCYRYDSSFKAYPSM---TFHLQ-EADYIVQPENMYFIEPDRGRFCVAI-QD 408
NAS C+ + + P M H AD + +N + FC+ I
Sbjct: 336 VNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGT 395
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+S+LG +QQQN+ +++D+ V L F +C+
Sbjct: 396 TSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSK 430
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 170/367 (46%), Gaps = 16/367 (4%)
Query: 84 EDIHLPMAKQDLFYSVEVN--IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
++I M D + VN +G P PQ + DT S L+W QC+PC CF Q+TPIFDP
Sbjct: 76 DEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDP 135
Query: 142 RASTTYSEIPCDDPLC-RSPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRN-GF 197
S+TY ++ D P+C SP K N +C+Y Y G + G + E F + G
Sbjct: 136 SKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGT 195
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
V + FGC + N G F G+ SGILG +A S+ S+L +R FSYC+ +
Sbjct: 196 VTVSSVVFGCGHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHY 250
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
D + + +TP +Y+ L IS+G + P F G GG
Sbjct: 251 THNQLVLGDGVKMEGSSTP--FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGV 308
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
++D+GT TF+ + L +++R +Q I Y R + + +P +
Sbjct: 309 VMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 368
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY---SILGAWQQQNMLIIYDLNVPAL 434
FH E +V N F++ ++ FC+A+ + S++G QQ+ + YDL +
Sbjct: 369 FHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRV 428
Query: 435 RFGSENC 441
F +C
Sbjct: 429 YFQRTDC 435
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 190/407 (46%), Gaps = 36/407 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELE---DIHLPMAKQDLFYSVEVNIGTP 106
+ +SE I + S AR +MA+ + +++ + D+ P+ Y +++++GTP
Sbjct: 5 GVKRSEAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTP 64
Query: 107 MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQN 165
K + DT S LVW Q +PC C T IFDPR S+T+ E+ C LC P C+
Sbjct: 65 GKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCTELPGSCEP 122
Query: 166 GK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFT-FVPRLAFGCSNDNSGFAFGGKISG 222
G C Y+ Y G+ T G +R+T + +G + P A GC NSGF + G
Sbjct: 123 GSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFD---GVDG 178
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDADVRRRDLETTPIL-L 279
++G P+SL+SQL I FSYCLV +S + FG A + +++T I
Sbjct: 179 LVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP 238
Query: 280 SDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
SD P +YL + I++ + P G IID+GT +T++ +G Y ++
Sbjct: 239 SDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYVPSGVYGRVLS 287
Query: 339 RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPENMYFIEP 397
R + ++ +L R +S D CY S+ +P++T L A N + +
Sbjct: 288 RMESMV-TLPRVD---GSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343
Query: 398 DRG-RFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D G C+A+ SI+G QQ I+YD L F C
Sbjct: 344 DSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 189/407 (46%), Gaps = 36/407 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELE---DIHLPMAKQDLFYSVEVNIGTP 106
+ +SE I + S AR +MA+ + +++ + D+ P+ Y +++++GTP
Sbjct: 5 GVKRSEAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTP 64
Query: 107 MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQN 165
K + DT S LVW Q +PC C T IFDPR S+T+ E+ C LC P C+
Sbjct: 65 GKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELPGSCEP 122
Query: 166 GK--CVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
G C Y+ Y G+ T G +R+T + +G P A GC NSGF + G
Sbjct: 123 GSSTCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFD---GVDG 178
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDADVRRRDLETTPIL-L 279
++G P+SL+SQL I FSYCLV +S + FG A + +++T I
Sbjct: 179 LVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP 238
Query: 280 SDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
SD P +YL + I++ + P G IID+GT +T++ +G Y ++
Sbjct: 239 SDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYVPSGVYGRVLS 287
Query: 339 RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPENMYFIEP 397
R + ++ +L R +S D CY S+ +P++T L A N + +
Sbjct: 288 RMESMV-TLPRVD---GSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343
Query: 398 DRG-RFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D G C+A+ SI+G QQ I+YD L F C
Sbjct: 344 DSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 170/367 (46%), Gaps = 16/367 (4%)
Query: 84 EDIHLPMAKQDLFYSVEVN--IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
++I M D + VN +G P PQ + DT S L+W QC+PC CF Q+TPIFDP
Sbjct: 44 DEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDP 103
Query: 142 RASTTYSEIPCDDPLC-RSPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRN-GF 197
S+TY ++ D P+C SP K N +C+Y Y G + G + E F + G
Sbjct: 104 SKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGT 163
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
V + FGC + N G F G+ SGILG +A S+ S+L +R FSYC+ +
Sbjct: 164 VTVSSVVFGCGHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHY 218
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
D + + +TP +Y+ L IS+G + P F G GG
Sbjct: 219 THNQLVLGDGVKMEGSSTP--FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGV 276
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
++D+GT TF+ + L +++R +Q I Y R + + +P +
Sbjct: 277 VMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY---SILGAWQQQNMLIIYDLNVPAL 434
FH E +V N F++ ++ FC+A+ + S++G QQ+ + YDL +
Sbjct: 337 FHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRV 396
Query: 435 RFGSENC 441
F +C
Sbjct: 397 YFQRTDC 403
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 170/367 (46%), Gaps = 16/367 (4%)
Query: 84 EDIHLPMAKQDLFYSVEVN--IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
++I M D + VN +G P PQ + DT S L+W QC+PC CF Q+TPIFDP
Sbjct: 44 DEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDP 103
Query: 142 RASTTYSEIPCDDPLC-RSPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRN-GF 197
S+TY ++ D P+C SP K N +C+Y Y G + G + E F + G
Sbjct: 104 SKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGT 163
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
V + FGC + N G F G+ SGILG +A S+ S+L +R FSYC+ +
Sbjct: 164 VTVSSVVFGCGHSNRG-RFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHY 218
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
D + + +TP +Y+ L IS+G + P F G GG
Sbjct: 219 THNQLVLGDGVKMEGSSTP--FHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGV 276
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
++D+GT TF+ + L +++R +Q I Y R + + +P +
Sbjct: 277 VMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY---SILGAWQQQNMLIIYDLNVPAL 434
FH E +V N F++ ++ FC+A+ + S++G QQ+ + YDL +
Sbjct: 337 FHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRV 396
Query: 435 RFGSENC 441
F +C
Sbjct: 397 YFQRTDC 403
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 180/370 (48%), Gaps = 29/370 (7%)
Query: 84 EDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
ED+ P+ A+ Y V +G P KP +++ DT S + W QC+PC C+ Q+ PIF
Sbjct: 140 EDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIF 199
Query: 140 DPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
DP AS++Y+ + CD C+ C+NGKC+Y Y G T G ET +F G
Sbjct: 200 DPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSF----GA 255
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT 256
V R+A GC +DN G +G+LG PLSL+SQ++ FSYCLV R+ +
Sbjct: 256 GSVNRVAIGCGHDNEGLFV--GSAGLLGLGGGPLSLTSQIKATS---FSYCLVDRDSGKS 310
Query: 257 SVIKFGRDADVRRRDLETTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S ++F R D P+L + + +Y+ L +S+G IV PP F + + G G
Sbjct: 311 STLEFNSP---RPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAG 367
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--Y 373
G I+D+GT +T +R Y ++ + + +L P FD CY SS ++
Sbjct: 368 GVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLR----PAEGVALFDTCYDL-SSLQSVRV 422
Query: 374 PSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNV 431
P+++FH P Y I D G +C A SI+G QQQ + +DL
Sbjct: 423 PTVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLAN 482
Query: 432 PALRFGSENC 441
+ F C
Sbjct: 483 SLVGFSPNKC 492
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/396 (31%), Positives = 188/396 (47%), Gaps = 22/396 (5%)
Query: 55 ERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
+RI K ++K A ++++ A + ++ MA+ Y + +GTPM+ Q+++
Sbjct: 156 QRIEKRLRLNKDPAGSHENVAEVAA-EFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVL 214
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTR 172
DT S +VW QC+PC +C+ Q PIF+P S ++S + C+ +C + C G C+Y
Sbjct: 215 DTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKV 274
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
Y G T G + E F G T V +A GC +DN+G +G+LG A LS
Sbjct: 275 SYGDGSYTIGSFATEMLTF----GTTSVRNVAIGCGHDNAGLFV--GAAGLLGLGAGLLS 328
Query: 233 LSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHL 290
SQL + FSYCLV R E++ ++FG ++ L TP+L + P F Y+ L
Sbjct: 329 FPSQLGTQTGRAFSYCLVDRFSESSGTLEFGPESVPLGSIL--TPLLTNPSLPTFYYVPL 386
Query: 291 LEISIGRHIV-RFPPGAFDI-MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
+ IS+G ++ PP F I G GGFI+D+GT VT ++ Y + + R L
Sbjct: 387 ISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLP 446
Query: 349 RQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAI 406
+ FD CY P++ FH ++ P Y I D G FC A
Sbjct: 447 KAE----GVSIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAF 502
Query: 407 Q-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
SI+G QQQ + + +D + F C
Sbjct: 503 APATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 181/405 (44%), Gaps = 36/405 (8%)
Query: 55 ERIHKMFEISKARANYMA----SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQ 110
+R+ + ++ AR + A S SK + + L A Y V V +GTP +
Sbjct: 96 DRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTAN----YIVSVGLGTPKRDL 151
Query: 111 HLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKC 168
++FDT S L W QC+PC C+ Q P+FDP STTYS +PC CR C +GKC
Sbjct: 152 LVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSGSCSSGKC 211
Query: 169 VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL---AFGCSNDNSGFAFGGKISGILG 225
Y Y T G +R+T + + +L FGC +D++G GK G+ G
Sbjct: 212 RYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLF--GKADGLFG 269
Query: 226 FNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH 285
+SL+SQ + FSYCL A + G A R T + SD
Sbjct: 270 LGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAAPPNAR--FTAMVTRSDTPSF 327
Query: 286 FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILR 345
+YL+L+ I + VR P F T G +ID+GT +T + + Y L + ++R
Sbjct: 328 YYLNLVGIKVAGRTVRVSPAVFR-----TPGTVIDSGTVITRLPSRAYAALRSSFAGLMR 382
Query: 346 SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIE----PDRG 400
+R P A D CY + K PS+ + N+ F E ++
Sbjct: 383 RYSYKRAP--ALSILDTCYDFTGRNKVQIPSVALLFDGGATL----NLGFGEVLYVANKS 436
Query: 401 RFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +ILG QQ+ ++YD+ + FG++ C+
Sbjct: 437 QACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 165/374 (44%), Gaps = 37/374 (9%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTT 146
LP+ + Y V V +GTP K L+FDT S L WTQCQPC++ C+ Q PIFDP AS T
Sbjct: 147 LPLGTGN--YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKT 204
Query: 147 YSEIPCDDPLC--------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
YS I C C SP C + CVY +Y T G +++T + F
Sbjct: 205 YSNISCTSTACSGLKSATGNSP-GCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVF- 262
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
FGC +N G GK +G++G PLS+ Q + FSYCL +
Sbjct: 263 --DGFMFGCGQNNRGLF--GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGH 318
Query: 259 IKFGRDADVR-----RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ FG V+ + + TP S +++ +L IS+G + P F
Sbjct: 319 LTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ----- 373
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN-ASQEFDYCYRYDS-SFK 371
G IID+GT +T + + Y +L + Q + + P A D CY + +
Sbjct: 374 NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMS-----KYPTAPALSLLDTCYDLSNYTSI 428
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYD 428
+ P ++F+ + N I + C+A DD I G QQQ + ++YD
Sbjct: 429 SIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYD 488
Query: 429 LNVPALRFGSENCA 442
+ L FG + C+
Sbjct: 489 VAGGQLGFGYKGCS 502
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/430 (28%), Positives = 199/430 (46%), Gaps = 30/430 (6%)
Query: 32 FSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMA 91
S++LI SP SPLY + ++R++ F S +R+ + N D+ +
Sbjct: 26 LSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRL------NNILSQTDLQSGLI 79
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
D + + + IGTP + DT S L W QC+PC +C+ + PIFD + S+TY P
Sbjct: 80 GADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEP 139
Query: 152 CDDPLCR----SPFKCQNGKCVYTRRYHVGD--VTRGLASRETFAFPVRNGF-TFVPRLA 204
CD C S C K V RY GD ++G + ET + +G P
Sbjct: 140 CDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTV 199
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE---MEATSVIKF 261
FGC +N G F SGI+G LSL SQL + I FSYCL + TSVI
Sbjct: 200 FGCGYNNGG-TFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINL 258
Query: 262 GRDAD----VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD-----IMRD 312
G ++ + + +TP++ + R ++YL L IS+G+ + + +++ I +
Sbjct: 259 GTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSE 318
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
+G IID+GT +T + +G + ++++ R P +C++ S+
Sbjct: 319 TSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGL---LSHCFKSGSAEIG 375
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
P +T H AD + P N F++ C+++ + +I G + Q + L+ YDL
Sbjct: 376 LPEITVHFTGADVRLSPINA-FVKVSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETR 434
Query: 433 ALRFGSENCA 442
+ F +C+
Sbjct: 435 TVSFQRMDCS 444
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 167/353 (47%), Gaps = 24/353 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G P K +++ DT S + W QCQPC C+ Q+ PIF P AS++YS + CD
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C S C+NG+C Y Y G T G ET +F G V +A GC +DN G
Sbjct: 219 CNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNSIALGCGHDNEGL 275
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG PLSL+SQL+ FSYCLV R+ A+S + F D
Sbjct: 276 FV--GAAGLLGLGGGPLSLTSQLKATS---FSYCLVNRDSAASSTLDFNS---APVGDSV 327
Query: 274 TTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P+L S + +Y+ L +S+G ++R P F + G GG I+D GT +T +++
Sbjct: 328 IAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQSEA 387
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD--SSFKAYPSMTFHLQEADYIVQPE 390
Y +L + S+ R + FD CY SS K P+++FH P
Sbjct: 388 YNSLRDSF----VSMSRHLRSTSGVALFDTCYDLSGQSSVKV-PTVSFHFDGGKSWDLPA 442
Query: 391 NMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G +C A SI+G QQQ + +DL + F + C
Sbjct: 443 ANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/416 (30%), Positives = 188/416 (45%), Gaps = 34/416 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
LS E + +M SKAR+ + +S A ++ D Y V + IGTP +P
Sbjct: 66 GLSTRELLRRMAARSKARSARL--LSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQP 123
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKC---- 163
L+ DT S L WTQC PC+ CF Q+ P F+P S T+S +PCD +CR + C
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRN---GFTFVPRLAFGCSNDNSGFAFGGK 219
NG CVY Y +T G +TF+F + G VP L FGC N+G F
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGI-FVSN 242
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVI-----KFGRDADVRRRDL 272
+GI GF+ LS+ +QL+ FSYC + E + V DA +
Sbjct: 243 ETGIAGFSRGALSMPAQLKVDN---FSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 299
Query: 273 ETTPILL---SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ L+ S +Y+ L +++G + P F + DGTGG I+D+GT +T +
Sbjct: 300 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 359
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y + + + + + + S C+ K P++ H + A +
Sbjct: 360 EAVYNLVCDAF----VAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLP 415
Query: 389 PEN-MYFIEPDRG--RFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN M+ IE G C+AI S++G +QQQNM ++YDL L F C
Sbjct: 416 RENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/416 (30%), Positives = 188/416 (45%), Gaps = 34/416 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
LS E + +M SKAR+ + +S A ++ D Y V + IGTP +P
Sbjct: 40 GLSTRELLRRMAARSKARSARL--LSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQP 97
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKC---- 163
L+ DT S L WTQC PC+ CF Q+ P F+P S T+S +PCD +CR + C
Sbjct: 98 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 157
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRN---GFTFVPRLAFGCSNDNSGFAFGGK 219
NG CVY Y +T G +TF+F + G VP L FGC N+G F
Sbjct: 158 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGI-FVSN 216
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVI-----KFGRDADVRRRDL 272
+GI GF+ LS+ +QL+ FSYC + E + V DA +
Sbjct: 217 ETGIAGFSRGALSMPAQLKVDN---FSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 273
Query: 273 ETTPILL---SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ L+ S +Y+ L +++G + P F + DGTGG I+D+GT +T +
Sbjct: 274 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 333
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQ 388
Y + + + + + + S C+ K P++ H + A +
Sbjct: 334 EAVYNLVCDAF----VAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLP 389
Query: 389 PEN-MYFIEPDRG--RFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN M+ IE G C+AI S++G +QQQNM ++YDL L F C
Sbjct: 390 RENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 189/397 (47%), Gaps = 24/397 (6%)
Query: 55 ERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
+RI + ++ K A +++ A + ++ M + Y + IGTP + Q+++
Sbjct: 113 QRIERKLKLKKDPAGSYENVAGVTA-EFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVL 171
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTR 172
DT S +VW QC+PC C+ Q PIF+P +S ++S + CD +C C G C+Y
Sbjct: 172 DTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEV 231
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
Y G T G + ET F G T + +A GC +DN G +G+LG A LS
Sbjct: 232 SYGDGSYTVGSYATETLTF----GTTSIQNVAIGCGHDNVGLFV--GAAGLLGLGAGSLS 285
Query: 233 LSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHL 290
+QL + FSYCLV R+ E++ ++FG ++ TP++ + P F YL +
Sbjct: 286 FPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIF--TPLVANPFLPTFYYLSM 343
Query: 291 LEISIGRHIV-RFPPGAFDI-MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
+ IS+G I+ P AF I G GG IID+GT VT ++ Y L + G
Sbjct: 344 VAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIA-----G 398
Query: 349 RQRIP-YNASQEFDYCYRYDS-SFKAYPSMTFHLQE-ADYIVQPENMYFIEPDRGRFCVA 405
Q +P + FD CY + + P++ FH A +I+ +N G FC A
Sbjct: 399 TQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFA 458
Query: 406 IQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D SI+G QQQ + + +D + F + C
Sbjct: 459 FAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 126/456 (27%), Positives = 211/456 (46%), Gaps = 39/456 (8%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
L FF +FSV T +S FS++LI SP SP+Y ++ ++R++ F S +R+
Sbjct: 6 LLCFFLFFSV---TLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRS 62
Query: 69 ---NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
N+ S + D+ + D + + + IGTP + DT S L W QC
Sbjct: 63 RRFNHQLSQT---------DLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQC 113
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGD--V 179
+PC +C+ + PIFD + S+TY PCD C+ + C + RY GD
Sbjct: 114 KPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSF 173
Query: 180 TRGLASRETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR 238
++G + ET + +G P FGC +N G F SGI+G LSL SQL
Sbjct: 174 SKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGG-TFDETGSGIIGLGGGHLSLISQLG 232
Query: 239 NRIQGLFSYCLVRE---MEATSVIKFGRDAD----VRRRDLETTPILLSDLRPHFYLHLL 291
+ I FSYCL + TSVI G ++ + + +TP++ + ++YL L
Sbjct: 233 SSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLE 292
Query: 292 EISIGRHIVRFPPGAFD-----IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
IS+G+ + + +++ I+ + +G IID+GT +T + G + ++ +
Sbjct: 293 AISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG 352
Query: 347 LGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
R P +C++ S+ P +T H AD + P N F++ C+++
Sbjct: 353 AKRVSDPQGL---LSHCFKSGSAEIGLPEITVHFTGADVRLSPINA-FVKLSEDMVCLSM 408
Query: 407 QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ +I G + Q + L+ YDL + F +C+
Sbjct: 409 VPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 163/373 (43%), Gaps = 35/373 (9%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTT 146
LP+ + Y V V +GTP K L+FDT S L WTQCQPC++ C+ Q PIFDP S T
Sbjct: 147 LPLGTGN--YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKT 204
Query: 147 YSEIPCDDPLCRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
YS I C C S C + CVY +Y T G +++ + F
Sbjct: 205 YSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVF-- 262
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVI 259
FGC +N G GK +G++G PLS+ Q + FSYCL + +
Sbjct: 263 -DGFMFGCGQNNKGLF--GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHL 319
Query: 260 KFGRDADVR-----RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
FG V+ + + TP S ++++ +L IS+G + P F
Sbjct: 320 TFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----N 374
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN-ASQEFDYCYRYDS-SFKA 372
G IID+GT +T + + Y +L + Q + + P A D CY + + +
Sbjct: 375 AGTIIDSGTVITRLPSTAYGSLKSAFKQFMS-----KYPTAPALSLLDTCYDLSNYTSIS 429
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDL 429
P ++F+ + N I + C+A DD I G QQQ + ++YD+
Sbjct: 430 IPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDV 489
Query: 430 NVPALRFGSENCA 442
L FG + C+
Sbjct: 490 AGGQLGFGYKGCS 502
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 129/444 (29%), Positives = 206/444 (46%), Gaps = 35/444 (7%)
Query: 18 VLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKP 77
+F T +S+ S++LI SP SPLY + S+R++ A ++ S+S+
Sbjct: 15 TIFFTSTSSAHRKNLSVELIHRDSPHSPLYNPQHTVSDRLN---------AAFLRSISRS 65
Query: 78 NAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
F D+ + Y + ++IGTP + DT S L W QC+PC +C+ Q TP
Sbjct: 66 RRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTP 125
Query: 138 IFDPRASTTYSEIPCDDPLCRSPFKCQNG------KCVYTRRYHVGD--VTRGLASRETF 189
+FD + S+TY CD C + + + G C Y RY GD T+G + ET
Sbjct: 126 LFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKY--RYSYGDESFTKGEVATETI 183
Query: 190 AFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
+ +G P AFGC +N G F SGI+G PLSL SQL + I FSYC
Sbjct: 184 SIDSSSGSPVSFPGTAFGCGYNNGG-TFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYC 242
Query: 249 LVR---EMEATSVIKFGRDADVRR--RD--LETTPILLSDLRPHFYLHLLEISIGRHIVR 301
L TSVI G ++ + +D + TTP++ D +++L L I++G+ +
Sbjct: 243 LSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLP 302
Query: 302 FP-PGAFDIMRDG--TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ G + + R TG IID+GT +T + +G Y ++ + R P
Sbjct: 303 YTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGI-- 360
Query: 359 EFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAW 418
+C++ P++T H AD + P N F++ C+++ + +I G
Sbjct: 361 -LTHCFKSGDKEIGLPTITMHFTGADVKLSPINS-FVKLSEDIVCLSMIPTTEVAIYGNM 418
Query: 419 QQQNMLIIYDLNVPALRFGSENCA 442
Q + L+ YDL + F +C+
Sbjct: 419 VQMDFLVGYDLETKTVSFQRMDCS 442
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 172/359 (47%), Gaps = 25/359 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +EV+IGTP + + DT S L WT C PC +C+ Q PIFDP+ ST+Y I CD L
Sbjct: 25 YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84
Query: 157 CR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCS 208
C SP K C YT Y +T+G+ ++ET G + + + FGC
Sbjct: 85 CHKLDTGVCSPQK----HCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCG 140
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL-FSYCLV---REMEATSVIKFGRD 264
++N+G F + GI+G P+S SQ+ + G FS CLV ++ +S + G+
Sbjct: 141 HNNTG-GFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKG 199
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
++V + + +TP++ + +++ LL IS+G + F + + G +D+GTP
Sbjct: 200 SEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTP 257
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
T + P Q + Q+ + + + + CYR ++ + P +T H + D
Sbjct: 258 PTIL---PTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRG-PVLTAHFEGGD 313
Query: 385 YIVQPENMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ P F+ P G FC+ + + G + Q N LI +DL+ + F +C
Sbjct: 314 VKLLPTQT-FVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCT 371
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 172/355 (48%), Gaps = 23/355 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + IGTP + Q+++ DT S +VW QC+PC C+ Q PIF+P +S ++S + CD +
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 67
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C C G C+Y Y G T G + ET F G T + +A GC +DN G
Sbjct: 68 CSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTF----GTTSIQNVAIGCGHDNVGL 123
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG A LS +QL + FSYCLV R+ E++ ++FG ++
Sbjct: 124 FV--GAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIF- 180
Query: 274 TTPILLSDLRPHF-YLHLLEISIGRHIV-RFPPGAFDI-MRDGTGGFIIDTGTPVTFIRN 330
TP++ + P F YL ++ IS+G I+ P AF I G GG IID+GT VT ++
Sbjct: 181 -TPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQT 239
Query: 331 GPYQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYRYDS-SFKAYPSMTFHLQE-ADYIV 387
Y L + G Q +P + FD CY + + P++ FH A +I+
Sbjct: 240 SAYDALRDAFIA-----GTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFIL 294
Query: 388 QPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+N G FC A D SI+G QQQ + + +D + F + C
Sbjct: 295 PAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 164/353 (46%), Gaps = 25/353 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG P P +++ DT S + W QC PC C++QT PIF+P +S +++ + C+
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETEQ 210
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+NG C+Y Y G T G ET G T + +A GC ++N G
Sbjct: 211 CKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL----GSTSLGNIAIGCGHNNEGL 266
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGR--DADVRRRD 271
+G+LG LS SQL FSYCLV R+ ++TS + F D
Sbjct: 267 FI--GAAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITPDAVTAP 321
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
L P +L FYL L +S+G ++ P +F + DG GG I+D+GT VT ++
Sbjct: 322 LHRNP----NLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTT 377
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y L + + L R FD CY S + P+++FH + + P
Sbjct: 378 VYNVLRDAFVKSTHDLQTAR----GVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433
Query: 391 NMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A D SILG QQQ + +DL + F C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 115/355 (32%), Positives = 166/355 (46%), Gaps = 24/355 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG+P + +++ DT S + W QCQPC C+ Q+ P+FDP S +Y+ + CD P
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPR 228
Query: 157 CR--SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR C+N G C+Y Y G T G + ET T V +A GC +DN
Sbjct: 229 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVTNVAIGCGHDNE 285
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRD 271
G +G+L PLS SQ+ FSYCLV R+ A S ++FG AD D
Sbjct: 286 GLFV--GAAGLLALGGGPLSFPSQISAST---FSYCLVDRDSPAASTLQFG--ADGAEAD 338
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDI-MRDGTGGFIIDTGTPVTFIR 329
T P++ S F Y+ L IS+G + P AF + G+GG I+D+GT VT ++
Sbjct: 339 TVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQ 398
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQEADYIVQ 388
+ Y L + + SL R + FD CY D + P+++ + +
Sbjct: 399 SSAYAALRDAFVRGTPSLPRT----SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRL 454
Query: 389 PENMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A + SI+G QQQ + +D + F C
Sbjct: 455 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 132/446 (29%), Positives = 203/446 (45%), Gaps = 22/446 (4%)
Query: 6 ALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK 65
+L L + +++ FL + GFS+++I S SPLY + +R+ S
Sbjct: 9 SLALVLLWCLYNISFL----KANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRRSI 64
Query: 66 ARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
R N+ AF + + Y + ++G+P + DT S ++W QC
Sbjct: 65 NRGNHFK-----KAFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQC 119
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRG 182
+PC C+ QTTPIFDP S TY +PC C R+ + C Y+ Y G + G
Sbjct: 120 EPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDG 179
Query: 183 LASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
S ET +G + P+ GC ++N G F + SGI+G P+SL SQL + I
Sbjct: 180 DLSVETLTLGSTDGSSVHFPKTVIGCGHNNGG-TFQEEGSGIVGLGGGPVSLISQLSSSI 238
Query: 242 QGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRH 298
G FSYCL E ++S + FG A V R +TP+ + + ++L L S+G +
Sbjct: 239 GGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDN 298
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ F + G G IID+GT +T + Y L +++ L R R P S+
Sbjct: 299 RIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIK-LERARDP---SK 354
Query: 359 EFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAW 418
CY+ S P +T H + AD + P + F+ ++G C A +I G
Sbjct: 355 LLSLCYKTTSDELDLPVITAHFKGADVELNPIST-FVPVEKGVVCFAFISSKIGAIFGNL 413
Query: 419 QQQNMLIIYDLNVPALRFGSENCANG 444
QQN+L+ YDL + F +C G
Sbjct: 414 AQQNLLVGYDLVKKTVSFKPTDCTKG 439
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 190/427 (44%), Gaps = 36/427 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+ LI SP+SP Y + S+R+ S R F E + P
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV-----------FHFTEKDNTPQ 78
Query: 91 AKQDLF-----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ DL Y + V+IGTP P + DT S L+WTQC PC C+ Q P+FDP+ S+
Sbjct: 79 PQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138
Query: 146 TYSEIPCDDPLC-----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF- 199
TY ++ C C ++ + C Y+ Y T+G + +T +
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEAT 256
+ + GC ++N+G F K SGI+G P+SL QL + I G FSYCLV + + T
Sbjct: 199 LKNIIIGCGHNNAG-TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S I FG +A V + +TP++ + F YL L IS+G +++ + G
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEG 314
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
IID+GT +T + P + + D + S+ ++ + CY K P
Sbjct: 315 NIIIDSGTTLTLL---PTEFYSELEDAVASSIDAEK-KQDPQSGLSLCYSATGDLKV-PV 369
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+T H AD + N F++ C A + P +SI G Q N L+ YD +
Sbjct: 370 ITMHFDGADVKLDSSNA-FVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVS 428
Query: 436 FGSENCA 442
F +CA
Sbjct: 429 FKPTDCA 435
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 190/427 (44%), Gaps = 36/427 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+ LI SP+SP Y + S+R+ S R F E + P
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV-----------FHFTEKDNTPQ 78
Query: 91 AKQDLF-----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ DL Y + V+IGTP P + DT S L+WTQC PC C+ Q P+FDP+ S+
Sbjct: 79 PQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138
Query: 146 TYSEIPCDDPLC-----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF- 199
TY ++ C C ++ + C Y+ Y T+G + +T +
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEAT 256
+ + GC ++N+G F K SGI+G P+SL QL + I G FSYCLV + + T
Sbjct: 199 LKNIIIGCGHNNAG-TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S I FG +A V + +TP++ + F YL L IS+G +++ + G
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEG 314
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
IID+GT +T + P + + D + S+ ++ + CY K P
Sbjct: 315 NIIIDSGTTLTLL---PTEFYSELEDAVASSIDAEK-KQDPQSGLSLCYSATGDLKV-PV 369
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+T H AD + N F++ C A + P +SI G Q N L+ YD +
Sbjct: 370 ITMHFDGADVKLDSSNA-FVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVS 428
Query: 436 FGSENCA 442
F +CA
Sbjct: 429 FKPTDCA 435
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 189/398 (47%), Gaps = 38/398 (9%)
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
F + R N KP+ Q P++ D Y +E++IGTP + DT S L
Sbjct: 30 FSVKLIRRNSSHDSYKPSTIQS------PVSAYDCEYLMELSIGTPPIKIYAEADTGSDL 83
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC--------DDPLCRSPFKCQNGKCVYTR 172
VW QC PC +C+ Q P+FDPR+S++Y+ I C D LC + K C YT
Sbjct: 84 VWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQK----TCNYTY 139
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y +T+G+ ++ET G + FGC ++NSG F + G++G PL
Sbjct: 140 SYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSG--FNDREMGLIGLGRGPL 197
Query: 232 SLSSQLRNRIQG---LFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPH 285
SL SQ+ + + +FS CLV + TS + FG+ ++V +TP++ D +
Sbjct: 198 SLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGY 257
Query: 286 FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILR 345
F LL IS+ + F G+ + G +ID+GT +T++ Y L++ Q+
Sbjct: 258 F-ATLLGISVEDINLPFSNGS-SLGTITKGNILIDSGTTITYLPEEFYHRLIE---QVRN 312
Query: 346 SLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA 405
+ + + ++ CY+ ++ P++T H + D ++ P M FI FC A
Sbjct: 313 KVALEPFRIDG---YELCYQTPTNLNG-PTLTIHFEGGDVLLTPAQM-FIPVQDDNFCFA 367
Query: 406 IQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ D + +Y G + Q N LI +DL + F + +C
Sbjct: 368 VFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 167/360 (46%), Gaps = 22/360 (6%)
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
P+A + Y ++++ G P + + DT S L W QC PC C++ + FDP S +Y
Sbjct: 82 PVASGNGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYK 141
Query: 149 EIPCDDPLCRS-PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+ C C+ PF+ C Y Y G T G S + V G +P +AFGC
Sbjct: 142 TLGCGSNFCQDLPFQSCAASCQYDYMYGDGSSTSGALSTDD----VTIGTGKIPNVAFGC 197
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
N N G G++G PLSL SQL FSYCLV + +T
Sbjct: 198 GNSNLGTFA--GAGGLVGLGKGPLSLVSQLGGTATKKFSYCLV-PLGSTKTSPLYIGDST 254
Query: 268 RRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ TP+L ++ P FY L+ IS+ V +P FDI G GG I+D+GT +T
Sbjct: 255 LAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLT 314
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPY-NASQEF---DYCYRYDS-SFKAYPSMTFHLQ 381
++ + ++ ++ +L + +PY A F +YC+ + YP++ FH
Sbjct: 315 YLD-------VDAFNPMVAAL-KAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFN 366
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
AD + P+N + G C+A+ +SI G QQ N +I++DL + F S NC
Sbjct: 367 GADVALAPDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 177/363 (48%), Gaps = 37/363 (10%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
+E+ IGTP L DT S L+W QC PC+ C+ Q P+FDP S+TY+ I CD PLC
Sbjct: 70 MEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLCH 129
Query: 159 -------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSND 210
SP K +C YT Y +T+G+ +++T F G + R FGC ++
Sbjct: 130 KLDTGVCSPEK----RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLV---REMEATSVIKFGRDAD 266
N+G F G++G P SL SQ+ G FS CLV +++ +S + FG+ +
Sbjct: 186 NTG-GFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQ 244
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
V + TTP++ + +++ LL IS+ FP + G ++D+GTP
Sbjct: 245 VLGNGVVTTPLVPREKDTSYFVTLLGISV--EDTYFPMNS----TIGKANMLVDSGTPPI 298
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ P Q + + ++ + + I + S CYR ++ K P++TFH A+ +
Sbjct: 299 LL---PQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLKG-PTLTFHFVGANVL 354
Query: 387 VQPENMYFIEP---DRGRFCVAI----QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ P FI P +G FC+AI DP + G + Q N LI +DL+ + F
Sbjct: 355 LTPIQT-FIPPTPQTKGIFCLAIYNRTNSDP--GVYGNFAQSNYLIGFDLDRQVVSFKPT 411
Query: 440 NCA 442
+C
Sbjct: 412 DCT 414
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 119/401 (29%), Positives = 177/401 (44%), Gaps = 27/401 (6%)
Query: 58 HKMFEISK---ARANYMASMSKPNAF--QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHL 112
H M ++ AR Y+ P + ++ +++ Y V V +G+P Q+L
Sbjct: 89 HAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYL 148
Query: 113 LFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPFKCQNGK 167
+ D+ S ++W QC+PC C+ Q P+FDP AS +++ +PCD +CR S +G
Sbjct: 149 VVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGVCRTLPGGSSGCADSGA 208
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C Y Y G T+G+ + ET F T V +A GC + N G G +G+LG
Sbjct: 209 CRYQVSYGDGSYTQGVLAMETLTF---GDSTPVQGVAIGCGHRNRGLFVG--AAGLLGLG 263
Query: 228 ASPLSLSSQLRNRIQGLFSYCLV-REMEA-TSVIKFGRDADVRRRDLETTPILLSDLRPH 285
P+SL QL G FSYCL R +A + FGRD D P+L + +P
Sbjct: 264 WGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRD-DAMPVGAVWVPLLRNAQQPS 322
Query: 286 FYLHLLEISIGRHIVRFP--PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
FY ++ +G R P G FD+ DG GG ++DTGT VT + Y L D
Sbjct: 323 FY-YVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALR---DAF 378
Query: 344 LRSLGRQRIPYNASQEFDYCYRYD--SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR 401
++G D CY +S + + ++ + P +E G
Sbjct: 379 ASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVEMGGGV 438
Query: 402 FCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+C+A SILG QQQ + I D + FG C
Sbjct: 439 YCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 167/352 (47%), Gaps = 23/352 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG P + +++ DT S + W QC PC C+ QT PIF+P +S++Y + CD P
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 210
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + +C+N C+Y Y G T G + ET G T V +A GC + N G
Sbjct: 211 CNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI----GSTLVQNVAVGCGHSNEGL 266
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG L+L SQL FSYCLV R+ ++ S ++FG D
Sbjct: 267 FV--GAAGLLGLGGGLLALPSQLNTTS---FSYCLVDRDSDSASTVEFGTSLP---PDAV 318
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P+L + L +YL L IS+G +++ P +F++ G+GG IID+GT VT ++ G
Sbjct: 319 VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGI 378
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPEN 391
Y +L + + L + FD CY + P++ FH + P
Sbjct: 379 YNSLRDSFLKGTSDLEKAA----GVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAK 434
Query: 392 MYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC+A +I+G QQQ + +DL + F S C
Sbjct: 435 NYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 119/374 (31%), Positives = 172/374 (45%), Gaps = 37/374 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E+ IGTP P L DT S L WTQC+PC CF Q TPI+D AS ++S +PC
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 157 C----RSPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAF----PVRNG-FTFVPRLA 204
C RS C C Y Y G + G+ ET F P G V +A
Sbjct: 155 CLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVA 214
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR--EMEATSVIKFG 262
FGC DN G ++ +G +G LSL +QL G FSYCL S + FG
Sbjct: 215 FGCGVDNGGLSY--NSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGSPVLFG 269
Query: 263 RDAD------VRRRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
A+ + +++TP++ P +Y+ L IS+G + P G FD+ DG+G
Sbjct: 270 SLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSG 329
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
G I+D+GT T + ++ ++ +L Q + NAS C+ + + P
Sbjct: 330 GMIVDSGTIFTVLVESAFRVVVNHVAGVL----NQPV-VNASSLDSPCFPATAGEQQLPD 384
Query: 376 MTFHLQE----ADYIVQPENMYFIEPDRGRFCVAIQDDPKY--SILGAWQQQNMLIIYDL 429
M L AD + +N + FC+ I P SILG +QQQN+ +++D+
Sbjct: 385 MPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDI 444
Query: 430 NVPALRFGSENCAN 443
V L F +C+
Sbjct: 445 TVGQLSFVPTDCSK 458
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 165/351 (47%), Gaps = 29/351 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ IGTP + DT S +WTQC PC+ C++QT PIFDP S+T+ EI CD
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT-- 122
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
+ C Y Y T+G ET +G FV P GC +NSGF
Sbjct: 123 -------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFK 175
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
G +G++G + P SL +Q+ GL SYC + TS I FG +A V + +T
Sbjct: 176 PG--FAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG--KGTSKINFGANAIVAGDGVVST 231
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+ + +P F YL+L +S+G + F ++ G +ID+G+ +T+
Sbjct: 232 TVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPESYCN 288
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENMY 393
+ + +Q++ ++ R P + CY Y + +P +T H AD ++ NMY
Sbjct: 289 LVRKAVEQVVTAV---RFP----RSDILCY-YSKTIDIFPVITMHFSGGADLVLDKYNMY 340
Query: 394 FIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
G FC+AI + +I G Q N L+ YD + + F NC+
Sbjct: 341 VASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 166/352 (47%), Gaps = 20/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP K +++ DT S + W QC PC C+ Q+ PIFDP +S+T+ + C DP
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPK 223
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C S C++ KC+Y Y G T G + +T F V +A GC +DN G
Sbjct: 224 CASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGK---VNDVALGCGHDNEGL 280
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS+++Q++ + FSYCLV R+ +S + F + D
Sbjct: 281 F--TGAAGLLGLGGGALSMTNQIKAKS---FSYCLVDRDSAKSSSLDF-NSVQIGAGD-A 333
Query: 274 TTPILL-SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T P+L S + +Y+ L S+G V P F++ G GG I+D GT VT ++
Sbjct: 334 TAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQA 393
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPEN 391
Y +L + ++ + P + FD CY + S S P++TFH + P
Sbjct: 394 YNSLRDAFVKLTTDFKKGTSPISL---FDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAK 450
Query: 392 MYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A SI+G QQQ I YDL + + C
Sbjct: 451 NYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 169/358 (47%), Gaps = 33/358 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G P + +++ DT S + W QCQPC C+ Q+ P++DP ST+Y+ + CD P
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPR 222
Query: 157 CR--SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAF----PVRNGFTFVPRLAFGCS 208
CR C+N G C+Y Y G T G + ET PV N +A GC
Sbjct: 223 CRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVSN-------VAIGCG 275
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADV 267
+DN G +G+L PLS SQ+ FSYCLV R+ ++S ++FG
Sbjct: 276 HDNEGLFV--GAAGLLALGGGPLSFPSQISATT---FSYCLVDRDSPSSSTLQFGDS--- 327
Query: 268 RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ T P++ S F Y+ L IS+G + P AF + G+GG I+D+GT VT
Sbjct: 328 -EQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVT 386
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADY 385
+++G Y L + + Q +SL R + FD CY S P++ +
Sbjct: 387 RLQSGAYGALREAFVQGTQSLPRA----SGVSLFDTCYDLAGRSSVQVPAVALWFEGGGE 442
Query: 386 IVQPENMYFIEPD-RGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ P Y I D G +C+A SI+G QQQ + + +D + F ++ C
Sbjct: 443 LKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 165/351 (47%), Gaps = 29/351 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ IGTP + DT S +WTQC PC+ C++QT PIFDP S+T+ EI CD
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT-- 116
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
+ C Y Y T+G ET +G FV P GC +NSGF
Sbjct: 117 -------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFK 169
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
G +G++G + P SL +Q+ GL SYC + TS I FG +A V + +T
Sbjct: 170 PG--FAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG--KGTSKINFGANAIVAGDGVVST 225
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+ + +P F YL+L +S+G + F ++ G +ID+G+ +T+
Sbjct: 226 TVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPESYCN 282
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENMY 393
+ + +Q++ ++ R P + CY Y + +P +T H AD ++ NMY
Sbjct: 283 LVRKAVEQVVTAV---RFP----RSDILCY-YSKTIDIFPVITMHFSGGADLVLDKYNMY 334
Query: 394 FIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
G FC+AI + +I G Q N L+ YD + + F NC+
Sbjct: 335 VASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 181/362 (50%), Gaps = 30/362 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E+ IGTP DT S L+W QC PC+ C++Q P+FDP S+TY+ I CD PL
Sbjct: 64 YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123
Query: 157 CRSPF--KCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNS 212
C P+ +C K C YT Y +T+G+ ++ET G + + FGC ++N+
Sbjct: 124 CYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNT 183
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLV---REMEATSVIKFGRDADVR 268
G F G++G P SL SQ+ G FS CLV ++ +S + FG+ ++V
Sbjct: 184 G-NFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVL 242
Query: 269 RRDLETTPILLSDL-RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ TTP++ + +Y+ LL IS+ + P I + G ++D+GTP
Sbjct: 243 GEGVVTTPLVQREQDMTSYYVTLLGISVED---TYLPMNSTIEK---GNMLVDSGTPPNI 296
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ P Q + Y ++ + + I + S CYR ++ K P++T+H + A+ ++
Sbjct: 297 L---PQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKG-PTLTYHFEGANLLL 352
Query: 388 QPENMYFIEP---DRGRFCVAIQD----DPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
P FI P +G FC+AI + DP I G + Q N LI +DL+ + F +
Sbjct: 353 TPIQT-FIPPTPETKGVFCLAITNCANSDP--GIYGNFAQTNYLIGFDLDRQIVSFKPTD 409
Query: 441 CA 442
C
Sbjct: 410 CT 411
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 134/456 (29%), Positives = 210/456 (46%), Gaps = 28/456 (6%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTG-FSLKLIPIFSPESPLYPGNLSQSERIHK 59
MA L+ F + +++ T T+S + G F+ LI SP SPLY + +R+
Sbjct: 1 MAAFSITHLSLFVIFVALISKTSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQS 60
Query: 60 MFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASS 119
F S +RAN PN+ + + + Y + ++IGTP ++ DT S
Sbjct: 61 SFHRSISRANRFT----PNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSD 116
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNG-----KCVYT 171
L+W QCQPC C+ Q +PIF+P+ S+TY + C+ C S + + C Y+
Sbjct: 117 LIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYS 176
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y T G + E F N + LAFGC N N G F SGI+G L
Sbjct: 177 YSYGDHSFTMGYLATERFIIGSTN--NSIQELAFGCGNSNGG-NFDEVGSGIVGLGGGSL 233
Query: 232 SLSSQLRNRIQGLFSYCLVREMEATSV----IKFGRDADVRRRDL-ETTPILLSDLRPHF 286
SL SQL +I FSYCLV +E ++ I FG ++ + D +TP++ + +
Sbjct: 234 SLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFY 293
Query: 287 YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
YL L IS+G + + D + G IID+GT +TF+ + Y L ++ +
Sbjct: 294 YLTLEAISVGNERLAYENSRNDGNVE-KGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEG 352
Query: 347 LGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
+R+ + + F C+R D P +T H +AD ++P N F + + C +
Sbjct: 353 ---ERVS-DPNGIFSICFR-DKIGIELPIITVHFTDADVELKPINT-FAKAEEDLLCFTM 406
Query: 407 QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+I G Q N L+ YDL+ + F +C+
Sbjct: 407 IPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 163/353 (46%), Gaps = 25/353 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG P P +++ DT S + W QC PC C++QT P F+P +S +++ + C+
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETEQ 210
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+NG C+Y Y G T G ET G T + +A GC ++N G
Sbjct: 211 CKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL----GSTSLGNIAIGCGHNNEGL 266
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGR--DADVRRRD 271
+G+LG LS SQL FSYCLV R+ ++TS + F D
Sbjct: 267 FI--GAAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITPDAVTAP 321
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
L P +L FYL L +S+G ++ P +F + DG GG I+D+GT VT ++
Sbjct: 322 LHRNP----NLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTT 377
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y L + + L R FD CY S + P+++FH + + P
Sbjct: 378 VYNVLRDAFVKSTHDLQTAR----GVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433
Query: 391 NMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A D SILG QQQ + +DL + F C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 128/428 (29%), Positives = 201/428 (46%), Gaps = 25/428 (5%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMS-KPNAFQELE 84
+S GFSL LI SP SPLY N + +R+ F S +R N + + N+FQ
Sbjct: 28 ASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFKTKAVDINSFQN-- 85
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ +P + Y ++++IGTP+ ++ DT S L W QC PC C+ Q +P+FDP S
Sbjct: 86 DL-VPNGGE---YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRS 141
Query: 145 TTYSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGD--VTRGLASRETFAFPVRNGF- 197
++Y + C C S C + Y GD T G + E F +
Sbjct: 142 SSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRP 201
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REME 254
+ + FGC N G F SGI+G LSL SQL + I+G FSYCLV +
Sbjct: 202 VHLSPIVFGCGTGNGG-TFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSN 260
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
TS IKFG D+ + + +TP++ ++Y+ L IS+G + + G + +
Sbjct: 261 VTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVE-K 319
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G IID+GT +TF+ + + L + ++ +++ +R+ + F C+R P
Sbjct: 320 GNVIIDSGTTLTFLDSEFFTELERVLEETVKA---ERVS-DPRGLFSVCFRSAGDID-LP 374
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
+ H +AD +QP N F++ D C + + I G Q + L+ YDL +
Sbjct: 375 VIAVHFNDADVKLQPLNT-FVKADEDLLCFTMISSNQIGIFGNLAQMDFLVGYDLEKRTV 433
Query: 435 RFGSENCA 442
F +C
Sbjct: 434 SFKPTDCT 441
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/447 (28%), Positives = 197/447 (44%), Gaps = 30/447 (6%)
Query: 12 FFSYFSVLFL---THFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
F+S +LF + +++ GFS++LI S +SP Y S +R+ + S R
Sbjct: 3 FYSSLLLLFCFCRVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNMKHSTNRV 62
Query: 69 NYMASMSK--PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
+Y+ + PN + + P Y + IGTP + + DTA+ +W QC
Sbjct: 63 HYLNHVFSFPPNKVPNI--VVSPFMGDG--YIISFLIGTPPFQLYGVMDTANDNIWFQCN 118
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGK---CVYTRRYHVGDVTR 181
PC CF+ T+P+FDP S+TY IPC P C++ C + C Y+ Y ++
Sbjct: 119 PCKPCFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQ 178
Query: 182 GLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR 240
G S +T N + GC + N G G +SG +G PLS SQL +
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKG-PLEGYVSGNIGLGRGPLSFISQLNSS 237
Query: 241 IQGLFSYCLVREMEATSV---IKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
I G FSYCLV + + FG + V +TPI ++ + L +S+G
Sbjct: 238 IGGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIG--YSTTLNALSVGD 295
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
HI++F D G IID+GT +T + Y R + I+ S+ + + +
Sbjct: 296 HIIKFENSTSK--NDNLGNTIIDSGTTLTILPENVY----SRLESIVTSMVKLERAKSPN 349
Query: 358 QEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY--SIL 415
Q+F CY+ P +T H AD + N ++ D C A + +I+
Sbjct: 350 QQFKLCYKATLKNLDVPIITAHFNGADVHLNSLNTFY-PIDHEVVCFAFVSVGNFPGTII 408
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCA 442
G QQN L+ +DL + F +C
Sbjct: 409 GNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 130/441 (29%), Positives = 194/441 (43%), Gaps = 53/441 (12%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTS---SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
++ + FF+ V FL + GFS+ LI SP SP + + +Q+ER+
Sbjct: 1 MEGFGVKIFFNVVVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDA 60
Query: 61 FEISKARANYMASMSKPNAFQE--LEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
F S +R +P A ++ +P A + Y + + IGTP P + DT S
Sbjct: 61 FRRSVSRVGRF----RPTAMTSDGIQSRIVPSAGE---YLMNLYIGTPPVPVIAIVDTGS 113
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK----CQNGKCVYTRRY 174
L WTQC+PC C+ Q P+FDP+ S+TY + C C + K + KC + Y
Sbjct: 114 DLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSY 173
Query: 175 HVGDVTRGLASRETFAFPVRNG--FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLS 232
G T G + ET G +F P AFGC + +SG F SGI+G LS
Sbjct: 174 ADGSFTGGNLASETLTVDSTAGKPVSF-PGFAFGCGH-SSGGIFDKSSSGIVGLGGGELS 231
Query: 233 LSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYL 288
L SQL++ I GLFSYCL+ + +S I FG V +TP+ L P+ Y
Sbjct: 232 LISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRL----PYKGYS 287
Query: 289 HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
E+ G I+D+GT TF+ Y L + + S+
Sbjct: 288 KKTEVE-------------------EGNIIVDSGTTYTFLPQEFYSKLEK---SVANSIK 325
Query: 349 RQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
+R+ + + F CY + A P +T H ++A+ +QP N F+ C +
Sbjct: 326 GKRV-RDPNGIFSLCYNTTAEINA-PIITAHFKDANVELQPLNT-FMRMQEDLVCFTVAP 382
Query: 409 DPKYSILGAWQQQNMLIIYDL 429
+LG Q N L+ +DL
Sbjct: 383 TSDIGVLGNLAQVNFLVGFDL 403
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 88/216 (40%), Gaps = 23/216 (10%)
Query: 242 QGLFSYC--LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHI 299
G+FS C E+ A + +DA+V + L T + DL +I + ++
Sbjct: 333 NGIFSLCYNTTAEINAPIITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNL 392
Query: 300 --VRFPPGAFDIMRD---------GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
V F G FD+ + G I+D+GT T++ P + ++ + + S+
Sbjct: 393 AQVNFLVG-FDLRKKRGFSKKAEVEEGNIIVDSGTTYTYL---PLEFYVKLEESVAHSIK 448
Query: 349 RQRI--PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
+R+ P S CY P +T H ++A+ +QP N F+ C +
Sbjct: 449 GKRVRDPNGISS---LCYNTTVDQIDAPIITAHFKDANVELQPWNT-FLRMQEDLVCFTV 504
Query: 407 QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILG Q N L+ +DL + F + +C
Sbjct: 505 LPTSDIGILGNLAQVNFLVGFDLRKKRVSFKAADCT 540
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 171/365 (46%), Gaps = 26/365 (7%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP + +++ DT S +VW QC PCI+C+ QT P+FDP S +++
Sbjct: 138 LAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFAN 197
Query: 150 IPCDDPLCRS---PFKCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
IPC PLCR P C K C+Y Y G T G S ET F T V R+
Sbjct: 198 IPCGSPLCRRLDYP-GCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRG----TRVGRVV 252
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFG 262
GC +DN G +G+LG LS SQ+ R FSYCL ++ S I FG
Sbjct: 253 LGCGHDNEGLFV--GAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFG 310
Query: 263 RDADVRRRDLETTPILLS-DLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIID 320
A R TP+L + L +Y+ LL IS+ G + F + G GG IID
Sbjct: 311 DSAISRTTRF--TPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIID 368
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFH 379
+GT VT + Y L + ++ + +R P + FD C+ + P++ H
Sbjct: 369 SGTSVTRLTRAAYVALRDAF--LVGASNLKRAPEFS--LFDTCFDLSGKTEVKVPTVVLH 424
Query: 380 LQEADYIVQPENMYFIEPDR-GRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFG 437
+ AD + P + Y I D G FC A SI+G QQQ ++YDL + F
Sbjct: 425 FRGAD-VPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFA 483
Query: 438 SENCA 442
CA
Sbjct: 484 PRGCA 488
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/390 (31%), Positives = 183/390 (46%), Gaps = 31/390 (7%)
Query: 63 ISKARANYMASMSKPNAFQELEDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
ISKA +++M E +DI P+ + Y V IG P + +++ DT S
Sbjct: 114 ISKADLKPISTMYT----TEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGS 169
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHV 176
+ W QC PC C+ QT PIF+P +S++Y + CD P C + +C+N C+Y Y
Sbjct: 170 DVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGD 229
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQ 236
G T G + ET G T V +A GC + N G +G+LG L+L SQ
Sbjct: 230 GSYTVGDFATETLTI----GSTLVQNVAVGCGHSNEGLFV--GAAGLLGLGGGLLALPSQ 283
Query: 237 LRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEIS 294
L FSYCLV R+ ++ S + FG D P+L + L +YL L IS
Sbjct: 284 LNTTS---FSYCLVDRDSDSASTVDFGTSL---SPDAVVAPLLRNHQLDTFYYLGLTGIS 337
Query: 295 IGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY 354
+G +++ P +F++ G+GG IID+GT VT ++ Y +L + + +L ++
Sbjct: 338 VGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSF--VKGTLDLEKAAG 395
Query: 355 NASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDR-GRFCVAIQ-DDPK 411
A FD CY + P++ FH + P Y I D G FC+A
Sbjct: 396 VA--MFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASS 453
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+I+G QQQ + +DL + F S C
Sbjct: 454 LAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 153/353 (43%), Gaps = 39/353 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +G+P Q+L+ D+ S ++W QC+PC +C+ QT P+FDP AS+++S + C +
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAI 189
Query: 157 CRS------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR+ GKC Y+ Y G T+G + ET G T V +A GC +
Sbjct: 190 CRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGCGHR 245
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
NSG G +G+LG +SL QL G+FSYCL R
Sbjct: 246 NSGLFVG--AAGLLGLGWGAMSLVGQLGGAAGGVFSYCLA------------------SR 285
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
L S +Y+ L I +G + F + DG GG ++DTGT VT +
Sbjct: 286 GAGGAGSLASSF---YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPR 342
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQP 389
Y L +D + +L R A D CY P+++F+ + + P
Sbjct: 343 EAYAALRGAFDGAMGALPRSP----AVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLP 398
Query: 390 ENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+E FC+A SILG QQ+ + I D + FG C
Sbjct: 399 ARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 123/446 (27%), Positives = 199/446 (44%), Gaps = 53/446 (11%)
Query: 12 FFSYFSVLFLT----HFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKAR 67
F+S F +L T +++ GF+++LI S SP Y +Q +RI + S R
Sbjct: 3 FYSSFVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINR 62
Query: 68 ANYMASMSK--PNAFQELEDIHLPMAK-QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
Y+ + PN Q++ P++ Y + +IGTP + L DT + +W Q
Sbjct: 63 VRYLNHVFSFSPNKIQDV-----PLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQ 117
Query: 125 CQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLA 184
C+PC C +QT+P+F P S+TY IPC P+C++ +++G T L
Sbjct: 118 CKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKN-----------ADGHYLGVDTLTLN 166
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
S +N + GC + N G G +SG +G PLS SQL + I G
Sbjct: 167 SNNGTPISFKN-------IVIGCGHRNQG-PLEGYVSGNIGLARGPLSFISQLNSSIGGK 218
Query: 245 FSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR 301
FSYCLV + +S + FG + V +TPI + +++ L S+G HI++
Sbjct: 219 FSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEEN---GYFVSLEAFSVGDHIIK 275
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
D G IID+GT +T + Y R + ++ + + + + SQ+F+
Sbjct: 276 LE------NSDNRGNSIIDSGTTMTILPKDVY----SRLESVVLDMVKLKRVKDPSQQFN 325
Query: 362 YCYRYDSS--FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS---ILG 416
CY+ S+ +T H ++ + N ++ D C A +S I G
Sbjct: 326 LCYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYPITDE-VICFAFVSGGNFSSLAIFG 384
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCA 442
QQN L+ +DLN + F +C
Sbjct: 385 NVVQQNFLVGFDLNKKTISFKPTDCT 410
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 173/367 (47%), Gaps = 23/367 (6%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
LP A + F SV +GTP P L+ DT S +VW QC+PC+ C+ Q +P++DPR S+TY
Sbjct: 92 LPFASGEYFASV--GVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTY 149
Query: 148 SEIPCDDPLCRSPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
++ PC P CR+P C G C Y Y T G + + F + T V +
Sbjct: 150 AQTPCSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVF---SNDTSVGNVTL 206
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VREMEATSVIKFG 262
GC +DN G G +G+LG S ++Q+ + F+YCL R ++S + FG
Sbjct: 207 GCGHDNEGLF--GSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFG 264
Query: 263 RDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRH-IVRFPPGAFDI-MRDGTGGFII 319
R A + TP+ + RP +Y+ ++ S+G + F + + G GG ++
Sbjct: 265 RTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVV 323
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTF 378
D+GT +T Y L +D +G +++ S FD CY A P +
Sbjct: 324 DSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISV-FDACYDLRGVAVADAPGVVL 382
Query: 379 HLQ-EADYIVQPENMYFIEPDRGRF-CVAIQ--DDPKYSILGAWQQQNMLIIYDLNVPAL 434
H AD + PEN Y + + GR+ C A++ S++G QQ +++D+ +
Sbjct: 383 HFAGGADVALPPEN-YLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERV 441
Query: 435 RFGSENC 441
F C
Sbjct: 442 GFEPNGC 448
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 130/421 (30%), Positives = 185/421 (43%), Gaps = 47/421 (11%)
Query: 53 QSERIHKMFEISKARANY----MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
Q + + + ISKA AN + +S P + + Y ++ +GTP
Sbjct: 89 QRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPTSGE---YMAKIAVGTPAV 145
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK-----C 163
L DTAS L W QCQPC RC+ Q+ P+FDPR ST+Y E+ D P C++ +
Sbjct: 146 QALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSGGGDA 205
Query: 164 QNGKCVYTRRY---------HVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
+ G C+YT +Y VGD+ TFA VR + L+ GC +DN G
Sbjct: 206 KRGTCIYTVQYGDGHGSTSTSVGDLVE---ETLTFAGGVRQAY-----LSIGCGHDNKGL 257
Query: 215 AFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVREMEA----TSVIKFGRDADVRR 269
FG +GILG +S+ Q+ FSYCLV + +S + FG A
Sbjct: 258 -FGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTS 316
Query: 270 RDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPP-GAFDIMRD---GTGGFIIDTGTP 324
TP +L+ P F Y+ L+ +S+G VR P D+ D G GG I+D+GT
Sbjct: 317 PPASFTPTVLNQNMPTFYYVRLIGVSVGG--VRVPGVTERDLQLDPYTGRGGVILDSGTT 374
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA 383
VT + Y + SLG Q S FD CY + P+++ H
Sbjct: 375 VTRLARPAYVAFRDAFRAAATSLG-QVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGG 433
Query: 384 -DYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ +QP+N RG C A D S++G QQ ++YDL + F N
Sbjct: 434 VEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNN 493
Query: 441 C 441
C
Sbjct: 494 C 494
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 165/372 (44%), Gaps = 31/372 (8%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y ++ +GTP+ P ++ DT S +VW QC PC RC+DQ+ +FDPRAS +Y
Sbjct: 140 LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGA 199
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+ C PLCR + C+Y Y G VT G + ET F VPR+A
Sbjct: 200 VDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASG---ARVPRVAL 256
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-------REMEATSV 258
GC +DN G F + S LS SQ+ R FSYCLV +S
Sbjct: 257 GCGHDNEGL-FVAAAGLLGLGRGS-LSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314
Query: 259 IKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMR----DG 313
+ FG A TP++ + + +Y+ L+ IS+G R P A +R G
Sbjct: 315 VTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGG--ARVPGVAVSDLRLDPSTG 372
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA- 372
GG I+D+GT VT + Y L + L R+ FD CY S K
Sbjct: 373 RGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGL---RLSPGGFSLFDTCYDL-SGLKVV 428
Query: 373 -YPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDL 429
P+++ H P Y I D RG FC A D SI+G QQQ +++D
Sbjct: 429 KVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDG 488
Query: 430 NVPALRFGSENC 441
+ L F + C
Sbjct: 489 DGQRLGFVPKGC 500
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 181/355 (50%), Gaps = 25/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTTPIFDPRASTTYSEIPCD 153
Y ++ +G P+K +L+ DT S + W QCQPC C+ Q PIFDP++S++YS + C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 154 DPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C+ C + C+Y Y G T G + ET +F N +P L GC +DN
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLPIGCGHDN 264
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRR 270
G +G++G +SLSSQL+ FSYCLV + +++S ++F +
Sbjct: 265 EGLFA--GGAGLIGLGGGAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFNSNMP---S 316
Query: 271 DLETTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
D T+P++ +D + Y+ ++ IS+G + P F+I G GG I+D+GT ++ +
Sbjct: 317 DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLP 376
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
+ Y++L + + ++ SL P FD CY + S P++ F L E +
Sbjct: 377 SDVYESLREAFVKLTSSLS----PAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 389 PENMYFIEPDR-GRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A I+ SI+G++QQQ + + YDL + F + C
Sbjct: 433 PARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 120/393 (30%), Positives = 187/393 (47%), Gaps = 35/393 (8%)
Query: 62 EISKARANYMASMSKPNAFQELEDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTA 117
+ISK+ + + KP ED+ P+ ++ Y V +G P + +++ DT
Sbjct: 128 DISKSDLKPLETEIKP------EDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTG 181
Query: 118 SSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYH 175
S + W QCQPC C+ QT PIFDP AS+TY+ + C C S C++G+C+Y Y
Sbjct: 182 SDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYG 241
Query: 176 VGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
G T G + E+ +F V +A GC +DN G +G+LG PLSL++
Sbjct: 242 DGSYTFGDFATESVSFGNSGS---VKNVALGCGHDNEGLFV--GAAGLLGLGGGPLSLTN 296
Query: 236 QLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEI 293
QL+ FSYCLV R+ +S + F ++ D T P++ + + +Y+ L +
Sbjct: 297 QLKATS---FSYCLVNRDSAGSSTLDF--NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGM 351
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
S+G +V P F + G GG I+D GT +T ++ Y L + ++ ++L
Sbjct: 352 SVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLT--- 408
Query: 354 YNASQEFDYCYRYDSSFKA---YPSMTFHLQEADYIVQPENMYFIEPDR-GRFCVAIQ-D 408
+A FD C YD S +A P+++FH + P Y I D G +C A
Sbjct: 409 -SAVALFDTC--YDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPT 465
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
SI+G QQQ + +DL + F C
Sbjct: 466 TSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 166/392 (42%), Gaps = 24/392 (6%)
Query: 55 ERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
+ IH+M + +S SK + + L A Y V V +GTP + ++F
Sbjct: 152 DSIHRM--TAGPWTAGQSSASKGVSLPAHRGLRLGTAN----YIVSVGLGTPRRDLLVVF 205
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRY 174
DT S L W QC+PC C+ Q P+FDP STTYS +PC C C +GKC Y Y
Sbjct: 206 DTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQECLDSGTCSSGKCRYEVVY 265
Query: 175 HVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLS 234
T G +R+T + + FGC +D++G G+ G+ G +SL+
Sbjct: 266 GDMSQTDGNLARDTLTLGPSS--DQLQGFVFGCGDDDTGLF--GRADGLFGLGRDRVSLA 321
Query: 235 SQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEIS 294
SQ R FSYCL A + G A T + SD +YL L+ I
Sbjct: 322 SQAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPPHAQF-TAMVTRSDTPSFYYLDLVGIK 380
Query: 295 IGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY 354
+ VR P F G +ID+GT +T + + Y L + +R +R P
Sbjct: 381 VAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRRY--KRAP- 432
Query: 355 NASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDP 410
A D CY + K PS+ + +R + C+A DD
Sbjct: 433 -ALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANRSQACLAFASNGDDT 491
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILG QQ+ ++YDL + FG++ C+
Sbjct: 492 SVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 178/371 (47%), Gaps = 29/371 (7%)
Query: 84 EDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
ED+ P+ ++ Y V +G P + +++ DT S + W QCQPC C+ QT PIF
Sbjct: 3 EDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIF 62
Query: 140 DPRASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
DP AS+TY+ + C C S C++G+C+Y Y G T G + E+ +F
Sbjct: 63 DPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS- 121
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT 256
V +A GC +DN G +G+LG PLSL++QL+ FSYCLV R+ +
Sbjct: 122 --VKNVALGCGHDNEGLFV--GAAGLLGLGGGPLSLTNQLKATS---FSYCLVNRDSAGS 174
Query: 257 SVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S + F ++ D T P++ + + +Y+ L +S+G +V P F + G G
Sbjct: 175 STLDF--NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNG 232
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--- 372
G I+D GT +T ++ Y L + ++ ++L +A FD C YD S +A
Sbjct: 233 GIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLT----SAVALFDTC--YDLSGQASVR 286
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLN 430
P+++FH + P Y I D G +C A SI+G QQQ + +DL
Sbjct: 287 VPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLA 346
Query: 431 VPALRFGSENC 441
+ F C
Sbjct: 347 NNRMGFSPNKC 357
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 175/377 (46%), Gaps = 36/377 (9%)
Query: 82 ELEDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
+ ED+ P+ ++ Y + +GTP K +L+ DT S + W QC+PC C+ Q+ P
Sbjct: 143 QTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDP 202
Query: 138 IFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+F+P +S+TY + C P C C++ KC+Y Y G T G + +T F
Sbjct: 203 VFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG 262
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REME 254
+ +A GC +DN G +G+LG LS+++Q++ FSYCLV R+
Sbjct: 263 K---INNVALGCGHDNEGLFT--GAAGLLGLGGGVLSITNQMKATS---FSYCLVDRDSG 314
Query: 255 ATSVIKF------GRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAF 307
+S + F G DA T P+L + + +Y+ L S+G V P F
Sbjct: 315 KSSSLDFNSVQLGGGDA--------TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIF 366
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
D+ G+GG I+D GT VT ++ Y +L + ++ +L + ++ FD CY +
Sbjct: 367 DVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGS---SSISLFDTCYDFS 423
Query: 368 S-SFKAYPSMTFHLQEADYIVQPENMYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNML 424
S S P++ FH + P Y I D G FC A SI+G QQQ
Sbjct: 424 SLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTR 483
Query: 425 IIYDLNVPALRFGSENC 441
I YDL+ + C
Sbjct: 484 ITYDLSKNVIGLSGNKC 500
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 124/403 (30%), Positives = 199/403 (49%), Gaps = 40/403 (9%)
Query: 51 LSQSERIHKMFEISKARANYMA-SMSKPNAFQELE-DIHLPMAKQDLFYSVEVNIGTPMK 108
++ + I M +IS AR Y+ S+ K + + D+H + K LF+ V ++G P
Sbjct: 22 VTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAI-KTSLFF-VNFSVGQPPV 79
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQ--TTPIFDPRASTTYSEIPCDDPLCRSP--FKCQ 164
PQ + DT SSL+W QC PC C P+F+P S+T+ E CDD CR C
Sbjct: 80 PQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCS 139
Query: 165 NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-LAFGCSNDNSGFAFGGKISGI 223
+ KCVY + Y G ++G+ ++E F NG T V + +AFGC ++N G + +GI
Sbjct: 140 SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHEN-GEQLESEFTGI 198
Query: 224 LGFNASPLSLSSQLRNRIQGLFSYC---LVREMEATSVIKFGRDADVRRRDLETTPILLS 280
LG A P SL+ QL ++ FSYC L + + + G DAD+ + TPI
Sbjct: 199 LGLGAKPTSLAVQLGSK----FSYCIGDLANKNYGYNQLVLGEDADILG---DPTPIEFE 251
Query: 281 DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
+Y++L IS+G + P F R G I+DTGT T++ + Y+ L Y
Sbjct: 252 TENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLYTWLADIAYREL---Y 307
Query: 341 DQILRSLGRQRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQ-EADYIVQPENMYF--I 395
++I +S+ ++ ++F CY R + +P +TFH A+ ++ +M++
Sbjct: 308 NEI-KSILDPKLERFWFRDF-LCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMT 365
Query: 396 EPD--RGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
E D FC++++ + ++ +G QQ I YDL
Sbjct: 366 ESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDL 408
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 175/377 (46%), Gaps = 36/377 (9%)
Query: 82 ELEDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
+ ED+ P+ ++ Y + +GTP K +L+ DT S + W QC+PC C+ Q+ P
Sbjct: 143 QTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDP 202
Query: 138 IFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+F+P +S+TY + C P C C++ KC+Y Y G T G + +T F
Sbjct: 203 VFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG 262
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REME 254
+ +A GC +DN G +G+LG LS+++Q++ FSYCLV R+
Sbjct: 263 K---INNVALGCGHDNEGLFT--GAAGLLGLGGGVLSITNQMKATS---FSYCLVDRDSG 314
Query: 255 ATSVIKF------GRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAF 307
+S + F G DA T P+L + + +Y+ L S+G V P F
Sbjct: 315 KSSSLDFNSVQLGGGDA--------TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIF 366
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
D+ G+GG I+D GT VT ++ Y +L + ++ +L + ++ FD CY +
Sbjct: 367 DVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGS---SSISLFDTCYDFS 423
Query: 368 S-SFKAYPSMTFHLQEADYIVQPENMYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNML 424
S S P++ FH + P Y I D G FC A SI+G QQQ
Sbjct: 424 SLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTR 483
Query: 425 IIYDLNVPALRFGSENC 441
I YDL+ + C
Sbjct: 484 ITYDLSKNVIGLSGNKC 500
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 164/366 (44%), Gaps = 39/366 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q + Y IGTP +P + D A LVWTQC+ C RCF+Q TP+FDP AS TY PC
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S C C Y + GD T G +TFA T LAFGC
Sbjct: 107 GTPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAV-----GTAKASLAFGCV 160
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADV 267
+ GG SGI+G +P SL +Q FSYCL + S + G A +
Sbjct: 161 VASDIDTMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGRNSALFLGSSAKL 216
Query: 268 R-RRDLETTPIL-----LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+TP + +DL ++ + L + G ++ PP ++ +DT
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL--------LDT 268
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
+P++F+ +G YQ + + + ++G + + FD C+ + A P + F +
Sbjct: 269 FSPISFLVDGAYQAVKK---AVTAAVGAPPM-ATPVEPFDLCFPKSGASGAAPDLVFTFR 324
Query: 382 EADYIVQPENMYFIEPDRGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ P Y ++ G C+A + + S+LG+ QQ+N+ ++DL+ L
Sbjct: 325 GGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLS 384
Query: 436 FGSENC 441
F +C
Sbjct: 385 FEPADC 390
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 172/364 (47%), Gaps = 33/364 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y++ +++GTP+ ++ DT S L+WTQC PC +CF Q P F P +S+T+S++PC
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ S C CVY +Y G T G + ET ++ G P +AFGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATET----LKVGDASFPSVAFGCSTEN- 199
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD 271
G SGI G LSL QL G FSYCL A S I FG A++ +
Sbjct: 200 --GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGN 254
Query: 272 LETTPILLS-DLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGFIIDTGTPVTFI 328
+++TP + + + P ++Y++L I++G + F ++G GG I+D+GT +T++
Sbjct: 255 VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYL 314
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AYPSMTFHLQEADY 385
Y+ + Q + S N ++ D C++ A PS+
Sbjct: 315 AKDGYEMVKQAF----LSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAE 370
Query: 386 IVQPENMYFIEPD-RGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
P +E D +G VA + D S++G Q +M ++YDL+ F
Sbjct: 371 YAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAP 430
Query: 439 ENCA 442
+CA
Sbjct: 431 ADCA 434
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 180/355 (50%), Gaps = 25/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTTPIFDPRASTTYSEIPCD 153
Y ++ +G P+K +L+ DT S + W QCQPC C+ Q PIFDP++S++YS + C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 154 DPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C+ C + C+Y Y G T G + ET +F N +P L GC +DN
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLPIGCGHDN 264
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRR 270
G +G++G +SLSSQL+ FSYCLV + +++S ++F
Sbjct: 265 EGLFA--GGAGLIGLGGGAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFN---SYMPS 316
Query: 271 DLETTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
D T+P++ +D + Y+ ++ IS+G + P F+I G GG I+D+GT ++ +
Sbjct: 317 DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLP 376
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
+ Y++L + + ++ SL P FD CY + S P++ F L E +
Sbjct: 377 SDVYESLREAFVKLTSSLS----PAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRL 432
Query: 389 PENMYFIEPDR-GRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A I+ SI+G++QQQ + + YDL + F + C
Sbjct: 433 PARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/434 (28%), Positives = 195/434 (44%), Gaps = 22/434 (5%)
Query: 16 FSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMS 75
FS L++ + GF+ LI SP+SP Y + S+RI S R ++ +S
Sbjct: 15 FSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLS 74
Query: 76 KPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT 135
+ +A L + Y + +++GTP P + DT S+L+WTQC+PC C+ Q
Sbjct: 75 EMDA--SLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQV 132
Query: 136 TPIFDPRASTTYSEIPCDDPLC-----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFA 190
P+FDP+AS+TY ++ C C ++ ++ C Y Y G T G + +T
Sbjct: 133 DPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLT 192
Query: 191 F-PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
N + + GC +N+ F K SG++G +SL QL + I G FSYCL
Sbjct: 193 LGSTDNRPVQLKNIIIGCGQNNA-VTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCL 251
Query: 250 VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI 309
V E + TS I FG +A V +TP+++ +YL L IS+G ++ P
Sbjct: 252 VPENDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIK- 310
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS 369
G +ID+GT +T + Y + ++ + + +S CY +
Sbjct: 311 -----GNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSS----LCYNATAD 361
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYD 428
P +T H + AD + P N +F + C+A + I G Q+N L+ YD
Sbjct: 362 LN-IPVITMHFEGADVKLYPYNSFF-KVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYD 419
Query: 429 LNVPALRFGSENCA 442
+ F +CA
Sbjct: 420 TASKTMSFKPTDCA 433
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/434 (26%), Positives = 194/434 (44%), Gaps = 26/434 (5%)
Query: 15 YFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM 74
+F + +F+ + G S+++I +SPLY +++ +R + + S R NY
Sbjct: 11 FFYLCCFIYFSHASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNYFTKE 70
Query: 75 SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
N Q + + + + + YSV GTP + DT S++VW QCQPC CF+Q
Sbjct: 71 FSLNKNQPVSTLTPELGEYLISYSV----GTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQ 126
Query: 135 TTPIFDPRASTTYSEIPCDDPLCR----SPFKCQNGK--CVYTRRYHVGDVTRGLASRET 188
T+PIF+P S++Y IPC C+ + C NG C Y+ Y ++G S ++
Sbjct: 127 TSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDS 186
Query: 189 FAFPVRNGFTFV-PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL-RNRIQGLFS 246
+G + + P + GC + N + SG++G P+SL Q+ + + FS
Sbjct: 187 LTLDSTSGSSVLFPNIVIGCGHINV-LQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFS 245
Query: 247 YCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRF 302
YCL+ + ++S + FG D V + +TP++ + + ++Y LE S+G + + +
Sbjct: 246 YCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEY 305
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
+ T +ID+GTP+T + N L+ Q ++ L R P
Sbjct: 306 GERS----NASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVK-LPRIEPP---DHHLSL 357
Query: 363 CYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQN 422
CY P +T H AD + +F D G C I G Q N
Sbjct: 358 CYNTTGKQLNVPDITAHFNGADVKLNSNGTFFPFED-GIMCFGFISSNGLEIFGNIAQNN 416
Query: 423 MLIIYDLNVPALRF 436
+LI YDL + F
Sbjct: 417 LLIDYDLEKEIISF 430
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 172/363 (47%), Gaps = 32/363 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y++ +++GTP+ ++ DT S L+WTQC PC +CF Q P F P +S+T+S++PC
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ S C CVY +Y G T G + ET ++ G P +AFGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATET----LKVGDASFPSVAFGCSTEN- 199
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD 271
G SGI G LSL QL G FSYCL A S I FG A++ +
Sbjct: 200 --GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGN 254
Query: 272 LETTPILLS-DLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGFIIDTGTPVTFI 328
+++TP + + + P ++Y++L I++G + F ++G GG I+D+GT +T++
Sbjct: 255 VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYL 314
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSMTFHLQEADYI 386
Y+ + Q + S N ++ D C++ A PS+
Sbjct: 315 AKDGYEMVKQAF----LSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEY 370
Query: 387 VQPENMYFIEPD-RGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
P +E D +G VA + D S++G Q +M ++YDL+ F
Sbjct: 371 AVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPA 430
Query: 440 NCA 442
+CA
Sbjct: 431 DCA 433
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 132/432 (30%), Positives = 206/432 (47%), Gaps = 54/432 (12%)
Query: 27 SESTGFSLKLIPIFSPE-SPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELED 85
++++GF+++LI SP SP Y +S+ +H M + F +
Sbjct: 3 ADNSGFTIQLIRHNSPNYSPFY-----KSDELH------------MHRLGSNGVFTRV-- 43
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ Y +++ +GTP + L DT S LVW QC PC C+ Q +P+F+P S
Sbjct: 44 -----TSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSN 98
Query: 146 TYSEIPCDDPLCRSPF--KCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVP 201
TY+ IPCD C S F C K C Y+ Y VT+G+ +RET F +G V
Sbjct: 99 TYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVG 158
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL-FSYCLV---REMEATS 257
+ FGC + NSG F GI+G PLSL SQ N FS CLV +
Sbjct: 159 DIVFGCGHSNSG-TFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLG 217
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
I FG +DV + TP++ + + + + L IS+G V F + +++ G
Sbjct: 218 TISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSF--NSSEMLSKGN--I 273
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR--IPYNASQEF--DYCYRYDSSFKAY 373
+ID+GTP T++ + YD++++ L Q +P + + CYR +++ +
Sbjct: 274 MIDSGTPATYLP-------QEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEG- 325
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLNV 431
P + H + AD + P FI P G FC A+ D +Y I G + Q N+LI +DL+
Sbjct: 326 PILIAHFEGADVQLMPIQT-FIPPKDGVFCFAMAGTTDGEY-IFGNFAQSNVLIGFDLDR 383
Query: 432 PALRFGSENCAN 443
+ F + +C+N
Sbjct: 384 KTVSFKATDCSN 395
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 175/353 (49%), Gaps = 24/353 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG+P K +++ DT S + W QC PC C+ Q PIF+P S++Y+ + C+
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQ 214
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+N C+Y Y G T G + ET +G + +A GC +DN G
Sbjct: 215 CKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITL---DGSASLNNVAIGCGHDNEGL 271
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS SQ+ FSYCLV R+ ++ S ++F ++ + +
Sbjct: 272 FV--GAAGLLGLGGGSLSFPSQIN---ASSFSYCLVNRDTDSASTLEF--NSPIPSHSV- 323
Query: 274 TTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T P+L ++ L +YL + I +G ++ P +F++ G GG I+D+GT VT +++
Sbjct: 324 TAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDV 383
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDS-SFKAYPSMTFHLQEADYIVQPE 390
Y +L D +R G Q +P + FD CY S S P+++FH + Y+ P
Sbjct: 384 YNSLR---DSFVR--GTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPA 438
Query: 391 NMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A SI+G QQQ + YDL+ + F C
Sbjct: 439 KNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 164/366 (44%), Gaps = 39/366 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q + Y IGTP +P + D A LVWTQC+ C RCF+Q TP+FDP AS TY PC
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPC 106
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S C C Y + GD T G +TFA T LAFGC
Sbjct: 107 GTPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAV-----GTAKASLAFGCV 160
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADV 267
+ GG SGI+G +P SL +Q FSYCL + S + G A +
Sbjct: 161 VASDIDTMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKL 216
Query: 268 R-RRDLETTPIL-----LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+TP + +DL ++ + L + G ++ PP ++ +DT
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL--------LDT 268
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
+P++F+ +G YQ + + + ++G + + FD C+ + A P + F +
Sbjct: 269 FSPISFLVDGAYQAVKK---AVTVAVGAPPM-ATPVEPFDLCFPKSGASGAAPDLVFTFR 324
Query: 382 EADYIVQPENMYFIEPDRGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ P Y ++ G C+A + + S+LG+ QQ+N+ ++DL+ L
Sbjct: 325 GGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLS 384
Query: 436 FGSENC 441
F +C
Sbjct: 385 FEPADC 390
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 169/356 (47%), Gaps = 30/356 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G P KP +++ DT S + W QCQPC C+ QT PIFDPR+S++++ +PC+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C++ C+ KC+Y Y G T G ET F + +A GC +DN G
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTF---GNSGMINDVAVGCGHDNEGL 271
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG PLSL+SQ++ FSYCLV R+ ++S ++F A D
Sbjct: 272 FV--GSAGLLGLGGGPLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNSAAP---SDSV 323
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P+L S + +Y+ L +S+G ++ PP F + G GG I+D+GT +T ++
Sbjct: 324 NAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQA 383
Query: 333 YQTLMQRYDQILRSLGRQRIPY----NASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIV 387
Y TL + R PY N FD CY S + P+++F +
Sbjct: 384 YNTLRDAF--------VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQ 435
Query: 388 QPENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G FC A SI+G QQQ + YDL + F C
Sbjct: 436 LPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 179/418 (42%), Gaps = 36/418 (8%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP----MAKQDLFYSVEVNIGT 105
N + E + + K RA ++ + + + P +A+ Y ++ +GT
Sbjct: 78 NATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGLAQGSGEYFTKIGVGT 137
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPF 161
P ++ DT S +VW QC PC RC++Q+ P+FDPR S++Y + C LCR
Sbjct: 138 PATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGC 197
Query: 162 KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKIS 221
+ G C+Y Y G VT G ET F G V R+A GC +DN G +
Sbjct: 198 DLRRGACMYQVAYGDGSVTAGDFVTETLTFA---GGARVARVALGCGHDNEGLFV--AAA 252
Query: 222 GILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA----------TSVIKFGRDADVRRRD 271
G+LG LS +Q+ R FSYCLV + +S + FG V
Sbjct: 253 GLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASS 311
Query: 272 LETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMR----DGTGGFIIDTGTPVT 326
TP++ + + +Y+ L+ IS+G R P A +R G GG I+D+GT VT
Sbjct: 312 ASFTPMVRNPRMETFYYVQLVGISVGG--ARVPGVAESDLRLDPSTGRGGVIVDSGTSVT 369
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADY 385
+ Y L + + G R+ FD CY P+++ H
Sbjct: 370 RLARASYSALRDAFRAA--AAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAE 427
Query: 386 IVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D RG FC A D SI+G QQQ +++D + + F + C
Sbjct: 428 AALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/409 (27%), Positives = 174/409 (42%), Gaps = 32/409 (7%)
Query: 58 HKMFEI---SKARANYMAS----MSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMKP 109
H + ++ ARA Y+A+ +P F E + + + Y V V++G+P
Sbjct: 124 HAVLDLVARDNARAEYLATRLSPAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTE 183
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGK 167
Q+L+ D+ S ++W QC+PC+ C+ Q P+FDP S T+S + C +CR C +G+
Sbjct: 184 QYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGE 243
Query: 168 ---CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
C Y Y G T+G + ET G T V + GC + N G G +G++
Sbjct: 244 LGGCEYEVSYADGSYTKGALALETLTL----GGTAVEGVVIGCGHRNRGLFVG--AAGLM 297
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLVREM--------EATSVIKFGRDADVRRRDLETTP 276
G P+SL QL + G FSYCL + + GR V + P
Sbjct: 298 GLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAV-WVP 356
Query: 277 ILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
++ + P F Y+ L I +G + G F + DG G ++DTGT VT + Y
Sbjct: 357 LVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAA 416
Query: 336 LMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYF 394
L + L R +S D CY P+++F ++
Sbjct: 417 LRDAFVGALAG-AVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVL 475
Query: 395 IEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+E D G +C+A SI+G QQ + I D + FG NC
Sbjct: 476 LEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYIGFGPANCG 524
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 176/385 (45%), Gaps = 35/385 (9%)
Query: 73 SMSKPNAFQEL--EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+ KP +E D+ M + Y V + +G+P + Q+++ D+ S ++W QC+PC +
Sbjct: 108 AAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQ 167
Query: 131 CFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRET 188
C+ Q+ P+F+P S++Y+ + C +C C G+C Y Y G T+G + ET
Sbjct: 168 CYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALET 227
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
F G T + +A GC + N G G +G+LG + P+S QL + G FSYC
Sbjct: 228 LTF----GRTLIRNVAIGCGHHNQGMFVGA--AGLLGLGSGPMSFVGQLGGQAGGTFSYC 281
Query: 249 LV-REMEATSVIKFGRDADVRRRDLETTPI------LLSDLRPHFYLHLLEISIGRHIVR 301
LV R ++++ +++FGR+A P+ L+ + R + ++ +G +R
Sbjct: 282 LVSRGIQSSGLLQFGREA---------VPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLR 332
Query: 302 FP--PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
P F + G GG ++DTGT VT + Y+ + +L R +
Sbjct: 333 VPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRA----SGVSI 388
Query: 360 FDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDR-GRFCVAIQ-DDPKYSILG 416
FD CY P+++F+ + P + I D G FC A SI+G
Sbjct: 389 FDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIG 448
Query: 417 AWQQQNMLIIYDLNVPALRFGSENC 441
QQ+ + I D + FG C
Sbjct: 449 NIQQEGIEISVDGANGFVGFGPNVC 473
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 133/462 (28%), Positives = 199/462 (43%), Gaps = 49/462 (10%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
+AAF + +L L + S + ++L + + Y G +ER+ + + S R
Sbjct: 1 MAAFLVWI-LLLLPYVAISSTASHGVRLELTHADDRGGYVG----AERVRRAADRSHRRV 55
Query: 69 N-YMASMSKPNAFQEL-----------EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDT 116
N ++ ++ P++ L +H A Y V++ IGTP P + DT
Sbjct: 56 NGFLGAIEGPSSTARLGIDGAGAGGAEASVHASTAT----YLVDIAIGTPPLPLTAVLDT 111
Query: 117 ASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPF-KCQ--NGKCV 169
S L+WTQC PC RCF Q P++ P S TY+ + C P+C +SP+ +C + C
Sbjct: 112 GSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCA 171
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y G T G+ + ETF T V +AFGC +N G SG++G
Sbjct: 172 YYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAFGCGTENLGST--DNSSGLVGMGRG 226
Query: 230 PLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLS------DL 282
PLSL SQL FSYC A S + G A + +TTP + S
Sbjct: 227 PLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSSA-AKTTPFVPSPSGGARRR 282
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
++YL L I++G ++ P F + G GG IID+GT T + + L +
Sbjct: 283 SSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVALA----R 338
Query: 343 ILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGR 401
L S R + A C+ S P + H AD ++ E+ + G
Sbjct: 339 ALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGV 398
Query: 402 FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
C+ + S+LG+ QQQN I+YDL L F C
Sbjct: 399 ACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKCGE 440
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 168/356 (47%), Gaps = 30/356 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +G P KP +++ DT S + W QCQPC C+ QT PIFDPR+S++++ +PC+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C++ C+ KC+Y Y G T G ET F + +A GC +DN G
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTF---GNSGMINNVAVGCGHDNEGL 271
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LSL+SQ++ FSYCLV R+ ++S ++F A D
Sbjct: 272 FV--GSAGLLGLGGGSLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNSAAP---SDSV 323
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P+L S + +Y+ L +S+G ++ PP F + G GG I+D+GT +T ++
Sbjct: 324 NAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQA 383
Query: 333 YQTLMQRYDQILRSLGRQRIPY----NASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIV 387
Y TL + R PY N FD CY S + P+++F +
Sbjct: 384 YNTLRDAF--------VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQ 435
Query: 388 QPENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G FC A SI+G QQQ + YDL + F C
Sbjct: 436 LPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 164/355 (46%), Gaps = 24/355 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG+P + +++ DT S + W QCQPC C+ Q+ P+FDP S +Y+ + CD
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQR 225
Query: 157 CR--SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR C+N G C+Y Y G T G + ET T V +A GC +DN
Sbjct: 226 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAIGCGHDNE 282
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRD 271
G +G+L PLS SQ+ FSYCLV R+ A S ++FG A
Sbjct: 283 GLFV--GAAGLLALGGGPLSFPSQISAST---FSYCLVDRDSPAASTLQFGDGA--AEAG 335
Query: 272 LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDI-MRDGTGGFIIDTGTPVTFIR 329
T P++ S F Y+ L IS+G + P AF + G+GG I+D+GT VT ++
Sbjct: 336 TVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQ 395
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQEADYIVQ 388
+ Y L + Q SL R + FD CY D + P+++ + +
Sbjct: 396 SAAYAALRDAFVQGAPSLPRT----SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRL 451
Query: 389 PENMYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G +C+A + SI+G QQQ + +D A+ F C
Sbjct: 452 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 122/452 (26%), Positives = 212/452 (46%), Gaps = 30/452 (6%)
Query: 6 ALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK 65
+L LA + S + + + +++ + + KLI S PLY N + +R + S
Sbjct: 14 SLTLAFYLS--TAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSI 71
Query: 66 ARANYMASMSK--PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
R +++ S K + E +P + F V ++IG+P Q ++ DT SSL+W
Sbjct: 72 ERFDFLESKIKELKSVGNEARSSLIPFNRGSGFL-VNLSIGSPPVTQLVVVDTGSSLLWV 130
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQN-GKCVYTRRYHVGDVT 180
QC PCI CF Q+T FDP S ++ + C P + +KC + Y RY GD +
Sbjct: 131 QCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSS 190
Query: 181 RGLASRETFAFPVRN-GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASP-LSLSSQLR 238
+G+ ++E+ F + G + FGC + N +G+ G A P +++++QL
Sbjct: 191 QGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG 250
Query: 239 NRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI 295
N+ FSYC+ + + + G+ + + ++TP+ + H+Y+ L IS+
Sbjct: 251 NK----FSYCIGDINNPLYTHNHLVLGQGSYIEG---DSTPLQIH--FGHYYVTLQSISV 301
Query: 296 GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN 355
G ++ P AF I DG+GG +ID+G T + NG ++ L +++ L +RIP
Sbjct: 302 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL-LERIPTQ 360
Query: 356 ASQEFDYCYR--YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI----QDD 409
E C++ +P++TFH +V F + RFC+AI +
Sbjct: 361 RKFE-GLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEL 419
Query: 410 PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
S++G QQN + +DL + F +C
Sbjct: 420 LNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 133/462 (28%), Positives = 199/462 (43%), Gaps = 49/462 (10%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
+AAF + +L L + S + ++L + + Y G +ER+ + + S R
Sbjct: 1 MAAFLVWI-LLLLPYVAISSTASHGVRLELTHADDRGGYVG----AERVRRAADRSHRRV 55
Query: 69 N-YMASMSKPNAFQEL-----------EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDT 116
N ++ ++ P++ L +H A Y V++ IGTP P + DT
Sbjct: 56 NGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTAT----YLVDIAIGTPPLPLTAVLDT 111
Query: 117 ASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPF-KCQ--NGKCV 169
S L+WTQC PC RCF Q P++ P S TY+ + C P+C +SP+ +C + C
Sbjct: 112 GSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCA 171
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y G T G+ + ETF T V +AFGC +N G SG++G
Sbjct: 172 YYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAFGCGTENLGST--DNSSGLVGMGRG 226
Query: 230 PLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLS------DL 282
PLSL SQL FSYC A S + G A + +TTP + S
Sbjct: 227 PLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSSA-AKTTPFVPSPSGGARRR 282
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
++YL L I++G ++ P F + G GG IID+GT T + + L +
Sbjct: 283 SSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALA----R 338
Query: 343 ILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR 401
L S R + A C+ S P + H AD ++ E+ + G
Sbjct: 339 ALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGV 398
Query: 402 FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
C+ + S+LG+ QQQN I+YDL L F C
Sbjct: 399 ACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKCGE 440
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 112/343 (32%), Positives = 170/343 (49%), Gaps = 19/343 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + V++GTP + +L+ DT S ++W QC PC+ C+ Q +FDP S+TYS + C+
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQ 96
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF--TFVPRLAFGCSNDNS 212
C + C KC+Y Y G + G + + + +G + ++ GC +DN
Sbjct: 97 CLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNE 156
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT--SVIKFGRDADVRR 269
G+ +G+LG PLS +Q+ + G FSYCL R+ ++T S + FG DA V
Sbjct: 157 GYFV--GAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFG-DAAVPP 213
Query: 270 RDLETTPILLSDLR--PHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ TP S+LR +YL + IS+G I+ P AF + G GG IID+GT VT
Sbjct: 214 AGVRFTP-QASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTR 272
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQEADYI 386
++N Y +L + + R+ + FD CY D S P++T H Q +
Sbjct: 273 LQNAAYASLREAF----RAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADL 328
Query: 387 VQPENMYFIEPDRGR-FCVAIQDDPKYSILGAWQQQNMLIIYD 428
P + Y + D FC+A SI+G QQQ +IYD
Sbjct: 329 KLPASNYLVPVDNSSTFCLAFAGTTGPSIIGNIQQQGFRVIYD 371
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 136/463 (29%), Positives = 203/463 (43%), Gaps = 44/463 (9%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTS-----SESTGFSLKLIPIFSPESPLYPGNLSQSERIH 58
++ L F +++FL +F ++ GF+ I SP SP Y + ++ +R+
Sbjct: 1 MEGFNLKFVFCLLAIIFLIYFAKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQ 60
Query: 59 KMFEISKARANYMASM-SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTA 117
K F S R N+ ++ + PN DI + Y + +++GTP + DT
Sbjct: 61 KAFRRSILRGNHFRAIRASPN------DIQSNVISGGGSYLMNISLGTPPVSMLGIADTG 114
Query: 118 SSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC----VYTRR 173
S L+W QC PC C+ Q P+FDP+ S TY + C++ C+ Q G C T
Sbjct: 115 SDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFCQD--LGQQGSCGDDNTCTSS 172
Query: 174 YHVGD--VTRGLASRETFAFPVRNGF-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASP 230
Y GD TR S ETF G P LAFGC + N G F K SG++G P
Sbjct: 173 YSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSNGG-TFNEKDSGLIGLGGGP 231
Query: 231 LSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY 287
LSL QL +++ G FSYCLV + A+S I FG+ A V +TP++ +Y
Sbjct: 232 LSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYY 291
Query: 288 LHLLEISIGRHIVRFP--------PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR 339
L L +S+G V F P A + IID+GT +T + P
Sbjct: 292 LTLEGMSLGSEKVAFKGFSKNKSSPAAAE-----ESNIIIDSGTTLTLL---PRDFYTDM 343
Query: 340 YDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDR 399
+ + +G Q + F CY + P++T H AD + P N F++
Sbjct: 344 ESALTKVIGGQTTT-DPRGTFSLCYSGVKKLE-IPTITAHFIGADVQLPPLNT-FVQAQE 400
Query: 400 GRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
C ++ +I G Q N L+ YDL + F +C
Sbjct: 401 DLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCT 443
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 120/426 (28%), Positives = 184/426 (43%), Gaps = 19/426 (4%)
Query: 20 FLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNA 79
L S S LI I+S SP P N + + + R ++ S+ +
Sbjct: 40 ILNRKVGKRSHSVSFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSS- 98
Query: 80 FQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
+E + ++P+ Y ++V+ GTP + + L DT S + W C+ C C T PIF
Sbjct: 99 -KEDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIF 156
Query: 140 DPRASTTYSEIPCDDPLCRS-PFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
DP S++Y CD C+ C N KC + Y G G + + G
Sbjct: 157 DPAKSSSYKPFACDSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITL----GS 212
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
++P +FGC+ S + LG + L + G FSYCL ++
Sbjct: 213 QYLPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSG 272
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGG 316
+ G++A V L+ T ++ P FY L+ IS+G + P A +I G G
Sbjct: 273 SLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVP--ATNIASGG--G 328
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT +T++ Y+ L + Q L SL Q P ++ D CY SS P++
Sbjct: 329 TIIDSGTTITYLVPSAYKDLRDAFRQQLSSL--QPTPV---EDMDTCYDLSSSSVDVPTI 383
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
T HL +V P+ I + G C+A SI+G QQQN I++D+ + F
Sbjct: 384 TLHLDRNVDLVLPKENILITQESGLSCLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGF 443
Query: 437 GSENCA 442
E CA
Sbjct: 444 AQEQCA 449
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 156/353 (44%), Gaps = 27/353 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + V GTP K Q ++FDT S++ W QC+PC + C+ Q P+FDP S+TY I C
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C S C CVY Y G T G + ETF N F FGC +N G
Sbjct: 76 ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFN---NFIFGCGQNNQG 132
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G +G++G SP SL+SQL + +FSYCL AT + G R
Sbjct: 133 LFTGA--AGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNP----LRTPG 186
Query: 274 TTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T +L + P Y + L+ IS+G + F + G IID+GT +T +
Sbjct: 187 YTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSV-----GTIIDSGTVITRLPPTA 241
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFHLQEADYIVQPEN 391
Y L + + R A+ D CY + ++ +P++ H D +
Sbjct: 242 YGALRTAFRAAMTQYTRAA----AASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAG 297
Query: 392 MYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++++ + C+A D + I+G QQ+ M + YD + + F + C
Sbjct: 298 VFYVISSS-QVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 169/372 (45%), Gaps = 46/372 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +GTP +P L DT S LVWTQC PC CFDQ P+ DP AS+TY+ +PC
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 157 CRS-PFKC-------QNGKCVYTRRYHVGD--VTRGLASRETFAFPVRNGFT---FVPRL 203
CR+ PF + C+Y YH GD +T G + + F F G RL
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYA--YHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRL 201
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFG 262
FGC + N G F +GI GF SL SQL FSYC E+ +S++ G
Sbjct: 202 TFGCGHLNKGV-FQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFESKSSLVTLG 257
Query: 263 RD-----ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ ++ TTPIL + +P Y L L IS+G+ + P F
Sbjct: 258 GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------S 310
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ--EFDYCYRYDSSF---- 370
IID+G +T + Y+ + + + +P + + D C+ +
Sbjct: 311 TIIDSGASITTLPEEVYEAVKAEF------AAQVGLPPSGVEGSALDLCFALPVTALWRR 364
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDL 429
A PS+T HL+ AD+ + N F + C+ + P + +++G +QQQN ++YDL
Sbjct: 365 PAVPSLTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDL 424
Query: 430 NVPALRFGSENC 441
L F C
Sbjct: 425 ENDRLSFAPARC 436
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 176/374 (47%), Gaps = 45/374 (12%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L +++ V+IGTP +P+ L+ DT S L+WTQC+ + P++DP S++++ PCD
Sbjct: 87 LHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146
Query: 155 PLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
LC + C KC+YT Y T+G + ETF F + L FGC
Sbjct: 147 RLCETGSFNTKNCSRNKCIYTYNYGSA-TTKGELASETFTFGEHRRVSV--SLDFGCGKL 203
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDADVR 268
SG G SGILG + LSL SQL+ FSYCL + TS I FG AD+
Sbjct: 204 TSGSLPGA--SGILGISPDRLSLVSQLQIP---RFSYCLTPFLDRNTTSHIFFGAMADLS 258
Query: 269 RRDLETTPILLSDLRP-------HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ T PI + L ++Y+ L+ IS+G + P +F I RDG+GG +D+
Sbjct: 259 KYR-TTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDS 317
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY-NASQEFDYCYRYDSSFK--------- 371
G + + + L + + ++ +P NA+ D+ Y Y+ F+
Sbjct: 318 GDTTGMLPSVVMEALKEAMVEAVK------LPVVNAT---DHGYEYELCFQLPRNGGGAV 368
Query: 372 ----AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIY 427
P + +H ++ + Y +E GR C+ I + +I+G +QQQNM +++
Sbjct: 369 ETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGAIIGNYQQQNMHVLF 428
Query: 428 DLNVPALRFGSENC 441
D+ F C
Sbjct: 429 DVENHEFSFAPTQC 442
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 118/378 (31%), Positives = 182/378 (48%), Gaps = 39/378 (10%)
Query: 94 DLFYSVEVNIGTPMKPQH--LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ Y+V V +GT ++ L D A+ W QC PC C Q P+FDP S T+ +
Sbjct: 98 SMVYAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVS 157
Query: 152 CDDP-LCRSPFK-CQNGKCVYTRRYHVGDVTRGLASRETFAFPV-RNGFTFVPRLAFGCS 208
+ LCR P+ Q+G+C + Y G G +R+TF+FP N F +P + FGC+
Sbjct: 158 GHNAVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGCA 217
Query: 209 NDNSGFAFGGKISGILGFN----ASPLS-LSSQLRNRIQGLFSYC-LVREMEATSVIKFG 262
N + F G ++G+LG PL+ QL + G FSYC +V A S ++FG
Sbjct: 218 NRIARFDTHGALAGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRFG 277
Query: 263 RD------ADVRRRDLET-TPILLSDLRPHFYLHLLEISIGRHIVRFP---PGAFDIMRD 312
D A V R+ + P S+ +Y+ L IS+G +R P P F+ +
Sbjct: 278 NDIPSQPPAGVHRQSMAVLAPTTTSEA---YYVKLAGISVG--ALRVPGVTPEMFERDQH 332
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS-LGRQRIPYNASQEFDYC-YRYDSSF 370
G GG ID GT +T I QT + +R L R R + S C +R +
Sbjct: 333 GRGGCAIDIGTKMTAI----VQTAYAHVEAAVRGHLQRNRARFVQSPGHHLCVHRTPAIE 388
Query: 371 KAYPSMTFHLQEADYI-VQPENMYFI--EPDRGR--FCVAIQDDPKYSILGAWQQQNMLI 425
+ PSMT H ++ V+P++++ + P G C+ + D + +++GA QQ +
Sbjct: 389 ERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAEMTVIGAMQQIDTRF 448
Query: 426 IYDL--NVPALRFGSENC 441
I+DL N+P + F E+C
Sbjct: 449 IFDLHNNIPIVSFNPEDC 466
>gi|255563737|ref|XP_002522870.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537954|gb|EEF39568.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 341
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 133/251 (52%), Gaps = 9/251 (3%)
Query: 205 FGCSNDN---SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSV 258
FGCS DN S F+ GK GI+G N SP+S+ QLRN FSYCL ATS+
Sbjct: 91 FGCSKDNRNFSAFSRTGKTDGIMGLNMSPVSILQQLRNVTNQRFSYCLTPYGSRPPATSL 150
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
++FG D R +TP + P+++L+LL++S+ +R PP F + RDGTGG I
Sbjct: 151 LRFGNDISTWGRGFYSTPFVDPPDMPNYFLNLLDLSVAGQRLRLPPETFALKRDGTGGTI 210
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY-NASQEFDYCYRYDSSFKAYPSMT 377
ID+GT +T + Y+ L+ G R+ + + E Y + + +F+ + S+T
Sbjct: 211 IDSGTGLTLVVQPAYRHLLGALQNHFDHHGFHRVHIPDTNLELRYNFAQNRTFQNHASLT 270
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLNVPALR 435
+H Q AD+ V+P Y + D FCVA+ +I+GA Q N +Y+ L+
Sbjct: 271 YHFQGADFTVEPRYAYVVYNDENAFCVALLASHIEGRAIIGALHQANTRFVYNAAKRRLK 330
Query: 436 FGSENCANGRQ 446
F +EN N ++
Sbjct: 331 FKAENFQNDKR 341
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 44/88 (50%)
Query: 28 ESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIH 87
E GFSL+LI SPESP YPG L+ SERI ++ E S RA ++S K N+ E
Sbjct: 3 EPNGFSLELIHGDSPESPFYPGKLTDSERISRLVESSIIRAQVLSSYLKYNSTSGPEAYR 62
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFD 115
P+ ++ P +LF+
Sbjct: 63 FPVYIWIASCVASTDVLEPTNNSRILFN 90
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 118/361 (32%), Positives = 171/361 (47%), Gaps = 34/361 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S ++W QC PC +C+ QT P+F+P AS+TY ++PC PL
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPL 212
Query: 157 CRS--PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C+ C+N + C Y Y G T G S ET F + + R+A GC +DN G
Sbjct: 213 CKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ----VIRRVALGCGHDNEG 268
Query: 214 FAFGGKISGILGFNAS--PLSLSSQLRNRIQGLFSYCLV-REMEAT-SVIKFGRDADVRR 269
G LG + P +Q R FSYCLV R T S + FG+ A +
Sbjct: 269 LFIGAAGLLGLGRGSLSFPSQTGAQFSKR----FSYCLVDRSASGTASSLIFGKAAIPKS 324
Query: 270 RDLETTPILLS-DLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TP+L + L +Y+ L+ IS+ GR + P F + G GG IID+GT VT
Sbjct: 325 AIF--TPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTR 382
Query: 328 IRNGPYQTLMQRY---DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQE 382
+ + Y T+ + L+S G + FD CY S K P++ FH Q
Sbjct: 383 LVDSAYSTMRDAFRVGTGNLKSAGGFSL-------FDTCYDL-SGLKTVKVPTLVFHFQG 434
Query: 383 ADYIVQPENMYFIEPD-RGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+I P Y I D FC A + SI+G QQQ +++D + F + +
Sbjct: 435 GAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGS 494
Query: 441 C 441
C
Sbjct: 495 C 495
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 169/357 (47%), Gaps = 27/357 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + Q+++ DT S + W QC+PC C+ Q PIF+P S ++S + CD +
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAV 216
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + C +G C+Y Y G + G + ET F G T V +A GC + N G
Sbjct: 217 CSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTF----GTTSVANVAIGCGHKNVGL 272
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFG-RDADVRR--R 270
+G+LG A LS +Q+ + FSYCLV RE +++ ++FG + V
Sbjct: 273 FI--GAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFT 330
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIV-RFPPGAFDI-MRDGTGGFIIDTGTPVTFI 328
LE P L +YL + IS+G ++ PP F I G GGFIID+GT VT +
Sbjct: 331 PLEKNP----HLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRL 386
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYI 386
Y + + G ++P +A FD CY F + P++ FH +
Sbjct: 387 VTSAYDAVRDAF-----VAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASL 441
Query: 387 VQPENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ P Y I D G FC A SI+G QQQ++ + +D + F + C
Sbjct: 442 ILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/355 (33%), Positives = 167/355 (47%), Gaps = 41/355 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + IGTP +P L DT S L+WTQCQPC CFDQ P FDP S+T S CD L
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 148
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
C+ G L + F F V G + VP +AFGC N+G F
Sbjct: 149 CQ------------------GLPVASLPRSDKFTF-VGAGAS-VPGVAFGCGLFNNG-VF 187
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADV---RRRDL 272
+GI GF PLSL SQL+ G FS+C A S + AD+ + +
Sbjct: 188 KSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV 244
Query: 273 ETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+TTP++ + P F YL L I++G + P F +++GTGG IID+GT +T +
Sbjct: 245 QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEF-ALKNGTGGTIIDSGTAMTSLPTR 303
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFD--YCYRYDSSFKAY-PSMTFHLQEADYIVQ 388
Y+ + + + ++P + D +C K Y P + H + A +
Sbjct: 304 VYRLVRDAFA------AQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLP 357
Query: 389 PENMYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN F D G C+AI + + + +G +QQQNM ++YDL L F C
Sbjct: 358 RENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 164/366 (44%), Gaps = 39/366 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q + Y IGTP +P + D A LVWTQC+ C RCF+Q TP+FDP AS TY PC
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S C C Y + GD T G +TFA T LAFGC
Sbjct: 107 GTPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAV-----GTAKASLAFGCV 160
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADV 267
+ GG SGI+G +P SL +Q FSYCL + S + G A +
Sbjct: 161 VASDIDTMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKL 216
Query: 268 R-RRDLETTPIL-----LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+TP + +DL ++ + L + G ++ PP ++ +DT
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL--------LDT 268
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
+P++F+ +G YQ + + + ++G + + FD C+ + A P + F +
Sbjct: 269 FSPISFLVDGAYQAVKK---AVTVAVGAPPM-ATPVEPFDLCFPKSGASGAAPDLVFTFR 324
Query: 382 EADYIVQPENMYFIEPDRGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ + Y ++ G C+A + + S+LG+ QQ+N+ ++DL+ L
Sbjct: 325 GGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLS 384
Query: 436 FGSENC 441
F +C
Sbjct: 385 FEPADC 390
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 127/431 (29%), Positives = 189/431 (43%), Gaps = 37/431 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQS-ERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G ++L I SPL P N S + + + FE AR N + S N+ +LP
Sbjct: 69 GVKIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSK---NSGPYTTMSNLP 125
Query: 90 MAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ Y V GTP K L+ DT S L W QC+PC C+ Q IF+P+ S+
Sbjct: 126 LQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSS 185
Query: 146 TYSEIPCDDPLC-------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
+Y +PC C +P C G CVY Y G ++G S+ET G
Sbjct: 186 SYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTL----GSD 241
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
AFGC + N+G G SG+LG + LS SQ +++ G F+YCL +TS
Sbjct: 242 SFQNFAFGCGHTNTGLFKGS--SGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTST 299
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
F TP++ + + P FY + L IS+G + PP G G
Sbjct: 300 GSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL-----GRGST 354
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSM 376
I+D+GT +T + Y L + R L + P++ D CY + P++
Sbjct: 355 IVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAK-PFSI---LDTCYDLSRHSQVRIPTI 410
Query: 377 TFHLQ-EADYIVQPENMYF-IEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNV 431
TFH Q AD V + ++ + C+A + ++I+G +QQQ M + +D
Sbjct: 411 TFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGA 470
Query: 432 PALRFGSENCA 442
+ F S +CA
Sbjct: 471 GRIGFASGSCA 481
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 179/373 (47%), Gaps = 40/373 (10%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ----PCIRCFDQTTPIFDPRASTTYSE 149
D +S+ V I ++P+ L+ DT S L+WTQC+ + P++DP S+T++
Sbjct: 13 DQGHSLTVGI---VQPRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAF 69
Query: 150 IPCDDPLCRS---PFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
+PC D LC+ FK +CVY Y LAS ETF F R + RL
Sbjct: 70 LPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSL--RLG 126
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGR 263
FGC ++G G +GILG + LSL +QL+ IQ FSYCL + TS + FG
Sbjct: 127 FGCGALSAGSLIGA--TGILGLSPESLSLITQLK--IQ-RFSYCLTPFADKKTSPLLFGA 181
Query: 264 DADVRR----RDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
AD+ R R ++TT I+ + + ++Y+ L+ IS+G + P + + DG GG I
Sbjct: 182 MADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTI 241
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA------ 372
+D+G+ V ++ ++ + + ++ R + ++++ C+ A
Sbjct: 242 VDSGSTVAYLVEAAFEAVKEAVMDVV----RLPVANRTVEDYELCFVLPRRTAAAAMEAV 297
Query: 373 -YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYD 428
P + H +V P + YF EP G C+A+ D SI+G QQQNM +++D
Sbjct: 298 QVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 357
Query: 429 LNVPALRFGSENC 441
+ F C
Sbjct: 358 VQHHKFSFAPTQC 370
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 125/434 (28%), Positives = 187/434 (43%), Gaps = 51/434 (11%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF-----------YS 98
N + +E + + + + RA ++ S + N + + L + + Y
Sbjct: 83 NATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYI 142
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
++ +GTP L DTAS L W QCQPC RC+ Q+ P+FDPR ST+Y E+ D P C+
Sbjct: 143 AKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQ 202
Query: 159 SPFK-----CQNGKCVYTRRYHVGD--------VTRGLASRETFAFPVRNGFTFVPRLAF 205
+ + + G C+YT Y GD V + TFA VR + L+
Sbjct: 203 ALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAY-----LSI 257
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVREMEA----TSVIK 260
GC +DN G FG +GILG + +S+ Q+ FSYCLV + +S +
Sbjct: 258 GCGHDNKGL-FGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLT 316
Query: 261 FGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPP-GAFDIMRD---GTG 315
FG A TP +L+ P F Y+ L+ +S+G VR P D+ D G G
Sbjct: 317 FGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGG--VRVPGVTERDLQLDPYTGHG 374
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK---- 371
G I+D+GT VT + Y + LG Q S FD CY
Sbjct: 375 GVILDSGTTVTRLARPAYTAFRDAFRAAATGLG-QVSTGGPSGLFDTCYTVGGRAGLRHC 433
Query: 372 -AYPSMTFHLQEA-DYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQNMLIIY 427
P+++ H + +QP+N RG C A D S++G QQ ++Y
Sbjct: 434 VKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVY 493
Query: 428 DLNVPALRFGSENC 441
D+ + F +C
Sbjct: 494 DIGGQRVGFAPNSC 507
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 131/400 (32%), Positives = 180/400 (45%), Gaps = 32/400 (8%)
Query: 56 RIHKMFEISKARANYMASMSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
R+ K+ + N +SKP + +A+ Y + +GTP K +++
Sbjct: 4 RVKKLSSLGATSRN----LSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVL 59
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYT 171
DT S +VW QC PC C+ QT P+F+P S +++++ C PLCR SP Q C+Y
Sbjct: 60 DTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQ 119
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y G T G ET F T V ++A GC +DN G +G+LG L
Sbjct: 120 VSYGDGSYTTGEFVTETLTFRR----TKVEQVALGCGHDNEGLFV--GAAGLLGLGRGGL 173
Query: 232 SLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRRDLETTPILLS-DLRPHFYL 288
S SQ FSYCLV ++ S + FG A R TP+L + L +Y+
Sbjct: 174 SFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSA--VSRTARFTPLLTNPRLDTFYYV 231
Query: 289 HLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL 347
LL IS+G V F + R G GG IID GT VT + Y L + SL
Sbjct: 232 ELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSL 291
Query: 348 GRQRIPYNASQEFDYCYRYDSSFKA---YPSMTFHLQEADYIVQPENMYFIEPD-RGRFC 403
+ P + FD C YD S K P++ H + AD + P + Y I D GRFC
Sbjct: 292 --KSAPEFS--LFDTC--YDLSGKTTVKVPTVVLHFRGAD-VSLPASNYLIPVDGSGRFC 344
Query: 404 VAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
A SI+G QQQ ++YDL + F CA
Sbjct: 345 FAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 163/370 (44%), Gaps = 28/370 (7%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y ++ +GTP P ++ DT S +VW QC PC RC++Q+ +FDPR S +Y+
Sbjct: 133 LAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNA 192
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+ C PLCR + C+Y Y G VT G + ET F G V R+A
Sbjct: 193 VGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFA---GGARVARVAL 249
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA------TSVI 259
GC +DN G F + S LS +Q+ R FSYCLV + +S +
Sbjct: 250 GCGHDNEGL-FVAAAGLLGLGRGS-LSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTV 307
Query: 260 KFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMR----DGT 314
FG A TP++ + + +Y+ L+ IS+G R P A +R G
Sbjct: 308 TFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGG--ARVPGVANSDLRLDPSSGR 365
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAY 373
GG I+D+GT VT + Y L + L R+ FD CY
Sbjct: 366 GGVIVDSGTSVTRLARPAYSALRDAFRGAAAGL---RLSPGGFSLFDTCYDLSGRKVVKV 422
Query: 374 PSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNV 431
P+++ H P Y I D +G FC A D SI+G QQQ +++D +
Sbjct: 423 PTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 482
Query: 432 PALRFGSENC 441
+ F + C
Sbjct: 483 QRVAFTPKGC 492
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 160/352 (45%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + + +G+P + Q+++ D+ S +VW QCQPC +C+ QT P+FDP S ++ +PC +
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSV 201
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C C G C Y Y G T+G + ET F G T V +A GC + N G
Sbjct: 202 CERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF----GRTVVRNVAIGCGHRNRGM 257
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG +SL QL + G FSYCLV R ++ ++FGR A
Sbjct: 258 FV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGA--MPVGAA 313
Query: 274 TTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P++ + P F Y+ L + +G V F + G GG ++DTGT VT I
Sbjct: 314 WIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVA 373
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y + +L R + FD CY + P+++F+ + P
Sbjct: 374 YVAFRDAFIGQTGNLPRA----SGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPAR 429
Query: 392 MYFIEPDR-GRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I D G FC A P SI+G QQ+ + I +D + FG C
Sbjct: 430 NFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 130/421 (30%), Positives = 195/421 (46%), Gaps = 29/421 (6%)
Query: 36 LIPIFSPESP--LYPGNLSQ-SERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L + S E+P L+ L++ + R+ + ++ A + + ++ F + +A+
Sbjct: 85 LDALSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSS--SVTSGLAQ 142
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Y + +GTP + ++ DT S +VW QC PC +C+ QT P+F+P S +++ IPC
Sbjct: 143 GSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPC 202
Query: 153 DDPLCR---SPFKCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
PLCR SP C K C+Y Y G T G S ET F R T V R+A GC
Sbjct: 203 GSPLCRRLDSP-GCSTKKHICLYQVSYGDGSFTYGEFSTETLTF--RG--TRVGRVALGC 257
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDA 265
+DN G +G+LG LS SQ+ R FSYCLV ++ S + FG D+
Sbjct: 258 GHDNEGLFI--GAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFG-DS 314
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ R T + L +Y+ LL +S+ G + F + G GG IID+GT
Sbjct: 315 AISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTS 374
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEA 383
VT + Y L + +L +R P + FD C+ + P++ H + A
Sbjct: 375 VTRLTRPAYVALRDAFRVGASNL--KRAPEFS--LFDTCFDLSGKTEVKVPTVVLHFRGA 430
Query: 384 DYIVQPENMYFIEPDR-GRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D + P + Y I D G FC A SI+G QQQ ++YDL + F C
Sbjct: 431 D-VSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
Query: 442 A 442
A
Sbjct: 490 A 490
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 165/353 (46%), Gaps = 22/353 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP K +L+ DT S + W QC+PC C+ Q+ P+F+P +S+TY + C P
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C C++ KC+Y Y G T G + +T F + +A GC +DN G
Sbjct: 222 CSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK---INDVALGCGHDNEGL 278
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS+++Q++ FSYCLV R+ +S + F V+ +
Sbjct: 279 FT--GAAGLLGLGGGALSITNQMKATS---FSYCLVDRDSGKSSSLDFNS---VQLGSGD 330
Query: 274 TTPILLSD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
T LL + + +Y+ L S+G V P FD+ G+GG I+D GT VT ++
Sbjct: 331 ATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQ 390
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPE 390
Y +L + ++ +L + ++ FD CY + S S P++ FH + P
Sbjct: 391 AYNSLRDAFLKLTTNLKKGT---SSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPA 447
Query: 391 NMYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A SI+G QQQ I YDL + C
Sbjct: 448 KNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 130/430 (30%), Positives = 194/430 (45%), Gaps = 38/430 (8%)
Query: 36 LIPIFSPESPLYPGNLSQSERIH----KMFEISKARANYMASMSKPNAFQ-------ELE 84
LI +F+ P + + R+H K E K + Y+ S S P + E+
Sbjct: 14 LIILFALTCPKQCTSYRFTLRLHTKSIKTKESPKIKPGYLHSKSTPAPSRLDNLWTTEIA 73
Query: 85 DI--HLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPR 142
DI H+ + ++IG P PQ LL DT S L W QC PC +C+ QT P F P
Sbjct: 74 DIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPS 132
Query: 143 ASTTYSEIPCDDPLCRSP--FKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRN-GFT 198
S+TY C+ P F+ + G C Y RY TRG+ ++E F + G
Sbjct: 133 RSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLI 192
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
P + FGC DNSGF + SG+LG P + S RN FSYC ++ T
Sbjct: 193 SKPNIVFGCGQDNSGFT---QYSGVLGLG--PGTFSIVTRN-FGSKFSYCFGSLIDPTYP 246
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
F + R + + TP+ + + +YL L IS+G ++ PG F R GG +
Sbjct: 247 HNFLILGNGARIEGDPTPLQI--FQDRYYLDLQAISLGEKLLDIEPGIFQRYRS-KGGTV 303
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY--PSM 376
IDTG T + Y+TL + D +L + R+ + Q ++CY + Y P +
Sbjct: 304 IDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWE--QYTNHCYEGNLKLDLYGFPVV 361
Query: 377 TFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQ----DDPKYSILGAWQQQNMLIIYDLNV 431
TFH A+ + E+++ FC+A+ DD S++GA QQN + Y+L
Sbjct: 362 TFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDD--MSVIGAMAQQNYNVGYNLRT 419
Query: 432 PALRFGSENC 441
+ F +C
Sbjct: 420 MKVYFQRTDC 429
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 165/355 (46%), Gaps = 29/355 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG P P +++ DT S + W QC PC C+ Q PIF+P +ST+YS + CD
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQ 203
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+N C+Y Y G T G ET G V +A GC ++N G
Sbjct: 204 CQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITL----GSASVDNVAIGCGHNNEGL 259
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS SQ+ FSYCLV R+ ++ S ++F
Sbjct: 260 FI--GAAGLLGLGGGKLSFPSQIN---ASSFSYCLVDRDSDSASTLEFNS---ALLPHAI 311
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T P+L + +L +Y+ + +S+G ++ P F++ G GG IID+GT VT ++
Sbjct: 312 TAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAA 371
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSFKA---YPSMTFHLQEADYIVQ 388
Y L + + G + +P + FD C YD S K P++TFHL +
Sbjct: 372 YNALRDAFVK-----GTKDLPVTSEVALFDTC--YDLSRKTSVEVPTVTFHLAGGKVLPL 424
Query: 389 PENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P Y I D G FC A SI+G QQQ + +DL + F C
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 162/366 (44%), Gaps = 32/366 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V+ ++GTP + HL+ DT S L + QC PC C++Q P++ P S+T++ +PCD
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAE 93
Query: 157 C-------------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C P G C Y RY T G+ + ET G V +
Sbjct: 94 CLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNHV 149
Query: 204 AFGCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK-- 260
AFGC N N G F G G+LG LS +SQ + F+YCL + TSV
Sbjct: 150 AFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSVFSSL 206
Query: 261 -FGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
FG D DL+ TP++ + L P +Y+ ++ I G + P A+ I G GG I
Sbjct: 207 IFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNGGTI 266
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMT 377
D+GT VT+ Y ++ ++ +S+ R P + Q C YPS T
Sbjct: 267 FDSGTTVTYWSPQAYARIIAAFE---KSVPYPRAP-PSPQGLPLCVNVSGIDHPIYPSFT 322
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALR 435
+ + YFIE C+A+ + ++++G QQN L+ YD +
Sbjct: 323 IEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRIG 382
Query: 436 FGSENC 441
F NC
Sbjct: 383 FAHANC 388
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 169/362 (46%), Gaps = 21/362 (5%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP K +++ DT S +VW QC PC +C+ QT P+FDP+ S ++S
Sbjct: 140 LAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSS 199
Query: 150 IPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
I C PLC SP C+Y Y G T G S ET F R T VP++A G
Sbjct: 200 ISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTF--RG--TRVPKVALG 255
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRD 264
C +DN G +G+LG LS +Q R FSYCLV ++ S + FG+
Sbjct: 256 CGHDNEGLFV--GAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQS 313
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
A V R + T I L +YL L IS+ G + F + G GG IID+GT
Sbjct: 314 A-VSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGT 372
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQE 382
VT + Y +L + L +R P + FD C+ + P++ H +
Sbjct: 373 SVTRLTRRAYVSLRDAFRAGAADL--KRAPDYS--LFDTCFDLSGKTEVKVPTVVMHFRG 428
Query: 383 ADYIVQPENMYFIEPD-RGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
AD + P Y I D G FC A SI+G QQQ +++D+ + F +
Sbjct: 429 AD-VSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARG 487
Query: 441 CA 442
CA
Sbjct: 488 CA 489
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 128/398 (32%), Positives = 177/398 (44%), Gaps = 28/398 (7%)
Query: 56 RIHKMFEISKARANYMASMSKPNAFQELED-IHLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
R+ K+ + N +SKP + +A+ Y + +GTP K +++
Sbjct: 91 RVKKLSSLGATSRN----LSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVL 146
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYT 171
DT S +VW QC PC C+ QT P+F+P S +++++ C PLCR SP Q C+Y
Sbjct: 147 DTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQ 206
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y G T G ET F T V ++A GC +DN G +G+LG L
Sbjct: 207 VSYGDGSYTTGEFVTETLTFRR----TKVEQVALGCGHDNEGLFV--GAAGLLGLGRGGL 260
Query: 232 SLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRRDLETTPILLS-DLRPHFYL 288
S SQ FSYCLV ++ S + FG A R TP+L + L +Y+
Sbjct: 261 SFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSA--VSRTARFTPLLTNPRLDTFYYV 318
Query: 289 HLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL 347
LL IS+G V F + R G GG IID GT VT + Y L + SL
Sbjct: 319 ELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSL 378
Query: 348 GRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVA 405
+ P + FD CY P++ H + AD + P + Y I D GRFC A
Sbjct: 379 --KSAPEFS--LFDTCYDLSGKTTVKVPTVVLHFRGAD-VSLPASNYLIPVDGSGRFCFA 433
Query: 406 IQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
SI+G QQQ ++YDL + F CA
Sbjct: 434 FAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 170/361 (47%), Gaps = 34/361 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +G+P + +FDT S L WTQC+PC+ C+ Q IFDP S +YS + CD P
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 206
Query: 156 LCR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C SP C + C+Y RY G + G +RE + + F FGC
Sbjct: 207 SCEKLESATGNSP-GCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFN---NFQFGC 262
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
+N G FGG +G+LG +PLSL SQ + +FSYCL +T + FG D
Sbjct: 263 GQNNRGL-FGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLSFG-SGDG 319
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ ++ TP ++ P FY L ++ IS+G + P F T G IID+GT ++
Sbjct: 320 DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFS-----TAGTIIDSGTVIS 374
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQ-EA 383
+ Y ++ + + +++ R + D CY S +K P + + A
Sbjct: 375 RLPPTVYSSVQKVFRELMSDYPRVK----GVSILDTCYDL-SKYKTVKVPKIILYFSGGA 429
Query: 384 DYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ + PE + ++ + C+A DD + +I+G QQ+ + ++YD + F
Sbjct: 430 EMDLAPEGIIYVL-KVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSG 488
Query: 441 C 441
C
Sbjct: 489 C 489
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 162/369 (43%), Gaps = 26/369 (7%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y ++ +GTP P ++ DT S +VW QC PC RC+DQ+ +FDPR S +Y
Sbjct: 135 LAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGA 194
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+ C PLCR + C+Y Y G VT G + ET F G V R+A
Sbjct: 195 VGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---GGARVARIAL 251
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA------TSVI 259
GC +DN G F + S LS +Q+ R FSYCLV + +S +
Sbjct: 252 GCGHDNEGL-FVAAAGLLGLGRGS-LSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309
Query: 260 KFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD---GTG 315
FG A TP++ + + +Y+ L+ IS+G V + D+ D G G
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADS-DLRLDPSSGRG 368
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYP 374
G I+D+GT VT + Y L + L R+ FD CY P
Sbjct: 369 GVIVDSGTSVTRLARPAYSALRDAFRAAAAGL---RLSPGGFSLFDTCYDLSGRKVVKVP 425
Query: 375 SMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVP 432
+++ H P Y I D +G FC A D SI+G QQQ +++D +
Sbjct: 426 TVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 485
Query: 433 ALRFGSENC 441
+ F + C
Sbjct: 486 RVGFVPKGC 494
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 157/354 (44%), Gaps = 26/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QC+PC ++C+ Q P+FDP S+TY+ + C D
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETF--AFPVRNGFTFVPRLAFGCSNDN 211
C C G C+Y +Y G T G +++T A GF FGC N
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN 276
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
+G GK +G++G SL+ Q N+ G F+YCL T + FG +
Sbjct: 277 NGLF--GKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGNNAR 334
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
L TP+L + +Y+ + I +G V F T G ++D+GT +T +
Sbjct: 335 L--TPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPAT 387
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPE 390
Y L +D+++ + G ++ P D CY + S P+++ Q +
Sbjct: 388 AYTALSSAFDKVMLARGYKKAP--GYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDV 445
Query: 391 NMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + C+A DD +I+G QQ+ ++YDL + F +C
Sbjct: 446 SGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 157/354 (44%), Gaps = 26/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QC+PC ++C+ Q P+FDP S+TY+ + C D
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDS 222
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETF--AFPVRNGFTFVPRLAFGCSNDN 211
C C G C+Y +Y G T G +++T A GF FGC N
Sbjct: 223 ACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN 276
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
+G GK +G++G SL+ Q N+ G F+YCL T + FG +
Sbjct: 277 NGLF--GKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGNNAR 334
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
L TP+L + +Y+ + I +G V F T G ++D+GT +T +
Sbjct: 335 L--TPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPAT 387
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPE 390
Y L +D+++ + G ++ P D CY + S P+++ Q +
Sbjct: 388 AYTALSSAFDKVMLARGYKKAP--GYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDV 445
Query: 391 NMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + C+A DD +I+G QQ+ ++YDL + F +C
Sbjct: 446 SGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 187/393 (47%), Gaps = 47/393 (11%)
Query: 82 ELEDIHLPMAK-QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-------CFD 133
L +P+A D +S+ V IGTP +P+ L+ DT S L+WTQC R
Sbjct: 68 NLSAADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASR 127
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCRS---PFK--CQNGKCVYTRRYHVGDVTRGLASRET 188
Q P+++PR S++++ +PC D LC+ +K +N +C+Y Y + LAS ET
Sbjct: 128 QREPLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAEAGGVLAS-ET 186
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
F F V N +P L FGC ++G G SG++G + +SL SQL FSYC
Sbjct: 187 FTFGV-NAKVSLP-LGFGCGALSAGDLVGA--SGLMGLSPGIMSLVSQLS---VPRFSYC 239
Query: 249 LVREME-ATSVIKFGRDADVRRRDLETTPILLSDLR------PHFYLHLLEISIGRHIVR 301
L E TS + FG AD+RR T S LR ++Y+ L+ +S+G +
Sbjct: 240 LTPFAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLD 299
Query: 302 FPPGAFDIMR-DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
P + +++ DG+GG I+D+G+ ++++ ++ + + + + R+P +
Sbjct: 300 VPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAV------RLPVANGTDE 353
Query: 361 DY-----CYRYDSSFK----AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK 411
DY C+ + P + H + P + YF EP G C+A+ P
Sbjct: 354 DYDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPD 413
Query: 412 ---YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
SI+G QQQNM +++D+ F C
Sbjct: 414 GFGVSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 175/367 (47%), Gaps = 34/367 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V+ +GTP + L+ D+ S L+W QC PC++C+ Q TP++ P S+T++ +PC P
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPE 124
Query: 157 C-----RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPRLAFGC 207
C F C G C Y RY +++G+ + E+ VR + ++AFGC
Sbjct: 125 CLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVR-----IDKVAFGC 179
Query: 208 SNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IKFGR 263
DN G FA G G+LG PLS SQ+ F+YCLV ++ TSV + FG
Sbjct: 180 GRDNQGSFAAAG---GVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGD 236
Query: 264 DADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
+ DL+ TPI+ + P +Y+ + ++ +G + A+ + G GG I D+G
Sbjct: 237 ELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSG 296
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFK-AYPSMTFHL 380
T VT+ Y+ ++ +D+ + R P AS Q D C + ++PS T L
Sbjct: 297 TTVTYWLPPAYRNILAAFDKNV------RYPRAASVQGLDLCVDVTGVDQPSFPSFTIVL 350
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK----YSILGAWQQQNMLIIYDLNVPALRF 436
+ YF++ C+A+ P ++ +G QQN L+ YD + F
Sbjct: 351 GGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGF 410
Query: 437 GSENCAN 443
C++
Sbjct: 411 APAKCSS 417
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 176/375 (46%), Gaps = 38/375 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V IGTP K L+ DT S L W QC PC CF+Q P +DP+ S+++ I C DP
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C P K +N C Y Y T G + ETF + ++ F V +
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G G SG+LG PLS SSQL++ FSYCLV T+V +
Sbjct: 210 MFGCGHWNRGLFHGA--SGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 267
Query: 261 FGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D D+ +L T ++ P +Y+ + I +G ++ P +++ DG GG
Sbjct: 268 FGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGG 327
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKAY 373
I+D+GT +++ YQ + + + ++ Y Q+F D CY S +
Sbjct: 328 TIVDSGTTLSYFTEPAYQIIKDAFVKKVKG-------YPIVQDFPILDPCYNV-SGVEKI 379
Query: 374 PSMTFHLQEADYIVQ--PENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYD 428
F + AD V P YFI D C+AI P+ SI+G +QQQN ++YD
Sbjct: 380 DLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYD 439
Query: 429 LNVPALRFGSENCAN 443
L + NCA+
Sbjct: 440 TKKSRLGYAPMNCAD 454
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 166/360 (46%), Gaps = 30/360 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + IG+P + +++ DT S + W QC PC C+ Q+ P+FDP S++Y+ +PCD P
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPH 255
Query: 157 CRS--PFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
CR+ C N CVY Y G T G + ET +G V +A GC
Sbjct: 256 CRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVAIGCG 314
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADV 267
+DN G +G+L PLS SQ+ FSYCLV R+ + S ++FG
Sbjct: 315 HDNEGLFV--GAAGLLALGGGPLSFPSQISAT---EFSYCLVDRDSPSASTLQFG----A 365
Query: 268 RRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPV 325
T P++ S F Y+ L IS+G + PP AF + G+GG I+D+GT V
Sbjct: 366 SDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSGTAV 425
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDS-SFKAYPSMTFHLQEA 383
T +++ Y L D +R G Q +P + FD CY S P+++ +
Sbjct: 426 TRLQSSAYSALR---DAFVR--GTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEGG 480
Query: 384 DYIVQPENMYFIEPD-RGRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ P Y I D G +C+A SI+G QQQ + + +D + F C
Sbjct: 481 GELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 117/436 (26%), Positives = 191/436 (43%), Gaps = 29/436 (6%)
Query: 22 THFTSSESTGFSLKL--------IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMAS 73
THF+ ++ ++L+L + + L+ ++R+ + + +S
Sbjct: 49 THFSDDSNSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASS 108
Query: 74 MSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFD 133
S+ D+ M + Y V + +G+P + Q+++ D+ S +VW QCQPC C+
Sbjct: 109 DSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYK 168
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
Q+ P+FDP S +Y+ + C +C C +G C Y Y G T+G + ET F
Sbjct: 169 QSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF 228
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV- 250
T V +A GC + N G F G + S +S QL + G F YCLV
Sbjct: 229 AK----TVVRNVAMGCGHRNRGM-FIGAAGLLGIGGGS-MSFVGQLSGQTGGAFGYCLVS 282
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHI-VRFPPGAFDI 309
R ++T + FGR+A P++ + P FY L+ + + P G FD+
Sbjct: 283 RGTDSTGSLVFGREA--LPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDL 340
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS 369
G GG ++DTGT VT + G Y + +L R + FD CY S
Sbjct: 341 TETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRA----SGVSIFDTCYDL-SG 395
Query: 370 FKA--YPSMTFHLQEADYIVQPENMYFIE-PDRGRFCVAIQDDPK-YSILGAWQQQNMLI 425
F + P+++F+ E + P + + D G +C A P SI+G QQ+ + +
Sbjct: 396 FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQV 455
Query: 426 IYDLNVPALRFGSENC 441
+D + FG C
Sbjct: 456 SFDGANGFVGFGPNVC 471
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 172/358 (48%), Gaps = 39/358 (10%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSK-PNAFQELED 85
+++ GF LKL + + S ++ + + + SKAR + S + P +
Sbjct: 24 NDNVGFQLKLTHVDAGTS------YTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITA 77
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+ + Y V++ IGTP + DT S L+WTQC PC+ C DQ TP FD + S
Sbjct: 78 ARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSA 137
Query: 146 TYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDV--TRGLASRETFAFPVRNGFTF-V 200
TY +PC C S C CVY +Y+ GD T G+ + ETF F N
Sbjct: 138 TYRALPCRSSRCASLSSPSCFKKMCVY--QYYYGDTASTAGVLANETFTFGAANSTKVRA 195
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVI 259
+AFGC + N+G SG++GF PLSL SQL FSYCL + AT S +
Sbjct: 196 TNIAFGCGSLNAGDL--ANSSGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRL 250
Query: 260 KFGRDADVRRRD------LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRD 312
FG A++ + +++TP +++ P+ Y L L IS+G ++ P F I D
Sbjct: 251 YFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDD 310
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE----FDYCYRY 366
GTGG IID+GT +T+++ Y+ + R L IP A + D C+++
Sbjct: 311 GTGGVIIDSGTSITWLQQ-------DAYEAVRRGL-VSAIPLTAMNDTDIGLDTCFQW 360
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/342 (33%), Positives = 164/342 (47%), Gaps = 17/342 (4%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +++GTP + +L+ DT S ++W QC PC+ C+ Q+ IFDP S+TYS + C
Sbjct: 58 YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQ 117
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF--TFVPRLAFGCSNDNS 212
C + CQ KC+Y Y G T G + + +G + ++ GC +DN
Sbjct: 118 CLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNE 177
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT--SVIKFGRDADVRR 269
G+ +G+LG PLS +Q+ + G FSYCL RE ++T S + FG +A V
Sbjct: 178 GYFV--GAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFG-EAAVPP 234
Query: 270 RDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP + P F YL + IS+G I+ P AF + G GG IID+GT VT +
Sbjct: 235 AGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRL 294
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIV 387
+N Y +L + R+ P FD CY P++T H Q +
Sbjct: 295 QNAAYASLRDAF----RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLK 350
Query: 388 QPENMYFIEPDRGR-FCVAIQDDPKYSILGAWQQQNMLIIYD 428
P + Y I D FC+A SI+G QQQ +IYD
Sbjct: 351 LPASNYLIPVDNSNTFCLAFAGTTGPSIIGNIQQQGFRVIYD 392
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 122/425 (28%), Positives = 188/425 (44%), Gaps = 46/425 (10%)
Query: 37 IPIFSPESPLYPGNLSQSE-RIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDL 95
+P+ P P S E + + S+AR+ Y+ MS+ + HL + L
Sbjct: 61 VPLVHRHGPCAPSTRSSDEPSLSERLRRSRARSKYI--MSRASKSNVSIPTHLGGSVDSL 118
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCD 153
Y V V +GTP Q LL DT S L W QC PC C+ Q P+FDP S+TY+ IPC+
Sbjct: 119 EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCN 178
Query: 154 DPLCRSPFK------CQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
CR + C +G +C Y Y G T G+ S ET + G T V
Sbjct: 179 TDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLT--MAPGVT-VKD 235
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
FGC +D G K G+LG +P SL Q + G FSYCL + + G
Sbjct: 236 FHFGCGHDQDG--PNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQAGFLALG 293
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
+ TP ++ + + + +++ I++G + PP AF +GG IID+G
Sbjct: 294 APVN-DASGFVFTP-MVREQQTFYVVNMTGITVGGEPIDVPPSAF------SGGMIIDSG 345
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYP--SMTFH 379
T VT +++ Y L + + + + P + E D CY + S P ++TF
Sbjct: 346 TVVTELQHTAYAALQAAFRKAMAAY-----PLLPNGELDTCYNFTGHSNVTVPRVALTFS 400
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ P+ + C+A Q+ D + ILG Q+ + ++YD+ + F
Sbjct: 401 GGATVDLDVPDGILLDN------CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGF 454
Query: 437 GSENC 441
G++ C
Sbjct: 455 GADAC 459
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 154/354 (43%), Gaps = 24/354 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP + ++FDT S W QCQPC+ C+ Q P+FDP S TY+ I C
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155
Query: 156 LCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G T G +++T + + FGC N G
Sbjct: 156 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTL----AYDTIKNFRFGCGEKNRG 211
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F+YCL T + G A L
Sbjct: 212 LF--GRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAANARL- 268
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TP+L+ +Y+ + I +G H++ P F T G ++D+GT +T + Y
Sbjct: 269 -TPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFS-----TAGTLVDSGTVITRLPPSAY 322
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQEADYIVQPE 390
L + + ++ LG P A D CY + A P+++ Q +
Sbjct: 323 APLRSAFSKAMQGLGYSAAP--AFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380
Query: 391 NMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ D + C+A DD +I+G QQ+ ++YD+ + F C
Sbjct: 381 SGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 159/351 (45%), Gaps = 28/351 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ +GTP + DT S + WTQC PC+ C++Q PIFDP S+T+ E CD
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD--- 121
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
C Y Y T G + ET +G FV P GC ++NS F
Sbjct: 122 --------GHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSWFK 173
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
SG++G N P SL +Q+ GL SYC + TS I FG +A V + +T
Sbjct: 174 --PSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF--SGQGTSKINFGANAIVAGDGVVST 229
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+ ++ +P F YL+L +S+G + F + G +ID+GT +T+
Sbjct: 230 TMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE---GNIVIDSGTTLTYFPVSYCN 286
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA-DYIVQPENMY 393
+ Q + ++ ++ R P CY D + +P +T H D ++ NMY
Sbjct: 287 LVRQAVEHVVTAV-RAADPTGNDM---LCYNSD-TIDIFPVITMHFSGGVDLVLDKYNMY 341
Query: 394 FIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ G FC+AI + +I G Q N L+ YD + + F NC+
Sbjct: 342 MESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 156/352 (44%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + +GTP ++FDT S W QCQPC+ C+ Q +FDP S+TY+ + C P
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 156 LCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y+ +Y G + G + +T + + V FGC N G
Sbjct: 242 ACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 298
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG + +
Sbjct: 299 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQ 356
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 357 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFS-----TAGTIVDSGTVITRLPPAAY 411
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P ++ Q Y+ +
Sbjct: 412 SSLRSAFASAMAARGYKKAP--ALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASG 469
Query: 393 YFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+ A +DD I+G Q + ++YD+ + F C
Sbjct: 470 IMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 162/355 (45%), Gaps = 20/355 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP K +++ DT S +VW QC+PC +C+ QT IFDP S +++ IPC PL
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPL 189
Query: 157 CR---SP-FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR SP +N C Y Y G T G S ET F R VPR+A GC +DN
Sbjct: 190 CRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF--RRA--AVPRVAIGCGHDNE 245
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRR 270
G +G+LG LS +Q R FSYCL + S I FG D+ V R
Sbjct: 246 GLFV--GAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFG-DSAVSRT 302
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
T + L +Y+ LL IS+G VR F + G GG IID+GT VT +
Sbjct: 303 ARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLT 362
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
Y +L + L +R P FD CY S P++ H + AD +
Sbjct: 363 RPAYVSLRDAFRVGASHL--KRAP--EFSLFDTCYDLSGLSEVKVPTVVLHFRGADVSLP 418
Query: 389 PENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
N + G FC A SI+G QQQ +++DL + F CA
Sbjct: 419 AANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 169/363 (46%), Gaps = 22/363 (6%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP + +++ DT S +VW QC PC RC+ Q+ P+FDPR S +++
Sbjct: 119 LAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFAS 178
Query: 150 IPCDDPLCR---SP-FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
I C PLC SP Q C+Y Y G T G S ET F T V R+A
Sbjct: 179 IACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRR----TRVARVAL 234
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGR 263
GC +DN G +G+LG LS SQ R FSYCLV ++ S + FG
Sbjct: 235 GCGHDNEGLFV--GAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFG- 291
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTG 322
D+ V R T + L +Y+ LL IS+ G + F + + G GG IID+G
Sbjct: 292 DSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSG 351
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ 381
T VT + Y + +L +R P + FD C+ + P++ H +
Sbjct: 352 TSVTRLTRPAYIAFRDAFRAGASNL--KRAPQFS--LFDTCFDLSGKTEVKVPTVVLHFR 407
Query: 382 EADYIVQPENMYFIEPD-RGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
AD + P + Y I D G FC+A SI+G QQQ ++YDL + F
Sbjct: 408 GAD-VSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPH 466
Query: 440 NCA 442
CA
Sbjct: 467 GCA 469
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 170/358 (47%), Gaps = 27/358 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V IG+P K Q+L+ DT S + W QC PC C+ Q +FDPRAS+++ + C P
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ + +C+Y Y G T G + ++F+ V G T + FGC +DN
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFS--VSRGRT--SPVVFGCGHDNE 129
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE---MEATSVIKFGRDADVRR 269
G +G+LG A LS SQL +R FSYCLV + A+S + FG A
Sbjct: 130 GLFV--GAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 270 RDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTF 327
T +L + L +Y L ISIG ++ P AF + G GG IID+GT VT
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDS-SFKAYPSMTFHLQEADY 385
+ Y + + RS Q++P A FD CY + + + P+++FH +
Sbjct: 245 LPTYAYTVMRDAF----RS-ATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGAS 299
Query: 386 IVQPENMYFIEPD-RGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ P + Y + D G FC A SI+G QQQ M + DL+ + F C
Sbjct: 300 VQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 171/360 (47%), Gaps = 30/360 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S +VW QC PC RC+ Q+ PIFDPR S TY+ IPC P
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR + + C+Y Y G T G S ET F RN V +A GC +DN
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRN---RVKGVALGCGHDNE 257
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRR 270
G +G+LG LS Q +R FSYCLV ++ S + FG A R
Sbjct: 258 GLFV--GAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAA--VSR 313
Query: 271 DLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIIDTGTPV 325
TP+L + L +Y+ LL IS+G V PG F + + G GG IID+GT V
Sbjct: 314 IARFTPLLSNPKLDTFYYVELLGISVGGTRV---PGVAASLFKLDQIGNGGVIIDSGTSV 370
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEAD 384
T + Y + + ++L +R P + FD C+ + + P++ H + AD
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKAL--KRAPDFS--LFDTCFDLSNMNEVKVPTVVLHFRGAD 426
Query: 385 YIVQPENMYFIEPD-RGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ P Y I D G+FC A SI+G QQQ ++YDL + F CA
Sbjct: 427 -VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 125/360 (34%), Positives = 172/360 (47%), Gaps = 30/360 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S +VW QC PC RC+ Q+ PIFDPR S TY+ IPC P
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR + + C+Y Y G T G S ET F RN V +A GC +DN
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-RN---RVKGVALGCGHDNE 257
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRR 270
G +G+LG LS Q +R FSYCLV ++ S + FG A R
Sbjct: 258 GLFV--GAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAA--VSR 313
Query: 271 DLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIIDTGTPV 325
TP+L + L +Y+ LL IS+G V PG F + + G GG IID+GT V
Sbjct: 314 IARFTPLLSNPKLDTFYYVGLLGISVGGTRV---PGVTASLFKLDQIGNGGVIIDSGTSV 370
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEAD 384
T + Y + + ++L +R P N S FD C+ + + P++ H + AD
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKTL--KRAP-NFSL-FDTCFDLSNMNEVKVPTVVLHFRRAD 426
Query: 385 YIVQPENMYFIEPD-RGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ P Y I D G+FC A SI+G QQQ ++YDL + F CA
Sbjct: 427 -VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/352 (30%), Positives = 160/352 (45%), Gaps = 28/352 (7%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y +++ +GTP DT S L+WTQC PC C+ Q PIFDP S+T+ E
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEK----- 114
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGF 214
+C C Y Y ++G + ET +G FV P GC +++S F
Sbjct: 115 ------RCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWF 168
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
SG++G + P SL +Q+ GL SYC + TS I FG +A V + +
Sbjct: 169 K--PTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFAS--QGTSKINFGTNAIVAGDGVVS 224
Query: 275 TPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
T + L+ +P +YL+L +S+G V F + G IID+GT +T+
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYC 281
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENM 392
+ + D + ++ R P CY Y + +P +T H AD ++ NM
Sbjct: 282 NLVREAVDHYVTAV-RTADPTGNDM---LCY-YTDTIDIFPVITMHFSGGADLVLDKYNM 336
Query: 393 YFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
Y RG FC+AI + P+ +I G Q N L+ YD + + F NC+
Sbjct: 337 YIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 178/372 (47%), Gaps = 36/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y++ ++IGTP +L DT SSL+WTQC PC C + P F P +S+T+S++PC L
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 157 CR---SPF-KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ SP+ C CVY Y +G T G + ET G +F P +AFGCS +N
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHV---GGASF-PGVAFGCSTEN- 203
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD 271
G SGI+G SPLSL SQ+ G FSYCL + +A S I FG A V +
Sbjct: 204 --GVGNSSSGIVGLGRSPLSLVSQVG---VGRFSYCLRSDADAGDSPILFGSLAKVTGGN 258
Query: 272 LETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDG----TGGFIIDTGTP 324
+++TP+L + P ++Y++L I++G + F R GG I+D+GT
Sbjct: 259 VQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTT 318
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS--FKAYPSMTFHLQ- 381
+T++ Y + + + + + FD C+ ++ P T L+
Sbjct: 319 LTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLRF 378
Query: 382 --EADYIVQPENMYFIEP--DRGRFCVA------IQDDPKYSILGAWQQQNMLIIYDLNV 431
A+Y V+ + + +GR V + SI+G Q ++ ++YDL+
Sbjct: 379 AGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDG 438
Query: 432 PALRFGSENCAN 443
F +CAN
Sbjct: 439 GMFSFAPADCAN 450
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 167/377 (44%), Gaps = 41/377 (10%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y ++ +GTP P ++ DT S +VW QC PC RC+DQ+ P+FDPR S++Y
Sbjct: 133 LAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGA 192
Query: 150 IPCDDPLCRSPFKCQNGKCVYTRR-------YHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ C PLCR + +G C RR Y G VT G + ET F G V R
Sbjct: 193 VDCAAPLCR---RLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFA---GGARVAR 246
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM--------- 253
+A GC +DN G +G+LG LS +Q+ R FSYCLV
Sbjct: 247 VALGCGHDNEGLFV--AAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAAS 304
Query: 254 -EATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMR 311
+S + FG + TP++ + + +Y+ L+ IS+G R P A +R
Sbjct: 305 RSRSSTVTFGPPS---ASAASFTPMVRNPRMETFYYVQLVGISVGG--ARVPGVAESDLR 359
Query: 312 ----DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
G GG I+D+GT VT + Y L + L R+ FD CY
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGL---RLSPGGFSLFDTCYDLG 416
Query: 368 S-SFKAYPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNML 424
P+++ H P Y I D RG FC A D SI+G QQQ
Sbjct: 417 GRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFR 476
Query: 425 IIYDLNVPALRFGSENC 441
+++D + + F + C
Sbjct: 477 VVFDGDGQRVGFAPKGC 493
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 173/372 (46%), Gaps = 44/372 (11%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q L Y V V +G + ++ DT S L W QCQPC RC++Q P+F+P S +Y + C
Sbjct: 131 QTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLC 188
Query: 153 DDPLCRSPFKCQNGK----------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
P C+S + G C Y Y G TRG E T V
Sbjct: 189 SSPTCQS-LQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL---GNSTAVNN 244
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKF 261
FGC +N G FGG SG++G S LSL SQ G+FSYCL + E EA+ +
Sbjct: 245 FIFGCGRNNQGL-FGGA-SGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEASGSLVM 302
Query: 262 GRDADVRRRDLETTPILLSDLRPH-----FYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
G ++ V + TTPI + + P+ ++L+L I++G V+ P +F G G
Sbjct: 303 GGNSSVYKN---TTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAP--SF-----GKDG 352
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPS 375
+ID+GT +T + YQ L + + + G P A D C+ + P+
Sbjct: 353 MMIDSGTVITRLPPSIYQALKDEF--VKQFSGFPSAP--AFMILDTCFNLSGYQEVEIPN 408
Query: 376 MTFHLQ-EADYIVQPENM-YFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLN 430
+ H + A+ V + YF++ D + C+AI + + I+G +QQ+N +IYD
Sbjct: 409 IKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTK 468
Query: 431 VPALRFGSENCA 442
L F +E C
Sbjct: 469 GSMLGFAAEACT 480
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 157/355 (44%), Gaps = 26/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP + ++FDT S W QCQPC+ C+ Q P+FDP S TY+ I C
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 220
Query: 156 LCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G T G +++T + + FGC N G
Sbjct: 221 YCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTL----AYDTIKNFRFGCGEKNRG 276
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F+YCL T + G A L
Sbjct: 277 LF--GRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAANARL- 333
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TP+L+ +Y+ + I +G H++ P F T G ++D+GT +T + Y
Sbjct: 334 -TPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFS-----TAGTLVDSGTVITRLPPSAY 387
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQEADYI-VQP 389
L + + ++ LG P A D CY + A P+++ Q + V
Sbjct: 388 APLRSAFSKAMQGLGYSAAP--AFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445
Query: 390 ENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ ++ D + C+A DD +I+G QQ+ ++YD+ + F C
Sbjct: 446 SGILYVA-DVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 160/352 (45%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +G+P + Q+++ D+ S +VW QCQPC C+ Q+ P+FDP S TY+ I CD +
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSV 196
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C C +G+C Y Y G TRG + ET F G + +A GC + N G
Sbjct: 197 CDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF----GRVLIRNIAIGCGHMNRGM 252
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG +S QL + G FSYCLV R E+T ++FGR A
Sbjct: 253 FI--GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGA--MPVGAA 308
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHI-VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P++ + P FY L I V P F++ G GG ++DTGT VT +
Sbjct: 309 WVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPA 368
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y+ + +L R + FD CY + P+++F+ + P
Sbjct: 369 YEAFRDTFIGQTANLPRS----DRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 424
Query: 392 MYFIEPD-RGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I D G FC A SI+G QQ+ + I D + + FG C
Sbjct: 425 NFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 120/379 (31%), Positives = 166/379 (43%), Gaps = 50/379 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +GTP +P L DT S LVWTQC PC CF Q P+ DP AS+TY+ +PC P
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPR 151
Query: 157 CRS-PF-KCQNG----------KCVYTRRYHVGD--VTRGLASRETFAFPVRN--GFTFV 200
CR+ PF C G C Y YH GD VT G + + F F N G + +
Sbjct: 152 CRALPFTSCGGGGRSSWGNGNRSCAYI--YHYGDKSVTVGEIATDRFTFGGDNGDGDSRL 209
Query: 201 P--RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TS 257
P RL FGC + N G F +GI GF SL SQL FSYC E+ +S
Sbjct: 210 PTRRLTFGCGHFNKGV-FQSNETGIAGFGRGRWSLPSQLNVTT---FSYCFTSMFESKSS 265
Query: 258 VIKFG---------RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAF 307
++ G A ++ TTP+L + +P Y L L IS+G+ + P
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKL 325
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
IID+G +T + Y+ + + +G D C+
Sbjct: 326 R-------STIIDSGASITTLPEAVYEAVKAEFAA---QVGLPPTGVVEGSALDLCFALP 375
Query: 368 SSF----KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-KYSILGAWQQQN 422
+ PS+T HL AD+ + N F + CV + P +++G +QQQN
Sbjct: 376 VTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQN 435
Query: 423 MLIIYDLNVPALRFGSENC 441
++YDL L F C
Sbjct: 436 THVVYDLENDWLSFAPARC 454
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 170/397 (42%), Gaps = 43/397 (10%)
Query: 82 ELEDIHLPMAKQDLF--------------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
+LE +H A DL Y + +G P ++ DT S L+W QC P
Sbjct: 63 QLESLHSATAAADLLRSPVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLP 122
Query: 128 CIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK-----CQNGKCVYTRRYHVGDVTRG 182
C RC+ Q TP++DPR S T+ IPC P CR + + G CVY Y G + G
Sbjct: 123 CRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSG 182
Query: 183 LASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ 242
+ +T P T V + GC +DN G +G+LG LS +QL
Sbjct: 183 DLATDTLVLPDD---TRVHNVTLGCGHDNEGLL--ASAAGLLGAGRGQLSFPTQLAPAYG 237
Query: 243 GLFSYCL----VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISI-G 296
+FSYCL R ++S + FGR ++ TP+ + RP +Y+ ++ S+ G
Sbjct: 238 HVFSYCLGDRMSRARNSSSYLVFGRTPELPSTAF--TPLRTNPRRPSLYYVDMVGFSVGG 295
Query: 297 RHIVRFPPGAFDIM-RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN 355
+ F + + G GG ++D+GT ++ Y + + + G +R+ N
Sbjct: 296 ERVAGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRL-RN 354
Query: 356 ASQEFDYCYRYDSSFKA----YPSMTFHLQEADYIVQPENMYFIE----PDRGRFCVAIQ 407
FD CY + PS+ H A + P+ Y I R FC+ +Q
Sbjct: 355 KFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQ 414
Query: 408 -DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D ++LG QQQ +++D+ + F C+
Sbjct: 415 AADDGLNVLGNVQQQGFGVVFDVERGRIGFTPNGCSG 451
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 121/435 (27%), Positives = 185/435 (42%), Gaps = 42/435 (9%)
Query: 33 SLKLIPIFSPESPLYPGNLSQSERIHK-MFEISKARANYMAS-----MSKPNAFQELEDI 86
SL+++ P S L +++ H + + R Y+ S + + N+ +EL+
Sbjct: 62 SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDST 121
Query: 87 HLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDP 141
LP L Y V V +GTP + L+FDT S L WTQC+PC C+ Q IFDP
Sbjct: 122 TLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDP 181
Query: 142 RASTTYSEIPCDDPLC--------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV 193
S++Y I C LC +S C+Y +Y + G S+E
Sbjct: 182 SKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITA 241
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM 253
+ V FGC DN G F G +G++G P+S Q + +FSYCL
Sbjct: 242 TD---IVDDFLFGCGQDNEGL-FSGS-AGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTS 296
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRD 312
+ + FG A +L+ TP+ FY L ++ IS+G + P A
Sbjct: 297 SSLGHLTFGASA-ATNANLKYTPLSTISGDNTFYGLDIVGISVGG--TKLP--AVSSSTF 351
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY-NASQEFDYCYRYDSSFK 371
GG IID+GT +T + Y L + Q G ++ P N FD CY + S +K
Sbjct: 352 SAGGSIIDSGTVITRLAPTAYAALRSAFRQ-----GMEKYPVANEDGLFDTCYDF-SGYK 405
Query: 372 --AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLII 426
+ P + F + P I + C+A +D +I G QQ+ + ++
Sbjct: 406 EISVPKIDFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVV 465
Query: 427 YDLNVPALRFGSENC 441
YD+ + FG+ C
Sbjct: 466 YDVEGGRIGFGAAGC 480
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 169/358 (47%), Gaps = 27/358 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V IG+P K Q+L+ DT S + W QC PC C+ Q +FDPRAS+++ + C P
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ + +C+Y Y G T G + ++F V G T + FGC +DN
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFL--VSRGRT--SPVVFGCGHDNE 129
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE---MEATSVIKFGRDADVRR 269
G +G+LG A LS SQL +R FSYCLV + A+S + FG A
Sbjct: 130 GLFV--GAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 270 RDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTF 327
T +L + L +Y L ISIG ++ P AF + G GG IID+GT VT
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDS-SFKAYPSMTFHLQEADY 385
+ Y + + RS Q++P A FD CY + + + P+++FH +
Sbjct: 245 LPTYAYTVMRDAF----RS-ATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGAS 299
Query: 386 IVQPENMYFIEPD-RGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ P + Y + D G FC A SI+G QQQ M + DL+ + F C
Sbjct: 300 VQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 165/354 (46%), Gaps = 22/354 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +G P + Q ++ DT S + W QC+PC C+ Q+ PI++P S++Y + C L
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANL 204
Query: 157 CR--SPFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C+ C +NG C+Y Y G T+G + ET G + +A GC +DN G
Sbjct: 205 CQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTL----GGAPLQNVAIGCGHDNEG 260
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDL 272
+G+LG LS SQL + +FSYCLV R+ E++S ++FGR A L
Sbjct: 261 LFV--GAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAAVPNGAVL 318
Query: 273 ETTPILL-SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
P+L S L +Y+ L IS+G ++ F I G GG I+D+GT VT ++
Sbjct: 319 --APMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTA 376
Query: 332 PYQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQP 389
Y +L + G + +P + FD CY S P++ FH + P
Sbjct: 377 AYDSLRDAF-----RAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLP 431
Query: 390 ENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y + D G FC A SI+G QQQ + + +D + F C
Sbjct: 432 AKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 171/353 (48%), Gaps = 25/353 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + V IG P +++ DT S + W QC PC C+ Q+ PIFDP +S +YS I CD P
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQ 208
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+NG C+Y Y G T G + ET G V +A GC ++N G
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL----GTAAVENVAIGCGHNNEGL 264
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS +Q+ FSYCLV R+ +A S ++F R++
Sbjct: 265 FV--GAAGLLGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFNSPLP---RNVV 316
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T P+ + +L +YL L IS+G + P F++ G GG IID+GT VT +R+
Sbjct: 317 TAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEV 376
Query: 333 YQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y L + + G + IP N FD CY S P+++FH E + P
Sbjct: 377 YDALRDAFVK-----GAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPA 431
Query: 391 NMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A SI+G QQQ + +D+ + F +++C
Sbjct: 432 RNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 170/360 (47%), Gaps = 30/360 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S +VW QC PC RC+ Q+ PIFDPR S TY+ IPC P
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR + + C+Y Y G T G S ET F RN V +A GC +DN
Sbjct: 202 CRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-RN---RVKGVALGCGHDNE 257
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRR 270
G +G+LG LS Q +R FSYCLV ++ S + FG A R
Sbjct: 258 GLFV--GAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAA--VSR 313
Query: 271 DLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIIDTGTPV 325
TP+L + L +Y+ LL IS+G V PG F + + G GG IID+GT V
Sbjct: 314 IARFTPLLSNPKLDTFYYVGLLGISVGGTRV---PGVTASLFKLDQIGNGGVIIDSGTSV 370
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEAD 384
T + Y + + ++L +R P FD C+ + + P++ H + AD
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKTL--KRAP--DFSLFDTCFDLSNMNEVKVPTVVLHFRGAD 426
Query: 385 YIVQPENMYFIEPD-RGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ P Y I D G+FC A SI+G QQQ ++YDL + F CA
Sbjct: 427 -VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 189/398 (47%), Gaps = 39/398 (9%)
Query: 78 NAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
NA + + + L Y V + +GTP L+ DT S + W QC PC C P
Sbjct: 119 NALTGFTSPVVTLGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRP 178
Query: 138 IFDPRASTTYSEIPCDDPLCRS------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFA 190
F+PR S+++ ++PC C + PF +G+ C+++ +Y G ++ GL + ET A
Sbjct: 179 PFNPRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA 238
Query: 191 FPVRNGFTFVP----RLAFGCSN-DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
N P + GC++ D G G SG+LG + P+S SQL +R F
Sbjct: 239 GNTPNFGDGEPVKLSNITLGCADIDREGLPTGA--SGLLGMDRRPISFPSQLSSRYARKF 296
Query: 246 SYCL---VREMEATSVIKFGRDADVRRRDLETTPILLSDLRP-----HFYLHLLEISIGR 297
S+C + + ++ ++ FG ++D+ L TP++ + P ++Y+ L+ IS+
Sbjct: 297 SHCFPDKIAHLNSSGLVFFG-ESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDE 355
Query: 298 HIVRFPPGAFDIMR-DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+ FDI + G+GG IID+GT T+++ +Q + + + + R+ ++ N+
Sbjct: 356 SRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNS 413
Query: 357 SQEFDYCYRYDSSFKA-----YPSMTFHLQEADYIVQPENMYFI----EPDRGRFCVAIQ 407
F CY S A PS+T H + +V P+N I ++ C+A Q
Sbjct: 414 G--FTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ 471
Query: 408 --DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D ++I+G +QQQN+ + YDL L CA
Sbjct: 472 MSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCAT 509
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 130/441 (29%), Positives = 200/441 (45%), Gaps = 44/441 (9%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSK--PNAFQELE 84
S GF+ +LI SP SP Y ++ + RI S++R NY+ ++K NA
Sbjct: 3 SNEVGFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDV 62
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQT---TPIFD 140
+ + + Y + NIG P DT++ L+W QC C +C + T F
Sbjct: 63 SLSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFL 122
Query: 141 PRASTTYSEIPCDDPLCRS--PFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRN 195
S TY PC C S F+ N C Y Y T G+ S ++F F +
Sbjct: 123 SSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSD 182
Query: 196 GFTF-VPRLAFGCS-----NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
G V L FGCS D + +G +G N +PLSL SQL + FSYCL
Sbjct: 183 GMLVDVGFLNFGCSEAPLTGDEQSY------TGNVGLNQTPLSLISQLGIK---KFSYCL 233
Query: 250 V--REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
V + +TS + FG + TP+L + +Y+ +L ISIG F G F
Sbjct: 234 VPFNNLGSTSKMYFGS---LPVTSGGQTPLLYPN-SDAYYVKVLGISIGNDEPHF-DGVF 288
Query: 308 DI--MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
D+ +RD G+IIDTG + + + +L+ ++ + R+ P + F+ C+
Sbjct: 289 DVYEVRD---GWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDP---KERFELCFE 342
Query: 366 YDSS--FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA-IQDDPKYSILGAWQQQN 422
++ +++P +T H AD I+ E+ + D G FC+A ++ SILG +Q QN
Sbjct: 343 LQNANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQN 402
Query: 423 MLIIYDLNVPALRFGSENCAN 443
+ YDL + F +CA+
Sbjct: 403 YHVGYDLEAQVISFAPVDCAD 423
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 167/363 (46%), Gaps = 38/363 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP + +FDT S L WTQC+PC R C+ Q PIF+P ST+Y+ I C P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 156 LC--------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C SP C CVY +Y + G +++ A + F FGC
Sbjct: 198 TCDELKSGTGNSP-SCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDVFN---NFLFGC 253
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
+N G G ++G++G + LSL SQ + LFSYCL +T + FG
Sbjct: 254 GQNNRGLFVG--VAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSSTGYLTFGSGGGT 311
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ ++ TP L++ P FY L+L+ IS+G + F T G IID+GT ++
Sbjct: 312 SKA-VKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFS-----TAGTIIDSGTVIS 365
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNA-SQEFDYCY---RYDSSFKAYPSMTFHLQE 382
+ Y L + Q + + P A + D CY +YD+ P + + +
Sbjct: 366 RLPPTAYSDLRASFQQQM-----SKYPKAAPASILDTCYDFSQYDT--VDVPKINLYFSD 418
Query: 383 -ADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
A+ + P +++I + + C+A D +ILG QQ+ ++YD+ + F
Sbjct: 419 GAEMDLDPSGIFYIL-NISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAP 477
Query: 439 ENC 441
C
Sbjct: 478 GGC 480
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 160/351 (45%), Gaps = 28/351 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ +GTP DT S L+WTQC PC C+ Q PIFDP S+T+ E
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEK------ 114
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
+C C Y Y ++G + ET +G FV P GC +++S F
Sbjct: 115 -----RCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFK 169
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
SG++G + P SL +Q+ GL SYC + TS I FG +A V + +T
Sbjct: 170 --PTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFAS--QGTSKINFGTNAIVAGDGVVST 225
Query: 276 PILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+ L+ +P +YL+L +S+G V F + G IID+GT +T+
Sbjct: 226 TMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYCN 282
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENMY 393
+ + D + ++ R P CY Y + +P +T H AD ++ NMY
Sbjct: 283 LVREAVDHYVTAV-RTADPTGNDM---LCY-YTDTIDIFPVITMHFSGGADLVLDKYNMY 337
Query: 394 FIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
RG FC+AI + P+ +I G Q N L+ YD + + F NC+
Sbjct: 338 IETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/376 (31%), Positives = 171/376 (45%), Gaps = 49/376 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ-TTPIFDPRASTTYSEIPCDDP 155
Y + V++GTP +P L DT S LVWTQC PC+ CF+Q P+ DP AS+T++ +PCD P
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAP 149
Query: 156 LCRS-PFKCQNGK------CVYTRRYHVGD--VTRGLASRETFAFPVRN--GFTFVPRLA 204
LCR+ PF G+ CVY YH GD +T G + ++F F + G R+
Sbjct: 150 LCRALPFTSCGGRSWGDRSCVYV--YHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVT 207
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR--EMEATSVIKFG 262
FGC + N G F +GI GF SL SQL FSYC + +++SV+ G
Sbjct: 208 FGCGHINKGI-FQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFDTKSSSVVTLG 263
Query: 263 RDA--------DVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDG 313
A D+ TT ++ + +P Y + L IS+G V P +R
Sbjct: 264 AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE---SRLRSS 320
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP--YNASQEFDYCYRYDSSF- 370
T IID+G +T + Y+ + + + + +P S D C+ +
Sbjct: 321 T---IIDSGASITTLPEDVYEAVKAEF------VSQVGLPAAAAGSAALDLCFALPVAAL 371
Query: 371 ---KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP--KYSILGAWQQQNMLI 425
A P++T HL P Y E R + D + ++G +QQQN +
Sbjct: 372 WRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHV 431
Query: 426 IYDLNVPALRFGSENC 441
+YDL L F C
Sbjct: 432 VYDLENDVLSFAPARC 447
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 194/423 (45%), Gaps = 32/423 (7%)
Query: 35 KLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQD 94
KLI S SP + N S +ER ++ + S R Y+ + K + +++L + +
Sbjct: 37 KLIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIHMNDFELNLLPSTYE 96
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+ V ++G P PQ + DT S+++W +C PC RC Q P+ DP S+TY+ +PC +
Sbjct: 97 PLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTN 156
Query: 155 PLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN-GFTFVPRLAFGCSND 210
+C S + + +C Y Y G + G+ + E F + G VP + FGCS++
Sbjct: 157 TMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHE 216
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATSVIKFGRDADV 267
N + + +G+ G S +++ ++ FSYCL + + + FG A+
Sbjct: 217 NGDYK-DRRFTGVFGLGKGITSFVTRMGSK----FSYCLGNIADPHYGYNQLVFGEKANF 271
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+TP L + H+Y+ L IS+G + AF M+ +ID+GT +T+
Sbjct: 272 EGY---STP--LKVVNGHYYVTLEGISVGEKRLDIDSTAFS-MKGNEKSALIDSGTALTW 325
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYI 386
+ ++ L Q+L + +P+ Y +P +TFH AD
Sbjct: 326 LAESAFRALDNEVRQLLDGV---LMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLD 382
Query: 387 VQPENMYF-IEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ E+M++ PD C+A++ D +S++G QQ + YDLN L F
Sbjct: 383 LDTESMFYQATPD--ILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQR 440
Query: 439 ENC 441
+C
Sbjct: 441 IDC 443
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/465 (26%), Positives = 213/465 (45%), Gaps = 43/465 (9%)
Query: 6 ALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK 65
+L LA + S + + + +++ + + KLI S PLY N + +R + S
Sbjct: 14 SLTLAFYLS--TAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSI 71
Query: 66 ARANYMASMSK--PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
R +++ S K + E +P + F V ++IG+P Q ++ DT SSL+W
Sbjct: 72 ERFDFLESKIKELKSVGNEARSSLIPFNRGSGFL-VNLSIGSPPVTQLVVVDTGSSLLWV 130
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQN-GKCVYTRRYHVGDVT 180
QC PCI CF Q+T FDP S ++ + C P + +KC + Y RY GD +
Sbjct: 131 QCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSS 190
Query: 181 RGLASRETFAFPVRN-GFTF-------------VPRLAFGCSNDNSGFAFGGKISGILGF 226
+G+ ++E+ F + G F + FGC + N +G+ G
Sbjct: 191 QGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYNGVFGL 250
Query: 227 NASP-LSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETTPILLSDL 282
A P +++++QL N+ FSYC+ + + + G+ + + ++TP+ +
Sbjct: 251 GAYPHITMATQLGNK----FSYCIGDINNPLYTHNHLVLGQGSYIEG---DSTPLQIH-- 301
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
H+Y+ L IS+G ++ P AF I DG+GG +ID+G T + NG ++ L
Sbjct: 302 FGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVD 361
Query: 343 ILRSLGRQRIPYNASQEFDYCYR--YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRG 400
+++ L +RIP E C++ +P++TFH +V F +
Sbjct: 362 LMKGL-LERIPTQRKFE-GLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGD 419
Query: 401 RFCVAI----QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
RFC+AI + S++G QQN + +DL + F +C
Sbjct: 420 RFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/369 (29%), Positives = 168/369 (45%), Gaps = 29/369 (7%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ M Y V + +G+P + Q+++ D+ S +VW QC+PC RC+ Q+ P+FDP S
Sbjct: 131 DVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADS 190
Query: 145 TTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
++++ + C +C C G+C Y Y G T+G + ET G +
Sbjct: 191 SSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTV----GQVMIRD 246
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKF 261
+A GC + N G +G+LG +S QL + G FSYCLV R +T ++F
Sbjct: 247 VAIGCGHTNQGMFI--GAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEF 304
Query: 262 GRDADVRRRDLETTPILLSDLR----PHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
GR A L +S +R P F Y+ L I +G V P F + GT G
Sbjct: 305 GRGA------LPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNG 358
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YP 374
++DTGT VT Y + +L R P FD CY + F++ P
Sbjct: 359 VVMDTGTAVTRFPTAAYVAFRDSFTAQTSNL--PRAP--GVSIFDTCYDLN-GFESVRVP 413
Query: 375 SMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVP 432
+++F+ + + P + I D G FC+A P SI+G QQ+ + I +D
Sbjct: 414 TVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANG 473
Query: 433 ALRFGSENC 441
+ FG C
Sbjct: 474 FVGFGPNIC 482
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 164/365 (44%), Gaps = 19/365 (5%)
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRA 143
D+ M + Y V + +G+P + Q+++ D+ S +VW QC+PC +C+ QT P+FDP
Sbjct: 30 SDVVSGMNQGSGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPAD 89
Query: 144 STTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP 201
S ++ + C +C C +G+C Y Y G T+G + ET F G T V
Sbjct: 90 SASFMGVSCSSAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF----GRTVVR 145
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK 260
+A GC + N G +G+LG +S QL + FSYCLV R ++
Sbjct: 146 NVAIGCGHSNRGMFV--GAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLE 203
Query: 261 FGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
FG +A P++ + P F Y+ LL + +G V F + G+GG ++
Sbjct: 204 FGSEA--MPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVM 261
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTF 378
DTGT VT Y+ + + ++L R + FD CY P+++F
Sbjct: 262 DTGTAVTRFPTVAYEAFRNAFIEQTQNLPRA----SGVSIFDTCYNLFGFLSVRVPTVSF 317
Query: 379 HLQEADYIVQPENMYFIE-PDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRF 436
+ + P N + I D G FC A P SILG QQ+ + I D + F
Sbjct: 318 YFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGF 377
Query: 437 GSENC 441
G C
Sbjct: 378 GPNIC 382
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 169/364 (46%), Gaps = 58/364 (15%)
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
P++ + Y ++++IGTP + ++DT S L+WTQC PC+ C+ Q P+FDP ST++
Sbjct: 16 PVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFK 75
Query: 149 EIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
E+ C+ CR + + T + + FGC
Sbjct: 76 EVSCESQQCR----------------------------------LLDTPTSILNIVFGCG 101
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLV---REMEATSVIKFGR 263
++NSG F G+ G PLSL+SQ+ + + FS CLV + TS I FG
Sbjct: 102 HNNSG-TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 160
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+A+V D+ +TP++ D ++++ L IS+G + F + + G ID GT
Sbjct: 161 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATK---GNVFIDAGT 217
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD----YCYRYDSSFKAYPSMTFH 379
P T + Y L+Q ++ IP Q+ D CYR ++ P +T H
Sbjct: 218 PPTLLPRDFYNRLVQGV--------KEAIPMEPVQDPDLQPQLCYR-SATLIDGPILTAH 268
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
AD ++P N FI P G +C A+Q D I G + Q N LI +DL+ + F +
Sbjct: 269 FDGADVQLKPLNT-FISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKA 327
Query: 439 ENCA 442
+C
Sbjct: 328 VDCT 331
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/377 (31%), Positives = 176/377 (46%), Gaps = 41/377 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT-TPIFDPRASTTYSEIPCDDP 155
Y V++ IG P + L+ DT S LVW +C C C + +F PR S+T+S C DP
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 142
Query: 156 LC-------RSPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVPRL 203
+C R+P +C + + C Y Y G +T GL +RET + +G + +
Sbjct: 143 VCRLVPKPGRAP-RCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSV 201
Query: 204 AFGC-----SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--- 255
AFGC SG +F G +G++G P+S +SQL R FSYCL+ +
Sbjct: 202 AFGCGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGT 314
TS + G D + L TP+L + L P F Y+ L + + +R P ++I G
Sbjct: 261 TSYLIIGDGGDAVSK-LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGN 319
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ---EFDYCYRYDSSFK 371
GG ++D+GT + F+ + Y+ ++ Q R ++P NA + FD C K
Sbjct: 320 GGTVMDSGTTLAFLADPAYRLVIAAVKQ------RIKLP-NADELTPGFDLCVNVSGVTK 372
Query: 372 A---YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD-DPK--YSILGAWQQQNMLI 425
P + F V P YFIE + C+AIQ DPK +S++G QQ L
Sbjct: 373 PEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLF 432
Query: 426 IYDLNVPALRFGSENCA 442
+D + L F CA
Sbjct: 433 EFDRDRSRLGFSRRGCA 449
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/438 (27%), Positives = 190/438 (43%), Gaps = 34/438 (7%)
Query: 22 THFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYM---------- 71
THF+ S+ ++L+L+ S Y + R+H R + +
Sbjct: 49 THFSDESSSKYTLRLLHRDRFPSVTY---RNHHHRLHARMRRDTDRVSAILRRISGKVIP 105
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC 131
+S S+ DI M + Y V + +G+P + Q+++ D+ S +VW QCQPC C
Sbjct: 106 SSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC 165
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETF 189
+ Q+ P+FDP S +Y+ + C +C C +G C Y Y G T+G + ET
Sbjct: 166 YKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETL 225
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
F T V +A GC + N G F G + S +S QL + G F YCL
Sbjct: 226 TFAK----TVVRNVAMGCGHRNRGM-FIGAAGLLGIGGGS-MSFVGQLSGQTGGAFGYCL 279
Query: 250 V-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHI-VRFPPGAF 307
V R ++T + FGR+A P++ + P FY L+ + + P G F
Sbjct: 280 VSRGTDSTGSLVFGREA--LPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
D+ G GG ++DTGT VT + Y + +L R + FD CY
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRA----SGVSIFDTCYDL- 392
Query: 368 SSFKA--YPSMTFHLQEADYIVQPENMYFIE-PDRGRFCVAIQDDPK-YSILGAWQQQNM 423
S F + P+++F+ E + P + + D G +C A P SI+G QQ+ +
Sbjct: 393 SGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGI 452
Query: 424 LIIYDLNVPALRFGSENC 441
+ +D + FG C
Sbjct: 453 QVSFDGANGFVGFGPNVC 470
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 159/354 (44%), Gaps = 28/354 (7%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y +++ +GTP DT S ++WTQC PC C+ Q PIFDP S+T+ E
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQ----- 474
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGF 214
+C C Y Y ++G+ + ET P +G FV GC DN+
Sbjct: 475 ------RCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNL 528
Query: 215 AFGG---KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
+ G SGI+G N PLSL SQ+ GL SYC + TS I FG +A V
Sbjct: 529 QYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF--SGQGTSKINFGTNAIVAGDG 586
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ + P +YL+L +S+ +++ F G ID+GT +T+
Sbjct: 587 TVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPF---HAEDGNIFIDSGTTLTYFPMS 643
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPE 390
+ + +Q++ ++ ++P S CY Y + +P +T H AD ++
Sbjct: 644 YCNLVREAVEQVVTAV---KVPDMGSDNL-LCY-YSDTIDIFPVITMHFSGGADLVLDKY 698
Query: 391 NMYFIEPDRGRFCVAIQ-DDPKY-SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
NMY G FC+AI +DP ++ G Q N L+ YD + + F NC+
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/342 (29%), Positives = 152/342 (44%), Gaps = 32/342 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y +++ +GTP DT S L+WTQC PC C+ Q PIFDP S+T++E
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQ----- 135
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCS-----N 209
+C C Y Y ++G+ + ET +G FV GC
Sbjct: 136 ------RCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDL 189
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
DNSGFA SGI+G N P SL SQ+ GL SYC + TS I FG +A V
Sbjct: 190 DNSGFA--SSSSGIVGLNMGPRSLISQMDLPYPGLISYCF--SGQGTSKINFGTNAIVAG 245
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ + P +YL+L +S+ + + F G +ID+G+ VT+
Sbjct: 246 DGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPF---HAEDGNIVIDSGSTVTYFP 302
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQ 388
+ + +Q++ ++ R+P + + CY + + +P +T H AD ++
Sbjct: 303 VSYCNLVRKAVEQVVTAV---RVPDPSGNDM-LCY-FSETIDIFPVITMHFSGGADLVLD 357
Query: 389 PENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYD 428
NMY G FC+AI + +I G Q N L+ YD
Sbjct: 358 KYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/353 (33%), Positives = 171/353 (48%), Gaps = 25/353 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + V IG P +++ DT S + W QC PC C+ Q+ PIFDP +S +YS I CD+P
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQ 208
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C+S +C+NG C+Y Y G T G + ET G V +A GC ++N G
Sbjct: 209 CKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL----GSAAVENVAIGCGHNNEGL 264
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS +Q+ FSYCLV R+ +A S ++F R+
Sbjct: 265 FV--GAAGLLGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFNSPLP---RNAA 316
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T P++ + +L +YL L IS+G + P +F++ G GG IID+GT VT +R+
Sbjct: 317 TAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEV 376
Query: 333 YQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y L + + G + IP N FD CY S P+++F E + P
Sbjct: 377 YDALRDAFVK-----GAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPA 431
Query: 391 NMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D G FC A SI+G QQQ + +D+ + F ++C
Sbjct: 432 RNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 160/355 (45%), Gaps = 22/355 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S +VW QC PC +C+ Q+ PIF+P S +++ IPC PL
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPL 169
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
CR S + C+Y Y G T G + ET F + ++A GC + N
Sbjct: 170 CRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNK----IAKVALGCGHHNE 225
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRR 270
G +G+LG LS SQ R FSYCLV ++ S + FG DA + R
Sbjct: 226 GLFV--GAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFG-DAAISRL 282
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
T I L +Y+ L+ IS+G VR P F + G GG IID+GT VT +
Sbjct: 283 ARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLT 342
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD--SSFKAYPSMTFHLQEADYIV 387
Y L + R L R FD CY SS K P++ H + AD +
Sbjct: 343 RPAYTALRDAFRVGARHLKRG----PEFSLFDTCYDLSGQSSVKV-PTVVLHFRGADMAL 397
Query: 388 QPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
N + G FC A SI+G QQQ ++YDL + F C
Sbjct: 398 PATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 174/375 (46%), Gaps = 41/375 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT--PIFDPRASTTYSEIPCDD 154
Y++ +++GTP ++ DT S+L+W QC PC RCF + T P+ P S+T+S +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 155 PLCR------SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C+ P C C Y Y G T G + ET V +G TF P++AFGC
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLT--VGDG-TF-PKVAFGC 205
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME--ATSVIKFGRDA 265
S +N SGI+G PLSL SQL G FSYCL +M S I FG A
Sbjct: 206 STENG----VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCLRSDMADGGASPILFGSLA 258
Query: 266 DVRRRD-LETTPILLS---DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGFIID 320
+ R +++TP+L + H+Y++L I++ + F + G GG I+D
Sbjct: 259 KLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVD 318
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSM 376
+GT +T++ Y + Q + + +L + A + D CY+ + P +
Sbjct: 319 SGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRL 378
Query: 377 TFHLQEADYIVQPENMYF--IEPD-RGRFCVA------IQDDPKYSILGAWQQQNMLIIY 427
P YF +E D +GR VA DD SI+G Q +M ++Y
Sbjct: 379 ALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLY 438
Query: 428 DLNVPALRFGSENCA 442
D++ F +CA
Sbjct: 439 DIDGGMFSFAPADCA 453
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 123/409 (30%), Positives = 180/409 (44%), Gaps = 37/409 (9%)
Query: 54 SERIHKMFEISKARANYMASMSKPNAFQELEDIHLP-MAKQDLFYSVEVNIGTPMKPQHL 112
S R+ + ++ A + P + + + +++ Y + + +GTP ++
Sbjct: 92 SLRVESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRLGVGTPATNMYM 151
Query: 113 LFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKC---QNG 166
+ DT S +VW QC PC C++Q+ P+F+P S T++ +PC LCR +C ++
Sbjct: 152 VLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSSECVSRRSK 211
Query: 167 KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF 226
C+Y Y G T G S ET F V +A GC +DN G +G+LG
Sbjct: 212 ACLYQVSYGDGSFTVGDFSTETLTFHGAR----VDHVALGCGHDNEGLFV--GAAGLLGL 265
Query: 227 NASPLSLSSQLRNRIQGLFSYCLVREM------EATSVIKFGRDADVRRRDLETTPILLS 280
LS SQ +NR G FSYCLV + S I FG A + TP+L +
Sbjct: 266 GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVF--TPLLTN 323
Query: 281 -DLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
L +YL LL IS+G V PG F + G GG IID+GT VT + Y
Sbjct: 324 PKLDTFYYLQLLGISVGGSRV---PGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVA 380
Query: 336 LMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMY 393
L + LG R+ S FD C+ + P++ FH + + N
Sbjct: 381 LRDAF-----RLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEVSLPASNYL 435
Query: 394 FIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++GRFC A SI+G QQQ + YDL + F S C
Sbjct: 436 IPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 186/386 (48%), Gaps = 39/386 (10%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+ + L Y V + +GTP L+ DT S + W QC PC C P F+PR S+++ +
Sbjct: 132 LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFK 191
Query: 150 IPCDDPLCRS------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP- 201
+PC C + PF +G+ C+++ +Y G ++ GL + ET A N P
Sbjct: 192 LPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPV 251
Query: 202 ---RLAFGCSN-DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VREME 254
+ GC++ D G G SG+LG + P+S SQL +R FS+C + +
Sbjct: 252 KLSNITLGCADIDREGLPTGA--SGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLN 309
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRP-----HFYLHLLEISIGRHIVRFPPGAFDI 309
++ ++ FG ++D+ L TP++ + P ++Y+ L+ IS+ + FDI
Sbjct: 310 SSGLVFFG-ESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDI 368
Query: 310 MR-DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
+ G+GG IID+GT T+++ +Q + + + + R+ ++ N+ F CY S
Sbjct: 369 DKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNSG--FTPCYNITS 424
Query: 369 SFKA-----YPSMTFHLQEADYIVQPENMYFI----EPDRGRFCVA--IQDDPKYSILGA 417
A PS+T H + +V P+N I ++ C+A + D ++I+G
Sbjct: 425 GTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGN 484
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
+QQQN+ + YDL L CA
Sbjct: 485 YQQQNLWVEYDLEKLRLGIAPAQCAT 510
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 180/426 (42%), Gaps = 19/426 (4%)
Query: 20 FLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNA 79
L S S LI I+S SP P N + + + R ++ S+ +
Sbjct: 40 ILNRKVGKRSHSVSFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSS- 98
Query: 80 FQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
++ + ++P+ Y ++V+ GTP + + L DT S + W C+ C C T PIF
Sbjct: 99 -KQDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIF 156
Query: 140 DPRASTTYSEIPCDDPLCRS-PFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
DP S++Y CD C+ C N KC + Y G G + + G
Sbjct: 157 DPAKSSSYKPFACDSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITL----GS 212
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
++P +FGC+ S LG + L + G FSYCL ++
Sbjct: 213 QYLPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSG 272
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ G++A V L+ T ++ P FY + L IS+G + P +I G G
Sbjct: 273 SLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVP--GTNIASGG--G 328
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT +T + Y L + Q L SL Q P ++ D CY SS P++
Sbjct: 329 TIIDSGTTITHLVPSAYTALRDAFRQQLSSL--QPTPV---EDMDTCYDLSSSSVDVPTI 383
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
T HL +V P+ I + G C+A SI+G QQQN I++D+ + F
Sbjct: 384 TLHLDRNVDLVLPKENILITQESGLACLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGF 443
Query: 437 GSENCA 442
E CA
Sbjct: 444 AQEQCA 449
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 167/362 (46%), Gaps = 29/362 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
+S+ V +GTP +P ++ D S L+WTQC Q P+FD S+++S +PCD L
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 157 CRSPF----KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C + C + KC Y Y + T G+ + ETF F +G + L FGC +
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHGVS--ANLTFGCGKLAN 223
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRDADVRR-- 269
G + SGILG + PLS+ QL FSYCL + TS + FG AD+ +
Sbjct: 224 GTI--AEASGILGLSPGPLSMLKQLAIT---KFSYCLTPFADRKTSPVMFGAMADLGKYK 278
Query: 270 --RDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
++T P+L + + ++Y+ ++ +S+G + P I DGTGG ++D+ T +
Sbjct: 279 TTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLA 338
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSMTFHLQE 382
++ + L + + ++ R + ++ C+ P + H
Sbjct: 339 YLVEPAFTELKKAVMEGIKLPVANR----SVDDYPVCFELPRGMSMEGVQVPPLVLHFDG 394
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ P + YF EP G C+A+ P +++G QQQNM ++YD+ +
Sbjct: 395 DAEMSLPRDNYFQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPT 454
Query: 440 NC 441
C
Sbjct: 455 KC 456
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 126/364 (34%), Positives = 163/364 (44%), Gaps = 26/364 (7%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP + +++ DT S +VW QC PC +C+ Q P+FDP S TY+
Sbjct: 122 LAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAG 181
Query: 150 IPCDDPLCR---SPFKCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
IPC PLCR SP C N C Y Y G T G S ET F T V R+A
Sbjct: 182 IPCGAPLCRRLDSP-GCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRR----TRVTRVA 236
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT-SVIKFG 262
GC +DN G +G+LG LS Q R FSYCLV R A S + FG
Sbjct: 237 LGCGHDNEGLFI--GAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFG 294
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDT 321
D+ V R T I L +YL LL IS+G VR F + G GG IID+
Sbjct: 295 -DSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDS 353
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFKA-YPSMTFH 379
GT VT + Y L + +G + A FD C+ + P++ H
Sbjct: 354 GTSVTRLTRPAYIALRDAF-----RVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLH 408
Query: 380 LQEADYIVQPENMYFIEPDR-GRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ AD + P Y I D G FC A SI+G QQQ + +DL + F
Sbjct: 409 FRGAD-VSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFA 467
Query: 438 SENC 441
C
Sbjct: 468 PRGC 471
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 162/368 (44%), Gaps = 29/368 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +N+G P ++ DT S L+W QC PC C+ Q TP++DPR+S+T+ IPC P
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPR 147
Query: 157 CRSPFK-----CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
CR + + G CVY Y G + G + + FP T V + GC +DN
Sbjct: 148 CRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDD---THVHNVTLGCGHDN 204
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC----LVREMEATSVIKFGRDADV 267
G +G+LG LS +QL +FSYC L R +S + FGR +
Sbjct: 205 VGLLE--SAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEP 262
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISI-GRHIVRFPPGAFDIM-RDGTGGFIIDTGTP 324
TP+ + RP +Y+ ++ S+ G + F + + G GG ++D+GT
Sbjct: 263 PSTAF--TPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTA 320
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY--RYDSSFKA---YPSMTFH 379
++ Y + +D + G R FD CY R + + A PS+ H
Sbjct: 321 ISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLH 380
Query: 380 LQEADYIVQPENMYFIEPDRGR----FCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPAL 434
+ P+ Y I G FC+ +Q D ++LG QQQ +++D+ +
Sbjct: 381 FAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRI 440
Query: 435 RFGSENCA 442
F C+
Sbjct: 441 GFTPNGCS 448
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 155/352 (44%), Gaps = 21/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC + C++Q +FDP S+TY+ + C P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 238
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 239 ACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 295
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG + R L
Sbjct: 296 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAAR--LT 351
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L+ + +Y+ L I +G ++ P F T G I+D+GT +T + Y
Sbjct: 352 TTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVF-----ATAGTIVDSGTVITRLPPAAY 406
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 407 SSLRSAFAAAMSARGYKKAP--AVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASG 464
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+A +D I+G Q + + YD+ + F C
Sbjct: 465 IMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 126/462 (27%), Positives = 203/462 (43%), Gaps = 33/462 (7%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSL--KLIPIFSPESPLYPGNLSQSERIH 58
+A+ + + S +FL+ F ++ FS +LI I SP SP + + + + R+
Sbjct: 5 VAYHNFISFTSLIIILSTVFLSSFAIIQADKFSFTAELIHIDSPNSPFFNASETTTHRLA 64
Query: 59 KMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTAS 118
K + S R + +S + E +H + D Y +++ IGTP H DT S
Sbjct: 65 KALQRSANRVARLNPLSNSD-----EGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGS 119
Query: 119 SLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-SPFKCQNGK-CVYT-RRYH 175
+++W C C CF+Q++ IF+P AS+TY + PCD C + CQ+ C+Y+ H
Sbjct: 120 NVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKH 179
Query: 176 VGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLS 234
+ G + +T +G F +P F C N F G G++G LSL+
Sbjct: 180 QLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVCGNSIYK-TFAGV--GVIGLGRGALSLT 236
Query: 235 SQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRDLETTPILLSDLR--PHFYLHLL 291
S+L + G FSYCL S I FG + + DLE L R ++Y+ L
Sbjct: 237 SKLYHLSDGKFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLE 296
Query: 292 EISIG--RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
IS+G R + + F G +ID+GT T + Y L +
Sbjct: 297 GISVGEKRQDLYYVDDPF---APPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPE-NP 352
Query: 350 QRIPYNASQEFDY--------CYRYDSSFKAYPSMTFHLQEADYIVQPENMYF-IEPDRG 400
Q P+N+ F C+ Y K +P +T H +AD + +N + + D
Sbjct: 353 QNHPHNSRFPFSMDNTLKLSPCFWYYPELK-FPKITIHFTDADVELSDDNSFIRVAEDVV 411
Query: 401 RFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
F A + ++ G+WQQ N ++ YDL + F +C+
Sbjct: 412 CFAFAATQPGQSTVYGSWQQMNFILGYDLKRGTVSFKRTDCS 453
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 118/414 (28%), Positives = 194/414 (46%), Gaps = 36/414 (8%)
Query: 51 LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQ 110
++ + I + +IS AR Y+ + + + A + + V ++G P PQ
Sbjct: 50 ITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQ 109
Query: 111 HLLFDTASSLVWTQCQPCIRCFDQ--TTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG-- 166
+ DT SSL+W QCQPC C P+F+P S+T+ E CDD CR G
Sbjct: 110 LTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSS 169
Query: 167 -KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-LAFGCSNDNSGFAFGGKISGIL 224
KCVY + Y G ++G+ ++E F NG T V + +AFGC +N G +GIL
Sbjct: 170 NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYEN-GEQLESHFTGIL 228
Query: 225 GFNASPLSLSSQLRNRIQGLFSYC---LVREMEATSVIKFGRDADVRRRDLETTPILLSD 281
G A P SL+ QL ++ FSYC L + + + G DAD+ + TPI
Sbjct: 229 GLGAKPTSLAVQLGSK----FSYCIGDLANKNYGYNQLVLGEDADILG---DPTPIEFET 281
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+Y++L IS+G + P F R G I+D+GT T++ + Y+ L Y+
Sbjct: 282 ENSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSGTLYTWLADIAYREL---YN 337
Query: 342 QILRSLGRQRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQ-EADYIVQPENMYF--IE 396
+I +S+ ++ ++F CY R +P +TFH A+ ++ +M++ E
Sbjct: 338 EI-KSILDPKLERFWFRDF-LCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSE 395
Query: 397 PDR-GRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
P+ FC++++ + +++ +G QQ I YDL + +C
Sbjct: 396 PNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCV 449
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 158/340 (46%), Gaps = 23/340 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V IG P +L+ DT S + W QC PC C+ Q PIF+P +S ++S + C+
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQ 208
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
CRS +C+N C+Y Y G T G ET G V +A GC ++N G
Sbjct: 209 CRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITL----GSAPVDNVAIGCGHNNEGL 264
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG LS SQ+ FSYCLV R+ E+ S ++F +
Sbjct: 265 FV--GAAGLLGLGGGSLSFPSQINATS---FSYCLVDRDSESASTLEFNS---TLPPNAV 316
Query: 274 TTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
+ P+L + L +Y+ L +S+G +V P AF I G GG I+D+GT +T ++
Sbjct: 317 SAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDV 376
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y +L + + R L N FD CY S P+++FH + + P
Sbjct: 377 YNSLRDAFVKRTRDLPST----NGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAK 432
Query: 392 MYFIEPD-RGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDL 429
Y + D G FC A SI+G QQQ ++YDL
Sbjct: 433 NYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDL 472
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 169/375 (45%), Gaps = 36/375 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT-TPIFDPRASTTYSEIPCDDP 155
Y V + IGTP + L+ DT S L+W +C PC C ++ F R STTYS I C P
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145
Query: 156 LCR-----SPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAF 205
C+ P C + C Y Y T G S+E G + L+F
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSF 205
Query: 206 GCSNDNSGFAFGGK----ISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TSV 258
GC SG + G G++G +P+S SSQL R FSYCL+ + TS
Sbjct: 206 GCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPTSF 265
Query: 259 IKFGRDADV---RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFP--PGAFDIMRDG 313
+ G +V ++ + TP+L++ L P FY ++ + + V+ P P + I G
Sbjct: 266 LTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIK-GVYVNGVKLPINPSVWSIDDLG 324
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA--SQEFDYCYRYDSSFK 371
GG IID+GT +TFI Y +++ + + R ++P A + FD C +
Sbjct: 325 NGGTIIDSGTTLTFITEPAYTEILKAFKK------RVKLPSPAEPTPGFDLCMNVSGVTR 378
Query: 372 -AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIY 427
A P M+F+L P YFIE C+A+Q D +S+LG QQ L+ +
Sbjct: 379 PALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEF 438
Query: 428 DLNVPALRFGSENCA 442
D + L F CA
Sbjct: 439 DRDKSRLGFTRRGCA 453
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 185/431 (42%), Gaps = 37/431 (8%)
Query: 33 SLKLIPIFSPESPLYPGNLSQSERIHK-MFEISKARANYMAS-----MSKPNAFQELEDI 86
SL+++ P S L +++ H + + R Y+ S + N +EL+
Sbjct: 66 SLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDST 125
Query: 87 HLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDP 141
LP L Y V V +GTP + L+FDT S L WTQC+PC C+ Q PIFDP
Sbjct: 126 TLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDP 185
Query: 142 RASTTYSEIPCDDPLC---RSP--FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
S++Y+ I C LC RS + C+Y +Y ++RG S+E +
Sbjct: 186 SKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD- 244
Query: 197 FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT 256
V FGC DN G G +G++G + P+S Q + +FSYCL +
Sbjct: 245 --IVHDFLFGCGQDNEGLFRG--TAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSSL 300
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTG 315
+ FG A +L+ TP FY L ++ IS+G + P A G
Sbjct: 301 GHLTFGASA-ATNANLKYTPFSTISGENSFYGLDIVGISVGG--TKLP--AVSSSTFSAG 355
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AY 373
G IID+GT +T + Y L + Q + + + Y ++ D CY + S +K +
Sbjct: 356 GSIIDSGTVITRLPPTAYAALRSAFRQFMM---KYPVAY-GTRLLDTCYDF-SGYKEISV 410
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLN 430
P + F + P + C+A + +I G QQ+ + ++YD+
Sbjct: 411 PRIDFEFAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVE 470
Query: 431 VPALRFGSENC 441
+ FG+ C
Sbjct: 471 GGRIGFGAAGC 481
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 171/373 (45%), Gaps = 33/373 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT-TPIFDPRASTTYSEIPCDDP 155
Y V++ IG P + L+ DT S LVW +C C C + +F PR S+T+S C DP
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 143
Query: 156 LCRSPFK------CQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVPRLA 204
+CR K C + + C Y Y G +T GL +RET + +G + +A
Sbjct: 144 VCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVA 203
Query: 205 FGC-----SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---T 256
FGC SG +F G +G++G P+S +SQL R FSYCL+ + T
Sbjct: 204 FGCGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPT 262
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S + G D + L TP+L + L P F Y+ L + + +R P ++I G G
Sbjct: 263 SYLIIGNGGDGISK-LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 321
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--- 372
G ++D+GT + F+ Y++++ +R + I + FD C K
Sbjct: 322 GTVVDSGTTLAFLAEPAYRSVI----AAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKI 377
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD-DPK--YSILGAWQQQNMLIIYDL 429
P + F V P YFIE + C+AIQ DPK +S++G QQ L +D
Sbjct: 378 LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDR 437
Query: 430 NVPALRFGSENCA 442
+ L F CA
Sbjct: 438 DRSRLGFSRRGCA 450
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 164/363 (45%), Gaps = 24/363 (6%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP + +++ DT S +VW QC PC +C+ QT +FDP S TY+
Sbjct: 111 LAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAG 170
Query: 150 IPCDDPLCR---SPFKCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
IPC PLCR SP C N C Y Y G T G S ET F RN T R+A
Sbjct: 171 IPCGAPLCRRLDSP-GCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-RNRVT---RVA 225
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEAT-SVIKFG 262
GC +DN G +G+LG LS Q R FSYCLV R A S + FG
Sbjct: 226 LGCGHDNEGLF--TGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFG 283
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR-FPPGAFDIMRDGTGGFIIDT 321
D+ V R T I L +YL LL IS+G VR F + G GG IID+
Sbjct: 284 -DSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDS 342
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL 380
GT VT + Y L + L +R P FD C+ + P++ H
Sbjct: 343 GTSVTRLTRPAYIALRDAFRIGASHL--KRAP--EFSLFDTCFDLSGLTEVKVPTVVLHF 398
Query: 381 QEADYIVQPENMYFIEPDR-GRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ AD + P Y I D G FC A SI+G QQQ I YDL + F
Sbjct: 399 RGAD-VSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAP 457
Query: 439 ENC 441
C
Sbjct: 458 RGC 460
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 176/401 (43%), Gaps = 37/401 (9%)
Query: 59 KMFEISKARANYMASMSKPNAFQELEDI----HLPMAKQDLFYSVEVNIGTPMKPQHLLF 114
K E SK + Y+ S S P + L+++ H+ + ++IG P PQ LL
Sbjct: 38 KTQESSKIKIGYLHSKSTPAS--RLDNLWTVSHVTPIPNPAAFLANISIGNPPVPQLLLI 95
Query: 115 DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP--FKCQ-NGKCVYT 171
DT S L W C PC +C+ QT P F P S+TY C P F+ + G C Y
Sbjct: 96 DTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYH 154
Query: 172 RRYHVGDVTRGLASRETFAFPVR-NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASP 230
RY TRG+ + E F +G + FGC DNSGF K SG+LG P
Sbjct: 155 LRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGFT---KYSGVLGLG--P 209
Query: 231 LSLSSQLRNRIQGLFSYCLVREMEAT---SVIKFGRDADVRRRDLETTPILLSDLRPHFY 287
+ S RN FSYC T +++ G A + E P L + +Y
Sbjct: 210 GTFSIVTRN-FGSKFSYCFGSLTNPTYPHNILILGNGAKI-----EGDPTPLQIFQDRYY 263
Query: 288 LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL 347
L L IS G ++ PG F R GG +IDTG T + Y+TL + D +L +
Sbjct: 264 LDLQAISFGEKLLDIEPGTFQRYRS-QGGTVIDTGCSPTILAREAYETLSEEIDFLLGEV 322
Query: 348 GRQRIPYNASQEFDYCYRYDSSFKAY--PSMTFHLQ-EADYIVQPENMYFIEPDRGRFCV 404
R+ ++ Q CY + Y P +TFH A+ + E+++ FC+
Sbjct: 323 LRRVKDWD--QYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCL 380
Query: 405 AIQ----DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
A+ DD S++GA QQN + Y+L + F +C
Sbjct: 381 AMTMNTFDD--MSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 173/375 (46%), Gaps = 41/375 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT--PIFDPRASTTYSEIPCDD 154
Y++ +++GTP ++ DT S+L+W QC PC RCF + T P+ P S+T+S +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 155 PLCR------SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C+ P C C Y Y G T G + ET V +G TF P++AFGC
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLT--VGDG-TF-PKVAFGC 205
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME--ATSVIKFGRDA 265
S +N SGI+G PLSL SQL G FSYCL +M S I FG A
Sbjct: 206 STENG----VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCLRSDMADGGASPILFGSLA 258
Query: 266 DVRRRD-LETTPILLS---DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGFIID 320
+ +++TP+L + H+Y++L I++ + F + G GG I+D
Sbjct: 259 KLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVD 318
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSM 376
+GT +T++ Y + Q + + +L + A + D CY+ + P +
Sbjct: 319 SGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRL 378
Query: 377 TFHLQEADYIVQPENMYF--IEPD-RGRFCVA------IQDDPKYSILGAWQQQNMLIIY 427
P YF +E D +GR VA DD SI+G Q +M ++Y
Sbjct: 379 ALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLY 438
Query: 428 DLNVPALRFGSENCA 442
D++ F +CA
Sbjct: 439 DIDGGMFSFAPADCA 453
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 190/438 (43%), Gaps = 56/438 (12%)
Query: 25 TSSESTGFSLKLIPIFSPESPL--YPGNLSQSERIHKMFEISKARANYMAS-----MSKP 77
T T SL+++ P S L + G + + K R Y+ S + +
Sbjct: 63 TKGPKTKASLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQD 122
Query: 78 NAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CF 132
++ +EL+ LP L Y V V +GTP + L+FDT S L WTQC+PC R C+
Sbjct: 123 SSVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCY 182
Query: 133 DQTTPIFDPRASTTYSEIPC-------------DDPLCRSPFKCQNGKCVYTRRYHVGDV 179
Q IFDP ST+YS I C +DP C + K C+Y +Y
Sbjct: 183 KQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKA----CIYGIQYGDSSF 238
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ G SRE + V FGC +N G FGG +G++G P+S Q
Sbjct: 239 SVGYFSRERLTVTATD---VVDNFLFGCGQNNQGL-FGGS-AGLIGLGRHPISFVQQTAA 293
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRH 298
+ + +FSYCL +T + FG A R L+ TP FY L + I++G
Sbjct: 294 KYRKIFSYCLPSTSSSTGHLSFGPAATGRY--LKYTPFSTISRGSSFYGLDITAIAVGG- 350
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
V+ P + TGG IID+GT +T + Y L + Q G + P
Sbjct: 351 -VKLPVSSSTF---STGGAIIDSGTVITRLPPTAYGALRSAFRQ-----GMSKYPSAGEL 401
Query: 359 E-FDYCYRYDSSFKAY--PSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ---DDPK 411
D CY S +K + P++ F + + P+ + F+ + + C+A DD
Sbjct: 402 SILDTCYDL-SGYKVFSIPTIEFSFAGGVTVKLPPQGILFVASTK-QVCLAFAANGDDSD 459
Query: 412 YSILGAWQQQNMLIIYDL 429
+I G QQ+ + ++YD+
Sbjct: 460 VTIYGNVQQRTIEVVYDV 477
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 163/364 (44%), Gaps = 19/364 (5%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ M + Y V + +G+P + Q+++ D+ S +VW QCQPC +C+ Q+ P+FDP S
Sbjct: 128 DVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 187
Query: 145 TTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+++ + C +C C G+C Y Y G T+G + ET F G T V
Sbjct: 188 ASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRS 243
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKF 261
+A GC + N G +G+LG +S QL + G FSYCLV R +++ + F
Sbjct: 244 VAIGCGHRNRGMFV--GAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVF 301
Query: 262 GRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
GR+A P++ + P F Y+ L + +G V F + G GG ++D
Sbjct: 302 GREA--LPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMD 359
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFH 379
TGT VT + YQ + +L R FD CY P+++F+
Sbjct: 360 TGTAVTRLPTLAYQAFRDAFLAQTANLPRA----TGVAIFDTCYDLLGFVSVRVPTVSFY 415
Query: 380 LQEADYIVQPENMYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ P + I D G FC A SILG QQ+ + I +D + FG
Sbjct: 416 FSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFG 475
Query: 438 SENC 441
C
Sbjct: 476 PNIC 479
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/355 (31%), Positives = 172/355 (48%), Gaps = 25/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTTPIFDPRASTTYSEIPCD 153
Y + +G P++ + DT S + W QCQPC C+ Q PIFDP++S++YS + CD
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 154 DPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C C C+Y Y G T G + ETF+F N +P L GC +DN
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLPIGCGHDN 300
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRR 270
G G++G +SLSSQL FSYCLV + E++S + F D +
Sbjct: 301 EGLFV--GADGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESSSTLDFNAD---QPS 352
Query: 271 DLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
D T+P++ +D P F Y+ ++ +S+G + +F+I G+GG I+D+GT +T I
Sbjct: 353 DSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIP 412
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
+ Y L + + ++L P FD CY S S P++ F L + +
Sbjct: 413 SDVYDVLRDAFVGLTKNLP----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQL 468
Query: 389 PENMYFIEPDR-GRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P I+ D G FC+A + SI+G QQQ + + YDL + F ++ C
Sbjct: 469 PAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 173/374 (46%), Gaps = 37/374 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ +GTP K L+ DT S L W QC PC CF+Q P ++P S++Y I C DP
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPR 229
Query: 157 CR---SP-----FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C+ SP K +N C Y Y G T G + ETF + + F V +
Sbjct: 230 CQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDV 289
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N GF G+LG PLS SQL++ FSYCL TSV +
Sbjct: 290 MFGCGHWNKGFFH--GAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLI 347
Query: 261 FGRDAD-VRRRDLETTPILLSDLRPH---FYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D + + +L T +L + P +YL + I +G ++ P + +G GG
Sbjct: 348 FGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGG 407
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF--DYCYRYDSSFKA-Y 373
IID+G+ +TF + Y + + +++ ++ Q+I A+ +F CY + +
Sbjct: 408 TIIDSGSTLTFFPDSAYDVIKEAFEKKIK---LQQI---AADDFIMSPCYNVSGAMQVEL 461
Query: 374 PSMTFHLQEADYIVQPENMYF--IEPDRGRFCVAIQDDPKYS---ILGAWQQQNMLIIYD 428
P H + P YF EPD C+AI P +S I+G QQN I+YD
Sbjct: 462 PDYGIHFADGAVWNFPAENYFYQYEPDE-VICLAILKTPNHSHLTIIGNLLQQNFHILYD 520
Query: 429 LNVPALRFGSENCA 442
+ L + CA
Sbjct: 521 VKRSRLGYSPRRCA 534
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 139/468 (29%), Positives = 201/468 (42%), Gaps = 46/468 (9%)
Query: 4 VQALPLAAFFSY-FSVLFLTHFTSSESTGFSLKL-----IPIFSPESP--LYPGNLSQ-S 54
V LP +A S+ S F S +T S+ L + FS SP L+ L + S
Sbjct: 35 VNTLPSSATLSWPESKSFSDESVSESTTSLSVHLSHVDALSSFSDASPVDLFKLRLQRDS 94
Query: 55 ERIHKMFEISKARANYMASMSKPNAFQELEDIHLP-MAKQDLFYSVEVNIGTPMKPQHLL 113
R+ + ++ A+ P + + +++ Y + + +GTP +++
Sbjct: 95 LRVKSITSLAAVSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMV 154
Query: 114 FDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKC---QNGK 167
DT S +VW QC PC C++Q+ IFDP+ S T++ +PC LCR +C ++
Sbjct: 155 LDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKT 214
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C+Y Y G T G S ET F V + GC +DN G +G+LG
Sbjct: 215 CLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGCGHDNEGLFV--GAAGLLGLG 268
Query: 228 ASPLSLSSQLRNRIQGLFSYCLVREM------EATSVIKFGRDADVRRRDLETTPILLS- 280
LS SQ ++R G FSYCLV + S I FG DA + TP+L +
Sbjct: 269 RGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVF--TPLLTNP 326
Query: 281 DLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIIDTGTPVTFIRNGPYQTL 336
L +YL LL IS+G V PG F + G GG IID+GT VT + Y L
Sbjct: 327 KLDTFYYLQLLGISVGGSRV---PGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVAL 383
Query: 337 MQRYDQILRSLGRQRIPYNASQE-FDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYF 394
+ LG ++ S FD C+ + P++ FH + + N
Sbjct: 384 RDAF-----RLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLI 438
Query: 395 IEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
GRFC A SI+G QQQ + YDL + F S C
Sbjct: 439 PVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 156/340 (45%), Gaps = 24/340 (7%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQN--GK 167
++ DT S + W QCQPC C+ Q+ P+FDP S +Y+ + CD CR C+N G
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C+Y Y G T G + ET T V +A GC +DN G +G+L
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAIGCGHDNEGLFV--GAAGLLALG 115
Query: 228 ASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF 286
PLS SQ+ FSYCLV R+ A S ++FG A T P++ S F
Sbjct: 116 GGPLSFPSQISAST---FSYCLVDRDSPAASTLQFGDGA--AEAGTVTAPLVRSPRTSTF 170
Query: 287 -YLHLLEISIGRHIVRFPPGAFDI-MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQIL 344
Y+ L IS+G + P AF + G+GG I+D+GT VT +++ Y L + Q
Sbjct: 171 YYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGA 230
Query: 345 RSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQEADYIVQPENMYFIEPD-RGRF 402
SL R + FD CY D + P+++ + + P Y I D G +
Sbjct: 231 PSLPRT----SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTY 286
Query: 403 CVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A + SI+G QQQ + +D A+ F C
Sbjct: 287 CLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 153/354 (43%), Gaps = 24/354 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + +GTP ++FDT S W QCQPC+ C+ Q P+F P S TY+ I C
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G T G +++T G+ V FGC N G
Sbjct: 225 YCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTL----GYDTVKDFRFGCGEKNRG 280
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
GK +G++G S+ Q ++ G+F+YC+ T + F +
Sbjct: 281 LF--GKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDF-GPGAPAAANAR 337
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TP+L+ + +Y+ + I +G H++ P F G ++D+GT +T + Y
Sbjct: 338 LTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTVITRLPPSAY 392
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQEADYIVQPE 390
+ L + + + LG + P A D CY Y S A P+++ Q +
Sbjct: 393 EPLRSAFAKGMEGLGYKTAP--AFSILDTCYDLTGYQGSI-ALPAVSLVFQGGACLDVDA 449
Query: 391 NMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ D + C+A DD +I+G QQ+ ++YDL + F C
Sbjct: 450 SGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 122/376 (32%), Positives = 180/376 (47%), Gaps = 36/376 (9%)
Query: 50 NLSQSERIHKMFEISKARA-NYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
L+ E + +M SKARA ++S + D +P + Y V + IGTP +
Sbjct: 38 GLAARELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTTE----YLVHLAIGTPPQ 93
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--------SP 160
P L DT S L+WTQCQPC CFDQ P FDP S+T S CD LC+ SP
Sbjct: 94 PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSP 153
Query: 161 FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
N CVYT Y VT G + F F V G + VP +AFGC N+G F
Sbjct: 154 KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF-VGAGAS-VPGVAFGCGLFNNG-VFKSNE 210
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV---RRRDLETT 275
+GI GF PLSL SQL+ G FS+C V ++ ++V+ AD+ R +++T
Sbjct: 211 TGIAGFGRGPLSLPSQLK---VGNFSHCFTAVNGLKPSTVL-LDLPADLYKSGRGAVQST 266
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
P++ + P F YL L I++G + P F +++GTGG IID+GT +T + Y+
Sbjct: 267 PLIQNPANPTFYYLSLKGITVGSTRLPVPESEF-ALKNGTGGTIIDSGTAMTSLPTRVYR 325
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFD--YCYRYDSSFKAY-PSMTFHLQEADYIVQPEN 391
+ + + ++P + D +C K Y P + H + A + EN
Sbjct: 326 LVRDAFA------AQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPREN 379
Query: 392 MYFIEPDRGRFCVAIQ 407
+++ R + ++
Sbjct: 380 YVWLKHYPKRLLIRVK 395
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 161/358 (44%), Gaps = 26/358 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + DT S W QC+PC C++Q +FDP S+TYS+I C
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRE 193
Query: 157 CRS-----PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C+ C + KC Y Y T G +R+T + VP FGC ++
Sbjct: 194 CQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDA---VPGFVFGCGHN 250
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
N+G +F G+I G+LG SLSSQ+ R FSYCL AT + F A
Sbjct: 251 NAG-SF-GEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFSGAAAAAPT 308
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+ + T ++ +YL+L I++ ++ PP F G IID+GT + +
Sbjct: 309 NAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATA----AGTIIDSGTAFSCLPP 364
Query: 331 GPYQTLMQRYDQILRSLGR-QRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYI-V 387
Y L + ++GR +R P +S FD CY PS+ + + +
Sbjct: 365 SAYAALRS---SVRSAMGRYKRAP--SSTIFDTCYDLTGHETVRIPSVALVFADGATVHL 419
Query: 388 QPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
P + + + + C+A DD +LG QQ+ + +IYD++ + FG+ CA
Sbjct: 420 HPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 477
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 163/363 (44%), Gaps = 32/363 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + + +GTP +++ DT S +VW QC PC C++QT IFDP+ S T++ +PC L
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRL 194
Query: 157 CR---SPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR +C ++ C+Y Y G T G S ET F V + GC +D
Sbjct: 195 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGCGHD 250
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM------EATSVIKFGRD 264
N G +G+LG LS SQ +NR G FSYCLV + S I FG +
Sbjct: 251 NEGLFV--GAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG-N 307
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA----FDIMRDGTGGFIID 320
A V + + T + L +YL LL IS+G V PG F + G GG IID
Sbjct: 308 AAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRV---PGVSESQFKLDATGNGGVIID 364
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFH 379
+GT VT + Y L + L + +R P + FD C+ + P++ FH
Sbjct: 365 SGTSVTRLTQPAYVALRDAFR--LGATKLKRAP--SYSLFDTCFDLSGMTTVKVPTVVFH 420
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ + N GRFC A SI+G QQQ + YDL + F S
Sbjct: 421 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 480
Query: 439 ENC 441
C
Sbjct: 481 RAC 483
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 165/349 (47%), Gaps = 25/349 (7%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTTPIFDPRASTTYSEIPCDDPLCR- 158
+G P +P + DT S + W QC PC C++Q TPIFDP S++Y+ + CD C+
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 159 -SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
C C+Y Y G T G + ET F N +P ++ GC +DN G
Sbjct: 63 LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNS---IPNISIGCGHDNEGLFV- 118
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTP 276
G++G +S+SSQL+ FSYCLV + + S + F D D +P
Sbjct: 119 -GADGLIGLGGGAISISSQLK---ASSFSYCLVDIDSPSFSTLDFNTDP---PSDSLISP 171
Query: 277 ILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
++ +D P F Y+ ++ +S+G + F+I G GG I+D+GT +T + + Y+
Sbjct: 172 LVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231
Query: 336 LMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYF 394
L + + + +L P FD CY S S P++ F L + + P
Sbjct: 232 LREAFLGLTTNLP----PAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCL 287
Query: 395 IEPDR-GRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
I+ D G FC+A + SI+G +QQQ + + YDL + F + C
Sbjct: 288 IQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/355 (31%), Positives = 173/355 (48%), Gaps = 25/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTTPIFDPRASTTYSEIPCD 153
Y + +G P++ + DT S + W QCQPC C+ Q PIFDP++S++YS + CD
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 154 DPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C C C+Y Y G T G + ETF+F N +P L GC +DN
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLPIGCGHDN 300
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRR 270
G +G++G +SLSSQL FSYCLV + E++S + F D +
Sbjct: 301 EGLFV--GAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESSSTLDFNAD---QPS 352
Query: 271 DLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
D T+P++ +D P F Y+ ++ +S+G + +F+I G+GG I+D+GT +T I
Sbjct: 353 DSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIP 412
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
+ Y L + + ++L P FD CY S S P++ F L + +
Sbjct: 413 SDVYDVLRDAFVGLTKNLP----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQL 468
Query: 389 P-ENMYFIEPDRGRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P +N F G FC+A + SI+G QQQ + + YDL + F ++ C
Sbjct: 469 PAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 170/407 (41%), Gaps = 41/407 (10%)
Query: 59 KMFEISKARANYMAS-MSKPNA---FQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQ 110
++ + +AR N + S +SK A E + LP Y V V +GTP
Sbjct: 58 EILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 117
Query: 111 HLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDPLCRS-------PFK 162
L+FDT S L WTQCQPC+R C+DQ PIF+P ST+Y + C C S
Sbjct: 118 SLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS 177
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
C C+Y +Y + G ++E F + F V FGC +N G G ++G
Sbjct: 178 CSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGV---YFGCGENNQGLFTG--VAG 232
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPI-LLSD 281
+LG LS SQ +FSYCL T + FG R ++ TPI ++D
Sbjct: 233 LLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITD 290
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+ L+++ I++G + P F T G +ID+GT +T + Y L +
Sbjct: 291 GTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFK 345
Query: 342 QILRSLGRQRIPYNASQE-FDYCYRYDSSFKAY--PSMTFHLQEADYIVQPENMYFIEPD 398
+ + P + D C+ S FK P + F + F
Sbjct: 346 AKM-----SKYPTTSGVSILDTCFDL-SGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK 399
Query: 399 RGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +I G QQQ + ++YD + F C+
Sbjct: 400 ISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 124/459 (27%), Positives = 193/459 (42%), Gaps = 45/459 (9%)
Query: 9 LAAFFSYFSVL--FLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKA 66
+A F + VL F +S STG L++ + Y + ER+ + +S+
Sbjct: 1 MARTFVFLLVLLCFRASLVTSSSTGAGLRMKLTHVDDKAGY----TTEERVRRAVAVSRE 56
Query: 67 RANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
R Y + + D+ P+ Y E IG P + L DT S+L+WTQC
Sbjct: 57 RLAYT---QQQQQLRASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCG 113
Query: 127 PCI---RCFDQTTPIFDPRASTTYSEIPCDDP--LCRSP---FKCQNGKCVYTRRYHVGD 178
C Q P ++ S+T++ +PC D LC + +G C + Y G
Sbjct: 114 TTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYGAGS 173
Query: 179 VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN--SGFAFGGKISGILGFNASPLSLSSQ 236
V S T AF ++G +L FGC + + A G SG++G LSL SQ
Sbjct: 174 V---FGSLGTEAFTFQSG---AAKLGFGCVSLTRITKGALNGA-SGLIGLGRGRLSLVSQ 226
Query: 237 LRNRIQGLFSYCL---VREMEATSVIKFGRDADVRRRDLETTPILLSD------LRPHFY 287
FSYCL +R A+S + G A + T I +Y
Sbjct: 227 TGAT---KFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYY 283
Query: 288 LHLLEISIGRHIVRFPPGAFDIMRDG----TGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
L L+ IS+G + P AF++ R +GG IIDTG+PVT + Y L D++
Sbjct: 284 LPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALS---DEV 340
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
R L R + A D C K P + FH + Y+ D+ C
Sbjct: 341 ARQLNRSLVQPPADTGLDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTAC 400
Query: 404 VAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ I++ +++G +QQQ++ ++YD+ L F + +C+
Sbjct: 401 MLIEEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 170/407 (41%), Gaps = 41/407 (10%)
Query: 59 KMFEISKARANYMAS-MSKPNA---FQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQ 110
++ + +AR N + S +SK A E + LP Y V V +GTP
Sbjct: 86 EILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 145
Query: 111 HLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDPLCRS-------PFK 162
L+FDT S L WTQCQPC+R C+DQ PIF+P ST+Y + C C S
Sbjct: 146 SLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS 205
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
C C+Y +Y + G ++E F + F V FGC +N G G ++G
Sbjct: 206 CSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGV---YFGCGENNQGLFTG--VAG 260
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPI-LLSD 281
+LG LS SQ +FSYCL T + FG R ++ TPI ++D
Sbjct: 261 LLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITD 318
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+ L+++ I++G + P F T G +ID+GT +T + Y L +
Sbjct: 319 GTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFK 373
Query: 342 QILRSLGRQRIPYNASQE-FDYCYRYDSSFKAY--PSMTFHLQEADYIVQPENMYFIEPD 398
+ + P + D C+ S FK P + F + F
Sbjct: 374 AKM-----SKYPTTSGVSILDTCFDL-SGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK 427
Query: 399 RGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +I G QQQ + ++YD + F C+
Sbjct: 428 ISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 157/351 (44%), Gaps = 28/351 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ +GTP + DT S + WTQC PC+ C+ Q PIFDP S+T+ E
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEK------ 433
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
+C + C Y Y T+G + +T +G FV GC +NS F
Sbjct: 434 -----RCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWFR 488
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
G +G N PLSL +Q+ GL SYC TS I FG +A V + +T
Sbjct: 489 --PSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG--NGTSKINFGTNAIVGGGGVVST 544
Query: 276 PILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+ ++ RP F YL+L +S+G + F + G +ID+GT +T+
Sbjct: 545 TMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALE---GNIVIDSGTTLTYFPESYCN 601
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENMY 393
+ Q + ++ + +P D Y ++ + +P +T H AD ++ NM+
Sbjct: 602 LVRQAVEHVVPA-----VPAADPTGNDLLCYYSNTTEIFPVITMHFSGGADLVLDKYNMF 656
Query: 394 FIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
G FC+AI + + +I G Q N L+ YD + + F NC+
Sbjct: 657 MESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 150/336 (44%), Gaps = 46/336 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ IGTP + DT S L+WTQC PC+ C+DQ PIFDP S+T+ E C+ P
Sbjct: 65 YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTP- 123
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGFA 215
+ C Y Y T+G + ET +G FV P GCS +NSG
Sbjct: 124 --------DHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSG 175
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETT 275
F SGI+G + LSL SQ+ G S F + A
Sbjct: 176 FRPSSSGIVGLSRGSLSLISQMGGAYPG---------DGVVSTTMFAKTAK--------- 217
Query: 276 PILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
R +YL+L +S+G + F + G +ID+GTP+T+ Y
Sbjct: 218 -------RGQYYLNLDAVSVGDTRIETVGTPFHALN---GNIVIDSGTPLTYFPVS-YCN 266
Query: 336 LMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPENMYF 394
L+++ + R + R+ + S+ CY Y ++ + +P +T H AD ++ NMY
Sbjct: 267 LVRK--AVERVVTADRV-VDPSRNDMLCY-YSNTIEIFPVITVHFSGGADLVLDKYNMYM 322
Query: 395 IEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYD 428
G FC+AI + + +I G Q N L+ YD
Sbjct: 323 ELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 171/373 (45%), Gaps = 55/373 (14%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC--- 157
+G ++ DTAS L W QCQPC C DQ P+FDP +S +Y+ +PC+ C
Sbjct: 122 ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDAL 181
Query: 158 -------RSPFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
SP N + C Y Y G +RG+ +R+ ++ FV FGC
Sbjct: 182 RVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFV----FGC 237
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDAD 266
N G FGG SG++G S +SL SQ ++ G+FSYCL +RE ++ + G D+
Sbjct: 238 GTSNQGAPFGGT-SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLGDDSS 296
Query: 267 VRRRDLETTPILLSDL--------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
R +TPI+ + + P ++L+L I++G V P F R I
Sbjct: 297 AYRN---STPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESP--WFSAGR-----VI 346
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA-YP 374
ID+GT +T + + Y+ + Q Y + F D C+ + P
Sbjct: 347 IDSGTIITTL-------VPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNLTGLKEVQVP 399
Query: 375 SMTFHLQEADYIVQPEN---MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYD 428
S+ F E V+ ++ +YF+ D + C+A ++ + SI+G +QQ+N+ +I+D
Sbjct: 400 SLKFVF-EGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFD 458
Query: 429 LNVPALRFGSENC 441
+ F E C
Sbjct: 459 TLGSQIGFAQETC 471
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 170/407 (41%), Gaps = 41/407 (10%)
Query: 59 KMFEISKARANYMAS-MSKP---NAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQ 110
++ + +AR N + S +SK N + + LP Y V V +GTP
Sbjct: 87 EILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 146
Query: 111 HLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDPLCRS-------PFK 162
L+FDT S L WTQCQPC+R C+DQ PIF+P ST+Y + C C S
Sbjct: 147 SLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS 206
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
C C+Y +Y + G +++ F + F V FGC +N G G ++G
Sbjct: 207 CSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGV---YFGCGENNQGLFTG--VAG 261
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPI-LLSD 281
+LG LS SQ +FSYCL T + FG R ++ TPI ++D
Sbjct: 262 LLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITD 319
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+ L+++ I++G + P F T G +ID+GT +T + Y L +
Sbjct: 320 GTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFK 374
Query: 342 QILRSLGRQRIPYNASQE-FDYCYRYDSSFKAY--PSMTFHLQEADYIVQPENMYFIEPD 398
+ + P + D C+ S FK P + F + F
Sbjct: 375 AKM-----SKYPTTSGVSILDTCFDL-SGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 428
Query: 399 RGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +I G QQQ + ++YD + F C+
Sbjct: 429 ISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 181/385 (47%), Gaps = 50/385 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ-TTPIFDPRASTTYSEIPCDDP 155
Y V +++GTP +P L DT S LVWTQC PC+ CFDQ P+ DP AS+T++ + CD P
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAP 153
Query: 156 LCRS-PF-KCQNG-------KCVYTRRYHVGD--VTRGLASRETFAF-PVRN---GFTFV 200
+CR+ PF C G CVY YH GD +T G + + F F P N G
Sbjct: 154 VCRALPFTSCGRGGSSWGERSCVYV--YHYGDKSITVGKLASDRFTFGPGDNADGGGVSE 211
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVI 259
RL FGC + N G F +GI GF SL SQL FSYC E+T S++
Sbjct: 212 RRLTFGCGHFNKGI-FQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFESTSSLV 267
Query: 260 KFG-RDADVR-RRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFP-PGAFDIMRDGTG 315
G A++ +++TP+L +P Y L L I++G R P P +R+ +
Sbjct: 268 TLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVG--ATRIPIPERRQRLREASA 325
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQR---------IPYNASQEFDYCYR 365
IID+G +T + Y+ + + Q+ + +P A+ + + +R
Sbjct: 326 --IIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWR 383
Query: 366 YDSSFKA----YPSMTFHL-QEADYIVQPENMYFIEPDRGRFCV----AIQDDPKYSILG 416
+ +A P + FHL AD+ + EN F + C+ A + ++G
Sbjct: 384 WRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIG 443
Query: 417 AWQQQNMLIIYDLNVPALRFGSENC 441
+QQQN ++YDL L F C
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARC 468
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 165/364 (45%), Gaps = 36/364 (9%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCD 153
L + V V GTP + L+FDT S + W QC PC C+ Q PIFDP S TYS +PC
Sbjct: 118 LEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCG 177
Query: 154 DPLCRSP-FKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
P C + KC NG C+Y +Y G T G+ S ET + +P AFGC N
Sbjct: 178 HPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARA---LPGFAFGCGETN 234
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
G G + G++G LSLSSQ FSYCL + + G D
Sbjct: 235 LGDF--GDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSD 292
Query: 272 -LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
+ T ++ P FY + L+ I +G ++ PP F RDGT ++D+GT +T++
Sbjct: 293 GVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF--TRDGT---LLDSGTVLTYLP 347
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---------SFKAYPSMTFHL 380
Y L R+ + Q P A FD CY + SFK +F L
Sbjct: 348 PEAYTALRDRFKFTM----TQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDL 403
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALRFG 437
++ P++ P G C+A P ++I+G QQ+N +IYD+ + F
Sbjct: 404 SPFGVLIFPDD---TAPATG--CLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFV 458
Query: 438 SENC 441
S +C
Sbjct: 459 SGSC 462
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 165/351 (47%), Gaps = 26/351 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V IGTP K L+FDT S L+WTQC+PC C+ + P+FDP S ++ +PC L
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPK-VPVFDPTKSASFKGLPCSSKL 190
Query: 157 CRSPFK-CQNGKCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVPRLAFGCSNDNSGF 214
C+S + C + KC Y Y + G + ET +F ++ F + GCS+ SG
Sbjct: 191 CQSIRQGCSSPKCTYLTAYVDNSSSTGTLATETISFSHLKYDFK---NILIGCSDQVSGE 247
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
+ G SGI+G N SP+SL+SQ N LFSYC+ +T + FG D+
Sbjct: 248 SLGE--SGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGHLTFGGKVP---NDVRF 302
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
+P+ + + + + IS+G + AF I ID+G +T + Y
Sbjct: 303 SPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAS------TIDSGAVLTRLPPKAYS 356
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDS-SFKAYPSMTFHLQEA-DYIVQPEN 391
L + ++++ P +F D CY + + S A PS++ + + +
Sbjct: 357 ALRSVFREMMKGY-----PLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSG 411
Query: 392 MYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + P +C+A + D + SI G +QQ+ +++D + F C
Sbjct: 412 IMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 152/352 (43%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC+ C++Q +FDP S+TY+ I C P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 156 LCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 240 ACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 296
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAAGARL 354
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 355 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFT-----TAGTIVDSGTVITRLPPAAY 409
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 410 SSLRSAFASAMAARGYKKAP--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 393 YFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+ A +D I+G Q + + YD+ + F C
Sbjct: 468 IMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 166/352 (47%), Gaps = 29/352 (8%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE-IPCDDPLCRSPF 161
+GTP P L + + L+W P CF+Q P F+P T+S +P C SP
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPL---TFSRGLPFAS--CGSPK 55
Query: 162 KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKIS 221
N CVYT Y VT G + F F V G + VP +AFGC N+G F +
Sbjct: 56 FWPNQTCVYTYSYGDKSVTTGFLEVDKFTF-VGAGAS-VPGVAFGCGLFNNG-VFKSNET 112
Query: 222 GILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADV---RRRDLETTPI 277
GI GF PLSL SQL+ G FS+C A S + AD+ + ++TTP+
Sbjct: 113 GIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPL 169
Query: 278 L---LSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
+ ++ P +YL L I++G + P AF + +GTGG IID+GT +T + P
Sbjct: 170 IQYAKNEANPTLYYLSLKGITVGSTRLPVPESAF-ALTNGTGGTIIDSGTSITSL---PP 225
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENM 392
Q D+ + +P NA+ + C+ S K P + H + A + EN
Sbjct: 226 QVYQVVRDEFAAQIKLPVVPGNATGHYT-CFSAPSQAKPDVPKLVLHFEGATMDLPRENY 284
Query: 393 YFIEPDRGR---FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
F PD C+AI + +I+G +QQQNM ++YDL L F + C
Sbjct: 285 VFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 114/442 (25%), Positives = 194/442 (43%), Gaps = 45/442 (10%)
Query: 12 FFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANY 70
FF+ + L + G +L++ ++SP SP +P L E + +M +AR +
Sbjct: 12 FFTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQF 71
Query: 71 MASMSKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
++S+ + +P+A Q Y V IGTP + L DT++ W C
Sbjct: 72 LSSLVARKSV-------VPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPC 124
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGL 183
C+ C ++ +F+ STT+ + C+ P C+ KC C + Y + L
Sbjct: 125 SGCVGC---SSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAANL 181
Query: 184 ASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG 243
S++ + +P FGC + +G + + G+LG P+SL SQ +N Q
Sbjct: 182 -SQDVVTLATDS----IPSYTFGCLTEATGSSIPPQ--GLLGLGRGPMSLLSQTQNLYQS 234
Query: 244 LFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIV 300
FSYCL R + + ++ G +R ++TTP+L + R +Y++L+ I +GR +V
Sbjct: 235 TFSYCLPSFRSLNFSGSLRLGPVGQPKR--IKTTPLLKNPRRSSLYYVNLMAIRVGRRVV 292
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
PP A G I D+GT T + Y + D + +G + + F
Sbjct: 293 DIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAV---RDAFRKRVGNATV--TSLGGF 347
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSIL 415
D CY +S P++TF + + P+N+ C+A+ P +++
Sbjct: 348 DTCY---TSPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVI 404
Query: 416 GAWQQQNMLIIYDLNVPALRFG 437
QQQN I++D VP R G
Sbjct: 405 ANMQQQNHRILFD--VPNSRLG 424
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 119/433 (27%), Positives = 181/433 (41%), Gaps = 50/433 (11%)
Query: 37 IPIFSPESPLYPGNLSQSE--RIHKMFEISKARANYMASMSKPNAFQELEDI----HLPM 90
+P+ P P LS + ++AR+ Y+ S + D+ HL
Sbjct: 58 VPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGG 117
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYS 148
+ L Y V V +GTP Q LL DT S L W QCQPC C+ Q P+FDP S+TY+
Sbjct: 118 SVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYA 177
Query: 149 EIPCDDPLCRS------PFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
IPC+ CR C +G +C + Y G TRG+ S ET A
Sbjct: 178 PIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPG---V 234
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
V FGC +D G K G+LG +P SL Q + G FSYCL
Sbjct: 235 AVKDFRFGCGHDQDG--ANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGF 292
Query: 259 IKFGRDADVRRRDLETTPILLSDL----RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G + T+ + + + + +++ I++G + PP AF +
Sbjct: 293 LALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAF------S 346
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAY 373
GG IID+GT VT +++ Y L + + + + P + E D CY + S
Sbjct: 347 GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAY-----PLVRNGELDTCYDFSGYSNVTL 401
Query: 374 P--SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYD 428
P ++TF + P + + C+A Q+ D + ILG Q+ + ++YD
Sbjct: 402 PKVALTFSGGATIDLDVPNGILLDD------CLAFQESGPDDQPGILGNVNQRTLEVLYD 455
Query: 429 LNVPALRFGSENC 441
+ F + C
Sbjct: 456 AGRGRVGFRAAVC 468
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 155/352 (44%), Gaps = 32/352 (9%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQNGK 167
++ DT S +VW QC PC RC++Q+ P+FDPR S++Y + C LCR + G
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C+Y Y G VT G ET F G V R+A GC +DN G +G+LG
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFA---GGARVARVALGCGHDNEGLFV--AAAGLLGLG 115
Query: 228 ASPLSLSSQLRNRIQGLFSYCLVREMEA----------TSVIKFGRDADVRRRDLETTPI 277
LS +Q+ R FSYCLV + +S + FG V TP+
Sbjct: 116 RGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASSASFTPM 174
Query: 278 LLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMR----DGTGGFIIDTGTPVTFIRNGP 332
+ + + +Y+ L+ IS+G R P A +R G GG I+D+GT VT +
Sbjct: 175 VRNPRMETFYYVQLVGISVGG--ARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARAS 232
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPEN 391
Y L + + G R+ FD CY P+++ H P
Sbjct: 233 YSALRDAFRAA--AAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPE 290
Query: 392 MYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I D RG FC A D SI+G QQQ +++D + + F + C
Sbjct: 291 NYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/410 (25%), Positives = 170/410 (41%), Gaps = 29/410 (7%)
Query: 42 PESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEV 101
P + L + ++ + IH+ + + A K I L Y V +
Sbjct: 95 PHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTLPAQRGISLGTGN----YVVSM 150
Query: 102 NIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--- 158
+GTP + ++FDT S L W QC PC C++Q P+FDP S+TYS +PC P C+
Sbjct: 151 GLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLD 210
Query: 159 SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
S ++ KC Y Y T G +R+T + +P FGC ++G G
Sbjct: 211 SRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD---VLPGFVFGCGEQDTGLF--G 265
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG--RDADVRRRDLETTP 276
+ G++G +SLSSQ ++ FSYCL A + G A+ R +ET
Sbjct: 266 RADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYLSLGGPAPANARFTAMETR- 324
Query: 277 ILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTL 336
D +Y+ L+ + + VR P F G +ID+GT +T + Y L
Sbjct: 325 ---HDSPSFYYVRLVGVKVAGRTVRVSPIVFS-----AAGTVIDSGTVITRLPPRVYAAL 376
Query: 337 MQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFI 395
+ + + G +R P A D CY + PS+ + +
Sbjct: 377 RSAFARSMGRYGYKRAP--ALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLY 434
Query: 396 EPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A D I+G QQ+ + ++YD+ + FG+ C+
Sbjct: 435 VAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 174/387 (44%), Gaps = 55/387 (14%)
Query: 85 DIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
D +P+ Q L Y V V +G + ++ DT S L W QCQPC RC++Q P+F+
Sbjct: 50 DTQIPLTSGIRLQSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFN 107
Query: 141 PRASTTYSEIPCDDPLCRSPFKCQNGK----------CVYTRRYHVGDVTRGLASRETFA 190
P S +Y + C+ CRS + G C Y Y G T G E
Sbjct: 108 PSKSPSYRTVLCNSLTCRS-LQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLN 166
Query: 191 FPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL- 249
G T V FGC N G FGG SG++G + LSL SQ+ G+FSYCL
Sbjct: 167 L----GNTTVNNFIFGCGRKNQGL-FGGA-SGLVGLGRTDLSLISQISPMFGGVFSYCLP 220
Query: 250 VREMEATSVIKFGRDADVRRRDLETTPI-----LLSDLRPHFYLHLLEISIGRHIVRFPP 304
E EA+ + G ++ V + TTPI + + L P ++L+L I++G V+ P
Sbjct: 221 TTEAEASGSLVMGGNSSVYK---NTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPS 277
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---D 361
D M IID+GT ++ + YQ L + +Q Y ++ F D
Sbjct: 278 FGKDRM-------IIDSGTVISRLPPSIYQALKAEFV-------KQFSGYPSAPSFMILD 323
Query: 362 YCYRYDSSFKA-YPSMTFHLQ-EADYIVQPENMYF-IEPDRGRFCVAIQDDP---KYSIL 415
C+ + P + + + A+ V +++ ++ D + C+AI P + I+
Sbjct: 324 SCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGII 383
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCA 442
G +QQ+N IIYD L F E C+
Sbjct: 384 GNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 124/439 (28%), Positives = 188/439 (42%), Gaps = 44/439 (10%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSE-RIHKMFEISKARANYMASMSKPNAFQ---- 81
+ES GFS +I ++ N +Q+ H+ +R++ + +A Q
Sbjct: 25 AESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASRSSQVDKPQSSSASQLSNN 84
Query: 82 ELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+ + + L M Y +E +IGTP + L DT S L+WT+C + + P
Sbjct: 85 DTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHP 144
Query: 142 RASTTYSEIPCDDPLCR-----SPFKCQNG--KCVYTRRYHVGD---VTRGLASRETFAF 191
AS+T++ +PC D LC S +C G +C Y Y +GD T+G ETF
Sbjct: 145 NASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL 204
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
G VP + FGC+ G G+ +G++G PLSL SQL G F YCL
Sbjct: 205 ----GGDAVPGVGFGCTTALEGDY--GEGAGLVGLGRGPLSLVSQLD---AGTFMYCLTA 255
Query: 252 EMEATSVIKFGRDADVRRRD--LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI 309
+ S + FG A + +++T +L S + ++L I+IG A
Sbjct: 256 DASKASPLLFGALATMTGAGAGVQSTGLLAS--TTFYAVNLRSITIGS--------ATTA 305
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS 369
G GG + D+GT +T++ Y + SL P F+ CY S
Sbjct: 306 GVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSL----TPVEGRYGFEACYEKPDS 361
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
+ P+M H + P Y +E D G C +Q P SI+G Q N L+++D+
Sbjct: 362 ARLIPAMVLHFDGGADMALPVANYVVEVDDGVVCWVVQRSPSLSIIGNIMQMNYLVLHDV 421
Query: 430 NVPALRFGSENC----ANG 444
L F NC ANG
Sbjct: 422 RKSVLSFQPANCDSYKANG 440
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 155/352 (44%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + +GTP ++FDT S W QC+PC + C++Q +FDP S+T + I C P
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 156 LCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G + G + +T + + + FGC N G
Sbjct: 246 ACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAIKGFRFGCGERNEG 302
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++C T + FG +
Sbjct: 303 LF--GEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSPAVSTKL 360
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L+ + +Y+ L I +G ++ PP F T G I+D+GT +T + Y
Sbjct: 361 TTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFT-----TAGTIVDSGTVITRLPPAAY 415
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 416 SSLRSAFASAIAARGYKKAP--ALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASG 473
Query: 393 YFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+ A ++D I+G Q + ++YD+ + F C
Sbjct: 474 IIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 170/367 (46%), Gaps = 40/367 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +E+ IGTP P L DT S L WTQC+PC CF Q TPI+D S+++S +PC
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 157 CRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + +C RY D G S E V +AFGC DN G
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDD---GAYSPECAGISVGG-------IAFGCGVDNGGL 192
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR--EMEATSVIKFGRDADVRRRD- 271
++ +G +G LSL +QL G FSYCL +S + FG A++
Sbjct: 193 SY--NSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLSSPVFFGSLAELAASSA 247
Query: 272 ------LETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGT 323
+++TP++ S P +Y+ L IS+G + P G FD+ DG+GG I+D+GT
Sbjct: 248 SADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGT 307
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS----FKAYPSMTFH 379
T + ++ ++ D + LG+ + NAS C+ ++ P M H
Sbjct: 308 IFTILVETGFRVVV---DHVAGVLGQPVV--NASSLDRPCFPAPAAGVQELPDMPDMVLH 362
Query: 380 LQ-EADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
AD + +N + FC+ I + S+LG +QQQN+ +++D+ V L F
Sbjct: 363 FAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSF 422
Query: 437 GSENCAN 443
+C+
Sbjct: 423 MPTDCSK 429
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 166/396 (41%), Gaps = 54/396 (13%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
A +S FQE + LP+ Y+V V +GTP K L+FDT S L WTQC+P
Sbjct: 105 ARLSSHGVFQE-KQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEP 163
Query: 128 CIR-CFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPFKCQNGKCVYTRRYHVGDVTR 181
C + C+ Q P DP ST+Y I C C+ C + C+Y +Y G +
Sbjct: 164 CAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSI 223
Query: 182 GLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
G + ET N F FGC NSG G +G+LG + LSL SQ +
Sbjct: 224 GFFATETLTLSSSNVF---KNFLFGCGQQNSGLFRGA--AGLLGLGRTKLSLPSQTAQKY 278
Query: 242 QGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLR--PHFYLHLLEISIGRHI 299
+ LFSYCL + + FG + ++ TP L D + P + L + E+S+G +
Sbjct: 279 KKLFSYCLPASSSSKGYLSFGGQVS---KTVKFTP-LSEDFKSTPFYGLDITELSVGGNK 334
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
+ F T G +ID+GT +T + + Y L + +++ +
Sbjct: 335 LSIDASIFS-----TSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPST----DGYSI 385
Query: 360 FDYCYRYDS-----------SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ- 407
FD CY + SFK M + I+ P N + C+A
Sbjct: 386 FDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSG---ILYPVN------GLKKVCLAFAG 436
Query: 408 --DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
DD K +I G QQ+ ++YD + F C
Sbjct: 437 NGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/398 (28%), Positives = 171/398 (42%), Gaps = 47/398 (11%)
Query: 71 MASMSKPNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
+ +M+ Q + + +P+ + L Y V V +G K L+ DT S L W QCQ
Sbjct: 108 IKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ 165
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---------SPFKCQNG----KCVYTRR 173
PC C++Q P++DP S++Y + C+ C+ P NG C Y
Sbjct: 166 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVS 225
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
Y G TRG + E+ G T + L FGC +N G FGG SG++G S +SL
Sbjct: 226 YGDGSYTRGDLASESIVL----GDTKLENLVFGCGRNNKGL-FGGA-SGLMGLGRSSVSL 279
Query: 234 SSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADVRRRDLET--TPILLS-DLRPHFYLH 289
SQ G+FSYCL E A+ + FG D V + TP++ + LR + L+
Sbjct: 280 VSQTLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILN 339
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L SIG V +F G +ID+GT +T + Y+ + + + G
Sbjct: 340 LTGASIGG--VELKTLSFGR------GILIDSGTVITRLPPSIYKAVKTEFLKQFS--GF 389
Query: 350 QRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
P D C+ Y+ M F + YF++PD C+A+
Sbjct: 390 PSAP--GYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLAL 447
Query: 407 QD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + I+G +QQ+N +IYD L ENC
Sbjct: 448 ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 169/383 (44%), Gaps = 70/383 (18%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP 160
+G ++ DTAS L W QC PC C DQ P+FDP +S +Y+ +PCD P C +
Sbjct: 145 ATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDAL 204
Query: 161 FK------------CQNGK---CVYTRRYHVGDVTRGLAS--RETFAFPVRNGFTFVPRL 203
+ C G+ C Y Y G +RG+ + R + A V +GF
Sbjct: 205 QQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDGFV----- 259
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC--LVREMEATSVIKF 261
FGC N G FGG SG++G S LSL SQ ++ G+FSYC L RE +A+ +
Sbjct: 260 -FGCGTSNQGPPFGGT-SGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVL 317
Query: 262 GRDADVRRRDLETTPILLSDL---------RPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
G D R +TP++ + + P + ++L I++G V
Sbjct: 318 GDDPSAYRN---STPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVE----------- 363
Query: 313 GTGGF----IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYR 365
+ GF I+D+GT +T + + Y+ + Q Y + F D C+
Sbjct: 364 -STGFSARAIVDSGTVITSL-------VPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFN 415
Query: 366 YDSSFKA-YPSMTFHLQEADYIVQPEN--MYFIEPDRGRFCVAI---QDDPKYSILGAWQ 419
+ PS+T + +YF+ D + C+A+ + + + SI+G +Q
Sbjct: 416 MTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQ 475
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
Q+N+ +++D + + F E C
Sbjct: 476 QKNLRVVFDTSASQVGFAQETCG 498
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 170/380 (44%), Gaps = 43/380 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +++ +GTP K L+ DT S L W QC PC CF+Q + P+ S+TY I C DP
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPR 230
Query: 157 CR-----SPF---KCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C+ P K +N C Y Y G T G + ETF + + F V +
Sbjct: 231 CQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDV 290
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N GF +G SG+LG P+S SQ+++ FSYCL TSV +
Sbjct: 291 MFGCGHWNKGFFYGA--SGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLI 348
Query: 261 FGRDAD-VRRRDLETTPILLSDLRPH---FYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D + + +L T +L + P +YL + I +G ++ + +G
Sbjct: 349 FGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAA 408
Query: 317 F-----IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY--CYRYDSS 369
IID+G+ +TF + Y + + +++ ++ Q+I A+ +F CY +
Sbjct: 409 DAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK---LQQI---AADDFVMSPCYNVSGA 462
Query: 370 FK--AYPSMTFHLQEADYIVQPENMYF--IEPDRGRFCVAIQDDPKYS---ILGAWQQQN 422
P H + P YF EPD C+AI P +S I+G QQN
Sbjct: 463 MMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE-VICLAIMKTPNHSHLTIIGNLLQQN 521
Query: 423 MLIIYDLNVPALRFGSENCA 442
I+YD+ L + CA
Sbjct: 522 FHILYDVKRSRLGYSPRRCA 541
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 115/401 (28%), Positives = 187/401 (46%), Gaps = 41/401 (10%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC 131
AS + P + Q + + ++ Y ++V +GTP K L+ DT S L W QC PCI C
Sbjct: 170 ASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIAC 229
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCR--------SPFKCQNGKCVYTRRYHVGDVTRGL 183
F+Q+ P +DP+ S+++ I C DP C+ +P K +N C Y Y G T G
Sbjct: 230 FEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGD 289
Query: 184 ASRETFAFPV-----RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR 238
+ ETF + ++ V + FGC + N G +G+LG PLS +SQ++
Sbjct: 290 FALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFH--GAAGLLGLGKGPLSFASQMQ 347
Query: 239 NRIQGLFSYCLVREMEATSV---IKFGRDADVRRRDLETTPILL---------SDLRPHF 286
+ FSYCLV SV + FG D ++L + P L + +
Sbjct: 348 SLYGQSFSYCLVDRNSNASVSSKLIFGED-----KELLSHPNLNFTSFGGGKDGSVDTFY 402
Query: 287 YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
Y+ + + + +++ P + + +G GG IID+GT +T+ Y+ + + + + ++
Sbjct: 403 YVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIK- 461
Query: 347 LGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQ--PENMYFIEPDRGRFCV 404
G + + CY S + F + AD V P YFI+ D C+
Sbjct: 462 -GYELV--EGLPPLKPCYNV-SGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCL 517
Query: 405 AIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
AI +P+ SI+G +QQQN I+YD+ L + CA+
Sbjct: 518 AILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCAD 558
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 153/347 (44%), Gaps = 19/347 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC+ C++Q +FDP S+TY+ + C P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 296
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG + R
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAARARL 354
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 355 TTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVF-----ATAGTIVDSGTVITRLPPAAY 409
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 410 SSLRYAFAAAMAARGYKKAP--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ C+A +D I+G Q + + YD+ + F
Sbjct: 468 IMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 167/374 (44%), Gaps = 36/374 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP K L+ DT S L W QC PC CF+Q P +DP S++Y I C D
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C P K +N C Y Y T G + ETF + + V +
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIK 260
FGC + N G +G+LG PLS SSQL++ FSYCLV + +S +
Sbjct: 301 MFGCGHWNRGLFH--GAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLI 358
Query: 261 FGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D D+ +L T ++ P +Y+ + I +G +V P + I DG+GG
Sbjct: 359 FGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGG 418
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA- 372
IID+GT +++ YQ + + + ++ Y ++F + CY +
Sbjct: 419 TIIDSGTTLSYFAEPAYQVIKEAFMAKVKG-------YPVVKDFPVLEPCYNVTGVEQPD 471
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQDDP--KYSILGAWQQQNMLIIYDL 429
P + P YFIE + R C+AI P SI+G +QQQN I+YD
Sbjct: 472 LPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDT 531
Query: 430 NVPALRFGSENCAN 443
L F CA+
Sbjct: 532 KKSRLGFAPTKCAD 545
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 157/352 (44%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +G+P + Q+++ D+ S +VW QC+PC +C+ QT P+FDP S ++ + C +
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C C +G+C Y Y G T+G + ET G T V +A GC + N G
Sbjct: 103 CDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL----GRTVVQNVAIGCGHMNQGM 158
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDLE 273
+G+LG +S QL FSYCLV R + ++FG +A
Sbjct: 159 FV--GAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEA--MPVGAA 214
Query: 274 TTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
P++ + P ++Y+ L + +G V F++ G GG ++DTGT VT
Sbjct: 215 WIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVA 274
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y+ + +L R + FD CY P+++F+ + P N
Sbjct: 275 YEAFRDAFIDQTGNLPRA----SGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPAN 330
Query: 392 MYFIE-PDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I D G FC A P SILG QQ+ + I D + FG C
Sbjct: 331 NFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 114/376 (30%), Positives = 175/376 (46%), Gaps = 41/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP K L+ DT S L W QC PCI CF+Q+ P +DP+ S+++ I C DP
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR--NG---FTFVPRL 203
C+ P K +N C Y Y G T G + ETF + NG V +
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G +G+LG PLS +SQ+++ FSYCLV SV +
Sbjct: 317 MFGCGHWNRGLFH--GAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLI 374
Query: 261 FGRDADVRRRDLETTPILL---------SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR 311
FG D ++L + P L + +Y+ + + + +++ P + +
Sbjct: 375 FGED-----KELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSS 429
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
+G GG IID+GT +T+ Y+ + + + + ++ G Q + CY S +
Sbjct: 430 EGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIK--GYQLV--EGLPPLKPCYNV-SGIE 484
Query: 372 AYPSMTFHLQEADYIVQ--PENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIY 427
F + AD V P YFI D C+AI +P+ SI+G +QQQN I+Y
Sbjct: 485 KMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILY 544
Query: 428 DLNVPALRFGSENCAN 443
D+ L + CA+
Sbjct: 545 DMKKSRLGYAPMKCAD 560
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 161/362 (44%), Gaps = 32/362 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC---IRCFDQTTPIFDPRASTTYSEIP 151
L + V V +GTP +P L+FDT S L W QCQPC C Q P+FDP S+TY+ +
Sbjct: 142 LEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 201
Query: 152 CDDPLCRSPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C +P C + N C+Y RY G T G+ SR+T A T P FGC
Sbjct: 202 CGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFP---FGCG 258
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
N G G++ G+LG LSL SQ +FSYCL T + G
Sbjct: 259 TRNLGDF--GRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATPATD 316
Query: 269 RRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ T +L P FY + L+ I IG +++ PP F GG ++D+GT +T+
Sbjct: 317 TGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSGTVLTY 371
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQ----- 381
+ Y L R+ R + P + D CY + S P+++F
Sbjct: 372 LPAQAYALLRDRF----RLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVF 427
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSE 439
E D+ M F++ + G A D SI+G QQ++ +IYD+ + F
Sbjct: 428 ELDFF---GVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPA 484
Query: 440 NC 441
+C
Sbjct: 485 SC 486
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 172/371 (46%), Gaps = 35/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +G P + L+ DT S L W QC+PC CFDQ+ P+FDP ST++ IPC+
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 157 C---------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN--GFTFVPRLAF 205
C + K C Y Y T G + E+ + + + + +
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR-IQGLFSYCLV---REMEATSVIKF 261
GC + N G G+LG LS SQLR+ I FSYCLV + +S I F
Sbjct: 207 GCGHSNKGLFQ--GAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISF 264
Query: 262 GRDADVRRR--DLETTPILLSD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
G + R ++ TP + ++ + +YL + I I + ++ P F I +G+GG
Sbjct: 265 GAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGT 324
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD---YCYRYDSSFKA-Y 373
IID+GT +T++ Y+ + + RI Y + FD CY +
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAF--------LARISYPRADPFDILGICYNATGRAAVPF 376
Query: 374 PSMTFHLQEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
P+++ Q + P+ YFI+PD + C+AI SI+G +QQQN+ +YD+
Sbjct: 377 PALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQH 436
Query: 432 PALRFGSENCA 442
L F + +C+
Sbjct: 437 ARLGFANTDCS 447
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 152/352 (43%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC+ C++Q +FDP S+TY+ + C P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 156 LC--RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 239 ACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 295
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG +
Sbjct: 296 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAAGARL 353
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 354 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF-----ATAGTIVDSGTVITRLPPPAY 408
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 409 SSLRSAFVSAMAARGYKKAP--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASG 466
Query: 393 YFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+ A +D I+G Q + + YD+ + F C
Sbjct: 467 IMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 164/362 (45%), Gaps = 41/362 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP K L FDT S L WTQC+PC+ CF Q P FDP ST+Y + C
Sbjct: 140 YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSE 199
Query: 156 LCRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C+ + C + C+Y +Y G T G + ET A + F FGCS
Sbjct: 200 FCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSDVFK---NFLFGCS 255
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
++ G F G +G+LG SP++L SQ N+ + LFSYCL +T + FG +
Sbjct: 256 EESRG-TFNGT-TGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTGHLSFGVEVSQA 313
Query: 269 RRDLETTPILLSDLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ +P L+ + L+ + IS+ GR + P I R IID+GT TF
Sbjct: 314 AKSTPISP----KLKQLYGLNTVGISVRGREL----PINGSISRT-----IIDSGTTFTF 360
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD---SSFKAYPSMTFHLQEAD 384
+ + Y L + +++ + N + F CY + + P ++ E
Sbjct: 361 LPSPTYSALGSAFREMMANYTLT----NGTSSFQPCYDFSNIGNGTLTIPGISIFF-EGG 415
Query: 385 YIVQPENMYFIEPDRG--RFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
V+ + + P G C+A D D ++I G +QQ+ +IYD+ + F +
Sbjct: 416 VEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPK 475
Query: 440 NC 441
C
Sbjct: 476 GC 477
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 172/372 (46%), Gaps = 35/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V IGTP K L+ DT S L W QC PCI CF+Q+ P +DP+ S+++ I C DP
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPR 251
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFT---FVPRL 203
C+ P K +N C Y Y T G + ETF + NG + V +
Sbjct: 252 CKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENV 311
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G +G+LG PLS +SQL++ FSYCLV TSV +
Sbjct: 312 MFGCGHWNRGLFH--GAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLI 369
Query: 261 FGRDADVRRR-DLETTPILLSD---LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D ++ +L T + + + +Y+ + I + +++ P + + ++G GG
Sbjct: 370 FGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGG 429
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFK-A 372
IID+GT +T+ Y+ + + + + ++ Y + F CY K
Sbjct: 430 TIIDSGTTLTYFAEPAYEIIKEAFMKKIKG-------YELVEGFPPLKPCYNVSGIEKME 482
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLN 430
P + P YFI+ + C+AI PK SI+G +QQQN I+YD+
Sbjct: 483 LPDFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMK 542
Query: 431 VPALRFGSENCA 442
L + C
Sbjct: 543 KSRLGYAPMKCT 554
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 121/427 (28%), Positives = 191/427 (44%), Gaps = 53/427 (12%)
Query: 49 GNLSQSERIHKMFEISKARANYM-------ASMSKPNAFQELEDIHLPMAK----QDLFY 97
G S++E H + AR + + + +A + +P+ + L Y
Sbjct: 54 GGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARLRTLNY 113
Query: 98 SVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC 157
V IG ++ DTAS L W QC+PC C DQ P+FDP +S +Y+ +PC+ C
Sbjct: 114 VATVGIGG--GEATVIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSC 171
Query: 158 --------RSPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
S C Q C YT Y G +RG+ + + + + FV FGC
Sbjct: 172 DALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQGFV----FGC 227
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDAD 266
N G FGG SG++G S LSL SQ ++ G+FSYCL +E ++ + G DA
Sbjct: 228 GTSNQG-PFGGT-SGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGSLVLGDDAS 285
Query: 267 VRRRDLETTPI----LLSD-LRPHFYL-HLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
V R +TPI ++SD L+ FYL +L I++G V+ P G G I+D
Sbjct: 286 VYRN---STPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP----GFSAGGGGKAIVD 338
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFH 379
+GT +T + Y + + L Q P++ D C+ + PS+
Sbjct: 339 SGTIITSLVPSVYAAVRAEFVSQLAEY-PQAAPFSI---LDTCFDLTGLREVQVPSLKLV 394
Query: 380 LQ-EADYIVQPEN-MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
A+ V + +Y + D + C+A ++ + I+G +QQ+N+ +I+D +
Sbjct: 395 FDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQI 454
Query: 435 RFGSENC 441
F E C
Sbjct: 455 GFAQETC 461
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 172/375 (45%), Gaps = 38/375 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V IGTP + L+ DT S L W QC PC CF Q P +DP+ S+++ I C DP
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C P K +N C Y Y T G + ETF + ++ F V +
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G +G+LG PLS SSQL++ FSYCLV T+V +
Sbjct: 312 MFGCGHWNRGLFH--GAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 369
Query: 261 FGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D D+ ++ T ++ P +Y+ + I +G +++ P + + +G GG
Sbjct: 370 FGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGG 429
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFK-A 372
I+D+GT +++ Y+ + + + ++ Y ++F D CY K
Sbjct: 430 TIVDSGTTLSYFAEPSYEIIKDAFVKKVKG-------YPVIKDFPILDPCYNVSGVEKME 482
Query: 373 YPSMTFHLQEADYIVQPENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYD 428
P ++ P YFI EP+ C+AI P+ SI+G +QQQN I+YD
Sbjct: 483 LPEFRILFEDGAVWNFPVENYFIKLEPEE-IVCLAILGTPRSALSIIGNYQQQNFHILYD 541
Query: 429 LNVPALRFGSENCAN 443
L + CA+
Sbjct: 542 TKKSRLGYAPMKCAD 556
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/428 (26%), Positives = 189/428 (44%), Gaps = 26/428 (6%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA---SMSKPNAFQE 82
+S S + LKL+ + P + R + + RA + + KP E
Sbjct: 62 ASSSAKYKLKLV--HRDKVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAE 119
Query: 83 L--EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
D+ M + Y V + +G+P + Q+++ D+ S ++W QC+PC +C+ Q+ P+F+
Sbjct: 120 AFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFN 179
Query: 141 PRASTTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
P S+++S + C +C C G+C Y Y G T+G + ET F G T
Sbjct: 180 PADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITF----GRT 235
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATS 257
+ +A GC + N G +G+LG P+S QL + G FSYCLV R +E++
Sbjct: 236 LIRNVAIGCGHHNQGMFV--GAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSG 293
Query: 258 VIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+++FGR+A P++ + + +Y+ L + +G V F + G GG
Sbjct: 294 LLEFGREA--MPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGG 351
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPS 375
++DTGT VT + Y+ + +L R + FD CY P+
Sbjct: 352 VVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRA----SGVSIFDTCYDLFGFVSVRVPT 407
Query: 376 MTFHLQEADYIVQPENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPA 433
++F+ + P + I D G FC A SI+G QQ+ + I D
Sbjct: 408 VSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGF 467
Query: 434 LRFGSENC 441
+ FG C
Sbjct: 468 VGFGPNVC 475
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/455 (27%), Positives = 184/455 (40%), Gaps = 64/455 (14%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARA 68
+AAF + +L L + S + ++L + + Y G +ER+ + + S R
Sbjct: 1 MAAFLVWI-LLLLPYVAISSTASHGVRLELTHADDRGGYVG----AERVRRAADRSHRRV 55
Query: 69 N-YMASMSKPNAFQEL-----------EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDT 116
N ++ ++ P++ L +H A Y V++ IGTP P + DT
Sbjct: 56 NGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTAT----YLVDIAIGTPPLPLTAVLDT 111
Query: 117 ASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPF-KCQ--NGKCV 169
S L+WTQC PC RCF Q P++ P S TY+ + C P+C +SP+ +C + C
Sbjct: 112 GSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCA 171
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y Y G T G+ + ETF T V +AFGC +N G SG++G
Sbjct: 172 YYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAFGCGTENLGST--DNSSGLVGMGRG 226
Query: 230 PLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLH 289
PLSL SQL V + RR P
Sbjct: 227 PLSLVSQL-------------------GVTR-------PRRSCRARAAARGGGAPTTTSP 260
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L I++G ++ P F + G GG IID+GT T + + L + L S R
Sbjct: 261 LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALA----RALASRVR 316
Query: 350 QRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
+ A C+ S P + H AD ++ E+ + G C+ +
Sbjct: 317 LPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVS 376
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
S+LG+ QQQN I+YDL L F C
Sbjct: 377 ARGMSVLGSMQQQNTHILYDLERGILSFEPAKCGE 411
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 165/381 (43%), Gaps = 48/381 (12%)
Query: 93 QDLFYSVEVNIG----TPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
Q L Y +++G +P ++ DT S L W QC+PC C+ Q P+FDP S TY+
Sbjct: 140 QTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYA 199
Query: 149 EIPCDDPLCRSPFKCQNG-------------KCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+ C+ C + G KC Y Y G +RG+ + +T A
Sbjct: 200 AVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---- 255
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREM 253
G + FGC N G FGG +G++G + LSL SQ +R G+FSYCL
Sbjct: 256 GGASLGGFVFGCGLSNRGL-FGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSG 313
Query: 254 EATSVIKFGRDADVRRRDLETTPI----LLSD-LRPHFY-LHLLEISIGRHIVRFPPGAF 307
+A+ + G D TTP+ +++D +P FY L++ ++G A
Sbjct: 314 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------AL 366
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRY 366
G +ID+GT +T + Y+ + + +R G P D CY
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEF---MRQFGAAGYPAAPGFSILDTCYDL 423
Query: 367 DSSFKA-YPSMTFHLQ-EADYIVQPENMYF-IEPDRGRFCVAIQD---DPKYSILGAWQQ 420
+ P +T L+ AD V M F + D + C+A+ + + I+G +QQ
Sbjct: 424 TGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQ 483
Query: 421 QNMLIIYDLNVPALRFGSENC 441
+N ++YD L F E+C
Sbjct: 484 KNKRVVYDTLGSRLGFADEDC 504
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 125/434 (28%), Positives = 176/434 (40%), Gaps = 54/434 (12%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELED---------------IHLPMAKQ 93
G+ + S+R+ + + S R N + + + P A L +H A
Sbjct: 41 GDFTGSDRVRRAADRSHRRVNGLLAAAPPPAASTLRSDGGGGGACAATAAASVHASTAT- 99
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPC 152
Y V+ IGTP + DT S L+WTQC PC RCF Q P++ P S TY+ + C
Sbjct: 100 ---YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSC 156
Query: 153 DDPLCRS---------------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
LC + + G C Y Y G T G+ + ETF F
Sbjct: 157 GSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF---GAG 213
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEA 255
T V LAFGC DN G SG++G PLSL SQL FSYC +
Sbjct: 214 TTVHDLAFGCGTDNLGGT--DNSSGLVGMGRGPLSLVSQLGVTK---FSYCFTPFNDTTT 268
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMR 311
+S + G A + ++TP + S P ++YL L I++G ++ P F +
Sbjct: 269 SSPLFLGSSASLSPA-AKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTA 327
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
G GG IID+GT T + + L + + S F
Sbjct: 328 SGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAV 387
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDR--GRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
P + H AD + P + +E DR G C+ I S+LG+ QQQNM + YD+
Sbjct: 388 DVPRLVLHFDGADMEL-PRSSAVVE-DRVAGVACLGIVSARGMSVLGSMQQQNMHVRYDV 445
Query: 430 NVPALRFGSENCAN 443
L F NC
Sbjct: 446 GRDVLSFEPANCGE 459
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 171/371 (46%), Gaps = 35/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +G P + L+ DT S L W QC+PC CFDQ+ P+FDP ST++ IPC+
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 157 C---------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN--GFTFVPRLAF 205
C + K C Y Y T G + E+ + + + + +
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR-IQGLFSYCLV---REMEATSVIKF 261
GC + N G G+LG LS SQLR+ I FSYCLV + +S I F
Sbjct: 291 GCGHSNKGLFQ--GAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISF 348
Query: 262 GRDADVRRR--DLETTPILLSD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
G + R + TP + ++ + +YL + I I + ++ P F I +G+GG
Sbjct: 349 GAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGT 408
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD---YCYRYDSSFKA-Y 373
IID+GT +T++ Y+ + + RI Y + FD CY +
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAF--------LARISYPRADPFDILGICYNATGRTAVPF 460
Query: 374 PSMTFHLQEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
P+++ Q + P+ YFI+PD + C+AI SI+G +QQQN+ +YD+
Sbjct: 461 PTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQH 520
Query: 432 PALRFGSENCA 442
L F + +C+
Sbjct: 521 ARLGFANTDCS 531
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 170/381 (44%), Gaps = 49/381 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT---PIFDPRASTTYSEIPCD 153
Y VE+ +GTP K L+ DT S L W QC P + ++ P +D +S++Y EIPC
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 86
Query: 154 DPLC-------------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR------ 194
D C +SP C YT Y T G+ + ET + R
Sbjct: 87 DDECLFLPAPIGSSCSIKSPSPCD-----YTYGYSDQSRTTGILAYETISMKSRKRSGKR 141
Query: 195 --NGFTFVPRL---AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR-IQGLFSYC 248
N T R+ A GCS ++ G +F G SG+LG P+SL++Q R+ + G+FSYC
Sbjct: 142 AGNHKTRTIRIKNVALGCSRESVGASFLGA-SGVLGLGQGPISLATQTRHTALGGIFSYC 200
Query: 249 LV---REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISI-GRHIVRFP 303
LV R A+S + GR R R L TPI+ + F Y+++ +++ G+ +
Sbjct: 201 LVDYLRGSNASSFLVMGR---TRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYC 363
+ I DG G I D+GT ++++R Y ++ + + Q IP + F+ C
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP----EGFELC 313
Query: 364 YRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQ 420
Y K P + Q + P N Y + CVA+Q +ILG Q
Sbjct: 314 YNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQ 373
Query: 421 QNMLIIYDLNVPALRFGSENC 441
Q+ I YDL + F C
Sbjct: 374 QDHHIEYDLAKARIGFKWSPC 394
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 171/367 (46%), Gaps = 34/367 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V+ +GTP + L+ D+ S L+W QC PC +C+ Q +P++ P S+T+S +PC
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSD 123
Query: 157 C-----RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVPRLAFGC 207
C F C G C Y Y ++G+ + E+ VR + ++AFGC
Sbjct: 124 CLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVR-----IDKVAFGC 178
Query: 208 SNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IKFGR 263
+DN G FA G G+LG PLS SQ+ F+YCLV ++ TSV + FG
Sbjct: 179 GSDNQGSFAAAG---GVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLIFGD 235
Query: 264 DADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
+ D++ TPI+ + P +Y+ + ++++G + A++I G GG I D+G
Sbjct: 236 ELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDSG 295
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQ 381
T +T+ Y ++ +D + + + Q D C + ++PS T
Sbjct: 296 TTLTYWFPSAYSHILAAFDSGVHYPRAESV-----QGLDLCVELTGVDQPSFPSFTIEFD 350
Query: 382 EADYIVQPE-NMYFIEPDRGRFCVAIQDDPK----YSILGAWQQQNMLIIYDLNVPALRF 436
+ + QPE YF++ C+A+ ++ +G QQN + YD + F
Sbjct: 351 DGA-VFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIGF 409
Query: 437 GSENCAN 443
C++
Sbjct: 410 APAKCSS 416
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 31/320 (9%)
Query: 144 STTYSEIPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
S+T+ + C DP+CR S +N +C Y Y +T G ++TF F NG
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 197 F-TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA 255
V LAFGC + N+G F SGI GF P SL SQL+ G FSYCL E+
Sbjct: 62 VPVAVSELAFGCGDYNTGL-FVSNESGIAGFGRGPQSLPSQLK---VGRFSYCLTLVTES 117
Query: 256 -TSVIKFGR--DADVRRRD----LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAF 307
+SV+ G D D R ++TPI+ + L P FY L L I++G+ + F F
Sbjct: 118 KSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVF 177
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD--YCYR 365
+ +DG+GG +ID+GT +T + ++ L + +++ R Y+ + E C+R
Sbjct: 178 ALKKDGSGGTVIDSGTSLTTLPEAVFELLQE---ELVAQFPLPR--YDNTPEVGDRLCFR 232
Query: 366 YDSSFKA--YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQ 421
K P + HL AD + +N + EPD G C+ I +D ++G +QQQ
Sbjct: 233 RPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQ 292
Query: 422 NMLIIYDLNVPALRFGSENC 441
NM ++YD+ L F C
Sbjct: 293 NMHVVYDVENNKLLFAPAQC 312
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/438 (28%), Positives = 195/438 (44%), Gaps = 48/438 (10%)
Query: 34 LKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQ 93
L LI SP SPL+ NL+ S+R+ +A+++ ++S+ + + + LP +
Sbjct: 29 LDLIHRDSPLSPLHTPNLTFSDRL---------QASFLRAISRQSRHVDFQTDLLPSGGE 79
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
Y + ++IGTP P + DT S L W Q +PC +C+ Q PIFDP STT+ ++PC
Sbjct: 80 ---YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCT 136
Query: 154 DPLC----RSPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C S C + C YT Y T G + +T V N + +AFGC
Sbjct: 137 TAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVT--VGNASVQIRNVAFGCG 194
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV----------REMEATSV 258
N G F + SGI+G LS SQL + I FSYCL+ + ATS
Sbjct: 195 TRNGG-NFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSR 253
Query: 259 IKFGRD-----ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRF-----PPGAFD 308
I FG + + TTP++ + ++YL + I++GR + + ++D
Sbjct: 254 IVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYD 313
Query: 309 IMRDGT---GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
+ G IID+GT +TF+ Y L ++ + +R+ + F C++
Sbjct: 314 SGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEA---ALVEEIKMERVNDVKNSMFSLCFK 370
Query: 366 YDSSFKAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNML 424
P M H + AD ++P N F+ + G C + I G Q N +
Sbjct: 371 SGKEEVELPLMKVHFRGGADVELKPVNT-FVRAEEGLVCFTMLPTNDVGIYGNLAQMNFV 429
Query: 425 IIYDLNVPALRFGSENCA 442
+ YDL + F +C+
Sbjct: 430 VGYDLGKRTVSFLPADCS 447
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 173/407 (42%), Gaps = 42/407 (10%)
Query: 61 FEISKARANYMAS-----MSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQH 111
+ R Y+ S + + N ++L+ LP L Y V V +GTP +
Sbjct: 1 MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60
Query: 112 LLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDPLCRS------PFKCQ 164
L+FDT S L WTQC+PC C+ Q IFDP S++Y+ I C LC +C
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120
Query: 165 ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKIS 221
+ C+Y +Y + G S+E + V FGC DN G F G +
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATD---IVDDFLFGCGQDNEGL-FNGS-A 175
Query: 222 GILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSD 281
G++G P+S+ Q + +FSYCL + + FG A + T +S
Sbjct: 176 GLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASAATNASLIYTPLSTISG 235
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYD 341
+ L ++ IS+G + P + GG IID+GT +T + Y L +
Sbjct: 236 DNSFYGLDIVSISVGG--TKLPAVSSSTFS--AGGSIIDSGTVITRLAPTVYAALRSAFR 291
Query: 342 QILRSLGRQRIPY-NASQEFDYCYRYDSSFK--AYPSMTFHLQEADYI-VQPENMYFIEP 397
+ + ++ P N + D CY S +K + P + F + + + +E
Sbjct: 292 RXM-----EKYPVANEAGLLDTCYDL-SGYKEISVPRIDFEFSGGVTVELXHRGILXVES 345
Query: 398 DRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++ + C+A D ++ G QQ+ + ++YD+ + FG+ C
Sbjct: 346 EQ-QVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 160/358 (44%), Gaps = 36/358 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + V GTP + Q ++FDT S + W QC+PC +RC+ Q P+FDP S+TY + C +P
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTEP 75
Query: 156 LC--RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C S C + C+Y Y G T G + +TF F FGC +N+G
Sbjct: 76 ACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKF---KNFIFGCGQNNTG 132
Query: 214 FAFGGKISGILGFN-ASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDL 272
G +G++G +S SL+SQ+ + +FSYCL AT + G
Sbjct: 133 LFQG--TAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNP-------- 182
Query: 273 ETTP---ILLSDLR-PHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ TP +L+D R P Y + L+ IS+G + F + G IID+GT +T
Sbjct: 183 QNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSV-----GTIIDSGTVITR 237
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFHLQEADYI 386
+ Y L +R+ Q A D CY + ++ YP + H D
Sbjct: 238 LPPTAYSAL----KTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVR 293
Query: 387 VQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ ++F+ + + C+A D I+G QQ M + YD + + F + C
Sbjct: 294 IPATGVFFVF-NSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 167/364 (45%), Gaps = 49/364 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y++ +++GTP+ ++ DT S L+WTQC PC +CF Q P F P +S+T+S++PC
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 157 CR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ S C CVY +Y G T G + ET ++ G P +AFGCS +N
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATET----LKVGDASFPSVAFGCSTENG 200
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRDADVRRRD 271
LG QL + G FSYCL A S I FG A++ +
Sbjct: 201 -----------LG----------QLDLGV-GRFSYCLRSGSAAGASPILFGSLANLTDGN 238
Query: 272 LETTPILLS-DLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGFIIDTGTPVTFI 328
+++TP + + + P ++Y++L I++G + F ++G GG I+D+GT +T++
Sbjct: 239 VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYL 298
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK---AYPSMTFHLQEADY 385
Y+ + Q + S N ++ D C++ A PS+
Sbjct: 299 AKDGYEMVKQAF----LSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAE 354
Query: 386 IVQPENMYFIEPD-RGRFCVAI------QDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
P +E D +G VA + D S++G Q +M ++YDL+ F
Sbjct: 355 YAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAP 414
Query: 439 ENCA 442
+CA
Sbjct: 415 ADCA 418
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 180/426 (42%), Gaps = 49/426 (11%)
Query: 33 SLKLIPIFSPESPL--YPGNLSQSERIHKMFEISKARANYMAS-----MSKPNAFQELED 85
SL+++ P S L + G ++ K R Y+ S + + ++ EL+
Sbjct: 70 SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129
Query: 86 IHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFD 140
+ LP L Y V V +GTP + L+FDT S L WTQC+PC R C+ Q IFD
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFD 189
Query: 141 PRASTTYSEIPCDDPLCRSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAF 191
P ST+YS I C LC + C+Y +Y + G SRE +
Sbjct: 190 PSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSV 249
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
+ V FGC +N G FGG +G++G P+S Q + +FSYCL
Sbjct: 250 TATD---IVDNFLFGCGQNNQGL-FGGS-AGLIGLGRHPISFVQQTAAVYRKIFSYCLPA 304
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIM 310
+T + FG ++ TP FY L + IS+G + F
Sbjct: 305 TSSSTGRLSFGT---TTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFS-- 359
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSS 369
TGG IID+GT +T + Y L + Q G + P D CY S
Sbjct: 360 ---TGGAIIDSGTVITRLPPTAYTALRSAFRQ-----GMSKYPSAGELSILDTCYDL-SG 410
Query: 370 FKAY--PSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNM 423
++ + P + F + + P+ + ++ + + C+A DD +I G QQ+ +
Sbjct: 411 YEVFSIPKIDFSFAGGVTVQLPPQGILYVASAK-QVCLAFAANGDDSDVTIYGNVQQKTI 469
Query: 424 LIIYDL 429
++YD+
Sbjct: 470 EVVYDV 475
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 154/346 (44%), Gaps = 21/346 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +GTP + +L+ DT S + W QC PC C+ Q +F+P +S+++ + C L
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSSL 75
Query: 157 CRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFVPRLAFGCSNDNS 212
C + C + KC+Y Y G T G + G + + GC +DN
Sbjct: 76 CLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNE 135
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEAT--SVIKFGRDADVRR 269
G G +GILG PLS + L + +FSYCL RE + S + FG DA +
Sbjct: 136 GTF--GTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFG-DAAIPH 192
Query: 270 RDLETTPILLSDLRP----HFYLHLLEISIGRHIV-RFPPGAFDIMRDGTGGFIIDTGTP 324
+ + P ++Y+ + IS+G +++ P F + G GG I D+GT
Sbjct: 193 TATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGTT 252
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQ-E 382
+T + Y + + R+ + FD CY + + + P++TFH Q +
Sbjct: 253 ITRLEARAYTAVRDAF----RAATMHLTSAADFKIFDTCYDFTGMNSISVPTVTFHFQGD 308
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
D + P N + FC A S++G QQQ+ +IYD
Sbjct: 309 VDMRLPPSNYIVPVSNNNIFCFAFAASMGPSVIGNVQQQSFRVIYD 354
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 161/362 (44%), Gaps = 42/362 (11%)
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC 163
G+P ++ DT S L W QC+PC C+ Q P+FDP S TY+ + C+ C + K
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKA 256
Query: 164 QNG----------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
G +C Y Y G +RG+ + +T A G + FGC N G
Sbjct: 257 ATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVAL----GGASLDGFVFGCGLSNRG 312
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRD 271
FGG +G++G + LSL SQ R G+FSYCL +A+ + G DA R
Sbjct: 313 L-FGG-TAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYR-- 368
Query: 272 LETTPI----LLSD-LRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
TTP+ +++D +P FY L++ ++G A G +ID+GT +
Sbjct: 369 -NTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------ALAAQGLGASNVLIDSGTVI 420
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ-EA 383
T + Y+ + + + + G P D CY + P +T L+ A
Sbjct: 421 TRLAPSVYRGVRAEFTRQFAAAGYPTAP--GFSILDTCYDLTGHDEVKVPLLTLRLEGGA 478
Query: 384 DYIVQPENMYF-IEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ V M F + D + C+A+ + + I+G +QQ+N ++YD L F E
Sbjct: 479 EVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADE 538
Query: 440 NC 441
+C
Sbjct: 539 DC 540
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 152/352 (43%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + +GTP ++FDT S W QC+PC + C+ Q +FDP S+TY+ I C P
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 156 LCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G + G + +T + + + FGC N G
Sbjct: 221 ACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAIKGFRFGCGERNEG 277
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++C T + FG +
Sbjct: 278 LY--GEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSLPAVSAKL 335
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L+ + +Y+ L I +G ++ P F T G I+D+GT +T + Y
Sbjct: 336 TTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPPAAY 390
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 391 SSLRSAFASAMAERGYKKAP--ALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHASG 448
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+ ++D I+G Q + ++YD+ + F C
Sbjct: 449 IIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 179/407 (43%), Gaps = 52/407 (12%)
Query: 50 NLSQS-ERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
NL+++ + H+ + AR + AS S Q + Y + +IGTP +
Sbjct: 42 NLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQ--------LDSGGGAYDMTFSIGTPPQ 93
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQNG 166
L DT S L+W +C C RC Q +P + P S+++S++PC LC +C G
Sbjct: 94 ELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAG 153
Query: 167 --KCVYTRRYHVGD----VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
+C Y Y + T+G ETF G VP + FGC+ + G
Sbjct: 154 GAECDYKYSYGLASDPHHYTQGYLGSETFTL----GSDAVPGIGFGCTTMSEGGYG--SG 207
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLS 280
SG++G PLSL SQL G FSYCL + TS + FG A + +++TP+L +
Sbjct: 208 SGLVGLGRGPLSLVSQLN---VGAFSYCLTSDAAKTSPLLFGSGA-LTGAGVQSTPLLRT 263
Query: 281 DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY----QTL 336
++ ++L ISI GA G+ G I D+GT V F+ Y + +
Sbjct: 264 STY-YYTVNLESISI---------GAATTAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAV 313
Query: 337 MQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIE 396
+ + + + GR ++ C++ +S +PSM H D + EN YF
Sbjct: 314 LSQTTNLTMASGR--------DGYEVCFQ--TSGAVFPSMVLHFDGGDMDLPTEN-YFGA 362
Query: 397 PDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D C +Q P SI+G Q N I YD+ L F NC N
Sbjct: 363 VDDSVSCWIVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCDN 409
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 153/354 (43%), Gaps = 27/354 (7%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + + +GTP DT S L+WTQC PC C+ Q PIFDP S+T+ E
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEK----- 114
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAFGCSNDNSGF 214
+C C Y Y + G+ + ET +G FV + GC +NS
Sbjct: 115 ------RCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSNL 168
Query: 215 ---AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
+ SGI+G N P SL SQ+ I GL SYC + TS I FG +A V
Sbjct: 169 MTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF--SSQGTSKINFGTNAVVAGDG 226
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ + +P +YL+L +S+G + F G ID+GT T++
Sbjct: 227 TVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQD---GNIFIDSGTTYTYL--- 280
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPE 390
P + + S+ + S E CY +D + + +P +T H AD ++
Sbjct: 281 PTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWD-TMEIFPVITLHFAGGADLVLDKY 339
Query: 391 NMYFIEPDRGRFCVAIQ-DDPKY-SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
NMY G FC+AI DP +I G N+L+ YD + + F NC+
Sbjct: 340 NMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/434 (27%), Positives = 191/434 (44%), Gaps = 49/434 (11%)
Query: 41 SPESPLYPGNLSQS----ERIHK----MFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
S +SP P N + R+H+ + IS + +A + K + L++ + P +
Sbjct: 5 SADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTN-PFLQ 63
Query: 93 QDLF-------------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
QD Y V + +GTP + +++ DT S ++W QC PC C+ QT P+F
Sbjct: 64 QDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLF 123
Query: 140 DPRASTTYSEIPCDDPLCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
+P S+T+ I C LC+ C+ +C+Y Y G T G S ET +F G
Sbjct: 124 NPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF----GS 179
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEAT 256
V +A GC ++N G +G+LG LS SQ+ +FSYCL RE +
Sbjct: 180 NAVNSVAIGCGHNNQGLFT--GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGS 237
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGTG 315
+ FG A V TT + L +Y+ ++ I +G V P G+ + G G
Sbjct: 238 VPLIFGNQA-VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNG 296
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-----FDYCYRYDS-S 369
G I+D+GT VT + Y + + R +P +A FD CY S
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAF--------RAGMPSDAKMTSGFSLFDTCYDLSGRS 348
Query: 370 FKAYPSMTFHLQEADYIVQP-ENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIY 427
P+++F + P +N+ + G +C+A + +SI+G QQQ+ + +
Sbjct: 349 SIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSF 408
Query: 428 DLNVPALRFGSENC 441
D + G+ C
Sbjct: 409 DSTGNRVGIGANQC 422
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 160/362 (44%), Gaps = 32/362 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC---IRCFDQTTPIFDPRASTTYSEIP 151
L + V V +GTP +P L+FDT S L W QCQPC C Q P+FDP S+TY+ +
Sbjct: 147 LEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 206
Query: 152 CDDPLCRSPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C +P C + N C+Y Y G T G+ SR+T A P FGC
Sbjct: 207 CGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFP---FGCG 263
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
N G G++ G+LG LSL SQ +FSYCL T + G
Sbjct: 264 TRNLGDF--GRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATPATD 321
Query: 269 RRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ T +L P FY + L+ I IG +I+ PP F GG ++D+GT +T+
Sbjct: 322 TGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSGTVLTY 376
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ----- 381
+ Y+ L R+ R + P + D CY + + P+++F
Sbjct: 377 LPAQAYELLRDRF----RLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVF 432
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSE 439
E D+ M F++ + G A D SI+G QQ++ +IYD+ + F
Sbjct: 433 ELDFF---GVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPA 489
Query: 440 NC 441
+C
Sbjct: 490 SC 491
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 172/371 (46%), Gaps = 61/371 (16%)
Query: 102 NIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-- 159
+G ++ DTAS L W QC PC C DQ P+FDP +S +Y+ +PC+ C +
Sbjct: 130 TVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQ 189
Query: 160 ---------PFKCQNGKCVYTRRYHVGDVTRGLASRE--TFAFPVRNGFTFVPRLAFGCS 208
+ C YT Y G ++G+ + + + A V +GF FGC
Sbjct: 190 VATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFV------FGCG 243
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADV 267
N G FGG SG++G S LSL SQ ++ G+FSYCL ++E E++ + G D V
Sbjct: 244 TSNQG-PFGGT-SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSV 301
Query: 268 RRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
R +TPI+ + + P ++++L I+IG V G I+D+
Sbjct: 302 YRN---STPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGK----------VIVDS 348
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKAY--PSM 376
GT +T + Y + + Q Y + F D C+ + F+ PS+
Sbjct: 349 GTIITSLVPSVYNAVKAEFLS-------QFAEYPQAPGFSILDTCFNL-TGFREVQIPSL 400
Query: 377 TFHLQEADYIVQPEN---MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLN 430
F E + V+ ++ +YF+ D + C+A ++ + + SI+G +QQ+N+ +I+D
Sbjct: 401 KFVF-EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTL 459
Query: 431 VPALRFGSENC 441
+ F E C
Sbjct: 460 GSQIGFAQETC 470
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 151/342 (44%), Gaps = 28/342 (8%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
V++ +G P + +++FD + W QCQPCI+C+DQ IFDP S++Y+ + C+ C
Sbjct: 189 VQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLSCETKHCN 248
Query: 159 ---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFA 215
+ +G C Y Y G T G+ ET +F +V R++ GCSN N G
Sbjct: 249 LLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG---WVDRVSLGCSNKNQGPF 305
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME--ATSVIKFGR---DADVRRR 270
G G G LS S++ SYCLV + ++S ++F V+ +
Sbjct: 306 VGSD--GTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSPPCSGSVKAK 360
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
L+ +Y+ L I +G + P F I G GG I+ + + +T + N
Sbjct: 361 LLQN-----PKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLEN 415
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQP 389
Y + + + L R + A +FD CY S+ P + F + + + P
Sbjct: 416 DTYNVVRDAFVAKTQHLERLK----AFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLLP 471
Query: 390 ENMYFIEPDR-GRFCVAIQ-DDPKYSILGAWQQQNMLIIYDL 429
+ Y D+ G FC A +SILG QQ + +DL
Sbjct: 472 KESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/434 (27%), Positives = 191/434 (44%), Gaps = 49/434 (11%)
Query: 41 SPESPLYPGNLSQS----ERIHK----MFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
S +SP P N + R+H+ + IS + +A + K + L++ + P +
Sbjct: 5 SADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTN-PFLQ 63
Query: 93 QDLF-------------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
QD Y V + +GTP + +++ DT S ++W QC PC C+ QT P+F
Sbjct: 64 QDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLF 123
Query: 140 DPRASTTYSEIPCDDPLCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
+P S+T+ I C LC+ C+ +C+Y Y G T G S ET +F G
Sbjct: 124 NPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF----GS 179
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEAT 256
V +A GC ++N G +G+LG LS SQ+ +FSYCL RE +
Sbjct: 180 NAVNSVAIGCGHNNQGLFT--GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGS 237
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGTG 315
+ FG A V TT + L +Y+ ++ I +G V P G+ + G G
Sbjct: 238 VPLIFGNQA-VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNG 296
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-----FDYCYRYDS-S 369
G I+D+GT VT + Y + + R +P +A FD CY S
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAF--------RAGMPSDAKMTSGFSLFDTCYDLSGRS 348
Query: 370 FKAYPSMTFHLQEADYIVQP-ENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIY 427
P+++F + P +N+ + G +C+A + +SI+G QQQ+ + +
Sbjct: 349 SIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSF 408
Query: 428 DLNVPALRFGSENC 441
D + G+ C
Sbjct: 409 DSTGNRVGIGANQC 422
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 174/376 (46%), Gaps = 41/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V IG+P K L+ DT S L W QC PC CF+Q P +DP+ S ++ I C+DP
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV------RNGFTFVPR 202
C+ P K + C Y Y T G + ETF + ++ F V
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---I 259
+ FGC + N G G L PLS SSQL++ FSYCLV TSV +
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGL--GRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKL 373
Query: 260 KFGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
FG D D+ +L T ++ P +YL + I +G ++ P +++ DG G
Sbjct: 374 IFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAG 433
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFK- 371
G IID+GT +++ + Y+ + + + + ++ Y ++F CY + +
Sbjct: 434 GTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-------YKLVEDFPILHPCYNVSGTDEL 486
Query: 372 AYPSMTFHLQEADYIVQ--PENMYFIEPDR-GRFCVAIQDDPK--YSILGAWQQQNMLII 426
+P F +Q AD V P YFI + C+A+ PK SI+G +QQQN I+
Sbjct: 487 NFPE--FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHIL 544
Query: 427 YDLNVPALRFGSENCA 442
YD L + CA
Sbjct: 545 YDTKNSRLGYAPMRCA 560
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 174/376 (46%), Gaps = 41/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V IG+P K L+ DT S L W QC PC CF+Q P +DP+ S ++ I C+DP
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV------RNGFTFVPR 202
C+ P K + C Y Y T G + ETF + ++ F V
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---I 259
+ FGC + N G G L PLS SSQL++ FSYCLV TSV +
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGL--GRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKL 373
Query: 260 KFGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
FG D D+ +L T ++ P +YL + I +G ++ P +++ DG G
Sbjct: 374 IFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAG 433
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFK- 371
G IID+GT +++ + Y+ + + + + ++ Y ++F CY + +
Sbjct: 434 GTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-------YKLVEDFPILHPCYNVSGTDEL 486
Query: 372 AYPSMTFHLQEADYIVQ--PENMYFIEPDR-GRFCVAIQDDPK--YSILGAWQQQNMLII 426
+P F +Q AD V P YFI + C+A+ PK SI+G +QQQN I+
Sbjct: 487 NFPE--FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHIL 544
Query: 427 YDLNVPALRFGSENCA 442
YD L + CA
Sbjct: 545 YDTKNSRLGYAPMRCA 560
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 173/372 (46%), Gaps = 61/372 (16%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS- 159
+G ++ DTAS L W QC PC C DQ P+FDP +S +Y+ +PC+ C +
Sbjct: 128 ATVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDAL 187
Query: 160 ----------PFKCQNGKCVYTRRYHVGDVTRGLASRE--TFAFPVRNGFTFVPRLAFGC 207
+ C YT Y G ++G+ + + + A V +GF FGC
Sbjct: 188 QVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFV------FGC 241
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDAD 266
N G FGG SG++G S LSL SQ ++ G+FSYCL ++E E++ + G D
Sbjct: 242 GTSNQG-PFGGT-SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTS 299
Query: 267 VRRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
V R +TPI+ + + P ++++L I+IG V G I+D
Sbjct: 300 VYRN---STPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGK----------VIVD 346
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKAY--PS 375
+GT +T + + Y+ + Q Y + F D C+ + F+ PS
Sbjct: 347 SGTIITSL-------VPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNL-TGFREVQIPS 398
Query: 376 MTFHLQEADYIVQPEN---MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDL 429
+ F E + V+ ++ +YF+ D + C+A ++ + + SI+G +QQ+N+ +I+D
Sbjct: 399 LKFVF-EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDT 457
Query: 430 NVPALRFGSENC 441
+ F E C
Sbjct: 458 LGSQIGFAQETC 469
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 132/459 (28%), Positives = 206/459 (44%), Gaps = 54/459 (11%)
Query: 9 LAAFFSYF--SVLFLTHFTSSE----STGFSLKLIPIFSPESPLYPGNLSQSERIHKMFE 62
+AA S F +LFL F+ + GF+ L S SPL +LS +R+ F
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFR 60
Query: 63 ISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVW 122
S +R+ + + + + L+ P + + Y + V+IGTP + DT S L W
Sbjct: 61 RSLSRSAALLNRAATSGAVGLQSSIGPGSGE---YLMSVSIGTPPVDYLGIADTGSDLTW 117
Query: 123 TQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC----VYTRRYHVGD 178
QC PC++C+ Q PIF+P ST++S +PC+ C + +G C V Y GD
Sbjct: 118 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHA---VDDGHCGVQGVCDYSYTYGD 174
Query: 179 VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS-GFAFGGKISGILGFNASPLSLSSQL 237
T S+ F + + GC + +S GF F SG++G LSL SQ+
Sbjct: 175 RTY---SKGDLGFEKITIGSSSVKSVIGCGHASSGGFGFA---SGVIGLGGGQLSLVSQM 228
Query: 238 R--NRIQGLFSYCLVREM-EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEIS 294
+ I FSYCL + A I FG +A V + +TP++ + ++Y+ L IS
Sbjct: 229 SQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAIS 288
Query: 295 IG--RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
IG RH+ AF G IID+GT +T + Y ++ +++++ R +
Sbjct: 289 IGNERHM------AFAKQ----GNVIIDSGTTLTILPKELYDGVVSSLLKVVKA-KRVKD 337
Query: 353 PYNASQEFDYCYRYDSSFKA-----YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAI 406
P+ + D C +D A P +T H A+ + P N + D C+ +
Sbjct: 338 PHGS---LDLC--FDDGINAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVN-CLTL 391
Query: 407 QD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ ++ I+G Q N LI YDL L F CA
Sbjct: 392 KAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 161/367 (43%), Gaps = 41/367 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + IGTP +P + A VWTQC PC RCF Q P+F+ AS+TY PC
Sbjct: 27 LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTA 86
Query: 156 LCRS--PFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
LC S C +G C Y GD T G+ +TFA T LAFGC+ D++
Sbjct: 87 LCESVPASTCSGDGVCSYEVETMFGD-TSGIGGTDTFAI-----GTATASLAFGCAMDSN 140
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--TSVIKFGRDADVR-R 269
G SG++G +P SL Q+ FSYCL A S + G A +
Sbjct: 141 IKQLLGA-SGVVGLGRTPWSLVGQMNATA---FSYCLAPHGAAGKKSALLLGASAKLAGG 196
Query: 270 RDLETTPIL-LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
+ TTP++ SD + +HL I G I+ PP ++ +DT V+F+
Sbjct: 197 KSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGSVVL--------VDTIFGVSFL 248
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSSFKAYPSMTFHLQE 382
+ +Q + + + ++G + ++ FD C+ +S P + Q
Sbjct: 249 VDAAFQAIKK---AVTVAVGAAPM-ATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQG 304
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDP------KYSILGAWQQQNMLIIYDLNVPALRF 436
A + P + Y + G C+A+ + SILG Q+N+ ++DL+ L F
Sbjct: 305 AAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSF 364
Query: 437 GSENCAN 443
+C++
Sbjct: 365 EPADCSS 371
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 171/370 (46%), Gaps = 32/370 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP + ++ DT S L W QC PC+ CF+Q P+FDP AS++Y + C D
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQR 208
Query: 157 C------RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFTFVPRLAF 205
C +P C+ C Y Y T G + E+F + V + F
Sbjct: 209 CGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVF 268
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-MEATSVIKFGRD 264
GC + N G +G+LG PLS +SQLR FSYCLV +A S + FG D
Sbjct: 269 GCGHRNRGLFH--GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVVFGED 326
Query: 265 ADVRRR-DLETTPI--LLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
V L+ T S +Y+ L + +G ++ +D+ +DG+GG IID+
Sbjct: 327 YLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDS 386
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFK-AYPSMT 377
GT +++ YQ + Q + ++ L Y +F + CY + P ++
Sbjct: 387 GTTLSYFVEPAYQVIRQAFVDLMSRL------YPLIPDFPVLNPCYNVSGVERPEVPELS 440
Query: 378 FHLQEADYIVQPENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPA 433
+ P YF+ +PD G C+A++ P+ SI+G +QQQN ++YDL
Sbjct: 441 LLFADGAVWDFPAENYFVRLDPD-GIMCLAVRGTPRTGMSIIGNFQQQNFHVVYDLQNNR 499
Query: 434 LRFGSENCAN 443
L F CA
Sbjct: 500 LGFAPRRCAE 509
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 115/376 (30%), Positives = 170/376 (45%), Gaps = 38/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP K L+ DT S L W QC PC CF Q +DP+ S ++ I C+DP
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221
Query: 157 C------RSPFKCQ--NGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C P +C+ N C Y Y T G + ETF + R+ V +
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G SG+LG PLS SSQL++ FSYCLV T+V +
Sbjct: 282 MFGCGHWNRGLFS--GASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 339
Query: 261 FGRDAD-VRRRDLETTPIL---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D D + +L T + + + +Y+ + I +G + P ++I DG GG
Sbjct: 340 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGG 399
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT +++ Y+ + ++ + ++ + + D C+ +
Sbjct: 400 TIIDSGTTLSYFAEPAYEIIKNKFAEKMK---ENYLVFRDFPVLDPCF----NVSGIEEN 452
Query: 377 TFHLQE-----ADYIVQ--PENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIY 427
HL E AD V P FI C+AI PK +SI+G +QQQN I+Y
Sbjct: 453 NIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILY 512
Query: 428 DLNVPALRFGSENCAN 443
D + L F CA+
Sbjct: 513 DTKMSRLGFTPTKCAD 528
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 193/425 (45%), Gaps = 21/425 (4%)
Query: 29 STGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHL 88
++GFS+++I S SPLY + +R+ S RAN+ +K +
Sbjct: 32 NSGFSVEMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHF---NKKSFVASTNTAES 88
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
+ Y + ++GTP + DT S + W QCQ C C++QTTPIFDP S TY
Sbjct: 89 TVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYK 148
Query: 149 EIPCDDPLCRSPF---KCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPR 202
+PC +C+S C + K C YT +Y G ++G S ET NG + P
Sbjct: 149 TLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPN 208
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVI 259
GC ++N G F G+ SG++G P+SL SQL + I G FSYCL + ++S +
Sbjct: 209 TVIGCGHNNKG-TFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKL 267
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRF-PPGAFDIMRDGTGGF 317
FG A V +TP++ FY LE S+G + F + +G G
Sbjct: 268 NFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNI 327
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSM 376
IID+GT +T + Y L +++ R P N CY+ S + P +
Sbjct: 328 IIDSGTTLTLLPQEDYSNLESAVADAIQA-NRVSDPSNF---LSLCYQTTPSGQLDVPVI 383
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
T H + AD + P + F++ G C A SI G Q N+L+ YDL + F
Sbjct: 384 TAHFKGADVELNPIST-FVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSF 442
Query: 437 GSENC 441
+C
Sbjct: 443 KPTDC 447
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 29/372 (7%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
LP + F +V +GTP ++ DT S +VW QC PC C+ Q+ +FDPR S +Y
Sbjct: 121 LPQGSGEYF--AQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSY 178
Query: 148 SEIPCDDPLCR--SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ + C P+CR C + C+Y Y G VT G + ET F R V R+
Sbjct: 179 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF-ARG--ARVQRV 235
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-------T 256
A GC +DN G SG+LG LS SQ+ FSYCLV + +
Sbjct: 236 AIGCGHDNEGLFIAA--SGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRS 293
Query: 257 SVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD--- 312
S + FG A TP+ + + +Y+HLL S+G V+ + D+ +
Sbjct: 294 STVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTT 352
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFK 371
G GG I+D+GT VT + Y+ + + L R+ FD CY
Sbjct: 353 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL---RVSPGGFSLFDTCYNLSGRRVV 409
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDL 429
P+++ HL + P Y I D G FC A+ D SI+G QQQ +++D
Sbjct: 410 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 469
Query: 430 NVPALRFGSENC 441
+ + F ++C
Sbjct: 470 DAQRVGFVPKSC 481
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/376 (29%), Positives = 169/376 (44%), Gaps = 39/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT---PIFDPRASTTYSEIPCD 153
Y VE+ +GTP K L+ DT S L W QC P + ++ P +D +S++Y EIPC
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 118
Query: 154 DPLCR---SPFK-----CQNGKCVYTRRYHVGDVTRGLASRETFAFPVR--------NGF 197
D C+ +P C YT Y T G+ + ET + R N
Sbjct: 119 DDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHK 178
Query: 198 T---FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR-IQGLFSYCLV--- 250
T + +A GCS ++ G +F G SG+LG P+SL++Q R+ + G+FSYCLV
Sbjct: 179 TRRIRIKNVALGCSRESVGASFLGA-SGVLGLGQGPISLATQTRHTALGGIFSYCLVDYL 237
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISI-GRHIVRFPPGAFD 308
R A+S + GR R L TPI+ + F Y+++ +++ G+ + +
Sbjct: 238 RGSNASSFLVMGR---THWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWG 294
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
I DG G I D+GT ++++R Y ++ + + Q IP + F+ CY
Sbjct: 295 IDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP----EGFELCYNVTR 350
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLI 425
K P + Q + P N Y + CVA+Q +ILG QQ+ I
Sbjct: 351 MEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHI 410
Query: 426 IYDLNVPALRFGSENC 441
YDL + F C
Sbjct: 411 EYDLAKARIGFKWSPC 426
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 163/376 (43%), Gaps = 30/376 (7%)
Query: 84 EDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
ED +P+A Y +++ GTP + + + DT S++ W C PC C + P F
Sbjct: 107 EDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-F 165
Query: 140 DPRASTTYSEIPCDDPLCRSPFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+P S+TY+ + C C+ C C T+RY + S ET +
Sbjct: 166 EPSKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV---- 221
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA 255
G V FGCSN G + ++GF +PLS SQ FSYCL +
Sbjct: 222 GSQQVENFVFGCSNAARGLI--QRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSS 279
Query: 256 --TSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRD 312
T + G++A + + L+ TP+L + P F Y+ L IS+G +V P G +
Sbjct: 280 AFTGSLLLGKEA-LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDES 338
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
G IID+GT +T + Y + + L +L + + FD CY S
Sbjct: 339 TGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMA----SPTDLFDTCYNRPSGDVE 394
Query: 373 YPSMTFHLQEA-DYIVQPENMYFIEPDRGR-FCVAI-----QDDPKYSILGAWQQQNMLI 425
+P +T H + D + +N+ + D G C+A D S G +QQQ + I
Sbjct: 395 FPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRI 454
Query: 426 IYDLNVPALRFGSENC 441
++D+ L SENC
Sbjct: 455 VHDVAESRLGIASENC 470
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/441 (27%), Positives = 182/441 (41%), Gaps = 49/441 (11%)
Query: 28 ESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQEL---- 83
S S+ L+ + P +P N+ + I + S+AR NY+ S + + +
Sbjct: 51 SSATVSMSLVHRYGPCAPSQYSNV-PTPSISETLRRSRARTNYIMSQASKSMGMGMASTP 109
Query: 84 --EDIHLPMAKQ------DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFD 133
+D + + + L Y V + GTP PQ LL DT S + W QC PC +C+
Sbjct: 110 DDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYP 169
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCRSPFK-----CQNG--KCVYTRRYHVGDVTRGLASR 186
Q P+FDP S+TY+ I C+ CR C +G +C Y+ Y G +RG+ S
Sbjct: 170 QKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSN 229
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
ET G T V FGC D G + K G+LG +P+SL Q + G FS
Sbjct: 230 ETLTL--APGIT-VEDFHFGCGRDQRGPS--DKYDGLLGLGGAPVSLVVQTSSVYGGAFS 284
Query: 247 YCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYL-HLLEISIGRHIVRFPPG 305
YCL + G + TP+ FY+ + IS+G + P
Sbjct: 285 YCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQS 344
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
AF GG IID+GT T + Y L + L++ P S +FD CY
Sbjct: 345 AF------RGGMIIDSGTVDTELPETAYNALEAALRKALKAY-----PLVPSDDFDTCYN 393
Query: 366 YDS-SFKAYPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQ 420
+ S P + F I + N + C+A Q+ D I+G Q
Sbjct: 394 FTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPDDGLGIIGNVNQ 448
Query: 421 QNMLIIYDLNVPALRFGSENC 441
+ + ++YD + F + C
Sbjct: 449 RTLEVLYDAGRGNVGFRAGAC 469
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 168/369 (45%), Gaps = 31/369 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y VEV +GTP + ++ DT S L W QC PC+ CFDQ P+FDP AST+Y + C D
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTR 209
Query: 157 C------RSPFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRNGFT-FVPRLAFG 206
C +P C++ + C Y Y T G + E F + + V + G
Sbjct: 210 CGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLG 269
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRD- 264
C + N G +G+LG PLS +SQLR FSYCLV A S I FG D
Sbjct: 270 CGHRNRGLFH--GAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVFGDDN 327
Query: 265 ADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMR-DGTGGFIIDTG 322
+ L T S F Y+ L I +G ++ P + + + DG+GG IID+G
Sbjct: 328 VLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSG 387
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFKAYPSMTFH 379
T +++ Y+ + Q + + R Y +F CY S + F
Sbjct: 388 TTLSYFPEPAYKAIRQAF------VDRMDKAYPLIADFPVLSPCYNV-SGVERVEVPEFS 440
Query: 380 LQEADYIVQ--PENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPAL 434
L AD V P YFI D G C+A+ P+ SI+G +QQQN ++YDL+ L
Sbjct: 441 LLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRL 500
Query: 435 RFGSENCAN 443
F CA
Sbjct: 501 GFAPRRCAE 509
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 153/347 (44%), Gaps = 19/347 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP+ ++FDT S W QCQPC + C++Q +FDP S+TY+ + C P
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 240 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 296
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAASARL 354
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 355 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF-----ATAGTIVDSGTVITRLPPAAY 409
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 410 SSLRYAFAAAMAARGYKKAP--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 467
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ C+A +D I+G Q + + YD+ + F
Sbjct: 468 IMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 29/372 (7%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
LP + F +V +GTP ++ DT S +VW QC PC C+ Q+ +FDPR S +Y
Sbjct: 115 LPQGSGEYF--AQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSY 172
Query: 148 SEIPCDDPLCR--SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ + C P+CR C + C+Y Y G VT G + ET F R V R+
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF-ARG--ARVQRV 229
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-------T 256
A GC +DN G SG+LG LS SQ+ FSYCLV + +
Sbjct: 230 AIGCGHDNEGLFIAA--SGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRS 287
Query: 257 SVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD--- 312
S + FG A TP+ + + +Y+HLL S+G V+ + D+ +
Sbjct: 288 STVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTT 346
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFK 371
G GG I+D+GT VT + Y+ + + L R+ FD CY
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL---RVSPGGFSLFDTCYNLSGRRVV 403
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDL 429
P+++ HL + P Y I D G FC A+ D SI+G QQQ +++D
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463
Query: 430 NVPALRFGSENC 441
+ + F ++C
Sbjct: 464 DAQRVGFVPKSC 475
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 112/426 (26%), Positives = 178/426 (41%), Gaps = 53/426 (12%)
Query: 49 GNLSQSERIHKMFEIS--------KARANYMASMSKPNAFQELEDIHLPMAK----QDLF 96
G S+SER E ++ N++ + + + + +P+ Q L
Sbjct: 62 GECSESERKGDWVEKQLVLDGLHVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLN 121
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +G+ + ++ DT S L W QC+PC C++Q P+F P S +Y I C+
Sbjct: 122 YIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTT 179
Query: 157 CRSPFKCQNGK-------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C+S G C Y Y G T G E F G V FGC
Sbjct: 180 CQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF----GGISVSNFVFGCGR 235
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
+N G FGG SG++G S LS+ SQ G+FSYCL + A+ + G + V
Sbjct: 236 NNKGL-FGGA-SGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGV 293
Query: 268 RRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ TPI + + P+ + L+L I +G + +F G GG I+D+
Sbjct: 294 FKN---VTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDS 345
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR---YDSSFKAYPSMTF 378
GT ++ + Y+ L ++ + G P D C+ YD SM F
Sbjct: 346 GTVISRLAPSVYKALKAKFLEQFS--GFPSAP--GFSILDTCFNLTGYDQVNIPTISMYF 401
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ Y ++ D R C+A + D+ + I+G +QQ+N ++YD + +
Sbjct: 402 EGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVG 461
Query: 436 FGSENC 441
F E C
Sbjct: 462 FAKEPC 467
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 129/468 (27%), Positives = 209/468 (44%), Gaps = 41/468 (8%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTG---FSLKLIPIFSPESPLYPGNLSQSERI 57
M + L +F F++ L+ ++ + + KLI S SP Y N S +R
Sbjct: 1 MEVTSSFTLKSFLLTFTITLLSLALTTNTKPNKPVTTKLIHRDSIFSPAYNPNDSIKDRA 60
Query: 58 HKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF----------YSVEVNIGTPM 107
+M + S AR +Y+ ++SK N+ D A D + + V +IG P
Sbjct: 61 KRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDAYEASLLSELCTFLVNFSIGQPP 120
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGK 167
PQ+ + DT SSL W QC+PCI C Q P+++P +S+TY D +
Sbjct: 121 VPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTDTTFTATHGSD 180
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPV-RNGFTFVPRLAFGCSNDNSGF-AFGGKISGILG 225
C Y++ Y TRG +RE F +G T + + FGC ++N+ G SG+ G
Sbjct: 181 CNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFG 240
Query: 226 FNASPLSLSSQLRNRIQGL-FSYCLVREMEATSVIKFGRDADVRRRDLE--TTPILLSDL 282
S S+ S+L G FSYC+ + + F R + +E +TP++
Sbjct: 241 LGDSGSSIISKL-----GFGFSYCIGNIGDP--LYGFHRLTLGNKLKIEGYSTPLV---P 290
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFD-IMRDGTGG-FIIDTGTPVTFIRNGPYQTLMQRY 340
R +Y+ L+ ISIG+ + P F + +G +ID+G +++I Y + +
Sbjct: 291 RGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKV 350
Query: 341 DQILRS-LGRQRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQE-ADYIVQPENMYFIE 396
IL L R R ++ CY + + + +P TFHL + AD + Q E ++F
Sbjct: 351 SSILSGFLSRYRY---IARHLSLCYIGKLNQDLQGFPDATFHLADGADLVFQVEGLFFQY 407
Query: 397 PDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D C+A+ + D + ++G QQ + YDL L F C
Sbjct: 408 TDN-VLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYFQRIEC 454
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 125/447 (27%), Positives = 207/447 (46%), Gaps = 30/447 (6%)
Query: 4 VQALPLAAFFSYFSVLFLTH--FTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMF 61
V +LPL F ++F++ + F+S + T KLI S SP Y N + ++R +
Sbjct: 8 VSSLPLI-FSTHFALTIANNLEFSSIQPTRLVTKLIHRDSIVSPYYRSNDTVADRTERTM 66
Query: 62 EISKARANYM-ASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
+ S AR +Y+ A + + +L P A + LF V ++G P PQ + DT SSL
Sbjct: 67 KASLARLSYLYAKIERDFDINDLWLNLHPSASEPLFL-VNFSMGQPPVPQLAIMDTGSSL 125
Query: 121 VWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPCDDPLCR-SPF-KCQ-NGKCVYTRRYHV 176
+W QC PC C Q P+FDP S+TY + C + +CR +P +C + +CVY + Y
Sbjct: 126 LWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNIICRYAPSGECDSSSQCVYNQTYVE 185
Query: 177 GDVTRGLASRETFAFPVRN-GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
G + G+ + E F + G V + FGCS+ N + + +G+ G + S+ +
Sbjct: 186 GLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRNGNYK-DRRFTGVFGLGSGITSVVN 244
Query: 236 QLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI 295
Q+ ++ FSYC+ + ++ + +TP+ + D H+ + L IS+
Sbjct: 245 QMGSK----FSYCIGNIADPDYSYNQLVLSEGVNMEGYSTPLDVVD--GHYQVILEGISV 298
Query: 296 GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN 355
G + P AF + IID+GT T++ Y+ L + ++ L R P+
Sbjct: 299 GETRLVIDPSAFK-RTEKQRRVIIDSGTAPTWLAENEYRALER---EVRNLLDRFLTPFM 354
Query: 356 ASQEFDYCYRYDSSFKAYPSMTFHLQE-ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSI 414
Y + +P++TFH E AD +V E + V +D +S+
Sbjct: 355 RESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVDTE--------MRQASVYGKDFKDFSV 406
Query: 415 LGAWQQQNMLIIYDLNVPALRFGSENC 441
+G QQ + YDLN L F +C
Sbjct: 407 IGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 172/396 (43%), Gaps = 28/396 (7%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+AS + A Q + + M Y ++V +G+P K L+ DT S L W QC PC
Sbjct: 129 VASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHD 188
Query: 131 CFDQTTPIFDPRASTTYSEIPCDDPLCR--------SPFKCQNGKCVYTRRYHVGDVTRG 182
CF Q +DP+AS +Y I C+DP C P K N C Y Y T G
Sbjct: 189 CFQQNGAFYDPKASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTG 248
Query: 183 LASRETFAFPVRNG-----FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL 237
+ ETF + V + FGC + N G +G+LG PLS SSQL
Sbjct: 249 DFAVETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFH--GAAGLLGLGRGPLSFSSQL 306
Query: 238 RNRIQGLFSYCLVREMEATSV---IKFGRDADVRRR-DLETTPILLSD---LRPHFYLHL 290
++ FSYCLV T+V + FG D D+ +L T + + +Y+ +
Sbjct: 307 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQI 366
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
I + ++ P ++I DG GG IID+GT +++ Y+ + + + ++ G+
Sbjct: 367 KSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE--KAKGKY 424
Query: 351 RIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD 409
+ Y D C+ P + + P FI + C+AI
Sbjct: 425 PV-YRDFPILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGT 483
Query: 410 PK--YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
PK +SI+G +QQQN I+YD L + CA+
Sbjct: 484 PKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCAD 519
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 170/381 (44%), Gaps = 53/381 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE------- 149
Y++E+ +G+P K + + DT S LVW QC+PC +C+ Q+ PI+DP AS+T+++
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 150 ---IPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-PRLAF 205
+P C S K C+Y +Y T+G + ET G + P F
Sbjct: 64 CQSLPASG--CSSSAK----TCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQF 117
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIKFG 262
GC NSG +FGG +GI+G +SLS+QL + I FSYCLV + TS + FG
Sbjct: 118 GCGRLNSG-SFGGA-AGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFG 175
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDI------------ 309
A + +TPI+ + R +Y LE IS+G + A D
Sbjct: 176 SSASTGSGAI-STPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVR 234
Query: 310 -MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY--NASQEFDYCYRY 366
+ +GG I D+GT +T + + Y + + + +P +S FD CY
Sbjct: 235 ALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSV------SLPTVDASSSGFDLCYDV 288
Query: 367 DSS--FKAYPSMTFHLQEADYIVQPENMYFIEPDRGR--FCVAIQDDPKYSILGAWQ--Q 420
S FK +P++T + + P+ YF+ D C+A+ + Q
Sbjct: 289 SKSKNFK-FPALTLAFKGTKF-SPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQ 346
Query: 421 QNMLIIYDLNVPALRFGSENC 441
QN ++YD + C
Sbjct: 347 QNYHVVYDRGTSTISMSPAQC 367
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 160/360 (44%), Gaps = 36/360 (10%)
Query: 102 NIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPF 161
+IG P PQ + DT SSL W C PC C Q+ PIFDP S+TYS + C + C
Sbjct: 98 SIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCSE--CN--- 152
Query: 162 KCQ--NGKCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVPRLAFGC----SNDNSGF 214
KC NG+C Y+ Y ++G+ +RE + VP L FGC S ++G+
Sbjct: 153 KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGY 212
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
+ G I+G+ G + SL + FSYC + + T+ KF R + +++
Sbjct: 213 PYQG-INGVFGLGSGRFSLLPSFGKK----FSYC-IGNLRNTNY-KFNRLVLGDKANMQG 265
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD-IMRDGTGGFIIDTGTPVTFIRNGPY 333
L+ + +Y++L ISIG + P F+ + D G IID+G T++ +
Sbjct: 266 DSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGF 325
Query: 334 QTLMQRYDQILRS---LGRQRIPYNASQEFDYCYR--YDSSFKAYPSMTFHLQEADYIVQ 388
+ L + +L L +Q + + CY +P +TFH E +
Sbjct: 326 EVLSFEVENLLEGVLVLAQQ----DKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDL 381
Query: 389 PENMYFIEPDRGRFCVAI-------QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
FI+ FC+A+ D +S +G QQN + YDLN + F +C
Sbjct: 382 DVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 160/363 (44%), Gaps = 41/363 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y +++ +G+P K ++ DT SSL W QC+PC + C Q P+F+P AS TY + C
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVP 201
+DPLC + +G CVYT Y + G SR+ P + +P
Sbjct: 180 ECSLLKAATLNDPLCTA-----SGVCVYTASYGDASYSMGYLSRDLLTLTPSQT----LP 230
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKF 261
+GC DN G GK +GI+G LS+ +QL + FSYCL +S F
Sbjct: 231 SFTYGCGQDNEGLF--GKAAGIVGLARDKLSMLAQLSPKYGYAFSYCL--PTSTSSGGGF 286
Query: 262 GRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+ + TP++ + P Y L L I++ V + + IID
Sbjct: 287 LSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------IID 340
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFH 379
+GT VT + Y L + + +I+ Q Y+ D C++ S P +
Sbjct: 341 SGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSI---LDTCFKGSLKSMSGAPEIRMI 397
Query: 380 LQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
Q AD ++ N+ IE D+G C+A + +I+G QQQ I YD++ + F
Sbjct: 398 FQGGADLSLRAPNI-LIEADKGIACLAFASSNQIAIIGNHQQQTYNIAYDVSASKIGFAP 456
Query: 439 ENC 441
C
Sbjct: 457 GGC 459
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 158/371 (42%), Gaps = 31/371 (8%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC-QPCIRCFDQTTPIFDPRAS 144
+ +P+ FY V + IGTP +P + D LVWTQC Q C RCF Q P+FD AS
Sbjct: 40 VTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNAS 99
Query: 145 TTYSEIPCDDPLCRS-PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+T+ PC +C S P + G Y + G V G RL
Sbjct: 100 STFRPEPCGAAVCESIPTRSCAGDGGGACGYEA-STSFGRTVGRIGTDAVAIGTAATARL 158
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFG 262
AFGC+ + G SG +G + LSL++Q+ FSYCL + +S + G
Sbjct: 159 AFGCAVASEMDTMWGS-SGSVGLGRTNLSLAAQMNATA---FSYCLAPPDTGKSSALFLG 214
Query: 263 RDADV--RRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
A + + TTP + + PH + L L I G + P IM
Sbjct: 215 ASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIM---- 270
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
+ T TPVT + + Y+ L + + ++G +P Q +D C+ S+ P
Sbjct: 271 ----VSTATPVTALVDSVYRDLRK---AVADAVGAAPVPPPV-QNYDLCFPKASASGGAP 322
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNV 431
+ Q + P + Y + CVAI P SILG+ QQ N+ +++DL+
Sbjct: 323 DLVLAFQGGAEMTVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDK 382
Query: 432 PALRFGSENCA 442
L F +C+
Sbjct: 383 ETLSFEPADCS 393
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 173/377 (45%), Gaps = 56/377 (14%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS- 159
+G ++ DTAS L W QC PC C DQ P+FDP +S +Y+ +PC+ C +
Sbjct: 155 ATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDAL 214
Query: 160 ----------PFKCQN-----GKCVYTRRYHVGDVTRGLAS--RETFAFPVRNGFTFVPR 202
CQ C YT Y G +RG+ + R + A V +GF
Sbjct: 215 QLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFV---- 270
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKF 261
FGC N G FGG SG++G S LSL SQ ++ G+FSYCL ++E +++ +
Sbjct: 271 --FGCGTSNQGPPFGGT-SGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESDSSGSLVI 327
Query: 262 GRDADVRRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
G D+ V R +TPI+ + + P ++++L I++G V +
Sbjct: 328 GDDSSVYR---NSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKA- 383
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA 372
IID+GT +T + + Y+ + Q Y + F D C+ +
Sbjct: 384 --IIDSGTVITSL-------VPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREV 434
Query: 373 -YPSMTFHLQEADYIVQPEN---MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLI 425
PS+ + + V+ ++ +YF+ D + C+A ++ + + +I+G +QQ+N+ +
Sbjct: 435 QVPSLKL-VFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRV 493
Query: 426 IYDLNVPALRFGSENCA 442
I+D + + F E C
Sbjct: 494 IFDTSGSQVGFAQETCG 510
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/428 (26%), Positives = 177/428 (41%), Gaps = 29/428 (6%)
Query: 33 SLKLIPIFSPESPLYP---GNLSQSERIHK-MFEISKARANYMASMSKPNAFQELEDIHL 88
SL ++ P SPL G S +E + + + R AS +KP L +
Sbjct: 72 SLTVVHRHGPCSPLRSRGSGAPSHTEILRRDQDRVDAIRRKVTASSNKPKGGVSLL-ANW 130
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
+ Y + +GTP + DT S W QC+PC C++Q P+FDP AS+TYS
Sbjct: 131 GKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYS 190
Query: 149 EIPCDDPLCRS---------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
+PC C+ N C Y Y T G +R+T +
Sbjct: 191 AVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSP 250
Query: 200 ---VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT 256
VP FGC + N+G G++ G+LG SL SQ+ R FSYCL A
Sbjct: 251 ADTVPGFVFGCGHSNAGTF--GEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSAA 308
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ FG A R + + T ++ +YL+L I + ++ P AF G
Sbjct: 309 GYLSFGGAA--ARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATA----AG 362
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPS 375
IID+GT + + Y L + + +R P +S FD CY + P+
Sbjct: 363 TIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAP--SSPIFDTCYDFTGHETVRIPA 420
Query: 376 MTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
+ + + + P + + D + C+A + ILG QQ+ + +IYD+ +
Sbjct: 421 VELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRI 480
Query: 435 RFGSENCA 442
FG + CA
Sbjct: 481 GFGRKGCA 488
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 164/372 (44%), Gaps = 44/372 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L+ IGTP +P + D A LVWTQC C RCF Q P+F P AS+T+ PC
Sbjct: 65 LYNVANFTIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGT 124
Query: 155 PLCRS--PFKCQNGKCVY--TRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C+S C + C Y T +G T G+ + +TFA T L FGC
Sbjct: 125 DACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAI-----GTATASLGFGCVV- 178
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVR- 268
SG G SG++G +P SL SQ+ FSYCL + S + G A +
Sbjct: 179 ASGIDTMGGPSGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSAKLAG 235
Query: 269 RRDLETTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ TTP + + D+ ++ + L I G + PP ++ + T P
Sbjct: 236 GGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVL--------VQTLAP 287
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY-RYDSSFKAYPSMTFHLQE- 382
++F+ + YQ L + ++ +++G Q FD C+ + S + P + F Q+
Sbjct: 288 MSFLVDSAYQALKK---EVTKAVGAAPT-ATPLQPFDLCFPKAGLSNASAPDLVFTFQQG 343
Query: 383 ADYIVQPENMYFIE--PDRGRFCVAIQD---------DPKYSILGAWQQQNMLIIYDLNV 431
A + P Y I+ ++G C+AI D +ILG+ QQ+N + DL
Sbjct: 344 AAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEK 403
Query: 432 PALRFGSENCAN 443
L F +C++
Sbjct: 404 KTLSFEPADCSS 415
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 154/355 (43%), Gaps = 19/355 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + IG+P + +L DT S + W QC PC C+ Q PI+DP S++Y + C L
Sbjct: 45 YFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 104
Query: 157 CRSP--FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C++ CQ C Y Y + G E+F + N T + +AFGC + NSG
Sbjct: 105 CQALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIAFGCGHSNSGL 163
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE----MEATSVIKFGRDADVRRR 270
+G+LG LS SQ+ I FSYCLV +S + FGR A
Sbjct: 164 FR--GEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAA 221
Query: 271 DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TP+L + FY +L IS+G + PP F + +GTGG I+D+GT VT +
Sbjct: 222 RF--TPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVV 279
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
Y L Y R+ R P D C+ + PS+ H +V
Sbjct: 280 PAAYAVLRDAY----RAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVL 335
Query: 389 PENMYFIEPDR-GRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P I DR G FC+A S++G QQQ I +DL + C
Sbjct: 336 PGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 154/356 (43%), Gaps = 27/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP KP ++ DT SSL W QC PC + C Q+ P+FDP+ S++Y+ + C P
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSP 176
Query: 156 LCR-------SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C + C+Y Y + G S++T +F G VP +GC
Sbjct: 177 QCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF----GANSVPNFYYGC 232
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL +TS +
Sbjct: 233 GQDNEGLF--GRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL----PSTSSSGYLSIGSY 286
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TP++ + L Y IS+ V P A + IID+GT +T
Sbjct: 287 NPGGYSYTPMVSNTLDDSLYF----ISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITR 342
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS-FKAYPSMTFHLQEADYI 386
+ Y L + ++ ++ Y+ D C+ +S +A P+++ +
Sbjct: 343 LPTSVYTALSKAVAAAMKGSTKRAAAYSI---LDTCFEGQASKLRAVPAVSMAFSGGATL 399
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 400 KLSAGNLLVDVDGATTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 166/372 (44%), Gaps = 29/372 (7%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
LP + F +V +GTP ++ DT S +VW QC PC C+ Q+ +FDPR S +Y
Sbjct: 115 LPQGSGEYF--AQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSY 172
Query: 148 SEIPCDDPLCR--SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ + C P+CR C + C+Y Y G VT G + ET F R V R+
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF-ARG--ARVQRV 229
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-------T 256
A GC +DN G SG+LG LS +Q+ FSYCLV + +
Sbjct: 230 AIGCGHDNEGLFIAA--SGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRS 287
Query: 257 SVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD--- 312
S + FG A TP+ + + +Y+HLL S+G V+ + D+ +
Sbjct: 288 STVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTT 346
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFK 371
G GG I+D+GT VT + Y+ + + L R+ FD CY
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL---RVSPGGFSLFDTCYNLSGRRVV 403
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDL 429
P+++ HL + P Y I D G FC A+ D SI+G QQQ +++D
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463
Query: 430 NVPALRFGSENC 441
+ + F ++C
Sbjct: 464 DAQRVGFVPKSC 475
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 188/425 (44%), Gaps = 38/425 (8%)
Query: 34 LKLIPIFSPESPLY-PGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L +IPI+ SP P + S + M AR Y++S++ Q+ + +
Sbjct: 32 LSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSSLTA----QKTVAAPIASGQ 87
Query: 93 QDLF---YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q L Y V V +GTP + +++ DT++ W C CI C TT F + S+T++
Sbjct: 88 QVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTT--FSAQNSSTFAT 145
Query: 150 IPCDDPLCRSP--FKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
+ C P C C N C++ + Y GD T S + G +P +
Sbjct: 146 LDCSKPECTQARGLSCPTTGNVDCLFNQTYG-GDSTF---SATLVQDSLHLGPNVIPNFS 201
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFG 262
FGC + SG + + G++G PLSL SQ + GLFSYCL + + +K G
Sbjct: 202 FGCISSASGSSIPPQ--GLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLG 259
Query: 263 RDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ + + TTP+L + RP +Y++L IS+GR +V P + G IID+
Sbjct: 260 PVG--QPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDS 317
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
GT +T Y + D+ + +G P A FD C+ ++ A P++T HL
Sbjct: 318 GTVITRFVPAIYTAV---RDEFRKQVGGSFSPLGA---FDTCFATNNEVSA-PAITLHLS 370
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDP-----KYSILGAWQQQNMLIIYDLNVPALRF 436
D + EN C+A+ P +++ QQQN I++D+N L
Sbjct: 371 GLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGI 430
Query: 437 GSENC 441
E C
Sbjct: 431 ARELC 435
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 164/391 (41%), Gaps = 26/391 (6%)
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
+ RA + + S F E++ +P Y+V V +GTP K LLFDT S L
Sbjct: 97 LRVKSIRAKHSMNSSTTGVFNEMK-TRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDL 155
Query: 121 VWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK------CQNGKCVYTRR 173
WTQC+PC CF Q FDP ST+Y + C C+S K + C+Y +
Sbjct: 156 TWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVK 215
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
Y G T G + ET + F GC N G F G +G+LG SP++L
Sbjct: 216 YGTG-YTVGFLATETLTITPSDVF---ENFVIGCGERNGG-RFSG-TAGLLGLGRSPVAL 269
Query: 234 SSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEI 293
SQ + + LFSYCL +T + FG + + TPI S + + L + I
Sbjct: 270 PSQTSSTYKNLFSYCLPASSSSTGHLSFGGGV---SQAAKFTPI-TSKIPELYGLDVSGI 325
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
S+G + P F T G IID+GT +T++ + + L + +++ + +
Sbjct: 326 SVGGRKLPIDPSVFR-----TAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGT 380
Query: 354 YNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DP 410
+D+ + + F + + ++ C+A +D D
Sbjct: 381 SGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDT 440
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+I G QQ+ ++YD+ + F C
Sbjct: 441 DVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 158/366 (43%), Gaps = 38/366 (10%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCD 153
L + V V G+P + L DT S + W QC PC C+ Q P+FDP S TYS +PC
Sbjct: 159 LEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCG 218
Query: 154 DPLCRSP-FKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
P C + KC N G C+Y Y G T G+ S ET + +P AFGC N
Sbjct: 219 HPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD---LPGFAFGCGQTN 275
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRD---ADVR 268
G G LG A LSL SQ FSYCL + G A
Sbjct: 276 LGEFGGVDGLVGLGRGA--LSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPAASND 333
Query: 269 RRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
D++ T ++ + P Y + ++ I IG +I+ PP F RDGT + D+GT +T+
Sbjct: 334 DDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF--TRDGT---LFDSGTILTY 388
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---------SFKAYPSMTF 378
+ Y +L R+ + Q P A FD CY + +FK F
Sbjct: 389 LPPEAYASLRDRFKFTM----TQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAVF 444
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALR 435
L ++ P++ P G C+A P ++I+G QQ+ +IYD+ +
Sbjct: 445 DLSPVAILIYPDD---TAPATG--CLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIG 499
Query: 436 FGSENC 441
FG C
Sbjct: 500 FGQFTC 505
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 149/365 (40%), Gaps = 25/365 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++ +GTP L DT S + W QCQPC RC+ Q+ P+FDPR ST+Y E+ D P
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMGYDAPD 193
Query: 157 CRSPFKCQNG-----KCVYTRRY-HVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C++ + G CVY Y G T G ET F G VP ++ GC +D
Sbjct: 194 CQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTF---AGGVQVPHMSIGCGHD 250
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVR------EMEATSVIKFG 262
N G F +GILG +S SQ+ + FSYCL +S + G
Sbjct: 251 NKGL-FAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVSSTLTIG 309
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD---GTGGFII 319
A TP + + FY L + D+ D G GG I+
Sbjct: 310 DGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYTGRGGVIL 369
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
D+GT VT + Y + LG+ I S FD CY P+++ H
Sbjct: 370 DSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIG-GPSGFFDTCYTMGGRAMKVPTVSMH 428
Query: 380 LQEADYIVQPENMYFIEPDR-GRFCVAIQ--DDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ P Y I D G C A D SI+G QQQ ++Y++ + F
Sbjct: 429 FAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGGGRVGF 488
Query: 437 GSENC 441
+C
Sbjct: 489 APNSC 493
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 174/396 (43%), Gaps = 28/396 (7%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+AS + A Q + + M Y ++V +G+P K L+ DT S L W QC PC
Sbjct: 144 VASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYD 203
Query: 131 CFDQTTPIFDPRASTTYSEIPCDDPLCR--------SPFKCQNGKCVYTRRYHVGDVTRG 182
CF Q +DP+AS +Y I C+D C P K N C Y Y T G
Sbjct: 204 CFQQNGAFYDPKASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTG 263
Query: 183 LASRETFAFPV-RNGFTF----VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL 237
+ ETF + NG + V + FGC + N G +G+LG PLS SSQL
Sbjct: 264 DFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFH--GAAGLLGLGRGPLSFSSQL 321
Query: 238 RNRIQGLFSYCLVREMEATSV---IKFGRDADVRRR-DLETTPILLSD---LRPHFYLHL 290
++ FSYCLV T+V + FG D D+ +L T + + +Y+ +
Sbjct: 322 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQI 381
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
I + ++ P ++I DG GG IID+GT +++ Y+ + + + ++ G+
Sbjct: 382 KSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE--KAKGKY 439
Query: 351 RIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD 409
+ Y D C+ P + + P FI + C+A+
Sbjct: 440 PV-YRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGT 498
Query: 410 PK--YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
PK +SI+G +QQQN I+YD L + CA+
Sbjct: 499 PKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCAD 534
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 168/371 (45%), Gaps = 38/371 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V+V +GTP + ++ DT S L W QC PC+ CF+Q+ PIFDP AS +Y + C D
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDR 208
Query: 157 CR--------SPFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPV-RNGFTFVPRLA 204
CR +P +C+ + C Y Y T G + E F + ++G V +A
Sbjct: 209 CRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVA 268
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLVREMEAT-SVIKFG 262
FGC + N G +G+LG PLS +SQLR G FSYCLV A S I FG
Sbjct: 269 FGCGHRNRGLFH--GAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFG 326
Query: 263 R-DADVRRRDLETTPIL-LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
DA + L T +D +YL L I +G V GG IID
Sbjct: 327 HDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS-----AGGTIID 381
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFKA-YPSM 376
+GT +++ YQ + Q + + R Y F CY + K P +
Sbjct: 382 SGTTLSYFPEPAYQAIRQAF------IDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPEL 435
Query: 377 TFHLQEADYIVQPENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVP 432
+ + P YFI EP+ G C+A+ P+ SI+G +QQQN ++YDL
Sbjct: 436 SLVFADGAAWEFPAENYFIRLEPE-GIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHN 494
Query: 433 ALRFGSENCAN 443
L F CA+
Sbjct: 495 RLGFAPRRCAD 505
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 153/363 (42%), Gaps = 36/363 (9%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+ M + Y V + +G+P + Q+++ D+ S +VW QCQPC +C+ Q+ P+FDP S
Sbjct: 189 DVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 248
Query: 145 TTYSEIPCDDPLCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+++ + C +C C G+C Y Y G T+G + ET F G T V
Sbjct: 249 ASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRS 304
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+A GC + N G +G+LG +S QL + G FSYCLV
Sbjct: 305 VAIGCGHRNRGMFV--GAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA---------- 352
Query: 263 RDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
P++ + P F Y+ L + +G V F + G GG ++DT
Sbjct: 353 ----------AWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDT 402
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL 380
GT VT + YQ + +L R FD CY P+++F+
Sbjct: 403 GTAVTRLPTLAYQAFRDAFLAQTANLPRA----TGVAIFDTCYDLLGFVSVRVPTVSFYF 458
Query: 381 QEADYIVQPENMYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ P + I D G FC A SILG QQ+ + I +D + FG
Sbjct: 459 SGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGP 518
Query: 439 ENC 441
C
Sbjct: 519 NIC 521
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 124/451 (27%), Positives = 204/451 (45%), Gaps = 29/451 (6%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
MA V L + F FS +S+S GFS LI I SP SP Y ++S
Sbjct: 15 MASVNLLLIICFTFIFSPCI---SAASDSKGFSTNLIHIHSPSSP-YKNVKAESLAKDTA 70
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
E + +R Y+ + + A Q + + P+ + + ++IG P +++ DT S L
Sbjct: 71 LESTLSRHAYLRARQQ-KALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDL 129
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS---PFKCQN-GKCVYTRRYHV 176
W QC+PC C+ Q PI++ S +Y+E+ C++P C S +C + G C+Y Y
Sbjct: 130 FWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYAD 189
Query: 177 GDVTRGLASRETFAFPVR-NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
G T GL S E AF + ++ FGC N F + G+LG +SL S
Sbjct: 190 GSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVS 249
Query: 236 QLR--NRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLL 291
QL ++ F+YC + A + FG DA D+ TP+++++ +Y++LL
Sbjct: 250 QLSAIGKVSKSFAYCFGNLSNPNAGGFLVFG-DATYLNGDM--TPMVIAEF---YYVNLL 303
Query: 292 EISIGRHIVRFP--PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
I +G R +F+ DG+GG IID+G+ ++ Y+ + L+ G
Sbjct: 304 GIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKK-GY 362
Query: 350 QRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ 407
P +S + C+ + +P++ +L E+ I+ F++ FC+
Sbjct: 363 NISPLTSSPD---CFEGKIGRDLPLFPTLVLYL-ESTGILNDRWSIFLQRYDELFCLGFT 418
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
SI+G QQ+ Y+L + L S
Sbjct: 419 SGEGLSIIGTLAQQSYKFGYNLELSTLSIES 449
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 171/385 (44%), Gaps = 42/385 (10%)
Query: 82 ELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
E +P++ Q L Y V + +G+ K ++ DT S L W QC+PC+ C++Q P
Sbjct: 46 EASQTQIPLSSGINLQTLNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGP 103
Query: 138 IFDPRASTTYSEIPCDDPLCRS-PFKCQN---------GKCVYTRRYHVGDVTRGLASRE 187
IF P S++Y + C+ C+S F N C Y Y G T G E
Sbjct: 104 IFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVE 163
Query: 188 TFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSY 247
+F G V FGC +N G FGG +SG++G S LSL SQ G+FSY
Sbjct: 164 ALSF----GGVSVSDFVFGCGRNNKGL-FGG-VSGLMGLGRSYLSLVSQTNATFGGVFSY 217
Query: 248 CL-VREMEATSVIKFGRDADVRRRD--LETTPILLSDLRPHFY-LHLLEISIGRHIVRFP 303
CL E ++ + G ++ V + + T +L + +FY L+L I +G ++ P
Sbjct: 218 CLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAP 277
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYC 363
+ G GG +ID+GT +T + + Y+ L + + + G P D C
Sbjct: 278 ------LSFGNGGILIDSGTVITRLPSSVYKALKAEF--LKKFTGFPSAP--GFSILDTC 327
Query: 364 YR---YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGA 417
+ YD S+ F + Y ++ D + C+A + D +I+G
Sbjct: 328 FNLTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGN 387
Query: 418 WQQQNMLIIYDLNVPALRFGSENCA 442
+QQ+N +IYD + F E C+
Sbjct: 388 YQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 187/401 (46%), Gaps = 41/401 (10%)
Query: 59 KMFEISKARANYMASMSKPNAFQE-LEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTA 117
+ + S++R + +A+ + NA E P+ K Y++ IGTP DT
Sbjct: 53 RAVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTG 112
Query: 118 SSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD--------PLCRSPFKCQNGKCV 169
S L+WT+C C RC + +P + P +S++ + + C D PLC + +G
Sbjct: 113 SDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGN 172
Query: 170 YTRRYHVGDV------TRGLASRETFAFPVRNGFTFVPRLAFGCS-NDNSGFAFGGKISG 222
+ Y G+ T G+ ETF F + P +AFGC+ GF G SG
Sbjct: 173 CSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAFPGIAFGCTLRSEGGFGTG---SG 227
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE---TTPIL- 278
++G LSL +QL F Y L ++ A S I FG ADV + + +TP+L
Sbjct: 228 LVGLGRGKLSLVTQLNVEA---FGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284
Query: 279 ---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTFIRNGPYQ 334
+ DL P +Y+ L IS+G +V+ P G F R G GG I D+GT +T + + P
Sbjct: 285 NPVVQDL-PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPD-PAY 342
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPEN-- 391
TL++ D++L +G Q+ P A+ + C+ SS +PSM H AD + EN
Sbjct: 343 TLVR--DELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYL 400
Query: 392 --MYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLN 430
M + R ++ +I+G Q + +++DL+
Sbjct: 401 PQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLS 441
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 155/353 (43%), Gaps = 33/353 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + IG+P Q+++ D+ S +VW QC+PC +C++QT PIF+P S ++ + C +
Sbjct: 129 YFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNV 188
Query: 157 CR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C+ G+C Y Y G T+G + ET G T + A GC + N G
Sbjct: 189 CNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITI----GRTVIQDTAIGCGHWNEG 244
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRDL 272
+G+LG P+S QL + G F YCLV R M ++
Sbjct: 245 MFV--GAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVGAMW------------- 289
Query: 273 ETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
P++ + P F Y+ L +++G V F + GTGG ++DTGT +T +
Sbjct: 290 --VPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTV 347
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPE 390
Y + I ++ R P FD CY + P+++F+ + P
Sbjct: 348 AYNAFRDAF--IAQTTNLPRAP--GVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPA 403
Query: 391 NMYFIEPDR-GRFCVAIQDDPK-YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I D G FC A P SI+G QQ+ + + D + FG C
Sbjct: 404 RNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 113/407 (27%), Positives = 176/407 (43%), Gaps = 38/407 (9%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK---Q 93
+PI P SQ ++F ++R +++ S A + L+D H P K +
Sbjct: 100 LPITQKYGPCSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKD-HTPNNKLFDE 158
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
D + V+V GTP + L+ DT SS+ WTQC+PC+RC + FDP AS TYS C
Sbjct: 159 DGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC- 217
Query: 154 DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
P N Y Y + G +T + F P+ FGC +N G
Sbjct: 218 -----IPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEHSDVF---PKFQFGCGRNNEG 266
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
FG G+LG LS SQ ++ + +FSYCL E S++ FG A + L+
Sbjct: 267 -DFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLL-FGEKATSQSSSLK 324
Query: 274 TTPIL----LSDLRP--HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
T ++ S L ++++ LL+IS+G + P F + G IID+GT +T
Sbjct: 325 FTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITR 379
Query: 328 IRNGPYQTLMQRYDQILR----SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQE 382
+ Y L + + + S GR++ D CY P + H E
Sbjct: 380 LPQRAYSALKAAFKKAMAKYPLSNGRRK----KGDILDTCYNLSGRKDVLLPEIVLHFGE 435
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
+ D R C+A + + +I+G QQ ++ ++YD+
Sbjct: 436 GADVRLNGKRVIWGNDASRLCLAFAGNSELTIIGNRQQVSLTVLYDI 482
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 187/401 (46%), Gaps = 41/401 (10%)
Query: 59 KMFEISKARANYMASMSKPNAFQE-LEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTA 117
+ + S++R + +A+ + NA E P+ K Y++ IGTP DT
Sbjct: 53 RAVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTG 112
Query: 118 SSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD--------PLCRSPFKCQNGKCV 169
S L+WT+C C RC + +P + P +S++ + + C D PLC + +G
Sbjct: 113 SDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGN 172
Query: 170 YTRRYHVGDV------TRGLASRETFAFPVRNGFTFVPRLAFGCS-NDNSGFAFGGKISG 222
+ Y G+ T G+ ETF F + P +AFGC+ GF G SG
Sbjct: 173 CSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAFPGIAFGCTLRSEGGFGTG---SG 227
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE---TTPIL- 278
++G LSL +QL F Y L ++ A S I FG ADV + + +TP+L
Sbjct: 228 LVGLGRGKLSLVTQLNVEA---FGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284
Query: 279 ---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTFIRNGPYQ 334
+ DL P +Y+ L IS+G +V+ P G F R G GG I D+GT +T + + P
Sbjct: 285 NPVVQDL-PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPD-PAY 342
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPEN-- 391
TL++ D++L +G Q+ P A+ + C+ SS +PSM H AD + EN
Sbjct: 343 TLVR--DELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYL 400
Query: 392 --MYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLN 430
M + R ++ +I+G Q + +++DL+
Sbjct: 401 PQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLS 441
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 32/373 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT-TPIFDPRASTTYSEIPCDDP 155
Y V++ +GTP + L+ DT S LVW +C C C T F R STT+S C D
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDS 148
Query: 156 LCR-----SPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGF-TFVPRLAF 205
C+ +C + + C Y Y G T G S+ET +G + +AF
Sbjct: 149 ACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAF 208
Query: 206 GCSNDNSGFAFGGK----ISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE---MEATSV 258
GC+ SG + G G++G P+SLSSQL +R FSYCL+ TS
Sbjct: 209 GCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSY 268
Query: 259 IKFG---RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFP--PGAFDIMRDG 313
+ G D +R + TP+ ++ L P FY +E S+ ++ P P + + G
Sbjct: 269 LLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIE-SVSVDGIKLPINPSVWALDELG 327
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKA 372
GG I+D+GT +TF+ Y ++ +++ R P + FD C
Sbjct: 328 NGGTIVDSGTTLTFLPEPAYLQIL----TVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPR 383
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDL 429
P ++F L P YF++ D C+A+Q +S++G QQ L+ +D
Sbjct: 384 LPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDK 443
Query: 430 NVPALRFGSENCA 442
+ L F CA
Sbjct: 444 DRTRLGFSRHGCA 456
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 123/452 (27%), Positives = 207/452 (45%), Gaps = 28/452 (6%)
Query: 1 MAHVQALPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKM 60
MA V L L F++ ++ +S+S GFS LI I SP SP Y ++S
Sbjct: 1 MASVNNLLLIICFTFIFSPCIS--AASDSKGFSTNLIHIHSPSSP-YKNVKAESLAKDTA 57
Query: 61 FEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSL 120
E + +R Y+ + + A Q + + P+ + + ++IG P +++ DT S L
Sbjct: 58 LESTLSRHAYLRARQQ-KALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDL 116
Query: 121 VWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS---PFKCQN-GKCVYTRRYHV 176
W QC+PC C+ Q PI++ S +Y+E+ C++P C S +C + G C+Y Y
Sbjct: 117 FWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYAD 176
Query: 177 GDVTRGLASRETFAFPVR-NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
G T GL S E AF + ++ FGC N F + G+LG +SL S
Sbjct: 177 GARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVS 236
Query: 236 QLR--NRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLL 291
QL ++ F+YC + A + FG DA D+ TP+++++ +Y++LL
Sbjct: 237 QLSAIGKVSKSFAYCFGNISNPNAGGFLVFG-DATYLNGDM--TPMVIAEF---YYVNLL 290
Query: 292 EISIGRHIVRFP--PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
I +G R +F+ DG+GG IID+G+ ++ Y+ + L+ G
Sbjct: 291 GIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKK-GY 349
Query: 350 QRIPYNASQEFDYCY--RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ 407
P +S + C+ + + +P++ +L E+ I+ F++ FC+
Sbjct: 350 NISPLTSSPD---CFEGKIERDLPLFPTLVLYL-ESTGILNDRWSIFLQRYDELFCLGFT 405
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
SI+G QQ+ Y+L + L S
Sbjct: 406 SGEGLSIIGTLAQQSYKFGYNLELSTLSIESN 437
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 161/354 (45%), Gaps = 32/354 (9%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL-CRSPFK-CQNGKCV 169
L D L W QC PC C Q +P+FDP S T+S IP + + CR P++ NG C
Sbjct: 113 LALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGACG 172
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVP--RLAFGCSNDNSGFAFGGKISGILGFN 227
+ Y G +R+TF+FP N FVP + FGC++ F ++GILG
Sbjct: 173 FDIAYRDNTHASGYLARDTFSFPAGND-DFVPLSAIVFGCAHQTEHFKNQRAVAGILGLG 231
Query: 228 AS-----PLSLSSQLRNRIQGLFSYC-LVREMEATSVIKFGRD------ADVRRRDLETT 275
P + + Q+ G FSYC V M S ++FG D +V R ++T
Sbjct: 232 MGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHR---QST 288
Query: 276 PILL-SDLRPHFYLHLLEISIGRH-IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
P+L + +++ L +S+G + + P F G GG ++D GT +T + Y
Sbjct: 289 PVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAY 348
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYI-VQPEN 391
+ D +R ++R + + C + + PSMT H + ++ V PE+
Sbjct: 349 VHI----DHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGAWLRVMPEH 404
Query: 392 MY--FIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLN--VPALRFGSENC 441
++ F+ C +++GA QQ N I+DL+ +P + F E+C
Sbjct: 405 VFMPFVVGGHHYQCFGFVSSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 154/355 (43%), Gaps = 19/355 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + IG P + +L DT S + W QC PC C+ Q PI+DP S++Y + C L
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 71
Query: 157 CRSP--FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C++ CQ C Y Y + G E+F + N T + +AFGC + NSG
Sbjct: 72 CQALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIAFGCGHSNSGL 130
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV----REMEATSVIKFGRDADVRRR 270
+G+LG LS SQ+ I FSYCLV + +S + FGR A
Sbjct: 131 FR--GEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAA 188
Query: 271 DLETTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TP+L + + +Y L IS+G + PP F + +GTGG I+D+GT VT +
Sbjct: 189 RF--TPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVV 246
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
Y L Y R+ R P D C+ + PS+ H +V
Sbjct: 247 PPAYAVLRDAY----RAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDMVL 302
Query: 389 PENMYFIEPDR-GRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P I DR G FC+A S++G QQQ I +DL + C
Sbjct: 303 PGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 116/436 (26%), Positives = 178/436 (40%), Gaps = 46/436 (10%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGN---LSQSERIHKMFEISKARANYMA---SMSKPNAF 80
S S G ++ L P SP+ P N S ER+ + + RA Y+ S +K
Sbjct: 56 STSGGITVPLHHRHGPCSPV-PSNKMPASLEERLQR----DQLRAAYIKRKFSGAKGGDV 110
Query: 81 QELEDIHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
++ + +P + L Y + V IG+P Q + DT S + W QC+PC +C +
Sbjct: 111 EQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVD 170
Query: 137 PIFDPRASTTYSEIPCDDPLC------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFA 190
+FDP AS+TYS C C + C + +C Y Y G T G S +T
Sbjct: 171 SLFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLT 230
Query: 191 FPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
G + FGCS SG F + G++G SL SQ FSYCL
Sbjct: 231 L----GSNAIKGFQFGCSQSESG-GFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLP 285
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDI 309
++ + G R TP+L S P +Y LLE I +G + P F
Sbjct: 286 PTPGSSGFLTLGA---ASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF-- 340
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS- 368
+ G ++D+GT +T + Y L + + ++ P S D C+ +
Sbjct: 341 ----SAGSVMDSGTVITRLPPTAYSALSSAFKAGM----KKYPPAQPSGILDTCFDFSGQ 392
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLI 425
S + PS+ + N +E D +C+A DD +G QQ+ +
Sbjct: 393 SSVSIPSVALVFSGGAVVNLDFNGIMLELD--NWCLAFAANSDDSSLGFIGNVQQRTFEV 450
Query: 426 IYDLNVPALRFGSENC 441
+YD+ A+ F + C
Sbjct: 451 LYDVGGGAVGFRAGAC 466
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 173/385 (44%), Gaps = 50/385 (12%)
Query: 85 DIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
D +P++ Q L Y V V IG + ++ DT S L W QCQPC C++Q P+F+
Sbjct: 51 DSQIPLSSGVRLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFN 108
Query: 141 PRASTTYSEIPCDDPLCRSPFKCQNGK----------CVYTRRYHVGDVTRGLASRETFA 190
P S +Y I C+ C+S + G C Y Y G TRG E
Sbjct: 109 PSGSPSYQTILCNSSTCQS-LQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLN 167
Query: 191 FPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL- 249
G T V FGC +N G FGG SG++G S LSL SQ +G+FSYCL
Sbjct: 168 L----GTTHVSNFIFGCGRNNKGL-FGGA-SGLMGLGKSDLSLVSQTSAIFEGVFSYCLP 221
Query: 250 VREMEATSVIKFGRDADVRRRDLETTPILLSDL-----RPHFY-LHLLEISIGRHIVRFP 303
+A+ + G ++ V + TTPI + + P FY L+L ISIG ++ P
Sbjct: 222 TTAADASGSLILGGNSSVYK---NTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAP 278
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYC 363
G +ID+GT +T + Y+ L + + P++ D C
Sbjct: 279 -------NYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGF-PSAPPFSI---LDTC 327
Query: 364 YRYDSSFKA-YPSMTFHLQ-EADYIVQPENM-YFIEPDRGRFCVAIQD---DPKYSILGA 417
+ + + P++ + A+ V + YF++ D + C+A+ D + I+G
Sbjct: 328 FNLNGYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGN 387
Query: 418 WQQQNMLIIYDLNVPALRFGSENCA 442
+QQ+N +IY+ L F +E C+
Sbjct: 388 YQQRNQRVIYNTKESKLGFAAEACS 412
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 150/356 (42%), Gaps = 29/356 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y + V +G+P K Q +L DT S + W QC+PC +C Q P+FDP +S+TYS C
Sbjct: 131 LEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSS 190
Query: 155 PLC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C + C + +C YT Y G T G S +T A G V + FGCSN
Sbjct: 191 AACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL----GSNAVRKFQFGCSNV 246
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
SG F + G++G SL SQ FSYCL ++ + G
Sbjct: 247 ESG--FNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSSGFLTLGAGTS---- 300
Query: 271 DLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TP+L S P FY + + I +G + P F + G I+D+GT +T +
Sbjct: 301 GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF------SAGTIMDSGTVLTRLP 354
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQ 388
Y L + + +Q S D C+ + S + P++ +
Sbjct: 355 PTAYSALSSAFKAGM----KQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDI 410
Query: 389 PENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ ++ C+A DD I+G QQ+ ++YD+ A+ F + C
Sbjct: 411 ASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 152/347 (43%), Gaps = 19/347 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC+ C++Q +FDP S+TY+ + C P
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 238 ACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNEG 294
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q ++ G+F++CL T + FG +
Sbjct: 295 LF--GEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAAASARL 352
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F T G I+D+GT +T + Y
Sbjct: 353 TTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVF-----ATAGTIVDSGTVITRLPPPAY 407
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ P A D CY + S A P+++ Q + +
Sbjct: 408 SSLRYAFAAAMAARGYKKAP--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASG 465
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
+ C+A +D I+G Q + + YD+ + F
Sbjct: 466 IMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 168/385 (43%), Gaps = 36/385 (9%)
Query: 86 IHLPMAKQDLFYSVE----VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+H P+ F S E V +GTP L+ DT S LVW QC PC RC+ Q +FDP
Sbjct: 71 LHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDP 130
Query: 142 RASTTYSEIPCDDPLCRS-------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
R S+TY +PC P CR+ G C Y Y G + G + + AF
Sbjct: 131 RRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAF--- 187
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VR 251
T+V + GC DN G +G+LG +S+S+Q+ +F YCL
Sbjct: 188 ANDTYVNNVTLGCGRDNEGLF--DSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTS 245
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISI-GRHIVRFPPGAFDI 309
+S + FGR + T +L + RP +Y+ + S+ G + F + +
Sbjct: 246 RSTRSSYLVFGRTPEPPSTAF--TALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303
Query: 310 -MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
G GG ++D+GT ++ Y L +D R+ G +R+ S FD CY
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV-FDACYDLRG 362
Query: 369 SFKA-YPSMTFHLQ-EADYIVQPENMYFIEPDRG-------RFCVAIQ-DDPKYSILGAW 418
A P + H AD + PEN YF+ D G R C+ + D S++G
Sbjct: 363 RPAASAPLIVLHFAGGADMALPPEN-YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNV 421
Query: 419 QQQNMLIIYDLNVPALRFGSENCAN 443
QQQ +++D+ + F + C +
Sbjct: 422 QQQGFRVVFDVEKERIGFAPKGCTS 446
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 168/385 (43%), Gaps = 36/385 (9%)
Query: 86 IHLPMAKQDLFYSVE----VNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+H P+ F S E V +GTP L+ DT S LVW QC PC RC+ Q +FDP
Sbjct: 71 LHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDP 130
Query: 142 RASTTYSEIPCDDPLCRS-------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
R S+TY +PC P CR+ G C Y Y G + G + + AF
Sbjct: 131 RRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAF--- 187
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VR 251
T+V + GC DN G +G+LG +S+S+Q+ +F YCL
Sbjct: 188 ANDTYVNNVTLGCGRDNEGLF--DSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTS 245
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISI-GRHIVRFPPGAFDI 309
+S + FGR + T +L + RP +Y+ + S+ G + F + +
Sbjct: 246 RSTRSSYLVFGRTPEPPSTAF--TALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303
Query: 310 -MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
G GG ++D+GT ++ Y L +D R+ G +R+ S FD CY
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV-FDACYDLRG 362
Query: 369 SFKA-YPSMTFHLQ-EADYIVQPENMYFIEPDRG-------RFCVAIQ-DDPKYSILGAW 418
A P + H AD + PEN YF+ D G R C+ + D S++G
Sbjct: 363 RPAASAPLIVLHFAGGADMALPPEN-YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNV 421
Query: 419 QQQNMLIIYDLNVPALRFGSENCAN 443
QQQ +++D+ + F + C +
Sbjct: 422 QQQGFRVVFDVEKERIGFAPKGCTS 446
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 167/371 (45%), Gaps = 35/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V++ +GTP + ++ DT S L W QC PC+ CF+Q P+FDP AS +Y + C DP
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPR 211
Query: 157 C------RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFTFVPRLAF 205
C +P C+ + C Y Y T G + E F + V + F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRD 264
GC + N G +G+LG LS +SQLR FSYCLV + S I FG D
Sbjct: 272 GCGHSNRGLFH--GAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDD 329
Query: 265 ADV----RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+ R P + +Y+ L + +G + P +D+ +DG+GG IID
Sbjct: 330 DALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIID 389
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFKAYPSMT 377
+GT +++ Y+ + + + + R Y +F CY S +
Sbjct: 390 SGTTLSYFAEPAYEVIRRAFVE------RMDKAYPLVADFPVLSPCYNV-SGVERVEVPE 442
Query: 378 FHLQEADYIVQ--PENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNV 431
F L AD V P YF+ +PD G C+A+ P+ SI+G +QQQN ++YDL
Sbjct: 443 FSLLFADGAVWDFPAENYFVRLDPD-GIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501
Query: 432 PALRFGSENCA 442
L F CA
Sbjct: 502 NRLGFAPRRCA 512
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 158/375 (42%), Gaps = 44/375 (11%)
Query: 93 QDLFYSVEVNIGTP-MKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSE 149
Q L Y + +G K ++ DT S L W QC+PC C+ Q P+FDP AS T++
Sbjct: 176 QTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAA 235
Query: 150 IPCDDPLCRSPFKCQNG--------------KCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+PC P C + K G +C Y Y G +RG+ +++T
Sbjct: 236 VPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTT- 294
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA 255
T + FGC N G FGG +G++G + LSL SQ R G+FSYCL +
Sbjct: 295 --TKLDGFVFGCGLSNRGL-FGG-TAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTS 350
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
T + G ++ T ++ +P FY + + F G G
Sbjct: 351 TGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF-----GAG 405
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA 372
++D+GT +T + Y+ + + +R Y A+ F D CY +
Sbjct: 406 NVLVDSGTVITRLAPSVYKAVRAEF--------ARRFEYPAAPGFSILDACYDLTGRDEV 457
Query: 373 -YPSMTFHLQ-EADYIVQPENMYF-IEPDRGRFCVAIQDDP---KYSILGAWQQQNMLII 426
P +T L+ A V M F + D + C+A+ P + I+G +QQ+N ++
Sbjct: 458 NVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVV 517
Query: 427 YDLNVPALRFGSENC 441
YD L F E+C
Sbjct: 518 YDTVGSRLGFADEDC 532
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 191/441 (43%), Gaps = 51/441 (11%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI 86
+ +TG +KL + + GN + ER+ + +S+ ++ + E +
Sbjct: 29 TSNTGIRMKLTHVDAK------GNYTAPERVRRAIALSR-------QINLASTRAEGGGV 75
Query: 87 HLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRAS 144
P+ Y E +G P + L DT SSL+WTQC C+R C Q P F+ +S
Sbjct: 76 SAPVHWATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSS 135
Query: 145 TTYSEIPCDDPLCRSP---FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP 201
+++ +PC D C F +G C + Y G + G + F F G T
Sbjct: 136 GSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQ-SGGAT--- 190
Query: 202 RLAFGCSNDNSGFAFGGKI---SGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEA 255
LAFGC + FA + SG++G LSL+SQ + FSYCL A
Sbjct: 191 -LAFGCVSFTR-FAAPDVLHGASGLIGLGRGRLSLASQTGAK---RFSYCLTPYFHNNGA 245
Query: 256 TSVIKFGRDADVRRRDLETTPILLSD------LRPHFYLHLLEISIGRHIVRFPPGAFDI 309
+S + G A + + + +YL L+ I++G + P AFD+
Sbjct: 246 SSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDL 305
Query: 310 --MRDG--TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYC 363
+ +G GG IID+G+P T + Y+ LM ++ R L +P + C
Sbjct: 306 QEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLM---GELARQLNGSLVPPPGEDDGGMALC 362
Query: 364 YRYDSSFKAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQN 422
+ P++ H AD + PEN Y+ ++ C+AI SI+G +QQQN
Sbjct: 363 VARGDLDRVVPTLVLHFSGGADMALPPEN-YWAPLEKSTACMAIVRGYLQSIIGNFQQQN 421
Query: 423 MLIIYDLNVPALRFGSENCAN 443
M I++D+ L F + +C+
Sbjct: 422 MHILFDVGGGRLSFQNADCST 442
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 179/392 (45%), Gaps = 58/392 (14%)
Query: 82 ELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
E +P++ Q L Y V + +G+ ++ DT S L W QC+PC+ C++Q P
Sbjct: 46 EASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTGSDLTWVQCEPCMSCYNQQGP 103
Query: 138 IFDPRASTTYSEIPCDDPLCRS-PFKCQN--------GKCVYTRRYHVGDVTRGLASRET 188
IF P S++Y + C+ C+S F N C Y Y G T G E
Sbjct: 104 IFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQ 163
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
+F G V FGC +N G FGG +SG++G S LSL SQ G+FSYC
Sbjct: 164 LSF----GGVSVSDFVFGCGRNNKGL-FGG-VSGLMGLGRSYLSLVSQTNATFGGVFSYC 217
Query: 249 L-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVR 301
L E A+ + G ++ V + TPI + + P+ + L+L I + ++
Sbjct: 218 LPTTESGASGSLVMGNESSVFK---NVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQ 274
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF- 360
P +F G GG +ID+GT +T + + Y+ L + +Q + ++ F
Sbjct: 275 VP--SF-----GNGGVLIDSGTVITRLPSSVYKALKALFL-------KQFTGFPSAPGFS 320
Query: 361 --DYCYR---YDSSFKAYPSMTFHLQ-EADYIVQPENM-YFIEPDRGRFCVA---IQDDP 410
D C+ YD + P+++ H + A+ V Y ++ D + C+A + D
Sbjct: 321 ILDTCFNLTGYDE--VSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAY 378
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+I+G +QQ+N +IYD + F E+C+
Sbjct: 379 DTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 153/361 (42%), Gaps = 34/361 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP K L+FDT S L WTQCQPC R C++Q P+F P STTYS I C P
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 156 LCRSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C S + G C+Y +Y + G ++ET + + FG
Sbjct: 191 DC-SQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD---VIENFLFG 246
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C +N G G +G++G +S+ Q + +FSYCL + +T + F
Sbjct: 247 CGQNNRGLF--GSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF--GGG 302
Query: 267 VRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
L+ TPI + +FY + ++ + +G + F T G IID+GT +
Sbjct: 303 GGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFS-----TSGAIIDSGTVI 357
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDS-SFKAYPSMTFHLQEA 383
T + Y L +++ G + P D CY S P + F +
Sbjct: 358 TRLPPDAYSALKSAFEK-----GMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGG 412
Query: 384 DYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ + + C+A QD +I+G QQ+ + ++YD+ + FG
Sbjct: 413 EELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNG 472
Query: 441 C 441
C
Sbjct: 473 C 473
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/414 (28%), Positives = 185/414 (44%), Gaps = 39/414 (9%)
Query: 48 PGNLSQ-SERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF-YSVEVNIGT 105
PGN++ S +I + AN ++S + +D+ LP+ F Y V V+IGT
Sbjct: 23 PGNVTGLSFQIVALSRAPDEHANNLSSFAT-------DDMRLPILTSARFVYGVFVSIGT 75
Query: 106 P--MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC 163
K Q L DT++S+ W C+PC Q +F P AS T+ + +DP+C +P++
Sbjct: 76 GQGFKLQVLGLDTSTSMSWVMCEPCQPSLPQAGHLFSPAASPTFHGVHSNDPVCTAPYRP 135
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-------FTFVPRLAFGCSNDNSGFAF 216
C + + G SR+TF +RNG VP + FGC++ +GF
Sbjct: 136 TANGCSFRFPF-----ASGYLSRDTFH--LRNGGLSGGAPIESVPGIMFGCAHSVAGFHN 188
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVIKFGRDADVRRRDLET 274
G + G+L + LSL +QL R G FSYCL + + ++ G D
Sbjct: 189 DGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKPTQGNPHGFLRLGADVLPPLPHSHM 248
Query: 275 TPILL-SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
T + + S P +YL L+ I++ +R P F G GG I+ +T I Y
Sbjct: 249 TALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVFAA---GRGGCSINPAATITAIMEPAY 305
Query: 334 QTLMQRYDQILRSLGRQRI----PYNASQEFDYCYRYDSSFKAYPSMTFHLQE-ADYIVQ 388
+ + ++ LG R+ P + FD Y+ S PSM FH ++ A+
Sbjct: 306 LVVERALVAYMKELGSDRVKKGPPGGGALFFDRMYK--SVQARLPSMAFHFKDGAELWFT 363
Query: 389 PENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
PE ++ + F + + + +++GA QQ N +D+ L F SE C
Sbjct: 364 PEQLFEVHGMVAWFMM-VGKGYRRTVIGAPQQVNTRFTFDVAAGRLSFASELCG 416
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 183/459 (39%), Gaps = 59/459 (12%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMAS------- 73
++ F S+G L L P+SP P L + ARA ++AS
Sbjct: 34 ISDFPHRNSSGLHLTL---HHPQSPCSPAPLPSDLPFSTVLTHDDARAAHLASRLATTSN 90
Query: 74 ---------MSKPNAFQ-----ELED--IHLPMAKQDLF----YSVEVNIGTPMKPQHLL 113
+ KP A L+D +P+ Y E+ +GTP ++
Sbjct: 91 APSRRPTTSLRKPKAAAGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMV 150
Query: 114 FDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDPLCR-------SPFKCQ- 164
DT SSL W QC PC + C Q P++DPRAS+TY+ +PC C +P C
Sbjct: 151 VDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSV 210
Query: 165 NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
C+Y Y + G SR+T +F G P +GC DN G G+ +G++
Sbjct: 211 RNVCIYQASYGDSSFSVGYLSRDTVSF----GSGSYPNFYYGCGQDNEGLF--GRSAGLI 264
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRP 284
G + LSL QL + FSYCL +T + G TP+ S L
Sbjct: 265 GLARNKLSLLYQLAPSLGYSFSYCLPTP-ASTGYLSIG---PYTSGHYSYTPMASSSLDA 320
Query: 285 HFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
Y + L +S+G + P + + IID+GT +T + Y L +
Sbjct: 321 SLYFVTLSGMSVGGSPLAVSPAEYSSLPT-----IIDSGTVITRLPTAVYTALSKAVAAA 375
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
+ +G Q P A D C++ +S P++ + I+ D C
Sbjct: 376 M--VGVQSAP--AFSILDTCFQGQASQLRVPAVAMAFAGGATLKLATQNVLIDVDDSTTC 431
Query: 404 VAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+A +I+G QQQ ++YD+ + F + C+
Sbjct: 432 LAFAPTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 166/377 (44%), Gaps = 38/377 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ-TTPIFDPRASTTYSEIPCDDP 155
Y V++ +GTP + L+ DT S LVW +C C C + F PR S+++S C DP
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 147
Query: 156 LCRSPFKCQNGKCVYTR---------RYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAF 205
CR + C +TR Y G ++ G S+ET +G + L+F
Sbjct: 148 HCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSF 207
Query: 206 GCSNDNSGFAFGGK----ISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TSV 258
GC SG + G G++G +S SSQL R FSYCL+ + TS
Sbjct: 208 GCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSF 267
Query: 259 IKFGRDAD----VRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G + TP+ ++ L P F Y+ + I+I + P ++I G
Sbjct: 268 LMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQG 327
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ---EFDYCYRY--DS 368
GG ++D+GT +T++ Y+++L+S+ R+ NA++ FD C +S
Sbjct: 328 NGGTVVDSGTTLTYLTK-------TAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGES 380
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLI 425
+ P + F L P YF+E + G C+AI + +S++G QQ L+
Sbjct: 381 RRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLL 440
Query: 426 IYDLNVPALRFGSENCA 442
+D L F C
Sbjct: 441 EFDKEESRLGFTRRGCG 457
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 165/353 (46%), Gaps = 26/353 (7%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCD 153
L + V V GTP + ++ DT S L W QC+PC C+ Q P FDP S++Y+ +PC
Sbjct: 135 LEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCG 194
Query: 154 DPLCRSPFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
P+C + NG C+Y +Y G T G+ SR+T F + FT FGC N
Sbjct: 195 TPVCAAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT---GFTFGCGEKNI 251
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDL 272
G G++ G+LG LSL SQ G+FSYCL + G +
Sbjct: 252 GDF--GEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTSTVPV 309
Query: 273 ETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ T ++ P FY + L+ I+IG +I+ PP F + GT ++D+GT +T++
Sbjct: 310 QYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF--TKTGT---LLDSGTILTYLPPP 364
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPE 390
Y +L R+ ++ + PY + D CY + P+++F+ +
Sbjct: 365 AYTSLRDRFKFTMQG-NKPAPPY---EPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDF 420
Query: 391 NMYFIEPDRGR---FCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALRFG 437
I PD + C+A P +SI+G QQ+ +IYD VP+ + G
Sbjct: 421 YGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYD--VPSQKIG 471
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 150/352 (42%), Gaps = 22/352 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC + C++Q +FDP +S+TY+ + C P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 238
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 239 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNDG 295
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q + G+F++CL T + FG +
Sbjct: 296 LF--GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA---TT 350
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F G I+D+GT +T + Y
Sbjct: 351 TTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAY 405
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ A D CY + S A P+++ Q + +
Sbjct: 406 SSLRSAFAAAMAARGYRKAA--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 463
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+A +D I+G Q + + YD+ + F C
Sbjct: 464 IMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 166/386 (43%), Gaps = 42/386 (10%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRAST 145
L +A Q L Y V + IGTP + +LFDT S L W QC PC C+ Q P+FDP S+
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSS 172
Query: 146 TYSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF----PVRNGF 197
TY ++PC P C +C C Y+ +Y T G + ETF P+
Sbjct: 173 TYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232
Query: 198 TFVPRLAFGCSNDNSGF--AFGGKISGILGFNASPLSLSSQLRNRIQ---GLFSYCLVRE 252
T V FGCS++ G ++G+LG S+ SQ R I G+FSYCL
Sbjct: 233 TGV---VFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPR 289
Query: 253 MEATSVIKFGRDADVRRR---DLETTPIL--LSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
+T + G A ++ +L TP++ +S LR + ++L +S+ V P AF
Sbjct: 290 GSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAF 349
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
+ G +ID+GT VT + Y L + + S + +P + + D CY
Sbjct: 350 SL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSY--KMLPEGSMKLLDTCYDVT 401
Query: 368 S-SFKAYPSMTFHLQEADYI-VQPENMYFIEPDRGR-------FCVAI--QDDPKYSILG 416
P + I V + + P C+A + I+G
Sbjct: 402 GQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVG 461
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCA 442
QQ+ +++D++ + FG C+
Sbjct: 462 NMQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/416 (26%), Positives = 187/416 (44%), Gaps = 48/416 (11%)
Query: 53 QSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMK 108
Q + I + + A +S N+ ++ +I +P+A + L Y V + +G +
Sbjct: 85 QKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYIVTIGLGN--Q 142
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS-PFKCQN-- 165
++ DT S L W QC PC+ C+ Q P+F+P S++Y+ + C+ C++ F N
Sbjct: 143 NMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTE 202
Query: 166 -------GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
C +T Y G T G E +F G V FGC +N G FGG
Sbjct: 203 ACESNNPSSCNHTVSYGDGSFTDGELGVEHLSF----GGISVSNFVFGCGRNNKGL-FGG 257
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADVRRRDLETTPI 277
+SGI+G S LS+ SQ G+FSYCL + A+ + G ++ + + TPI
Sbjct: 258 -VSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNESSLFKN---LTPI 313
Query: 278 LLSDLRPH------FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ + + + L+L I +G V +F G GG +ID+GT +T +
Sbjct: 314 AYTSMVSNPQLSNFYVLNLTGIDVGG--VAIQDTSF-----GNGGILIDSGTVITRLAPS 366
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQ-EADYIVQP 389
Y L + + + G P A D C+ + + P+++ H + D V
Sbjct: 367 LYNALKAEF--LKQFSGYPIAP--ALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDA 422
Query: 390 ENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ ++ D + C+A + D+ +I+G +QQ+N +IYD + F E+C+
Sbjct: 423 VGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 57/394 (14%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP-----------IFDPRAST 145
Y V +GTP +P L+ DT S L W +C+P T F P S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 146 TYSEIPCDDPLCRS--PFKCQ-----NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
T++ IPC C PF C Y RY G RG E+ + + +
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214
Query: 199 FVPR---------LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
L GC+ +G +F G+L S +S +S +R G FSYCL
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFEAS-DGVLSLGYSNVSFASHAASRFGGRFSYCL 273
Query: 250 VREME---ATSVIKFGRDADVR-------RRDLETTPILL-SDLRPHFYLHLLEISIGRH 298
V + ATS + FG ++ + TP++L S +RP + + + IS+
Sbjct: 274 VDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGE 333
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ--RIPYNA 356
+++ P +++ DG GG I+D+GT +T + Y+ ++ +LG++ R P A
Sbjct: 334 LLKIPRDVWEV--DGGGGVIVDSGTSLTVLAKPAYRA-------VVAALGKKLARFPRVA 384
Query: 357 SQEFDYCYRYDSSFKA-----YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD-- 409
F+YCY + S + P + H + + P Y I+ G C+ +Q+
Sbjct: 385 MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPW 444
Query: 410 PKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
P S++G QQ L +DL LRF C +
Sbjct: 445 PGISVIGNILQQEHLWEFDLKNRRLRFKRSRCTH 478
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 153/374 (40%), Gaps = 34/374 (9%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y ++ +GTP+ P ++ DT S +VW QC PC RC+DQ+ +FDPRAS +Y
Sbjct: 140 LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGA 199
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+ C PLCR + C+Y Y G VT G + ET F VPR+A
Sbjct: 200 VDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTF---ASGARVPRVAL 256
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-------REMEATSV 258
GC +DN G F + S LS SQ+ R FSYCLV +S
Sbjct: 257 GCGHDNEGL-FVAAAGLLGLGRGS-LSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314
Query: 259 IKFGRDA--DVRRRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG A + RR L D LR + PP
Sbjct: 315 VTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPP---PDPST 371
Query: 313 GTGGFIIDTGTPV-TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
G GG I+D+G P + R G R L R+ FD CY S K
Sbjct: 372 GRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGL---RLSPGGFSLFDTCYDL-SGLK 427
Query: 372 A--YPSMTFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIY 427
P+++ H P Y I D RG FC A D SI+G QQQ +++
Sbjct: 428 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 487
Query: 428 DLNVPALRFGSENC 441
D + L F + C
Sbjct: 488 DGDGQRLGFVPKGC 501
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/426 (26%), Positives = 179/426 (42%), Gaps = 50/426 (11%)
Query: 34 LKLIPIFSPESPLY-PGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L++ + SP SP P +S + K KAR Y++S++K +P+A
Sbjct: 34 LRVFHVNSPCSPFKQPNTVSWESTLLK----DKARLQYLSSLAK--------KPSVPIAS 81
Query: 93 -----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
Q Y V NIGTP +P + DT++ W C C+ C +FDP S++
Sbjct: 82 GRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSS 139
Query: 148 SEIPCDDPLCRSPFK--CQNGK-CVYTRRYHVGDVTRGLASRE-TFAFPVRNGFTFVPRL 203
+ CD P C+ C GK C + Y + L T A V +T
Sbjct: 140 RNLQCDAPQCKQAPNPTCTAGKSCGFNMTYGGSTIEASLTQDTLTLANDVIKSYT----- 194
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--TSVIKF 261
FGC + +G + + G++G PLSL SQ +N FSYCL + + ++
Sbjct: 195 -FGCISKATGTSLPAQ--GLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRL 251
Query: 262 GRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G R ++TTP+L + R +Y++L+ I +G IV P A G I D
Sbjct: 252 GPKYQPVR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFD 309
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+GT T + Y + + + +++ + FD CY S YPS+TF
Sbjct: 310 SGTVFTRLVEPAYVAVRNEFRRRIKNANATSL-----GGFDTCY---SGSVVYPSVTFMF 361
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALR 435
+ + P+N+ C+A+ P +++ + QQQN ++ DL L
Sbjct: 362 AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLG 421
Query: 436 FGSENC 441
E C
Sbjct: 422 ISRETC 427
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 150/352 (42%), Gaps = 22/352 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC + C++Q +FDP +S+TY+ + C P
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 242
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 243 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNDG 299
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q + G+F++CL T + FG +
Sbjct: 300 LF--GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA---TT 354
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F G I+D+GT +T + Y
Sbjct: 355 TTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAY 409
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ A D CY + S A P+++ Q + +
Sbjct: 410 SSLRSAFAAAMAARGYRKAA--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 467
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+A +D I+G Q + + YD+ + F C
Sbjct: 468 IMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/261 (36%), Positives = 127/261 (48%), Gaps = 30/261 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V + +GTP +P L DT S LVWTQC PC CFDQ P+ DP AS+TY+ +PC P
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPR 145
Query: 157 CRS-PF-KCQNGKCVYTRRYHVGD--VTRGLASRETFAF---PVRNGFTFVP---RLAFG 206
CR+ PF C CVY YH GD VT G + + F F RNG +P RL FG
Sbjct: 146 CRALPFTSCGGRSCVYV--YHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFG 203
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVIKFGRD- 264
C + N G F +GI GF SL SQL FSYC ++ +S++ G
Sbjct: 204 CGHFNKGV-FQSNETGIAGFGRGRWSLPSQLNATS---FSYCFTSMFDSKSSIVTLGGAP 259
Query: 265 ----ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
+ ++ TTP+ + +P Y L L IS+G+ + P F II
Sbjct: 260 AALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST-------II 312
Query: 320 DTGTPVTFIRNGPYQTLMQRY 340
D+G +T + Y+ + +
Sbjct: 313 DSGASITTLPEEVYEAVKAEF 333
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 186/445 (41%), Gaps = 66/445 (14%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQ-ELEDIHLPMAKQDLFYSVEVNIGTPMK 108
NL+ E + + + S+ R +A P + + ++ P+ Y V++ +GTP
Sbjct: 40 NLTDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQH 99
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---------RS 159
DTAS L+WTQCQPC++C+ Q P+F+P AST+Y+ +PC+ C R
Sbjct: 100 CFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159
Query: 160 PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG- 218
C YT Y TRG+ + + A G + FGCS+ + GG
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAI----GDDVFRGVVFGCSSS----SVGGP 211
Query: 219 --KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRDADVRRRDLE-- 273
++SG++G LSL SQL R F YCL + + + G DA R+
Sbjct: 212 PPQVSGVVGLGRGALSLVSQLSVR---RFMYCLPPPVSRSAGRLVLGADAAATVRNASER 268
Query: 274 -TTPILLSDLRP-HFYLHLLEISIGRHIVRF-PPGAFDIMRDGTG--------------- 315
P+ P ++YL+L ISIG + F + GT
Sbjct: 269 VVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGD 328
Query: 316 ---------GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYCY 364
G IID + +TF+ Y+ ++ ++ + R+P + + D C+
Sbjct: 329 GSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEI------RLPRGSGSDLGLDLCF 382
Query: 365 RYDSSF---KAY-PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQ 420
+ Y P ++ + + E M+ + G C+ + SILG +QQ
Sbjct: 383 ILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDGVSILGNYQQ 442
Query: 421 QNMLIIYDLNVPALRFGSENCANGR 445
QNM ++Y+L + F C + R
Sbjct: 443 QNMQVMYNLRRGRITFIKTACESVR 467
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/417 (25%), Positives = 175/417 (41%), Gaps = 31/417 (7%)
Query: 42 PESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF----Y 97
P+SP P LS AR +AS + +P+A Y
Sbjct: 49 PQSPCSPAPLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASGASVGVGNY 108
Query: 98 SVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDPL 156
+ +GTP ++ D+ SSL W QC PC + C Q P++DPRAS+TY+ +PC P
Sbjct: 109 ITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQ 168
Query: 157 CR-------SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C +P C +G C Y Y G + G S++T + F P +GC
Sbjct: 169 CAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSF---PGFYYGCG 225
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS-VIKFGRDADV 267
DN G G+ +G++G + LSL SQL + F+YCL A++ + FG ++D
Sbjct: 226 QDNVGLF--GRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDN 283
Query: 268 RR-RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ T ++ S L Y +S+ V P A G+ IID+GT +T
Sbjct: 284 KNPGKYSYTSMVSSSLDASLYF----VSLAGMSVAGSPLAVPSSEYGSLPTIIDSGTVIT 339
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ Y L + L + Y+ Q C++ + P++ +
Sbjct: 340 RLPTPVYTALSKAVGAALAA--PSAPAYSILQT---CFKGQVAKLPVPAVNMAFAGGATL 394
Query: 387 -VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ P N+ ++ + C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 395 RLTPGNV-LVDVNETTTCLAFAPTDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 166/371 (44%), Gaps = 35/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V++ +GTP + ++ DT S L W QC PC+ CF+Q P+FDP S +Y + C DP
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPR 211
Query: 157 C------RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFTFVPRLAF 205
C +P C+ + C Y Y T G + E F + V + F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRD 264
GC + N G +G+LG LS +SQLR FSYCLV + S I FG D
Sbjct: 272 GCGHSNRGLFH--GAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDD 329
Query: 265 ADV----RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+ R P + +Y+ L + +G + P +D+ +DG+GG IID
Sbjct: 330 DALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIID 389
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFKAYPSMT 377
+GT +++ Y+ + + + + R Y +F CY S +
Sbjct: 390 SGTTLSYFAEPAYEVIRRAFVE------RMDKAYPLVADFPVLSPCYNV-SGVERVEVPE 442
Query: 378 FHLQEADYIVQ--PENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNV 431
F L AD V P YF+ +PD G C+A+ P+ SI+G +QQQN ++YDL
Sbjct: 443 FSLLFADGAVWDFPAENYFVRLDPD-GIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501
Query: 432 PALRFGSENCA 442
L F CA
Sbjct: 502 NRLGFAPRRCA 512
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 150/352 (42%), Gaps = 22/352 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP ++FDT S W QCQPC + C++Q +FDP +S+TY+ + C P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 239
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G + G + +T + + V FGC N G
Sbjct: 240 ACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL---SSYDAVKGFRFGCGERNDG 296
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG SL Q + G+F++CL T + FG +
Sbjct: 297 LF--GEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPA---TT 351
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
TTP+L + +Y+ + I +G ++ P F G I+D+GT +T + Y
Sbjct: 352 TTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAY 406
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
+L + + + G ++ A D CY + S A P+++ Q + +
Sbjct: 407 SSLRSAFAAAMAARGYRKAA--AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 464
Query: 393 YFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+A +D I+G Q + + YD+ + F C
Sbjct: 465 IMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 144/347 (41%), Gaps = 39/347 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDD 154
Y V V++GTP Q + DT S + W QC+PC C Q +FDP S+TYS +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 155 PLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C C +C Y Y G T G+ +T A N V FGC +
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGN---TVGTFLFGCGHA 259
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
+G G I G+L +SL SQ G+FSYCL + A + G +
Sbjct: 260 QAGMFAG--IDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSA--S 315
Query: 271 DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TT +L + P FY+ +L IS+G V P AF GG ++DTGT +T +
Sbjct: 316 GFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTVITRLP 369
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFH-----LQ 381
Y L + + G P N D CY RY ++TF
Sbjct: 370 PTAYAALRSAFRGAIAPCGYPSAPANG--ILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
EA I+ + F P+ G D +ILG QQ++ + +D
Sbjct: 428 EAPGILSSGCLAF-APNGG--------DGDAAILGNVQQRSFAVRFD 465
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 147/357 (41%), Gaps = 35/357 (9%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y + V +G+P K Q +L D+ S + W QC+PC++C Q P+FDP S+TYS C
Sbjct: 129 LEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSS 188
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C + NG +C Y RY G T G S +T A G + FGCS+
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQFGCSH 244
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
SG F G++G SL+SQ FSYCL ++ + G
Sbjct: 245 VESG--FNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSSGFLTLGAGTS--- 299
Query: 270 RDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP+L S P FY + L I +G + P F + G ++D+GT +T +
Sbjct: 300 -GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMVMDSGTIITRL 352
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIV 387
Y L + + +Q P D C+ + S PS+ +
Sbjct: 353 PRTAYSALSSAFKAGM----KQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVN 408
Query: 388 QPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
N + C+A DD I+G QQ+ ++YD+ A+ F + C
Sbjct: 409 LDANGIILG-----NCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 182/429 (42%), Gaps = 60/429 (13%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
N S ER+ + E + R M S P + E + Y E IG P +
Sbjct: 36 NCSTEERMRRATERTHRRLASMGEASAPVHWAESQ------------YIAEYLIGDPPQQ 83
Query: 110 QHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKC-- 163
+ DT S+L+WTQC C CF Q +DP S T + C+D C S +C
Sbjct: 84 AEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSETRCAR 143
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC---SNDNSGFAFGGKI 220
N C Y G V G+ E F F ++ LAFGC + G G
Sbjct: 144 DNKACAVLTAYGAG-VIGGVLGTEAFTFQPQSENV---SLAFGCIAATRLTPGSLDGA-- 197
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADVRRRDLETT-- 275
SGI+G LSL SQL + FSYCL + TS + G A + T
Sbjct: 198 SGIIGLGRGNLSLVSQLGDN---KFSYCLTPYFSQSTNTSRLFVGASAGLSSGGAPATSV 254
Query: 276 PILLS-DLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTG---GFIIDTGTPVTFI 328
P L + D+ P +YL L I++G + P AFD+ + TG G +ID+G+P T +
Sbjct: 255 PFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTLIDSGSPFTSL 314
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYC--YRYDSSFKAYPSMTFHLQE--A 383
+ YQ L D++++ LG +P A E D C + K P + H
Sbjct: 315 VDVAYQAL---RDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVLHFGSGGG 371
Query: 384 DYIVQPENMYFIEPDRGRFCVAI--QDDP-------KYSILGAWQQQNMLIIYDLNVPAL 434
D V PEN Y+ D C+ + P + +I+G + QQ+M ++YDL L
Sbjct: 372 DVAVPPEN-YWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLYDLEKGML 430
Query: 435 RFGSENCAN 443
F +C++
Sbjct: 431 SFQPADCSS 439
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 168/428 (39%), Gaps = 84/428 (19%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHL-PMAKQDLF----YSVEVNIG 104
L+ E + +M + SKARA ++ S + + P A D F Y V + G
Sbjct: 36 GLTHWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAG 95
Query: 105 TPMKPQHLLFDTASSLVWTQCQ--PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK 162
TP + L DT S + WTQC+ P CF+QT P+FDP AS++++ +PC P C +
Sbjct: 96 TPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPP 155
Query: 163 CQNGK------CVYTRRYHVGDVTRGLASRETFAFPVRNG---FTFVPRLAFGCSNDNSG 213
C G C Y+ Y G V+RG RE F F G VP L FGC + N G
Sbjct: 156 CGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRG 215
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADV----- 267
F +GI GF LSL SQL+ G FS+C TS + G
Sbjct: 216 V-FTSNETGIAGFGRGSLSLPSQLK---VGNFSHCFTTITGSKTSAVLLGLPGVAPPSAS 271
Query: 268 ---RRRD---LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
RRR +TP + G I PP + +R+ F
Sbjct: 272 PLGRRRGSYRCRSTP--------------RSSNSGTSITSLPPRTYRAVRE---EFAAQV 314
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD-YCYRYDSSFKAYPSMTFHL 380
PV +P NA+ F + P+M H
Sbjct: 315 KLPV--------------------------VPGNATDPFTCFSAPLRGPKPDVPTMALHF 348
Query: 381 QEADYIVQPENMYFI---EPDRGR----FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPA 433
+ A + EN F + D G C+A+ + + ILG QQQNM ++YDL
Sbjct: 349 EGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEI-ILGNIQQQNMHVLYDLQNSK 407
Query: 434 LRFGSENC 441
L F C
Sbjct: 408 LSFVPAQC 415
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 107/409 (26%), Positives = 179/409 (43%), Gaps = 31/409 (7%)
Query: 35 KLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYM-ASMSKPNAFQELEDIHLPMAKQ 93
KLI S P Y N + +R+ E S AR Y+ A + + + +
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLT 97
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
V ++IG P PQ ++ DT S ++W C PC C + +FDP S+T+S
Sbjct: 98 GRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFS----- 152
Query: 154 DPLCRSPFKCQNGKCV---YTRRYHVGDVTRGLASRETFAFPVRN-GFTFVPRLAFGCSN 209
PLC++P + KC +T Y G R+ F + G + + + GC +
Sbjct: 153 -PLCKTPCGFKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH 211
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATSVIKFGRDAD 266
N GF +GILG N P SL++Q+ + FSYC+ + + ++ G AD
Sbjct: 212 -NIGFNSDPGYNGILGLNNGPNSLATQIGRK----FSYCIGNLADPYYNYNQLRLGEGAD 266
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ +TP + +Y+ + IS+G + F++ R+GTGG I+D+GT +T
Sbjct: 267 LEGY---STPFEV--YHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTIT 321
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
++ + ++ L +L+ RQ I NA + Y +P +TFH + +
Sbjct: 322 YLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADL 381
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKY------SILGAWQQQNMLIIYDL 429
+F + D FC+ + S++G QQ+ + YDL
Sbjct: 382 ALDTGSFFSQRDD-IFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDL 429
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 167/364 (45%), Gaps = 47/364 (12%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
D +S+ V I ++P+ L+ DT S L+WTQC+ +ST +
Sbjct: 40 DQGHSLTVGI---VQPRKLIVDTGSDLIWTQCKL--------------SSSTAAAARHGS 82
Query: 154 DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
PL R+ + G +TR G+ + ETF F R + RL FGC ++G
Sbjct: 83 PPLSRTA-PARTGA--FTRTCTASAAAVGVLASETFTFGARRAVSL--RLGFGCGALSAG 137
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRR--- 269
G +GILG + LSL +QL+ IQ FSYCL + TS + FG AD+ R
Sbjct: 138 SLIGA--TGILGLSPESLSLITQLK--IQ-RFSYCLTPFADKKTSPLLFGAMADLSRHKT 192
Query: 270 -RDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R ++TT I+ + + ++Y+ L+ IS+G + P + + DG GG I+D+G+ V +
Sbjct: 193 TRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAY 252
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-------YPSMTFHL 380
+ ++ + + ++ R + ++++ C+ A P + H
Sbjct: 253 LVEAAFEAVKEAVMDVV----RLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHF 308
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+V P + YF EP G C+A+ D SI+G QQQNM +++D+ F
Sbjct: 309 DGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFA 368
Query: 438 SENC 441
C
Sbjct: 369 PTQC 372
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 143/347 (41%), Gaps = 39/347 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDD 154
Y V V++GTP Q + DT S + W QC+PC C Q +FDP S+TYS +PC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 155 PLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C C +C Y Y G T G+ +T A N V FGC +
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGN---TVGTFLFGCGHA 259
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
+G G I G+L +SL SQ G+FSYCL + A + G
Sbjct: 260 QAGMFAG--IDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLG--GPTSAS 315
Query: 271 DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TT +L + P FY+ +L IS+G V P AF GG ++DTGT +T +
Sbjct: 316 GFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTVITRLP 369
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFH-----LQ 381
Y L + + G P N D CY RY ++TF
Sbjct: 370 PTAYAALRSAFRGAIAPYGYPSAPANG--ILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
EA I+ + F P+ G D +ILG QQ++ + +D
Sbjct: 428 EAPGILSSGCLAF-APNGG--------DGDAAILGNVQQRSFAVRFD 465
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 167/378 (44%), Gaps = 35/378 (9%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTT 146
L +A L Y V + IGTP + +LFDT S L W QC+PC C+ Q P+FDP S+T
Sbjct: 117 LGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSST 176
Query: 147 YSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
Y ++PC P C+ C C Y+ +Y VTRG ++E AF +
Sbjct: 177 YVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQE--AFTLSPSAPPAAG 234
Query: 203 LAFGCSNDNSGFAFGGK----ISGILGFNASPLSLSSQLRNRIQG-LFSYCLVREMEATS 257
+ FGCS++ S G + ++G+LG S+ SQ R G +FSYCL +
Sbjct: 235 VVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSAG 294
Query: 258 VIKFGRDADVRRRDLETTPILL--SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
+ G A + +L TP++ S L + ++L+ IS+ + AF I
Sbjct: 295 YLTIGAAAP-PQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------ 347
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR-QRIPYNASQEFDYCYRYDS-SFKAY 373
G +ID+GT +T + Y L D+ R +G +P + D CY
Sbjct: 348 GTVIDSGTVITHMPAAAYYVLR---DEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTA 404
Query: 374 PSMTFHLQEADYIVQPEN---MYFIEPDRGR----FCVAI--QDDPKYSILGAWQQQNML 424
P + I + + F G+ C+A + P + I+G QQ+
Sbjct: 405 PPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYN 464
Query: 425 IIYDLNVPALRFGSENCA 442
+++D+ + FG+ C+
Sbjct: 465 VVFDVEGRRIGFGANGCS 482
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 167/373 (44%), Gaps = 34/373 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP K L+ DT S L W QC PC CF+Q P +DP+ S+++ I C DP
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254
Query: 157 CR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C+ P K + C Y Y T G + ETF + + V +
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G +G+LG PLS ++QL++ FSYCLV +SV +
Sbjct: 315 MFGCGHWNRGLFH--GAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSSKLI 372
Query: 261 FGRDADVRRR-DLETTPILLSDLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D ++ +L T + P +Y+ + I +G +++ P + + G GG
Sbjct: 373 FGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGG 432
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR-QRIPYNASQEFDYCYRYDSSFK-AYP 374
IID+GT +T+ Y+ + + + + ++ + P CY K P
Sbjct: 433 TIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFP-----PLKPCYNVSGVEKMELP 487
Query: 375 SMTFHLQEADYIVQPENMYF--IEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLN 430
+ P YF IEP+ C+AI P+ SI+G +QQQN I+YDL
Sbjct: 488 EFAILFADGAMWDFPVENYFIQIEPED-VVCLAILGTPRSALSIIGNYQQQNFHILYDLK 546
Query: 431 VPALRFGSENCAN 443
L + CA+
Sbjct: 547 KSRLGYAPMKCAD 559
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 114/402 (28%), Positives = 166/402 (41%), Gaps = 46/402 (11%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQ------DLFYSVEVNIGTPMKPQHLLFDTA 117
S+AR NY+ S + +D + + + L Y V + GTP PQ LL DT
Sbjct: 86 SRARTNYIKSRASTGMASTPDDAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTG 145
Query: 118 SSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLC-------RSPFKCQNGKC 168
S + W QC PC C+ Q P+FDP S+TY+ I C C R+ +C
Sbjct: 146 SDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQC 205
Query: 169 VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNA 228
Y Y G TRG+ S ET F G T V FGC +D G + K G+LG
Sbjct: 206 GYRVEYGDGSSTRGVYSNETITF--APGIT-VKDFHFGCGHDQRGPS--DKFDGLLGLGG 260
Query: 229 SPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGR--DADVRRRDLETTPILLSDLRPHF 286
+P SL Q + G FSYCL + G A TP+ +
Sbjct: 261 APESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDATS 320
Query: 287 YL-HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILR 345
Y+ ++ IS+G + P AF GG +ID+GT VT + Y L +
Sbjct: 321 YMVNMTGISVGGKPLDIPRSAF------RGGMLIDSGTIVTELPETAYNALNAALRKAFA 374
Query: 346 SLGRQRIPYNASQEFDYCYRYDS-SFKAYP--SMTFHLQEADYIVQPENMYFIEPDRGRF 402
+ P AS++FD CY + S P ++TF + P + +
Sbjct: 375 AY-----PMVASEDFDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD------ 423
Query: 403 CVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A ++ D I+G Q+ + ++YD + F + C
Sbjct: 424 CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 160/365 (43%), Gaps = 30/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y E+ +GTP K ++ DT S L W C+ R D +F S ++ + C
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQT 164
Query: 157 CR-------SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFG 206
C+ S C + C Y RY G +G+ ++ET + NG +P G
Sbjct: 165 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIG 224
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IKFGR 263
CS+ +G +F G G+LG S S +S + FSYCLV + +V + FG
Sbjct: 225 CSSSFTGQSFQGA-DGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 283
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+ TTP+ L+ + P + ++++ IS+G ++ P +D GG I+D+GT
Sbjct: 284 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSG--GGTILDSGT 341
Query: 324 PVTFIRNGPYQ---TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTF 378
+T + + Y+ T + RY L+ + + +P +YC+ + S F P +TF
Sbjct: 342 SLTLLADAAYKQVVTGLARYLVELKRVKPEGVP------IEYCFSFTSGFNVSKLPQLTF 395
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
HL+ Y ++ G C+ P +++G QQN L +DL L F
Sbjct: 396 HLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSF 455
Query: 437 GSENC 441
C
Sbjct: 456 APSAC 460
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 160/365 (43%), Gaps = 30/365 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y E+ +GTP K ++ DT S L W C+ R D +F S ++ + C
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQT 142
Query: 157 CR-------SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFG 206
C+ S C + C Y RY G +G+ ++ET + NG +P G
Sbjct: 143 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIG 202
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IKFGR 263
CS+ +G +F G G+LG S S +S + FSYCLV + +V + FG
Sbjct: 203 CSSSFTGQSFQGA-DGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 261
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+ TTP+ L+ + P + ++++ IS+G ++ P +D GG I+D+GT
Sbjct: 262 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSG--GGTILDSGT 319
Query: 324 PVTFIRNGPYQ---TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTF 378
+T + + Y+ T + RY L+ + + +P +YC+ + S F P +TF
Sbjct: 320 SLTLLADAAYKQVVTGLARYLVELKRVKPEGVP------IEYCFSFTSGFNVSKLPQLTF 373
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
HL+ Y ++ G C+ P +++G QQN L +DL L F
Sbjct: 374 HLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSF 433
Query: 437 GSENC 441
C
Sbjct: 434 APSAC 438
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 158/374 (42%), Gaps = 37/374 (9%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC-QPCIRCFDQTTPIFDPRAS 144
+ +P+ FY V + IGTP +P + D LVWTQC Q C RCF Q P+FD AS
Sbjct: 40 VTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNAS 99
Query: 145 TTYSEIPCDDPLCRS-PFKCQNGKCVYTRRYHVGDV---TRGLASRETFAFPVRNGFTFV 200
+T+ PC +C S P + G Y T G + A G
Sbjct: 100 STFRPEPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAI----GTAAT 155
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVI 259
RLAFGC+ + G SG +G + LSL++Q+ FSYCL + +S +
Sbjct: 156 ARLAFGCAVASEMDTMWGS-SGSVGLGRTNLSLAAQMNATA---FSYCLAPPDTGKSSAL 211
Query: 260 KFGRDADV--RRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVRFPPGAFDIMR 311
G A + + TTP + + P+ + L L I G + P I
Sbjct: 212 FLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTI-- 269
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
+ T TPVT + + Y+ L + + ++G +P Q +D C+ S+
Sbjct: 270 ------TVSTATPVTALVDSVYRDLRK---AVADAVGAAPVPPPV-QNYDLCFPKASASG 319
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYD 428
P + Q + P + Y + CVAI P SILG+ QQ N+ +++D
Sbjct: 320 GAPDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFD 379
Query: 429 LNVPALRFGSENCA 442
L+ L F +C+
Sbjct: 380 LDKETLSFEPADCS 393
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 183/441 (41%), Gaps = 40/441 (9%)
Query: 22 THFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQ 81
T S E+ F LK++ P S L G+ ++++ I + + + + +SK +
Sbjct: 74 TQVPSIENKAF-LKVVHKHGPCSDLRQGHKAEAQYI--LLQDQSRVDSIHSKLSKDSGLS 130
Query: 82 ELEDIH---LPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFD 133
+++ LP + Y V V +GTP K L+FDT S L WTQC+PC++ C++
Sbjct: 131 DVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYN 190
Query: 134 QTTPIFDPRASTTYSEIPCDDPLCRS-------PFKCQNGKCVYTRRYHVGDVTRGLASR 186
Q IF+P ST+Y+ I C LC S F C + CVY +Y + G +
Sbjct: 191 QKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGK 250
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
E + + F FGC +N G +G+LG LSL SQ R +FS
Sbjct: 251 EKLSLTATDVFN---DFYFGCGQNNKGLFG--GAAGLLGLGRDKLSLVSQTAQRYNKIFS 305
Query: 247 YCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPG 305
YCL +T + FG + TP+ FY L L IS+G + P
Sbjct: 306 YCLPSSSSSTGFLTFGGST---SKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPS 362
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
F T G IID+GT +T + Y L + R L Q A D C+
Sbjct: 363 VFS-----TAGTIIDSGTVITRLPPAAYSALSSTF----RKLMSQYPAAPALSILDTCFD 413
Query: 366 Y-DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQ 421
+ + + P + + + F D + C+A D +I G QQ+
Sbjct: 414 FSNHDTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQK 473
Query: 422 NMLIIYDLNVPALRFGSENCA 442
+ ++YD + F C+
Sbjct: 474 TLEVVYDGAAGRVGFAPAGCS 494
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 176/406 (43%), Gaps = 32/406 (7%)
Query: 53 QSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHL 112
QS ++ + E S R Y+ + + + L ++P+ Q V ++IG+P Q L
Sbjct: 44 QSPQVSHIKEASVERLEYLKAKATGDIIAHLSP-NVPIIPQAFL--VNISIGSPPVTQLL 100
Query: 113 LFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP---FKCQNGKCV 169
DTAS L+W QC+PCI C+ Q+ PIFDP S T+ C P F + C
Sbjct: 101 HMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRSCE 160
Query: 170 YTRRYHVGDVTRGLASRETFAFPV---RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF 226
Y+ RY G ++G+ ++E F + + + FGC +DN G G +GILG
Sbjct: 161 YSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGEPLVG--TGILGL 218
Query: 227 NASPLSLSSQLRNRIQGLFSYCLVREMEAT---SVIKFGRDADVRRRDLETTPILLSDLR 283
SL + + FSYC + + +V+ G D D TTP+ + +
Sbjct: 219 GYGEFSLVHRFGTK----FSYCFGSLDDPSYPHNVLVLGDDGANILGD--TTPLEIYN-- 270
Query: 284 PHFYLHLLEISIGRHIVRFPPGAFD-IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
+Y+ + IS+ I+ P F+ + G GG IIDTG +T + Y+ L + +
Sbjct: 271 GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIED 330
Query: 343 ILRSLGR-QRIPYNASQEFDY-CY----RYDSSFKAYPSMTFHLQEADYIVQPENMYFIE 396
GR N F CY D +P +TFH + + F++
Sbjct: 331 YFE--GRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMK 388
Query: 397 PDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
FC+A+ SI GA QQ+ I YDL + F +C
Sbjct: 389 LSPNVFCLAVTPGNMNSI-GATAQQSYNIGYDLEAKKISFERIDCG 433
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 180/397 (45%), Gaps = 33/397 (8%)
Query: 57 IHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK-QDLFYSVEVNIGTPMKPQHLLFD 115
+H M+ S ARA+ A +++ A + D+ +P+A+ D Y+V + IGTP + L+ D
Sbjct: 53 VHDMWRRS-ARAS-KARVARLEA-RLTGDMSVPLARISDEGYTVTIGIGTPPQLHTLIAD 109
Query: 116 TASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQNGKCVYT 171
TAS L WTQC Q P+FDP S++++ + C LC +C N C Y
Sbjct: 110 TASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTEDNPGTKRCSNKTCRYV 169
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y V G+ + E+F N + FGC G G SGILG + + L
Sbjct: 170 YPY-VSVEAAGVLAYESFTLSDNNQHICM-SFGFGCGALTDGNLLGA--SGILGMSPAIL 225
Query: 232 SLSSQLRNRIQGLFSYCLVREME-ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHL 290
S+ SQL FSYCL + +S + FG AD+ R +TT + L ++Y+ L
Sbjct: 226 SMVSQLAIPK---FSYCLTPYTDRKSSPLFFGAWADLGR--YKTTGPIQKSLTFYYYVPL 280
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
+ +S+G + P F + + GG ++D G V + + L + +L +L
Sbjct: 281 VGLSLGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTALKE---AVLHTL--- 331
Query: 351 RIPYNASQEFDY--CYRYDSSFKA----YPSMTFHLQEADYIVQPENMYFIEPDRGRFCV 404
+P DY C+ S P + + +V P + YF EP G C+
Sbjct: 332 NLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAGLMCL 391
Query: 405 AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
A+ SI+G QQQN +++D++ F C
Sbjct: 392 ALVPGGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 123/291 (42%), Gaps = 25/291 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPC 152
L Y V V++GTP Q L DT S L W QC PC C+ Q P+FDP S++Y+ +PC
Sbjct: 138 LNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPC 197
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
P+C C +C Y Y G T G+ S +T + V FGC
Sbjct: 198 GGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDA---VRGFFFGCG 254
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
+ SGF G+LG SL Q G+FSYCL T + G +
Sbjct: 255 HAQSGFT---GNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGYLTLGGPSGAA 311
Query: 269 RRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TT +L S +Y+ +L IS+G + P F GG ++DTGT +T
Sbjct: 312 PPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVF------AGGTVVDTGTVITR 365
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTF 378
+ Y L + + S G P A+ D CY F Y ++T
Sbjct: 366 LPPTAYAALRSAFRSGMASYGYPSAP--ATGILDTCYN----FSGYGTVTL 410
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/439 (26%), Positives = 179/439 (40%), Gaps = 50/439 (11%)
Query: 23 HFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQE 82
H T SLK++ P S L N + + + E + A +S + +E
Sbjct: 56 HSTKVAQNKASLKVVHKHGPCSQLNQQNGNAPNLVEILLEDQSRVDSIHAKLSDHSGVKE 115
Query: 83 LEDIHLP----MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI 138
+ LP M+ Y V + +G+P K L+FDT S L W +C
Sbjct: 116 TDAAKLPTKSGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC--------SAAET 167
Query: 139 FDPRASTTYSEIPCDDPLCRS-------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
FDP ST+Y+ + C PLC S P +C CVY +Y G + G +E
Sbjct: 168 FDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTI 227
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
+ F FGC D G GK +G+LG LS+ SQ + LFSYCL
Sbjct: 228 GSTDIFN---NFYFGCGQDVDGLF--GKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPS 282
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIM 310
+T + FG + + + TP LS FY L L I++G + P F
Sbjct: 283 S-SSTGFLSFGSS---QSKSAKFTP--LSSGPSSFYNLDLTGITVGGQKLAIPLSVFS-- 334
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS--LGRQRIPYNASQEFDYCYRYDS 368
T G IID+GT VT + Y L + + + S +G+ P + D CY + S
Sbjct: 335 ---TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGK---PLSI---LDTCYDF-S 384
Query: 369 SFKA--YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNM 423
+K P + + + F+ + C+A + +I G QQ+N
Sbjct: 385 KYKTIKVPKIVISFSGGVDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNF 444
Query: 424 LIIYDLNVPALRFGSENCA 442
++YD++ + F +C+
Sbjct: 445 EVVYDVSGGKVGFAPASCS 463
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 169/375 (45%), Gaps = 36/375 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP K L+ DT S L W QC PC CF Q +DP+ S ++ I C+DP
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 157 CR------SPFKCQ--NGKCVYTRRYHVGDVTRGLASRETFAFPV---RNGFT--FVPRL 203
C P +C+ N C Y Y T G + ETF + G + V +
Sbjct: 220 CSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV---IK 260
FGC + N G SG+LG PLS SSQL++ FSYCLV T+V +
Sbjct: 280 MFGCGHWNRGLFS--GASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLI 337
Query: 261 FGRDAD-VRRRDLETTPIL---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
FG D D + +L T + + + +Y+ + I +G + P ++I DG GG
Sbjct: 338 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGG 397
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRY---DSSF 370
IID+GT +++ Y+ + ++ + ++ Y ++F D C+ + +
Sbjct: 398 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKE------NYPIFRDFPVLDPCFNVSGIEENN 451
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYD 428
P + + P FI C+AI PK +SI+G +QQQN I+YD
Sbjct: 452 IHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYD 511
Query: 429 LNVPALRFGSENCAN 443
L F CA+
Sbjct: 512 TKRSRLGFTPTKCAD 526
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 155/364 (42%), Gaps = 37/364 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V++ +GTP K ++ DT SSL W QCQPC + C Q P++DP S TY ++ C
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+DPLC + + C+YT Y + G S++ +P+
Sbjct: 185 ECSRLKAATLNDPLCET----DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ---TLPQ 237
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+GC DN G G+ +GI+G LS+ +QL + FSYCL +S F
Sbjct: 238 FTYGCGQDNQGLF--GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFL 295
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ + TP+L P Y L L I++ + + + +ID+
Sbjct: 296 SIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LIDS 349
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFHL 380
GT +T + Y L Q + +I+ + + Y+ D C++ S A P +
Sbjct: 350 GTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSI---LDTCFKGSLKSISAVPEIKMIF 406
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRFG 437
Q + IE D+G C+A + +I+G QQQ I YD++ + F
Sbjct: 407 QGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFA 466
Query: 438 SENC 441
+C
Sbjct: 467 PGSC 470
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 161/391 (41%), Gaps = 54/391 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---------------CFDQTTPIFDP 141
Y V +GTP +P L+ DT S L W +C+ +F P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 142 RASTTYSEIPCDDPLCRS--PFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVR 194
S T+S IPC C+S PF N C Y RY+ RG+ ++ +
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALS 229
Query: 195 NGFTF---------VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
G + + GC+ ++G F G+L S +S +S+ +R G F
Sbjct: 230 GGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEAS-DGVLSLGYSNISFASRAASRFGGRF 288
Query: 246 SYCLVREM---EATSVIKFGRDADVRRRDLET----TPILL-SDLRPHFYLHLLEISIGR 297
SYCLV + ATS + FG D TP+LL + +RP + + + +S+
Sbjct: 289 SYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDG 348
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ P +D+ + GG IID+GT +T + Y+ ++ + L L P A
Sbjct: 349 VALDIPAEVWDVGSN--GGTIIDSGTSLTVLATPAYKAVVAALSEQLAGL-----PRVAM 401
Query: 358 QEFDYCYRYDSSFK-----AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--P 410
FDYCY + + A P + + + P Y I+ G C+ +Q+ P
Sbjct: 402 DPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWP 461
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
S++G QQ L +DLN LRF +C
Sbjct: 462 GVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/353 (31%), Positives = 169/353 (47%), Gaps = 30/353 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
+ V V G+P + +FDT S L W QCQPC C+ Q P+FDP S++Y+ +PC
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGTT 171
Query: 156 LCRSP-FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + +C CVY Y G T G+ +RET F + FT FGC N G
Sbjct: 172 ECAAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFT---GFIFGCGETNLGD 228
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
G++ G+LG LSLSSQ G+FSYCL + G + ++
Sbjct: 229 F--GEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVTGQIPVQY 286
Query: 275 TPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
T ++ P FY + L+ I+IG +++ PP F + GT ++D+GT +T++ Y
Sbjct: 287 TAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEF--TKTGT---LLDSGTILTYLPPPAY 341
Query: 334 QTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENM 392
L R+ ++ + PY+ E D CY + S P ++F+ +D V N
Sbjct: 342 TALRDRFKFTMQG-SKPAPPYD---ELDTCYDFTGQSGILIPGVSFNF--SDGAVFNLNF 395
Query: 393 YFIE--PDRGR---FCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPALRFG 437
+ I PD + C+A P +S++G+ Q++ +IYD VPA + G
Sbjct: 396 FGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYD--VPAQKIG 446
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/418 (24%), Positives = 182/418 (43%), Gaps = 37/418 (8%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S + A + + + Y V +G+
Sbjct: 29 VHPPSSSPLESIIALAREDDARLLFLSSKA---ASTGVSSAPVASGQSPPSYVVRAGLGS 85
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR----SPF 161
P +P L DT++ W C PC C + +F P ST+Y+ +PC +C P
Sbjct: 86 PAQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPANSTSYAPLPCSSTMCTVLQGQPC 144
Query: 162 KCQNG--------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
Q+ C +T+ + LAS + G +P AFGC + SG
Sbjct: 145 PAQDPYDSSAPLPMCAFTKPFADASFQASLASDW-----LHLGKDAIPNYAFGCVSAVSG 199
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRD 271
G+LG P++L SQ+ N G+FSYCL + + ++ G A + R
Sbjct: 200 PTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLG--AAGQPRG 257
Query: 272 LETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+ TP+L + R +Y+++ +S+GR V+ P G+F G ++D+GT +T
Sbjct: 258 VRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTP 317
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA-DYIVQ 388
Y L + + + + + Y + FD C+ D + P++T H+ D +
Sbjct: 318 PVYAALREEFRRHVAAPSG----YTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALP 373
Query: 389 PENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
EN C+A+ + P+ ++L QQQN+ +++D+ + F E+C
Sbjct: 374 MENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 169/394 (42%), Gaps = 45/394 (11%)
Query: 83 LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----F 132
L I LP+ Y ++ +GTP + H+ DT S ++W C CIRC
Sbjct: 66 LSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL 125
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNGK-CVYTRRYHVGDVTRGLASRET 188
+ TP +D AS+T + C D C +C +G C Y Y G T G R+
Sbjct: 126 VELTP-YDADASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDV 184
Query: 189 FAFPVRNG----FTFVPRLAFGCSNDNSGFAFG---GKISGILGFNASPLSLSSQL--RN 239
+ G + + FGC + SG G + GI+GF S S SQL +
Sbjct: 185 VHLDLVTGNRQTGSTNGTIIFGCGSKQSG-QLGESQAAVDGIMGFGQSNSSFISQLASQG 243
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHI 299
+++ F++CL + G +V ++TTP+L H+ ++L I +G +
Sbjct: 244 KVKRSFAHCL-DNNNGGGIFAIG---EVVSPKVKTTPML--SKSAHYSVNLNAIEVGNSV 297
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
++ AFD D G IID+GT + ++ + Y LM +QIL S Q + + Q+
Sbjct: 298 LQLSSDAFDSGDD--KGVIIDSGTTLVYLPDAVYNPLM---NQILAS--HQELNLHTVQD 350
Query: 360 FDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD-------DPKY 412
C+ Y +P++TF ++ + Y + +C Q+
Sbjct: 351 SFTCFHYIDRLDRFPTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASL 410
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENCANGRQ 446
+ILG N L++YD+ + + + NC+ G Q
Sbjct: 411 TILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQ 444
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/434 (25%), Positives = 183/434 (42%), Gaps = 40/434 (9%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARANYMASM-SKPNAFQEL 83
SSES G L +I ++ SP S + M AR Y++S+ + P A
Sbjct: 27 SSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINMASKDPARVTYLSSLVASPKA---- 82
Query: 84 EDIHLPMAKQDLF---YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
+ + +Q L Y V V +GTP + ++ DT+ W C C C ++P F
Sbjct: 83 TSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFS 139
Query: 141 PRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRN 195
P S+TY+ + C P C C C + + Y + S+++ V
Sbjct: 140 PNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDT 199
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREM 253
+P +FGC N SG + G+LG P+SL SQ + G+FSYC +
Sbjct: 200 ----LPSYSFGCVNAVSGSTLPPQ--GLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSY 253
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRD 312
+ ++ G + +++ TTP+L + RP +Y++L +S+GR +V P +
Sbjct: 254 YFSGSLRLGPLG--QPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPN 311
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
G IID+GT +T Y + + + ++ P+ FD C+ + A
Sbjct: 312 TGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG------PFATIGAFDTCFAATNEDIA 365
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIY 427
P +TFH D + EN C+A+ P +++ QQQN+ I++
Sbjct: 366 -PPVTFHFTGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMF 424
Query: 428 DLNVPALRFGSENC 441
D+ L E C
Sbjct: 425 DVTNSRLGIARELC 438
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 166/365 (45%), Gaps = 39/365 (10%)
Query: 99 VEVNIGTPM-KPQHLLFDTASSLVWTQCQPCIRCFDQTTP---IFDPRASTTYSEIPCDD 154
+ + +GTP+ + L D S VW QC PC P F P S T+S +PC
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 155 PLCRSPFK------------CQNGKCVYTRRYHVGDV--TRGLASRETFAFPVRNGFTFV 200
+C + +C + G T G + +TF F G T V
Sbjct: 150 DMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF----GATAV 205
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-----EMEA 255
P + FGCS+ + G F G SG++G LSL SQL+ G FSY L+ + A
Sbjct: 206 PGVVFGCSDASYG-DFAGA-SGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSA 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISI-GRHIVRFPPGAFDIMRDG 313
SVI+FG DA + + ++TP+L S L P FY ++L + + G + P G FD+ +G
Sbjct: 261 DSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANG 320
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA- 372
TGG I+ + TPVT++ Y + + +G + +A+ E D CY S K
Sbjct: 321 TGGVILSSTTPVTYLEQAAYDVVRA---AVASRIGLPAVNGSAALELDLCYNASSMAKVK 377
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
P +T AD + N ++I+ D G C+ + S+LG Q +IYD++
Sbjct: 378 VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDA 437
Query: 432 PALRF 436
L F
Sbjct: 438 GRLTF 442
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 120/428 (28%), Positives = 184/428 (42%), Gaps = 49/428 (11%)
Query: 56 RIHKMFE-ISKARANYMASMSKPN-AFQE--LEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
RI M+ +++ M + S P A E + + +A Y ++V +GTP +
Sbjct: 106 RIETMYRRAARSGGGRMPASSSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFR 165
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------------- 157
++ DT S L W QC PC+ CF+Q P+FDP AS++Y + C D C
Sbjct: 166 MIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSP 225
Query: 158 RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFTFVPRLAFGCSNDNSGFA 215
R+ + C Y Y T G + E+F + V + FGC + N G
Sbjct: 226 RTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLF 285
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-MEATSVIKFGRDAD-------- 266
+G+LG PLS +SQLR FSYCLV + S + FG D D
Sbjct: 286 H--GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHP 343
Query: 267 -VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
++ S +Y+ L + +G ++ +D+ +DG+GG IID+GT +
Sbjct: 344 QLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTL 403
Query: 326 TFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDY---CYRYDSSFK-AYPSMTFHL 380
++ YQ + + D++ RS Y EF CY + P ++
Sbjct: 404 SYFVEPAYQVIRHAFMDRMSRS-------YPLVPEFPVLSPCYNVSGVERPEVPELSLLF 456
Query: 381 QEADYIVQPENMYFI--EPDRGR-FCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALR 435
+ P YFI +PD G C+A+ P+ SI+G +QQQN ++YDL L
Sbjct: 457 ADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLQNNRLG 516
Query: 436 FGSENCAN 443
F CA
Sbjct: 517 FAPRRCAE 524
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/419 (29%), Positives = 182/419 (43%), Gaps = 47/419 (11%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGT 105
L + ER + E SKA+ +A K A D++ P+ L+ Y V + +GT
Sbjct: 9 TLQRDERRVRWIE-SKAK---LAGKKKDEA--SSTDLNGPVTSGLLYGSGEYFVRLGLGT 62
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR------- 158
P + ++ DT S L W QCQPC C+ Q PIFDPR S+++ IPC PLC+
Sbjct: 63 PARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHSC 122
Query: 159 SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
S + +C Y Y G + G S + F + V AFGC DN G
Sbjct: 123 SGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSV---AFGCGFDNEGLFA-- 177
Query: 219 KISGILGFNASPLSLSSQL-----RNRIQGLFSYCLVRE----MEATSVIKFGRDADVRR 269
+G+LG A LS SQ+ + FSYCLV ++S + FG A
Sbjct: 178 GAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPST 237
Query: 270 RDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
L +P+L + L +Y ++ +S+G + + + + G+GG IID+GT VT
Sbjct: 238 AAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRF 295
Query: 329 RNGPYQTLMQRYDQI---LRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQE-A 383
Y T+ + L S R + FD CY + P++ H + A
Sbjct: 296 PTSVYATIRDAFRNATINLPSAPRYSL-------FDTCYNFSGKASVDVPALVLHFENGA 348
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D + P N G FC+A + I+G QQQ+ I +DL L F + C
Sbjct: 349 DLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 155/354 (43%), Gaps = 22/354 (6%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + +GTP ++FDT S W QC+PC + C+ Q +FDP S+TY+ + C DP
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 156 LCR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C C G C+Y +Y G T G +++T A + FGC N G
Sbjct: 223 ACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQDA----IKGFKFGCGEKNRG 278
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G+ +G+LG P S++ Q + G FSYCL AT ++FG +
Sbjct: 279 LF--GQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPSSSGSNA 336
Query: 274 TTPILLSDLRPHF-YLHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
T +L+D P F Y+ L I + G+ + P F G ++D+GT +T + +
Sbjct: 337 KTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVITRLPDT 391
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPE 390
Y L + + + G ++ A D CY + S + P+++ Q +
Sbjct: 392 AYAALSSAFAAAMAASGYKKAA--AYSILDTCYDFTGLSQVSLPTVSLVFQGGACLDLDA 449
Query: 391 NMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + + C+ DD I+G QQ+ ++YD++ + F C
Sbjct: 450 SGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 118/448 (26%), Positives = 189/448 (42%), Gaps = 49/448 (10%)
Query: 19 LFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPN 78
L L SS + GFSL L+P + + + + ++ A + +P
Sbjct: 21 LALDATNSSGAAGFSLPLVPYYRTTAGV----------LEELLPEGDAEGGVNITSIRPK 70
Query: 79 AFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH---LLFDTASSLVWTQCQPCIRCFDQT 135
I YSV V IG+ QH L D L W QC+PC+ Q
Sbjct: 71 MIPYSGGI----------YSVRVGIGSG-GTQHFYKLALDLVRPLTWMQCKPCVPEKRQD 119
Query: 136 TPIFDPRASTTYSEIPCDDPLCRSPF-KCQNGKCVYTRRYHVGDV-TRGLASRETFAFPV 193
+F+ AS Y I DP C +P+ + G+C + ++ GD RG+ + F F
Sbjct: 120 GSVFNTAASPHYHHIASTDPRCMAPYTRAGQGRCTFDVKFQYGDSRARGVLGSDDFVFDG 179
Query: 194 R---NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL----FS 246
+ + V L FGC+++ F +G++ N P S QL R GL FS
Sbjct: 180 SGPGSPISSVNGLVFGCAHNTHDFYNHDLWAGVMSLNRHPTSFIRQLSAR--GLAAPRFS 237
Query: 247 YCLV--REMEATSVIKFGRDADVRRRDLETTPILLSDLRP----HFYLHLLEISIGRHIV 300
YCL + + ++FG D + +TP+L DL ++ + GR +
Sbjct: 238 YCLASRQHRDRRGFLRFGADIP-DQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGRRLT 296
Query: 301 RFPPGAFDIMRDGT-GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
P F++ R GG IID GT +T + PY L+ +RS G Q ++ Q+
Sbjct: 297 AITPVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPGQK 356
Query: 360 FDYCYRYDSSFKAYPSMTFHLQ----EADYIVQPENMYF-IEPDRGRF-CVAIQDDPKYS 413
+ +++S + PS+T H Q ++PE ++ + +R + C+AI + +
Sbjct: 357 HCFRGKWESIHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVPYAERT 416
Query: 414 ILGAWQQQNMLIIYDLNVPALRFGSENC 441
I+GA Q + +DL L F E C
Sbjct: 417 IIGAGQMLDTRFTFDLQQNRLFFAPEQC 444
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 168/381 (44%), Gaps = 35/381 (9%)
Query: 85 DIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
D++ P+ L+ Y V + +GTP + ++ DT S L W QCQPC C+ Q PIFD
Sbjct: 113 DLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFD 172
Query: 141 PRASTTYSEIPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV 193
PR S+++ IPC PLC+ S + +C Y Y G + G S + F
Sbjct: 173 PRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGT 232
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL-----RNRIQGLFSYC 248
+ V AFGC DN G +G+LG A LS SQ+ + FSYC
Sbjct: 233 GSKAMSV---AFGCGFDNEGLFA--GAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC 287
Query: 249 LVRE----MEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFP 303
LV ++S + FG A L +P+L + L +Y ++ +S+G +
Sbjct: 288 LVDRSNPMTRSSSSLIFGAAAIPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPIS 345
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYC 363
+ + + G+GG IID+GT VT Y T+ + +L P + FD C
Sbjct: 346 LKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNL--PSAPRYS--LFDTC 401
Query: 364 YRYDSSFKA-YPSMTFHLQE-ADYIVQPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQ 420
Y + P++ H + AD + P N G FC+A + I+G QQ
Sbjct: 402 YNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQ 461
Query: 421 QNMLIIYDLNVPALRFGSENC 441
Q+ I +DL L F + C
Sbjct: 462 QSFRIGFDLQKSHLAFAPQQC 482
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 148/354 (41%), Gaps = 27/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP KP ++ DT SSL W QC PC + C Q+ P+FDP+ S++Y+ + C P
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C + C+Y Y + G S++T +F G VP +GC
Sbjct: 197 QCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF----GSNSVPNFYYGC 252
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL + + +
Sbjct: 253 GQDNEGLF--GRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSYNPG- 309
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TP++ S L Y I + V P A + IID+GT +T
Sbjct: 310 ---QYSYTPMVSSTLDDSLYF----IKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITR 362
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ Y L + ++ R +A D C+ +S P+++ +
Sbjct: 363 LPTTVYDALSKAVAGAMKGTKRA----DAYSILDTCFVGQASSLRVPAVSMAFSGGAALK 418
Query: 388 QPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++ D C+A +I+G QQQ ++YD+ + F + C
Sbjct: 419 LSAQNLLVDVDSSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/414 (27%), Positives = 165/414 (39%), Gaps = 37/414 (8%)
Query: 37 IPIFSPESPLYPGNLSQSERIH--KMFEISKARANYMASMSKPNAFQELEDIHLPMA--- 91
+P+ P P + + + R +M +AR N++ + K + + + +P +
Sbjct: 58 MPLMYRHGPCAPASAAATNRPSPAEMLRRDRARRNHI--LRKASGRRITLGVSIPTSLGA 115
Query: 92 -KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYS 148
L Y V + GTP PQ LL DT S L W QCQPC C+ Q P+FDP AS+TY+
Sbjct: 116 FVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYA 175
Query: 149 EIPCDDPLCRS------PFKCQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
+PC CR C N C Y +Y GD T G+ S ET
Sbjct: 176 PVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLS-PEAA 234
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
T V +FGC G G+LG +P SL SQ G FSYCL
Sbjct: 235 TVVNNFSFGCGLVQKGVFD--LFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAG 292
Query: 258 VIKFGRDADVRRR--DLETTPILLSDLRPHFYL-HLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G A + TP+ + + FYL L IS+G + P F
Sbjct: 293 FLALGAPATGGNNTAGFQFTPLQV--VETTFYLVKLTGISVGGKQLDIEPTVF------A 344
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
GG IID+GT VT + Y L + + + +P N ++ D CY + +
Sbjct: 345 GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAY--PLLPPNDDEDLDTCYDFTGNTNVTV 402
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
E + + + D VA D I+G Q+ ++YD
Sbjct: 403 PTVALTFEGGVTIDLDVPSGVLLDGCLAFVAGASDGDTGIIGNVNQRTFEVLYD 456
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 126/462 (27%), Positives = 188/462 (40%), Gaps = 69/462 (14%)
Query: 19 LFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPN 78
L T + G L+L + + E + + ER+ + E + R M ++ P
Sbjct: 10 LLCTSLAFTTCAGIRLELTHVDAKE------HYTVEERVRRATERTHRRLASMGGVTAP- 62
Query: 79 AFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTP 137
IH Q Y E IG P + + DT S+L+WTQC C CF Q P
Sbjct: 63 -------IHWGGQSQ---YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLP 112
Query: 138 IFDPRASTTYSEIPCDDPLCR--SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPV 193
+DP S + C+D C S +C N C Y G++ LA+ E F
Sbjct: 113 YYDPSRSRAARAVGCNDAACALGSETQCLSDNKTCAVVTGYGAGNIAGTLAT-ENLTFQS 171
Query: 194 RNGFTFVPRLAFGC---SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
L FGC + + G G SGI+G LSL SQL + FSYCL
Sbjct: 172 ET-----VSLVFGCIVVTKLSPGSLNGA--SGIIGLGRGKLSLPSQLGDT---RFSYCLT 221
Query: 251 REMEAT---SVIKFGRDADVRRRDLETTPILL-------SD--LRPHFYLHLLEISIGRH 298
E T S + G A + +TP+ SD +YL L I+ G+
Sbjct: 222 PYFEDTIEPSHMVVGASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKV 281
Query: 299 IVRFPPGAFDIMRDGTG---GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI-PY 354
+ P AFD+ + G G ID+G P+T + + YQ L ++ R LG + P
Sbjct: 282 KLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRA---ELARQLGAALVQPL 338
Query: 355 NASQEFDYCYRYDSSFKAYPSMTFHL-----QEADYIVQPENMYFIEPDRGRFCVAI--- 406
+ FD C + + P + H D +V P N Y+ D C+ +
Sbjct: 339 AGTTGFDLCVALKDAERLVPPLVLHFGGGSGTGTDLVVPPAN-YWAPVDSATACMVVFSS 397
Query: 407 ---QDDP--KYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ P + +++G + QQNM ++YDL L F +C++
Sbjct: 398 VDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADCSS 439
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 175/409 (42%), Gaps = 37/409 (9%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
N S ++ + + + RA ++ + + A E + + A Y ++ +GTP +
Sbjct: 79 NASAADLLARRLQRDMRRAAWIITKAATPADPENGTV-VTGAPTSGEYIAKITVGTPYEN 137
Query: 110 QH-----LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPF 161
L D S + W QC PC RC+ Q P+++ S++ S++ C P CR S
Sbjct: 138 DSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRALGSSG 197
Query: 162 KCQN--GKCVYTRRYHVGDVTRGLASRETFAFP--VRNGFTFVPRLAFGCSNDNSGFAFG 217
C +C Y Y G + G ET FP VR VP +A GC +DN G F
Sbjct: 198 GCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVR-----VPGVAIGCGSDNQGL-FP 251
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--TSVIKFGRDADV--RRRDLE 273
+GILG LS SQ+ R FSYCL + +S + FG A
Sbjct: 252 APAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTTTTTPP 311
Query: 274 TTPILLSDLRPH--FYLHLLEISIGRHIVRFPPGAFDIMRD---GTGGFIIDTGTPVTFI 328
+ +L++ R + +Y+ L+ IS+G VR + D+ D G GG I+D+GT VT +
Sbjct: 312 SFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES-DLRLDPSTGHGGVIVDSGTAVTRL 370
Query: 329 RNGPYQTLMQRYD-QILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQEADY 385
Y + ++ LG P FD CY K P+++ H
Sbjct: 371 SGPAYAAFRDAFRVAAVKELGWPS-PGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVE 429
Query: 386 IVQPENMYFIEPD--RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLN 430
+ P Y I D +G C A D SI+G Q Q ++YD++
Sbjct: 430 VKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVD 478
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 152/337 (45%), Gaps = 32/337 (9%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS----PFKCQNGK 167
LL DT S + W QC PC +C+ Q +F P S TY +PC+ +C+ C N
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSS 62
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGF 226
C Y Y TRG + ET + VP AFGC + N G G +G++G
Sbjct: 63 CNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA--AGLMGL 120
Query: 227 NASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRP 284
S + +Q +FSYCL V + ++ FG +A + D+ TP++ S P
Sbjct: 121 GKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFG-EAAMLDYDVRFTPLVDSSSGP 179
Query: 285 -HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
+++ + I++G ++ P + +M +D+GT ++ Y+ L + QI
Sbjct: 180 SQYFVSMTGINVGDELL---PISATVM--------VDSGTVISRFEQSAYERLRDAFTQI 228
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQE-ADYIVQPENMYFIEPDRGR 401
L L + FD C+R + P +T H ++ A+ + P ++ + D G
Sbjct: 229 LPGLQTAV----SVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILY-PVDDGV 283
Query: 402 FCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
C A S+LG +QQQN+ +YD +P R G
Sbjct: 284 MCFAFAPSSSGRSVLGNFQQQNLRFVYD--IPKSRLG 318
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 150/356 (42%), Gaps = 28/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y ++ +GTP ++ DT SSL W QC PC + C Q P+FDPRAS+TY+ + C
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C C+Y Y + G S +T +F G T P +GC
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSF----GSTSYPSFYYGC 249
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL +T + G
Sbjct: 250 GQDNEGLF--GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYNTG 306
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ S L Y + L +S+G + P + + IID+GT +T
Sbjct: 307 HYYSY--TPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----IIDSGTVIT 359
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ + L + Q + G QR P A D C+ +S P++ +
Sbjct: 360 RLPTAVHTALSKAVAQAMA--GAQRAP--AFSILDTCFEGQASQLRVPTVVMAFAGGASM 415
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
I+ D C+A +I+G QQQ +IYD+ + F + C+
Sbjct: 416 KLTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 165/365 (45%), Gaps = 39/365 (10%)
Query: 99 VEVNIGTPM-KPQHLLFDTASSLVWTQCQPCIRCFDQTTP---IFDPRASTTYSEIPCDD 154
+ + +GTP+ + L D S VW QC PC P F P S T+S +PC
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 155 PLCRSPFK------------CQNGKCVYTRRYHVGDV--TRGLASRETFAFPVRNGFTFV 200
+C + +C + G T G + +TF F G T V
Sbjct: 150 DMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF----GATAV 205
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-----EMEA 255
P + FGCS+ + G F G SG++G LSL SQL+ G FSY L+ + A
Sbjct: 206 PGVVFGCSDASYG-DFAGA-SGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSA 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISI-GRHIVRFPPGAFDIMRDG 313
SVI+FG DA + + +TP+L S L P FY ++L + + G + P G FD+ +G
Sbjct: 261 DSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANG 320
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA- 372
TGG I+ + TPVT++ Y + + +G + +A+ E D CY S K
Sbjct: 321 TGGVILSSTTPVTYLEQAAYDVVRA---AVASRIGLPAVNGSAALELDLCYNASSMAKVK 377
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
P +T AD + N ++I+ D G C+ + S+LG Q +IYD++
Sbjct: 378 VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDA 437
Query: 432 PALRF 436
L F
Sbjct: 438 GRLTF 442
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 159/375 (42%), Gaps = 38/375 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP-----IFDPRASTTYSEIP 151
Y V + +GTP +P L+ DT S L W +C +F P S ++S +P
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163
Query: 152 CDDPLCRS--PFKCQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVR-NGFTFVPRL 203
CD C+S PF N C Y RY RG+ ++ + N T +L
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKL 223
Query: 204 ---AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATS 257
GC+ G +F G+L S +S +S+ +R G FSYCLV + ATS
Sbjct: 224 QEVVLGCTTSYDGQSFKSS-DGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282
Query: 258 VIKFGRDADVRRRDL---ETTPILLSD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG D T +LL D RP +++ + +++ + P +D ++
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDFRKN 342
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ--RIPYNASQEFDYCYRYDSSF 370
GG I+D+GT +T + YD +++++ +Q +P F+YCY +
Sbjct: 343 --GGAILDSGTSLTIL-------ATPAYDAVVKAISKQFAGVPRVNMDPFEYCYNWTGVS 393
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
P M A + P Y I+ G C+ + + P S++G QQ L +D
Sbjct: 394 AEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFD 453
Query: 429 LNVPALRFGSENCAN 443
L LRF CA+
Sbjct: 454 LANRWLRFKQSRCAH 468
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 161/374 (43%), Gaps = 37/374 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRA-----STTYSEIP 151
Y V+ +GTP +P L+ DT S L W +C+ +P+ PR S +++ IP
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169
Query: 152 CDDPLCRS--PFKCQN--------GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT--- 198
C C+S PF N C Y RY RG+ + + +
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRK 229
Query: 199 -FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---E 254
+ + GC+ G +F G+L S +S +S+ R G FSYCLV +
Sbjct: 230 AKLQEVVLGCTTSYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288
Query: 255 ATSVIKFGRDADVRRRDLETTPILL-SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
ATS + FG TP+LL + + P + + + +S+ + P +D+ ++
Sbjct: 289 ATSYLTFGPVGAA--HSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKN- 345
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-- 371
GG I+D+GT +T + Y+ ++ + L R+P F+YCY + ++ +
Sbjct: 346 -GGAILDSGTSLTILATPAYKAVVAALSKQL-----ARVPRVTMDPFEYCYNWTATRRPP 399
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDL 429
A P + + + P Y I+ G C+ +Q+ P S++G QQ L +DL
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDL 459
Query: 430 NVPALRFGSENCAN 443
LRF CA+
Sbjct: 460 ANRWLRFQESRCAH 473
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 159/372 (42%), Gaps = 48/372 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y IGTP +P + D LVWTQC PC CF+Q P+FDP S+T+ +PC
Sbjct: 56 LYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 156 LC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS--N 209
LC S C + C+Y GD T G+A +TFA L FGC
Sbjct: 116 LCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGAAK-----ETLGFGCVVMT 169
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD-VR 268
D GG SGI+G +P SL +Q+ FSYCL +++ + G A +
Sbjct: 170 DKRLKTIGGP-SGIVGLGRTPWSLVTQMNVTA---FSYCLAG--KSSGALFLGATAKQLA 223
Query: 269 RRDLETTPILL--------SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+TP ++ + P++ + L I G ++ + + ++D
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTV-------LLD 276
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
T + +++ +G Y+ L + + ++G Q + + + +D C+ + A P + F
Sbjct: 277 TVSRASYLADGAYKALKK---ALTAAVGVQPV-ASPPKPYDLCFSKAVAGDA-PELVFTF 331
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKY---------SILGAWQQQNMLIIYDLNV 431
+ P Y + G C+ I SILG+ QQ+N+ +++DL
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391
Query: 432 PALRFGSENCAN 443
L F +C++
Sbjct: 392 ETLSFKPADCSS 403
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 159/365 (43%), Gaps = 54/365 (14%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG----- 166
++ DT S L W QC+PC C+ Q P+FDP S +Y+ +PC+ C + K G
Sbjct: 179 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238
Query: 167 -------------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
+C Y+ Y G +RG+ + +T A G V FGC N G
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLSNRG 294
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRD 271
FGG +G++G + LSL SQ R G+FSYCL +A + G D R
Sbjct: 295 L-FGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR-- 350
Query: 272 LETTPI----LLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ +++D +P FY + ++ A + ++D+GT +T
Sbjct: 351 -NATPVSYTRMIADPAQPPFY--FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVIT 403
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA-YPSMTFHLQ- 381
+ Y+ + + R G +R P A+ F D CY + P +T L+
Sbjct: 404 RLAPSVYRAVRAEF---ARQFGAERYP--AAPPFSLLDACYNLTGHDEVKVPLLTLRLEG 458
Query: 382 EADYIVQPENMYFI-EPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFG 437
AD V M F+ D + C+A+ + + I+G +QQ+N ++YD L F
Sbjct: 459 GADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFA 518
Query: 438 SENCA 442
E+C+
Sbjct: 519 DEDCS 523
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 145/349 (41%), Gaps = 39/349 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPC 152
L Y V V++GTP Q L DT S + W QC+PC C+ Q P+FDP S++YS +PC
Sbjct: 140 LQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 199
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C C G+C Y Y G T G+ S +T G + FGC
Sbjct: 200 AAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL---TGSNALKGFLFGCG 256
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
+ G G + G+LG SL SQ + G+FSYCL + I G +
Sbjct: 257 HAQQGLFAG--VDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSST- 313
Query: 269 RRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TTP+L + P +Y+ +L IS+G + F G ++DTGT VT
Sbjct: 314 -AGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTVVTR 366
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQEA- 383
+ Y L + + G P A+ D CY RY + S+ F A
Sbjct: 367 LPPTAYSALRSAFRAAMAPYGYPSAP--ATGILDTCYDFTRYGTVTLPTISIAFGGGAAM 424
Query: 384 ----DYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
I+ + F P G D + SILG QQ++ + +D
Sbjct: 425 DLGTSGILTSGCLAF-APTGG--------DSQASILGNVQQRSFEVRFD 464
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 159/365 (43%), Gaps = 54/365 (14%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG----- 166
++ DT S L W QC+PC C+ Q P+FDP S +Y+ +PC+ C + K G
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237
Query: 167 -------------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
+C Y+ Y G +RG+ + +T A G V FGC N G
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLSNRG 293
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRD 271
FGG +G++G + LSL SQ R G+FSYCL +A + G D R
Sbjct: 294 L-FGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR-- 349
Query: 272 LETTPI----LLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ +++D +P FY + ++ A + ++D+GT +T
Sbjct: 350 -NATPVSYTRMIADPAQPPFY--FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVIT 402
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA-YPSMTFHLQ- 381
+ Y+ + + R G +R P A+ F D CY + P +T L+
Sbjct: 403 RLAPSVYRAVRAEF---ARQFGAERYP--AAPPFSLLDACYNLTGHDEVKVPLLTLRLEG 457
Query: 382 EADYIVQPENMYFI-EPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFG 437
AD V M F+ D + C+A+ + + I+G +QQ+N ++YD L F
Sbjct: 458 GADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFA 517
Query: 438 SENCA 442
E+C+
Sbjct: 518 DEDCS 522
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 150/356 (42%), Gaps = 28/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y ++ +GTP ++ DT SSL W QC PC + C Q P+FDPRAS+TY+ + C
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C C+Y Y + G S +T +F G T P +GC
Sbjct: 194 QCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSF----GSTRYPSFYYGC 249
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL +T + G
Sbjct: 250 GQDNEGLF--GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYNTG 306
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ S L Y + L +S+G + P + + IID+GT +T
Sbjct: 307 HYYSY--TPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----IIDSGTVIT 359
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ + L + Q + G QR P A D C+ +S P++ +
Sbjct: 360 RLPTAVHTALSKAVAQAMA--GAQRAP--AFSILDTCFEGQASQLRVPTVAMAFAGGASM 415
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
I+ D C+A +I+G QQQ +IYD+ + F + C+
Sbjct: 416 KLTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 159/389 (40%), Gaps = 50/389 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP---------IFDPRASTTY 147
Y V +GTP +P L+ DT S L W +C+ +P F P S T+
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 148 SEIPCDDPLCRS--PFKCQ-----NGKCVYTRRYHVGDVTRGLASRE--TFAFPVRNGFT 198
+ I C C PF C Y RY G RG E T A R
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERK 216
Query: 199 F-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME--- 254
+ L GCS+ +G +F G+L S +S +S +R G FSYCLV +
Sbjct: 217 AKLKGLVLGCSSSYTGPSFEAS-DGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRN 275
Query: 255 ATSVIKFGRDADVRR------------RDLETTPILLS-DLRPHFYLHLLEISIGRHIVR 301
ATS + FG + V TP+LL +RP + + L IS+ ++
Sbjct: 276 ATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLK 335
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
P +D+ + GG I+D+GT +T + Y+ ++ + L L R + F+
Sbjct: 336 IPRAVWDV--EAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM-----DPFE 388
Query: 362 YCYRYDS-----SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSI 414
YCY + S + A P M H A + P Y I+ G C+ +Q+ P S+
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISV 448
Query: 415 LGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+G QQ L +D+ L+F C +
Sbjct: 449 IGNILQQEHLWEFDIKNRRLKFQRSRCTH 477
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 167/367 (45%), Gaps = 39/367 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT--TPIFDPRASTTYSEIPCDD 154
Y + VN+GTP + DT S LVW C + +F P STTYS + C
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQS 159
Query: 155 PLCR--SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF----VPRLAFGC 207
C+ S C + +C Y Y G T G+ S ETF+F G VPR++FGC
Sbjct: 160 AACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFGC 219
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLV---REMEATSVIKFG 262
S ++G +F + G++G A LSL SQL RI FSYCLV ++S + FG
Sbjct: 220 STGSAG-SF--RSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSSTLSFG 276
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A V +TP++ S++ ++ + L +++ D+ + I+D+G
Sbjct: 277 ARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ---------DVASANSSRIIVDSG 327
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSMTF 378
T +TF+ + L+ ++ +R L R + P Q CY +A P +T
Sbjct: 328 TTLTFLDPALLRPLVAELERRIR-LPRAQPP---EQLLQLCYDVQGKSQAEDFGIPDVTL 383
Query: 379 HL-QEADYIVQPENMYFIEPDRGRFC---VAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
A ++PEN + + + G C V + + SILG QQN + YDL+ +
Sbjct: 384 RFGGGASVTLRPENTFSLL-EEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTV 442
Query: 435 RFGSENC 441
F + +C
Sbjct: 443 TFAAVDC 449
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 172/390 (44%), Gaps = 45/390 (11%)
Query: 65 KARANYMASMSKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASS 119
KAR Y++S++ + +P+A Q Y V NIGTP +P + DT++
Sbjct: 57 KARFLYLSSLAG------VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSND 110
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
W C C+ C ++ +FDP S++ + C+ P C+ +P + C + Y
Sbjct: 111 AAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG- 167
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQ 236
G +++T +P FGC N SG + + G++G PLSL SQ
Sbjct: 168 GSTIEAYLTQDTLTL----ASDVIPNYTFGCINKASGTSLPAQ--GLMGLGRGPLSLISQ 221
Query: 237 LRNRIQGLFSYCLVREMEA--TSVIKFG-RDADVRRRDLETTPILLSDLRPH-FYLHLLE 292
+N Q FSYCL + + ++ G ++ +R ++TTP+L + R +Y++L+
Sbjct: 222 SQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSSLYYVNLVG 278
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
I +G IV P A G I D+GT T + Y + + + +++ +
Sbjct: 279 IRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSL 338
Query: 353 PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-- 410
FD CY S +PS+TF + + P+N+ C+A+ P
Sbjct: 339 -----GGFDTCY---SGSVVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVN 390
Query: 411 ---KYSILGAWQQQNMLIIYDLNVPALRFG 437
+++ + QQQN ++ D VP R G
Sbjct: 391 VNSVLNVIASMQQQNHRVLID--VPNSRLG 418
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 160/362 (44%), Gaps = 38/362 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V+V +G+P + ++ DT SSL W QC+PC + C Q P+FDP AS TY + C
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
++PLC + + CVYT Y + G S++ +P
Sbjct: 73 QCSSLVDATLNNPLCET----SSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ---TLPG 125
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+GC D+ G G+ +GILG + LS+ Q+ ++ FSYCL + G
Sbjct: 126 FVYGCGQDSEGLF--GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR-GGGGFLSIG 182
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ A + + TP+ P Y L L I++G + + + IID+
Sbjct: 183 K-ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------IIDS 235
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFHL 380
GT +T + Y Q + +I+ S R P D C++ + ++ P +
Sbjct: 236 GTVITRLPMSVYTPFQQAFVKIMSSK-YARAP--GFSILDTCFKGNLKDMQSVPEVRLIF 292
Query: 381 Q-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
Q AD ++P N+ ++ D G C+A + +I+G QQQ + +D++ + F +
Sbjct: 293 QGGADLNLRPVNV-LLQVDEGLTCLAFAGNNGVAIIGNHQQQTFKVAHDISTARIGFATG 351
Query: 440 NC 441
C
Sbjct: 352 GC 353
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 172/390 (44%), Gaps = 45/390 (11%)
Query: 65 KARANYMASMSKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASS 119
KAR Y++S++ + +P+A Q Y V NIGTP +P + DT++
Sbjct: 57 KARFLYLSSLAG------VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSND 110
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
W C C+ C ++ +FDP S++ + C+ P C+ +P + C + Y
Sbjct: 111 AAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG- 167
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQ 236
G +++T +P FGC N SG + + G++G PLSL SQ
Sbjct: 168 GSTIEAYLTQDTLTL----ASDVIPNYTFGCINKASGTSLPAQ--GLMGLGRGPLSLISQ 221
Query: 237 LRNRIQGLFSYCLVREMEA--TSVIKFG-RDADVRRRDLETTPILLSDLRPH-FYLHLLE 292
+N Q FSYCL + + ++ G ++ +R ++TTP+L + R +Y++L+
Sbjct: 222 SQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSSLYYVNLVG 278
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
I +G IV P A G I D+GT T + Y + + + +++ +
Sbjct: 279 IRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSL 338
Query: 353 PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-- 410
FD CY S +PS+TF + + P+N+ C+A+ P
Sbjct: 339 -----GGFDTCY---SGSVVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVN 390
Query: 411 ---KYSILGAWQQQNMLIIYDLNVPALRFG 437
+++ + QQQN ++ D VP R G
Sbjct: 391 VNSVLNVIASMQQQNHRVLID--VPNSRLG 418
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 121/433 (27%), Positives = 196/433 (45%), Gaps = 46/433 (10%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK--ARANYMASMSKPNAFQELEDIHL 88
G +L++ F P SPL PG + S + S+ +R Y+ S+ A + +
Sbjct: 41 GNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSL----AARGKARAYA 96
Query: 89 PMAK-----QDLFYSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFDQTTPIFDP 141
P+A Q Y V +GTP PQ LL DT++ W C C C + P FDP
Sbjct: 97 PIASGRQLLQTPTYVVRARLGTP--PQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDP 154
Query: 142 RASTTYSEIPCDDPLC-RSP-FKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
AST+Y +PC PLC ++P C G C ++ Y + L S+++ A
Sbjct: 155 AASTSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAAL-SQDSLAVAGDAVK 213
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEA 255
T+ FGC +G A + LG S SQ R+ QG FSYCL + +
Sbjct: 214 TYT----FGCLQKATGTAAPPQGLLGLGRGPL--SFLSQTRDMYQGTFSYCLPSFKSLNF 267
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ ++ GR+ R ++TTP+L + R +Y+++ I +GR +V PP A
Sbjct: 268 SGTLRLGRNGQPPR--IKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATG 325
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G ++D+GT T + Y + D++ R +G P ++ FD C ++++ A+P
Sbjct: 326 AGTVLDSGTMFTRLVAPAYVAV---RDEVRRRVG---APVSSLGGFDTC--FNTTAVAWP 377
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDPK-----YSILGAWQQQNMLIIYD 428
+T L + + PE I G C+A+ P +++ + QQQN +++D
Sbjct: 378 PVTL-LFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFD 436
Query: 429 LNVPALRFGSENC 441
+ + F E C
Sbjct: 437 VPNGRVGFARERC 449
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 145/349 (41%), Gaps = 39/349 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPC 152
L Y V V++GTP Q L DT S + W QC+PC C+ Q P+FDP S++YS +PC
Sbjct: 129 LQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 188
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C C G+C Y Y G T G+ S +T G + FGC
Sbjct: 189 AAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL---TGSNALKGFLFGCG 245
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
+ G G + G+LG SL SQ + G+FSYCL + I G +
Sbjct: 246 HAQQGLFAG--VDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSST- 302
Query: 269 RRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TTP+L + P +Y+ +L IS+G + F G ++DTGT VT
Sbjct: 303 -AGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTVVTR 355
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQEA- 383
+ Y L + + G P A+ D CY RY + S+ F A
Sbjct: 356 LPPTAYSALRSAFRAAMAPYGYPSAP--ATGILDTCYDFTRYGTVTLPTISIAFGGGAAM 413
Query: 384 ----DYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYD 428
I+ + F P G D + SILG QQ++ + +D
Sbjct: 414 DLGTSGILTSGCLAF-APTGG--------DSQASILGNVQQRSFEVRFD 453
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 44/375 (11%)
Query: 97 YSVEVNIGTPMKPQH--LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V V IGT L+ DTASSL W +C C+ Q +P+FDP S++Y +
Sbjct: 74 YGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTS 133
Query: 155 PLCRSPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
PLCR+P KC +H+ G +T + N + +AFGC+
Sbjct: 134 PLCRAPNPVLPAGDKC----SFHLPGEAHGYVGTDTII--LGNPTLPIHSVAFGCAQSTE 187
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TSVIKFGRD----- 264
GF G +G LG P SL Q+++R+ FSYCL+ + I+FG D
Sbjct: 188 GFDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPT 247
Query: 265 --ADVRRRDLETTPILLSDLRPH------FYLHLLEISI-GRHIVRFPPGAFDIMRDGTG 315
R + L T P L PH +Y+ LL IS+ G I F+ DG+G
Sbjct: 248 LLVHHRIKILPTPPHL-----PHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSG 302
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR-YDSSFKAYP 374
G +D GT VT + Y + + +++ G +R+ F C+R + + P
Sbjct: 303 GCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRV---RDPNFSLCFREHPGIWSHIP 359
Query: 375 SMTFHLQE------ADYIVQPENMYFIEPDRGRFCVAIQDDPKYS--ILGAWQQQNMLII 426
+T + A + N++ ++ C + + S ++GA QQ + I
Sbjct: 360 KLTLDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFI 419
Query: 427 YDLNVPALRFGSENC 441
+DL+ + F E+C
Sbjct: 420 FDLHANTITFHRESC 434
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 113/444 (25%), Positives = 191/444 (43%), Gaps = 57/444 (12%)
Query: 16 FSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASM 74
F L L ++ G ++K+ ++SP+SP P +S + + +M +AR +++S+
Sbjct: 10 FLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQFLSSL 69
Query: 75 SKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
++ +P+A Q Y V+ N+GTP + + DT++ W C C+
Sbjct: 70 VGRKSW-------VPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCV 122
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASR 186
C ++ +F+ STT+ + CD P C+ +P C C + Y + L +R
Sbjct: 123 GC---SSTVFNSVTSTTFKTLGCDAPQCKQVPNP-TCGGSTCTWNTTYGGSTILSNL-TR 177
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
+T A VP FGC +G + + LG S SQ ++ + FS
Sbjct: 178 DTIALSTD----IVPGYTFGCIQKTTGSSVPPQGLLGLGRGPL--SFLSQTQDLYKSTFS 231
Query: 247 YCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFP 303
YCL R + + ++ G R ++TTP+L + R +Y++L+ I +GR IV P
Sbjct: 232 YCLPSFRTLNFSGTLRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIP 289
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-----DQILRSLGRQRIPYNASQ 358
A G I D+GT T + Y + + + I+ SLG
Sbjct: 290 ASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLG---------- 339
Query: 359 EFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YS 413
FD CY + P+MTF + + P+N+ C+A+ P +
Sbjct: 340 GFDTCY---TGPIVAPTMTFMFSGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLN 396
Query: 414 ILGAWQQQNMLIIYDLNVPALRFG 437
++ QQQN I++D VP R G
Sbjct: 397 VIANMQQQNHRILFD--VPNSRIG 418
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 172/397 (43%), Gaps = 32/397 (8%)
Query: 56 RIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFD 115
++ + E S R Y+ + + + L ++P+ Q V ++IG+P Q L D
Sbjct: 47 HVYHIKEASVERLEYLKAKTTGDIIAHLSP-NVPIIPQAFL--VNISIGSPPITQLLHMD 103
Query: 116 TASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP---FKCQNGKCVYTR 172
TAS L+W QC PCI C+ Q+ PIFDP S T+ C P F C Y+
Sbjct: 104 TASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSM 163
Query: 173 RYHVGDVTRGLASRETFAFPV---RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
RY ++G+ +RE F + + + FGC +DN G G +GILG
Sbjct: 164 RYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGEPLVG--TGILGLGYG 221
Query: 230 PLSLSSQLRNRIQGLFSYCLVREMEAT---SVIKFGRDADVRRRDLETTPILLSDLRPHF 286
SL + + FSYC + + +V+ G D D TTP+ + + +
Sbjct: 222 EFSLVHRFGKK----FSYCFGSLDDPSYPHNVLVLGDDGANILGD--TTPLEIHN--GFY 273
Query: 287 YLHLLEISIGRHIVRFPPGAFD-IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILR 345
Y+ + IS+ I+ P F+ + G GG IIDTG +T + Y+ L R + I
Sbjct: 274 YVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFE 333
Query: 346 SLGRQRIPYNASQEFDYCYRYDSSFK------AYPSMTFHLQEADYIVQPENMYFIEPDR 399
GR + + Y+ +F+ +P +TFH E + F++
Sbjct: 334 --GRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSP 391
Query: 400 GRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
FC+A+ SI GA QQ+ I YDL + F
Sbjct: 392 NVFCLAVTPGNLNSI-GATAQQSYNIGYDLEAMEVSF 427
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 120/455 (26%), Positives = 194/455 (42%), Gaps = 47/455 (10%)
Query: 10 AAFFSYFSVLFLTHFT----SSESTGFSLKLIPIFSPESPLYP-GNLSQSERIHKMFEIS 64
AA F F++LF T +++S L +IPI+S SP P S + M
Sbjct: 7 AATFFLFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKD 66
Query: 65 KARANYMASMSKPNAFQELEDIHLPMAKQDL---FYSVEVNIGTPMKPQHLLFDTASSLV 121
R Y++++ A Q+ + + +Q L Y V V +GTP + ++ DT++
Sbjct: 67 PERLKYLSTL----ADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAA 122
Query: 122 WTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHV 176
W C C C T F P ASTT + C C F C + C++ + Y
Sbjct: 123 WVPCSGCTGCSSTT---FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY-- 177
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI--SGILGFNASPLSLS 234
G + A+ A + N +P FGC N S GG I G+LG P+SL
Sbjct: 178 GGDSSLTATLVQDAITLAN--DVIPGFTFGCINAVS----GGSIPPQGLLGLGRGPISLI 231
Query: 235 SQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLL 291
SQ G+FSYCL + + +K G + + + TTP+L + RP +Y++L
Sbjct: 232 SQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLT 289
Query: 292 EISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR 351
+S+GR V P + G IID+GT +T Y + + + +
Sbjct: 290 GVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG----- 344
Query: 352 IPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK 411
P ++ FD C+ + +A P++T H + + ++ EN C+++ P
Sbjct: 345 -PISSLGAFDTCFAATNEAEA-PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPN 402
Query: 412 -----YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+++ QQQN+ I++D L E C
Sbjct: 403 NVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 146/356 (41%), Gaps = 27/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP ++ DT SSL W QC PC + C Q P+FDPRAS TY+ + C
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 156 LCR-------SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C + C+Y Y + G S++T +F G P +GC
Sbjct: 191 ECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSF----GSGSFPGFYYGC 246
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL A + G
Sbjct: 247 GQDNEGLF--GRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSSAAAGYLSIG---SY 301
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ S L Y + L IS+ + PP + + IID+GT +T
Sbjct: 302 NPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPT-----IIDSGTVIT 356
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ Y L + + S + Y+ D C+R ++ P + +
Sbjct: 357 RLPPNVYTALSRAVAAAMASAAPRAPTYSI---LDTCFRGSAAGLRVPRVDMAFAGGATL 413
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
I+ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 414 ALSPGNVLIDVDDSTTCLAFAPTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 187/425 (44%), Gaps = 30/425 (7%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF+ L SP SPL+ +LS+ + + F S +R+ A++ I P+
Sbjct: 27 GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRS---ATLLTHLTSVSTACIRSPI 83
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ + + IGTP + DT S L WTQC PC CF+Q+ PIF+PR S++Y ++
Sbjct: 84 IPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKV 143
Query: 151 PCDDPLCRSPFKCQNG----KCVYTRRYHVGDVTRG-LASRETFAFPVRNGFTFVPRLAF 205
C CRS G C Y Y T G LAS + + G +P+
Sbjct: 144 SCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQ-----ITIGSFKLPKTVI 198
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL---FSYCL---VREMEATSVI 259
GC + N G FGG SGI+G LSL SQ+R I G+ FSYCL T I
Sbjct: 199 GCGHQNGG-TFGGVTSGIIGLGGGSLSLVSQMRT-IAGVKPRFSYCLPTFFSNANITGTI 256
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
FGR A V R + +TP++ ++L L IS+G+ RF G II
Sbjct: 257 SFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKK--RFKAANGISAMTNHGNIII 314
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTF 378
D+GT +T + Y + +++++ +R+ + S + CY P +T
Sbjct: 315 DSGTTLTLLPRSLYYGVFSTLARVIKA---KRVD-DPSGILELCYSAGQVDDLNIPIITA 370
Query: 379 HLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
H AD + P N + D C+ + +I G Q N + YDL L F
Sbjct: 371 HFAGGADVKLLPVNTFAPVADN-VTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFE 429
Query: 438 SENCA 442
+ CA
Sbjct: 430 PKLCA 434
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 53/429 (12%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQDLFYSVEVNIGTPM 107
NL+ E I + + S R +A A + + + P+ Y V++ GTP
Sbjct: 43 NLTDQELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGEYLVKLGTGTPQ 102
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQ- 164
DTAS LVW QCQPC+ C+ Q P+F+P+ S++Y+ +PC C +C
Sbjct: 103 HFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHE 162
Query: 165 --NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+G C YT +Y VT+G + + A G + FGCS+ + G + SG
Sbjct: 163 DDDGACQYTYKYSGHGVTKGTLAIDKLAI----GGDVFHAVVFGCSDSSVG-GPAAQASG 217
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS---VIKFGRDADVRRRDLETTPILL 279
++G PLSL SQL F YCL M TS V+ G DA VR T +
Sbjct: 218 LVGLGRGPLSLVSQLSVH---RFMYCLPPPMSRTSGKLVLGAGADA-VRNMSDRVTVTMS 273
Query: 280 SDLR--PHFYLHLLEISIGRHI------VRFPPGAFDIMRDGTG-------------GFI 318
S R ++YL+L +++G PP G G G I
Sbjct: 274 SSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMI 333
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSSFKA 372
+D + ++F+ Y L ++ +R R + D C+ D +
Sbjct: 334 VDVASTISFLETSLYDELADDLEEEIR---LPRATPSLRLGLDLCFILPEGVGMDRVYVP 390
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
S++F + ++ + F+ R C+ I SILG +Q QNM ++++L
Sbjct: 391 TVSLSF---DGRWLELDRDRLFVTDGR-MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRG 446
Query: 433 ALRFGSENC 441
+ F +C
Sbjct: 447 KITFAKASC 455
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 158/372 (42%), Gaps = 48/372 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y IGTP +P + D LVWTQC PC CF+Q P+FDP S+T+ +PC
Sbjct: 56 LYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 156 LC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS--N 209
LC S C + C+Y GD T G A +TFA L FGC
Sbjct: 116 LCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAK-----ETLGFGCVVMT 169
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD-VR 268
D GG SGI+G +P SL +Q+ FSYCL +++ + G A +
Sbjct: 170 DKRLKTIGGP-SGIVGLGRTPWSLVTQMNVTA---FSYCLAG--KSSGALFLGATAKQLA 223
Query: 269 RRDLETTPILL--------SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
+TP ++ + P++ + L I G ++ + + ++D
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTV-------LLD 276
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
T + +++ +G Y+ L + + ++G Q + + + +D C+ + A P + F
Sbjct: 277 TVSRASYLADGAYKALKK---ALTAAVGVQPV-ASPPKPYDLCFPKAVAGDA-PELVFTF 331
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKY---------SILGAWQQQNMLIIYDLNV 431
+ P Y + G C+ I SILG+ QQ+N+ +++DL
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391
Query: 432 PALRFGSENCAN 443
L F +C++
Sbjct: 392 ETLSFKPADCSS 403
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 162/376 (43%), Gaps = 40/376 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y ++ +GTP + H+ DT S ++W C CIRC + TP +D AS+T +
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSV 142
Query: 151 PCDDPLC---RSPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFVPR 202
C D C +C +G C Y Y G T G ++ + G +
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202
Query: 203 LAFGCSNDNSGFAFG---GKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
+ FGC + SG G + GI+GF S S SQL + +++ F++CL
Sbjct: 203 IIFGCGSKQSG-QLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL-DNNNGGG 260
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G +V ++TTP+L H+ ++L I +G ++ AFD D G
Sbjct: 261 IFAIG---EVVSPKVKTTPML--SKSAHYSVNLNAIEVGNSVLELSSNAFDSGDD--KGV 313
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
IID+GT + ++ + Y L+ ++IL S + + QE C+ Y +P++T
Sbjct: 314 IIDSGTTLVYLPDAVYNPLL---NEILAS--HPELTLHTVQESFTCFHYTDKLDRFPTVT 368
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQD-------DPKYSILGAWQQQNMLIIYDLN 430
F ++ + Y + +C Q+ +ILG N L++YD+
Sbjct: 369 FQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIE 428
Query: 431 VPALRFGSENCANGRQ 446
+ + + NC+ G Q
Sbjct: 429 NQVIGWTNHNCSGGIQ 444
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 149/354 (42%), Gaps = 25/354 (7%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y + V +G+P Q +L DT S + W QC+PC +C Q P+FDP +S+TYS C
Sbjct: 126 LEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C + NG +C Y Y G T G S +T A G + V FGCSN
Sbjct: 186 AACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGCSN 241
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
SG F + G++G SL SQ + FSYCL ++ + G
Sbjct: 242 VESG--FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGT 299
Query: 270 RDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP+L S P FY + L I +G + P F + G ++D+GT +T +
Sbjct: 300 SGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRL 353
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIV 387
Y L + + +Q P S D C+ + S + PS+ +V
Sbjct: 354 PPTAYSALSSAFKAGM----KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF-SGGAVV 408
Query: 388 QPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I + F A DD I+G QQ+ ++YD+ + F + C
Sbjct: 409 SLDASGIILSNCLAF-AANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/390 (25%), Positives = 171/390 (43%), Gaps = 45/390 (11%)
Query: 65 KARANYMASMSKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASS 119
KAR Y++S++ + +P+A Q Y V NIGTP + + DT++
Sbjct: 57 KARFLYLSSLAG------VTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSND 110
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
W C C+ C ++ +FDP S++ + C+ P C+ +P + C + Y
Sbjct: 111 AAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG- 167
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQ 236
G +++T +P FGC N SG + + G++G PLSL SQ
Sbjct: 168 GSAIEAYLTQDTLTLATD----VIPNYTFGCINKASGTSLPAQ--GLMGLGRGPLSLISQ 221
Query: 237 LRNRIQGLFSYCLVREMEA--TSVIKFG-RDADVRRRDLETTPILLSDLRPH-FYLHLLE 292
+N Q FSYCL + + ++ G ++ +R ++TTP+L + R +Y++L+
Sbjct: 222 SQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSSLYYVNLVG 278
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
I +G IV P A G I D+GT T + Y + + + +++ +
Sbjct: 279 IRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSL 338
Query: 353 PYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK- 411
FD CY S +PS+TF + + P+N+ C+A+ P
Sbjct: 339 -----GGFDTCY---SGSVVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTN 390
Query: 412 ----YSILGAWQQQNMLIIYDLNVPALRFG 437
+++ + QQQN ++ D VP R G
Sbjct: 391 VNSVLNVIASMQQQNHRVLID--VPNSRLG 418
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 118/440 (26%), Positives = 186/440 (42%), Gaps = 53/440 (12%)
Query: 34 LKLIPIFSPESP------LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIH 87
LKL P+ S +SP L+ ++ E + F S+ N A+ S +L I
Sbjct: 33 LKLYPMTSLKSPPNSTSLLFAYMFAKDEERIRYFH-SRLAKNSDANASFKKVGPKLAGIP 91
Query: 88 LP--MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRAS 144
L ++ Y V++ +G+P K ++ DT SS W QCQPC I C Q P+F+P AS
Sbjct: 92 LKSGLSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSAS 151
Query: 145 TTYSEIPC-------------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
TY +PC ++P C Q+ CVY Y + G S++
Sbjct: 152 KTYKTVPCSSSQCSSLKSATLNEPTCSK----QSNACVYKASYGDSSFSLGYLSQDVLTL 207
Query: 192 -PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
P + +FV +GC DN G G+ GI+G + LS+ SQL + FSYCL
Sbjct: 208 TPSQTLSSFV----YGCGQDNQGLF--GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLP 261
Query: 251 REMEA-----TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPP 304
+ G + + TP+L + P Y LE I++ +
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
++ + IID+GT +T + Y TL Y IL S Q+ P D C+
Sbjct: 322 SSYKVPT------IIDSGTVITRLPTPVYTTLKNAYVTIL-SKKYQQAP--GISLLDTCF 372
Query: 365 RYDSS--FKAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQ 421
+ + + P + + AD ++ N +E + G C+A+ +I+G +QQQ
Sbjct: 373 KGSLAGISEVAPDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSSIAIIGNYQQQ 431
Query: 422 NMLIIYDLNVPALRFGSENC 441
+ + YD+ + F C
Sbjct: 432 TVKVAYDVGNSRVGFAPGGC 451
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 154/352 (43%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S + W QC PC +C+ Q PIF+P S+++ + C +
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 73
Query: 157 C---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + + KC+Y Y G T G S ET +F G V +A GC +N G
Sbjct: 74 CGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMGCGRNNQG 129
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV-IKFGRDADVRRRDL 272
+G+LG PLS SQ +FSYCL R A + + FG A V +
Sbjct: 130 LFH--GAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA-VPEKAR 186
Query: 273 ETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T + L ++Y+ L I + V PP AF + GTGG I+D+GT ++ +
Sbjct: 187 FTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPA 246
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y L + ++ I FD CY S A P++ + P +
Sbjct: 247 YTALRDAFRSLVTFPSAPGISL-----FDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 301
Query: 392 MYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ D G +C+A ++ +SI+G QQQ I D + + C
Sbjct: 302 GILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 158/369 (42%), Gaps = 44/369 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V++ +G+P K ++ DT SS W QCQPC I C Q P+F+P AS TY +PC
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVP 201
++P C Q+ CVY Y + G S++ P + +FV
Sbjct: 163 QCSSLKSATLNEPTCSK----QSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFV- 217
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-----T 256
+GC DN G G+ GI+G + LS+ SQL + FSYCL
Sbjct: 218 ---YGCGQDNQGLF--GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKE 272
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTG 315
+ G + + TP+L + P Y LE I++ + ++ +
Sbjct: 273 GFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT---- 328
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS--FKAY 373
IID+GT +T + Y TL Y IL S Q+ P D C++ + +
Sbjct: 329 --IIDSGTVITRLPTPVYTTLKNAYVTIL-SKKYQQAP--GISLLDTCFKGSLAGISEVA 383
Query: 374 PSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
P + + AD ++ N +E + G C+A+ +I+G +QQQ + + YD+
Sbjct: 384 PDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGNS 442
Query: 433 ALRFGSENC 441
+ F C
Sbjct: 443 RVGFAPGGC 451
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 120/427 (28%), Positives = 179/427 (41%), Gaps = 59/427 (13%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
NL+ E I + + S R A K + P+ + Y V++ IGTP
Sbjct: 47 NLTDHELIRRAVQRSLDRPGVAARNRKAVVGEA------PLVPRGGEYLVKLGIGTPQHY 100
Query: 110 QHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKC---Q 164
DTAS LVW QCQPC+ C+ Q PIF+PR S++Y+ +PC C +C
Sbjct: 101 FSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDD 160
Query: 165 NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG---KIS 221
+ C Y +Y VT G + + A G + GCS+ + GG + S
Sbjct: 161 DQACRYNYKYSGNAVTNGTLAIDKLAV----GGNVFHAVVLGCSDS----SVGGPPPQAS 212
Query: 222 GILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS---VIKFGRDAD-VRRRDLETTPI 277
G++G PLSL SQL R F YCL M T V+ G AD VR T
Sbjct: 213 GLVGLARGPLSLLSQLSVR---RFMYCLPPPMSRTPGKLVLGAGAGADAVRNVSDRVTVT 269
Query: 278 LLSDLR--PHFYLHLLEISIGRH---IVRFP--PGAFDIMRDGTG----------GFIID 320
+ S R ++YL+ +++G +R P P A G G G I+D
Sbjct: 270 MSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVD 329
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSSFKAYP 374
+ ++F+ Y L ++ +R R + D C+ D +
Sbjct: 330 VASTISFLEASLYDELADDLEEEIR---LPRATPSTRLGLDLCFILPEGVGIDRVYVPTV 386
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
SM+F + ++ + F+E R C+ I SILG +QQQNM ++Y+L +
Sbjct: 387 SMSF---DGRWLELERDRLFLEDGR-MMCLMIGRTSGVSILGNYQQQNMHVLYNLRRGKI 442
Query: 435 RFGSENC 441
F +C
Sbjct: 443 TFAKASC 449
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 164/367 (44%), Gaps = 20/367 (5%)
Query: 84 EDIHLPMAKQDLF-YSVEVNIGTP--MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
ED++LP++ F Y V V+IGT + + L DT +S W C+PC Q +F
Sbjct: 54 EDLNLPISTSARFIYGVFVSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVGHLFS 113
Query: 141 PRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTF 199
P AS T+ + D P+C P++ + C + + G SR+TF R+G
Sbjct: 114 PAASPTFQGVRGDGPVCTVPYRHTDKGCSFRFPF-----AAGYLSRDTFHLRSGRSGTVM 168
Query: 200 --VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT- 256
VP + FGC++ +GF G +SG+L + SPLS + L R G FSYCL +
Sbjct: 169 ESVPGIMFGCAHSVTGFHNDGTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTHNP 228
Query: 257 -SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S ++FG D TT ++ + + P ++L+++ IS+G + F G
Sbjct: 229 DSFLRFGADVPSLPPHAHTTTLVHAGV-PGYHLNIVGISLGNKRLHIDRHVF----AAGG 283
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
G I+ +T I Y + ++ LG R+ + + + S P
Sbjct: 284 GCSINPAVTITRIMELAYLAVEHALVAHMKELGSGRVKGMPGRSLCFDHMDRSVRVQLPG 343
Query: 376 MTFHLQE-ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
M+FH ++ A+ E ++ + F V + +++GA QQ + +D+ L
Sbjct: 344 MSFHFEDGAELRFAAEQLFDVRVMAACFLV-VGRGHHQTVIGAAQQVDTRFTFDIAAGRL 402
Query: 435 RFGSENC 441
F E C
Sbjct: 403 AFVPETC 409
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 161/374 (43%), Gaps = 28/374 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP--IFDPRASTTYSEIPCDD 154
Y V + +G+P + L+ DT S L W +C C P F R STT+S C
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142
Query: 155 PLCR-----SPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-VPRLA 204
LC+ +P C + + C Y Y G T G S+ET +G + +A
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIA 202
Query: 205 FGCSNDNSGFAFGGK----ISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA---TS 257
FGC SG + G SG++G P+S +SQL R FSYCL+ + TS
Sbjct: 203 FGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPTS 262
Query: 258 VIKFGRDADVRRRD---LETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G ++ + + TP+L++ P F Y+ + + + + P + + G
Sbjct: 263 YLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDELG 322
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKA 372
GG +ID+GT +TF+ Y+ ++ + + ++ + FD C S
Sbjct: 323 NGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVNVTGVSRPR 382
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ----DDPKYSILGAWQQQNMLIIYD 428
+P ++ L P YFI+ G C+AIQ + ++S++G QQ L+ +D
Sbjct: 383 FPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLEFD 442
Query: 429 LNVPALRFGSENCA 442
L F CA
Sbjct: 443 RGKSRLGFSRRGCA 456
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 163/376 (43%), Gaps = 39/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP + ++ DT S L W QC PC+ CF+Q P+FDP AS++Y + C DP
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPR 205
Query: 157 CRSPFKCQNGKCVYTRR---------YHVGDVTRGLASRETFAFPVR----NGFTFVPRL 203
C + RR Y GD + +F V + V +
Sbjct: 206 CGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGV 265
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLVRE-MEATSVIKF 261
FGC + N G +G+LG PLS +SQLR G FSYCLV + S + F
Sbjct: 266 VFGCGHRNRGLFH--GAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHGSDVASKVVF 323
Query: 262 GRD------ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
G D A R + P S +Y+ L + +G ++ +D G+G
Sbjct: 324 GEDDALALAAHPRLKYTAFAPA-SSPADTFYYVRLTGVLVGGELLNISSDTWDASEGGSG 382
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY---CYRYDSSFK- 371
G IID+GT +++ YQ + + + + R Y +F CY +
Sbjct: 383 GTIIDSGTTLSYFVEPAYQVIRRAF------IDRMSGSYPPVPDFPVLSPCYNVSGVERP 436
Query: 372 AYPSMTFHLQEADYIVQPENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIY 427
P ++ + P YFI +PD G C+A+ P+ SI+G +QQQN + Y
Sbjct: 437 EVPELSLLFADGAVWDFPAENYFIRLDPD-GIMCLAVLGTPRTGMSIIGNFQQQNFHVAY 495
Query: 428 DLNVPALRFGSENCAN 443
DL+ L F CA
Sbjct: 496 DLHNNRLGFAPRRCAE 511
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 149/311 (47%), Gaps = 31/311 (9%)
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPF--KCQNGK------CVYTRRYHVGDVTRGLASRET 188
P FD S+T CD LC+ C N K CVYT Y+ VT GL +
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
F F VP +AFGC N+G F +GI GF PLSL SQL+ G FS+C
Sbjct: 83 FTF---GAGASVPGVAFGCGLFNNG-VFKSNETGIAGFGRGPLSLPSQLK---VGNFSHC 135
Query: 249 L--VREMEATSVIKFGRDADV---RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRF 302
V ++ ++V+ AD+ R +++TP++ + P FY L L I++G +
Sbjct: 136 FTAVNGLKQSTVL-LDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPV 194
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
P AF + +GTGG IID+GT +T + P Q D+ + +P NA+ +
Sbjct: 195 PESAF-ALTNGTGGTIIDSGTSITSL---PPQVYQVVRDEFAAQIKLPVVPGNATGPYT- 249
Query: 363 CYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGR---FCVAIQDDPKYSILGAW 418
C+ S K P + H + A + EN F PD C+AI + +I+G +
Sbjct: 250 CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNF 309
Query: 419 QQQNMLIIYDL 429
QQQNM ++YDL
Sbjct: 310 QQQNMHVLYDL 320
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 117/446 (26%), Positives = 171/446 (38%), Gaps = 64/446 (14%)
Query: 37 IPIFSPESPLYP-----GNLSQSERIHKMFEISKARANYMASMSKPN--AFQELEDIH-- 87
+P+ P P G S +ER+ + +AR NY+ + + A L D
Sbjct: 19 VPLVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGG 74
Query: 88 -------LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPI 138
L + L Y V + IGTP Q +L DT S L W QC+PC C+ Q P+
Sbjct: 75 GTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPL 134
Query: 139 FDPRASTTYSEIPCDDPLCRSPFKCQNGK------------CVYTRRYHVGDVTRGLASR 186
FDP +S++Y+ +PCD CR G C Y Y T G+ S
Sbjct: 135 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYST 194
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
ET + G V FGC + G K G+LG +P SL SQ ++ G FS
Sbjct: 195 ETLTL--KPGV-VVADFGFGCGDHQHGPYE--KFDGLLGLGGAPESLVSQTSSQFGGPFS 249
Query: 247 YCLVREMEATSVIKFG----RDADVRRRDLETTPILLSDLRPHFYL-HLLEISIGRHIVR 301
YCL + G + L TP+ P FY+ L IS+G +
Sbjct: 250 YCLPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLA 309
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
PP AF + G +ID+GT +T + Y L + + + +P + D
Sbjct: 310 IPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY--RLLPPSNGGVLD 361
Query: 362 YCYRYDSSFKAYP---SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSIL 415
CY + S+TF + P + C+A D I+
Sbjct: 362 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLV------DGCLAFAGAGTDNAIGII 415
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENC 441
G Q+ ++YD + F + C
Sbjct: 416 GNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 154/352 (43%), Gaps = 19/352 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + +++ DT S + W QC PC +C+ Q PIF+P S+++ + C +
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 140
Query: 157 C---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + + +C+Y Y G T G S ET +F G V +A GC +N G
Sbjct: 141 CGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMGCGRNNQG 196
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV-IKFGRDADVRRRDL 272
+G+LG PLS SQ +FSYCL R A + + FG A V +
Sbjct: 197 LFH--GAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA-VPEKAR 253
Query: 273 ETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
T + L ++Y+ L I + V PP AF + GTGG I+D+GT ++ +
Sbjct: 254 FTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPA 313
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
Y L + ++ I FD CY S A P++ + P +
Sbjct: 314 YTALRDAFRSLVTFPSAPGISL-----FDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 368
Query: 392 MYFIE-PDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ D G +C+A ++ +SI+G QQQ I D + + C
Sbjct: 369 GILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 153/332 (46%), Gaps = 34/332 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++ +IG P DT S L+W +C PC C +P++DP S + ++PC L
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146
Query: 157 CRS-------PFKCQNGK--CVYTRRY-HVGD-VTRGLASRETFAFPVRNGFTFVPRLAF 205
C++ +C + C Y Y H GD T+G+ ETF F +G+ ++F
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF--GDGY-VANNVSF 203
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDA 265
G S+ G FGG +G++G LSL SQL G F+YCL + S I FG A
Sbjct: 204 GRSDTIDGSQFGGT-AGLVGLGRGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGSLA 259
Query: 266 --DVRRRDLETTPILLS---DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
D D+ +TP++ + D H+Y++L IS+G + G F I DG+GG D
Sbjct: 260 ALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFD 319
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+G T +++ YQ + Q + QR+ Y+A + + + P + H
Sbjct: 320 SGAIDTSLKDAAYQVVRQAITSEI-----QRLGYDAGDDTCFVAANQQAVAQMPPLVLHF 374
Query: 381 QE-ADYIVQPENMYFIEPDRG----RFCVAIQ 407
+ AD + N Y +G C+AI+
Sbjct: 375 DDGADMSLNGRN-YLKTSTKGPSEVLVCMAIK 405
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 172/387 (44%), Gaps = 37/387 (9%)
Query: 81 QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQT 135
Q D +L +K + Y +V +G+P ++ DT S ++W C C C
Sbjct: 89 QGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGID 148
Query: 136 TPIFDPRASTTYSEIPCDDPLCRSPFKC------QNGKCVYTRRYHVGDVTRGLASRETF 189
FD S T + C DP+C S F+ +N +C Y+ RY G T G +TF
Sbjct: 149 LHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTF 208
Query: 190 AFPVRNGFTFVPR----LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--I 241
F G + V + FGCS SG + GI GF LS+ SQL +R
Sbjct: 209 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 268
Query: 242 QGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR 301
+FS+CL + V G ++ + +P++ S +PH+ L+LL I + ++
Sbjct: 269 PPVFSHCLKGDGSGGGVFVLG---EILVPGMVYSPLVPS--QPHYNLNLLSIGVNGQMLP 323
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
F+ T G I+DTGT +T++ Y + + I S+ + P ++ E
Sbjct: 324 LDAAVFE--ASNTRGTIVDTGTTLTYLVKEAYDLFL---NAISNSVSQLVTPIISNGE-- 376
Query: 362 YCYRYDSSFK-AYPSMTFHLQ-EADYIVQPENMYF---IEPDRGRFCVAIQDDP-KYSIL 415
CY +S +PS++ + A +++P++ F I +C+ Q P + +IL
Sbjct: 377 QCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTIL 436
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCA 442
G ++ + +YDL + + S +C+
Sbjct: 437 GDLVLKDKVFVYDLARQRIGWASYDCS 463
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 170/398 (42%), Gaps = 47/398 (11%)
Query: 71 MASMSKPNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
+ +M+ Q + + +P+ + L Y V V +G K L+ DT S L W QCQ
Sbjct: 57 IKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ 114
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---------SPFKCQNGK----CVYTRR 173
PC C++Q P++DP S++Y + C+ C+ P NG C Y
Sbjct: 115 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 174
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
Y G TRG + E+ G T + FGC +N G SG++G S +SL
Sbjct: 175 YGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFG--GSSGLMGLGRSSVSL 228
Query: 234 SSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADV--RRRDLETTPILLS-DLRPHFYLH 289
SQ G+FSYCL E A+ + FG D+ V + TP++ + LR + L+
Sbjct: 229 VSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILN 288
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L SIG V +F G +ID+GT +T + Y+ + + + + + G
Sbjct: 289 LTGASIGG--VELKSSSFG------RGILIDSGTVITRLPPSIYKAV--KIEFLKQFSGF 338
Query: 350 QRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
P D C+ Y+ M F + YF++PD C+A+
Sbjct: 339 PTAP--GYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL 396
Query: 407 QD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + I+G +QQ+N +IYD L ENC
Sbjct: 397 ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 170/398 (42%), Gaps = 47/398 (11%)
Query: 71 MASMSKPNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
+ +M+ Q + + +P+ + L Y V V +G K L+ DT S L W QCQ
Sbjct: 105 IKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ 162
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---------SPFKCQNGK----CVYTRR 173
PC C++Q P++DP S++Y + C+ C+ P NG C Y
Sbjct: 163 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 222
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
Y G TRG + E+ G T + FGC +N G SG++G S +SL
Sbjct: 223 YGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFG--GSSGLMGLGRSSVSL 276
Query: 234 SSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADV--RRRDLETTPILLS-DLRPHFYLH 289
SQ G+FSYCL E A+ + FG D+ V + TP++ + LR + L+
Sbjct: 277 VSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILN 336
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L SIG V +F G +ID+GT +T + Y+ + + + + + G
Sbjct: 337 LTGASIGG--VELKSSSFG------RGILIDSGTVITRLPPSIYKAV--KIEFLKQFSGF 386
Query: 350 QRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
P D C+ Y+ M F + YF++PD C+A+
Sbjct: 387 PTAP--GYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL 444
Query: 407 QD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + I+G +QQ+N +IYD L ENC
Sbjct: 445 ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 150/354 (42%), Gaps = 25/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +GTP K ++FDT S L W QC+PC C++Q P+FDP S+TY+ + C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 157 CRS--PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C+ C + +C Y +Y T G R+T + +P FGC + N+G
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFGCGDQNAG 265
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G++ G+ G +SL SQ F+YCL + G + +
Sbjct: 266 LF--GQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLG---GAPPANAQ 320
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
T + +Y+ L+ I +G +R P AF +ID+GT +T + Y
Sbjct: 321 FTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAY 376
Query: 334 QTLMQRYDQILRSLGR-QRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
L + RS+ + ++ P A D CY + A P++ +
Sbjct: 377 APLRAAF---ARSMAQYKKAP--ALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFT 431
Query: 392 MYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +ILG QQ+ + YD+ + FG++ C+
Sbjct: 432 GVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 83/257 (32%), Positives = 113/257 (43%), Gaps = 18/257 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDD 154
Y V ++GTP Q L DT S L W QC+PC C+ Q P+FDP S++Y+ +PC
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196
Query: 155 PLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C C +C Y Y G T G+ S +T V FGC +
Sbjct: 197 SACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAAN---ATVQGFLFGCGHA 253
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
SG F G I G+LGF SL Q G+FSYCL + T + G + V
Sbjct: 254 QSGGLFTG-IDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGV-AP 311
Query: 271 DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
TT +L S P +Y+ +L IS+G + P AF G ++DTGT +T +
Sbjct: 312 GFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAF------AAGTVVDTGTVITRLP 365
Query: 330 NGPYQTLMQRYDQILRS 346
Y L + + S
Sbjct: 366 PAAYAALRSAFRSGMAS 382
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 105/349 (30%), Positives = 149/349 (42%), Gaps = 44/349 (12%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y V +GTP P L+ DT S +VW QC PC +C+ Q+ +FDPR S +Y+
Sbjct: 135 LAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAA 194
Query: 150 IPCDDPLC-------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ C P C + G C+Y Y G VT G + ET F R VPR
Sbjct: 195 VRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF-ARG--ARVPR 251
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+A GC +DN G +G+LG LSL +Q R FSYC G
Sbjct: 252 VAVGCGHDNEGLFV--AAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQ-----------G 298
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
D D R ++ + H + +G +R P G GG I+D+G
Sbjct: 299 SDLDHR--------TIIRTVHQHVGGARVR-GVGERSLRLDPST------GRGGVILDSG 343
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQ 381
T VT + Y + + + + G R+ FD CY P+++ HL
Sbjct: 344 TSVTRLARPVYVAVREAFRA---AAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLA 400
Query: 382 EADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYD 428
+ P Y I D RG FC+A+ D SI+G QQQ +++D
Sbjct: 401 GGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFD 449
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 150/354 (42%), Gaps = 25/354 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +GTP K ++FDT S L W QC+PC C++Q P+FDP S+TY+ + C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 157 CRS--PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C+ C + +C Y +Y T G R+T + +P FGC + N+G
Sbjct: 209 CQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFGCGDQNAG 265
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G++ G+ G +SL SQ F+YCL + G + +
Sbjct: 266 LF--GQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLG---GAPPANAQ 320
Query: 274 TTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
T + +Y+ L+ I +G +R P AF +ID+GT +T + Y
Sbjct: 321 FTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAY 376
Query: 334 QTLMQRYDQILRSLGR-QRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPEN 391
L + RS+ + ++ P A D CY + A P++ +
Sbjct: 377 APLRAAF---ARSMAQYKKAP--ALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFT 431
Query: 392 MYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ C+A DD +ILG QQ+ + YD+ + FG++ C+
Sbjct: 432 GVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 111/379 (29%), Positives = 167/379 (44%), Gaps = 54/379 (14%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRA 143
I L M Y +E ++GTP + L DT S L+W +C C Q +P + P A
Sbjct: 80 IPLRMDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNA 139
Query: 144 STTYSEIPCDDPLCR-----SPFKCQ--NGKCVYTRRYHVGD----VTRGLASRETFAFP 192
S+T++++PC D LC S C +C Y Y +GD T+G +RETF
Sbjct: 140 SSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL- 198
Query: 193 VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE 252
G VP + FGC+ + G SG++G PLSL SQL F YCL +
Sbjct: 199 ---GADAVPSVRFGCTT--ASEGGYGSGSGLVGLGRGPLSLVSQLNAST---FMYCLTSD 250
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
S + FG A + +++T +L S + ++L ISIG PG +
Sbjct: 251 ASKASPLLFGSLASLTGAQVQSTGLLAS--TTFYAVNLRSISIGSATT---PGVGE---- 301
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRY------DQILRSLGRQRIPYNASQEFDYCYRY 366
G + D+GT +T++ Y + DQ+ + G F+ C++
Sbjct: 302 -PEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG-----------FEACFQK 349
Query: 367 DS----SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQN 422
+ S A P+M H AD + P Y +E + G C +Q P SI+G Q N
Sbjct: 350 PANGRLSNAAVPTMVLHFDGAD-MALPVANYVVEVEDGVVCWIVQRSPSLSIIGNIMQVN 408
Query: 423 MLIIYDLNVPALRFGSENC 441
L+++D++ L F NC
Sbjct: 409 YLVLHDVHRSVLSFQPANC 427
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 190/444 (42%), Gaps = 57/444 (12%)
Query: 16 FSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASM 74
F L L ++ G ++K+ ++SP+SP P +S + + +M +AR +++S+
Sbjct: 10 FLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQFLSSL 69
Query: 75 SKPNAFQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
++ +P+A Q Y V+ N+GTP + + DT++ W C C+
Sbjct: 70 VGRKSW-------VPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCV 122
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASR 186
C ++ +F+ STT+ + CD P C+ +P C C + Y + L +R
Sbjct: 123 GC---SSTVFNSVTSTTFKTLGCDAPQCKQVPNP-TCGGSTCTWNTTYGGSTILSNL-TR 177
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
+T A VP FGC +G + + LG S SQ ++ + FS
Sbjct: 178 DTIALSTD----IVPGYTFGCIQKTTGSSVPPQGLLGLGRGPL--SFLSQTQDLYKSTFS 231
Query: 247 YCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFP 303
YCL R + + ++ G R ++TTP+L + R +Y++L+ I +GR IV P
Sbjct: 232 YCLPSFRTLNFSGTLRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIP 289
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-----DQILRSLGRQRIPYNASQ 358
A G I D+GT T + Y + + + I+ SLG
Sbjct: 290 ASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLG---------- 339
Query: 359 EFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YS 413
FD CY + P+MTF + + +N+ C+A+ P +
Sbjct: 340 GFDTCY---TGPIVAPTMTFMFSGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLN 396
Query: 414 ILGAWQQQNMLIIYDLNVPALRFG 437
++ QQQN I++D VP R G
Sbjct: 397 VIANMQQQNHRILFD--VPNSRIG 418
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 157/365 (43%), Gaps = 36/365 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP--IFDPRASTTYSEIPCDD 154
Y V++ +GTP++ L+ DT S L W ++C + P +F P+ S +++ IPC
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTW------VKCAGASPPGRVFRPKTSRSWAPIPCSS 169
Query: 155 PLCR--SPFKCQN-----GKCVYTRRYHVGDV-TRGLASRETFAFPVRNG-FTFVPRLAF 205
C+ PF N C Y RY G RG+ E+ + G + +
Sbjct: 170 DTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVL 229
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKFG 262
GCS+ + G +F G+L + +S ++Q R G FSYCLV + AT + FG
Sbjct: 230 GCSSSHDGQSF-RSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFG 288
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
V R T + L P + + + I + + P +D +GG I+D+G
Sbjct: 289 -PGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAK---SGGVILDSG 344
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS----SFKAYPSMTF 378
+T + Y+ ++ + L + + P F++CY + + + + P +
Sbjct: 345 NTLTVLAAPAYKAVVAALSKHLDGVPKVSFP-----PFEHCYNWTARRPGAPEIIPKLAV 399
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLNVPALRF 436
+ + P Y I+ G C+ +Q+ P S++G QQ L +DL +RF
Sbjct: 400 QFAGSARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRF 459
Query: 437 GSENC 441
NC
Sbjct: 460 KQSNC 464
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 170/398 (42%), Gaps = 47/398 (11%)
Query: 71 MASMSKPNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
+ +M+ Q + + +P+ + L Y V V +G K L+ DT S L W QCQ
Sbjct: 105 IKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ 162
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---------SPFKCQNGK----CVYTRR 173
PC C++Q P++DP S++Y + C+ C+ P NG C Y
Sbjct: 163 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 222
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
Y G TRG + E+ G T + FGC +N G SG++G S +SL
Sbjct: 223 YGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFG--GSSGLMGLGRSSVSL 276
Query: 234 SSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADV--RRRDLETTPILLS-DLRPHFYLH 289
SQ G+FSYCL E A+ + FG D+ V + TP++ + LR + L+
Sbjct: 277 VSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILN 336
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L SIG V +F G +ID+GT +T + Y+ + + + + + G
Sbjct: 337 LTGASIGG--VELKSSSFG------RGILIDSGTVITRLPPSIYKAV--KIEFLKQFSGF 386
Query: 350 QRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
P D C+ Y+ M F + YF++PD C+A+
Sbjct: 387 PTAP--GYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL 444
Query: 407 QD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + I+G +QQ+N +IYD L ENC
Sbjct: 445 ASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/208 (38%), Positives = 115/208 (55%), Gaps = 19/208 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y++ ++IGTP +L DT SSL+WTQC PC C + P F P +S+T+S++PC L
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 157 CR---SPFK-CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
C+ SP++ C CVY Y +G T G + ET G +F P + FGCS +N
Sbjct: 150 CQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHV---GGASF-PGVTFGCSTEN- 203
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRD 271
G SGI+G SPLSL SQ+ FSYCL +A S I FG A V +
Sbjct: 204 --GVGNSSSGIVGLGRSPLSLVSQVG---VARFSYCLRSNADAGDSPILFGSLAKVTGGN 258
Query: 272 LETTPILLSDLRP---HFYLHLLEISIG 296
+++TP+L + P ++Y++L I++G
Sbjct: 259 VQSTPLLENPEMPSSSYYYVNLTGITVG 286
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 151/357 (42%), Gaps = 31/357 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y + V +G+P Q +L DT S + W QC+PC +C Q P+FDP +S+TYS C
Sbjct: 196 LEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 255
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C + NG +C Y Y G T G S +T A G + V FGCSN
Sbjct: 256 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSN 311
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
SG F + G++G SL SQ + FSYCL ++ + G
Sbjct: 312 VESG--FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGT 369
Query: 270 RDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP+L S P FY + L I +G + P F + G ++D+GT +T +
Sbjct: 370 SGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRL 423
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIV 387
Y L + + +Q P S D C+ + S + PS+ + +V
Sbjct: 424 PPTAYSALSSAFKAGM----KQYPPAQPSGILDTCFDFSGQSSVSIPSVAL-VFSGGAVV 478
Query: 388 QPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ I + C+A DD I+G QQ+ ++YD+ + F + C
Sbjct: 479 SLDASGIILSN----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 180/411 (43%), Gaps = 45/411 (10%)
Query: 62 EISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLV 121
IS RA+ + A +L L + Y EV +GTP K ++ DT S ++
Sbjct: 53 NISALRAHDGTRHGRLLATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDIL 112
Query: 122 WTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRSPF-----KCQ-NGKCVY 170
W C C +C ++ ++DP+AS+T S + CD C F KC N C Y
Sbjct: 113 WVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSANVPCEY 172
Query: 171 TRRYHVGDVTRGLASRETFAFPVRNGFTFV----PRLAFGCSNDNSG--FAFGGKISGIL 224
+ Y G T G + F G + FGC G + + GIL
Sbjct: 173 SVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGIL 232
Query: 225 GFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDL 282
GF + S+ SQL +++ +F++CL ++ + G DV + ++TTP++
Sbjct: 233 GFGEANTSMLSQLATAGKVKKIFAHCL-DTIKGGGIFAIG---DVVQPKVKTTPLVAD-- 286
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG-GFIIDTGTPVTFIRNGPYQTLMQRYD 341
+PH+ ++L I +G + P DI + G G IID+GT +T++ ++ +M
Sbjct: 287 KPHYNVNLKTIDVGGTTLELPA---DIFKPGEKRGTIIDSGTTLTYLPELVFKKVM---- 339
Query: 342 QILRSLGR-QRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDR 399
L + Q I ++ Q+F C+ Y S +P++TFH ++ + + YF
Sbjct: 340 --LAVFNKHQDITFHDVQDF-LCFEYSGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGN 396
Query: 400 GRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+CV Q D ++G N L++YDL + + NC++
Sbjct: 397 DVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCSS 447
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 31/357 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y + V +G+P Q +L DT S + W QC+PC +C Q P+FDP +S+TYS C
Sbjct: 126 LEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C + NG +C Y Y G T G S +T A G + V FGCSN
Sbjct: 186 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSN 241
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
SG F + G++G SL SQ + FSYCL ++ + G
Sbjct: 242 VESG--FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGT 299
Query: 270 RDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP+L S P FY + L I +G + P F + G ++D+GT +T +
Sbjct: 300 SGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRL 353
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYIV 387
Y L + + +Q P S D C+ + S + PS+ +
Sbjct: 354 PPTAYSALSSAFKAGM----KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVS 409
Query: 388 QPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ + C+A DD I+G QQ+ ++YD+ + F + C
Sbjct: 410 LDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 160/357 (44%), Gaps = 36/357 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y IGTP + D +S LVWT C T F+P STT +++PC D
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTACG--------ATAPFNPVRSTTVADVPCTDD 150
Query: 156 LCR--SPFKCQNG--KCVYTRRYHVGDV-TRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C+ +P C G +C YT Y G T GL E F F G T + + FGC
Sbjct: 151 ACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTF----GDTRIDGVVFGCGLK 206
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVRE--MEATSVIKFGRDADV 267
N G F G +SG++G LSL SQL+ +R FSY + ++ S I FG DA
Sbjct: 207 NVG-DFSG-VSGVIGLGRGNLSLVSQLQVDR----FSYHFAPDDSVDTQSFILFGDDATP 260
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGTPV 325
+ +T +L SD P +Y+ L I + + P G FD+ +DG+GG + V
Sbjct: 261 QTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLV 320
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEAD 384
T + Y+ L Q + +G + ++ D CY +S KA PSM
Sbjct: 321 TVLEEAAYKPLRQ---AVASKIGLPAV-NGSALGLDLCYTGESLAKAKVPSMALVFAGGA 376
Query: 385 YI-VQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ ++ N ++++ G C+ I S+LG+ Q ++YD+N L F S
Sbjct: 377 VMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFES 433
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/410 (26%), Positives = 184/410 (44%), Gaps = 43/410 (10%)
Query: 62 EISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLV 121
IS RA+ + A +L L + Y E+ +GTP K ++ DT S ++
Sbjct: 51 NISALRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDIL 110
Query: 122 WTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRSPF-----KC-QNGKCVY 170
W C C +C ++ ++DP+AS+T S + CD C + F KC N C Y
Sbjct: 111 WVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGANVPCEY 170
Query: 171 TRRYHVGDVTRGLASRETFAFP--VRNGFT--FVPRLAFGCSNDNSG--FAFGGKISGIL 224
+ Y G T G + F R+G T + FGC G + + GIL
Sbjct: 171 SVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGIL 230
Query: 225 GFNASPLSLSSQLR--NRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDL 282
GF + S+ SQL +++ +F++CL ++ + G DV + ++TTP++
Sbjct: 231 GFGEANTSMLSQLTTAGKVKKIFAHCL-DTIKGGGIFSIG---DVVQPKVKTTPLVAD-- 284
Query: 283 RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ 342
+PH+ ++L I +G ++ P F+ G IID+GT +T++ ++ +M
Sbjct: 285 KPHYNVNLKTIDVGGTTLQLPAHIFEPGEK--KGTIIDSGTTLTYLPELVFKEVM----- 337
Query: 343 ILRSLGR-QRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDRG 400
L + Q I ++ Q F C++Y S +P++TFH ++ + + YF
Sbjct: 338 -LAVFNKHQDITFHDVQGF-LCFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGND 395
Query: 401 RFCVAIQDDPKYS-------ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+CV Q+ S ++G N L+IYDL + + NC++
Sbjct: 396 VYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCSS 445
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/456 (25%), Positives = 196/456 (42%), Gaps = 48/456 (10%)
Query: 4 VQALPLAAFFSYFSVLFLTHFTSSEST------GFSLKLIPIFSPESPLYPGN-LSQSER 56
++A PL F F++ H ++T G +L++ +FSP SP P +S E
Sbjct: 1 MKATPLVLFL-LFTIAKGLHNPKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEES 59
Query: 57 IHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDT 116
+ K+ +AR Y++S+ + + Q Y V+ IGTP + L DT
Sbjct: 60 VLKLQAKDQARMQYLSSLVARRSIVPIASGR--QITQSPTYIVKAKIGTPAQTLLLAMDT 117
Query: 117 ASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNGKCVYTRR 173
++ W C C+ C TTP F P STT+ ++ C C R+P C C +
Sbjct: 118 SNDASWVPCTACVGC-STTTP-FAPAKSTTFKKVGCGASQCKQVRNP-TCDGSACAFNFT 174
Query: 174 YHVGDVTRGLASRETFAFPVRNGFTF----VPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y V L V++ T VP AFGC +G + + LG
Sbjct: 175 YGTSSVAASL---------VQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPL 225
Query: 230 PLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-F 286
SL +Q + Q FSYCL + + + ++ G A +R ++ TP+L + R +
Sbjct: 226 --SLLAQTQKLYQSTFSYCLPSFKTLNFSGSLRLGPVAQPKR--IKFTPLLKNPRRSSLY 281
Query: 287 YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
Y++L+ I +GR IV PP A + G + D+GT T + Y + + + R
Sbjct: 282 YVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRR--RI 339
Query: 347 LGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI 406
+++ + FD CY ++ P++TF + + P+N+ C+A+
Sbjct: 340 AVHKKLTVTSLGGFDTCY---TAPIVAPTITFMFSGMNVTLPPDNILIHSTAGSVTCLAM 396
Query: 407 QDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
P +++ QQQN +++D VP R G
Sbjct: 397 APAPDNVNSVLNVIANMQQQNHRVLFD--VPNSRLG 430
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 150/364 (41%), Gaps = 31/364 (8%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
L + L Y + V +G+P Q +L DT S + W QC+PC +C Q P+FDP +S+TY
Sbjct: 43 LGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTY 102
Query: 148 SEIPCDDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
S C C + NG +C Y Y G T G S +T A G + V
Sbjct: 103 SPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRS 158
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
FGCSN SG F + G++G SL SQ + FSYCL ++ + G
Sbjct: 159 FQFGCSNVESG--FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLG 216
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
TP+L S P FY + L I +G + P F + G ++D+
Sbjct: 217 AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDS 270
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHL 380
GT +T + Y L + + +Q P S D C+ + S + PS+
Sbjct: 271 GTVITRLPPTAYSALSSAFKAGM----KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 326
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ + + C+A DD I+G QQ+ ++YD+ + F
Sbjct: 327 SGGAVVSLDASGIILSN-----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 381
Query: 438 SENC 441
+ C
Sbjct: 382 AGAC 385
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/453 (27%), Positives = 201/453 (44%), Gaps = 44/453 (9%)
Query: 9 LAAFFSYFSVLFLTHFTSSEST------GFSLKLIPIFSPESPLYPGNLSQSERIHKMFE 62
+ A S F L L + S++T GF+ L S SPL +LS +R+ F
Sbjct: 1 MVATISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
Query: 63 ISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVW 122
S +R+ + + + N +L+ P+ Y + V+IGTP + DT S L+W
Sbjct: 61 RSLSRSATLLNRAATNGALDLQ---APLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMW 117
Query: 123 TQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC-QNGKCVYTRRYHVGDV 179
QC PC++C+ Q+ PIFDP ST++S +PC+ C++ C G C Y+ Y
Sbjct: 118 AQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTY 177
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR- 238
T+G E G + V + GC +++ G SG++G LSL SQ+
Sbjct: 178 TKGDLGFEKITI----GSSSV-KSVIGCGHESGGGFG--FASGVIGLGGGQLSLVSQMSQ 230
Query: 239 -NRIQGLFSYCLVREM-EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIG 296
+ I FSYCL + A I FG++A V + +TP++ + ++Y+ L ISIG
Sbjct: 231 TSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG 290
Query: 297 --RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY 354
RH+ G IID+GT ++F+ Y ++ +++++ R + P
Sbjct: 291 NERHMASAK----------QGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKA-KRVKDPG 339
Query: 355 NASQEFDYCYRYD---SSFKAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDP 410
N +D C+ ++ P +T A+ + P N + + P
Sbjct: 340 NF---WDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASP 396
Query: 411 --KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++ I+G N LI YDL L F C
Sbjct: 397 TDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 172/400 (43%), Gaps = 60/400 (15%)
Query: 83 LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ L Y + IGTP K ++ DT S ++W C C C ++
Sbjct: 71 LAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNL 130
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRS------PFKCQNGKCVYTRRYHVGDVTRGLASR 186
++DPR S + + CD C + P C Y+ Y G T G
Sbjct: 131 GIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVT 190
Query: 187 ETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGKIS-------GILGFNASPLSLSS 235
+ + +G ++FGC G GG + GILGF S S+ S
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGC-----GAKLGGDLGSSNLALDGILGFGQSNSSMLS 245
Query: 236 QL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEI 293
QL +++ +F++CL + + G +V + ++TTP L+SD+ PH+ + L I
Sbjct: 246 QLAAAGKVRKMFAHCL-DTVNGGGIFAIG---NVVQPKVKTTP-LVSDM-PHYNVILKGI 299
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR-YDQILRSLGRQRI 352
+G + P FD + G IID+GT + ++ G Y+ L +D+ Q I
Sbjct: 300 DVGGTALGLPTNIFD--SGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK------HQDI 351
Query: 353 PYNASQEFDYCYRYDSSF-KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQ--- 407
Q+F C++Y S +P +TFH + + IV P + Y + + +C+ Q
Sbjct: 352 SVQTLQDFS-CFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD-YLFQNGKNLYCMGFQNGG 409
Query: 408 ----DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D +LG N L++YDL A+ + NC++
Sbjct: 410 VQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 172/395 (43%), Gaps = 49/395 (12%)
Query: 77 PNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
P +L D +P++ Q L Y V V IG + L+ DT S L W QC PC C+
Sbjct: 42 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCY 99
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLC-------RSPFKCQNGK---CVYTRRYHVGDVTRG 182
+Q P+F+P S+++ +PC+ P C S C N C Y Y G +RG
Sbjct: 100 NQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG 159
Query: 183 LASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ 242
E G T + FGC +N G FGG SG++G S LSL SQ +
Sbjct: 160 ELGFEKLTL----GKTEIDNFIFGCGRNNKGL-FGGA-SGLMGLARSELSLVSQTSSLFG 213
Query: 243 GLFSYCL-VREMEATSVIKFGRDADVRRRDLE----TTPILLSDLRPHFYLHLLEISIGR 297
+FSYCL + ++ + G +++ T I + ++L+L ISIG
Sbjct: 214 SVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGG 273
Query: 298 HIVRFPPGAFDIMRDGTGGF-IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+ P + G ++D+GT +T + Y+ ++ +Q Y
Sbjct: 274 VNLNVPR-----LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFE-------KQFSGYRT 321
Query: 357 SQEF---DYCYRYDSSFKA-YPSMTFHLQ-EADYIVQPENM-YFIEPDRGRFCVAIQD-- 408
+ F + C+ + P++ F + A+ IV E + YF++ D + C+A
Sbjct: 322 TPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLG 381
Query: 409 -DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ + I+G +QQ+N +IY+ + F E C+
Sbjct: 382 YEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 172/395 (43%), Gaps = 49/395 (12%)
Query: 77 PNAFQELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
P +L D +P++ Q L Y V V IG + L+ DT S L W QC PC C+
Sbjct: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCY 178
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLC-------RSPFKCQNGK---CVYTRRYHVGDVTRG 182
+Q P+F+P S+++ +PC+ P C S C N C Y Y G +RG
Sbjct: 179 NQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG 238
Query: 183 LASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ 242
E G T + FGC +N G FGG SG++G S LSL SQ +
Sbjct: 239 ELGFEKLTL----GKTEIDNFIFGCGRNNKGL-FGGA-SGLMGLARSELSLVSQTSSLFG 292
Query: 243 GLFSYCL-VREMEATSVIKFGRDADVRRRDLE----TTPILLSDLRPHFYLHLLEISIGR 297
+FSYCL + ++ + G +++ T I + ++L+L ISIG
Sbjct: 293 SVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGG 352
Query: 298 HIVRFPPGAFDIMRDGTGGF-IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+ P + G ++D+GT +T + Y+ ++ +Q Y
Sbjct: 353 VNLNVPR-----LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFE-------KQFSGYRT 400
Query: 357 SQEF---DYCYRYDSSFKA-YPSMTFHLQ-EADYIVQPENM-YFIEPDRGRFCVAIQD-- 408
+ F + C+ + P++ F + A+ IV E + YF++ D + C+A
Sbjct: 401 TPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLG 460
Query: 409 -DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ + I+G +QQ+N +IY+ + F E C+
Sbjct: 461 YEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 117/446 (26%), Positives = 171/446 (38%), Gaps = 64/446 (14%)
Query: 37 IPIFSPESPLYP-----GNLSQSERIHKMFEISKARANYMASMSKPN--AFQELEDIH-- 87
+P+ P P G S +ER+ + +AR NY+ + + A L D
Sbjct: 99 VPLVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGG 154
Query: 88 -------LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPI 138
L + L Y V + IGTP Q +L DT S L W QC+PC C+ Q P+
Sbjct: 155 GTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPL 214
Query: 139 FDPRASTTYSEIPCDDPLCRSPFKCQNGK------------CVYTRRYHVGDVTRGLASR 186
FDP +S++Y+ +PCD CR G C Y Y T G+ S
Sbjct: 215 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYST 274
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
ET + G V FGC + G K G+LG +P SL SQ ++ G FS
Sbjct: 275 ETLTL--KPGV-VVADFGFGCGDHQHGPYE--KFDGLLGLGGAPESLVSQTSSQFGGPFS 329
Query: 247 YCLVREMEATSVIKFG----RDADVRRRDLETTPILLSDLRPHFYL-HLLEISIGRHIVR 301
YCL + G + L TP+ P FY+ L IS+G +
Sbjct: 330 YCLPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLA 389
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
PP AF + G +ID+GT +T + Y L + + + +P + D
Sbjct: 390 IPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY--RLLPPSNGGVLD 441
Query: 362 YCYRYDSSFKAYP---SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSIL 415
CY + S+TF + P + C+A D I+
Sbjct: 442 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLV------DGCLAFAGAGTDNAIGII 495
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENC 441
G Q+ ++YD + F + C
Sbjct: 496 GNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 163/369 (44%), Gaps = 36/369 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V IG+P QHL+ DT S ++W QC PC C+ Q P+FDP S ++S +PC+ +
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGV 182
Query: 157 CRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
CR+ + G+C Y Y T G+ + ET +G T V +A GC +
Sbjct: 183 CRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAMGCGH 239
Query: 210 DNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV----REMEATSVIKFGRD 264
+N G FA + +G+LG P+SL QL G FSYCL E + + GR+
Sbjct: 240 ENRGLFA---EAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVLGRE 296
Query: 265 ADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
D P++ + P F Y+ + + + ++ G FD+ DG GG ++DTGT
Sbjct: 297 -DAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDTGT 355
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL-- 380
VT + Y L + G R P FD CY P++ +
Sbjct: 356 AVTRLPAEAYAALRGAFAGAFEE-GAPRAP--GVSLFDTCYDLSGYASVRVPTVALYFGG 412
Query: 381 -----QEADYIVQPENMYFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVP 432
+ A + N+ D G +C+ A+ P SILG QQQ + I D
Sbjct: 413 GGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGP--SILGNIQQQGIEITVDSASG 470
Query: 433 ALRFGSENC 441
+ FG C
Sbjct: 471 YVGFGPATC 479
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 123/464 (26%), Positives = 201/464 (43%), Gaps = 58/464 (12%)
Query: 9 LAAFFSYFSV----LFLTH----FTSSESTGFSLKLIPIFSPESPLY-PGNLSQSERIHK 59
L F FS+ L L H T ++ G +L++ I SP SP P LS R+ +
Sbjct: 4 LVLFLQLFSIVPLALGLNHPNCDLTKNQDQGSTLRIFHIDSPCSPFKSPSPLSWEARVLQ 63
Query: 60 MFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASS 119
+AR Y++S+ + + + Q Y V+V IGTP +P L DT+S
Sbjct: 64 TLAQDQARLQYLSSLVAGRSVVPIASGRQML--QSTTYIVKVLIGTPAQPLLLAMDTSSD 121
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
+ W C C+ C T F P ST++ + C P C+ +P C C + Y
Sbjct: 122 VAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQCKQVPNP-ACGARACSFNLTYGS 178
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI---SGILGFNASPLSL 233
+ L S++T +R + FGC N +G GG I G+LG PLSL
Sbjct: 179 SSIAANL-SQDT----IRLAADPIKAFTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSL 230
Query: 234 SSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHL 290
SQ ++ + FSYCL R + + ++ G + +R ++ T +L + R +Y++L
Sbjct: 231 MSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR--VKYTQLLRNPRRSSLYYVNL 288
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ-------I 343
+ I +GR +V PP A G I D+GT T + Y+ + + + +
Sbjct: 289 VAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAV 348
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
+ SLG FD CY S P++TF + + + +N+ C
Sbjct: 349 VTSLG----------GFDTCY---SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSC 395
Query: 404 VAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+A+ P+ +++ + QQQN ++ D+ L E C+
Sbjct: 396 LAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 122/445 (27%), Positives = 189/445 (42%), Gaps = 37/445 (8%)
Query: 16 FSVLFLT------HFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARAN 69
S++FLT +E F+ +LI SP SPL+ + + R+ E S R N
Sbjct: 15 LSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVERSADRVN 74
Query: 70 YMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFD--TASSLVWTQC-- 125
+ N+ E P + + ++++IG P P LL + T S LVW C
Sbjct: 75 RFNDLIS-NSITAAE---FPSILDNGDFLMKISIGIP--PTELLVNVATGSDLVWIPCLS 128
Query: 126 -QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQNGKCVYT-RRYHVGDVTR 181
+PC D FDP S+TY +PCD C+ + CQ C Y+ H
Sbjct: 129 FKPCTHNCDLR--FFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPD 186
Query: 182 GLASRETFAFPVRNGFTF-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR 240
G + +T G +F +P F C N G G GILG LSL +++ +
Sbjct: 187 GDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGGDYPG---VGILGLGHGSLSLLNRISHL 243
Query: 241 IQGLFSYCLV-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHI 299
I G FS+C+V TS + FG A V + +T + ++ + L IS+G
Sbjct: 244 IDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKS 303
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
+ D +G G +D+GT T+ Y L YD + ++ ++ + + ++
Sbjct: 304 ISAGGIGSDYYMNGLG---MDSGTMFTYFPEYFYSQL--EYD-VRYAIQQEPLYPDPTRR 357
Query: 360 FDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGA 417
CYRY F P++T H + + N FI C+A + ++ G
Sbjct: 358 LRLCYRYSPDFSP-PTITMHFEGGSVELSSSNS-FIRMTEDIVCLAFATSSSEQDAVFGY 415
Query: 418 WQQQNMLIIYDLNVPALRFGSENCA 442
WQQ N+LI YDL+ L F +C
Sbjct: 416 WQQTNLLIGYDLDAGFLSFLKTDCT 440
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/423 (23%), Positives = 178/423 (42%), Gaps = 43/423 (10%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S + A + + + Y V +G+
Sbjct: 33 VHPSSPSPLESIIALARDDDARLLFLSSKA---ATAGVSSAPVASGQAPPSYVVRAGLGS 89
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------- 157
P + L DT++ W C PC C ++ +F P S++Y+ +PC C
Sbjct: 90 PSQQLLLALDTSADATWAHCSPCGTC--PSSSLFAPANSSSYASLPCSSSWCPLFQGQAC 147
Query: 158 --------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
+P C +++ + LAS +R G +P FGC +
Sbjct: 148 PAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDT-----LRLGKDAIPNYTFGCVS 202
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
+G G+LG P++L SQ + G+FSYCL R + ++ G
Sbjct: 203 SVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGG- 261
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ R + TP+L + R +Y+++ +S+GR V+ P G+F G ++D+GT +T
Sbjct: 262 QPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVIT 321
Query: 327 FIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEAD 384
Y L + + Q+ G Y + FD C+ D + P++T H+
Sbjct: 322 RWTAPVYAALREEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 376
Query: 385 YIVQP-ENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
+ P EN C+A+ + P+ +++ QQQN+ +++D+ + F
Sbjct: 377 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAK 436
Query: 439 ENC 441
E+C
Sbjct: 437 ESC 439
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/438 (24%), Positives = 180/438 (41%), Gaps = 41/438 (9%)
Query: 28 ESTGFSLKLIPIFSPESPL-YPGNL-------SQSERIH----KMFEISKARANYMASMS 75
STG L+L SP SP P +L RI ++ + ARA + + +
Sbjct: 39 NSTGLHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADA 98
Query: 76 KPNAFQELEDIHL-PMAKQDLF-YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCF 132
L + L P A + Y + +GTP ++ DT SSL W QC PC + C
Sbjct: 99 DAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCH 158
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLA 184
Q+ P+F+P++S+TY+ + C C +P C + C+Y Y + G
Sbjct: 159 RQSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYL 218
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
S++T +F G T +P +GC DN G G+ +G++G + LSL QL +
Sbjct: 219 SKDTVSF----GSTSLPNFYYGCGQDNEGLF--GRSAGLIGLARNKLSLLYQLAPSLGYS 272
Query: 245 FSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPP 304
F+YCL + + + TP++ S L Y I + V P
Sbjct: 273 FTYCLPSSSSSGYLSLGSYNPG----QYSYTPMVSSSLDDSLYF----IKLSGMTVAGNP 324
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
+ + IID+GT +T + Y L + ++ R +A D C+
Sbjct: 325 LSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRA----SAYSILDTCF 380
Query: 365 RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNML 424
+ +S + P++T + ++ D C+A +I+G QQQ
Sbjct: 381 KGQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCLAFAPARSAAIIGNTQQQTFS 440
Query: 425 IIYDLNVPALRFGSENCA 442
++YD+ + F + C+
Sbjct: 441 VVYDVKSSRIGFAAGGCS 458
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/454 (25%), Positives = 182/454 (40%), Gaps = 71/454 (15%)
Query: 51 LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTP 106
L+ + + + + + R +++S + A + +P++ Y V +GTP
Sbjct: 37 LAPAASLADLARMDRERMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTP 96
Query: 107 MKPQHLLFDTASSLVWTQCQ-------------PCIRCFDQTTP--IFDPRASTTYSEIP 151
+P L+ DT S L W +C + +P F P S T++ IP
Sbjct: 97 AQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIP 156
Query: 152 CDDPLCRS--PFK---CQN--GKCVYTRRYHVGDVTRGLASRE--TFAFPVRNGFTFVPR 202
C CR PF C C Y RY G RG + T A R R
Sbjct: 157 CSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLR 216
Query: 203 -LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSV 258
+ GC+ +G +F G+L S +S +S+ +R G FSYCLV + ATS
Sbjct: 217 GVVLGCTTSYNGQSFLAS-DGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSY 275
Query: 259 IKFGRD-ADVRRRDLE----------------------TTPILLSD-LRPHFYLHLLEIS 294
+ FG + A RR E TP++L RP + + + +S
Sbjct: 276 LTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVS 335
Query: 295 IGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY 354
+ +++ P +D+ + GG I+D+GT +T + Y+ ++ + L L R +
Sbjct: 336 VAGELLKIPRAVWDV--EQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTM-- 391
Query: 355 NASQEFDYCYRYDSSFKA-----YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD 409
FDYCY + S + P + H + + P Y I+ G C+ +Q+
Sbjct: 392 ---DPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEG 448
Query: 410 --PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P S++G QQ L YDL LRF C
Sbjct: 449 PWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 156/372 (41%), Gaps = 41/372 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L+ IGTP +P + D A LVWTQC C RCF Q P+F P AS+T+ PC
Sbjct: 41 LYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGT 100
Query: 155 PLCRS--PFKCQNGKCVYTRRYHV---GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C+S C C Y ++ T G+ ETFA T LAFGC
Sbjct: 101 DACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-----TATASLAFGCVV 155
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVR 268
+ G SG +G +P SL +Q++ FSYCL R +S + G A +
Sbjct: 156 ASDIDTMDG-TSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLA 211
Query: 269 RRDLETTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG-FIIDTGT 323
+ +T + D H+YL L+ +R G I +GG ++ T +
Sbjct: 212 GGESTSTAPFIKTSPDDDSHHYYLLSLD------AIR--AGNTTIATAQSGGILVMHTVS 263
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSMTFHLQ 381
P + + + Y+ + + + Q + Q FD C++ + F P + F Q
Sbjct: 264 PFSLLVDSAYRAFKKAVTEAVGGAAEQPM-ATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322
Query: 382 EADYIVQPENMYFIE--PDRGRFCVAI--------QDDPKYSILGAWQQQNMLIIYDLNV 431
A + P Y I+ ++ C AI S+LG+ QQ+++ +YDL
Sbjct: 323 GAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKK 382
Query: 432 PALRFGSENCAN 443
L F +C++
Sbjct: 383 ETLSFEPADCSS 394
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/412 (25%), Positives = 177/412 (42%), Gaps = 35/412 (8%)
Query: 35 KLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELE-DIHLPMAKQ 93
KLI S P Y N + +R+ + S AR Y+ + + + E + +
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPSLT 97
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
++IG P PQ ++ DT S ++W C PC C + +FDP S+T+S
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFS----- 152
Query: 154 DPLCRSPFKCQNGKCV------YTRRYHVGDVTRGLASRETFAFPVRN-GFTFVPRLAFG 206
PLC++P C C +T Y G+ R+T F + G + +P + FG
Sbjct: 153 -PLCKTP--CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFG 209
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC---LVREMEATSVIKFGR 263
C + N G +GILG N P SL++++ + FSYC L + G
Sbjct: 210 CGH-NIGQDTDPGHNGILGLNNGPDSLATKIGQK----FSYCIGDLADPYYNYHQLILGE 264
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
AD+ +TP + + +Y+ + IS+G + P F++ ++ TGG IIDTG+
Sbjct: 265 GADLEGY---STPFEVHN--GFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGS 319
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
+TF+ + ++ L + +L RQ + + +P +TFH +
Sbjct: 320 TITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADG 379
Query: 384 DYIVQPENMYFIEPDRGRFCVAI------QDDPKYSILGAWQQQNMLIIYDL 429
+ +F + + FC+ + K S++G QQ+ + YDL
Sbjct: 380 ADLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDL 431
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 118/451 (26%), Positives = 185/451 (41%), Gaps = 66/451 (14%)
Query: 44 SPLYPGNLSQSERIHKMFEIS--KARANYMASMSKPNAFQELEDIHLPMAKQDLF----Y 97
+P +L++S+R F S + RA A+ S AF+ +P+ Y
Sbjct: 41 APASLADLARSDRQRMAFIASHGRRRARETAAGSSAAAFE------MPLTSGAYTGIGQY 94
Query: 98 SVEVNIGTPMKPQHLLFDTASSLVWTQC-QPCIRCFDQTTP---IFDPRASTTYSEIPCD 153
V +GTP +P L+ DT S L W +C +P + + F P S T++ I C
Sbjct: 95 FVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCA 154
Query: 154 DPLCRS--PFKCQ-----NGKCVYTRRYHVGDVTRGLASRE--TFAFPVRNGFTFVPR-- 202
C PF C Y RY G RG E T A R +
Sbjct: 155 SDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLK 214
Query: 203 -LAFGCSNDNSGFAFGGKIS-GILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATS 257
L GC++ +G +F ++S G+L S +S +S +R G FSYCLV + ATS
Sbjct: 215 GLVLGCTSSYTGPSF--EVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATS 272
Query: 258 VIKFGRDADVRRRDLET--------------------TPILLS-DLRPHFYLHLLEISIG 296
+ FG + V + TP+LL +RP + + + +S+
Sbjct: 273 YLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVA 332
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
++ P +D+ D GG I+D+GT +T + Y+ ++ + L L R +
Sbjct: 333 GQFLKIPRAVWDV--DAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTM---- 386
Query: 357 SQEFDYCYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKY 412
F+YCY + S P M H A + P Y I+ G C+ +Q+ P
Sbjct: 387 -DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGI 445
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
S++G QQ L +D+ L+F C +
Sbjct: 446 SVIGNILQQEHLWEFDIKNRRLKFQRSRCTH 476
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 166/379 (43%), Gaps = 49/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y E+ IGTP K ++ DT S ++W C C RC ++DP+ S+T S++
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 151 PCDDPLCRSPF-----KCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV---- 200
CD C + + C C Y+ Y G T G + F +G
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 201 PRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLR--NRIQGLFSYCLVREMEAT 256
+ FGC + G + + GI+GF S S+ SQL +++ +F++CL +
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL-DTINGG 181
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ G +V + ++TTP++ + PH+ ++L I +G ++ P FD G
Sbjct: 182 GIFAIG---NVVQPKVKTTPLVPN--MPHYNVNLKSIDVGGTALKLPSHMFDTGE--KKG 234
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQE---FDYCYRYDSSFKA 372
IID+GT +T++ Y+ +M L + + I ++ QE F Y R D F
Sbjct: 235 TIIDSGTTLTYLPEIVYKEIM------LAVFAKHKDITFHNVQEFLCFQYVGRVDDDF-- 286
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNML 424
P +TFH + + V P + YF E +CV Q D +LG N L
Sbjct: 287 -PKITFHFENDLPLNVYPHD-YFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKL 344
Query: 425 IIYDLNVPALRFGSENCAN 443
++YDL + + NC++
Sbjct: 345 VVYDLENQVIGWTEYNCSS 363
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 162/378 (42%), Gaps = 26/378 (6%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLF-----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
S + N E + LP AK + Y V + IGTP L+FDT S L WTQC+P
Sbjct: 104 SKNSANEVSEAKSTELP-AKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEP 162
Query: 128 CI-RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASR 186
C+ C+ Q P F+P +S+TY + C P+C C CVY+ Y T+G ++
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCVYSIGYGDKSFTQGFLAK 222
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
E F + + + FGC +N G ++G+LG LSL +Q +FS
Sbjct: 223 EKFTLTNSD---VLEDVYFGCGENNQGLF--DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 247 YCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
YCL +T + FG ++ TPI ++ + ++ IS+G + P
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
+F T G IID+GT T + Y L + + + S + Y FD CY
Sbjct: 336 SFS-----TEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSY-KSTSGYGL---FDTCYD 386
Query: 366 YDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNM 423
+ YP++ F + + + + C+A +D +I G QQ +
Sbjct: 387 FTGLDTVTYPTIAFSFAGGTVVELDGSGISLPIKISQVCLAFAGNDDLPAIFGNVQQTTL 446
Query: 424 LIIYDLNVPALRFGSENC 441
++YD+ + F C
Sbjct: 447 DVVYDVAGGRVGFAPNGC 464
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 161/361 (44%), Gaps = 40/361 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y IGTP + D +S LVWT C T P F+P STT +++PC D
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVADVPCTDD 150
Query: 156 LCR--SPFKCQNG------KCVYTRRYHVGDV-TRGLASRETFAFPVRNGFTFVPRLAFG 206
C+ +P C G +C YT Y G T GL E F F G T + + FG
Sbjct: 151 ACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTF----GDTRIDGVVFG 206
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVRE--MEATSVIKFGR 263
C N G F G +SG++G LSL SQL+ +R FSY + ++ S I FG
Sbjct: 207 CGLQNVG-DFSG-VSGVIGLGRGNLSLVSQLQVDR----FSYHFAPDDSVDTQSFILFGD 260
Query: 264 DADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDT 321
DA + +T +L SD P +Y+ L I + + P G FD+ +DG+GG +
Sbjct: 261 DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSI 320
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL 380
VT + Y+ L Q + +G + ++ D CY +S KA PSM
Sbjct: 321 TDLVTVLEEAAYKPLRQ---AVASKIGLPAV-NGSALGLDLCYTGESLAKAKVPSMALVF 376
Query: 381 QEADYI-VQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ ++ N ++++ G C+ I S+LG+ Q ++YD+N L F
Sbjct: 377 AGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFE 436
Query: 438 S 438
S
Sbjct: 437 S 437
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 156/376 (41%), Gaps = 41/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP----IFDPRASTTYSEIPC 152
Y V +GTP +P L+ DT S L W +C+ +F AS +++ I C
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIAC 160
Query: 153 DDPLCRS--PFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF------ 199
C S PF N C Y RY G RG+ ++ + +G
Sbjct: 161 SSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSS 220
Query: 200 ------VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM 253
+ + GC+ G +F G+L S +S +S+ R G FSYCLV +
Sbjct: 221 GGRRAKLQGVVLGCAATYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHL 279
Query: 254 ---EATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDI 309
ATS + FG A TP+LL + P + + + + + + P +D+
Sbjct: 280 APRNATSYLTFGPGATA---PAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDV 336
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DS 368
R+ GG I+D+GT +T + Y+ ++ + L L R + F+YCY + D+
Sbjct: 337 DRN--GGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM-----DPFEYCYNWTDA 389
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLII 426
P M H + + P Y I+ G C+ +Q+ P S++G QQ L
Sbjct: 390 GALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWE 449
Query: 427 YDLNVPALRFGSENCA 442
+DL LRF CA
Sbjct: 450 FDLRDRWLRFKHTRCA 465
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 163/378 (43%), Gaps = 26/378 (6%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLF-----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
S + N E + LP AK + Y V + IGTP L+FDT S L WTQC+P
Sbjct: 104 SKNSANEVSEAKSTELP-AKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEP 162
Query: 128 CI-RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASR 186
C+ C+ Q P F+P +S+TY + C P+C C CVY+ Y T+G ++
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCVYSIVYGDKSFTQGFLAK 222
Query: 187 ETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
E F + + + FGC +N G ++G+LG LSL +Q +FS
Sbjct: 223 EKFTLTNSD---VLEDVYFGCGENNQGLF--DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 247 YCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
YCL +T + FG ++ TPI ++ + ++ IS+G + P
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
+F T G IID+GT T + Y L + + + S + Y FD CY
Sbjct: 336 SFS-----TEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSY-KSTSGYGL---FDTCYD 386
Query: 366 YDS-SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNM 423
+ YP++ F + + + + + C+A +D +I G QQ +
Sbjct: 387 FTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVCLAFAGNDDLPAIFGNVQQTTL 446
Query: 424 LIIYDLNVPALRFGSENC 441
++YD+ + F C
Sbjct: 447 DVVYDVAGGRVGFAPNGC 464
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 120/455 (26%), Positives = 194/455 (42%), Gaps = 47/455 (10%)
Query: 10 AAFFSYFSVLFLTHFT----SSESTGFSLKLIPIFSPESPLYP-GNLSQSERIHKMFEIS 64
AA F ++LF T +++S L +IPI+S SP P S + M
Sbjct: 7 AATFFLVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKD 66
Query: 65 KARANYMASMSKPNAFQELEDIHLPMAKQDL---FYSVEVNIGTPMKPQHLLFDTASSLV 121
R Y++++ A Q+ + + +Q L Y V V +GTP + ++ DT++
Sbjct: 67 PERLKYLSTL----ADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAA 122
Query: 122 WTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHV 176
W C C F TT F P ASTT + C C F C + C++ + Y
Sbjct: 123 WVPCSGCTG-FSSTT--FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY-- 177
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI--SGILGFNASPLSLS 234
G + A+ A + N +P FGC N SG G I G+LG P+SL
Sbjct: 178 GGDSSLTATLVQDAITLAN--DVIPGFTFGCINAVSG----GSIPPQGLLGLGRGPISLI 231
Query: 235 SQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLL 291
SQ G+FSYCL + + +K G + + + TTP+L + RP +Y++L
Sbjct: 232 SQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLT 289
Query: 292 EISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR 351
+S+GR V P + G IID+GT +T Y + + + +
Sbjct: 290 GVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG----- 344
Query: 352 IPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK 411
P ++ FD C+ + +A P++T H + + ++ EN C+++ P
Sbjct: 345 -PISSLGAFDTCFAATNEAEA-PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPN 402
Query: 412 -----YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+++ QQQN+ I++D L E C
Sbjct: 403 NVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/435 (25%), Positives = 179/435 (41%), Gaps = 56/435 (12%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQDLFYSVEVNIGTPM 107
NL++ E + + + S+ R + M++ A + + P+ Y V++ IGTP
Sbjct: 41 NLTEHELLRRAIQRSRYRLAGIG-MARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC-- 163
DTAS L+WTQCQPC C+ Q P+F+PR S+TY+ +PC C +C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+ C YT Y T G + + G +AFGCS ++G A + SG
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPPQASG 215
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRDLETTPI-LLS 280
++G PLSL SQL R F+YCL + G DAD R + +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 281 DLR--PHFYLHLLEISIGRHIVRF---------------------PPGAFDIMRDGTG-- 315
D R ++YL+L + IG + P A +
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSS 369
G IID + +TF+ Y L+ + +R L R +S D C+ +D
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIR-LPRG---TGSSLGLDLCFILPDGVAFDRV 388
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIY 427
+ P++ + ++ + + G C+ + + SILG +QQQNM ++Y
Sbjct: 389 Y--VPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLY 446
Query: 428 DLNVPALRFGSENCA 442
+L + F C
Sbjct: 447 NLRRGRVTFVQSPCG 461
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/435 (25%), Positives = 179/435 (41%), Gaps = 56/435 (12%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQDLFYSVEVNIGTPM 107
NL++ E + + + S+ R + M++ A + + P+ Y V++ IGTP
Sbjct: 41 NLTEHELLRRAIQRSRYRLAGIG-MARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC-- 163
DTAS L+WTQCQPC C+ Q P+F+PR S+TY+ +PC C +C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+ C YT Y T G + + G +AFGCS ++G A + SG
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPPQASG 215
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRDLETTPI-LLS 280
++G PLSL SQL R F+YCL + G DAD R + +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 281 DLR--PHFYLHLLEISIGRHIVRF---------------------PPGAFDIMRDGTG-- 315
D R ++YL+L + IG + P A +
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSS 369
G IID + +TF+ Y L+ + +R L R +S D C+ +D
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIR-LPRG---TGSSLGLDLCFILPDGVAFDRV 388
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIY 427
+ P++ + ++ + + G C+ + + SILG +QQQNM ++Y
Sbjct: 389 Y--VPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLY 446
Query: 428 DLNVPALRFGSENCA 442
+L + F C
Sbjct: 447 NLRRGRVTFVQSPCG 461
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 166/379 (43%), Gaps = 49/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y E+ IGTP K ++ DT S ++W C C RC ++DP+ S+T S++
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 147
Query: 151 PCDDPLCRSPF-----KCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV---- 200
CD C + + C C Y+ Y G T G + F +G
Sbjct: 148 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 207
Query: 201 PRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLR--NRIQGLFSYCLVREMEAT 256
+ FGC + G + + GI+GF S S+ SQL +++ +F++CL +
Sbjct: 208 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL-DTINGG 266
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ G +V + ++TTP++ + PH+ ++L I +G ++ P FD G
Sbjct: 267 GIFAIG---NVVQPKVKTTPLVPN--MPHYNVNLKSIDVGGTALKLPSHMFDTGE--KKG 319
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQE---FDYCYRYDSSFKA 372
IID+GT +T++ Y+ +M L + + I ++ QE F Y R D F
Sbjct: 320 TIIDSGTTLTYLPEIVYKEIM------LAVFAKHKDITFHNVQEFLCFQYVGRVDDDF-- 371
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNML 424
P +TFH + + V P + YF E +CV Q D +LG N L
Sbjct: 372 -PKITFHFENDLPLNVYPHD-YFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKL 429
Query: 425 IIYDLNVPALRFGSENCAN 443
++YDL + + NC++
Sbjct: 430 VVYDLENQVIGWTEYNCSS 448
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 106/329 (32%), Positives = 153/329 (46%), Gaps = 42/329 (12%)
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPF--KCQNGK------CVYTRRYHVGDVTRGLASRET 188
P FD S+T CD LC+ C N K CVYT Y+ VT GL +
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
F F VP +AFGC N+G F +GI GF PLSL SQL+ G FS+C
Sbjct: 235 FTF---GAGASVPGVAFGCGLFNNG-VFKSNETGIAGFGRGPLSLPSQLK---VGNFSHC 287
Query: 249 L--VREMEATSVI--------KFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGR 297
V ++ ++V+ K GR A +++TP++ + P +YL L I++G
Sbjct: 288 FTAVNGLKQSTVLLDLLADLYKNGRGA------VQSTPLIQNSANPTLYYLSLKGITVGS 341
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ P AF + +GTGG IID+GT +T + P Q D+ + +P NA+
Sbjct: 342 TRLPVPESAF-ALTNGTGGTIIDSGTSITSL---PPQVYQVVRDEFAAQIKLPVVPGNAT 397
Query: 358 QEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPD---RGRFCVAIQD-DPKY 412
+ C+ S K P + H + A + EN F PD C+AI + +
Sbjct: 398 GPYT-CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDER 456
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ +G +QQQNM ++YDL L F + C
Sbjct: 457 ATIGNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 9/135 (6%)
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
I++G + P AF + +GTGG IID+GT +T + P Q D+ + +
Sbjct: 42 ITVGSTRLPVPESAF-ALTNGTGGTIIDSGTSITSL---PPQVYQVVRDEFAAQIKLPVV 97
Query: 353 PYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGR---FCVAIQD 408
P NA+ + C+ S K P + H + A + EN F PD C+AI
Sbjct: 98 PGNATGPYT-CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINK 156
Query: 409 DPKYSILGAWQQQNM 423
+ +I+G +QQQNM
Sbjct: 157 GDETTIIGNFQQQNM 171
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 164/359 (45%), Gaps = 32/359 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP + L+FDT S + WTQCQPC+ C+ Q FDP ST+Y+ + C
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSA 194
Query: 156 LCR----SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C S C N C+Y Y ++G + ET + FT FGC
Sbjct: 195 SCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFT---NFLFGCGQ 251
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
N+G G+ +G+LG ++S +SL SQ + Q FSYCL +T + FG
Sbjct: 252 SNNGLF--GQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNFGGKV---S 306
Query: 270 RDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
+ TPI S FY + ++ IS+ + P F T G IID+GT +T +
Sbjct: 307 QTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIFT-----TSGAIIDSGTVITRL 359
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYI- 386
Y+ L + +D+ + + + N + D CY + + + ++P ++ + +
Sbjct: 360 PPTAYKALKEAFDEKMSNYPKT----NGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVD 415
Query: 387 VQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ + ++ C+A +DD ++ I G QQ+ ++YD + F + C+
Sbjct: 416 IDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 35/373 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTT------PIFDPRASTTY 147
YSV +GTP + L+ DT S L W C+ R C ++ +F S+++
Sbjct: 12 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 71
Query: 148 SEIPCDDPLCR----SPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
IPC +C+ F N C Y RY G G + ET ++ G
Sbjct: 72 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 131
Query: 199 F-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
+ + GCS G +F G++G S S + + + G FSYCLV + +
Sbjct: 132 MKLHNVLIGCSESFQGQSFQAA-DGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 190
Query: 258 V---IKFG--RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
V + FG R + ++ T ++L + + ++++ ISIG +++ P +D+
Sbjct: 191 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV--K 248
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
G GG I+D+G+ +TF+ YQ +M + SL + R +YC+ + F+
Sbjct: 249 GAGGTILDSGSSLTFLTEPAYQPVMA---ALRVSLLKFRKVEMDIGPLEYCFN-STGFEE 304
Query: 373 --YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
P + FH + P Y I G C+ P S++G QQN L +D
Sbjct: 305 SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFD 364
Query: 429 LNVPALRFGSENC 441
L + L F +C
Sbjct: 365 LGLKKLGFAPSSC 377
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 123/475 (25%), Positives = 185/475 (38%), Gaps = 61/475 (12%)
Query: 9 LAAFFSYFSVLFLTHF-------TSSESTGFSLKLIPIFSPESPLYP-----GNLSQSER 56
LA + F+V+ + F TSS ++ + +P+ P P G S +ER
Sbjct: 10 LAVNLNNFAVVPASSFEPEAACSTSSANSDPNRASVPLVHRHGPCAPSAASGGKPSLAER 69
Query: 57 IHKMFEISKARANYMASMSKPNAFQELE--------DIHLPMAKQD----LFYSVEVNIG 104
+ + +ARANY+ + + +P D L Y V + IG
Sbjct: 70 LRR----DRARANYIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIG 125
Query: 105 TPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--- 159
TP Q +L DT S L W QC+PC C+ Q P+FDP +S++Y+ +PCD CR
Sbjct: 126 TPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAA 185
Query: 160 ---PFKCQNGK---CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C +G C Y Y T G+ S ET + G V FGC + G
Sbjct: 186 GAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTL--KPGV-VVADFGFGCGDHQHG 242
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
K G+LG +P SL SQ ++ G FSYCL + G +
Sbjct: 243 PYE--KFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLALGAP-NSSSSSTA 299
Query: 274 TTPILLSDLR-----PHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
L + +R P FY + L IS+G + PP AF + G +ID+GT +T
Sbjct: 300 AAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAF------SSGMVIDSGTVITG 353
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYI 386
+ Y L + + + +P + D CY + P++ I
Sbjct: 354 LPATAYAALRSAFRSAMSEY--RLLPPSNGAVLDTCYDFTGHTNVTVPTIALTFSGGATI 411
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ F A DD I+G Q+ ++YD + F + C
Sbjct: 412 DLATPAGVLVDGCLAFAGAGTDD-TIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 166/370 (44%), Gaps = 45/370 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP----IFDPRASTTYSEIPC 152
Y + VN+GTP + DT S LVW C +F P S+TYS++ C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162
Query: 153 DDPLCR--SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFVPRLAFGC 207
C+ S C + +C Y Y G T G+ S ETF+F G VPR+ FGC
Sbjct: 163 QSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGC 222
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEA--TSVIKFGR 263
S ++G F + G++G A SL SQL I SYCL+ +A +S + FG
Sbjct: 223 STASAG-TF--RSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGS 279
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
A V +TP++ SD+ ++ + L +++G V A R I+D+GT
Sbjct: 280 RAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEV-----ATHDSR-----IIVDSGT 329
Query: 324 PVTFIRN---GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY----RYDSSFKAYPSM 376
+TF+ GP T ++R ++ QR+ Q CY + ++ P +
Sbjct: 330 TLTFLDPALLGPLVTELERRIKL------QRV-QPPEQLLQLCYDVQGKSETDNFGIPDV 382
Query: 377 TFHL-QEADYIVQPENMYFIEPDRGRFC---VAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
T A ++PEN + + + G C V + + SILG QQN + YDL+
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQE-GTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDAR 441
Query: 433 ALRFGSENCA 442
+ F + +CA
Sbjct: 442 TVTFAAADCA 451
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 35/373 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTT------PIFDPRASTTY 147
YSV +GTP + L+ DT S L W C+ R C ++ +F S+++
Sbjct: 83 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 148 SEIPCDDPLCR----SPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
IPC +C+ F N C Y RY G G + ET ++ G
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 202
Query: 199 F-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
+ + GCS G +F G++G S S + + + G FSYCLV + +
Sbjct: 203 MKLHNVLIGCSESFQGQSFQAA-DGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 261
Query: 258 V---IKFG--RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
V + FG R + ++ T ++L + + ++++ ISIG +++ P +D+
Sbjct: 262 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV--K 319
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
G GG I+D+G+ +TF+ YQ +M + SL + R +YC+ + F+
Sbjct: 320 GAGGTILDSGSSLTFLTEPAYQPVMA---ALRVSLLKFRKVEMDIGPLEYCFN-STGFEE 375
Query: 373 --YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
P + FH + P Y I G C+ P S++G QQN L +D
Sbjct: 376 SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFD 435
Query: 429 LNVPALRFGSENC 441
L + L F +C
Sbjct: 436 LGLKKLGFAPSSC 448
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 149/351 (42%), Gaps = 27/351 (7%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDPLCR- 158
+ +GTP ++ DT SSL W QC PC + C Q+ P+F+P++S+TY+ + C C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 159 ------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+P C + C+Y Y + G S++T +F G T +P +GC DN
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSLPNFYYGCGQDN 116
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
G G+ +G++G + LSL QL + F+YCL + + +
Sbjct: 117 EGLF--GRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPG----Q 170
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
TP++ S L Y I + V P + + IID+GT +T +
Sbjct: 171 YSYTPMVSSSLDDSLYF----IKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTS 226
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPEN 391
Y L + ++ R +A D C++ +S + P++T +
Sbjct: 227 VYSALSKAVAAAMKGTSRA----SAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQ 282
Query: 392 MYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 283 NLLVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 97/423 (22%), Positives = 177/423 (41%), Gaps = 43/423 (10%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S + A + + + Y V +G+
Sbjct: 31 VHPSSPSPLESIIALARDDDARLLFLSSKA---ATAGVSSAPVASGQAPPSYVVRAGLGS 87
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------- 157
P + L DT++ W C PC C ++ +F P S++Y+ +PC C
Sbjct: 88 PSQQLLLALDTSADATWAHCSPCGTC--PSSSLFAPANSSSYASLPCSSSWCPLFQGQAC 145
Query: 158 --------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
+P C +++ + LAS +R G +P FGC +
Sbjct: 146 PAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDT-----LRLGKDAIPNYTFGCVS 200
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
+G G+LG P++L SQ + G+FSYCL R + ++ G
Sbjct: 201 SVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGG- 259
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ R + TP+L + R +Y+++ +S+G V+ P G+F G ++D+GT +T
Sbjct: 260 QPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVIT 319
Query: 327 FIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEAD 384
Y L + + Q+ G Y + FD C+ D + P++T H+
Sbjct: 320 RWTAPVYAALREEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 374
Query: 385 YIVQP-ENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
+ P EN C+A+ + P+ +++ QQQN+ +++D+ + F
Sbjct: 375 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAK 434
Query: 439 ENC 441
E+C
Sbjct: 435 ESC 437
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 105/400 (26%), Positives = 171/400 (42%), Gaps = 60/400 (15%)
Query: 83 LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ L Y + IGTP K ++ DT S ++W C C C ++
Sbjct: 71 LAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNL 130
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRS------PFKCQNGKCVYTRRYHVGDVTRGLASR 186
++DPR S + + CD C + P C Y+ Y G T G
Sbjct: 131 GIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVT 190
Query: 187 ETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGKIS-------GILGFNASPLSLSS 235
+ + +G ++FGC G GG + GILGF S S+ S
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGC-----GAKLGGDLGSSNLALDGILGFGQSNSSMLS 245
Query: 236 QL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEI 293
QL +++ +F++CL + + G +V + ++TTP L+ D+ PH+ + L I
Sbjct: 246 QLAAAGKVRKMFAHCL-DTVNGGGIFAIG---NVVQPKVKTTP-LVPDM-PHYNVILKGI 299
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR-YDQILRSLGRQRI 352
+G + P FD + G IID+GT + ++ G Y+ L +D+ Q I
Sbjct: 300 DVGGTALGLPTNIFD--SGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK------HQDI 351
Query: 353 PYNASQEFDYCYRYDSSF-KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQ--- 407
Q+F C++Y S +P +TFH + + IV P + Y + + +C+ Q
Sbjct: 352 SVQTLQDFS-CFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD-YLFQNGKNLYCMGFQNGG 409
Query: 408 ----DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D +LG N L++YDL A+ + NC++
Sbjct: 410 VQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 114/437 (26%), Positives = 178/437 (40%), Gaps = 52/437 (11%)
Query: 23 HFTSSESTGFSLKLIPIFSPESPLYPG-NLSQSER-IHKMFEISKARANYMASMSKPNAF 80
F E G S +P+ P P +LS R +F S+AR +Y+ K +
Sbjct: 43 EFVKPEQNG-STVYVPLVHRHGPCAPAPSLSTDTRSFADIFRRSRARPSYIVRGKKVSV- 100
Query: 81 QELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPI 138
HL + L Y V V+ GTP PQ ++ DT S + W QC+PC +CF Q P+
Sbjct: 101 ----PAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPL 156
Query: 139 FDPRASTTYSEIPCDDPLCRS------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAF 191
+DP S+TYS +PC +C+ C +GK C + Y G T G S++
Sbjct: 157 YDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL 216
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
V FGC + A G G+LG L L R G+FSYCL
Sbjct: 217 APG---AIVQNFYFGCGHGK--HAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPS 267
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIM 310
+ G A TP+ +P F + L I++G + P AF
Sbjct: 268 VSSKPGFLALG--AGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF--- 322
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR---YD 367
+GG I+D+GT +T +++ Y+ L + + + + R+ N + D CY Y
Sbjct: 323 ---SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAY---RLLPNG--DLDTCYNLTGYK 374
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNML 424
+ ++TF + P + C+A + D +LG Q+
Sbjct: 375 NVVVPKIALTFTGGATINLDVPNGILV------NGCLAFAESGPDGSAGVLGNVNQRAFE 428
Query: 425 IIYDLNVPALRFGSENC 441
+++D + F ++ C
Sbjct: 429 VLFDTSTSKFGFRAKAC 445
>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 155/367 (42%), Gaps = 27/367 (7%)
Query: 96 FYSVEVNIGTPMKPQ--HLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
Y V V +G+ L D +L W QCQPC+ Q +FD S Y +
Sbjct: 67 LYGVLVGVGSGQTRHFYKLGLDLVGNLTWMQCQPCVPEVRQEGAVFDSAESPRYKHMKAT 126
Query: 154 DPLCRSPFKCQNG-KC-VYTRRYHVGDVTRGLASRETFAFPVRNG---FTFVPRLAFGCS 208
DP+C P+ G +C YT ++V G + FAF T V +L FGC+
Sbjct: 127 DPMCTPPYTPSVGNRCSFYTTTWNV--AAHGYLGSDMFAFAGTGAGGHSTDVDQLIFGCA 184
Query: 209 NDNSGFA--FGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVRE----MEATSVIK 260
+ G G ++G L + P+S SQL R FSYCL E + ++
Sbjct: 185 HTTDGLERLSHGVLAGALSLSRHPMSFLSQLTARGLADSRFSYCLFPEQSHPIAKHGFLR 244
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI---GRHIVRFPPGAFD-IMRDGTGG 316
FGRD R +T +L + H+ + I GR I+R P F ++ GG
Sbjct: 245 FGRDIP-RHDHAHSTSLLFTGPGSGGMYHIRVVGISLNGRRIMRLQPAMFTRNLQTRRGG 303
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
++D GTP+T + Y + ++ G +R Q C+ PS+
Sbjct: 304 SVVDPGTPLTRLVRQAYDIVEAEVVANMQKQGARRAKAQV-QGHRLCF-VSWGHVHLPSL 361
Query: 377 TFHLQE--ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
T ++ E A ++PE + F + C + D + ++LGA QQ + +DL+ L
Sbjct: 362 TINMYEDTAKLFIKPE-LLFRKVTARLLCFTVMPDEEMTVLGAAQQMDTRFTFDLHANRL 420
Query: 435 RFGSENC 441
F ENC
Sbjct: 421 YFAQENC 427
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 154/364 (42%), Gaps = 23/364 (6%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ FY+ + +GTP + ++ DT S++ + C+ C C T FDP STT ++
Sbjct: 9 RHSYFYTT-LKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLA 67
Query: 152 CDDPLCRS---PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
C DPLC C N +C Y+R Y + G +TF FP + RL FGC
Sbjct: 68 CGDPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFGCE 124
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDAD 266
N +G + GI+G + + SQL R I+ +FS C + ++ G
Sbjct: 125 NGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDG--ILLLGDVTL 182
Query: 267 VRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
+ TP LL+ L H+Y + + I++ + F FD G ++D+GT
Sbjct: 183 PEGANTVYTP-LLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY----GTVLDSGTTF 237
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR-----YDSSFKAYPSMTFHL 380
T++ ++ + + + G Q P Q D C++ + K +P F
Sbjct: 238 TYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVF 297
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-SILGAWQQQNMLIIYDLNVPALRFGSE 439
+ P Y +C+ I D+ +++G +++++ YD + F +
Sbjct: 298 GGGAKLTLPPLRYLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTM 357
Query: 440 NCAN 443
CA+
Sbjct: 358 ACAD 361
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/423 (26%), Positives = 174/423 (41%), Gaps = 51/423 (12%)
Query: 37 IPIFSPESPLYPG-NLSQSER-IHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQD 94
+P+ P P +LS R +F S+AR +Y+ K + HL +
Sbjct: 22 VPLVHRHGPCAPAPSLSTDTRSFADIFRRSRARPSYIVRGKKVSV-----PAHLGTSVMS 76
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPC 152
L Y V V+ GTP PQ ++ DT S + W QC+PC +CF Q P++DP S+TYS +PC
Sbjct: 77 LEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPC 136
Query: 153 DDPLCRS------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+C+ C +GK C + Y G T G S++ V F
Sbjct: 137 ASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPG---AIVQNFYF 193
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDA 265
GC + A G G+LG L L R G+FSYCL + G A
Sbjct: 194 GCGHGK--HAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSVSSKPGFLALG--A 245
Query: 266 DVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
TP+ +P F + L I++G + P AF +GG I+D+GT
Sbjct: 246 GKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGMIVDSGTV 299
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR---YDSSFKAYPSMTFHLQ 381
+T +++ Y+ L + + + + R+ N + D CY Y + ++TF
Sbjct: 300 ITGLQSTAYRALRSAFRKAMEAY---RLLPNG--DLDTCYNLTGYKNVVVPKIALTFTGG 354
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ P + C+A + D +LG Q+ +++D + F +
Sbjct: 355 ATINLDVPNGILV------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRA 408
Query: 439 ENC 441
+ C
Sbjct: 409 KAC 411
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 123/433 (28%), Positives = 179/433 (41%), Gaps = 57/433 (13%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKP 109
N + ER+ + E + R +ASM+ + P+ + Y E IG P +
Sbjct: 45 NCTTKERMRRATERTHRR---LASMAGGGG-----EASAPIHWNETQYIAEYLIGDPPQQ 96
Query: 110 QHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPCDDPLC--RSPFKC-Q 164
+ DT S+L+WTQC C CF Q +DP S T + C+D C S +C +
Sbjct: 97 AAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLGSETRCAR 156
Query: 165 NGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC---SNDNSGFAFGGKI 220
+GK C Y G + G E F F LAFGC S G G
Sbjct: 157 DGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNVSLAFGCITASRLTPGSLDGA-- 213
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCLV-----REMEATSVIKFGRDADVRRRDLETT 275
SGI+G LSL SQL + FSYCL +T + +
Sbjct: 214 SGIIGLGRGKLSLPSQLGDN---KFSYCLTPYFSDAANTSTLFVGASAGLSGGGAPATSV 270
Query: 276 PILLS-DLRP---HFYLHLLEISIGRHIVRFPPGAFDIMRDGT---GGFIIDTGTPVTFI 328
P L + D P +YL L I++G + P AFD+ GG +ID+G+P T +
Sbjct: 271 PFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWGGTLIDSGSPFTSL 330
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRY---DSSFKAYPSMTFHL---- 380
+ YQ L D+++R LG +P A E D C + K P + H
Sbjct: 331 IDVAYQALR---DELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPLVLHFGSGG 387
Query: 381 -QEADYIVQPENMYFIEPDRGRFCVAI--QDDP-------KYSILGAWQQQNMLIIYDLN 430
D +V PEN Y+ D C+ + P + +I+G + QQ+M ++YDL
Sbjct: 388 GGGGDVVVPPEN-YWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQDMHLLYDLG 446
Query: 431 VPALRFGSENCAN 443
L F +C++
Sbjct: 447 QGVLSFQPADCSS 459
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 145/370 (39%), Gaps = 43/370 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPC 152
L Y V + IGTP Q +L DT S L W QC+PC C+ Q P+FDP S+T++ IPC
Sbjct: 123 LEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPC 182
Query: 153 DDPLCRS-PFK-----CQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV 200
C+ P C N +C Y Y G +T G+ S ET A V
Sbjct: 183 ASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSAVV 239
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK 260
FGC +D G K G+LG +P SL SQ + G FSYCL +
Sbjct: 240 KSFRFGCGSDQHGPY--DKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGFLT 297
Query: 261 FGRDADVRRRDLETTPILLSDLRPH----FYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
G + + P + + L IS+G + PP F G
Sbjct: 298 LGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF------AKG 351
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY--- 373
I+D+GT +T I Y+ L + ++ + A D CY +
Sbjct: 352 NIVDSGTVITGIPTTAYKALRTAFRS---AMAEYPLLPPADSALDTCYNFTGHGTVTVPK 408
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLNV 431
++TF + P + + C+A D D + I+G + + ++YD
Sbjct: 409 VALTFVGGATVDLDVPSGVLVED------CLAFADAGDGSFGIIGNVNTRTIEVLYDSGK 462
Query: 432 PALRFGSENC 441
L F + C
Sbjct: 463 GHLGFRAGAC 472
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/428 (26%), Positives = 185/428 (43%), Gaps = 43/428 (10%)
Query: 33 SLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMA 91
+L++ IFSP SP P LS ++ + +M +AR +++S+ +F +
Sbjct: 40 TLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASAR--QL 97
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
Q + V IGTP + L DT++ W C CI C +T +F S+++ +P
Sbjct: 98 IQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLP 155
Query: 152 CDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF----VPRLA 204
C P C +P C C + Y V L V++ T VP
Sbjct: 156 CQSPQCNQVPNP-SCSGSACGFNLTYGSSTVAADL---------VQDNLTLATDSVPSYT 205
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFG 262
FGC +G + + LG L SQ + Q FSYCL + + + ++ G
Sbjct: 206 FGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ--SLYQSTFSYCLPSFKSVNFSGSLRLG 263
Query: 263 RDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
A R ++ TP+L + R +Y++L+ I +GR IV PP A G +ID+
Sbjct: 264 PVAQPIR--IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDS 321
Query: 322 GTPVTFIR-NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
GT TF R P T ++ D+ R +GR + ++ FD CY P++TF
Sbjct: 322 GT--TFTRLVAPAYTAVR--DEFRRRVGRN-VTVSSLGGFDTCYTVP---IISPTITFMF 373
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALR 435
+ + P+N C+A+ P +++ + QQQN I++D+ +
Sbjct: 374 AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVG 433
Query: 436 FGSENCAN 443
E+C++
Sbjct: 434 VARESCSS 441
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 150/348 (43%), Gaps = 65/348 (18%)
Query: 99 VEVNIGTPM-KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC 157
+ + +GTP+ + L D S VW QC P TY +
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPL-----------------TYGGSAAN---- 128
Query: 158 RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
T G + +TF F G T VP + FGCS+ + G F
Sbjct: 129 ----------------------TSGYLATDTFTF----GATAVPGVVFGCSDASYG-DFA 161
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-----EMEATSVIKFGRDADVRRRDL 272
G SG++G LSL SQL+ G FSY L+ + A SVI+FG DA + +
Sbjct: 162 GA-SGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRG 217
Query: 273 ETTPILLSDLRPHFY-LHLLEISI-GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+TP+L S L P FY ++L + + G + P G FD+ +GTGG I+ + TPVT++
Sbjct: 218 RSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQ 277
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ-EADYIVQ 388
Y + + +G + +A+ E D CY S K P +T AD +
Sbjct: 278 AAYDVVRA---AVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLS 334
Query: 389 PENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
N ++I+ D G C+ + S+LG Q +IYD++ L F
Sbjct: 335 AANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 176/396 (44%), Gaps = 54/396 (13%)
Query: 83 LEDIHLPM-----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L +I LP+ A Y ++ +G+P K ++ DT S ++W C PC +C +T
Sbjct: 58 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 117
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTR--RYHV----GDVTRGLASR 186
++D +AS+T + C+D C F Q+ C + YHV G + G +
Sbjct: 118 GIPLSLYDSKASSTSKNVGCEDAFCS--FIMQSETCGAKKPCSYHVVYGDGSTSDGDFVK 175
Query: 187 ETFAFPVRNGFTFVPRLA----FGCSNDNSGFAFG---GKISGILGFNASPLSLSSQLR- 238
+ G LA FGC + SG G + GI+GF S S+ SQL
Sbjct: 176 DNITLDQVTGNLRTAPLAQEVVFGCGKNQSG-QLGQTESAVDGIMGFGQSNTSVISQLAA 234
Query: 239 -NRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
++ +FS+CL M + G +V ++TTP++ + + H+ + L + +
Sbjct: 235 GGSVKRIFSHCL-DNMNGGGIFAIG---EVESPVVKTTPLVPNQV--HYNVILKGMDVDG 288
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ PP +G GG IID+GT + ++ Y +L+++ +Q++ +
Sbjct: 289 EPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI------TAKQQVKLHMV 340
Query: 358 QEFDYCYRYDSSF-KAYPSMTFHLQEA--------DYIVQ-PENMYFIEPDRGRFCVAIQ 407
QE C+ + S+ KA+P + H +++ DY+ E+MY G + Q
Sbjct: 341 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG--MTTQ 398
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D +LG N L++YDL + + NC++
Sbjct: 399 DGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 434
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 124/432 (28%), Positives = 174/432 (40%), Gaps = 78/432 (18%)
Query: 69 NYMASMSKPNAFQ-ELEDIHLPMAKQDLF------YSVEVNIGTPMKPQHLLFDTASSLV 121
N++AS+S A + + + K LF YS+ +N GTP + + DT SSLV
Sbjct: 48 NHLASLSLSRAHHIKSPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 107
Query: 122 WTQCQP---CIRC----FDQT-TPIFDPRASTTYSEIPCDDPLCRSPF------KCQN-- 165
W C C C +T P F P+ S++ I C +P C F KCQ
Sbjct: 108 WFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECD 167
Query: 166 ---GKCV-----YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
C Y +Y G T GL ET FP + +P GCS F
Sbjct: 168 STAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDFPNKKT---IPDFLVGCS------IFS 217
Query: 218 GKI-SGILGFNASPLSLSSQLRNRIQGLFSYCLVRE------MEATSVIKFGRDADVRRR 270
K GI GF SP SL SQL + FSYCLV + V+ G + V +
Sbjct: 218 IKQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKT 274
Query: 271 -DLETTPIL---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
L TP L + R ++Y+ L I IG V+ P DG GG I+D+GT T
Sbjct: 275 AGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFT 334
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY------CYRYDSSFK-AYPSMTFH 379
F+ N Y+ + + ++ +Q Y + E CY + P + F
Sbjct: 335 FMENPVYELVAKEFE-------KQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQ 387
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS---------ILGAWQQQNMLIIYDLN 430
+ + P + YF D G C+ I D ILG +QQ+N + +DL
Sbjct: 388 FKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLE 447
Query: 431 VPALRFGSENCA 442
F ++CA
Sbjct: 448 NEKFGFKQQSCA 459
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 159/380 (41%), Gaps = 41/380 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP-------CIRCFDQTTPIFDPRASTTYSE 149
Y V + GTP + L+ DT S L+W QC C + P F S T S
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 150 IPCDDPLCR-SPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFP-VRNGFT 198
+PC C P +G C Y Y G T G +R+T +G
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 173
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-----REM 253
V +AFGC N G +F G G++G LS +Q + FSYCL+ R
Sbjct: 174 AVRGVAFGCGTRNQGGSFSGT-GGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRD 312
++S + GR RR TP++ + L P FY + ++ I +G ++ P + I
Sbjct: 233 RSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 290
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS--QEFDYCYRYDSSF 370
G GG +ID+G+ +T++R G Y L+ + S+ RIP +A+ Q + CY SS
Sbjct: 291 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAA---SVHLPRIPSSATFFQGLELCYNVSSSS 347
Query: 371 K------AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQ 421
+P +T + + P Y ++ C+AI+ +++LG QQ
Sbjct: 348 SLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQ 407
Query: 422 NMLIIYDLNVPALRFGSENC 441
+ +D + F C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 125/455 (27%), Positives = 197/455 (43%), Gaps = 60/455 (13%)
Query: 9 LAAFFSYF--SVLFLTHFTSSE----STGFSLKLIPIFSPESPLYPGNLSQSERIHKMFE 62
+AA S F +LFL F+ + GF+ L S SPL +LS +R+ F
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFR 60
Query: 63 ISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVW 122
S +R+ + + + + L+ S+ IGTP + DT S L W
Sbjct: 61 RSLSRSAALLNRAATSGAVGLQS------------SI---IGTPPVDYLGIADTGSDLTW 105
Query: 123 TQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC----VYTRRYHVGD 178
QC PC++C+ Q PIF+P ST++S +PC+ C + +G C V Y GD
Sbjct: 106 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHA---VDDGHCGVQGVCDYSYTYGD 162
Query: 179 VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS-GFAFGGKISGILGFNASPLSLSSQL 237
T S+ F + + GC + +S GF F SG++G LSL SQ+
Sbjct: 163 RTY---SKGDLGFEKITIGSSSVKSVIGCGHASSGGFGFA---SGVIGLGGGQLSLVSQM 216
Query: 238 R--NRIQGLFSYCLVREM-EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEIS 294
+ I FSYCL + A I FG++A V + +TP++ + ++Y+ L IS
Sbjct: 217 SQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAIS 276
Query: 295 IG--RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI 352
IG RH+ AF G IID+GT ++F+ Y ++ +++++ R +
Sbjct: 277 IGNERHM------AFAKQ----GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKA-KRVKD 325
Query: 353 PYNASQEFDYCYRYD---SSFKAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQD 408
P N +D C+ ++ P +T A+ + P N + +
Sbjct: 326 PGNF---WDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPA 382
Query: 409 DP--KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P ++ I+G N LI YDL L F C
Sbjct: 383 SPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 184/418 (44%), Gaps = 34/418 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G +L+++ ++SP SP P LS E + +M KAR +++S+ + +
Sbjct: 36 GSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVPIASGR-- 93
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q+ Y V IGTP + + DT+S + W C C+ C ++ +F+ ASTTY
Sbjct: 94 QIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKS 150
Query: 150 IPCDDPLCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+ C C+ K C G C + Y + L S++T VP +FGC
Sbjct: 151 LGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITLATDA----VPGYSFGC 205
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDA 265
+G + + LG SL SQ +N Q FSYCL + + + ++ G
Sbjct: 206 IQKATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVG 263
Query: 266 DVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+R ++ TP+L + RP Y ++L+ + +GR +V PPG+F G I D+GT
Sbjct: 264 QPKR--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTV 321
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
T + Y + D +GR + + FD CY A P++TF +
Sbjct: 322 FTRLVTPAY---IAVRDAFRNRVGRN-LTVTSLGGFDTCYTVP---IAAPTITFMFTGMN 374
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
+ P+N+ C+A+ P +++ QQQN ++YD VP R G
Sbjct: 375 VTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYD--VPNSRLG 430
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 161/377 (42%), Gaps = 39/377 (10%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
+P+ +Y IGTP +P + D A LVWTQC C RCF Q P+F P AS+T+
Sbjct: 53 VPIRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTF 112
Query: 148 SEIPCDDPLCRS-PFK-CQNGKCVYTR-RYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
PC +C S P + C C Y + T G A+ +TFA T RLA
Sbjct: 113 KPEPCGTAVCESIPTRSCSGDVCSYKGPPTQLRGNTSGFAATDTFAI-----GTATVRLA 167
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGR 263
FGC + G SG +G +P SL +Q++ FSYCL R +S + G
Sbjct: 168 FGCVVASDIDTMDGP-SGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGS 223
Query: 264 DADVRRRDLETTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG-FI 318
A + + +T + D H+YL L+ +R G I +GG +
Sbjct: 224 SAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLD------AIR--AGNTTIATAQSGGILV 275
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSM 376
+ T +P + + + Y+ + + + + Q FD C++ + F P +
Sbjct: 276 MHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPM-ATPPQPFDLCFKKAAGFSRATAPDL 334
Query: 377 TFHLQEADYIVQPENMYFIE--PDRGRFCVAI--------QDDPKYSILGAWQQQNMLII 426
F Q A + P Y I+ ++ C AI S+LG+ QQ+++ +
Sbjct: 335 VFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFL 394
Query: 427 YDLNVPALRFGSENCAN 443
YDL L F +C++
Sbjct: 395 YDLKKETLSFEPADCSS 411
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 167/376 (44%), Gaps = 43/376 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIP 151
Y ++ IGTP KP H+ DT S ++W C C +C ++ ++DP+ S++ S +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 152 CDDPLCRSPF-------KCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV--- 200
CD+ C + + C GK C Y Y G T G ++ + +G
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206
Query: 201 -PRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEA 255
+ FGC G + + GI+GF S S SQL + ++ +FS+CL ++
Sbjct: 207 KANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL-DTIKG 265
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
+ G +V + +++TP+L + H+ ++L I + + ++ PP F+
Sbjct: 266 GGIFAIG---EVVQPKVKSTPLLPN--MSHYNVNLQSIDVAGNALQLPPHIFETSE--KR 318
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYP 374
G IID+GT +T++ Y+ ++ Q Q I + Q F C+ Y S +P
Sbjct: 319 GTIIDSGTTLTYLPELVYKDILAAVFQK-----HQDITFRTIQGF-LCFEYSESVDDGFP 372
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIY 427
+TFH ++ + + YF + +C+ Q D +LG N +++Y
Sbjct: 373 KITFHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVY 432
Query: 428 DLNVPALRFGSENCAN 443
DL + + NC++
Sbjct: 433 DLEKQVIGWTDYNCSS 448
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 171/409 (41%), Gaps = 30/409 (7%)
Query: 35 KLIPIFSPESPLYPGNLSQSERIHKMFEISKAR-ANYMASMSKPNAFQELEDIHLPMAKQ 93
KLI S P Y N + +R+ + S AR AN A + + +
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPSLT 97
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
++IG P PQ ++ DT S ++W C PC C + +FDP S+T+S
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFS----- 152
Query: 154 DPLCRSPFKCQNGKCV---YTRRYHVGDVTRGLASRETFAFPVRN-GFTFVPRLAFGCSN 209
PLC++P + +C +T Y G R+T F + G + + + FGC +
Sbjct: 153 -PLCKTPCDFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH 211
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME---ATSVIKFGRDAD 266
N G +GILG N P SL ++L + FSYC+ + + G AD
Sbjct: 212 -NIGHDTDPGHNGILGLNNGPDSLVTKLGQK----FSYCIGNLADPYYNYHQLILGEGAD 266
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ +TP + +Y+ + IS+G + P F++ + GG IIDTG+ +T
Sbjct: 267 LEGY---STPFEV--YNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTIT 321
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
F+ + ++ L + +L RQ + + +P +TFH + +
Sbjct: 322 FLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADL 381
Query: 387 VQPENMYFIEPDRGRFCVAI------QDDPKYSILGAWQQQNMLIIYDL 429
+F + + FC+ + K S++G QQ+ + YDL
Sbjct: 382 ALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDL 430
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 171/378 (45%), Gaps = 47/378 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-------FDQTTPIFDPRASTTYS 148
Y E+ +GTP K ++ DT S ++W C C +C D T +DP+AS++ S
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLT--FYDPKASSSGS 140
Query: 149 EIPCDDPLCRSPFKCQ------NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV-- 200
+ CD C + + + N C Y+ Y G T G + F G
Sbjct: 141 TVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQP 200
Query: 201 --PRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC G + + GILGF + S+ SQL +++ +F++CL ++
Sbjct: 201 GNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL-DTIK 259
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDG 313
+ G +V + ++TTP L++D+ PH+ ++L I +G ++ P F+ R G
Sbjct: 260 GGGIFAIG---NVVQPKVKTTP-LVADM-PHYNVNLKSIDVGGTTLQLPAHVFETGERKG 314
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KA 372
T IID+GT +T++ ++ +M I Q I ++ Q+F C++Y S
Sbjct: 315 T---IIDSGTTLTYLPELVFKEVMA---AIFNK--HQDIVFHNVQDF-MCFQYPGSVDDG 365
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLI 425
+P++TFH ++ + + YF +CV Q D ++G N L+
Sbjct: 366 FPTITFHFEDDLALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLV 425
Query: 426 IYDLNVPALRFGSENCAN 443
IYDL + + NC++
Sbjct: 426 IYDLENQVIGWTDYNCSS 443
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 126/458 (27%), Positives = 186/458 (40%), Gaps = 58/458 (12%)
Query: 22 THFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMAS----MSKP 77
T + S GFS++ I S SP + +L+ R+ + S RA ++ + P
Sbjct: 25 TAYVGSGGDGFSVEFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAP 84
Query: 78 NAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ--------PCI 129
+A + + + Y + VNIGTP + DT S L+W C
Sbjct: 85 SADGFVSE----LTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAA 140
Query: 130 RCFDQTTP--IFDPRASTTYSEIPCDDPLCRSPFKCQNG---KCVYTRRYHVGDVTRGLA 184
R D P FDP STT+ + CD C + G KC Y+ Y G T G+
Sbjct: 141 RDADAQPPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVL 200
Query: 185 SRETFAF---PVRNGFTFVPRLA---FGCSNDNSGFAFGGKISGILGFNASPLSLSSQL- 237
S ETF F P G R+A FGCS F G++G LSL SQL
Sbjct: 201 STETFTFADAPGARGDGTTTRVANVNFGCSTT---FVGSSVGDGLVGLGGGDLSLVSQLG 257
Query: 238 -RNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI 295
+ FSYCLV ++A+S + FG A V TTP++ S ++ ++ + L + +
Sbjct: 258 ADTSLGRRFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKV 317
Query: 296 GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN 355
G P I+D+GT +TF+ L++ GR ++P
Sbjct: 318 GNKTFEAP---------DRSPLIVDSGTTLTFLPEALVDPLVKEL------TGRIKLPPA 362
Query: 356 ASQE------FDYCYRYDSSFKAY-PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
S E FD + A P +T L + F+E G C+A+
Sbjct: 363 QSPERLLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSA 422
Query: 409 DPK---YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ SI+G QQNM + YDL+ + F CA+
Sbjct: 423 MSEQFPASIIGNIAQQNMHVGYDLDKGTVTFAPAACAS 460
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/431 (25%), Positives = 181/431 (41%), Gaps = 41/431 (9%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQE---LEDIH 87
GF+ +LI SP SP Y + + R + A +Y A + + N +
Sbjct: 36 GFTAELIRRDSPNSPFYNALEAAATRS------TNASQHYDAQIGRFNLMSDSYYASQSE 89
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
L +K + Y +++++GTP L D L W C+ C C F P S+TY
Sbjct: 90 LNFSKGN--YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFF-PSESSTY 146
Query: 148 SEIPCDDPLCR--SPFKCQNGKCVY----TRRYHVGDVTRGLASRETFAFPVRNGFTF-V 200
+ C+ C+ + CQ C+Y + +GL + +T +F +G
Sbjct: 147 TSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSY 206
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVI 259
P F C + + G +GI+G S++SQ+++ I G FS CLV + +S I
Sbjct: 207 PNTNFICGTFIDNWHYIG--AGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQSSKI 264
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
FG V + +TPI ++L L +S+G + V A + I
Sbjct: 265 NFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRV-----ANNFYSAPKSNIYI 319
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS--SFKAYPSMT 377
D T T + + Y+ + ++ +++ I YN ++ CY+ +S F A P +T
Sbjct: 320 DWRTTFTSLPHDFYENVEA---EVRKAINLTPINYNNERKLSLCYKSESDHDFDA-PPIT 375
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP-------KYSILGAWQQQNMLIIYDLN 430
H AD + P N F+ D C A D +++ G+WQQ N ++ YDL
Sbjct: 376 MHFTNADVQLSPLNT-FVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLK 434
Query: 431 VPALRFGSENC 441
+ F +C
Sbjct: 435 SSTVSFKQADC 445
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 169/381 (44%), Gaps = 43/381 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
+S+++ IG+ K + DT S V QC ++ P+FDP AS +Y ++PC
Sbjct: 99 LFSMQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQ 152
Query: 156 LCRS-----------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-- 202
LC + P + C Y+ Y + G S++ N +
Sbjct: 153 LCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFR 212
Query: 203 -LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLVR---EMEATS 257
+AFGC++ GF GI+GFN LSL SQL++R+ G FSYC + AT
Sbjct: 213 DVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATG 272
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPH----FYLHLLEISIGRHIVRFPPGAFDIM-RD 312
VI G D+ + + + TP+L + + P +Y+ L IS+ + P AF +
Sbjct: 273 VIFLG-DSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPST 331
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD--SSF 370
G GG ++D+GT T + + Y + RS R+++ A+ FD CY SS
Sbjct: 332 GDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKV--GAAAGFDDCYNISAGSSL 389
Query: 371 KAYPSMTFHLQEADYI-VQPENMYFIEPDRGR---FCVAIQDDP-----KYSILGAWQQQ 421
P + LQ + ++ E+++ G C+AI K ++LG +QQ
Sbjct: 390 PGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQS 449
Query: 422 NMLIIYDLNVPALRFGSENCA 442
N L+ YD + F +C+
Sbjct: 450 NYLVEYDNERSRVGFERADCS 470
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/413 (25%), Positives = 160/413 (38%), Gaps = 76/413 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ----------------------------PC 128
Y V +GTP +P L+ DT S L W +C+
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 129 IRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKCQ-----NGKCVYTRRYHVGDVTR 181
+F P S T++ IPC C + PF C Y RY G R
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAAR 174
Query: 182 GLASRE--TFAFPVRNGFTFVPR-----LAFGCSNDNSGFAFGGKISGILGFNASPLSLS 234
G + T A R R + GC+ +G +F G+L S +S +
Sbjct: 175 GTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLAS-DGVLSLGYSNVSFA 233
Query: 235 SQLRNRIQGLFSYCLVREM---EATSVIKFGRDADVRRRD--------------LETTPI 277
S+ R G FSYCLV + ATS + FG + V TP+
Sbjct: 234 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPL 293
Query: 278 LLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTL 336
LL +RP + + + +S+ ++R P +D+ + GG I+D+GT +T + + Y+ +
Sbjct: 294 LLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKG--GGAILDSGTSLTVLVSPAYRAV 351
Query: 337 MQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK------AYPSMTFHLQEADYIVQPE 390
+ + L L P A FDYCY + S A P++ H + + P
Sbjct: 352 VAALGKKLVGL-----PRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPP 406
Query: 391 NMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
Y I+ G C+ +Q D P S++G QQ L +DL LRF C
Sbjct: 407 KSYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/425 (25%), Positives = 182/425 (42%), Gaps = 29/425 (6%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GF + L+ S ESP Y NL+ +E S AR + + S+ N ++ M
Sbjct: 35 GFKVPLLHWLSTESPFYEPNLTLAELTQASIRTSGARGDSIRSIMSGNITSSMKYPISRM 94
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTPIFDPRASTTYS 148
+ D Y ++ +IG+P + + D+ SSLVW QC C C+ Q P+F+P S TY
Sbjct: 95 SYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYM 154
Query: 149 EIPCDDPLCRSP-----FKCQ--NGKCVYTRRYHVGDVTRGLASRETFAFPVR-NGF-TF 199
+ C+ CR ++C+ N C Y Y T G+ S + F FP +GF +
Sbjct: 155 KRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNY 214
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV----REMEA 255
R+ FGC +NS G++G + SL Q+ FSYC+ + ++
Sbjct: 215 TLRIIFGCGYNNSDPQHFYP-PGLVGLTNNKASLVGQMD---VDQFSYCVSIDTEQNLKG 270
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVR-FPPGAFDIMRDGT 314
+ I+FG A + + P + + + ++ I + V +P F G
Sbjct: 271 SMEIRFGLAASISGHSTQLVP---NSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQ 327
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY- 373
GG +DTGT T + N L++ ++ + + + ++ F+ CY D A
Sbjct: 328 GGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDY---SNSGFELCYFSDDFLGATL 384
Query: 374 --PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
+ F + Y + R + C+A+ SI+G Q +++ I YDL+
Sbjct: 385 PDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNGMSIIGMHQLRDIKIGYDLHH 444
Query: 432 PALRF 436
+ F
Sbjct: 445 NIVSF 449
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 146/349 (41%), Gaps = 37/349 (10%)
Query: 112 LLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC-------------DDPLC 157
++ DT SSL W QCQPC + C Q P++DP S TY ++ C +DPLC
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 158 RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
+ + C+YT Y + G S++ +P+ +GC DN G
Sbjct: 61 ET----DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT---LPQFTYGCGQDNQGLF-- 111
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPI 277
G+ +GI+G LS+ +QL + FSYCL +S F + + TP+
Sbjct: 112 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPM 171
Query: 278 LLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTL 336
L P Y L L I++ + + + +ID+GT +T + Y L
Sbjct: 172 LTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LIDSGTVITRLPMSMYAAL 225
Query: 337 MQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SSFKAYPSMTFHLQEADYIVQPENMYFI 395
Q + +I+ + + Y+ D C++ S A P + Q + I
Sbjct: 226 RQAFVKIMSTKYAKAPAYSI---LDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILI 282
Query: 396 EPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
E D+G C+A + +I+G QQQ I YD++ + F +C
Sbjct: 283 EADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 164/345 (47%), Gaps = 36/345 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP-RASTTYSEIPCDDP 155
Y +++ +GTP + L DT S LVW QC PC C+ Q P+FDP + ++ + C
Sbjct: 31 YLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSFFDHSC--- 87
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFA 215
SP K C Y Y T+G+ ++E F +G V + FGC ++N+G
Sbjct: 88 ---SPEKA----CDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHNNTG-V 139
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGL-FSYCLV---REMEATSVIKFGRDADVRRRD 271
F G++G PLSL SQ+ N FS CLV + + I G +DV
Sbjct: 140 FNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGEASDVSGEG 199
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ TTP++ + + + + L IS+G V F + +++ G +ID+GTP T++
Sbjct: 200 VVTTPLVSEEGQTPYLVTLEGISVGDTFVPF--NSSEMLSKGN--IMIDSGTPETYLPQ- 254
Query: 332 PYQTLMQRYDQILRSLGRQ----RIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ YD+++ L Q I + CY+ +++ + P +T H + AD +
Sbjct: 255 ------EFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEG-PILTAHFEGADVKL 307
Query: 388 QPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQQNMLIIYDLN 430
P FI P G FC A+ D Y I G + Q N+LI +DL+
Sbjct: 308 LPLQT-FIPPKDGVFCFAMTGTTDGLY-IFGNFAQSNVLIGFDLD 350
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 159/380 (41%), Gaps = 40/380 (10%)
Query: 86 IHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP---I 138
+ LPM+ Y V+V +GTP + L+ DT S L W ++C +P +
Sbjct: 76 VSLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTW------VKCAGGASPPGLV 129
Query: 139 FDPRASTTYSEIPCDDPLCR--SPFKCQN-----GKCVYTRRYHVGDV-TRGLASRETFA 190
F P AS +++ +PC C+ PF N C Y RY G G+ ++
Sbjct: 130 FRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSAT 189
Query: 191 FPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
+ G + + GCS+ + G +F + G+L + +S +S+ R G FSYCL
Sbjct: 190 IALPGGKVAQLQDVVLGCSSTHDGQSF-KSVDGVLSLGNAKISFASRAAARFGGSFSYCL 248
Query: 250 VREM---EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA 306
V + AT + FG V R T + L P + + + + + + P
Sbjct: 249 VDHLAPRNATGYLAFG-PGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEV 307
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
+D +GG I+D+GT +T + Y+ ++ ++L + + P F++CY +
Sbjct: 308 WDPK---SGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP-----PFEHCYNW 359
Query: 367 DS---SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQ 421
+ P + + P Y I+ G C+ +Q+ P S++G QQ
Sbjct: 360 TAPRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQ 419
Query: 422 NMLIIYDLNVPALRFGSENC 441
L +DL +RF C
Sbjct: 420 EHLWEFDLKNMEVRFMPSTC 439
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 120/434 (27%), Positives = 198/434 (45%), Gaps = 52/434 (11%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK--ARANYMASMSKPNAFQELEDIHL 88
G +L++ F P SPL PG + S + S+ +R Y+ S+ A + +
Sbjct: 43 GNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSL----AVRGRARAYA 98
Query: 89 PMAK-----QDLFYSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFDQTTPIFDP 141
P+A Q L Y V ++GTP PQ LL DT++ W C C C + FDP
Sbjct: 99 PIASGRQLLQTLTYVVRASLGTP--PQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDP 156
Query: 142 RASTTYSEIPCDDPLC-RSP-FKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
AS +Y +PC PLC ++P C G C ++ Y + L S+++ A
Sbjct: 157 AASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAAL-SQDSLAV----AG 211
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEA 255
V FGC +G A + LG S SQ ++ + FSYCL + +
Sbjct: 212 NAVKAYTFGCLQRATGTAAPPQGLLGLGRGPL--SFLSQTKDMYEATFSYCLPSFKSLNF 269
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ ++ GR+ +R ++TTP+L + R +Y+++ + +GR +V P AFD T
Sbjct: 270 SGTLRLGRNGQPQR--IKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP--AFD---PAT 322
Query: 315 G-GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
G G ++D+GT T + Y + D++ R +G P ++ FD C ++++ A+
Sbjct: 323 GAGTVLDSGTMFTRLVAPAYVAV---RDEVRRRVG---APVSSLGGFDTC--FNTTAVAW 374
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDPK-----YSILGAWQQQNMLIIY 427
P MT L + + PE I G C+A+ P +++ + QQQN +++
Sbjct: 375 PPMTL-LFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLF 433
Query: 428 DLNVPALRFGSENC 441
D+ + F E C
Sbjct: 434 DVPNGRVGFARERC 447
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 120/464 (25%), Positives = 199/464 (42%), Gaps = 58/464 (12%)
Query: 9 LAAFFSYFSVLFLT--------HFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHK 59
L F FS+L L T ++ G +L++ I SP SP + LS R+ +
Sbjct: 4 LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 63
Query: 60 MFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASS 119
+AR Y++S+ + + + Q Y V+ IGTP +P L DT+S
Sbjct: 64 TLAQDQARLQYLSSLVAGRSVVPIASGRQML--QSTTYIVKALIGTPAQPLLLAMDTSSD 121
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
+ W C C+ C T F P ST++ + C P C+ +P C C + Y
Sbjct: 122 VAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQCKQVPNP-TCGARACSFNLTYGS 178
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI---SGILGFNASPLSL 233
+ L S++T +R + FGC N +G GG I G+LG PLSL
Sbjct: 179 SSIAANL-SQDT----IRLAADPIKAFTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSL 230
Query: 234 SSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHL 290
SQ ++ + FSYCL R + + ++ G + +R ++ T +L + R +Y++L
Sbjct: 231 MSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR--VKYTQLLRNPRRSSLYYVNL 288
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ-------I 343
+ I +GR +V PP A G I D+GT T + Y+ + + + +
Sbjct: 289 VAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAV 348
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
+ SLG FD CY S P++TF + + + +N+ C
Sbjct: 349 VTSLG----------GFDTCY---SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSC 395
Query: 404 VAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+A+ P+ +++ + QQQN ++ D+ L E C+
Sbjct: 396 LAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 153/367 (41%), Gaps = 39/367 (10%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
+L+ IGTP + D LVWTQC CI CF Q P+F P AS+T+ PC
Sbjct: 51 ELYNVANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCG 110
Query: 154 DPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+C+S KC + C Y +G T G+ + +TFA G L FGC +
Sbjct: 111 TDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGCVVAS 166
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRR 270
GG SG +G +P SL +Q++ FSYCL + S + G A +
Sbjct: 167 DIDTMGGP-SGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKLAGG 222
Query: 271 D-----LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
++T+P +D +Y + L EI G + P G ++ +
Sbjct: 223 GAWTPFVKTSP---NDGMSQYYPIELEEIKAGDATITMPRGRNTVL-------VQTAVVR 272
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
V+ + + YQ + ++ S+G F+ C+ + P + F Q
Sbjct: 273 VSLLVDSVYQEFKK---AVMASVGAAPTATPVGAPFEVCFP-KAGVSGAPDLVFTFQAGA 328
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDP--------KYSILGAWQQQNMLIIYDLNVPALRF 436
+ P Y + C+++ +ILG++QQ+N+ +++DL+ L F
Sbjct: 329 ALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSF 388
Query: 437 GSENCAN 443
+C++
Sbjct: 389 EPADCSS 395
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 168/416 (40%), Gaps = 37/416 (8%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN + E + + K R ++ + P+ L Y E IG P +
Sbjct: 44 GNYTAEELVRRAVAAGKQRLAFLDAAMAGGGDGGGV--GAPVRWATLQYVAEYLIGDPPQ 101
Query: 109 PQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPCDDPLCRSP-----F 161
L DT S LVWTQC C+R C Q P ++ AS+T++ +PC +C + F
Sbjct: 102 RAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHF 161
Query: 162 KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC---SNDNSGFAFGG 218
C Y G V G E FAF + LAFGC + G G
Sbjct: 162 CDLAAGCSVIAGYGAG-VVAGTLGTEAFAFQ-----SGTAELAFGCVTFTRIVQGALHGA 215
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REMEATSVIKFGRDADV-RRRDLET 274
SG++G LSL SQ FSYCL AT + G A + D+ T
Sbjct: 216 --SGLIGLGRGRLSLVSQTGAT---KFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMT 270
Query: 275 TPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG----TGGFIIDTGTPVTFIR 329
T + P +YL L+ +++G + P FD+ +GG IID+G+P T +
Sbjct: 271 TQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLV 330
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQP 389
+ Y L L G P + + C + P++ FH + + P
Sbjct: 331 HDAYDALASELAARLN--GSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGADMAVP 388
Query: 390 ENMYFIEPDRGRFCVAIQDDPKY---SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
Y+ D+ C+AI Y S++G +QQQNM ++YDL F +C+
Sbjct: 389 AESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCS 444
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 162/391 (41%), Gaps = 40/391 (10%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
A +S F E + LP+ Y V V +GTP K L+FDT S + WTQC+P
Sbjct: 90 ARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEP 149
Query: 128 CIR-CFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPF--KCQNGKCVYTRRYHVGDV 179
C++ C+ Q P +P ST+Y I C LC+ F C + C+Y +Y G
Sbjct: 150 CVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY 209
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ G + ET N F FGC N+G +G+LG + L+L SQ
Sbjct: 210 SIGFFATETLTLSSSNVF---KNFLFGCGQQNNGLFG--GAAGLLGLGRTKLALPSQTAK 264
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRH 298
+ LFSYCL + + G + ++ TP+ D P + L + +S+G
Sbjct: 265 TYKKLFSYCLPASSSSKGYLSLGGQVS---KSVKFTPLSADFDSTPFYGLDITGLSVGGR 321
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ AF + G +ID+GT +T + Y L + ++ P +
Sbjct: 322 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY-----PSTSGY 370
Query: 359 E-FDYCY---RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPK 411
FD CY +YD+ +TF + + + + + C+A DD
Sbjct: 371 SIFDTCYDFSKYDTVRIPKVGVTFK-GGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSD 429
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
SI G QQ+ ++YD + F C+
Sbjct: 430 TSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/421 (24%), Positives = 178/421 (42%), Gaps = 45/421 (10%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S K + + + + Y V +GT
Sbjct: 30 VHPPSPSPLESIIALARADDARLLFLSS--KAASSGGITSAPVASGQTPPSYVVRAGLGT 87
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------- 157
P++ L DT++ W+ C PC C + F P +S++Y+ +PC C
Sbjct: 88 PVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPC 145
Query: 158 ------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+P C +++ + L S +R G + AFGC
Sbjct: 146 PANQDASAPLP----ACAFSKPFADTSFQASLGSDT-----LRLGKDAIAGYAFGCVGAV 196
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRR 269
+G G+LG P+SL SQ +R G+FSYCL R + ++ G A +
Sbjct: 197 AGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLG--AAGQP 254
Query: 270 RDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
R++ TP+L + RP +Y+++ +S+GR V+ P G+F G +ID+GT +T
Sbjct: 255 RNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRW 314
Query: 329 RNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA-DY 385
Y L + + Q+ G Y + FD C+ D + P +T H+ D
Sbjct: 315 TAPVYAALREEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ EN C+A+ + P+ +++ QQQN+ ++ D+ + F E
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 441 C 441
C
Sbjct: 430 C 430
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/438 (25%), Positives = 181/438 (41%), Gaps = 50/438 (11%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNA 79
LT ++ G +L++ +FSP SP P LS +E + ++ +AR ++ASM +
Sbjct: 22 LTPKCDTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRS 81
Query: 80 FQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
+P+A Q Y V IGTP + L DT++ W C C C
Sbjct: 82 I-------VPIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGC--- 131
Query: 135 TTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
T+ +F P STT+ + C P C SP C C + Y + +
Sbjct: 132 TSTLFAPEKSTTFKNVSCGSPECNKVPSP-SCGTSACTFNLTYGSSSIAANV-------- 182
Query: 192 PVRNGFTF----VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSY 247
V++ T +P FGC +G + + LG SL SQ +N Q FSY
Sbjct: 183 -VQDTVTLATDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPL--SLLSQTQNLYQSTFSY 239
Query: 248 CL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPP 304
CL + + + ++ G A R ++ TP+L + R +Y++L I +GR IV PP
Sbjct: 240 CLPSFKSLNFSGSLRLGPVAQPIR--IKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPP 297
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
A G + D+GT T + Y + + + + + + + FD CY
Sbjct: 298 AALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCY 357
Query: 365 RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQ 419
P++TF + + +N+ C+A+ P +++ Q
Sbjct: 358 TVP---IVAPTITFMFSGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQ 414
Query: 420 QQNMLIIYDLNVPALRFG 437
QQN ++YD VP R G
Sbjct: 415 QQNHRVLYD--VPNSRLG 430
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 151/364 (41%), Gaps = 33/364 (9%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
+L+ IGTP + D LVWTQC CI CF Q P+F P AS+T+ PC
Sbjct: 21 ELYNVANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCG 80
Query: 154 DPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+C+S KC + C + +G T G+ + +TFA G L FGC +
Sbjct: 81 TDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGCVVAS 136
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRR 270
GG SG +G +P SL +Q++ FSYCL + S + G A +
Sbjct: 137 DIDTMGGP-SGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKLAGG 192
Query: 271 DLETTPILLS---DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
T + S + ++ + L EI G + P G ++ + V+
Sbjct: 193 GAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVL-------VQTAVVRVSL 245
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ + YQ + ++ S+G + F+ C+ + P + F Q +
Sbjct: 246 LVDSVYQEFKK---AVMASVGAAPTATPVGEPFEVCFP-KAGVSGAPDLVFTFQAGAALT 301
Query: 388 QPENMYFIEPDRGRFCVAIQDDP--------KYSILGAWQQQNMLIIYDLNVPALRFGSE 439
P Y + C+++ +ILG++QQ+N+ +++DL+ L F
Sbjct: 302 VPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPA 361
Query: 440 NCAN 443
+C++
Sbjct: 362 DCSS 365
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 165/372 (44%), Gaps = 37/372 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P ++ DT S ++W C C C FD S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 151 PCDDPLCRSPFKC------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-- 202
C DP+C S F+ +N +C Y+ RY G T G +TF F G + V
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 203 --LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEAT 256
+ FGCS SG + GI GF LS+ SQL +R +FS+CL +
Sbjct: 219 APIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGG 278
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
V G ++ + +P++ S +PH+ L+LL I + ++ F+ T G
Sbjct: 279 GVFVLG---EILVPGMVYSPLVPS--QPHYNLNLLSIGVNGQMLPLDAAVFE--ASNTRG 331
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPS 375
I+DTGT +T++ Y + + I S+ + P ++ E CY +S +PS
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFL---NAISNSVSQLVTPIISNGE--QCYLVSTSISDMFPS 386
Query: 376 MTFHLQ-EADYIVQPENMYF---IEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLN 430
++ + A +++P++ F I +C+ Q P + +ILG ++ + +YDL
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446
Query: 431 VPALRFGSENCA 442
+ + S +C+
Sbjct: 447 RQRIGWASYDCS 458
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 120/464 (25%), Positives = 199/464 (42%), Gaps = 58/464 (12%)
Query: 9 LAAFFSYFSVLFLT--------HFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHK 59
L F FS+L L T ++ G +L++ I SP SP + LS R+ +
Sbjct: 20 LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 79
Query: 60 MFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASS 119
+AR Y++S+ + + + Q Y V+ IGTP +P L DT+S
Sbjct: 80 TLAQDQARLQYLSSLVAGRSVVPIASGRQML--QSTTYIVKALIGTPAQPLLLAMDTSSD 137
Query: 120 LVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHV 176
+ W C C+ C T F P ST++ + C P C+ +P C C + Y
Sbjct: 138 VAWIPCSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQCKQVPNP-TCGARACSFNLTYGS 194
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI---SGILGFNASPLSL 233
+ L S++T +R + FGC N +G GG I G+LG PLSL
Sbjct: 195 SSIAANL-SQDT----IRLAADPIKAFTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSL 246
Query: 234 SSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHL 290
SQ ++ + FSYCL R + + ++ G + +R ++ T +L + R +Y++L
Sbjct: 247 MSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR--VKYTQLLRNPRRSSLYYVNL 304
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQ-------I 343
+ I +GR +V PP A G I D+GT T + Y+ + + + +
Sbjct: 305 VAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAV 364
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
+ SLG FD CY S P++TF + + + +N+ C
Sbjct: 365 VTSLG----------GFDTCY---SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSC 411
Query: 404 VAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+A+ P+ +++ + QQQN ++ D+ L E C+
Sbjct: 412 LAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/421 (24%), Positives = 178/421 (42%), Gaps = 45/421 (10%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S K + + + + Y V +GT
Sbjct: 30 VHPPSPSPLESIIALARADDARLLFLSS--KAASSGGVTSAPVASGQTPPSYVVRAGLGT 87
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------- 157
P++ L DT++ W+ C PC C + F P +S++Y+ +PC C
Sbjct: 88 PVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPC 145
Query: 158 ------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+P C +++ + L S +R G + AFGC
Sbjct: 146 PANQDASAPLP----ACAFSKPFADTSFQASLGSDT-----LRLGKDAIAGYAFGCVGAV 196
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRR 269
+G G+LG P+SL SQ +R G+FSYCL R + ++ G A +
Sbjct: 197 AGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLG--AAGQP 254
Query: 270 RDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
R++ TP+L + RP +Y+++ +S+GR V+ P G+F G +ID+GT +T
Sbjct: 255 RNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRW 314
Query: 329 RNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA-DY 385
Y L + + Q+ G Y + FD C+ D + P +T H+ D
Sbjct: 315 TAPVYAALREEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ EN C+A+ + P+ +++ QQQN+ ++ D+ + F E
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 441 C 441
C
Sbjct: 430 C 430
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 141/347 (40%), Gaps = 30/347 (8%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI---RCFDQTTPIFDPRASTTYSEIP 151
L Y V ++GTP Q + DT S L W QC+PC C+ Q P+FDP S++Y+ +P
Sbjct: 138 LNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVP 197
Query: 152 CDDPLCRS-----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C P+C C +C Y Y G T G+ S +T + V FG
Sbjct: 198 CGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFG 254
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C + SG G + G+LG SL Q G+FSYCL + + G
Sbjct: 255 CGHAQSGLFNG--VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGP 312
Query: 267 VRRR-DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
TT +L S P +Y+ +L IS+G + P AF GG ++DTGT
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF------AGGTVVDTGTV 366
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
+T + Y L + + S G P N D CY F Y ++T
Sbjct: 367 ITRLPPTAYAALRSAFRSGMASYGYPTAPSNG--ILDTCYN----FAGYGTVTLPNVALT 420
Query: 385 YIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYD 428
+ M + C+A D +ILG QQ++ + D
Sbjct: 421 FGSGATVMLGADGILSFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 162/391 (41%), Gaps = 40/391 (10%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
A +S F E + LP+ Y V V +GTP K L+FDT S + WTQC+P
Sbjct: 102 ARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEP 161
Query: 128 CIR-CFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPF--KCQNGKCVYTRRYHVGDV 179
C++ C+ Q P +P ST+Y I C LC+ F C + C+Y +Y G
Sbjct: 162 CVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY 221
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ G + ET N F FGC N+G +G+LG + L+L SQ
Sbjct: 222 SIGFFATETLTLSSSNVF---KNFLFGCGQQNNGLFG--GAAGLLGLGRTKLALPSQTAK 276
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRH 298
+ LFSYCL + + G + ++ TP+ D P + L + +S+G
Sbjct: 277 TYKKLFSYCLPASSSSKGYLSLGGQVS---KSVKFTPLSADFDSTPFYGLDITGLSVGGR 333
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ AF + G +ID+GT +T + Y L + ++ P +
Sbjct: 334 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY-----PSTSGY 382
Query: 359 E-FDYCY---RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPK 411
FD CY +YD+ +TF + + + + + C+A DD
Sbjct: 383 SIFDTCYDFSKYDTVRIPKVGVTFK-GGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSD 441
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
SI G QQ+ ++YD + F C+
Sbjct: 442 TSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 162/391 (41%), Gaps = 40/391 (10%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP 127
A +S F E + LP+ Y V V +GTP K L+FDT S + WTQC+P
Sbjct: 42 ARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEP 101
Query: 128 CIR-CFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPF--KCQNGKCVYTRRYHVGDV 179
C++ C+ Q P +P ST+Y I C LC+ F C + C+Y +Y G
Sbjct: 102 CVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY 161
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ G + ET N F FGC N+G +G+LG + L+L SQ
Sbjct: 162 SIGFFATETLTLSSSNVF---KNFLFGCGQQNNGLFG--GAAGLLGLGRTKLALPSQTAK 216
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRH 298
+ LFSYCL + + G + ++ TP+ D P + L + +S+G
Sbjct: 217 TYKKLFSYCLPASSSSKGYLSLGGQV---SKSVKFTPLSADFDSTPFYGLDITGLSVGGR 273
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ AF + G +ID+GT +T + Y L + ++ P +
Sbjct: 274 QLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY-----PSTSGY 322
Query: 359 E-FDYCY---RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI---QDDPK 411
FD CY +YD+ +TF + + + + + C+A DD
Sbjct: 323 SIFDTCYDFSKYDTVRIPKVGVTFK-GGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSD 381
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
SI G QQ+ ++YD + F C+
Sbjct: 382 TSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/429 (26%), Positives = 174/429 (40%), Gaps = 41/429 (9%)
Query: 42 PESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF----Y 97
P SPL + + S+ + E +AR + + M +D+ LP + Y
Sbjct: 28 PCSPLQTPDDAPSDA--DLLEHDQARVDSIHRMIANETAVVGQDVSLPAERGISVGTGNY 85
Query: 98 SVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPCDDP 155
V V +GTP + ++FDT S L W QC PC C+ Q P+F P +S+T+S + C +P
Sbjct: 86 VVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEP 145
Query: 156 LC-RSPFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPV-------RNGFTFVPRL 203
C R+ C + +C Y Y T G +T N +P
Sbjct: 146 ECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGF 205
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFG 262
FGC +N+G GK G+ G +SLSSQ + FSYCL A + G
Sbjct: 206 VFGCGENNTGLF--GKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLG 263
Query: 263 RDADVRRRDLETTPILLSDLRPHF-YLHLLEISI-GRHI-VRFPPGAFDIMRDGTGGFII 319
A TP+L P F Y+ L+ I + GR I V P + G I+
Sbjct: 264 TPAPAPAH-ARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALW------PAGLIV 316
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA---YPSM 376
D+GT +T + Y L + + G +R P + D CY + + A P++
Sbjct: 317 DSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLS--ILDTCYDFTAHANATVSIPAV 374
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPA 433
I + + C+A + ILG QQ+ + ++YD+
Sbjct: 375 ALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQK 434
Query: 434 LRFGSENCA 442
+ F ++ C+
Sbjct: 435 IGFAAKGCS 443
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 105/418 (25%), Positives = 175/418 (41%), Gaps = 36/418 (8%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQD 94
+PI P SQ ++F ++R +++ S L++ + + +D
Sbjct: 66 LPITQKYGPCSGSGHSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDED 125
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+ V+V GTP L+ DT SS+ WTQC+ C+ C + FD AS+TYS C
Sbjct: 126 GNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSC-- 183
Query: 155 PLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
P +N Y Y + G +T + F + FGC +N G
Sbjct: 184 ----IPSTVENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQ---KFQFGCGRNNKG- 232
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
FG + G+LG LS SQ ++ +FSYCL E S++ FG A + L+
Sbjct: 233 DFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLL-FGEKATSQSSSLKF 291
Query: 275 TPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
T ++ +++++L +IS+G + P F + G IID+ T +T +
Sbjct: 292 TSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQ 346
Query: 331 GPYQTLMQRYDQILR----SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL-QEAD 384
Y L + + + S GR++ D CY P + H AD
Sbjct: 347 RAYSALKAAFKKAMAKYPLSNGRRK----KGDILDTCYNLSGRKDVLLPEIVLHFGGGAD 402
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ N+ + D R C+A + +I+G QQ ++ ++YD+ + FG C+
Sbjct: 403 VRLNGTNIVW-GSDASRLCLAFAGTSELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 156/373 (41%), Gaps = 35/373 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---CFDQTT------PIFDPRASTTY 147
Y V +GTP + L+ DT S L W C+ R C ++ +F S+++
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 148 SEIPCDDPLCR----SPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
IPC +C+ F N C Y RY G G + ET ++ G
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRK 202
Query: 199 F-VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
+ + GCS G +F G++G S S + + + G FSYCLV + +
Sbjct: 203 MKLHNVLIGCSESFQGQSFQAA-DGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKN 261
Query: 258 V---IKFG--RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
V + FG R + ++ T ++L + + ++++ ISIG +++ P +D+
Sbjct: 262 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV--K 319
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
G GG I+D+G+ +TF+ YQ +M L + + +YC+ + F+
Sbjct: 320 GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGP---LEYCFN-STGFEE 375
Query: 373 --YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
P + FH + P Y I G C+ P S++G QQN L +D
Sbjct: 376 SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFD 435
Query: 429 LNVPALRFGSENC 441
L + L F +C
Sbjct: 436 LGLKKLGFAPSSC 448
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 150/367 (40%), Gaps = 48/367 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V +GTP PQ L+ DT SSL W QC+PC +C+ Q P+FDP S++YS +PCD
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188
Query: 155 PLCRSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
CR+ +G C Y Y G G S + V R F
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTL---GPGAIVKRFHF 245
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCLVREMEATSVIKFGRD 264
GC + F G+LG P SL+ Q R G+FS+CL +T + G
Sbjct: 246 GCGHHQQRGKF-DMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGVSTGFLALGAP 304
Query: 265 ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
D TP+L D +P FY L IS+ ++ PP F R+ G I D+GT
Sbjct: 305 HDTSA--FVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF---RE---GVITDSGT 356
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR---YDSSFKAYPSMTF-- 378
++ ++ Y L + RS + D C+ YD+ S+TF
Sbjct: 357 VLSALQETAYTALRTAF----RSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRG 412
Query: 379 ----HLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
HL + ++ + F D ++G+ Q+ + ++YD+ +
Sbjct: 413 GATVHLDASSGVLMDGCLAFWS----------SGDEYTGLIGSVSQRTIEVLYDMPGRKV 462
Query: 435 RFGSENC 441
F + C
Sbjct: 463 GFRTGAC 469
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 147/360 (40%), Gaps = 78/360 (21%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +G+P + +FDT S L WTQC+PC+ C+ Q IFDP S +YS + CD P
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 148
Query: 156 LCR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C SP C + C+Y RY G + G +RE + + F FGC
Sbjct: 149 SCEKLESATGNSP-GCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFN---NFQFGC 204
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
+N G FGG +G+LG +PLSL SQ + +FSYCL +T + FG D
Sbjct: 205 GQNNRGL-FGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLSFG-SGDG 261
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ ++ TP R PP + ++
Sbjct: 262 DSKAVKFTP------------------------RLPPTVYSSVQK--------------- 282
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPSMTFHLQ-EAD 384
++ LM Y ++ D CY S +K P + + A+
Sbjct: 283 ----VFRELMSDYPRV-----------KGVSILDTCYDL-SKYKTVKVPKIILYFSGGAE 326
Query: 385 YIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ PE + ++ + C+A DD + +I+G QQ+ + ++YD + F C
Sbjct: 327 MDLAPEGIIYVL-KVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 167/377 (44%), Gaps = 43/377 (11%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
+++ IG+ K + DT S V QC ++ P+FDP AS +Y ++PC LC
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCL 54
Query: 159 S-----------PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR---LA 204
+ P + C Y+ Y + G S++ N + + +A
Sbjct: 55 AVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVA 114
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLVR---EMEATSVIK 260
FGC++ GF GI+GFN LSL SQL++R+ G FSYC + AT VI
Sbjct: 115 FGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIF 174
Query: 261 FGRDADVRRRDLETTPILLSDLRPH----FYLHLLEISIGRHIVRFPPGAFDIM-RDGTG 315
G D+ + + + TP+L + + P +Y+ L IS+ + P AF + G G
Sbjct: 175 LG-DSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDG 233
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD--SSFKAY 373
G ++D+GT T + + Y + RS R+++ A+ FD CY SS
Sbjct: 234 GTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKV--GAAAGFDDCYNISAGSSLPGV 291
Query: 374 PSMTFHLQEADYI-VQPENMYFIEPDRGR---FCVAIQDDP-----KYSILGAWQQQNML 424
P + LQ + ++ E+++ G C+AI K ++LG +QQ N L
Sbjct: 292 PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYL 351
Query: 425 IIYDLNVPALRFGSENC 441
+ YD + F +C
Sbjct: 352 VEYDNERSRVGFERADC 368
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 164/371 (44%), Gaps = 37/371 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P ++ DT S ++W C C C FD S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 151 PCDDPLCRSPFKC------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-- 202
C DP+C S F+ +N +C Y+ RY G T G +TF F G + V
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 203 --LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEAT 256
+ FGCS SG + GI GF LS+ SQL +R +FS+CL +
Sbjct: 219 APIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGG 278
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
V G ++ + +P++ S +PH+ L+LL I + ++ F+ T G
Sbjct: 279 GVFVLG---EILVPGMVYSPLVPS--QPHYNLNLLSIGVNGQMLPLDAAVFE--ASNTRG 331
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPS 375
I+DTGT +T++ Y + + I S+ + P ++ E CY +S +PS
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFL---NAISNSVSQLVTPIISNGE--QCYLVSTSISDMFPS 386
Query: 376 MTFHLQ-EADYIVQPENMYF---IEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLN 430
++ + A +++P++ F I +C+ Q P + +ILG ++ + +YDL
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446
Query: 431 VPALRFGSENC 441
+ + S +C
Sbjct: 447 RQRIGWASYDC 457
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/417 (26%), Positives = 168/417 (40%), Gaps = 37/417 (8%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA-SMSKPNAFQELEDIHLPM----A 91
+P+ P P ++ I ++ E + RA Y+ +S + Q L D+ +P A
Sbjct: 65 VPLNHRYGPCSPAPSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQPL-DLTVPTTLGSA 123
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ Y + V IG+P Q ++ DT S + W +C D T +FDP STTY+
Sbjct: 124 LDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNST----DGLT-LFDPSKSTTYAPFS 178
Query: 152 CDDPLC----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C C + C N C Y +Y G T G S +T A + V FGC
Sbjct: 179 CSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASD---TVTDFHFGC 235
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
S+ F G KI G++G SL SQ FSYCL + + FG +
Sbjct: 236 SHHEEDFD-GEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRTSGFLTFGA-PNG 293
Query: 268 RRRDLETTPILLSDLRPHFYLHLL-EISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TTP+L P Y LL +IS+G + P + G ++D+GT +T
Sbjct: 294 TSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVL------SNGSVMDSGTVIT 347
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEAD 384
++ Y L + + L QR P D CY + + P+++ L
Sbjct: 348 WLPRRAYSALSSAFRSSMTRLRHQRAAPLGI---LDTCYDFTGLVNVSIPAVSLVLDGGA 404
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ N I+ C+A SI+G QQ+ +++D+ F S C
Sbjct: 405 VVDLDGNGIMIQD-----CLAFAATSGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 156/368 (42%), Gaps = 40/368 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
+D + V+V GTP + L+ DT SS+ WTQC+ C+ C + FD AS+TYS C
Sbjct: 123 EDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSC 182
Query: 153 DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
P N Y Y + G +T + F + FGC +N
Sbjct: 183 ------IPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQ---KFQFGCGRNNE 230
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDL 272
G FG G+LG LS SQ ++ + +FSYCL E S++ FG A + L
Sbjct: 231 G-DFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLL-FGEKATSQSSSL 288
Query: 273 ETTPIL----LSDLRP--HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ T ++ S L ++++ LL+IS+G + P F + G IID+GT +T
Sbjct: 289 KFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVIT 343
Query: 327 FIRNGPYQTLMQRYDQILR----SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ 381
+ Y L + + + S GR++ + D CY P H
Sbjct: 344 RLPQRAYSALKAAFKKAMAKYPLSNGRRK----ENDMLDTCYNLSGRKDVLLPEXVLHFG 399
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQD------DPKYSILGAWQQQNMLIIYDLNVPALR 435
+ + D R C+A +P+ +I+G QQ ++ ++YD+ +
Sbjct: 400 DGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIG 459
Query: 436 FGSENCAN 443
FG C+N
Sbjct: 460 FGGNGCSN 467
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 115/241 (47%), Gaps = 31/241 (12%)
Query: 82 ELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
E+ I +P+A Q L Y V + +G + ++ DT S L W QC+PC+ C++Q P
Sbjct: 126 EVSQIQIPLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPCMSCYNQQGP 183
Query: 138 IFDPRASTTYSEIPCDDPLCRS-------PFKCQNG--KCVYTRRYHVGDVTRGLASRET 188
+F P S++Y IPC+ C+S C++ C Y Y G T G E
Sbjct: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
+F G V FGC +N G FGG +SG++G S LSL SQ + G+FSYC
Sbjct: 244 LSF----GGISVSNFVFGCGKNNKGL-FGG-VSGLMGLGRSNLSLISQTNSTFGGVFSYC 297
Query: 249 L-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVR 301
L + A+ + G ++ V + TPI + + P+ + L+L I +G + +
Sbjct: 298 LPPTDAGASGSLAMGNESSVFKN---LTPIAYTRMVPNPQLSNFYMLNLTGIDVGVWLFK 354
Query: 302 F 302
Sbjct: 355 L 355
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 173/396 (43%), Gaps = 54/396 (13%)
Query: 83 LEDIHLPM-----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L +I LP+ A Y ++ +G+P K ++ DT S ++W C PC +C +T
Sbjct: 55 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 114
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTR--RYHV---------GDVTR 181
++D + S+T + C+D C F Q+ C + YHV GD +
Sbjct: 115 GIPLSLYDSKTSSTSKNVGCEDDFCS--FIMQSETCGAKKPCSYHVVYGDGSTSDGDFIK 172
Query: 182 GLASRETFAFPVRNGFTFVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLR- 238
+ E +R + FGC + SG + GI+GF S S+ SQL
Sbjct: 173 DNITLEQVTGNLRTA-PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 231
Query: 239 -NRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
+ +FS+CL M + G +V ++TTPI+ + + H+ + L + +
Sbjct: 232 GGSTKRIFSHCL-DNMNGGGIFAVG---EVESPVVKTTPIVPNQV--HYNVILKGMDVDG 285
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ PP +G GG IID+GT + ++ Y +L+++ +Q++ +
Sbjct: 286 DPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI------TAKQQVKLHMV 337
Query: 358 QEFDYCYRYDSSF-KAYPSMTFHLQEA--------DYIVQ-PENMYFIEPDRGRFCVAIQ 407
QE C+ + S+ KA+P + H +++ DY+ E+MY G + Q
Sbjct: 338 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG--MTTQ 395
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D +LG N L++YDL + + NC++
Sbjct: 396 DGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 431
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 153/369 (41%), Gaps = 39/369 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
+Y IGTP +P + D A LVWTQC C RCF Q P+F P AS+T+ PC
Sbjct: 44 YYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 103
Query: 156 LCRS-PFK-CQNGKCVYTR-RYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
+C S P + C C Y + T G A+ +TFA T RLAFGC +
Sbjct: 104 VCESIPTRSCSGDVCSYKGPPTQLRGNTSGFAATDTFAI-----GTATVRLAFGCVVASD 158
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRRRD 271
G SG +G +P SL +Q++ FSYCL R +S + G A + +
Sbjct: 159 IDTMDGP-SGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGSSAKLAGSE 214
Query: 272 LETTPILLS-----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+T + D ++ L L I G + G ++ T +P +
Sbjct: 215 STSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFS 266
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSMTFHLQEAD 384
+ + Y+ + + + + Q FD C++ + F P + F Q A
Sbjct: 267 LLVDSAYKAFKKAVTEAVGGAAAPPM-ATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAA 325
Query: 385 YIVQPENMYFIE--PDRGRFCVAI--------QDDPKYSILGAWQQQNMLIIYDLNVPAL 434
+ P Y I+ ++ C AI S+LG+ QQ+++ +YDL L
Sbjct: 326 ALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETL 385
Query: 435 RFGSENCAN 443
F +C++
Sbjct: 386 SFEPADCSS 394
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 173/396 (43%), Gaps = 54/396 (13%)
Query: 83 LEDIHLPM-----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L +I LP+ A Y ++ +G+P K ++ DT S ++W C PC +C +T
Sbjct: 59 LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 118
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTR--RYHV---------GDVTR 181
++D + S+T + C+D C F Q+ C + YHV GD +
Sbjct: 119 GIPLSLYDSKTSSTSKNVGCEDDFCS--FIMQSETCGAKKPCSYHVVYGDGSTSDGDFIK 176
Query: 182 GLASRETFAFPVRNGFTFVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLR- 238
+ E +R + FGC + SG + GI+GF S S+ SQL
Sbjct: 177 DNITLEQVTGNLRTA-PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 235
Query: 239 -NRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
+ +FS+CL M + G +V ++TTPI+ + + H+ + L + +
Sbjct: 236 GGSTKRIFSHCL-DNMNGGGIFAVG---EVESPVVKTTPIVPNQV--HYNVILKGMDVDG 289
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ PP +G GG IID+GT + ++ Y +L+++ +Q++ +
Sbjct: 290 DPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI------TAKQQVKLHMV 341
Query: 358 QEFDYCYRYDSSF-KAYPSMTFHLQEA--------DYIVQ-PENMYFIEPDRGRFCVAIQ 407
QE C+ + S+ KA+P + H +++ DY+ E+MY G + Q
Sbjct: 342 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG--MTTQ 399
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
D +LG N L++YDL + + NC++
Sbjct: 400 DGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 435
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 157/376 (41%), Gaps = 38/376 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V +GTP +P L+ DT S L W +C D +F AS +++ I C
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSD 171
Query: 156 LCRS--PFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPV-----RNGFTFVPRL 203
C S PF N C Y RY+ G RG+ ++ + R+G +L
Sbjct: 172 TCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKL 231
Query: 204 ---AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATS 257
GC+ G +F G+L S +S +S+ R G FSYCLV + ATS
Sbjct: 232 QGVVLGCTASYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATS 290
Query: 258 VIKFG--------RDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFD 308
+ FG + TP+LL + P + + + + + + P +D
Sbjct: 291 YLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADVWD 350
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
+ R GG I+D+GT +T + Y+ ++ + L L P + F+YCY + +
Sbjct: 351 VARG--GGAILDSGTSLTVLATPAYRAVVAALSERLAGL-----PRVSMDPFEYCYNWTA 403
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLII 426
+ P + + + P Y ++ G C+ +Q+ P S++G QQ+ L
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463
Query: 427 YDLNVPALRFGSENCA 442
+DL LRF CA
Sbjct: 464 FDLRDRWLRFKHTRCA 479
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 164/386 (42%), Gaps = 51/386 (13%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRAST 145
+P+ Y IGTP + + D + LVWTQC C CF Q P+FDP AS
Sbjct: 53 VPLHWSGACYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASN 112
Query: 146 TYSEIPCDDPLCRS-PFK-CQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
TY C PLC+S P + C +G+C Y GD T G+AS + A G R
Sbjct: 113 TYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSMFGD-TFGIASTDAIAIGNAEG-----R 166
Query: 203 LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVI 259
LAFGC + G G SG +G +P SL Q FSYCL S +
Sbjct: 167 LAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTA---FSYCLAPHGPGKKSAL 223
Query: 260 KFGRDADVRRRDLET--TPIL------LSD--LRPHFYLHLLEISIGRHIVRFPP---GA 306
G A + TP+L SD P++ + L I G V GA
Sbjct: 224 FLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGA 283
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
I++ ++T P++++ + YQ L + + +LG + N + FD C++
Sbjct: 284 ITILQ-------LETFRPLSYLPDAAYQALEKV---VTAALGSPSM-ANPPEPFDLCFQ- 331
Query: 367 DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR--FCVAI-------QDDPKYSILGA 417
+++ P + F Q + P + Y + G C++I D SILG+
Sbjct: 332 NAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGS 391
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
Q+N+ ++DL L F +C++
Sbjct: 392 LLQENVHFLFDLEKETLSFEPADCSS 417
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 166/377 (44%), Gaps = 41/377 (10%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
K + V++ IGTP + Q ++ DT S L W QC T FDP S+T+S +P
Sbjct: 92 KYSMALIVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLP 151
Query: 152 CDDPLCRS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C P+C+ P C QN C Y+ Y G G RE F F R+ FT P L
Sbjct: 152 CTHPVCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFS-RSLFT--PPL 208
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM-----EATSV 258
GC+ +++ GILG N LS +SQ ++I FSYC+ + T
Sbjct: 209 ILGCATEST------DPRGILGMNRGRLSFASQ--SKITK-FSYCVPTRVTRPGYTPTGS 259
Query: 259 IKFGRDADVRR-RDLETTPILLSDLRPH-----FYLHLLEISIGRHIVRFPPGAFDIMRD 312
G + + R +E S P+ + + L I IG + P F
Sbjct: 260 FYLGHNPNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAG 319
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDS--S 369
G+G ++D+G+ T++ N Y + +++R++G R + Y D C+ ++
Sbjct: 320 GSGQTMLDSGSEFTYLVNEAYDKVRA---EVVRAVGPRMKKGYVYGGVADMCFDGNAIEI 376
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY----SILGAWQQQNMLI 425
+ M F ++ IV P+ + G C+ I + K +I+G + QQN+ +
Sbjct: 377 GRLIGDMVFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAASNIIGNFHQQNLWV 436
Query: 426 IYDLNVPALRFGSENCA 442
+DL + FG+ +C+
Sbjct: 437 EFDLVNRRMGFGTADCS 453
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/416 (24%), Positives = 178/416 (42%), Gaps = 64/416 (15%)
Query: 70 YMASMSKPNAFQELEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
++A++ K + + L + LP+ + Y ++ IGTP K ++ DT S ++W
Sbjct: 57 HLAALRKHDGRRLLTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVN 116
Query: 125 CQPCIRC-------FDQTTPIFDPRASTTYSEIPCDDPLCRS-------PFKCQNGKCVY 170
C C C D T ++DP AS + + C C + P N C Y
Sbjct: 117 CISCDSCPRKSGLGIDLT--LYDPTASASSKTVTCGQEFCATATNGGVPPSCAANSPCQY 174
Query: 171 TRRYHVGDVTRGLASRETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGK---ISGI 223
+ Y G T G + + +G + FGC G A G + GI
Sbjct: 175 SITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASVTFGCGAKIGG-ALGSSNVALDGI 233
Query: 224 LGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSD 281
LGF + S+ SQL + ++ +FS+CL + + G +V + ++TTP++
Sbjct: 234 LGFGQANSSMLSQLTSAGKVTKIFSHCL-DTVNGGGIFAIG---NVVQPKVKTTPLVPG- 288
Query: 282 LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY- 340
PH+ + L I +G ++ P FDI G+ G IID+GT + ++ Y+ ++
Sbjct: 289 -MPHYNVVLKTIDVGGSTLQLPTNIFDI-GGGSRGTIIDSGTTLAYLPEVVYKAVLSAVF 346
Query: 341 ----DQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQ-EADYIVQPENMYF 394
D L+++ Q+F C++Y S +P +TFH + +V P + Y
Sbjct: 347 SNHPDVTLKNV----------QDF-LCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHD-YL 394
Query: 395 IEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ +CV Q D +LG N L++YDL + + + NC++
Sbjct: 395 FQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCSS 450
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 166/383 (43%), Gaps = 53/383 (13%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC- 157
++ IGTP + LL DTAS L W Q C C P F+P S+++ PC +C
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60
Query: 158 -RSPFKCQN------GKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCSN 209
RS Q+ G C + Y G G+ +RE F+ +G + + + FGC++
Sbjct: 61 GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120
Query: 210 DN----SGFAFGGKISGILGFNASPLSLSSQLRNRIQ-GL---FSYCL---VREMEATSV 258
+ F+ SG LG N S +Q+ +R + GL FSYC + ++ V
Sbjct: 121 KDLQRPVDFS-----SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGV 175
Query: 259 IKFGRDA----DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
I FG + LE P + S + +Y+ L IS+G ++ P AF I R G
Sbjct: 176 IIFGDSGIPAHHFQYLSLEQEPPIAS-IVDFYYVGLQGISVGGELLHIPRSAFKIDRLGN 234
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY----CYRY---D 367
GG D+GT V+F+ + L++ + GR+ + N + D+ CY D
Sbjct: 235 GGTYFDSGTTVSFLVEPAHTALVEAF-------GRRVLHLNRTSGSDFTKELCYDVAAGD 287
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFI----EPDRGRFCVAIQDDPKYS-----ILGAW 418
+ P +T H + + E ++ P C+A + + ++G +
Sbjct: 288 ARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNY 347
Query: 419 QQQNMLIIYDLNVPALRFGSENC 441
QQQ+ LI +DL + F NC
Sbjct: 348 QQQDYLIEHDLERSRIGFAPANC 370
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 120/250 (48%), Gaps = 13/250 (5%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
I P++ Y +E++IGTP + DT S L+W QC PC C+ Q P+FD ++S+
Sbjct: 48 IQSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSS 107
Query: 146 TYSEIPCDDPLCRSPFKCQNG----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP 201
T+S I C C + C Y Y G T+G+ ++ET G
Sbjct: 108 TFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAF 167
Query: 202 R-LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLV---REMEAT 256
+ + FGC ++N+G AF K GI+G PLSL SQ+ + + G +FS CLV +
Sbjct: 168 KGVIFGCGHNNNG-AFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSIS 226
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTG 315
S + FG+ ++V + +TP++ FY + LL IS+ + P A +
Sbjct: 227 SPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLPFNAGSSLEPAAK 284
Query: 316 GFIIDTGTPV 325
G +I PV
Sbjct: 285 GNVIPQIWPV 294
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 109/438 (24%), Positives = 182/438 (41%), Gaps = 50/438 (11%)
Query: 21 LTHFTSSESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNA 79
LT ++ G +L++ +FSP SP P LS +E + ++ +AR ++ASM +
Sbjct: 23 LTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLASMVAGRS 82
Query: 80 FQELEDIHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQ 134
+P+A Q Y V IG+P + L DT++ W C C C
Sbjct: 83 V-------VPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGC--- 132
Query: 135 TTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
T+ +F P STT+ + C P C +P C C + Y + +
Sbjct: 133 TSTLFAPEKSTTFKNVSCGSPQCNQVPNP-SCGTSACTFNLTYGSSSIAANV-------- 183
Query: 192 PVRNGFTF----VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSY 247
V++ T +P FGC +G + + LG SL SQ +N Q FSY
Sbjct: 184 -VQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPL--SLLSQTQNLYQSTFSY 240
Query: 248 CL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPP 304
CL + + + ++ G A R ++ TP+L + R +Y++L+ I +GR +V PP
Sbjct: 241 CLPSFKSLNFSGSLRLGPVAQPIR--IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPP 298
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
A G + D+GT T + Y + + + + + + + FD CY
Sbjct: 299 EALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCY 358
Query: 365 RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQ 419
P++TF + + +N+ C+A+ P +++ Q
Sbjct: 359 TVP---IVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQ 415
Query: 420 QQNMLIIYDLNVPALRFG 437
QQN ++YD VP R G
Sbjct: 416 QQNHRVLYD--VPNSRLG 431
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 166/374 (44%), Gaps = 39/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP K ++ DT S ++W C C RC ++ ++D +ASTT +
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 132
Query: 151 PCDDPLCR---SPF-KCQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP--- 201
CDD C P C+ G +C+Y+ Y G T G ++ + + F P
Sbjct: 133 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 192
Query: 202 RLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
+ FGC N SG + + GILGF + S+ SQL +++ +FS+CL ++
Sbjct: 193 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGG 251
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGG 316
+ G +V + TP++ + + H+ + + EI +G + P AF+ R GT
Sbjct: 252 IFAIG---EVVEPKVNITPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT-- 304
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT + + Y L+++ L R A FDY D F P++
Sbjct: 305 -IIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGF---PTV 359
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
T H ++ + + Y + +C+ Q D ++LG N L++YDL
Sbjct: 360 TLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 419
Query: 430 NVPALRFGSENCAN 443
+ + NC++
Sbjct: 420 EKQGIGWVEYNCSS 433
>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 435
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 158/369 (42%), Gaps = 30/369 (8%)
Query: 96 FYSVEVNIGTPMKPQ--HLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
Y V V +G+ L D +L W QCQPC+ Q +F S Y +
Sbjct: 66 LYGVLVGVGSGQTRHFYKLGLDLVGNLTWIQCQPCVPEVRQEGAVFKSAVSPRYKDTKAT 125
Query: 154 DPLCRSPFKCQNG-KC-VYTRRYHVGDVTRGLASRETFAF---PVRNGF-TFVPRLAFGC 207
DP C P+ G +C YT ++V G + F F P G T V +L FGC
Sbjct: 126 DPKCTPPYTPSVGNRCSFYTTSWNV--AAHGYLGSDMFGFAGSPGTGGHGTDVDKLTFGC 183
Query: 208 SNDNSGFA--FGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSV----I 259
++ GF G ++G L + P S SQL R FSYCL + +
Sbjct: 184 AHTTDGFERLNHGVLAGALSLSRHPTSFLSQLTARRLADSRFSYCLFPGQSHPNARHGFL 243
Query: 260 KFGRDADVRRRDLETTPILLSDLR---PHFYLHLLEISI-GRHIVRFPPGAFDIM-RDGT 314
+FGR D+ R D + LL R +Y+ + IS+ G+ I+ P F +
Sbjct: 244 RFGR--DIPRHDHAHSTSLLFTGRGSGSMYYIGVTSISLNGKRIIGLQPAFFRRNPQTRR 301
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
GG ++D GTP+T + Y + +++ G +R P Q C+ P
Sbjct: 302 GGSVVDPGTPLTRLVREAYNIVEAELVAYMQTQGSRRAPAPV-QGHRLCF-VSWGHAHLP 359
Query: 375 SMTFHLQE--ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
SMT ++ E A ++PE + F++ C + D + ++LGA QQ + +DL+
Sbjct: 360 SMTINMNEDRAKLFIKPE-LLFLKVTHEHLCFLVVPDEEMTVLGAAQQVDTRFTFDLHAN 418
Query: 433 ALRFGSENC 441
L F E+C
Sbjct: 419 RLYFAQEHC 427
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 160/367 (43%), Gaps = 44/367 (11%)
Query: 93 QDLFYSVEVNIGTPMKPQHLL--FDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
Q Y V+ +GTP PQ LL D + W C+ C+ C ++ +F+ STT+ +
Sbjct: 31 QSPSYIVKAKVGTP--PQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTL 85
Query: 151 PCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C P C+ +P C C + Y + L +R+T A + VP AFGC
Sbjct: 86 GCGAPQCKQVPNPI-CGGSTCTWNTTYGSSTILSNL-TRDTIALSMDP----VPYYAFGC 139
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDA 265
+G + + G+LGF PLS SQ +N + FSYCL R + + ++ G
Sbjct: 140 IQKATGSSVPPQ--GLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVG 197
Query: 266 DVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
R ++TTP+L + R +Y+ L I +GR IV P A G I D+GT
Sbjct: 198 QPPR--IKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTV 255
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE----FDYCYRYDSSFKAYPSMTFHL 380
T + Y + + R+R+ NA+ FD CY S P++TF
Sbjct: 256 FTRLVAPAYIAVRNEF--------RKRV-GNATVSSLGGFDTCY---SVPIVPPTITFMF 303
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALR 435
+ + PEN+ C+A+ P +++ + QQQN I++D+ L
Sbjct: 304 SGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLG 363
Query: 436 FGSENCA 442
E C+
Sbjct: 364 VAREQCS 370
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/424 (25%), Positives = 174/424 (41%), Gaps = 45/424 (10%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYM----ASMSKPNAFQELEDIHLPM-- 90
+P+ P + + + M + RA Y+ + ++ E D+ +P
Sbjct: 59 VPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEGSDVTVPTTL 118
Query: 91 --AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
+ L Y + V +G+P Q +L DT S + W QC+PC +C Q +FDP +S+TYS
Sbjct: 119 GTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYS 178
Query: 149 EIPCDDPLCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C C + C + +C YT +Y G G S +T A G + V FG
Sbjct: 179 AFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL----GSSTVENFQFG 234
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
CS SG + +G++G SL++Q FSYCL ++ + G
Sbjct: 235 CSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTS 294
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
+ TP+L S P +Y LL+ I +G + P AF + G I+D+GT +
Sbjct: 295 ---GFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAF------SAGSIMDSGTII 345
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQ--- 381
T + Y L + + +Q P FD C+ + S + P++
Sbjct: 346 TRLPRTAYSALSSAFKAGM----KQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGA 401
Query: 382 ----EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+D I+ + F A DD I+G QQ+ ++YD+ A+ F
Sbjct: 402 VVDLASDGIILGSCLAF---------AANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFK 452
Query: 438 SENC 441
+ C
Sbjct: 453 AGAC 456
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 151/362 (41%), Gaps = 37/362 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +GTP + + D ++ W C C C ++P F P S+TY +PC P
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSPQ 141
Query: 157 CR---SPFKCQNG---KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C SP C G C + Y + + +++ A N V FGC
Sbjct: 142 CAQVPSP-SCPAGVGSSCGFNLTY-AASTFQAVLGQDSLAL--EN--NVVVSYTFGCLRV 195
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVR 268
SG + G++GF PLS SQ ++ +FSYCL R + +K G +
Sbjct: 196 VSGNSV--PPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPK 253
Query: 269 RRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R ++TTP+L + RP +Y++++ I +G +V+ P A G IID GT T
Sbjct: 254 R--IKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 311
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ Y + + GR R P FD CY S P++TF A +
Sbjct: 312 LAAPVYAAVRDAFR------GRVRTPVAPPLGGFDTCYNVTVSV---PTVTFMFAGAVAV 362
Query: 387 VQPENMYFIEPDRGRF-CVAIQDDPK------YSILGAWQQQNMLIIYDLNVPALRFGSE 439
PE I G C+A+ P ++L + QQQN +++D+ + F E
Sbjct: 363 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 422
Query: 440 NC 441
C
Sbjct: 423 LC 424
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 41/380 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP-------CIRCFDQTTPIFDPRASTTYSE 149
Y V + GTP + L+ DT S L+W QC C + P F S T S
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 150 IPCDDPLCR-SPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFP-VRNGFT 198
+PC C P +G C Y Y G T G +R+T +G
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 172
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-----REM 253
V +AFGC N G +F G G++G LS +Q + FSYCL+ R
Sbjct: 173 AVRGVAFGCGTRNQGGSFSGT-GGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRD 312
++S + GR RR TP++ + L P FY + ++ I +G ++ P + I
Sbjct: 232 RSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 289
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS--QEFDYCYRYDS-- 368
G GG +ID+G+ +T++R G Y L+ + S+ RIP +A+ Q + CY S
Sbjct: 290 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAA---SVHLPRIPSSATFFQGLELCYNVSSSS 346
Query: 369 ----SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQ 421
+ +P +T + + P Y ++ C+AI+ +++LG QQ
Sbjct: 347 SSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQ 406
Query: 422 NMLIIYDLNVPALRFGSENC 441
+ +D + F C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 170/400 (42%), Gaps = 60/400 (15%)
Query: 83 LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ L Y + IGTP K ++ DT S ++W C C C ++
Sbjct: 71 LAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNL 130
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRS------PFKCQNGKCVYTRRYHVGDVTRGLASR 186
++DPR S + + CD C + P C Y+ Y G T G
Sbjct: 131 GIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVT 190
Query: 187 ETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGKIS-------GILGFNASPLSLSS 235
+ + +G ++FGC G GG + GILGF S S+ S
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGC-----GAKLGGDLGSSNLALDGILGFGQSNSSMLS 245
Query: 236 QL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEI 293
QL +++ +F++CL + + G +V + ++TTP L+ D+ PH+ + L I
Sbjct: 246 QLAAAGKVRKMFAHCL-DTVNGGGIFAIG---NVVQPKVKTTP-LVPDM-PHYNVILKGI 299
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR-YDQILRSLGRQRI 352
+G + P FD + G IID+GT + ++ G Y+ L +D+ Q I
Sbjct: 300 DVGGTALGLPTNIFD--SGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK------HQDI 351
Query: 353 PYNASQEFDYCYRYDSSF-KAYPSMTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDP 410
Q+F C++Y S +P +TFH + + IV P + Y + + +C+ Q+
Sbjct: 352 SVQTLQDFS-CFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD-YLFQNGKNLYCMGFQNGG 409
Query: 411 KYSILG-------AWQQQNMLIIYDLNVPALRFGSENCAN 443
+ G N L++YDL A+ + NC++
Sbjct: 410 GKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/421 (24%), Positives = 177/421 (42%), Gaps = 45/421 (10%)
Query: 46 LYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT 105
++P + S E I + AR +++S K + + + + Y V +GT
Sbjct: 30 VHPPSPSPLESIIALARADDARLLFLSS--KAASSGGVTSAPVASGQTPPSYVVRAGLGT 87
Query: 106 PMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC-------- 157
P++ L DT++ W+ C PC C + F P +S++Y+ +PC C
Sbjct: 88 PVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPC 145
Query: 158 ------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
+P C +++ + L S +R G + AFGC
Sbjct: 146 PANQDASAPLP----ACAFSKPFADTSFQASLGSDT-----LRLGKDAIAGYAFGCVGAV 196
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRR 269
+G G+LG P+SL SQ + G+FSYCL R + ++ G A +
Sbjct: 197 AGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLG--AAGQP 254
Query: 270 RDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
R++ TP+L + RP +Y+++ +S+GR V+ P G+F G +ID+GT +T
Sbjct: 255 RNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRW 314
Query: 329 RNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA-DY 385
Y L + + Q+ G Y + FD C+ D + P +T H+ D
Sbjct: 315 TAPVYAALREEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ EN C+A+ + P+ +++ QQQN+ ++ D+ + F E
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 441 C 441
C
Sbjct: 430 C 430
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 144/351 (41%), Gaps = 36/351 (10%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNA-FQELEDIHLPM----A 91
+P+ P P S + + +AR +++ +K + L D+ +P A
Sbjct: 62 MPLAHRHGPCAPATTSSWPSLAERLRRDRARRDHITRKAKASGRTTTLSDVSIPTSLGAA 121
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSE 149
L Y V + IGTP Q +L DT S L W QC+PC C+ Q P++DP AS+TY+
Sbjct: 122 VDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAP 181
Query: 150 IPCDDPLCRS------PFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
+PCD C+ C N C Y Y D T G+ S ET +
Sbjct: 182 VPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQ---VS 238
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVI 259
V FGC G G+LG +P SL SQ G FSYCL T +
Sbjct: 239 VKDFGFGCGLVQQGTFD--LFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFL 296
Query: 260 KFGRDADVRRRDLET---TPILLSDLRPHFYL-HLLEISIGRHIVRFPPGAFDIMRDGTG 315
G A D TP+ + FYL +L +S+G + PP +G
Sbjct: 297 ALG--APTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL------SG 348
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
G IID+GT +T + + Y L + + + +P N D CY +
Sbjct: 349 GMIIDSGTIITGLPDTAYSALRTAFRTAMSA--YPLLPPNNDDVLDTCYNF 397
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 166/374 (44%), Gaps = 39/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP K ++ DT S ++W C C RC ++ ++D +ASTT +
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 151 PCDDPLCR---SPF-KCQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP--- 201
CDD C P C+ G +C+Y+ Y G T G ++ + + F P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 273
Query: 202 RLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
+ FGC N SG + + GILGF + S+ SQL +++ +FS+CL ++
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGG 332
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGG 316
+ G +V + TP++ + + H+ + + EI +G + P AF+ R GT
Sbjct: 333 IFAIG---EVVEPKVNITPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT-- 385
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT + + Y L+++ L R A FDY D F P++
Sbjct: 386 -IIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGF---PTV 440
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
T H ++ + + Y + +C+ Q D ++LG N L++YDL
Sbjct: 441 TLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 500
Query: 430 NVPALRFGSENCAN 443
+ + NC++
Sbjct: 501 EKQGIGWVEYNCSS 514
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 103/207 (49%), Gaps = 17/207 (8%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSP-- 160
+G P + + DT S L+W QC PC C++QT PIFDP S TY + D P+C +
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 161 FKCQNG--KCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFVPRLAFGCSNDNSGFAF 216
C+ G C Y Y G T+G S + FAF P R V L FGCS+D
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRT-IVEVGYLTFGCSHDTKA-RL 180
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV--REMEATSVIKFGRDADVRRRDLET 274
G +G++G N P SL SQL+ + FSYC+V + + S + FG A +
Sbjct: 181 KGHQAGVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILGGK--- 234
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVR 301
TP+L D H+++ L IS+G R
Sbjct: 235 TPLLKGDYS-HYFVTLKGISVGEEKGR 260
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 57/118 (48%), Gaps = 7/118 (5%)
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC--RSPFKCQ--NGKCVYTRRYHVGDV 179
+ Q +CF+QT PIFDP S+TYS +P D P C + C C Y Y G
Sbjct: 327 EAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGST 386
Query: 180 -TRGLASRETFAFP-VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSS 235
T G S + FAF R V L FGCS+ +G F G GI+G N LSL S
Sbjct: 387 STEGTISIDAFAFEDNRQNMVDVXHLVFGCSDYTTG-TFKGYEVGIVGLNQDSLSLVS 443
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 152/362 (41%), Gaps = 37/362 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +GTP + + D ++ W C C C ++P F P S+TY +PC P
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSPQ 160
Query: 157 CR---SPFKCQNG---KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C SP C G C + Y + + +++ A N V FGC
Sbjct: 161 CAQVPSP-SCPAGVGSSCGFNLTY-AASTFQAVLGQDSLAL--EN--NVVVSYTFGCLRV 214
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVR 268
SG + + G++GF PLS SQ ++ +FSYCL R + +K G +
Sbjct: 215 VSGNSVPPQ--GLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPK 272
Query: 269 RRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R ++TTP+L + RP +Y++++ I +G +V+ P A G IID GT T
Sbjct: 273 R--IKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 330
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ Y + + GR R P FD CY S P++TF A +
Sbjct: 331 LAAPVYAAVRDAFR------GRVRTPVAPPLGGFDTCYNVTVSV---PTVTFMFAGAVAV 381
Query: 387 VQPENMYFIEPDRGRF-CVAIQDDPK------YSILGAWQQQNMLIIYDLNVPALRFGSE 439
PE I G C+A+ P ++L + QQQN +++D+ + F E
Sbjct: 382 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 441
Query: 440 NC 441
C
Sbjct: 442 LC 443
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/295 (31%), Positives = 135/295 (45%), Gaps = 33/295 (11%)
Query: 168 CVYTRRYHVGDVTRGLASRETFAFP----VRNGFTFVPRLAFGCSNDNSGFAFGGKISGI 223
C Y Y G +T G+ + E F F T VP L FGC + N G G SGI
Sbjct: 22 CTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGFGCGSVNVGSLNNG--SGI 78
Query: 224 LGFNASPLSLSSQLRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRD----LETTPIL 278
+GF +PLSL SQL R FSYCL S + FG +D D ++TTP+L
Sbjct: 79 VGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLL 135
Query: 279 LSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLM 337
S P FY +H +++G +R P AF + DG+GG I+D+GT +T + ++
Sbjct: 136 QSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVV 195
Query: 338 QRYDQILRSLGRQRIPY--NASQEFDYCYRYDSSFK--------AYPSMTFHLQEADYIV 387
+ + Q L R+P+ + E C+ ++++ P M H Q AD +
Sbjct: 196 RAFRQQL------RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDL 249
Query: 388 QPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
N + RGR C+ + D S +G QQ+M ++YDL L C
Sbjct: 250 PRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 304
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 159/360 (44%), Gaps = 40/360 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + ++GTP + L DT S L+W +C C RC + + + P S+++S++PC L
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 157 CRSPFKCQNGKCVYTR--------RYHVG------DVTRGLASRETFAFPVRNGFTFVPR 202
CR+ C TR RY G T+G ETF G V
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTL----GSDAVQG 196
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+ FGC+ + G SG++G LSL QL+ G FSYCL + +S + FG
Sbjct: 197 IGFGCTTMSEGGYG--SGSGLVGLGRGKLSLVRQLK---VGAFSYCLTSDPSTSSPLLFG 251
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A + +++TP++ + ++L ISIG PG G G I D+G
Sbjct: 252 AGA-LTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKT---PGT------GRHGIIFDSG 301
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQE 382
T +TF+ Y + + ++ R+P + ++ C++ S +PSM H
Sbjct: 302 TTLTFLAEPAYT--LAEAGLLSQTTNLTRVP--GTDGYEVCFQ-TSGGAVFPSMVLHFDG 356
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D ++ EN YF + C +Q P + SI+G Q + I YDL+ L F NC
Sbjct: 357 GDMALKTEN-YFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 185/432 (42%), Gaps = 48/432 (11%)
Query: 31 GFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G +L+++ ++SP SP P LS E + +M KAR +++S+ + +
Sbjct: 36 GSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVPIASGR-- 93
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q+ Y V IGTP + + DT+S + W C C+ C ++ +F+ ASTTY
Sbjct: 94 QIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKS 150
Query: 150 IPCDDPLCR------SPF----------KCQNGKCVYTRRYHVGDVTRGLASRETFAFPV 193
+ C C+ SP C G C + Y + L S++T
Sbjct: 151 LGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITLAT 209
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VR 251
VP +FGC +G + + LG SL SQ +N Q FSYCL +
Sbjct: 210 DA----VPGYSFGCIQKATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLPSFK 263
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIM 310
+ + ++ G +R ++ TP+L + RP Y ++L+ + +GR +V PPG+F
Sbjct: 264 SLNFSGSLRLGPVGQPKR--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFN 321
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF 370
G I D+GT T + Y + D +GR + + FD CY
Sbjct: 322 PSTGAGTIFDSGTVFTRLVTPAY---IAVRDAFRNRVGRN-LTVTSLGGFDTCYTVP--- 374
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLI 425
A P++TF + + P+N+ C+A+ P +++ QQQN +
Sbjct: 375 IAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRL 434
Query: 426 IYDLNVPALRFG 437
+YD VP R G
Sbjct: 435 LYD--VPNSRLG 444
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 107/423 (25%), Positives = 161/423 (38%), Gaps = 90/423 (21%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP------------------- 137
Y V +GTP +P L+ DT S L W +C D P
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHR----HDHDAPAPGYGYAAPASNDSSTSSL 162
Query: 138 ------------IFDPRASTTYSEIPCDDPLCRS--PFKCQ-----NGKCVYTRRYHVGD 178
+F P S T++ IPC C + PF C Y RY G
Sbjct: 163 SAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGS 222
Query: 179 VTRGLASRE--TFAFPVRNGF-----TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
RG + T A R + + GC+ +G +F G+L S +
Sbjct: 223 AARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLAS-DGVLSLGYSNI 281
Query: 232 SLSSQLRNRIQGLFSYCLVREM---EATSVIKFGRDADVRRR------------------ 270
S +S+ R G FSYCLV + ATS + FG + V
Sbjct: 282 SFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPP 341
Query: 271 ---DLETTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+LL +RP + + + IS+ ++R P +D+ + GG I+D+GT +T
Sbjct: 342 GPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKG--GGAILDSGTSLT 399
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK------AYPSMTFHL 380
+ + Y+ ++ ++ L L R + FDYCY + S A P + H
Sbjct: 400 VLVSPAYRAVVAALNKKLAGLPRVTM-----DPFDYCYNWTSPSTGEDLTVAMPELAVHF 454
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ + P Y I+ G C+ +Q+ P S++G QQ L +DL LRF
Sbjct: 455 AGSARLQPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKR 514
Query: 439 ENC 441
C
Sbjct: 515 SRC 517
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 126/485 (25%), Positives = 189/485 (38%), Gaps = 76/485 (15%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIF-----SPESPLYPGNLSQSERIHKMFEI 63
+ F S+L FTSS +L L P+ S P + + S + + +
Sbjct: 10 IITVFLLLSLLSHIAFTSSNPNTITLPLSPLLIKPHSSDSDPFHSLKFAASASLTRAHHL 69
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
K R N S++ A+ K YS+++N+GTP + + DT SSLVW
Sbjct: 70 -KHRNNNSPSVATTPAY----------PKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWF 118
Query: 124 QCQP---CIRC----FDQTT-PIFDPRASTTYSEIPCDDPLC------RSPFKCQNGK-- 167
C C C D T P F P+ S+T + C +P C F+C K
Sbjct: 119 PCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPE 178
Query: 168 ---C-----VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGK 219
C Y +Y +G T G + FP + VP+ GCS +
Sbjct: 179 SQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPGKT----VPQFLVGCS-----ILSIRQ 228
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE------MEATSVIKFGRDADVRRRDLE 273
SGI GF SL SQ+ + FSYCLV + V++ D + L
Sbjct: 229 PSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLS 285
Query: 274 TTPILLS------DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
TP + + ++YL L ++ +G V+ P + DG GG I+D+G+ TF
Sbjct: 286 YTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTF 345
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADYI 386
+ Y + Q + + L + C+ +P +TF + +
Sbjct: 346 MERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKM 405
Query: 387 VQPENMYF-IEPDRGRFCVAIQDD-----PKYS----ILGAWQQQNMLIIYDLNVPALRF 436
QP YF + D C+ + D PK + ILG +QQQN I YDL F
Sbjct: 406 TQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGF 465
Query: 437 GSENC 441
G +C
Sbjct: 466 GPRSC 470
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 168/385 (43%), Gaps = 45/385 (11%)
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC---IRCFDQTTPIFD 140
+D+ + + Y + VN+G+P + + DT S LVW +C+ T FD
Sbjct: 88 DDVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFD 147
Query: 141 PRASTTYSEIPCDDPLCRSPFK--CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
P S+TY + C C + + C +G C Y Y G T G+ S ETF F G
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFD-DGGS 206
Query: 198 TFVPR------LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCL 249
PR + FGCS +G +F G++G +SL +QL + FSYCL
Sbjct: 207 GRSPRQVRVGGVKFGCSTATAG-SF--PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL 263
Query: 250 V-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD 308
V + A+S + FG ADV +TP++ D+ ++ + L + +G V
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTV-------- 315
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYCY-- 364
+ I+D+GT +TF+ L D++ R R +P S + CY
Sbjct: 316 -ASAASSRIIVDSGTTLTFLDP---SLLGPIVDELSR---RITLPPVQSPDGLLQLCYNV 368
Query: 365 --RYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAW 418
R + ++ P +T A ++PEN F+ G C+AI + SILG
Sbjct: 369 AGREVEAGESIPDLTLEFGGGAAVALKPENA-FVAVQEGTLCLAIVATTEQQPVSILGNL 427
Query: 419 QQQNMLIIYDLNVPALRFGSENCAN 443
QQN+ + YDL+ + F +CA
Sbjct: 428 AQQNIHVGYDLDAGTVTFAGADCAG 452
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 40/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP + ++ DT S ++W C C C +++ ++D + S T +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 151 PCDDPLCRS-----PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFV 200
CD C + P C N C YT Y G + G R+ + +G +
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSAN 216
Query: 201 PRLAFGCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATS 257
+ FGCS SG + + GILGF S S+ SQL + +++ +F++CL +
Sbjct: 217 GSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL-DGLNGGG 275
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G + + + TTP++ + + H+ +++ + +G + + P FD+ G
Sbjct: 276 IFAIGH---IVQPKVNTTPLVPN--QTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKKGT 328
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSM 376
IID+GT + ++ Y L+ + L I +F C++Y S +P++
Sbjct: 329 IIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTI----HDQFT-CFQYSESLDDGFPAV 383
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
TFH + + Y+ + Y D G +C+ Q D ++LG N L++YDL
Sbjct: 384 TFHFENSLYLKVHPHEYLFSYD-GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDL 442
Query: 430 NVPALRFGSENCAN 443
+ + NC++
Sbjct: 443 ENQVIGWTEYNCSS 456
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 163/371 (43%), Gaps = 36/371 (9%)
Query: 97 YSVEVNIGTPMKPQH--LLFDTASSLVWTQCQPCIRCFDQTTP----IFDPRASTTYSEI 150
Y V + IGTP +PQ L+ DT S L W C+ + + P +F S+++ I
Sbjct: 119 YFVSIRIGTP-RPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTI 177
Query: 151 PCDDPLCR-------SPFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-V 200
PC C+ S +C N C++ RY G G+ + ET + + +
Sbjct: 178 PCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRL 237
Query: 201 PRLAFGCS---NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
+ GC+ N+ +GF G++G SL+ +L FSYCLV + +++
Sbjct: 238 FDVLIGCTESFNETNGFP-----DGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSN 292
Query: 258 ---VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ FG +++ ++ T +LL + + +++ IS+G ++ +++ G
Sbjct: 293 HKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVT--GV 350
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--A 372
GG I+D+GT +T + Y ++ I ++ +P + ++C+ D F A
Sbjct: 351 GGMIVDSGTSLTMLAGEAYDKVVDALKPIFDK-HKKVVPIELPELNNFCFE-DKGFDRAA 408
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLN 430
P + H + P Y I+ G C+ I D P SILG QQN L YDL
Sbjct: 409 VPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYDLG 468
Query: 431 VPALRFGSENC 441
L FG +C
Sbjct: 469 RGKLGFGPSSC 479
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 171/397 (43%), Gaps = 61/397 (15%)
Query: 91 AKQDLFYSVEVNIGTPMKPQH--LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
A D Y + ++IGTP +PQ L DT S LVWTQC C CF Q P FD AS T
Sbjct: 94 ADIDSEYLIHLSIGTP-RPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTL 151
Query: 149 EIPCDDPLCRS---PFK-C--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG------ 196
+PC DP+C S P C + C Y Y +T G +TF F G
Sbjct: 152 AVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKA 211
Query: 197 --FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME 254
VP + FGC N G F SGI GF+ P+SL SQL+ FS+C +
Sbjct: 212 HAGVAVPNVRFGCGQYNKGI-FKSNESGIAGFSRGPMSLPSQLK---VARFSHCFTAIAD 267
Query: 255 A-TSVIKFGRDADVRRRD------LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
A TS + G +++TP S+ +YL L I++G+ R P A
Sbjct: 268 ARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSN-GSLYYLTLKGITVGK--TRLPLNAL 324
Query: 308 DIM----RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY----NASQE 359
G+GG IID+GT + + Y++L + + R ++P A E
Sbjct: 325 AFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAF------VARVKLPVANESAADAE 378
Query: 360 FDYCYRYDSS--------FKAYPSMTFHLQEADYIVQPEN--MYFIEPDRGR---FCVAI 406
C+ S A P + H+ AD+ + E+ + +E + G C+ +
Sbjct: 379 STLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVM 438
Query: 407 QD--DPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
D +I+G +QQQNM + YDL L F C
Sbjct: 439 NSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 163/386 (42%), Gaps = 51/386 (13%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRAST 145
+P+ Y IGTP + + D + LVWTQC C CF Q P+FDP AS
Sbjct: 53 VPLHWSGAHYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASN 112
Query: 146 TYSEIPCDDPLCRS-PFK-CQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
TY C PLC+S P + C +G+C Y GD T G+AS + A G R
Sbjct: 113 TYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSMFGD-TFGIASTDAIAIGNAEG-----R 166
Query: 203 LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA-TSVI 259
LAFGC + G G SG +G +P SL Q FSYCL S +
Sbjct: 167 LAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTA---FSYCLALHGPGKKSAL 223
Query: 260 KFGRDADVRRRDLET--TPIL------LSD--LRPHFYLHLLEISIGRHIVRFPP---GA 306
G A + TP+L SD P++ + L I G V GA
Sbjct: 224 FLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGA 283
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
+++ ++T P++++ + YQ L + + +LG + N + FD C++
Sbjct: 284 ITVLQ-------LETFRPLSYLPDAAYQALEKV---VTAALGSPSM-ANPPEPFDLCFQ- 331
Query: 367 DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR--FCVAI-------QDDPKYSILGA 417
+++ P + F Q + + Y + G C++I D SILG+
Sbjct: 332 NAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGS 391
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
Q+N+ ++DL L F +C++
Sbjct: 392 LLQENVHFLFDLEKETLSFEPADCSS 417
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 154/373 (41%), Gaps = 42/373 (11%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L+ IGTP +P + D A LVWTQC C RCF Q P+F P AS+T+ PC
Sbjct: 41 LYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGT 100
Query: 155 PLCRS--PFKCQNGKCVYTRRYHV---GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C+S C C Y ++ T G+ ETFA T LAFGC
Sbjct: 101 DACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-----TATASLAFGCVV 155
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVR 268
+ G SG +G +P SL +Q++ FSYCL R +S + G A +
Sbjct: 156 ASDIDTMDG-TSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLA 211
Query: 269 RRDLETTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG-FIIDTGT 323
+ +T + D H+YL L+ +R G I +GG ++ T +
Sbjct: 212 GGESTSTAPFIKTSPDDDSHHYYLLSLD------AIR--AGNTTIATAQSGGILVMHTVS 263
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK--AYPSMTFHLQ 381
P + + + Y+ + + + Q FD C++ + F P + F Q
Sbjct: 264 PFSLLVDSAYRAFKKAVTEAVGGA-AAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322
Query: 382 EAD-YIVQPENMYFIE--PDRGRFCVAIQDDPK--------YSILGAWQQQNMLIIYDLN 430
+ P Y I+ ++ C AI + S+LG+ QQ+N+ +YDL
Sbjct: 323 GGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLK 382
Query: 431 VPALRFGSENCAN 443
L F +C++
Sbjct: 383 KETLSFEPADCSS 395
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 115/458 (25%), Positives = 191/458 (41%), Gaps = 43/458 (9%)
Query: 7 LPLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKA 66
PL++ FS L L ++ GF LI SPESP Y NL+ E + S+A
Sbjct: 22 FPLSSSFS----LPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPGELMRASVRTSRA 77
Query: 67 RANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
R + + + + ++ D Y ++ NIG+P + + DT S++VW QC
Sbjct: 78 RGDRIRKIRSSGISNSRKYPVSRISIIDKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCG 137
Query: 127 P--CIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPF---------KCQNGKCVYTRRYH 175
C C+ Q P+F+P S+TY+ C C+ K C Y Y
Sbjct: 138 SPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYE 197
Query: 176 VGDVTRGLASRETFAFP--VRNGFTFVPRLAFGCSNDNSGFAFGGKIS----GILGFNAS 229
+ G S + FP + + R+ FGC +NS S G++G
Sbjct: 198 DHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNE 257
Query: 230 PLSLSSQLRNRIQGLFSYCL----VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH 285
SL QL G FSYC+ V++ T I+FG A + + L ++L
Sbjct: 258 MASLVGQLT---LGQFSYCISTPDVQKPNGTIEIRFGLAASISGH----STALANNLEGW 310
Query: 286 FYLHLLE-ISIGRHIVR-FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLM-QRYDQ 342
+ ++ I + V+ +P F G GG I+D+GT T + L+ + +Q
Sbjct: 311 YIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQ 370
Query: 343 ILRSLGRQRIPYNASQEFDYCYRYDSSFKAY-PSMTFHL---QEADYIVQPENMYFIEPD 398
I + Q +++ + CY + Y P++ +EA + N + I+
Sbjct: 371 IELAPDTQD---HSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAW-IDNG 426
Query: 399 RGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
++C+A+ SI+G +Q +++ I YDL + F
Sbjct: 427 NDQYCLAMFGTSGISIIGIYQHRDIKIGYDLKYNLVSF 464
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 164/372 (44%), Gaps = 40/372 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP + ++ DT S ++W C C C +++ ++D + S T +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 151 PCDDPLCRS-----PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFV 200
CD C + P C N C YT Y G + G R+ + +G +
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSAN 216
Query: 201 PRLAFGCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATS 257
+ FGCS SG + + GILGF S S+ SQL + +++ +F++CL +
Sbjct: 217 GSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL-DGLNGGG 275
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G + + + TTP++ + + H+ +++ + +G + + P FD+ G
Sbjct: 276 IFAIGH---IVQPKVNTTPLVPN--QTHYNVNMKAVEVGGYFLNLPTDVFDV--GDKKGT 328
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSM 376
IID+GT + ++ Y L+ + L I +F C++Y S +P++
Sbjct: 329 IIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTI----HDQFT-CFQYSESLDDGFPAV 383
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
TFH + + Y+ + Y D G +C+ Q D ++LG N L++YDL
Sbjct: 384 TFHFENSLYLKVHPHEYLFSYD-GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDL 442
Query: 430 NVPALRFGSENC 441
+ + NC
Sbjct: 443 ENQVIGWTEYNC 454
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 107/419 (25%), Positives = 174/419 (41%), Gaps = 35/419 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G +LK+ IFS SP P +S E + + +AR Y +S+ + +
Sbjct: 32 GSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASAR-- 89
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q Y V+ GTP + L DT+S W C C+ C T+ F P ST++
Sbjct: 90 QIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC--STSKPFAPIKSTSFRN 147
Query: 150 IPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
+ C P C+ +P C C + Y + + ++T +P FG
Sbjct: 148 VSCGSPHCKQVPNP-TCGGSACAFNFTYGSSSIAASVV-QDTLTLATDP----IPGYTFG 201
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRD 264
C N +G + + LG SL SQ +N + FSYCL + + + ++ G
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPL--SLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 265 ADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+R ++ TP+L + R +Y++L+ I +GR IV PP A G I D+GT
Sbjct: 260 YQPKR--IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGT 317
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
T + Y + + R +G ++P FD CY P++TF
Sbjct: 318 VFTRLAEPVYTAVRNEFR---RRVG-PKLPVTTLGGFDTCYNVP---IVVPTITFLFSGM 370
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
+ + P+N+ C+A+ P +++ QQQN +++D VP R G
Sbjct: 371 NVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFD--VPNSRIG 427
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 149/373 (39%), Gaps = 51/373 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V++ +GTP K ++ DT SSL W QCQPC I C Q PIF P S TY +PC
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ P C + G CVY Y + G S++ T P
Sbjct: 173 QCSSLKSSTLNAPGCSN----ATGACVYKASYGDTSFSIGYLSQDV--------LTLTPS 220
Query: 203 LA------FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT 256
A +GC DN G G+ SGI+G +S+ QL + FSYCL A
Sbjct: 221 EAPSSGFVYGCGQDNQGLF--GRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAP 278
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPH------FYLHLLEISIGRHIVRFPPGAFDIM 310
+ + L ++P + L + ++L L I++ + ++++
Sbjct: 279 NSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP 338
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SS 369
IID+GT +T + Y L + + I+ Q ++ D C++
Sbjct: 339 T------IIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSI---LDTCFKGSVKE 389
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYD 428
P + + + + +E ++G C+AI SI+G +QQQ + YD
Sbjct: 390 MSTVPEIQIIFRGGAGLELKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYD 449
Query: 429 LNVPALRFGSENC 441
+ + F C
Sbjct: 450 VANFKIGFAPGGC 462
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 96/421 (22%), Positives = 181/421 (42%), Gaps = 53/421 (12%)
Query: 56 RIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV-----EVNIGTPMKPQ 110
++ F + + + S + L I LP+ SV ++ +G+P K
Sbjct: 28 KVQHKFAGKEKKLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEY 87
Query: 111 HLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRSPFKCQN 165
H+ DT S ++W C+PC C +T +FD AS+T ++ CDD C F Q+
Sbjct: 88 HVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCS--FISQS 145
Query: 166 GKC--VYTRRYHV---------GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG- 213
C YH+ G+ R + E ++ G + FGC +D SG
Sbjct: 146 DSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTG-PLGQEVVFGCGSDQSGQ 204
Query: 214 -FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRR 270
+ G++GF S S+ SQL + +FS+CL ++ + G V
Sbjct: 205 LGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVGV---VDSP 260
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++TTP++ + + H+ + L+ + + + PP IMR+ GG I+D+GT + +
Sbjct: 261 KVKTTPMVPNQM--HYNVMLMGMDVDGTALDLPP---SIMRN--GGTIVDSGTTLAYFPK 313
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQP 389
Y +L++ L RQ + + ++ C+ + + A+P ++F +++ +
Sbjct: 314 VLYDSLIETI------LARQPVKLHIVEDTFQCFSFSENVDVAFPPVSFEFEDSVKLTVY 367
Query: 390 ENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ Y ++ +C Q + + +LG N L++YDL + + NC+
Sbjct: 368 PHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427
Query: 443 N 443
+
Sbjct: 428 S 428
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 107/419 (25%), Positives = 174/419 (41%), Gaps = 35/419 (8%)
Query: 31 GFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G +LK+ IFS SP P +S E + + +AR Y +S+ + +
Sbjct: 32 GSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASAR-- 89
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q Y V+ GTP + L DT+S W C C+ C T+ F P ST++
Sbjct: 90 QIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC--STSKPFAPIKSTSFRN 147
Query: 150 IPCDDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
+ C P C+ +P C C + Y + + ++T +P FG
Sbjct: 148 VSCGSPHCKQVPNP-TCGGSACAFNFTYGSSSIAASVV-QDTLTLAADP----IPGYTFG 201
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRD 264
C N +G + + LG SL SQ +N + FSYCL + + + ++ G
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPL--SLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 265 ADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+R ++ TP+L + R +Y++L+ I +GR IV PP A G I D+GT
Sbjct: 260 YQPKR--IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGT 317
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
T + Y + + R +G ++P FD CY P++TF
Sbjct: 318 VFTRLAEPVYTAVRNEFR---RRVG-PKLPVTTLGGFDTCYNVP---IVVPTITFLFSGM 370
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
+ + P+N+ C+A+ P +++ QQQN +++D VP R G
Sbjct: 371 NVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFD--VPNSRIG 427
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 118/434 (27%), Positives = 197/434 (45%), Gaps = 52/434 (11%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK--ARANYMASMSKPNAFQELEDIHL 88
G +L++ F P SPL PG + S + S+ +R Y+ S+ A + +
Sbjct: 43 GNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSL----AVRGRARAYA 98
Query: 89 PMAK-----QDLFYSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFDQTTPIFDP 141
P+A Q Y V ++GTP PQ LL DT++ W C C C + FDP
Sbjct: 99 PIASGRQLLQTPTYVVRASLGTP--PQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDP 156
Query: 142 RASTTYSEIPCDDPLC-RSP-FKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
+S +Y +PC PLC ++P C G C ++ Y + L S+++ A
Sbjct: 157 ASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAAL-SQDSLAV----AG 211
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEA 255
V FGC +G A + LG S SQ ++ + FSYCL + +
Sbjct: 212 NAVKAYTFGCLQRATGTAAPPQGLLGLGRGPL--SFLSQTKDMYEATFSYCLPSFKSLNF 269
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ ++ GR+ +R ++TTP+L + R +Y+++ I +GR +V P AFD T
Sbjct: 270 SGTLRLGRNGQPQR--IKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP--AFD---PAT 322
Query: 315 G-GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
G G ++D+GT T + Y + D++ R +G P ++ FD C ++++ A+
Sbjct: 323 GAGTVLDSGTMFTRLVAPAYVAV---RDEVRRRVG---APVSSLGGFDTC--FNTTAVAW 374
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDPK-----YSILGAWQQQNMLIIY 427
P +T L + + PE I G C+A+ P +++ + QQQN +++
Sbjct: 375 PPVTL-LFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLF 433
Query: 428 DLNVPALRFGSENC 441
D+ + F E C
Sbjct: 434 DVPNGRVGFARERC 447
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 159/372 (42%), Gaps = 37/372 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P ++ DT S ++W C C C FD S T +
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158
Query: 151 PCDDPLCRSPFKC------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-- 202
C DP+C S F+ +N +C Y+ RY G T G +TF F G + V
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSS 218
Query: 203 --LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEAT 256
+ FGCS SG + GI GF LS+ SQL +R +FS+CL +
Sbjct: 219 APIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGG 278
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
V G ++ + +P+L S +PH+ L+LL I + I+ F+ T G
Sbjct: 279 GVFVLG---EILVPGMVYSPLLPS--QPHYNLNLLSIGVNGQILPIDAAVFE--ASNTRG 331
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPS 375
I+DTGT +T++ Y + + L I N Q CY +S +P
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSVSQL-VTLIISNGEQ----CYLVSTSISDMFPP 386
Query: 376 MTFHLQ-EADYIVQPENMYF---IEPDRGRFCVAIQDDP-KYSILGAWQQQNMLIIYDLN 430
++ + A +++P++ F +C+ Q P + +ILG ++ + +YDL
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446
Query: 431 VPALRFGSENCA 442
+ + + +C+
Sbjct: 447 RQRIGWANYDCS 458
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 156/361 (43%), Gaps = 34/361 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y + ++GTP + + D S VW QC C C + P F S+T E+
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREV 155
Query: 151 PCDDPLCRS--PFKCQ--NGKCVYTRRYHVG--DVTRGLASRETFAFPVRNGFTFVPRLA 204
C + C+ P C + C Y+ Y G + T GL + + FAF +
Sbjct: 156 RCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI---- 211
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE--MEATSVIKFG 262
FGC+ A G I G++G LSL SQL+ G FSY L + ++ S I F
Sbjct: 212 FGCA-----VATEGDIGGVIGLGRGELSLVSQLQ---IGRFSYYLAPDDAVDVGSFILFL 263
Query: 263 RDADVRRRDLETTPILLSDL-RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
DA R +TP++ + R +Y+ L I + + P G FD+ DG+GG ++
Sbjct: 264 DDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL 380
PVTF+ G Y+ + Q + S R + D CY +S A PSM
Sbjct: 324 TIPVTFLDAGAYKVVR----QAMASKIGLRAADGSELGLDLCYTSESLATAKVPSMALVF 379
Query: 381 QEADYI-VQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFG 437
+ ++ N ++++ G C+ I P S+LG+ Q +IYD++ L F
Sbjct: 380 AGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVFE 439
Query: 438 S 438
S
Sbjct: 440 S 440
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 163/366 (44%), Gaps = 51/366 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +G+P K L+ DT S L W +C PC + FD AS TY + C D L
Sbjct: 124 YYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCADDL 180
Query: 157 CRSPFKCQNGKCVYTRRYHVGDVTR------GLASRETFAFPVRNGFTFVPRLAFGCSND 210
R P + ++ R +H G R G AS E FP GF FGC +
Sbjct: 181 -RLPVLLR----LWRRLFHSGRSLRDTLKMAGAASDELEEFP---GFV------FGCGSL 226
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK----FGRDA- 265
G G++ GIL + LS SQ+ + FSYCL+R+ S+ K FG A
Sbjct: 227 LKGL-ISGEV-GILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAV 284
Query: 266 ------DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
+ ++L+ TPI S + ++ + L IS+G + P F +D I
Sbjct: 285 ELKEPGSGKPQELQYTPIGESSI--YYTVRLDGISVGNQRLDLSPSTFLNGQDKPT--IF 340
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTF 378
D+GT +T + +G ++ Q ++ + A + D C+R SS + P +TF
Sbjct: 341 DSGTTLTMLPSGVCDSIKQSLASMVSG-----AEFVAIKGLDACFRVPPSSGQGLPDITF 395
Query: 379 HLQ-EADYIVQPENMYFIEPDRGRF-CVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
H AD++ +P N Y I D G C+ + SI G QQQ+ +++D++ + F
Sbjct: 396 HFNGGADFVTRPSN-YVI--DLGSLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRRIGF 452
Query: 437 GSENCA 442
+C
Sbjct: 453 KETDCG 458
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 114/437 (26%), Positives = 183/437 (41%), Gaps = 47/437 (10%)
Query: 16 FSVLFLTHFT-----SSESTGFSLKLIPIFSPESPLYP--GNLSQSERIHKMFEISKARA 68
FSV++L +S++ L +IPI+S SP P + S RI M R
Sbjct: 13 FSVIWLMRVNGIDPCASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMASKDPLRF 72
Query: 69 NYMASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
Y++++ P+A F Y V V +GTP + ++ DT++ +
Sbjct: 73 KYLSTLVGQKTVSTA-----PIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVP 127
Query: 125 CQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDV 179
C C C D T F P+AST+Y + C P C C G C + + Y
Sbjct: 128 CSGCTGCSDTT---FSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ L +R +P +FGC N +G + + L PLSL SQ +
Sbjct: 185 SATLVQDS-----LRLATDVIPNYSFGCVNAITGASVPAQGLLGL--GRGPLSLLSQSGS 237
Query: 240 RIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIG 296
G+FSYCL + + +K G + + + TTP+L S RP +Y++ IS+G
Sbjct: 238 NYSGIFSYCLPSFKSYYFSGSLKLGPVG--QPKSIRTTPLLRSPHRPSLYYVNFTGISVG 295
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
R +V FP + G IID+GT +T Y + + + + +G + +
Sbjct: 296 RVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFR---KQVGGTT--FTS 350
Query: 357 SQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK----- 411
FD C+ A P +T H + D + EN C+A+ P
Sbjct: 351 IGAFDTCFVKTYETLA-PPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSV 409
Query: 412 YSILGAWQQQNMLIIYD 428
+++ +QQQN+ I++D
Sbjct: 410 LNVIANFQQQNLRILFD 426
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/430 (25%), Positives = 178/430 (41%), Gaps = 40/430 (9%)
Query: 29 STGFSLKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARANYMA-SMSKPNAFQELEDI 86
STG ++ L + P SP+ + + ER+ + + RA Y+ S ++ +
Sbjct: 52 STGVTVPLHHRYDPCSPVPSKKVPTLEERLRR----DQLRAAYIKRKFSGAGDIEQSDAA 107
Query: 87 HLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPR 142
+P + L Y + V IG+P Q + DT S + W QC+PC +C + +FDP
Sbjct: 108 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPS 167
Query: 143 ASTTYSEIPCDDPLCRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
+S+TYS C C + Q G +C Y Y T G S +T G
Sbjct: 168 SSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTL----G 223
Query: 197 FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT 256
+ + FGCS SG F + G++G SL+SQ FSYCL ++
Sbjct: 224 SSAMTDFQFGCSQSESG-GFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSS 282
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTG 315
+ G + TP+L S P +Y+ LLE I +G + P F +
Sbjct: 283 GFLTLGTGSS----GFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVF------SA 332
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYP 374
G ++D+GT +T + Y L + + +Q P S D C+ + S + P
Sbjct: 333 GSLMDSGTIITRLPPTAYSALSSAFKAGM----QQYPPATPSGILDTCFDFSGQSSISIP 388
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNV 431
++T + + +E C+A DD I+G QQ+ ++YD+
Sbjct: 389 TVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGG 448
Query: 432 PALRFGSENC 441
A+ F + C
Sbjct: 449 GAVGFKAGAC 458
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 155/361 (42%), Gaps = 34/361 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y + ++GTP + + D S VW QC C C + P F S+T E+
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREV 155
Query: 151 PCDDPLCRS--PFKCQ--NGKCVYTRRYHVG--DVTRGLASRETFAFPVRNGFTFVPRLA 204
C + C+ P C + C Y+ Y G + T GL + + FAF +
Sbjct: 156 RCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI---- 211
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE--MEATSVIKFG 262
FGC+ A G I G++G LS SQL+ G FSY L + ++ S I F
Sbjct: 212 FGCA-----VATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFL 263
Query: 263 RDADVRRRDLETTPILLSDL-RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
DA R +TP++ S R +Y+ L I + + P G FD+ DG+GG ++
Sbjct: 264 DDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL 380
PVTF+ G Y+ + Q + S R + D CY +S A PSM
Sbjct: 324 TIPVTFLDAGAYKVVRQA----MASKIELRAADGSELGLDLCYTSESLATAKVPSMALVF 379
Query: 381 QEADYI-VQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFG 437
+ ++ N ++++ G C+ I P S+LG+ Q +IYD++ L F
Sbjct: 380 AGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVFE 439
Query: 438 S 438
S
Sbjct: 440 S 440
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/394 (23%), Positives = 176/394 (44%), Gaps = 53/394 (13%)
Query: 83 LEDIHLPMAKQDLFYSV-----EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ SV ++ +G+P K H+ DT S ++W C+PC +C +T
Sbjct: 55 LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL 114
Query: 137 ----PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC--VYTRRYHV---------GDVTR 181
+FD AS+T ++ CDD C F Q+ C YH+ G R
Sbjct: 115 NFRLSLFDMNASSTSKKVGCDDDFCS--FISQSDSCQPALGCSYHIVYADESTSDGKFIR 172
Query: 182 GLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQL-- 237
+ + E ++ G + FGC +D SG G + G++GF S S+ SQL
Sbjct: 173 DMLTLEQVTGDLKTG-PLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAA 231
Query: 238 RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
+ +FS+CL ++ + G V ++TTP++ + + H+ + L+ + +
Sbjct: 232 TGDAKRVFSHCL-DNVKGGGIFAVGV---VDSPKVKTTPMVPNQM--HYNVMLMGMDVDG 285
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ P I+R+ GG I+D+GT + + Y +L++ L RQ + +
Sbjct: 286 TSLDLPRS---IVRN--GGTIVDSGTTLAYFPKVLYDSLIETI------LARQPVKLHIV 334
Query: 358 QEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DD 409
+E C+ + ++ +A+P ++F +++ + + Y + +C Q +
Sbjct: 335 EETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDER 394
Query: 410 PKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ +LG N L++YDL+ + + NC++
Sbjct: 395 SEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSS 428
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 114/449 (25%), Positives = 184/449 (40%), Gaps = 46/449 (10%)
Query: 16 FSVLFLTHFTS-----SESTGFSLKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARAN 69
FSV++L + S+ L +IPI+S SP P + RI M R
Sbjct: 13 FSVMWLMRVNAIDPCASQPDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMASKDPVRVK 72
Query: 70 YMASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
Y++++ P+A F Y V V +GTP + ++ DT++ + C
Sbjct: 73 YLSTLVSQKTVSTA-----PIASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPC 127
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVT 180
C C D T F P+AST+Y + C P C C G C + + Y +
Sbjct: 128 SGCTGCSDTT---FSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS 184
Query: 181 RGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR 240
L +R +P +FGC N +G + + LG SL SQ +
Sbjct: 185 ATLVQDA-----LRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRGPL--SLLSQSGSN 237
Query: 241 IQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGR 297
G+FSYCL + + +K G + + + TTP+L S RP +Y++ IS+GR
Sbjct: 238 YSGIFSYCLPSFKSYYFSGSLKLGPVG--QPKSIRTTPLLRSPHRPSLYYVNFTGISVGR 295
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+V FP + G IID+GT +T Y + + + + +G + +
Sbjct: 296 VLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFR---KQVGGTT--FTSI 350
Query: 358 QEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----Y 412
FD C+ A P +T H + D + EN C+A+ P
Sbjct: 351 GAFDTCFVKTYETLA-PPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVL 409
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENC 441
+++ +QQQN+ I++D+ + E C
Sbjct: 410 NVIANFQQQNLRILFDIVNNKVGIAREVC 438
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/418 (24%), Positives = 174/418 (41%), Gaps = 59/418 (14%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTAS 118
K AN A ++ + + LP+ L Y ++ IGTP K ++ DT S
Sbjct: 43 GKHLANLRAHDARRHGRSLAAAVDLPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTGS 102
Query: 119 SLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRS------PFKCQNGK 167
++W C C C ++ ++DP S++ + + C C + P
Sbjct: 103 DILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAP 162
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGKIS-- 221
C Y+ Y G T G + + +G + + FGC G GG +
Sbjct: 163 CQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGC-----GAKIGGDLGSS 217
Query: 222 -----GILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
GILGF S S+ SQL +++ +F++CL + + G DV + + T
Sbjct: 218 SQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL-DTINGGGIFAIG---DVVQPKVST 273
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQ 334
TP++ PH+ ++L I +G ++ P FDI + G IID+GT + ++ Y
Sbjct: 274 TPLVPG--MPHYNVNLEAIDVGGVKLQLPTNIFDIGE--SKGTIIDSGTTLAYLPGVVYN 329
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQEA--------DY 385
+M + + G +P Q+F C+RY S +P +TFH + DY
Sbjct: 330 AIMSK---VFAQYG--DMPLKNDQDFQ-CFRYSGSVDDGFPIITFHFEGGLPLNIHPHDY 383
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ Q +Y + G + +D +LG N L++YDL + + NC++
Sbjct: 384 LFQNGELYCMGFQTGG--LQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCSS 439
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 157/374 (41%), Gaps = 41/374 (10%)
Query: 67 RANYMASM-SKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
R +A + + P A + + ++ Q L Y IGTP +P + D LVWTQC
Sbjct: 27 RGRLLAGVDATPPAAGGAVAVPIYLSSQGL-YVANFTIGTPPQPVSAVVDLTGELVWTQC 85
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC----RSPFKCQNGKCVYTRRYHVGDVTR 181
PC CF+Q P+FDP S+T+ +PC LC S C + C+Y GD T
Sbjct: 86 TPCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVCIYEAPTKAGD-TG 144
Query: 182 GLASRETFAFPVRNGFTFVPRLAFGCS--NDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
G A +TFA L FGC D GG SGI+G +P SL +Q+
Sbjct: 145 GKAGTDTFAIGAAK-----ETLGFGCVVMTDKRLKTIGGP-SGIVGLGRTPWSLVTQMNV 198
Query: 240 RIQGLFSYCLVREMEATSVIKFGRDAD-VRRRDLETTPILL--------SDLRPHFYLHL 290
FSYCL +++ + G A + +TP ++ + P++ + L
Sbjct: 199 TA---FSYCLAG--KSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKL 253
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
I G ++ + + ++DT + +++ +G Y+ L + + ++G Q
Sbjct: 254 AGIKTGGAPLQAASSSGSTV-------LLDTVSRASYLADGAYKALKK---ALTAAVGVQ 303
Query: 351 RIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP 410
+ + + +D C+ + A P + F + P Y + G C+ I
Sbjct: 304 PV-ASPPKPYDLCFPKAVAGDA-PELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSA 361
Query: 411 KYSILGAWQQQNML 424
++ G + ++L
Sbjct: 362 SLNLTGELEGASIL 375
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 151/356 (42%), Gaps = 28/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP K ++ DT SSL W QC PC + C Q+ P+F+PR+S++Y+ + C P
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 156 LCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C + P C C+Y Y + G S++T +F G T VP +GC
Sbjct: 181 QCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSVPNFYYGC 236
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL ++ +
Sbjct: 237 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSI---GSY 291
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
TP+ S L Y + + I++ + A+ + IID+GT +T
Sbjct: 292 NPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPT-----IIDSGTVIT 346
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ Y L + ++ R +A D C++ +S P ++ +
Sbjct: 347 RLPTDVYSALSKAVAGAMKGTPRA----SAFSILDTCFQGQASRLRVPQVSMAFAGGAAL 402
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 403 KLKATNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 40/399 (10%)
Query: 71 MASMSKPNAFQELEDIHLPMA--KQDL---FYSVEVNIGTPMKPQHLL--FDTASSLVWT 123
+A+ KP +P+A +Q L Y +GTP PQ LL D ++ W
Sbjct: 69 LAAKPKPKPKGHSRHTFVPIAAGRQILRTPSYVARARLGTP--PQTLLVAIDPSNDAAWV 126
Query: 124 QCQPCIRCF-DQTTPIFDPRASTTYSEIPCDDPLCR----SPFKCQNG---KCVYTRRYH 175
C C+ C ++P FDP S+TY + C P C + C G C + Y
Sbjct: 127 PCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSY- 185
Query: 176 VGDVTRGLASRETFAFPVRNGFTFVP--RLAFGCSNDNSGFAFGGKISGILGFNASPLSL 233
+ ++ + NG VP FGC +G G++GF PLS
Sbjct: 186 ASSTLHAVLGQDALSLSDSNGAA-VPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSF 244
Query: 234 SSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHL 290
SQ + +FSYCL + + ++ G RR ++TTP+L + RP +Y+ +
Sbjct: 245 LSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRR--IKTTPLLSNPHRPSLYYVAM 302
Query: 291 LEISIGRHIVRFPPGAFDI-MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
+ + + V P A + G GG I+D GT T + Y L + R +
Sbjct: 303 VGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFR---RGVSA 359
Query: 350 QRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQP-ENMYFIEPDRGRFCVAIQD 408
P A FD CY Y + K+ P++ F + P EN+ G C+A+
Sbjct: 360 PAAP--ALGGFDTCY-YVNGTKSVPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAA 416
Query: 409 DPK------YSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P ++L + QQQN +++D+ + F E C
Sbjct: 417 GPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 179/395 (45%), Gaps = 49/395 (12%)
Query: 83 LEDIHLPM------AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC----- 131
L + LP+ A+ L+++ ++ +G P K ++ DT S ++W C C +C
Sbjct: 63 LSAVDLPLGGNGHPAEAGLYFA-KIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSD 121
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCRSPFK------CQNGKCVYTRRYHVGDVTRGLAS 185
++DP++ST+ + I CDD C + + ++ C Y+ Y G T G
Sbjct: 122 LGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFV 181
Query: 186 RETFAFPVRNG----FTFVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL-- 237
++ F G + + FGC SG + GILGF + S+ SQL
Sbjct: 182 KDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAA 241
Query: 238 RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
+++ +F++CL ++ + G +V + TTP++ + +PH+ + + EI +G
Sbjct: 242 AGKVKRVFAHCL-DNVKGGGIFAIG---EVVSPKVNTTPMVPN--QPHYNVVMKEIEVGG 295
Query: 298 HIVRFPPGAFDIM-RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+++ P FD R GT IID+GT + ++ Y+++M + I+ + + +
Sbjct: 296 NVLELPTDIFDTGDRRGT---IIDSGTTLAYLPEVVYESMMTK---IVSE--QPGLKLHT 347
Query: 357 SQEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------D 408
+E C++Y + + +P + FH + + + Y + +C Q D
Sbjct: 348 VEEQFTCFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKD 407
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
++LG N L++YDL A+ + NC++
Sbjct: 408 GRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNCSS 442
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 167/374 (44%), Gaps = 40/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP K ++ DT S ++W C C RC ++ ++D +ASTT +
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 151 PCDDPLCR---SPF-KCQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP--- 201
CDD C P C+ G +C+Y+ Y G T G ++ + + F P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 273
Query: 202 RLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
+ FGC N SG + + GILGF + S+ SQL +++ +FS+CL ++
Sbjct: 274 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGG 332
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGG 316
+ G +V + TP++ + + H+ + + EI +G + P AF+ R GT
Sbjct: 333 IFAIG---EVVEPKVNITPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT-- 385
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
IID+GT + + Y L+++ L R A FDY D F P++
Sbjct: 386 -IIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGF---PTV 440
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDL 429
T H ++ + + Y + + +C+ Q D ++LG N L++YDL
Sbjct: 441 TLHFDKSISLTVYPHEYLFQHEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499
Query: 430 NVPALRFGSENCAN 443
+ + NC++
Sbjct: 500 EKQGIGWVEYNCSS 513
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/407 (24%), Positives = 178/407 (43%), Gaps = 48/407 (11%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
S R +A+ P L + LP L+Y+ E+ IGTP K H+ DT S ++W
Sbjct: 57 SNRRGRLLAAADVP-----LGGLGLP-TDTGLYYT-EIEIGTPPKQYHVQVDTGSDILWV 109
Query: 124 QCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRS------PFKCQNGKCVYTR 172
C C +C ++ ++DP+ S++ S + CD C + P +N C Y+
Sbjct: 110 NCISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSV 169
Query: 173 RYHVGDVTRGLASRETFAFPVRNGFTFV----PRLAFGCSNDNSG--FAFGGKISGILGF 226
Y G T G ++ + +G + FGC G + + GI+GF
Sbjct: 170 MYGDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGF 229
Query: 227 NASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRP 284
S S+ SQL ++ +FS+CL ++ + G DV + +++TP L+ D+ P
Sbjct: 230 GQSNTSMLSQLAAAGEVKKIFSHCL-DTIKGGGIFAIG---DVVQPKVKSTP-LVPDM-P 283
Query: 285 HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQIL 344
H+ ++L I++G ++ P F+ G IID+GT +T++ Y+ D +
Sbjct: 284 HYNVNLESINVGGTTLQLPSHMFETGE--KKGTIIDSGTTLTYLPELVYK------DVLA 335
Query: 345 RSLGRQ-RIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC 403
+ +++ Q+F + S +P +TFH ++ + + YF + +C
Sbjct: 336 AVFAKHPDTTFHSVQDFLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYC 395
Query: 404 VAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
Q D +LG N +++YDL + + NC++
Sbjct: 396 FGFQNGGLQSKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCSS 442
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/441 (25%), Positives = 179/441 (40%), Gaps = 42/441 (9%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELED 85
S+E T LKL + L+P LS RI + + R + ++ K ++ D
Sbjct: 25 STEDTAVRLKL----AHRDTLWPNPLS---RIEDIIGADQKRHSLISRKRKFKGGVKM-D 76
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP--IFDPRA 143
+ + Y EV +GTP K ++ DT S L W C+ R + +F
Sbjct: 77 LGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEE 136
Query: 144 STTYSEIPCDDPLCR---------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
S ++ + C C+ S + C Y RY G +G+ ++ET +
Sbjct: 137 SKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLT 196
Query: 195 NGFTFVPR-LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM 253
NG R L GCS+ S G+LG S S +S + SYCLV +
Sbjct: 197 NGRKARLRGLLVGCSSSFS-GQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHL 255
Query: 254 EATSV---IKFG---RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
++ + FG + TTP+ L+ + P + ++++ ISIG ++ P +
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVW 315
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQ---TLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
D GG I+D+GT +T + Y+ T + RY L+ + + IP +YC+
Sbjct: 316 DATTG--GGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIP------IEYCF 367
Query: 365 RYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD--DPKYSILGAWQQ 420
S F P +TFHL+ Y ++ G C+ P +++G Q
Sbjct: 368 SSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQ 427
Query: 421 QNMLIIYDLNVPALRFGSENC 441
QN L +DL L F C
Sbjct: 428 QNYLWEFDLMASTLSFAPSTC 448
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 123/432 (28%), Positives = 176/432 (40%), Gaps = 76/432 (17%)
Query: 69 NYMASMSKPNAFQ-ELEDIHLPMAKQDLF------YSVEVNIGTPMKPQHLLFDTASSLV 121
N++AS+S A + + K LF YS+ +N GTP + + DT SSLV
Sbjct: 57 NHLASLSLSRAHHIKSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 116
Query: 122 WTQCQP---CIRC----FDQT-TPIFDPRASTTYSEIPCDDPLCRSPF------KCQ--- 164
W C C RC + T P F P+ S++ + I C + C F KCQ
Sbjct: 117 WFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECD 176
Query: 165 --NGKCV-----YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
C Y +Y +G T GL ET FP + +P GCS F+
Sbjct: 177 PTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLDFPHKKT---IPGFLVGCSL----FSI- 227
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR----EMEATS--VIKFGRDA-DVRRR 270
+ GI GF SP SL SQL + FSYCLV + A+S V+ G + D +
Sbjct: 228 RQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTP 284
Query: 271 DLETTPIL---LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
L TP + R ++Y+ L I IG V+ P DG GG I+D+GT TF
Sbjct: 285 GLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTF 344
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY------CYRYDSSFK-AYPSMTFHL 380
+ Y+ + + ++ +Q Y + E C+ + P FH
Sbjct: 345 MEKPVYELVAKEFE-------KQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHF 397
Query: 381 QEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS---------ILGAWQQQNMLIIYDLNV 431
+ + P YF D G C+ I D ILG +QQ+N + +DL
Sbjct: 398 KGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKN 457
Query: 432 PALRFGSENCAN 443
F +NC +
Sbjct: 458 ERFGFKQQNCVS 469
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 168/376 (44%), Gaps = 43/376 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y +V +GTP K ++ DT S ++W C C C Q++ + FD S+T +
Sbjct: 77 LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNC-PQSSQLGIELNFFDTVGSSTAAL 135
Query: 150 IPCDDPLCRSPFKCQNG-------KCVYTRRYHVGDVTRGLASRETFAFPVRNG----FT 198
IPC DP+C S + +C YT +Y G T G + F + G
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195
Query: 199 FVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREME 254
+ FGCS SG + GI GF PLS+ SQL +R +FS+CL + +
Sbjct: 196 SSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGD 255
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
V+ ++ + +P++ S +PH+ L+L I++ ++ P F I +
Sbjct: 256 GGGVLVL---GEILEPSIVYSPLVPS--QPHYNLNLQSIAVNGQLLPINPAVFSI-SNNR 309
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
GG I+D GT + ++ Y L+ + + RQ + + + CY +S +
Sbjct: 310 GGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQ-----TNSKGNQCYLVSTSIGDIF 364
Query: 374 PSMTFHLQ-EADYIVQPE-----NMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLII 426
PS++ + + A +++PE N Y + +C+ Q SILG ++ +++
Sbjct: 365 PSVSLNFEGGASMVLKPEQYLMHNGYLDGAE--MWCIGFQKFQEGASILGDLVLKDKIVV 422
Query: 427 YDLNVPALRFGSENCA 442
YD+ + + + +C+
Sbjct: 423 YDIAQQRIGWANYDCS 438
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 162/383 (42%), Gaps = 50/383 (13%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + GTP++ ++ DT S L W C+ IF+P AS TY++IPC
Sbjct: 63 HNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPC 118
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
P C + P C K C + Y G + ETF R G P
Sbjct: 119 SSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETF----RVGSVTGPATV 174
Query: 205 FGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK 260
FGC +SGF+ K +G++G N LS +Q+ R FSYC + + +++ V+
Sbjct: 175 FGCM--DSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRK---FSYC-ISDRDSSGVLL 228
Query: 261 FGRDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G + + L TP++ +S P+F + L I + ++ P F G
Sbjct: 229 LGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGA 288
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRY----DQILRSLGRQRIPYNASQEFDYCYRYDSSF 370
G ++D+GT TF+ Y L Q + +LR L R + + D CY + +
Sbjct: 289 GQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGA--MDLCYLIEPTR 346
Query: 371 KAYPSM---TFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYSI----LGAW 418
A P++ + A+ V + + + P RG+ +C + I +G
Sbjct: 347 AALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHH 406
Query: 419 QQQNMLIIYDLNVPALRFGSENC 441
QQQN+ + YDL + F C
Sbjct: 407 QQQNVWMEYDLEKSRIGFAEVRC 429
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 160/363 (44%), Gaps = 48/363 (13%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
++YS + +G+P K L+ DT S L W +C PC + FD AS TY + C D
Sbjct: 2 VYYST-ITLGSPPKDFSLVMDTGSDLTWVRCDPCSP---DCSSTFDRLASNTYKALTCAD 57
Query: 155 PLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF--AFPVRNGFTFVPRLAFGCSNDNS 212
Y+ Y G T+G S +T A + P FGC +
Sbjct: 58 D--------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLK 103
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK----FGRDA--- 265
G G++ GIL + LS SQ+ + FSYCL+R+ S+ K FG A
Sbjct: 104 GL-ISGEV-GILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVEL 161
Query: 266 ----DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ ++L+ TPI S + ++ + L IS+G + P AF +D I D+
Sbjct: 162 KEPGSGKLQELQYTPIGESSI--YYTVRLDGISVGNQRLDLSPSAFLNGQDKP--TIFDS 217
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHL 380
GT +T + G ++ Q ++ + A + D C+R SS + P +TFH
Sbjct: 218 GTTLTMLPPGVCDSIKQSLASMVSG-----AEFVAIKGLDACFRVPPSSGQGLPDITFHF 272
Query: 381 Q-EADYIVQPENMYFIEPDRGRF-CVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
AD++ +P N Y I D G C+ + SI G QQQ+ +++D++ + F
Sbjct: 273 NGGADFVTRPSN-YVI--DLGSLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRRIGFKE 329
Query: 439 ENC 441
+C
Sbjct: 330 TDC 332
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 109/444 (24%), Positives = 168/444 (37%), Gaps = 45/444 (10%)
Query: 25 TSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA-SMSKPNAFQEL 83
+SS+ T S+ L + P SP P + + ++ + RA+Y+ S N
Sbjct: 54 SSSDGTS-SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAG 112
Query: 84 ED---------IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---C 131
ED L + L Y + V +G+P Q ++ DT S + W QC+PC C
Sbjct: 113 EDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPC 172
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC--QNG-----KCVYTRRYHVGDVTRGLA 184
+FDP AS+TY+ C C NG +C Y +Y G T G
Sbjct: 173 HAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTY 232
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
S + +G V FGCS+ G K G++G SL SQ R
Sbjct: 233 SSDVLTL---SGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKS 289
Query: 245 FSYCLVREMEATSVIKFGRDADVRRRD---LETTPILLSDLRPHFYLHLLE-ISIGRHIV 300
FSYCL ++ + G A TTP+L S P +Y LE I++G +
Sbjct: 290 FSYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 349
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
P F G ++D+GT +T + Y L + + R P
Sbjct: 350 GLSPSVF------AAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAE-PLGI---L 399
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGA 417
D C+ + K + +V + + C+A +DD + +G
Sbjct: 400 DTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGN 455
Query: 418 WQQQNMLIIYDLNVPALRFGSENC 441
QQ+ ++YD+ F + C
Sbjct: 456 VQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 158/411 (38%), Gaps = 74/411 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC----------IRCFDQTTPIFDPRASTT 146
Y IG P +P + DT S LVWTQC C CF Q P ++ S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 147 YSEIPCDD---PLCR---SPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVR 194
+PCDD LC C G CV Y G V G+ + F FP
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSS 196
Query: 195 NGFTFVPRLAFGCSNDN--SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--- 249
+ T LAFGC + S A G SGI+G LSL SQL N + FSYCL
Sbjct: 197 SSVT----LAFGCVSQTRISPGALNGA-SGIIGLGRGALSLVSQL-NATE--FSYCLTPY 248
Query: 250 -----------VREMEATSVIKFGRDADVRRRDLETTPILL----SDLRPHFYLHLLEIS 294
V + E + + T P S +YL L+ ++
Sbjct: 249 FRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLA 308
Query: 295 IGRHIVRFPPGAFDIMRDG----TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
G V P GAFD+ GG +ID+G+P T + + ++ L + + LR G
Sbjct: 309 AGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSL 368
Query: 351 RIP-YNASQEFDYCYRY----DS-SFKAYPSMTFHLQEA----DYIVQPENMYFIEPDRG 400
P + C DS + A P + + +V P Y+ +
Sbjct: 369 VPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEAS 428
Query: 401 RFCVAIQDDP---------KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+C+A+ + +I+G + QQ+M ++YDL L F NC+
Sbjct: 429 TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/421 (23%), Positives = 170/421 (40%), Gaps = 49/421 (11%)
Query: 62 EISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLV 121
IS RA+ + A +L L + Y E+ +GTP K ++ DT S ++
Sbjct: 52 NISALRAHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDIL 111
Query: 122 WTQCQPCIRC-----FDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQ------NGKCVY 170
W C C +C +DP+AS++ S + CD C + + + N C Y
Sbjct: 112 WVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEY 171
Query: 171 TRRYHVGDVTRGLASRETFAFPVRNGFTFV----PRLAFGCSNDNSGFAFGGK---ISGI 223
+ Y G T G + F G + FGC G G + GI
Sbjct: 172 SVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGG-DLGNSNQALDGI 230
Query: 224 LGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVR-------RRDLET 274
LGF + S+ SQL + + +F++CL ++ + G + L
Sbjct: 231 LGFGQANTSMLSQLAAAGKAKKIFAHCL-DTIKGGGIFAIGNVVQPKCYFVFFFAHGLLN 289
Query: 275 TPILLSDL----RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
P+ L + RPH+ ++L I +G ++ P F+ G IID+GT +T++
Sbjct: 290 IPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFETGE--KKGTIIDSGTTLTYL-- 345
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQP 389
P Q D + + I ++ Q+F C++Y S +P++TFH ++ +
Sbjct: 346 -PELVFKQVMDVVFSK--HRDIAFHNLQDF-LCFQYSGSVDDGFPTITFHFEDDLALHVY 401
Query: 390 ENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ YF +CV Q D ++G N L++YDL + + NC+
Sbjct: 402 PHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTDYNCS 461
Query: 443 N 443
+
Sbjct: 462 S 462
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 157/362 (43%), Gaps = 54/362 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC---FDQTTPI--FDPRASTTYSEI 150
Y +V +GTP + +L DT S L+W C PCI C D PI +D +AS + S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 151 PCDDPLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
PC DP C S C + +C Y+ +Y G T G + + V T +
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI---- 150
Query: 205 FGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIK 260
FGC SG + + GI+GF AS LS +SQL + + +F++CL ++
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF--DIMRDGTGGFI 318
G +V D++ TP++ H+ + L IS+ + P F D+M+ G I
Sbjct: 211 LG---NVIEPDIQYTPLV--PYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQ----GTI 261
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF--KAYPSM 376
D+GT + ++ + YQ Q ++ F C S F K +P++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQAVSLVV-------------APFLLCDTRLSRFIYKLFPNV 308
Query: 377 TFHLQEADYIVQPENMYFIEPDRGR---FCVAIQ------DDPKYSILGAWQQQNMLIIY 427
+ + A + P + +C+ Q + +Y+I G +N L++Y
Sbjct: 309 VLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVY 368
Query: 428 DL 429
DL
Sbjct: 369 DL 370
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 157/362 (43%), Gaps = 54/362 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC---FDQTTPI--FDPRASTTYSEI 150
Y +V +GTP + +L DT S L+W C PCI C D PI +D +AS + S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 151 PCDDPLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
PC DP C S C + +C Y+ +Y G T G + + V T +
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI---- 150
Query: 205 FGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIK 260
FGC SG + + GI+GF AS LS +SQL + + +F++CL ++
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF--DIMRDGTGGFI 318
G +V D++ TP++ H+ + L IS+ + P F D+M+ G I
Sbjct: 211 LG---NVIEPDIQYTPLV--PYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQ----GTI 261
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF--KAYPSM 376
D+GT + ++ + YQ Q ++ F C S F K +P++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQAVSLVV-------------APFLLCDTRLSRFIYKLFPNV 308
Query: 377 TFHLQEADYIVQPENMYFIEPDRGR---FCVAIQ------DDPKYSILGAWQQQNMLIIY 427
+ + A + P + +C+ Q + +Y+I G +N L++Y
Sbjct: 309 VLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVY 368
Query: 428 DL 429
DL
Sbjct: 369 DL 370
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 152/367 (41%), Gaps = 31/367 (8%)
Query: 95 LFYSVEVNIGTP-MKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPC 152
L Y + V +G+P K Q +L DT S + W +C+PC + C Q P+FDP S+TYS C
Sbjct: 138 LEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSC 197
Query: 153 DDPLCRSPFKCQN-------GKCVYTRRYHVGDV-TRGLASRETFAFPVRNGFTFVPRLA 204
C F+ N G+C Y Y G V T G S +T A + V +
Sbjct: 198 SSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFR 257
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI-QGLFSYCLVREMEATSVIKFGR 263
FGCS+ +G +G++G SL SQ FSYCL ++ + G
Sbjct: 258 FGCSHAETGITG--LTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSGFLTLGA 315
Query: 264 DADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A TP+L S P FY + L I +G + P F + G I+D+G
Sbjct: 316 -AGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF------SAGMIMDSG 368
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDS-SFKAYPSMTFHL 380
T VT + Y +L + ++ P +A F D C+ S + P++
Sbjct: 369 TVVTRLPPTAYSSLSSAFKAGMKQY--PPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVF 426
Query: 381 QEADYIV---QPENMYFIEPDRGRFC---VAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
A V + FC VA DD I+G QQ+ ++YD+ A+
Sbjct: 427 SGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAV 486
Query: 435 RFGSENC 441
F + C
Sbjct: 487 GFKAGAC 493
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 142/361 (39%), Gaps = 36/361 (9%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPC 152
L Y + V +GTP Q + DT S + W QC PC C QT +FDP S+TY + C
Sbjct: 125 LEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSC 184
Query: 153 DDPLCRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C + NG +C Y +Y G T G SR+T V FG
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTL--SGASDAVKGFQFG 242
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
CS+ SGF+ + G++G SL SQ FSYCL + S
Sbjct: 243 CSHLESGFS--DQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL--PPTSGSSGFLTLGGG 298
Query: 267 VRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
TT +L S P FY L +I++G + P F G ++D+GT +
Sbjct: 299 GGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSGTII 352
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEAD 384
T + Y L + + +Q A D C+ + + + P++
Sbjct: 353 TRLPPTAYSALSSAFKAGM----KQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGA 408
Query: 385 YI-VQPENMYFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
I + P + + C+ A DD I+G QQ+ ++YD+ L F S
Sbjct: 409 AIDLDPNGIMYGN------CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 441 C 441
C
Sbjct: 463 C 463
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 164/376 (43%), Gaps = 42/376 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y+ +V +GTP + + DT S ++W C C C FD S+T + +
Sbjct: 83 LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142
Query: 151 PCDDPLCRSPF-----KC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
PC DP+C S +C Q +C YT +Y G T G+ + F + G + +
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202
Query: 204 A------FGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREM 253
A FGCS SG + GILGF LS+ SQL +R +FS+CL +
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
++ G ++ + +P++ S +PH+ L+L I++ ++ P F
Sbjct: 263 NGGGILVLG---EILEPSIVYSPLVPS--QPHYNLNLQSIAVNGQVLSINPAVF--ATSD 315
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
G IID+GT ++++ Y L+ D + I SQ + D SF
Sbjct: 316 KRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFIS-KGSQCYLVLTSIDDSF--- 371
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRG------RFCVAIQD-DPKYSILGAWQQQNMLII 426
P+++F+ + + + Y + +RG +C+ Q +ILG ++ +++
Sbjct: 372 PTVSFNFEGGASMDLKPSQYLL--NRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVV 429
Query: 427 YDLNVPALRFGSENCA 442
YDL + + + +C+
Sbjct: 430 YDLARQQIGWTNYDCS 445
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 149/368 (40%), Gaps = 34/368 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPCDD 154
Y V V +GTP + ++FDT S L W QC PC C+ Q P+F P S+T+S + C
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213
Query: 155 PLCRSPFKC----QNGKCVYTRRY--------HVGDVTRGLASRETFAFPVRNGFTFVPR 202
CR+ C + +C Y Y H+G+ T L + N +P
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEND-NKLPG 272
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKF 261
FGC +N+G G+ G+ G +SLSSQ + FSYCL A +
Sbjct: 273 FVFGCGENNTGLF--GQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSL 330
Query: 262 GRDADVRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G + TP+L P F Y+ L+ I + +R + I+D
Sbjct: 331 GTPVPAPAH-AQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------LIVD 383
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA---YPSMT 377
+GT +T + Y+ L + + G +R P + D CY + + A P++
Sbjct: 384 SGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLS--ILDTCYDFTAHANATVSIPAVA 441
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPAL 434
I + + C+A D ILG QQ+ + ++YD+ +
Sbjct: 442 LVFAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKI 501
Query: 435 RFGSENCA 442
F ++ C+
Sbjct: 502 GFAAKGCS 509
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 144/360 (40%), Gaps = 32/360 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYS------- 148
Y V V +GTP K L+FDT S + WTQCQPC R C+ Q IFDP ST+Y+
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 149 -EIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+P C + CVY +Y + G E + F + FGC
Sbjct: 209 ICNSLTSATGNTP-GCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFN---NIYFGC 264
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
+N G G LG + LS+ SQ + +FSYCL +T + FG A
Sbjct: 265 GQNNQGLFGGSAGLLGLGRD--KLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFGGSA-- 320
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
++ + TP+ P FY L IS+G + F T G IID+GT +T
Sbjct: 321 -SKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFS-----TAGAIIDSGTVIT 374
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEADY 385
+ Y L + R+L + A D CY + S + + P + F
Sbjct: 375 RLPPAAYSALRASF----RNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIE 430
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPKYS---ILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ + C+A + + I G QQ+ + + YD + + F C+
Sbjct: 431 VDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 143/342 (41%), Gaps = 42/342 (12%)
Query: 106 PMKPQHLLFD-TASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQ 164
P PQ +L + S+ WTQC+PC+RC + FDP AS TYS C P
Sbjct: 83 PPSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSC------IPSTVG 136
Query: 165 NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
N Y Y + G +T + F P+ FGC +N G FG G+L
Sbjct: 137 N---TYNMTYGDKSTSVGNYGCDTMTLEPSDVF---PKFQFGCGRNNEG-DFGSGADGML 189
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPIL----LS 280
G LS SQ ++ + +FSYCL E S++ FG A + L+ T ++ S
Sbjct: 190 GLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLL-FGEKA-TSQSSLKFTSLVNGPGTS 247
Query: 281 DLRP--HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
L ++++ LL+IS+G + P F + G IID+GT +T + Y L
Sbjct: 248 GLEESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTA 302
Query: 339 RYDQILR----SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMY 393
+ + + S GR++ D CY P + H E +
Sbjct: 303 AFKKAMAKYPLSNGRRK----KGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV 358
Query: 394 FIEPDRGRFCVAIQDDPK------YSILGAWQQQNMLIIYDL 429
D R C+A + K +I+G QQ ++ ++YD+
Sbjct: 359 IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 143/361 (39%), Gaps = 36/361 (9%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--CFDQTTPIFDPRASTTYSEIPC 152
L Y + V +GTP Q + DT S + W QC PC C+ QT +FDP S+TY + C
Sbjct: 125 LEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSC 184
Query: 153 DDPLCRSPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C + NG +C Y +Y G T G SR+T V FG
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTL--SGASDAVKGFQFG 242
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
CS+ SGF+ + G++G SL SQ FSYCL + S
Sbjct: 243 CSHVESGFS--DQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL--PPTSGSSGFLTLGGG 298
Query: 267 VRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
TT +L S P FY L +I++G + P F G ++D+GT +
Sbjct: 299 GGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSGTII 352
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEAD 384
T + Y L + + +Q A D C+ + + + P++
Sbjct: 353 TRLPPTAYSALSSAFKAGM----KQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGA 408
Query: 385 YI-VQPENMYFIEPDRGRFCV---AIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
I + P + + C+ A DD I+G QQ+ ++YD+ L F S
Sbjct: 409 AIDLDPNGIMYGN------CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 441 C 441
C
Sbjct: 463 C 463
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 160/395 (40%), Gaps = 66/395 (16%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP---CIRC----FDQTT-PIFDPRASTTYS 148
YS+++N+GTP + + DT SSLVW C C C D T P F P+ S+T
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 149 EIPCDDPLCRSPF------KCQNGK------C-----VYTRRYHVGDVTRGLASRETFAF 191
+ C +P C F +C K C Y +Y +G T G + F
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLG-ATAGFLLLDNLNF 206
Query: 192 PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR 251
P + VP+ GCS + SGI GF SL SQ+ + FSYCLV
Sbjct: 207 PGKT----VPQFLVGCS-----ILSIRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVS 254
Query: 252 E------MEATSVIKFGRDADVRRRDLETTPILL-----SDLRPHFYLHLLEISIGRHIV 300
+ V++ D + L TP S R ++Y+ L ++ +G V
Sbjct: 255 HRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDV 314
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ---RIPYNAS 357
+ P + DG GG I+D+G+ TF+ Y + Q + LR LG++ A
Sbjct: 315 KIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEF---LRQLGKKYSREENVEAQ 371
Query: 358 QEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMYF-IEPDRGRFCVAIQDD-----P 410
C+ ++P TF + + QP YF D C + D P
Sbjct: 372 SGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQP 431
Query: 411 KYS----ILGAWQQQNMLIIYDLNVPALRFGSENC 441
K + ILG +QQQN + YDL FG NC
Sbjct: 432 KTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|357116104|ref|XP_003559824.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 489
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 109/416 (26%), Positives = 157/416 (37%), Gaps = 76/416 (18%)
Query: 97 YSVEVNIGTPMKPQH---LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
YSV V +G+ QH L D +L W QC P Q PIFDP+ S Y + D
Sbjct: 74 YSVRVGVGS-GDTQHFYRLAVDMVGNLTWMQCLPSNPKLKQDAPIFDPKTSHRYKNVGHD 132
Query: 154 DPLCRSPF--KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG--FTFVPRLAFGCSN 209
DPLC++PF + +C + R+ + G ++ FAF +G T V L FGC++
Sbjct: 133 DPLCKAPFTPRPTEHRCGFNIRFRAEAMATGYLGKDEFAFGAGSGSRTTNVDGLVFGCAH 192
Query: 210 DNSGF--------------------------AFGGKI--------------------SGI 223
+G+ GG + +GI
Sbjct: 193 RINGWNNKDVLAGIPSLNRRPTSFVRQLSTHGGGGAVDGLVFGCAHAINGWKNQDVLAGI 252
Query: 224 LGFNASPLSLSSQLRNRIQGL---FSYCLVREME---ATSVIKFGRDADVRRRDLETTPI 277
L N P S QL G FSYCLV + ++FG D ++T +
Sbjct: 253 LSLNRRPTSFVRQLSVHGGGTTPRFSYCLVDHKKYPNKHGFLRFGADVP-DHSHAQSTAL 311
Query: 278 LLSDLRPHF---YLHLLEISI-GRHIVRFPPGAFD-IMRDGTGGFIIDTGTPVTFIRNGP 332
L + F Y+ L+ +S+ GR + P F R GG +D G P T P
Sbjct: 312 LYGEPDGGFGMYYVRLVGVSVAGRKLTGITPKMFQRDRRSRLGGCYVDVGNPTTRFAEAP 371
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSS--FKAYPSMTFHLQE---ADYIV 387
Y L + S G R P + C R S PS+T H E A +
Sbjct: 372 YDILEAGVAAHMASHGLHRTPVPGHR---LCVRGTSPEVMPKLPSITLHFAEDEAAGLEI 428
Query: 388 QPENMYFIEPDRGR--FCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ ++ G C +Q P +++G QQ + +DL L F E+C
Sbjct: 429 KSRLLFATVKHAGADYVCFIVQRAPVTTVIGGHQQVDTRFTFDLEENRLFFAPEDC 484
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 70/229 (30%), Positives = 106/229 (46%), Gaps = 31/229 (13%)
Query: 93 QDLFYSVEVNIG----TPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
Q L Y +++G +P ++ DT S L W QC+PC C+ Q P+FDP S TY+
Sbjct: 88 QTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYA 147
Query: 149 EIPCDDPLCRSPFKCQNG-------------KCVYTRRYHVGDVTRGLASRETFAFPVRN 195
+ C+ C + G KC Y Y G +RG+ + +T A
Sbjct: 148 AVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---- 203
Query: 196 GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREM 253
G + FGC N G FGG +G++G + LSL SQ +R G+FSYCL
Sbjct: 204 GGASLGGFVFGCGLSNRGL-FGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSG 261
Query: 254 EATSVIKFGRDADVRRRDLETTPI----LLSD-LRPHFY-LHLLEISIG 296
+A+ + G D TTP+ +++D +P FY L++ ++G
Sbjct: 262 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVG 310
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 51/373 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSE------ 149
Y V++ +GTP K ++ DT SSL W QCQPC I C Q PIF P S TY
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSS 166
Query: 150 -------IPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ P C + G CVY Y + G S++ T P
Sbjct: 167 QCSSLKSSTLNAPGCSN----ATGACVYKASYGDTSFSIGYLSQDV--------LTLTPS 214
Query: 203 LA------FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA- 255
A +GC DN G G+ +GI+G LS+ QL N+ FSYCL A
Sbjct: 215 AAPSSGFVYGCGQDNQGLF--GRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQ 272
Query: 256 --TSVIKFGRDADVRRRDL--ETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIM 310
+SV F + TP++ + P Y L L I++ + ++++
Sbjct: 273 PNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP 332
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD-SS 369
IID+GT +T + Y L + + I+ Q ++ D C++
Sbjct: 333 T------IIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSI---LDTCFKGSVKE 383
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-YSILGAWQQQNMLIIYD 428
P + + + + +E ++G C+AI SI+G +QQQ + YD
Sbjct: 384 MSTVPEIRIIFRGGAGLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYD 443
Query: 429 LNVPALRFGSENC 441
+ + F C
Sbjct: 444 VANSKIGFAPGGC 456
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 165/381 (43%), Gaps = 53/381 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y ++ +GTP + ++ DT S ++W C C C QT+ + FDP +S T S
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTASP 138
Query: 150 IPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
I C D C S QN C YT +Y G T G + F + G + VP
Sbjct: 139 ISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPN 198
Query: 203 ----LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + GI GF +S+ SQL + QG+ FS+CL E
Sbjct: 199 STAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLAS--QGIAPRVFSHCLKGE 256
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ ++ TP++ S +PH+ ++LL IS+ + P F +
Sbjct: 257 NGGGGILVLG---EIVEPNMVFTPLVPS--QPHYNVNLLSISVNGQALPINPSVFS-TSN 310
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-K 371
G G IIDTGT + ++ Y ++ + R + + + CY +S
Sbjct: 311 GQ-GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV-----SKGNQCYVITTSVGD 364
Query: 372 AYPSMTFH--------LQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQ 421
+P ++ + L DY++Q N+ +C+ Q + +ILG +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNV----GGTAVWCIGFQRIQNQGITILGDLVLK 420
Query: 422 NMLIIYDLNVPALRFGSENCA 442
+ + +YDL + + + +C+
Sbjct: 421 DKIFVYDLVGQRIGWANYDCS 441
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 163/381 (42%), Gaps = 52/381 (13%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI--FDPRASTTYSE 149
K + V + IGTP +PQ ++ DT S L W QC ++T P FDP S+++
Sbjct: 83 KYSMALVVTLPIGTPPQPQQMVLDTGSQLSWIQCH------NKTPPTASFDPSLSSSFYV 136
Query: 150 IPCDDPLCRS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFV 200
+PC PLC+ P C QN C Y+ Y G G RE AF P +
Sbjct: 137 LPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQT----T 192
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA----- 255
P L GCS+++ GILG N LS Q + FSYC+ A
Sbjct: 193 PPLILGCSSESR------DARGILGMNLGRLSFPFQAK---VTKFSYCVPTRQPANNNNF 243
Query: 256 -TSVIKFGRD---ADVRRRDLETTP--ILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFD 308
T G + A R + T P + +L P Y + + I IG + PP F
Sbjct: 244 PTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFR 303
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYD 367
G+G ++D+G+ TF+ + Y + + +I+R LG R + Y D C+ +
Sbjct: 304 PNAGGSGQTMVDSGSEFTFLVDVAYDRVRE---EIIRVLGPRVKKGYVYGGVADMCFDGN 360
Query: 368 SS--FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY----SILGAWQQQ 421
+ + + F ++ IV P+ + G CV I + +I+G + QQ
Sbjct: 361 AMEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGAASNIIGNFHQQ 420
Query: 422 NMLIIYDLNVPALRFGSENCA 442
N+ + +DL + FG +C+
Sbjct: 421 NLWVEFDLANRRIGFGVADCS 441
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 169/383 (44%), Gaps = 33/383 (8%)
Query: 83 LEDIHLPMAKQ--DL-FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT--- 136
L+ I P+ DL Y E+ +G P++ ++ DT S ++W +C PC C +
Sbjct: 66 LQGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIP 125
Query: 137 --PIFDPRASTTYSEIPCDDPLCRS-PFKC----QNGKCVYTRRYHVGDVTRGLASRETF 189
I++ AS+T S C DPLC C N C Y Y + G R+
Sbjct: 126 PLSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDM 185
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSY 247
+ + G R+ FGC+ + +G + GI+GF ++ +Q+ + + +FS+
Sbjct: 186 HYVLHGGNATTSRIFFGCATNITG---SWPVDGIMGFGLISKTVPNQIATQRNMSRVFSH 242
Query: 248 CLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
CL E +++FG + ++ TP+L ++ H+ + LL IS+ ++ P F
Sbjct: 243 CLGGEKHGGGILEFGEAPNT--TEMVFTPLL--NVTTHYNVDLLSISVNSKVLPIDPKEF 298
Query: 308 DIMRDGTG--GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI-PYNASQEFDYCY 364
+R+ T G IID+GT + + L Q ++SL ++ P E Y
Sbjct: 299 SYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQE----IKSLTTAKLGPKLEGLECFYLK 354
Query: 365 RYDSSFKAYPSMTFHLQEADYI-VQPEN---MYFIEPDRGRFCVAIQDDPKYSILGAWQQ 420
+ ++P++T + ++P+N M + R +C A +I G
Sbjct: 355 SGLTMETSFPNVTLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLTIFGEIVL 414
Query: 421 QNMLIIYDLNVPALRFGSENCAN 443
++ L+ YD+ + + +NC++
Sbjct: 415 KDKLVFYDVENRRIGWKGQNCSS 437
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 78/162 (48%), Gaps = 14/162 (8%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q + Y IGTP +P + D A LVWTQC+ C RCF+Q TP+FDP AS TY PC
Sbjct: 47 QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106
Query: 153 DDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S C C Y + GD T G +TFA T LAFGC
Sbjct: 107 GTPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAV-----GTAKASLAFGCV 160
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
+ GG SGI+G +P SL +Q FSYCL
Sbjct: 161 VASDIDTMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA 198
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 45/370 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-FD-QTTPIFDPRASTTYSEIPCDD 154
Y +E++IGTP + + DT S LVW +C C C D IF AS++Y ++PC+
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGF----TFVPRLAF 205
C G C Y Y G T G + +F +F F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 206 GCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKF 261
GC+ G + F G++G SL QL +++ FSYCLV A S +
Sbjct: 125 GCARKLKGDWNF---TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181
Query: 262 GRDADVRRRDLETTPILLSDL--RPHFYLHLLEISIGRHIVRFPPGAFDI---MRDGTGG 316
G A +R D+ +TPIL D + +Y+ L I+IG P +D G
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIG----GVPVVVYDKESGHNTSVGP 237
Query: 317 F-----IIDTGTPVTFIRNGPYQTLMQRYDQ--ILRSLGRQRIPYNASQEFDYCYRY--D 367
F +ID+GT T + Y+ + + ++ IL +LG S D C+ D
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGN-------SAGLDLCFNSSGD 290
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLII 426
+S+ +PS+TF+ +V P F R C+++ SI+G QQQN I+
Sbjct: 291 TSY-GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHIL 349
Query: 427 YDLNVPALRF 436
YDL + F
Sbjct: 350 YDLVASQISF 359
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 166/379 (43%), Gaps = 46/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y ++ IGTP K +L DT + ++W C C C +++ + S++ +
Sbjct: 72 LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLV 131
Query: 151 PCDDPLCRSP-----FKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FT 198
PCD LC+ C N C Y Y G T G ++ F +G +
Sbjct: 132 PCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTAS 191
Query: 199 FVPRLAFGCSNDNSG---FAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREM 253
+ FGC SG ++ + GILGF + S+ SQL + +++ +F++CL +
Sbjct: 192 ANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL-NGV 250
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G V + + TTP+L +PH+ +++ I +G + A + RD
Sbjct: 251 NGGGIFAIGH---VVQPTVNTTPLLPD--QPHYSVNMTAIQVGHTFLNLSTDASE-QRD- 303
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KA 372
+ G IID+GT + ++ +G YQ L+ + +L Q + E+ C++Y S
Sbjct: 304 SKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTL----HDEYT-CFQYSGSVDDG 358
Query: 373 YPSMTFHLQEA--------DYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNML 424
+P++TF+ + DY+ EN++ I +D ++LG N L
Sbjct: 359 FPNVTFYFENGLSLKVYPHDYLFLSENLWCIGWQNSG--AQSRDSKNMTLLGDLVLSNKL 416
Query: 425 IIYDLNVPALRFGSENCAN 443
+ YDL + + NC++
Sbjct: 417 VFYDLENQVIGWTEYNCSS 435
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/423 (26%), Positives = 176/423 (41%), Gaps = 57/423 (13%)
Query: 49 GNLSQSERIHKMFEISKARAN--YMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTP 106
G + ER+ + +S+ + MA +D+ + + Y IG+P
Sbjct: 44 GGYTTEERVLRAVAVSRQQQQQRLMAGAE--------DDVSAQVHRATRQYIASYLIGSP 95
Query: 107 MKPQHLLFDTASSLVWTQCQP-CI--RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC 163
+ L DT S L+WTQC C+ C Q P ++ S+T+ +PC D ++ F
Sbjct: 96 PQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCAD---KAGFCA 152
Query: 164 QNG--------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC---SNDNS 212
NG C + Y G V L + E+FAF +G T LAFGC + S
Sbjct: 153 ANGVHLCGLDGSCTFIASYGAGRVIGSLGT-ESFAF--ESGTT---SLAFGCVSLTRITS 206
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK--FGRDADVRRR 270
G SG++G LSL SQ+ FSYCL ++ F +
Sbjct: 207 GAL--NDASGLIGLGRGRLSLVSQIGAT---RFSYCLTPYFHSSGASSHLFVGASASLGG 261
Query: 271 DLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPP---GAFDIMR--DG--TGGFII 319
+ P + S +YL L I++G+ R P F + + G GG II
Sbjct: 262 GGASMPFVKSPKDYPYSTFYYLPLEGITVGK--TRLPAVNSTTFQLRQLFKGYWAGGVII 319
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQEFDYCYRYDSSFKAYPSMTF 378
DTG+P+T + + Y+ L +++ LG +P + C + K P++ F
Sbjct: 320 DTGSPLTQLASHAYEALK---EEVAAQLGNGSLVPAPEDSGLELCVAREGFQKVVPALVF 376
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
H + P Y+ D+ C+ I + SI+G +QQQ+M ++YDL F +
Sbjct: 377 HFGGGADMAVPAASYWAPVDKAAACMMILEGGYDSIIGNFQQQDMHLLYDLRRGRFSFQT 436
Query: 439 ENC 441
+C
Sbjct: 437 ADC 439
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 163/373 (43%), Gaps = 39/373 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P + ++ DT S ++W C C C FD +S+T ++
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQV 124
Query: 151 PCDDPLCRSPFKC-------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR- 202
C DP+C S + Q +C YT +Y G T G +T F G + +
Sbjct: 125 RCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNS 184
Query: 203 ---LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEA 255
+ FGCS SG + GI GF LS+ SQL R +FS+CL +
Sbjct: 185 SALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSG 244
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
++ G ++ + +P++ S +PH+ L+LL I++ ++ P AF +
Sbjct: 245 GGILVLG---EILEPGIVYSPLVPS--QPHYNLNLLSIAVNGQLLPIDPAAF--ATSNSQ 297
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYP 374
G I+D+GT + ++ Y + + I+ S I +Q CY +S + +P
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNAIV-SPSVTPITSKGNQ----CYLVSTSVSQMFP 352
Query: 375 SMTFHLQ-EADYIVQPENMYFI----EPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDL 429
+F+ A +++PE+ Y I +C+ Q +ILG ++ + +YDL
Sbjct: 353 LASFNFAGGASMVLKPED-YLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDKIFVYDL 411
Query: 430 NVPALRFGSENCA 442
+ + + +C+
Sbjct: 412 VRQRIGWANYDCS 424
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 151/372 (40%), Gaps = 34/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP--CDD 154
Y V +GTP +P L+ DT S L W +C+ P + RAS + S P C
Sbjct: 14 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSS 73
Query: 155 PLCRS--PFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-------- 199
C S PF N C Y RY G RG+ + + +
Sbjct: 74 DTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGR 133
Query: 200 ---VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM--- 253
+ + GC+ G +F G+L S +S +S+ R G FSYCLV +
Sbjct: 134 RAKLQGVVLGCTATYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPR 192
Query: 254 EATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
A+S + FG + TP++L + P + + + + + + P +D+ R
Sbjct: 193 NASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRG 252
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
GG I+D+GT +T + Y+ ++ L +L P A F+YCY + +
Sbjct: 253 --GGAILDSGTSLTVLATPAYRAVVAALGGRLAAL-----PRVAMDPFEYCYNWTAGAPE 305
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLN 430
P + + + P Y I+ G C+ +Q+ P S++G QQ L +DL
Sbjct: 306 IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLR 365
Query: 431 VPALRFGSENCA 442
LRF CA
Sbjct: 366 DRWLRFKHTRCA 377
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 165/381 (43%), Gaps = 53/381 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y ++ +GTP + ++ DT S ++W C C C QT+ + FDP +S T S
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTASP 138
Query: 150 IPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
I C D C S QN C YT +Y G T G + F + G + VP
Sbjct: 139 ISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPN 198
Query: 203 ----LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + GI GF +S+ SQL + QG+ FS+CL E
Sbjct: 199 STAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLAS--QGIAPRVFSHCLKGE 256
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ ++ TP++ S +PH+ ++LL IS+ + P F +
Sbjct: 257 NGGGGILVLG---EIVEPNMVFTPLVPS--QPHYNVNLLSISVNGQALPINPSVFS-TSN 310
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-K 371
G G IIDTGT + ++ Y ++ + R + + + CY +S
Sbjct: 311 GQ-GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV-----SKGNQCYVITTSVGD 364
Query: 372 AYPSMTFH--------LQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQ 421
+P ++ + L DY++Q N+ +C+ Q + +ILG +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNV----GGTAVWCIGFQRIQNQGITILGDLVLK 420
Query: 422 NMLIIYDLNVPALRFGSENCA 442
+ + +YDL + + + +C+
Sbjct: 421 DKIFVYDLVGQRIGWANYDCS 441
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 27/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP K ++ DT SSL W QC PC + C Q+ P+F+P+AS++Y+ + C
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C SP C C+Y Y + G S++T +F G T VP +GC
Sbjct: 189 QCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSVPNFYYGC 244
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL ++S +
Sbjct: 245 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 302
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ TP+ S L Y + + I + + A+ + IID+GT +T
Sbjct: 303 GQYSY--TPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT-----IIDSGTVIT 355
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ G Y L + ++ R +A D C++ ++ P +T +
Sbjct: 356 RLPTGVYSALSKAVAGAMKGTPRA----SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 161/373 (43%), Gaps = 40/373 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y V +GTP + ++ DT S ++W C C C QT+ + FD +S+T
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNC-PQTSGLGIQLNYFDTTSSSTARL 138
Query: 150 IPCDDPLCRSPFKC-------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+PC P+C S + Q+ +C Y +Y G T G +TF F G + +
Sbjct: 139 VPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIAN 198
Query: 203 ----LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREME 254
+ FGCS SG + GI GF LS+ SQL + +FS+CL E
Sbjct: 199 SSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDS 258
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
++ G ++ + +P++ S +PH+ L L I++ ++ P AF +
Sbjct: 259 GGGILVLG---EILEPGIVYSPLVPS--QPHYNLDLQSIAVSGQLLPIDPAAFATSSN-- 311
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
G IIDTGT + ++ Y + + L I + + CY +S + +
Sbjct: 312 RGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTI-----NKGNQCYLVSNSVSEVF 366
Query: 374 PSMTFHLQ-EADYIVQPEN--MYFIE-PDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYD 428
P ++F+ A +++PE MY +C+ Q +ILG ++ + +YD
Sbjct: 367 PPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYD 426
Query: 429 LNVPALRFGSENC 441
L + + + +C
Sbjct: 427 LAHQRIGWANYDC 439
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 126/484 (26%), Positives = 198/484 (40%), Gaps = 79/484 (16%)
Query: 9 LAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFS--PESPLYPGNLSQSERIHKMFEISKA 66
L + S+ S++ +T F+SS +L L P+F+ P S +P + S
Sbjct: 7 LFSLLSFLSII-ITTFSSSTPNTITLHLSPLFTNHPSSSSHP-----FHTLKLAVSTSIT 60
Query: 67 RANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
RA+++ + KPN + LE P K YS+++ GTP + + DT S+LVW C
Sbjct: 61 RAHHLKNH-KPN--KSLETPVHP--KTYGGYSIDLEFGTPSQTFPFVLDTGSTLVWLPCS 115
Query: 127 P---CIRCFD-QTTPIFDPRASTTYSEIPCDDPLCRSPF-----------------KCQN 165
C +C TP F P+ S++ + C +P C F C
Sbjct: 116 SHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQ 175
Query: 166 GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILG 225
YT +Y +G T G E FP + F+ GCS + +GI G
Sbjct: 176 TCPAYTVQYGLGS-TAGFLLSENLNFPTKKYSDFL----LGCS-----VVSVYQPAGIAG 225
Query: 226 FNASPLSLSSQLRNRIQGLFSYCLVRE-------MEATSVIKFGRDADVRRRDLETTPIL 278
F SL SQ+ FSYCL+ + + V++ D + + TP L
Sbjct: 226 FGRGEESLPSQMN---LTRFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFL 282
Query: 279 LS-------DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ ++Y+ L I +G VR P + DG GGFI+D+G+ TF+
Sbjct: 283 KNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERP 342
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQEFD----YCYRYDSSFKAYPSMTFHLQEADYIV 387
+ + Q + + + S R R A ++F + + ++P + F + +
Sbjct: 343 IFDLVAQEFAKQV-SYTRAR---EAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMR 398
Query: 388 QPENMYFIEPDRGRF-CVAI-QDDPKYS--------ILGAWQQQNMLIIYDLNVPALRFG 437
P YF +G C+ I DD S ILG +QQQN + YDL F
Sbjct: 399 LPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFR 458
Query: 438 SENC 441
S++C
Sbjct: 459 SQSC 462
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 151/372 (40%), Gaps = 34/372 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP--CDD 154
Y V +GTP +P L+ DT S L W +C+ P + RAS + S P C
Sbjct: 105 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSS 164
Query: 155 PLCRS--PFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF-------- 199
C S PF N C Y RY G RG+ + + +
Sbjct: 165 DTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGR 224
Query: 200 ---VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM--- 253
+ + GC+ G +F G+L S +S +S+ R G FSYCLV +
Sbjct: 225 RAKLQGVVLGCTATYDGQSFQSS-DGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPR 283
Query: 254 EATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
A+S + FG + TP++L + P + + + + + + P +D+ R
Sbjct: 284 NASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRG 343
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
GG I+D+GT +T + Y+ ++ L +L P A F+YCY + +
Sbjct: 344 --GGAILDSGTSLTVLATPAYRAVVAALGGRLAAL-----PRVAMDPFEYCYNWTAGAPE 396
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLN 430
P + + + P Y I+ G C+ +Q+ P S++G QQ L +DL
Sbjct: 397 IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLR 456
Query: 431 VPALRFGSENCA 442
LRF CA
Sbjct: 457 DRWLRFKHTRCA 468
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 157/370 (42%), Gaps = 45/370 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-FD-QTTPIFDPRASTTYSEIPCDD 154
Y +E++IGTP + + DT S LVW +C C C D IF AS++Y ++PC+
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 155 PLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGF----TFVPRLAF 205
C G C Y Y G T G + +F +F F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 206 GCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKF 261
GC G + F G++G SL QL +++ FSYCLV A S +
Sbjct: 125 GCGRKLKGDWNF---TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181
Query: 262 GRDADVRRRDLETTPILLSDL--RPHFYLHLLEISIGRHIVRFPPGAFDI---MRDGTGG 316
G A +R D+ +TPIL D + +Y+ L I++G P +D G
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVG----GVPVVVYDKESGHNTSVGP 237
Query: 317 F-----IIDTGTPVTFIRNGPYQTLMQRYDQ--ILRSLGRQRIPYNASQEFDYCYRY--D 367
F +ID+GT T + Y+ + + ++ IL +LG S D C+ D
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGN-------SAGLDLCFNSSGD 290
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD-PKYSILGAWQQQNMLII 426
+S+ +PS+TF+ +V P F R C+++ SI+G QQQN I+
Sbjct: 291 TSY-GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHIL 349
Query: 427 YDLNVPALRF 436
YDL + F
Sbjct: 350 YDLVASQISF 359
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 181/421 (42%), Gaps = 48/421 (11%)
Query: 28 ESTGFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDI 86
+ G +L++I +FSP SP P LS E + +M R ++ S+ +
Sbjct: 25 QDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSI------ 78
Query: 87 HLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+P+A Q Y V IGTP + L DT++ W C C C + +F P
Sbjct: 79 -VPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAP 134
Query: 142 RASTTYSEIPCDDPLCRSPFKCQNGKC-VYTRRYHVGDVTRGLASRETFAFPVRNGFTF- 199
STT+ + C P C+ + N C V +R +++ + +A+ V++ T
Sbjct: 135 EKSTTFKNVSCAAPECK---QVPNPGCGVSSRNFNLTYGSSSIAANL-----VQDTITLA 186
Query: 200 ---VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREME 254
VP FGC + +G + + LG SL SQ +N Q FSYCL + +
Sbjct: 187 TDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLPSFKSLN 244
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ ++ G A +R ++ TP+L + R +Y++L I +GR +V PP A
Sbjct: 245 FSGSLRLGPVAQPKR--IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTT 302
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
G I D+GT T + Y + D+ R +G ++ + FD CY
Sbjct: 303 GAGTIFDSGTVFTRLVAPVYVAVR---DEFRRRVG-PKLTVTSLGGFDTCYNVP---IVV 355
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYD 428
P++TF + + +N+ C+A+ P +++ QQQN ++YD
Sbjct: 356 PTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYD 415
Query: 429 L 429
+
Sbjct: 416 V 416
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 165/381 (43%), Gaps = 53/381 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y ++ +G+P + ++ DT S ++W C C C QT+ + FDP +S T +
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTATP 138
Query: 150 IPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ C D C S QN C YT +Y G T G + F + G + VP
Sbjct: 139 VSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPN 198
Query: 203 ----LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + GI GF +S+ SQL + QGL FS+CL E
Sbjct: 199 STAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLAS--QGLAPRVFSHCLKGE 256
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ ++ TP++ S +PH+ ++LL IS+ + P F +
Sbjct: 257 NGGGGILVLG---EIVEPNMVFTPLVPS--QPHYNVNLLSISVNGQALPINPSVFS-TSN 310
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK- 371
G G IIDTGT + ++ Y ++ + R + + + CY +S
Sbjct: 311 GQ-GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV-----SKGNQCYVIATSVAD 364
Query: 372 AYPSMTFH--------LQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQ 421
+P ++ + L DY++Q N+ +C+ Q + +ILG +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNV----GGTAVWCIGFQRIQNQGITILGDLVLK 420
Query: 422 NMLIIYDLNVPALRFGSENCA 442
+ + +YDL + + + +C+
Sbjct: 421 DKIFVYDLVGQRIGWANYDCS 441
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 45/386 (11%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA +L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 60 PNAHMKLYDDLL----SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD 115
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQN-GK-CVYTRRYHVGDVTRGLASRETFAFPVR 194
P F P ST+Y + C +P C C + GK CVY RRY + G+ S + +F
Sbjct: 116 PKFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISF--G 168
Query: 195 NGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVR 251
N P R FGC N+ +G F + GI+G LS+ QL ++ I+ +FS C
Sbjct: 169 NESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGG 228
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILL---SD--LRPHFYLHLLEISIGRHIVRFPPGA 306
+ G+ + P ++ SD P++ + L ++ + ++ P
Sbjct: 229 MEVGGGAMVLGK--------ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKV 280
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL-GRQRIPYNASQEFDYCYR 365
F +G G ++D+GT + P + + D +++ + +RI D C+
Sbjct: 281 F----NGKHGTVLDSGTTYAYF---PKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFS 333
Query: 366 YDSSFKA-----YPSMTFHLQEA-DYIVQPENMYFIEPD-RGRFCVAI-QDDPKYSILGA 417
A +P + I+ PEN F RG +C+ I D ++LG
Sbjct: 334 GAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGG 393
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
+N L+ YD L F NC++
Sbjct: 394 IVVRNTLVTYDRENDKLGFLKTNCSD 419
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 166/378 (43%), Gaps = 35/378 (9%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP-----IFD 140
+H + FY+ + +GTP K ++ DT S++ + PC C P FD
Sbjct: 68 LHGAVKDYGYFYAT-LYLGTPAKKFAVIVDTGSTMTYV---PCSSCGSGCGPNHQDAAFD 123
Query: 141 PRASTTYSEIPCDDPLCR--SP-FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
P AS+T S I C P C SP C +C YTR Y + G+ + A + +G
Sbjct: 124 PEASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLA--LHDGL 181
Query: 198 TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEA 255
P + FGC +G F + G+ G S S+ +QL I +FS C +E
Sbjct: 182 PGAP-IIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGM-VEG 239
Query: 256 TSVIKFGRDADVRRR-DLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G DA+V L+ TP+L S P +Y + +L +++ ++ FD
Sbjct: 240 DGALLLG-DAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGY-- 296
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF--- 370
G ++D+GT T++ + ++ ++ S G +R+P Q D C+ S
Sbjct: 297 --GTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDL 354
Query: 371 ----KAYPSMTFHL-QEADYIVQPENMYFIEP-DRGRFCVAIQDDPKY-SILGAWQQQNM 423
+PSM Q ++ P N F+ + G++C+ + D+ + ++LG +N+
Sbjct: 355 EALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAGTLLGGITFRNV 414
Query: 424 LIIYDLNVPALRFGSENC 441
L+ YD + FG C
Sbjct: 415 LVRYDRANQRVGFGPALC 432
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 154/392 (39%), Gaps = 70/392 (17%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ------PCIRC----FDQTT-PIFDPRAST 145
YSV ++GTP + L+ DT SSLVWT C C C D T PI+ S+
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 146 TYSEIPCDDPLCR----SPFKCQNGK-C-VYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
T +PC P C S C K C Y Y +G T L S + N
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVS-DVLGLSKLN---R 189
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVRE----- 252
+P FGCS + GI GF S+ +QL GL FSYCLV
Sbjct: 190 IPDFLFGCS-----LVSNRQPEGIAGFGRGLASIPAQL-----GLTKFSYCLVSHRFDDT 239
Query: 253 -MEATSVIKFG-RDADVRRRDLETTPIL----LSDLRPHFYLHLLEISIGRHIVRFPPGA 306
V+ G R AD + P LS ++Y+ L +I +G V PP
Sbjct: 240 PQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRY 299
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR--YDQILRSLGRQRIPYNASQEFDY-- 362
++G GG I+D+G+ TF M+R +D + R L + Y ++E +
Sbjct: 300 LVPSKEGDGGMIVDSGSTFTF---------MERIIFDPVARELEKHMTKYKRAKEIEDSS 350
Query: 363 ----CYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS---- 413
CY + P +TF + + P YF G C+ + DP
Sbjct: 351 GLGPCYNITGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTT 410
Query: 414 ----ILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILG +QQQN I YDL F + C
Sbjct: 411 GPAIILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 45/386 (11%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA +L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 60 PNAHMKLYDDLL----SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD 115
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQN-GK-CVYTRRYHVGDVTRGLASRETFAFPVR 194
P F P ST+Y + C +P C C + GK CVY RRY + G+ S + +F
Sbjct: 116 PKFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISF--G 168
Query: 195 NGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVR 251
N P R FGC N+ +G F + GI+G LS+ QL ++ I+ +FS C
Sbjct: 169 NESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGG 228
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILL---SD--LRPHFYLHLLEISIGRHIVRFPPGA 306
+ G+ + P ++ SD P++ + L ++ + ++ P
Sbjct: 229 MEVGGGAMVLGK--------ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKV 280
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL-GRQRIPYNASQEFDYCYR 365
F +G G ++D+GT + P + + D +++ + +RI D C+
Sbjct: 281 F----NGKHGTVLDSGTTYAYF---PKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFS 333
Query: 366 YDSSFKA-----YPSMTFHLQEA-DYIVQPENMYFIEPD-RGRFCVAI-QDDPKYSILGA 417
A +P + I+ PEN F RG +C+ I D ++LG
Sbjct: 334 GAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGG 393
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
+N L+ YD L F NC++
Sbjct: 394 IVVRNTLVTYDRENDKLGFLKTNCSD 419
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 80/258 (31%), Positives = 119/258 (46%), Gaps = 19/258 (7%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQDLFYSVEVNIGTPM 107
NL++ E + + + S+ R + M++ A + + P+ Y V++ IGTP
Sbjct: 41 NLTEHELLRRAIQRSRYRLAGIG-MARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC-- 163
DTAS L+WTQCQPC C+ Q P+F+PR S+TY+ +PC C +C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+ C YT Y T G + + G +AFGCS ++G A + SG
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPPQASG 215
Query: 223 ILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT-SVIKFGRDADVRRRDLETTPI-LLS 280
++G PLSL SQL R F+YCL + G DAD R + +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 281 DLR--PHFYLHLLEISIG 296
D R ++YL+L + IG
Sbjct: 273 DPRYPSYYYLNLDGLLIG 290
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 114/401 (28%), Positives = 164/401 (40%), Gaps = 71/401 (17%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP---CIRC-FDQT----TPIFDPRASTTYS 148
YS+ +++GTP + L+ DT SSLVW C C C F T P F PR S++
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 149 EIPCDDPLCRSPF------KCQN-----GKCV-----YTRRYHVGDVTRGLASRETFAFP 192
I C +P C F KC N C Y +Y +G T GL ET FP
Sbjct: 144 LIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFP 202
Query: 193 VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE 252
+ F+ GCS + GI GF S SL QL + FSYCLV
Sbjct: 203 NKTISDFLA----GCS-----LLSTRQPEGIAGFGRSQESLPLQLGLK---KFSYCLVSR 250
Query: 253 ------MEATSVIKFG-RDADVRRRDLETTPI---LLSDLRPHF----YLHLLEISIGRH 298
+ + ++ G +D + L TP L S P F Y+ L +I +G+
Sbjct: 251 RFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKT 310
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
V+ P DG GG I+D+G+ TF+ ++ L + ++ + + + N +
Sbjct: 311 HVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFE---KQMANYTVATNVQK 367
Query: 359 EFDYCYRYDSSFK---AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-- 413
+D S + P +TF + + P + YF D G C+ I D +
Sbjct: 368 LTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAALG 427
Query: 414 ------------ILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILG +QQQN I YDL F ++CA
Sbjct: 428 GDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 162/383 (42%), Gaps = 54/383 (14%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSE 149
L+Y+ IG K ++ DT S +W C C C ++DP S T
Sbjct: 75 LYYT---KIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKA 131
Query: 150 IPCDDPLCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP- 201
+PCDD C S + C G C Y+ Y G T G ++ F V VP
Sbjct: 132 VPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191
Query: 202 --RLAFGCSNDNSGF---AFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC + SG + GI+GF + S+ SQL +++ +FS+CL +
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL-DSIS 250
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G +V + ++TTP+L H+ + L +I + ++ P DI+ +
Sbjct: 251 GGGIFAIG---EVVQPKVKTTPLLQG--MAHYNVVLKDIEVAGDPIQLPS---DILDSSS 302
Query: 315 G-GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFDYCYRY---D 367
G G IID+GT + ++ + YDQ+L + QR Y +F C+ Y +
Sbjct: 303 GRGTIIDSGTTLAYLP-------VSIYDQLLEKILAQRSGMKLYLVEDQF-TCFHYSDEE 354
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQ 420
S +P++ F +E + Y +CV Q D + +LG
Sbjct: 355 SVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVL 414
Query: 421 QNMLIIYDLNVPALRFGSENCAN 443
N L++YDL+ A+ + NC++
Sbjct: 415 ANKLVVYDLDNMAIGWADYNCSS 437
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 161/381 (42%), Gaps = 46/381 (12%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ + + IGTP + ++ DT S L W +C+ T IF+P AS TY++IPC
Sbjct: 63 HNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKK----EPNFTSIFNPLASKTYTKIPC 118
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
C++ P C K C + Y G + ETF F G P
Sbjct: 119 SSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRF----GSLTRPATV 174
Query: 205 FGC--SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
FGC S +S K +G++G N LS +Q+ R FSYC + +++T + G
Sbjct: 175 FGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRK---FSYC-ISGLDSTGFLLLG 230
Query: 263 RDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ L TP++ +S P+F + L I + ++ P F G G
Sbjct: 231 EARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQ 290
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI----PYNASQEFDYCYRYDSSFKA 372
++D+GT TF+ Y L + + +L++ G R+ Y D CY DS+
Sbjct: 291 TMVDSGTQFTFLLGPVYSALRKEF--LLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSST 348
Query: 373 YPSM---TFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYSI----LGAWQQ 420
P++ + A+ V + + + P RG+ +C + + I +G QQ
Sbjct: 349 LPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQ 408
Query: 421 QNMLIIYDLNVPALRFGSENC 441
QN+ + YDL + F C
Sbjct: 409 QNVWMEYDLENSRIGFAELRC 429
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 162/377 (42%), Gaps = 41/377 (10%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
K + V + IGTP + Q ++ DT S L W QC + + +FDP S+++S +P
Sbjct: 77 KYSMILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLP 136
Query: 152 CDDPLCRS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C+ PLC+ P C QN C Y+ Y G + G RE F P L
Sbjct: 137 CNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQS---TPPL 193
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREME----ATSV 258
GC+ ++S GILG N LS +SQ + FSYC+ R++ T
Sbjct: 194 ILGCAEESS------DAKGILGMNLGRLSFASQAK---LTKFSYCVPTRQVRPGFTPTGS 244
Query: 259 IKFGRDAD---VRRRDLET--TPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRD 312
G + + R +L T + +L P Y ++ I IG + P AF
Sbjct: 245 FYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPS 304
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDS--S 369
G G +ID+G+ T++ + Y + ++++R +G R + Y D C+ ++
Sbjct: 305 GAGQTMIDSGSEFTYLVDEAYNKVR---EEVVRLVGARLKKGYVYGGVSDMCFNGNAIEI 361
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLI 425
+ +M F + IV + + G CV I +I+G + QQN+ +
Sbjct: 362 GRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNIWV 421
Query: 426 IYDLNVPALRFGSENCA 442
+DL + FG +C+
Sbjct: 422 EFDLANRRVGFGKADCS 438
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 168/384 (43%), Gaps = 53/384 (13%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + +G+P + ++ DT S L W C+ T +F+P +S++YS IPC
Sbjct: 36 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKS----PNLTSVFNPLSSSSYSPIPC 91
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
P+CR+ P C K C Y G + + F R G + +P
Sbjct: 92 SSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNF----RIGSSALPGTL 147
Query: 205 FGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVREMEATSV 258
FGC +SGF+ K +G++G N LS +QL GL FSYC + +++ V
Sbjct: 148 FGCM--DSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYC-ISGRDSSGV 199
Query: 259 IKFGRDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG +L TP++ +S P+F + L I +G I+ P F
Sbjct: 200 LLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHT 259
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTL----MQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
G G ++D+GT TF+ Y L +++ +L LG + + D CYR +
Sbjct: 260 GAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGA--MDLCYRVPA 317
Query: 369 SFK--AYPSMTFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYSI----LGA 417
K P+++ + A+ +V E + + P +G+ +C+ + I +G
Sbjct: 318 GGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGH 377
Query: 418 WQQQNMLIIYDLNVPALRFGSENC 441
QQN+ + +DL + F C
Sbjct: 378 HHQQNVWMEFDLVKSRVGFVETRC 401
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 98/416 (23%), Positives = 180/416 (43%), Gaps = 54/416 (12%)
Query: 58 HKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV-----EVNIGTPMKPQHL 112
HK F K + S + L I LP+ SV ++ +G+P K H+
Sbjct: 31 HK-FAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHV 89
Query: 113 LFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRSPFKCQNGK 167
DT S ++W C+PC +C +T +FD AS+T ++ CDD C F Q+
Sbjct: 90 QVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFC--SFISQSDS 147
Query: 168 C--VYTRRYHV---------GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
C YH+ G R + + E ++ G + FGC +D SG
Sbjct: 148 CQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTG-PLGQEVVFGCGSDQSGQLG 206
Query: 217 GGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDL 272
G + G++GF S S+ SQL + +FS+CL ++ + G V +
Sbjct: 207 NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVGV---VDSPKV 262
Query: 273 ETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
+TTP++ + + H+ + L+ + + + P I+R+ GG I+D+GT + +
Sbjct: 263 KTTPMVPNQM--HYNVMLMGMDVDGTSLDLPRS---IVRN--GGTIVDSGTTLAYFPKVL 315
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQEADYIVQPEN 391
Y +L++ L RQ + + +E C+ + ++ +A+P ++F +++ + +
Sbjct: 316 YDSLIETI------LARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPH 369
Query: 392 MYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
Y + +C Q + + +LG N L++YDL+ + + N
Sbjct: 370 DYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHN 425
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 168/379 (44%), Gaps = 46/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ +G+P K ++ DT S ++W C C RC ++ ++DP+ S T +
Sbjct: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
Query: 151 PCDDPLCRSPF-------KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTF 199
C+ C S + K +N C Y+ Y G T G ++ F NG T
Sbjct: 128 SCEHNFCSSTYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQ 186
Query: 200 VPRLAFGCSNDNSG-FAFGGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC SG FA + + GI+GF + S+ SQL +++ +FS+CL +
Sbjct: 187 NSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNV- 245
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G +V ++TTP++ + H+ + L I + I++ P FD +
Sbjct: 246 GGGIFSIG---EVVEPKVKTTPLVPN--MAHYNVILKNIEVDGDILQLPSDTFD--SENG 298
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSFKA- 372
G +ID+GT + ++ Y LM + L +Q R+ +E C++Y + +
Sbjct: 299 KGTVIDSGTTLAYLPRIVYDQLMSKV------LAKQPRLKVYLVEEQYSCFQYTGNVDSG 352
Query: 373 YPSMTFHLQEA-DYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNML 424
+P + H +++ V P + F +C+ Q + ++LG + N L
Sbjct: 353 FPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKL 412
Query: 425 IIYDLNVPALRFGSENCAN 443
++YDL + + NC++
Sbjct: 413 VVYDLENMTIGWTDYNCSS 431
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 171/383 (44%), Gaps = 52/383 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y + +GTP +P ++ DT S ++W C+PC C FDPR S+T S +
Sbjct: 40 LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 151 PCDDPLCRSPFKCQNGKCV------YTRRYHVGDVTRGLASRETFAFP------VRNGFT 198
C D C S + C Y+ Y G T G + F + V N +
Sbjct: 100 SCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNAS 159
Query: 199 FVPRLAFGCSNDNSGFAF--GGKISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
++ FGCS + SG + GI GF + LS+ SQL + QGL FS+CL
Sbjct: 160 --AKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNS--QGLAPKIFSHCLEGA 215
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ + TPI+ S +PH+ L+L I++ + P F
Sbjct: 216 DPGGGILVLG---EITEPGMVYTPIVPS--QPHYNLNLQGIAVNGQQLSIDPQVFATTN- 269
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-K 371
T G IID GT + ++ Y+ + + I+ ++ + P+ + + C+ S +
Sbjct: 270 -TRGTIIDCGTTLAYLAEEAYEPFV---NTIIAAVSQSTQPFML--KGNPCFLTVHSIDE 323
Query: 372 AYPSMTFHLQEADYIVQPENMYFIE---PDRG-RFCVAIQ-------DDPKYSILGAWQQ 420
+PS+T + + A ++P++ Y I+ PD +C+ Q D K +ILG
Sbjct: 324 IFPSVTLYFEGAPMDLKPKD-YLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVL 382
Query: 421 QNMLIIYDLNVPALRFGSENCAN 443
++ + +YDL + + S +C++
Sbjct: 383 KDKVFVYDLENQRIGWTSFDCSS 405
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 152/356 (42%), Gaps = 27/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP K ++ DT SSL W QC PC + C Q+ P+F+P+AS++Y+ + C
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C C+Y Y + G S++T +F G T VP +GC
Sbjct: 189 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSVPNFYYGC 244
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL ++S +
Sbjct: 245 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 302
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ TP+ S L Y + + I + + A+ + IID+GT +T
Sbjct: 303 GQYSY--TPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT-----IIDSGTVIT 355
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ G Y L + ++ R +A D C++ ++ P +T +
Sbjct: 356 RLPTGVYSALSKAVAGAMKGTPRA----SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 155/373 (41%), Gaps = 36/373 (9%)
Query: 85 DIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAS 144
D+H + + +Y+ V IGTP L+ DT S++ + C C C + P F P S
Sbjct: 24 DLHDDLLTKG-YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALS 82
Query: 145 TTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
++Y + C C + F C +G Y R+Y + G+ ++ F + RL
Sbjct: 83 SSYKPLECGSE-CSTGF-C-DGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGG-QRLV 138
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFG 262
FGC +G + GI+G PLS+ QL +N ++ +FS C E + G
Sbjct: 139 FGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILG 198
Query: 263 -----RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+D D + P++ L L I +G +R P F DG G
Sbjct: 199 GFQPPKDMVFTASDPHRS--------PYYNLMLKGIRVGGSPLRLKPEVF----DGKYGT 246
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----- 372
++D+GT + +Q + + SL + +P + D CY + +
Sbjct: 247 VLDSGTTYAYFPGAAFQAFKSAVKEQVGSL--KEVPGPDEKFKDICYAGAGTNVSNLSQF 304
Query: 373 YPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYD 428
+PS+ F + + + PEN F G +C+ + DP ++LG +NML+ Y+
Sbjct: 305 FPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPT-TLLGGIIVRNMLVTYN 363
Query: 429 LNVPALRFGSENC 441
++ F C
Sbjct: 364 RGKASIGFLKTKC 376
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 161/363 (44%), Gaps = 35/363 (9%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
Q Y V +GTP PQ LL DT++ W C C C TT F+P AS +Y +
Sbjct: 104 QTPTYVVRARLGTP--PQQLLLAVDTSNDAAWIPCSGCAGC--PTTTPFNPAASKSYRAV 159
Query: 151 PCDDPLC-RSP---FKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
PC P C R+P C ++ Y + L S+++ A V N V FG
Sbjct: 160 PCGSPACSRAPNPSCSLNTKSCGFSLTYADSSLEAAL-SQDSLA--VAN--DVVKSYTFG 214
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRD 264
C +G A + LG S SQ ++ +G FSYCL + + + ++ GR
Sbjct: 215 CLQKATGTATPPQGLLGLGRGPL--SFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRK 272
Query: 265 ADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
R ++TTP+L++ R +Y+ + I +G+ +V PP A G ++D+GT
Sbjct: 273 GQPLR--IKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGT 330
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
T + Y + D++ R + + P ++ FD C Y+++ K +P +TF
Sbjct: 331 MFTRLVAPAYVAVR---DEVRRRI--RGAPLSSLGGFDTC--YNTTVK-WPPVTFMFTGM 382
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
+ +N+ C+A+ P +++ + QQQN I++D+ + F
Sbjct: 383 QVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAR 442
Query: 439 ENC 441
E C
Sbjct: 443 EQC 445
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 148/368 (40%), Gaps = 74/368 (20%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + IGTP +P + A VWTQC PC RCF Q P+F+
Sbjct: 27 LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN--------------- 71
Query: 156 LCRSPFKCQNGKCVYTRRYHV----GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
RY V GD T G+ +TFA T LAFGC+ D+
Sbjct: 72 -----------------RYEVETMFGD-TSGIGGTDTFAIG-----TATASLAFGCAMDS 108
Query: 212 SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--TSVIKFGRDADVRR 269
+ G SG++G +P SL Q+ FSYCL A S + G A +
Sbjct: 109 NIKQLLGA-SGVVGLGRTPWSLVGQMNATA---FSYCLAPHGAAGKKSALLLGASAKLAG 164
Query: 270 -RDLETTPIL-LSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ TTP++ SD + +HL I G I+ PP ++ +DT V+F
Sbjct: 165 GKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPNGSVVL--------VDTIFGVSF 216
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSSFKAYPSMTFHLQ 381
+ + + + + + ++G + ++ FD C+ +S P + Q
Sbjct: 217 LVDAAFHAIKK---AVTVAVGAAPM-ATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQ 272
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDP------KYSILGAWQQQNMLIIYDLNVPALR 435
A + P + Y + G C+A+ + SILG Q+N+ ++DL+ L
Sbjct: 273 GAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLS 332
Query: 436 FGSENCAN 443
F +C++
Sbjct: 333 FEPADCSS 340
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 179/418 (42%), Gaps = 51/418 (12%)
Query: 34 LKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L +IP++ SP P S R+ M AR +Y++S+ P+A
Sbjct: 35 LNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLSSLVAQKTVSSA-----PIAS 89
Query: 93 QDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
F Y V V IGTP + ++ DT++ + CI C T F P AST+Y
Sbjct: 90 GQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---FSPNASTSYV 146
Query: 149 EIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ C P C C +G C + + Y + L +R +P
Sbjct: 147 PLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDS-----LRLATDVIPSY 201
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKF 261
+FG N SG + + LG SL SQ + G+FSYCL + + +K
Sbjct: 202 SFGSINAISGSSIPAQGLLGLGRGPL--SLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKL 259
Query: 262 GRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPG--AFDIMRDGTGGFI 318
G + + + TTP+L + RP Y ++L I++G+ V FP AFD+ + G I
Sbjct: 260 GPVG--QPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDV--NTGSGTI 315
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI--PYNASQEFDYCYRYDSSFKAYPSM 376
ID+GT +T Y + + R+++ P+++ FD C+ + A P++
Sbjct: 316 IDSGTVITRFVEPVYNAVRDEF--------RKQVTGPFSSLGAFDTCFVKNYETLA-PAI 366
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGA---WQQQNMLIIYD 428
T H + D + EN C+A+ PK Y++L +QQQN+ +++D
Sbjct: 367 TLHFTDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFD 424
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 179/418 (42%), Gaps = 51/418 (12%)
Query: 34 LKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L +IP++ SP P S R+ M AR +Y++S+ P+A
Sbjct: 35 LNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLSSLVAQKTVSSA-----PIAS 89
Query: 93 QDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
F Y V V IGTP + ++ DT++ + CI C T F P AST+Y
Sbjct: 90 GQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---FSPNASTSYV 146
Query: 149 EIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ C P C C +G C + + Y + L +R +P
Sbjct: 147 PLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDS-----LRLATDVIPSY 201
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKF 261
+FG N SG + + LG SL SQ + G+FSYCL + + +K
Sbjct: 202 SFGSINAISGSSIPAQGLLGLGRGPL--SLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKL 259
Query: 262 GRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPG--AFDIMRDGTGGFI 318
G + + + TTP+L + RP Y ++L I++G+ V FP AFD+ + G I
Sbjct: 260 GPVG--QPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDV--NTGSGTI 315
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI--PYNASQEFDYCYRYDSSFKAYPSM 376
ID+GT +T Y + + R+++ P+++ FD C+ + A P++
Sbjct: 316 IDSGTVITRFVEPVYNAVRDEF--------RKQVTGPFSSLGAFDTCFVKNYETLA-PAI 366
Query: 377 TFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGA---WQQQNMLIIYD 428
T H + D + EN C+A+ PK Y++L +QQQN+ +++D
Sbjct: 367 TLHFTDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFD 424
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 172/386 (44%), Gaps = 51/386 (13%)
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRAST 145
A+ L+Y+ + IG+P H+ DT S ++W C C C ++ +++P++S+
Sbjct: 68 AETGLYYA-RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSS 126
Query: 146 TYSEIPCDDPLCRSPFK-----CQ-NGKCVYTRRYHVGDVTRGLASRETFAF--PVRNGF 197
T + I CD P C + + C+ + C Y Y G T G + V N
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHK 186
Query: 198 TFVPR--LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVR 251
T + FGC SG + + GILGF + S+ SQL +++ +F++CL
Sbjct: 187 TSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL-D 245
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR 311
+ + G +V L+TTP++ + + H+ + L + +G + P G F+
Sbjct: 246 SISGGGIFAIG---EVVEPKLKTTPVVPN--QAHYNVVLNGVKVGDTALDLPLGLFETSY 300
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRY-----DQILRSLGRQRIPYNASQEFDYCYRY 366
G IID+GT + ++ + Y LM++ D LR++ Q C+ +
Sbjct: 301 --KRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQFT----------CFVF 348
Query: 367 DSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAW 418
D + +P++TF +E+ + + Y + +CV Q D + ++LG
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDL 408
Query: 419 QQQNMLIIYDLNVPALRFGSENCANG 444
QN L+ Y+L + + NC++G
Sbjct: 409 VLQNKLVYYNLENQTIGWTEYNCSSG 434
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 149/322 (46%), Gaps = 40/322 (12%)
Query: 137 PIFDPRASTTYSEIPCDD--------PLCRSPFKCQNGKCVYTRRYHVGDV------TRG 182
P+ P +S++ + + C D PLC + +G + Y G+ T G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 183 LASRETFAFPVRNGFTFVPRLAFGCS-NDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
+ ETF F + P +AFGC+ GF G SG++G LSL +QL
Sbjct: 73 ILMTETFTF--GDDAAAFPGIAFGCTLRSEGGFGTG---SGLVGLGRGKLSLVTQLNVEA 127
Query: 242 QGLFSYCLVREMEATSVIKFGRDADVRRRDLET---TPIL----LSDLRPHFYLHLLEIS 294
F Y L ++ A S I FG ADV + ++ TP+L + DL P +Y+ L IS
Sbjct: 128 ---FGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDL-PFYYVGLTGIS 183
Query: 295 IGRHIVRFPPGAFDIMRD-GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
+G +V+ P G F R G GG I D+GT +T + + P TL++ D++L +G Q+ P
Sbjct: 184 VGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPD-PAYTLVR--DELLSQMGFQKPP 240
Query: 354 YNASQEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPEN----MYFIEPDRGRFCVAIQD 408
A+ + C+ SS +PSM H AD + EN M + R ++
Sbjct: 241 PAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKS 300
Query: 409 DPKYSILGAWQQQNMLIIYDLN 430
+I+G Q + +++DL+
Sbjct: 301 SQALTIIGNIMQMDFHVVFDLS 322
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 158/371 (42%), Gaps = 31/371 (8%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y ++V +GTP + ++ DT S L W QC PC+ CFDQ P+FDP AS++Y + C D
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQR 210
Query: 157 C------RSPFKCQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVR--NGFTFVPRLAF 205
C P C+ C Y Y T G + E+F + V + F
Sbjct: 211 CGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVF 270
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-MEATSVIKFGRD 264
GC + N G +G+LG PLS +SQLR FSYCLV + S + FG D
Sbjct: 271 GCGHWNRGLFH--GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVASKVVFGED 328
Query: 265 ADVRRR----DLETTPI--LLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF- 317
+ L T S +Y+ L + +G ++ + + G
Sbjct: 329 DALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGG 388
Query: 318 -IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPS 375
IID+GT +++ YQ + Q + + +GR CY + P
Sbjct: 389 TIIDSGTTLSYFVEPAYQVIRQAF---IDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPE 445
Query: 376 MTFHLQEADYIVQPENMYFI--EPDRGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNV 431
++ + P YFI +PD G C+A+ P+ SI+G +QQQN ++YDL
Sbjct: 446 LSLLFADGAVWDFPAENYFIRLDPD-GIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLKN 504
Query: 432 PALRFGSENCA 442
L F CA
Sbjct: 505 NRLGFAPRRCA 515
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 164/384 (42%), Gaps = 41/384 (10%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA +L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 64 PNAHMKLYDDLL----SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD 119
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQN-GK-CVYTRRYHVGDVTRGLASRETFAFPVR 194
P F P S++Y + C +P C C + GK CVY RRY + G+ S + +F
Sbjct: 120 PKFQPELSSSYKALKC-NPDC----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNE 174
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVRE 252
+ T R FGC N +G F + GI+G LS+ QL ++ I+ +FS C
Sbjct: 175 SQLT-PQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY--- 230
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSD----LRPHFYLHLLEISIGRHIVRFPPGAFD 308
++ G A V + ++ S P++ + L ++ + ++ P F
Sbjct: 231 ----GGMEVGGGAMVLGKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF- 285
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSL-GRQRIPYNASQEFDYCYRYD 367
+G G ++D+GT + P + + D I++ + +RI D C+
Sbjct: 286 ---NGKHGTVLDSGTTYAYF---PKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGA 339
Query: 368 SSFKA-----YPSMTFHLQEA-DYIVQPENMYFIEPD-RGRFCVAI-QDDPKYSILGAWQ 419
A +P + I+ PEN F RG +C+ I D ++LG
Sbjct: 340 GRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIV 399
Query: 420 QQNMLIIYDLNVPALRFGSENCAN 443
+N L+ YD L F NC++
Sbjct: 400 VRNTLVTYDRENDKLGFLKTNCSD 423
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 76/153 (49%), Gaps = 6/153 (3%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V + IGTP L+FDT S L WTQC+PC+ C+ Q P F+P +S++Y + C P
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSP 193
Query: 156 LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFA 215
+C +P C C+Y Y G VT G ++E F + + + FGC +N G
Sbjct: 194 MCGNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLTNSD---VLDDIYFGCGENNKGVF 250
Query: 216 FGGKISGILGFNASPLSLSSQLRNRIQGLFSYC 248
G +GILG S Q +FSYC
Sbjct: 251 IGS--AGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 158/362 (43%), Gaps = 31/362 (8%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q Y V +GTP + L DT++ W C C C T+ F+P AS +Y +PC
Sbjct: 103 QTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC--PTSSPFNPAASASYRPVPC 160
Query: 153 DDPLCR---SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
P C +P N K C ++ Y + L S++T A V FGC
Sbjct: 161 GSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAAL-SQDTLAV----AGDVVKAYTFGCL 215
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDAD 266
+G A + LG S SQ ++ FSYCL + + + ++ GR+
Sbjct: 216 QRATGTAAPPQGLLGLGRGPL--SFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQ 273
Query: 267 VRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
RR ++TTP+L + R +Y+++ I +G+ +V P A G ++D+GT
Sbjct: 274 PRR--IKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMF 331
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADY 385
T + Y L D++ R +G ++ FD CY ++ A+P +T L +
Sbjct: 332 TRLVAPVYLAL---RDEVRRRVGAGAAAVSSLGGFDTCY---NTTVAWPPVTL-LFDGMQ 384
Query: 386 IVQPENMYFIEPDRGRF-CVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ PE I G C+A+ P +++ + QQQN +++D+ + F E
Sbjct: 385 VTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARE 444
Query: 440 NC 441
+C
Sbjct: 445 SC 446
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 159/363 (43%), Gaps = 44/363 (12%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y +V +GTP K ++L DTASSL W C+PCI C P F+P AS+TY + C
Sbjct: 126 YVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINACL---IPTFNPNASSTYKVVGCGSA 182
Query: 156 LC---------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
LC R C Y + YH ++ G+ S +T + + + + FG
Sbjct: 183 LCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYGLGS-----QKFIFG 237
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLR--NRIQGLFSYCLVREMEATSVIKFGRD 264
C N G GG+ SGILG + + SL SQ+ +R + + SYC ++FGR
Sbjct: 238 CCNLFRG--VGGRYSGILGMSVNKFSLFSQMTVGHRYRAM-SYCFPHPRNQ-GFLQFGR- 292
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
D + L TP+ + ++++H+ + + + MR DTGTP
Sbjct: 293 YDEHKSLLRFTPLYIDG--NNYFVHVSNVMVETMSLDVQSSGNQTMR-----CFFDTGTP 345
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----YPSMTFHL 380
T + + +L ++ G R+ + Q C++ D ++ P++
Sbjct: 346 YTMLPQSLFVSLSDTVGNLVE--GYYRVGASTGQT---CFQADGNWIEGDLYMPTVKIEF 400
Query: 381 QEADYI-VQPENMYFIEPDRGRFCVAIQ-DDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
Q I + E++ F+E + FC+A + +D +LG+ + + DL + +
Sbjct: 401 QNGARITLNSEDLMFME-EPNVFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTMGLRG 459
Query: 439 ENC 441
+ C
Sbjct: 460 QGC 462
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/422 (25%), Positives = 170/422 (40%), Gaps = 49/422 (11%)
Query: 37 IPIFSPESPLYPG-NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDL 95
+P+ P P + + +MF S AR +Y+ S K + HL + + L
Sbjct: 56 VPLLHRHGPCAPSLSTDTPPSMSEMFRRSHARLSYIVSGKKVSV-----PAHLGTSVKSL 110
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCD 153
Y V+ GTP PQ ++ DT S L W QC+PC +C Q P+FDP S+TYS +PC
Sbjct: 111 EYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170
Query: 154 DPLCRS------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C+ C NG+ C + Y G T G+ ++ V FG
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAPG---AIVKDFYFG 227
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C + S LG + SL +Q FSYCL + FG A
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSE--SLGAQYGGGGG--FSYCLPAVNSKPGFLAFG--AG 281
Query: 267 VRRRDLETTPILLSDLRPHF-YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
TP+ +P F + L I++G + P AF +GG I+D+GT V
Sbjct: 282 RNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGGMIVDSGTVV 335
Query: 326 TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---RYDSSFKAYPSMTFHLQE 382
T +++ Y+ L + + +++ R+ + + D CY Y + ++TF
Sbjct: 336 TVLQSTVYRALRAAFREAMKAY---RLVHG---DLDTCYDLTGYKNVVVPKIALTFSGGA 389
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ P + C+A + D +LG Q+ +++D + F ++
Sbjct: 390 TINLDVPNGILV------NGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAK 443
Query: 440 NC 441
C
Sbjct: 444 AC 445
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 156/340 (45%), Gaps = 39/340 (11%)
Query: 130 RCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPF-KCQNGKCVYTRRYHVGDVTRGLAS 185
C + P F P +S+T+S++PC LC+ SP+ C CVY Y +G T G +
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145
Query: 186 RETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
ET G +F P +AFGCS +N G SGI+G SPLSL SQ+ G F
Sbjct: 146 TETLHV---GGASF-PGVAFGCSTEN---GVGNSSSGIVGLGRSPLSLVSQVG---VGRF 195
Query: 246 SYCLVREMEAT-SVIKFGRDADVRRRDLETTPILLSD----LRPHFYLHLLEISIGRHIV 300
SYCL + +A S I FG A V +++P +L + ++Y++L I++G +
Sbjct: 196 SYCLRSDADAGDSPILFGSLAKVTGG--KSSPAILENPEMPSSSYYYVNLTGITVGATDL 253
Query: 301 RFPPGAFDIMRDG----TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
F R GG I+D+GT +T++ Y + + + + +
Sbjct: 254 PVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGT 313
Query: 357 SQEFDYCYRYDSS--FKAYPSMTFHLQ---EADYIVQPEN-MYFIEPD-RGRFCVA---- 405
FD C+ +++ P T L+ A+Y V+ + + +E D +GR V
Sbjct: 314 RFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLV 373
Query: 406 --IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ SI+G Q ++ ++YDL+ F +CAN
Sbjct: 374 LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413
>gi|326531368|dbj|BAK05035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/423 (25%), Positives = 184/423 (43%), Gaps = 29/423 (6%)
Query: 32 FSLKLIPIFSPESPLYPGNLSQSERIHKMFEI-SKARANYMASMSKPNAFQELEDIHLPM 90
S L+P+F + N S + + F+I + RA + + + +ED+ LP+
Sbjct: 5 LSTLLLPVFFVSFAIAWANPSNTSGL--SFQIVALNRAVHPNGHTNNGSTYTIEDLRLPI 62
Query: 91 AKQDLF-YSVEVNIGTP--MKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
+ + Y V V++GT + + L DT +S W C+PC Q +F P AS T+
Sbjct: 63 STSAQYAYGVFVSLGTGEGTRLKVLALDTEASTSWVMCKPCHPSPPQVGNLFSPGASPTF 122
Query: 148 SEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF------AFPVRNGFTFVP 201
+ +DP+C P++ C +H +T G SR+TF A VR +P
Sbjct: 123 HGVHSNDPVCTVPYRKTANGC----SFHFSSIT-GYLSRDTFHLRTGRAGAVRES---IP 174
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEAT--SVI 259
R+ FGC++ ++GF + G+L + PLSL +QL G FSYCL + +
Sbjct: 175 RVVFGCAHSSTGFHNDNTLGGVLSLSHLPLSLLTQLGAHASGRFSYCLPKSTGHNPHGSL 234
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
G D TT +++ ++L+L+ I+ G ++ D + I
Sbjct: 235 FLGADVPSPPPHSHTTNLVIHPGVSGYHLNLIGITRGYKRLK-----IDKRVLVSHSCSI 289
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
+ +T I Y + + ++ LG R+ + Y S + P+M FH
Sbjct: 290 NPAETITHIAEPIYLVVEKALVARMKELGSDRVKGPPGGPLWFDRMYQSVKEQLPNMAFH 349
Query: 380 LQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ A+ + ++ + RF VA + + +++GA QQ N +D+ L F S
Sbjct: 350 FEGGAELWFTSDRLFEVHGMNARFMVAGRGY-RRTVIGAAQQVNTRFTFDVARGKLSFVS 408
Query: 439 ENC 441
E C
Sbjct: 409 EVC 411
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 168/379 (44%), Gaps = 47/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-------FDQTTPIFDPRASTTYS 148
Y ++ +G+P K ++ DT S ++W C C RC D T ++DP+ S T
Sbjct: 69 LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLT--LYDPKGSETSE 126
Query: 149 EIPCDDPLCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP 201
I CD C + + C++ C Y+ Y G T G ++ + V + P
Sbjct: 127 LISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAP 186
Query: 202 R---LAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREM 253
+ + FGC SG + GI+GF S S+ SQL +++ +FS+CL +
Sbjct: 187 QNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-DNI 245
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ G +V + TTP++ H+ + L I + I++ P FD G
Sbjct: 246 RGGGIFAIG---EVVEPKVSTTPLV--PRMAHYNVVLKSIEVDTDILQLPSDIFD---SG 297
Query: 314 TG-GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-K 371
G G IID+GT + ++ Y L+ + ++ R ++ Y Q+F C++Y + +
Sbjct: 298 NGKGTIIDSGTTLAYLPAIVYDELIPK---VMARQPRLKL-YLVEQQFS-CFQYTGNVDR 352
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNML 424
+P + H +++ + + Y + G +C+ Q + ++LG N L
Sbjct: 353 GFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKL 412
Query: 425 IIYDLNVPALRFGSENCAN 443
+IYDL A+ + NC++
Sbjct: 413 VIYDLENMAIGWTDYNCSS 431
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 168/386 (43%), Gaps = 37/386 (9%)
Query: 75 SKPNAFQELED-IHLPMAK-----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC 128
+ P+A L+ + P+A Q Y V +GTP + L DT++ W C C
Sbjct: 26 TPPDAGATLQGRAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGC 85
Query: 129 IRCFDQTTPIFDPRASTTYSEIPCDDPLCR---SPFKCQNGK-CVYTRRYHVGDVTRGLA 184
C T+ F+P AS +Y +PC P C +P N K C ++ Y + L
Sbjct: 86 AGC--PTSSPFNPAASASYRPVPCGSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAAL- 142
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
S++T A V FGC +G A + LG S SQ ++
Sbjct: 143 SQDTLAV----AGDVVKAYTFGCLQRATGTAAPPQGLLGLGRGPL--SFLSQTKDMYGAT 196
Query: 245 FSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVR 301
FSYCL + + + ++ GR+ RR ++TTP+L + R +Y+++ I +G+ +V
Sbjct: 197 FSYCLPSFKSLNFSGTLRLGRNGQPRR--IKTTPLLANPHRSSLYYVNMTGIRVGKKVVS 254
Query: 302 FPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
P A G ++D+GT T + Y L D++ R +G ++ FD
Sbjct: 255 IPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALR---DEVRRRVGAGAAAVSSLGGFD 311
Query: 362 YCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDPK-----YSIL 415
CY ++ A+P +T L + + PE I G C+A+ P +++
Sbjct: 312 TCY---NTTVAWPPVTL-LFDGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVI 367
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENC 441
+ QQQN +++D+ + F E+C
Sbjct: 368 ASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 158/377 (41%), Gaps = 41/377 (10%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
K + V + IGTP + Q ++ DT S L W QC + + +FDP S+++S +P
Sbjct: 72 KYSMILLVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLP 131
Query: 152 CDDPLCRS-------PFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C+ PLC+ P C N C Y+ Y G + G RE F P L
Sbjct: 132 CNHPLCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQS---TPPL 188
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEA--TSVIK 260
GC+ D S GILG N LS +SQ + FSYC+ R++ T
Sbjct: 189 ILGCAEDAS------DDKGILGMNLGRLSFASQAKIT---KFSYCVPTRQVRPGFTPTGS 239
Query: 261 FGRDADVRRRDLETTPIL-------LSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRD 312
F + + +L + +L P + L+ I IG + P AF
Sbjct: 240 FYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPS 299
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDSS-- 369
G G +ID+G+ T++ + Y + + +++R G R + Y S D C+ ++
Sbjct: 300 GAGQSMIDSGSEFTYLVDVAYNKVRE---EVVRLAGPRLKKGYVYSGVSDMCFDGNAMEI 356
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLI 425
+ +M F + IV + + G CV I +I+G + QQN+ +
Sbjct: 357 GRLIGNMVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNLWV 416
Query: 426 IYDLNVPALRFGSENCA 442
+D+ + FG +C+
Sbjct: 417 EFDIANRRVGFGKADCS 433
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 150/355 (42%), Gaps = 25/355 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP K ++ DT SSL W QC PC + C Q+ P+F+P+AS++Y+ + C
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C C+Y Y + G S++T +F G T VP +GC
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSVPNFYYGC 242
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL ++S +
Sbjct: 243 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 300
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ TP+ S L Y I + V P + + IID+GT +T
Sbjct: 301 GQYSY--TPMASSSLDDSLYF----IKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ G Y L + ++ R +A D C++ ++ P +T +
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRA----SAFSILDTCFQGQAARLRVPEVTMAFAGGAALK 410
Query: 388 QPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 411 LAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 175/395 (44%), Gaps = 49/395 (12%)
Query: 83 LEDIHLPMA---KQDL--FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ + D+ Y ++ IGTP K ++ DT S ++W C C +C ++T
Sbjct: 61 LAGIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTL 120
Query: 137 ----PIFDPRASTTYSEIPCDDPLC----RSPFK-CQ-NGKCVYTRRYHVGDVTRGLASR 186
+++ S + + CDD C P C+ N C Y Y G T G +
Sbjct: 121 GIELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVK 180
Query: 187 ETFAFPVRNGF----TFVPRLAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL-- 237
+ + G T + FGC SG + GILGF + S+ SQL
Sbjct: 181 DVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLAS 240
Query: 238 RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
R++ +F++CL + GR V + + TP++ + +PH+ +++ + +G+
Sbjct: 241 SGRVKKIFAHCL-DGRNGGGIFAIGR---VVQPKVNMTPLVPN--QPHYNVNMTAVQVGQ 294
Query: 298 HIVRFPPGAFDIMRDGT-GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+ P D+ + G G IID+GT + ++ Y+ L+++ +L + I
Sbjct: 295 EFLTIPA---DLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPAL-KVHIVDKD 350
Query: 357 SQEFDYCYRYDSSFKAYPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ-------D 408
+ F Y R D F P++TFH + + ++ V P + F P G +C+ Q D
Sbjct: 351 YKCFQYSGRVDEGF---PNVTFHFENSVFLRVYPHDYLF--PHEGMWCIGWQNSAMQSRD 405
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
++LG N L++YDL + + NC++
Sbjct: 406 RRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSS 440
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 157/367 (42%), Gaps = 43/367 (11%)
Query: 89 PMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTY 147
P +++ L + E++ G+P K Q L DT SSL WTQC PC C+ Q P + P AS TY
Sbjct: 50 PHSQRGLAFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITY 109
Query: 148 SEIPCDDPLCRS----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN-GFTFVPR 202
+ C+D +S F C Y + Y +G ++E + GF V
Sbjct: 110 RDAMCEDSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHG 169
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIK 260
+ FGC+ + G F G +GILG S+ + ++ FS+CL + E +A+ +
Sbjct: 170 VYFGCNTLSDGSYFTG--TGILGLGVGKYSIIGEFGSK----FSFCLGEISEPKASHNLI 223
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G A+V+ P +++ H L I +G I P +D
Sbjct: 224 LGDGANVQGH-----PTVINITEGHTIFQLESIIVGEEITLDDPVQ----------VFVD 268
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
TG+ ++ + Y + +D + +G + + Y E CY+ D + + M
Sbjct: 269 TGSTLSHLSTNLYYKFVDAFDDL---IGSRPLSY----EPTLCYKAD-TIERLEKMDVGF 320
Query: 381 Q---EADYIVQPENMYFIEPDRGRFCVAIQDDPK---YSILGAWQQQNMLIIYDLNVPAL 434
+ A+ V N++ + C+AIQ++ + + I+G Q + YDL+
Sbjct: 321 KFDVGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTA 380
Query: 435 RFGSENC 441
++C
Sbjct: 381 YINKQDC 387
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 146/344 (42%), Gaps = 47/344 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y ++ +GTP + ++ DT S ++W C C C QT+ + FDP +S T S
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTASP 138
Query: 150 IPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
I C D C S QN C YT +Y G T G + F + G + VP
Sbjct: 139 ISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPN 198
Query: 203 ----LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + GI GF +S+ SQL + QG+ FS+CL E
Sbjct: 199 STAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLAS--QGIAPRVFSHCLKGE 256
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ ++ TP++ S +PH+ ++LL IS+ + P F
Sbjct: 257 NGGGGILVLG---EIVEPNMVFTPLVPS--QPHYNVNLLSISVNGQALPINPSVFSTSNG 311
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-K 371
G IIDTGT + ++ Y ++ + R + + + CY +S
Sbjct: 312 --QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV-----SKGNQCYVITTSVGD 364
Query: 372 AYPSMTFH--------LQEADYIVQPENMYFIEPDRGRFCVAIQ 407
+P ++ + L DY++Q N+ GR+C +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVASALCFLGRYCSVVH 408
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 149/335 (44%), Gaps = 30/335 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP K ++ DT S ++W C C RC ++ ++D +ASTT +
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 136
Query: 151 PCDDPLCR---SPF-KCQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP--- 201
CDD C P C+ G +C+Y+ Y G T G ++ + + F P
Sbjct: 137 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNG 196
Query: 202 RLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
+ FGC N SG + + GILGF + S+ SQL +++ +FS+CL ++
Sbjct: 197 TVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGG 255
Query: 258 VIKFGRDADVRRRDLETTPILLSDL---RPHFYLHLLEISIGRHIVRFPPGAFDIM-RDG 313
+ G + + R L +++ L R H+ + + EI +G + P AF+ R G
Sbjct: 256 IFAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG 315
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
T IID+GT + + Y L+++ L R A FDY D F
Sbjct: 316 T---IIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGF--- 368
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
P++T H ++ + + Y + +C+ Q+
Sbjct: 369 PTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQN 403
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/430 (24%), Positives = 183/430 (42%), Gaps = 52/430 (12%)
Query: 34 LKLIPIFSPESPLYPGNLSQS--ERIHKMFEISKARANYMASM--SKPNAFQELEDIHLP 89
L +IPI + SP P ++S S + + M R Y++S+ KP + +P
Sbjct: 39 LSIIPINAKCSPFAPTHVSASVIDTVLHMASSDSHRLTYLSSLVAGKP------KPTSVP 92
Query: 90 MAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
+A + Y V +GTP + ++ DT++ VW C C C + +T F+ +S+
Sbjct: 93 VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSS 151
Query: 146 TYSEIPCDDPLCRSP--FKC-----QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFT 198
TYS + C C C Q C + + Y ++T
Sbjct: 152 TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL----APD 207
Query: 199 FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEAT 256
+P +FGC N SG + + G++G P+SL SQ + G+FSYCL R +
Sbjct: 208 VIPNFSFGCINSASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFS 265
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
+K G + + + TP+L + RP +Y++L +S+G V P +
Sbjct: 266 GSLKLGLLG--QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGA 323
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI---PYNASQEFDYCYRYDSSFKA 372
G IID+GT +T Y+ + + R+++ ++ FD C+ D+ A
Sbjct: 324 GTIIDSGTVITRFAQPVYEAIRDEF--------RKQVNVSSFSTLGAFDTCFSADNENVA 375
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI-----QDDPKYSILGAWQQQNMLIIY 427
P +T H+ D + EN C+++ + +++ QQQN+ I++
Sbjct: 376 -PKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILF 434
Query: 428 DLNVPALRFG 437
D VP R G
Sbjct: 435 D--VPNSRIG 442
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 167/425 (39%), Gaps = 85/425 (20%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQH--LLFDTASSLVWTQCQP--CIRCFDQTTP---- 137
+ LP+A Y++ +++G P L DT S LVW C P C+ C + TP
Sbjct: 78 LSLPLAPGS-DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNH 136
Query: 138 --------------IFDPRASTTYSEIPCDDPLC---RSPF------KCQNGKC-----V 169
P S +S P D LC R P C + C
Sbjct: 137 SSPLPPPIDSRRISCASPLCSAAHSSAPTSD-LCAAARCPLDAIETDSCASHACPPLYYA 195
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y V ++ RG + V N FTF C++ A + G+ GF
Sbjct: 196 YGDGSLVANLRRGRVGLAA-SMAVEN-FTFA------CAHT----ALAEPV-GVAGFGRG 242
Query: 230 PLSLSSQLRNRIQGLFSYCLVRE------MEATSVIKFGRDAD-----VRRRDLETTPIL 278
PLSL +QL + G FSYCLV + +S + GR D D TP+L
Sbjct: 243 PLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLL 302
Query: 279 LSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLM 337
+ P+FY LE +S+G ++ P D+ RDG GG ++D+GT T + P T
Sbjct: 303 HNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTML---PSDTFA 359
Query: 338 QRYDQILRSLGRQRIPYNASQEFDY----CYRYDSSFKAYPSMTFHLQEADYIVQPENMY 393
+ D+ R++ R E CY Y S +A P + H + + P Y
Sbjct: 360 RVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNY 419
Query: 394 FI----EPDRGRFCVAI------QDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
F+ E R C+ + DD + LG +QQQ ++YD++ + F
Sbjct: 420 FMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFAR 479
Query: 439 ENCAN 443
C +
Sbjct: 480 RRCTD 484
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 152/361 (42%), Gaps = 38/361 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G+P K + DT S ++W C PC C + F+P S+T S+I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 151 PCDDPLCRSPFK-----CQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FT 198
PC D C + + CQ N C YT Y G T G +T F G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 199 FVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLVREME 254
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
++ G ++ L TP++ S +PH+ L+L I + + P + T
Sbjct: 270 GGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIVVNGQ--KLPIDSSLFTTSNT 322
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G I+D+GT + ++ +G Y + + R + +Q F DSSF P
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSF---P 378
Query: 375 SMTFH-LQEADYIVQPENMYFIEP---DRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
+++ + + V+PEN + + +C+ Q + + +ILG ++ + +YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 429 L 429
L
Sbjct: 439 L 439
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 152/356 (42%), Gaps = 27/356 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +GTP K ++ DT SSL W QC PC + C Q+ P+F+P+AS++Y+ + C
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 156 LCR-------SPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C +P C C+Y Y + G S++T +F G T VP +GC
Sbjct: 187 QCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF----GSTSVPNFYYGC 242
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
DN G G+ +G++G + LSL QL + FSYCL ++S +
Sbjct: 243 GQDNEGLF--GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNP 300
Query: 268 RRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ TP+ S L Y + + I + + A+ + IID+GT +T
Sbjct: 301 GQYSY--TPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT-----IIDSGTVIT 353
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
+ G Y L + ++ R +A D C++ ++ P +T +
Sbjct: 354 RLPTGVYSALSKAVAGAMKGTPRA----SAFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409
Query: 387 VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ D C+A +I+G QQQ ++YD+ + F + C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 151/317 (47%), Gaps = 36/317 (11%)
Query: 145 TTYSEIPCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG- 196
T + CD PLC SP K +C YT Y +T+G+ +++T F G
Sbjct: 14 TVLAHNSCDSPLCHKLDTGVCSPEK----RCNYTYGYGDNSLTKGVLAQDTATFTSNTGK 69
Query: 197 FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQG-LFSYCLV---RE 252
+ R FGC ++N+G F G++G P SL SQ+ G FS CLV +
Sbjct: 70 LVSLSRFLFGCGHNNTG-GFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTD 128
Query: 253 MEATSVIKFGRDADVRRRDLETTPILL--SDLRPHFYLHLLEISIGRHIVRFPPGAFDIM 310
++ +S + FG+ + V + TTP++ D+ +F + LL IS+ + P I
Sbjct: 129 IKISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYF-VTLLGISVED---TYLPMNSTIE 184
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF 370
+ G ++D+GTP + P Q + Y ++ ++ + I + S CYR ++
Sbjct: 185 K---GNMLVDSGTPPNIL---PQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNL 238
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEP---DRGRFCVAIQD--DPKYSILGAWQQQNMLI 425
K P++T+H + A+ ++ P FI P +G FC+AI + + + G + Q N LI
Sbjct: 239 KG-PTLTYHFEGANLLLTPIQT-FIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLI 296
Query: 426 IYDLNVPALRFGSENCA 442
+DL+ + F + +C
Sbjct: 297 GFDLDRQVVSFKATDCT 313
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 131/273 (47%), Gaps = 35/273 (12%)
Query: 84 EDIHLPMA-KQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC---FDQT 135
E + P++ D+F Y +++GTP + ++ DT S++ W +C PC C D
Sbjct: 23 EVVSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVP 82
Query: 136 TPI--FDPRASTTYSEIPCDDPLC---RSPFKC--QNGKCVYTRRYHVGDVTRGLASRET 188
P+ FDPR STT I C D C +C + C Y+ Y G T G +
Sbjct: 83 VPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDV 142
Query: 189 FAF---PVRNGF--TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRI 241
F F P N + RL FGC +G + G+LGF + +SL +QL +N
Sbjct: 143 FTFNQVPSDNSTAKSGTARLVFGCGGTQTG---SWSVDGLLGFGPTTVSLPNQLAQQNIS 199
Query: 242 QGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIV 300
+F++CL ++ + G +R DL TP++ + H+ + LL I I GR++
Sbjct: 200 VNIFAHCLQGDVSGRGSLVIGT---IREPDLVYTPMVFGE--DHYNVQLLNIGISGRNVT 254
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPY 333
P +FD+ + TGG IID+GT +T++ Y
Sbjct: 255 T--PASFDL--EYTGGVIIDSGTTLTYLVQPAY 283
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 155/355 (43%), Gaps = 31/355 (8%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q+ Y V IGTP + + DT+S + W C C+ C ++ +F+ ASTTY + C
Sbjct: 32 QNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGC 88
Query: 153 DDPLCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C+ K C G C + Y + L S++T VP +FGC
Sbjct: 89 QAAQCKQVPKPTCGGGVCSFNLTYGGSSLAANL-SQDTITLATDA----VPGYSFGCIQK 143
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVR 268
+G + + LG SL SQ +N Q FSYCL + + + ++ G +
Sbjct: 144 ATGGSLPAQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPK 201
Query: 269 RRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R ++ TP+L + RP Y ++L+ + +GR +V PPG+F G I D+GT T
Sbjct: 202 R--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTR 259
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ Y + D +GR + + FD CY A P++TF + +
Sbjct: 260 LVTPAY---IAVRDAFRNRVGRN-LTVTSLGGFDTCYTVP---IAAPTITFMFTGMNVTL 312
Query: 388 QPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
P+N+ C+A+ P +++ QQQN ++YD VP R G
Sbjct: 313 PPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYD--VPNSRLG 365
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/431 (25%), Positives = 162/431 (37%), Gaps = 45/431 (10%)
Query: 33 SLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA------SMSKPNAFQELEDI 86
S +P+ P P + + ++ + RA Y+ S S + Q+ I
Sbjct: 51 SGTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAI 110
Query: 87 HLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPR 142
LP A L Y + V+IGTP Q ++ DT S + W C R ++ FDP
Sbjct: 111 TLPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSLFFDPG 168
Query: 143 ASTTYSEIPCDDPLC-----RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
S+TY+ C C R N C YT RY G T G +T A N
Sbjct: 169 KSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL---NST 225
Query: 198 TFVPRLAFGCS--NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA 255
V FGCS +D + G++G SL SQ FSYCL +
Sbjct: 226 EKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRS 285
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGT 314
+ + G A TTP+ S P FY +L+ I++G V P F
Sbjct: 286 SGFLTLG--ASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVF------A 337
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-- 372
G I+D+GT +T + Y L + +R R R FD+ + + S A
Sbjct: 338 AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVE 397
Query: 373 --YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLN 430
+ +AD I+ + F P G SI+G QQ+ +++D+
Sbjct: 398 LVFSGGAVVDLDADGIMYGSCLAF-APATGGI---------GSIIGNVQQRTFEVLHDVG 447
Query: 431 VPALRFGSENC 441
L F C
Sbjct: 448 QSVLGFRPGAC 458
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 155/362 (42%), Gaps = 32/362 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-D 154
+Y+ + IGTP + L+ DT S++ + C C +C P FDP +S+TY I C+ D
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNID 141
Query: 155 PLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSG 213
+C S +CVY R+Y + G+ + +F N +P R FGC N +G
Sbjct: 142 CICDS----DGVQCVYERQYAEMSTSSGVLGEDVISF--GNQSELIPQRAVFGCENMETG 195
Query: 214 FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCL-VREMEATSVIKFGRDADVRRR 270
F + GI+G LSL QL + I FS C ++ +++ G
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+ P+ P++ + L EI + + G F DG G ++D+GT ++
Sbjct: 256 FTYSDPV----RSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYL-- 305
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDSSFKA-----YPSMTFHLQEAD 384
P + D I+ + + F D C+ S A +P++ +
Sbjct: 306 -PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364
Query: 385 YI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ + PEN +F G +C+ I + + ++LG +N L++YD + F N
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTN 424
Query: 441 CA 442
C+
Sbjct: 425 CS 426
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 155/362 (42%), Gaps = 32/362 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-D 154
+Y+ + IGTP + L+ DT S++ + C C +C P FDP +S+TY I C+ D
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNID 141
Query: 155 PLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSG 213
+C S +CVY R+Y + G+ + +F N +P R FGC N +G
Sbjct: 142 CICDS----DGVQCVYERQYAEMSTSSGVLGEDVISF--GNQSELIPQRAVFGCENMETG 195
Query: 214 FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCL-VREMEATSVIKFGRDADVRRR 270
F + GI+G LSL QL + I FS C ++ +++ G
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+ P+ P++ + L EI + + G F DG G ++D+GT ++
Sbjct: 256 FTYSDPV----RSPYYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYL-- 305
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDSSFKA-----YPSMTFHLQEAD 384
P + D I+ + + F D C+ S A +P++ +
Sbjct: 306 -PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364
Query: 385 YI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ + PEN +F G +C+ I + + ++LG +N L++YD + F N
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTN 424
Query: 441 CA 442
C+
Sbjct: 425 CS 426
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 166/384 (43%), Gaps = 53/384 (13%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + +G+P + ++ DT S L W C+ T +F+P +S++YS IPC
Sbjct: 996 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKS----PNLTSVFNPLSSSSYSPIPC 1051
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
P+CR+ P C K C Y G + + F R G + +P
Sbjct: 1052 SSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNF----RIGSSALPGTL 1107
Query: 205 FGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVREMEATSV 258
FGC +SGF+ K +G++G N LS +QL GL FSYC + +++ V
Sbjct: 1108 FGCM--DSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYC-ISGRDSSGV 1159
Query: 259 IKFGRDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG +L TP++ +S P+F + L I +G I+ P F
Sbjct: 1160 LLFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHT 1219
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTL----MQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
G G ++D+GT TF+ Y L +++ +L LG + + + Y
Sbjct: 1220 GAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGG 1279
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR-----FCVAIQDDPKYSI----LGAWQ 419
PS++ + A+ +V E + + P+ + +C+ + I +G
Sbjct: 1280 KLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHH 1339
Query: 420 QQNMLIIYDLNVPALRFGSENCAN 443
QQN+ + +DL + F ++ C +
Sbjct: 1340 QQNVWMEFDL----VAFAADLCGS 1359
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 167/425 (39%), Gaps = 85/425 (20%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQH--LLFDTASSLVWTQCQP--CIRCFDQTTP---- 137
+ LP+A Y++ +++G P L DT S LVW C P C+ C + TP
Sbjct: 78 LSLPLAPGS-DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNH 136
Query: 138 --------------IFDPRASTTYSEIPCDDPLC---RSPF------KCQNGKC-----V 169
P S +S P D LC R P C + C
Sbjct: 137 SSPLPPPIDSRRISCASPLCSAAHSSAPTSD-LCAAARCPLDAIETDSCASHACPPLYYA 195
Query: 170 YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNAS 229
Y V ++ RG + V N FTF C++ A + G+ GF
Sbjct: 196 YGDGSLVANLRRGRVGLAA-SMAVEN-FTFA------CAHT----ALAEPV-GVAGFGRG 242
Query: 230 PLSLSSQLRNRIQGLFSYCLVRE------MEATSVIKFGRDAD-----VRRRDLETTPIL 278
PLSL +QL + G FSYCLV + +S + GR D D TP+L
Sbjct: 243 PLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLL 302
Query: 279 LSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLM 337
+ P+FY LE +S+G ++ P D+ RDG GG ++D+GT T + P T
Sbjct: 303 HNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTML---PSDTFA 359
Query: 338 QRYDQILRSLGRQRIPYNASQEFDY----CYRYDSSFKAYPSMTFHLQEADYIVQPENMY 393
+ D+ R++ R E CY Y S +A P + H + + P Y
Sbjct: 360 RVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNY 419
Query: 394 FI----EPDRGRFCVAI------QDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
F+ E R C+ + DD + LG +QQQ ++YD++ + F
Sbjct: 420 FMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFAR 479
Query: 439 ENCAN 443
C +
Sbjct: 480 RRCTD 484
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 90/169 (53%), Gaps = 9/169 (5%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + +GTP + Q+++ DT S + W QC+PC C+ Q PIF+P S ++S + CD +
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAV 216
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + C +G C+Y Y G + G + ET F G T V +A GC + N G
Sbjct: 217 CSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTF----GTTSVANVAIGCGHKNVGL 272
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFG 262
+G+LG A LS +Q+ + FSYCLV RE +++ ++FG
Sbjct: 273 FI--GAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFG 319
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 156/367 (42%), Gaps = 40/367 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q + V IGTP + L DT++ W C CI C +T +F S+++ +PC
Sbjct: 22 QSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPC 79
Query: 153 DDPLCR---SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF----VPRLAF 205
P C +P C C + Y V L V++ T VP F
Sbjct: 80 QSPQCNQVPNP-SCSGSACGFNLTYGSSTVAADL---------VQDNLTLATDSVPSYTF 129
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGR 263
GC +G + + LG L SQ + Q FSYCL + + + ++ G
Sbjct: 130 GCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ--SLYQSTFSYCLPSFKSVNFSGSLRLGP 187
Query: 264 DADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A R ++ TP+L + R +Y++L+ I +GR IV PP A G +ID+G
Sbjct: 188 VAQPIR--IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSG 245
Query: 323 TPVTFIR-NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
T TF R P T ++ D+ R +GR + ++ FD CY P++TF
Sbjct: 246 T--TFTRLVAPAYTAVR--DEFRRRVGRN-VTVSSLGGFDTCYTVP---IISPTITFMFA 297
Query: 382 EADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRF 436
+ + P+N C+A+ P +++ + QQQN I++D+ +
Sbjct: 298 GMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGV 357
Query: 437 GSENCAN 443
E+C++
Sbjct: 358 ARESCSS 364
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 152/361 (42%), Gaps = 38/361 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G+P K + DT S ++W C PC C + F+P S+T S+I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 151 PCDDPLCRSPFK-----CQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FT 198
PC D C + + CQ N C YT Y G T G +T F G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTAN 209
Query: 199 FVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLVREME 254
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
++ G ++ L TP++ S +PH+ L+L I + + P + T
Sbjct: 270 GGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIVVNGQ--KLPIDSSLFTTSNT 322
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G I+D+GT + ++ +G Y + + R + +Q F DSSF P
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSF---P 378
Query: 375 SMTFH-LQEADYIVQPENMYFIEP---DRGRFCVAIQDD--PKYSILGAWQQQNMLIIYD 428
+++ + + V+PEN + + +C+ Q + + +ILG ++ + +YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 429 L 429
L
Sbjct: 439 L 439
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 166/397 (41%), Gaps = 53/397 (13%)
Query: 83 LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----F 132
L I +P+ L Y +V +G+P K ++ DT S ++W C C C
Sbjct: 53 LAAIDVPLGGNGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGL 112
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRSPFK-----C-QNGKCVYTRRYHVGDVTRGLASR 186
++DP S T + +PC D C + C Q+ C Y+ Y G T G
Sbjct: 113 GMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVN 172
Query: 187 ETFAFPVRNGFTFVP----RLAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL-- 237
++ F +G + FGC SG + GI+GF + S+ SQL
Sbjct: 173 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 232
Query: 238 RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
+++ +FS+CL + G+ V TTP++ H+ + L ++ +
Sbjct: 233 SGKVKRIFSHCL-DSHHGGGIFSIGQ---VMEPKFNTTPLV--PRMAHYNVILKDMDVDG 286
Query: 298 HIVRFPPGAFDIMRDGTG-GFIIDTGTPVTFIRNGPYQTLMQRYDQIL-RSLGRQ-RIPY 354
+ P FD G+G G IID+GT + ++ + Y+Q+L + LGRQ +
Sbjct: 287 EPILLPLYLFD---SGSGRGTIIDSGTTLAYLP-------LSIYNQLLPKVLGRQPGLKL 336
Query: 355 NASQEFDYCYRY-DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS 413
++ C+ Y D + +P + FH + V P + F+ + +C+ Q +
Sbjct: 337 MIVEDQFTCFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKED-IYCIGWQKSSTQT 395
Query: 414 -------ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
++G N L++YDL + + + NC++
Sbjct: 396 KEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 432
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 175/395 (44%), Gaps = 49/395 (12%)
Query: 83 LEDIHLPMA---KQDL--FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
L I LP+ + D+ Y ++ IGTP K ++ DT S ++W C C +C ++T
Sbjct: 61 LAGIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTL 120
Query: 137 ----PIFDPRASTTYSEIPCDDPLC----RSPFK-CQ-NGKCVYTRRYHVGDVTRGLASR 186
+++ S + + CDD C P C+ N C Y Y G T G +
Sbjct: 121 GIELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVK 180
Query: 187 ETFAFPVRNGF----TFVPRLAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL-- 237
+ + G T + FGC SG + GILGF + S+ SQL
Sbjct: 181 DVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLAS 240
Query: 238 RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
R++ +F++CL + GR V + + TP++ + +PH+ +++ + +G+
Sbjct: 241 SGRVKKIFAHCL-DGRNGGGIFAIGR---VVQPKVNMTPLVPN--QPHYNVNMTAVQVGQ 294
Query: 298 HIVRFPPGAFDIMRDGT-GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+ P D+ + G G IID+GT + ++ Y+ L+++ +L + I
Sbjct: 295 EFLNIPA---DLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPAL-KVHIVDKD 350
Query: 357 SQEFDYCYRYDSSFKAYPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ-------D 408
+ F Y R D F P++TFH + + ++ V P + F P G +C+ Q D
Sbjct: 351 YKCFQYSGRVDEGF---PNVTFHFENSVFLRVYPHDYLF--PYEGMWCIGWQNSAMQSRD 405
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
++LG N L++YDL + + NC++
Sbjct: 406 RRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSS 440
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 157/373 (42%), Gaps = 38/373 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEIP 151
Y V +G+P K + DT S ++W C PC C + F+P S+T S+IP
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 152 CDDPLCRSPFK-----CQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTF 199
C D C + + CQ N C YT Y G T G +T F G
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 200 VPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLVREMEA 255
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNG 296
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
++ G ++ L TP++ S +PH+ L+L I + + P + T
Sbjct: 297 GGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIVVNGQ--KLPIDSSLFTTSNTQ 349
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
G I+D+GT + ++ +G Y + + R + +Q F DSSF P+
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVS-KGNQCFVTSSSVDSSF---PT 405
Query: 376 MTFH-LQEADYIVQPENMYFIEP---DRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDL 429
++ + + V+PEN + + +C+ Q + + +ILG ++ + +YDL
Sbjct: 406 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 465
Query: 430 NVPALRFGSENCA 442
+ + +C+
Sbjct: 466 ANMRMGWTDYDCS 478
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 155/376 (41%), Gaps = 40/376 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G P K + DT S ++W C PC C + F+P +S+T S I
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63
Query: 151 PCDDPLCRSPFK-----CQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVRNG---- 196
C D C + F+ CQ C YT Y G T G +T F G
Sbjct: 64 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 123
Query: 197 FTFVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQG--LFSYCLVRE 252
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 124 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGS 183
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ L TP++ S +PH+ L+L I++ + P +
Sbjct: 184 DNGGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIAVNGQ--KLPIDSSLFTTS 236
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
T G I+D+GT + ++ +G Y + + R + SQ F DSSF
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVS-KGSQCFITSSSVDSSF-- 293
Query: 373 YPSMTFHLQEADYI-VQPENMYFIEPDRGR---FCVAIQDD--PKYSILGAWQQQNMLII 426
P++T + + V+PEN + +C+ Q + + +ILG ++ + +
Sbjct: 294 -PTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352
Query: 427 YDLNVPALRFGSENCA 442
YDL + + +C+
Sbjct: 353 YDLANMRMGWADYDCS 368
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/417 (23%), Positives = 159/417 (38%), Gaps = 36/417 (8%)
Query: 38 PIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF- 96
P+ S E P + L + +++ + +K + Y N +EL+ + + +
Sbjct: 71 PVISKEKPSHEETLRR-DQLRAAYIQAKVSSRYN------NVAKELQQSAVTIPTSSGYS 123
Query: 97 -----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSE 149
Y + V IGTP Q + DT S + W QC PC C Q +FDP S TYS
Sbjct: 124 LGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSA 183
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
C C C +C Y +Y G T G +T + + V F
Sbjct: 184 FSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDA---VKSFQF 240
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRD 264
GCS+ +GF G++ G++G SL SQ FSYCL + G
Sbjct: 241 GCSHRAAGFV--GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAA 298
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
TP++ + + + L I++ ++ P F +G ++D+GT
Sbjct: 299 GGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVF------SGASVVDSGTV 352
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD 384
+T + YQ L + + +++ P + D C+ + S F T L +
Sbjct: 353 ITQLPPTAYQALRTAFKKEMKAY-PSAAPVGS---LDTCFDF-SGFNTITVPTVTLTFSR 407
Query: 385 YIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++ I A D ILG QQ+ +++D+ + F S C
Sbjct: 408 GAAMDLDISGILYAGCLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 108/426 (25%), Positives = 161/426 (37%), Gaps = 81/426 (19%)
Query: 86 IHLPMA-KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTPIFDPR 142
+ LP++ D S+ V + P L DT S LVW C P C+ C + TP
Sbjct: 84 LSLPLSPGSDYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHS 143
Query: 143 ASTTYS--------EIPCDDPLCRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRE 187
+S +PC PLC + C C D+ G
Sbjct: 144 SSAPLPLPPPPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLE------DIETGSCRGA 197
Query: 188 TFAFP--------------VRNGFTFVPRLAFGCSNDNSGFAFG------GKISGILGFN 227
+ A P +R G R+ G S F F G+ G+ GF
Sbjct: 198 SHACPPLYYAYGDGSLVAHLRRG-----RVGLGASVAVDNFTFACAHTALGEPVGVAGFG 252
Query: 228 ASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK-----FGR--DADVRRRDLETTPILL 279
PLSL QL ++ G FSYCLV A +I+ GR DA TP+L
Sbjct: 253 RGPLSLPGQLAPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDAAAETGGFVYTPLLH 312
Query: 280 SDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
+ P+FY LE +S+G ++ P + R G GG ++D+GT T + N Y + +
Sbjct: 313 NPKHPYFYSVALEAVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETYARVAE 372
Query: 339 RYDQILRSLGRQRIPYNASQE-FDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFI-- 395
+ + + + G R Q CY Y +S + P + H + + P YF+
Sbjct: 373 AFARAMAAAGFARAERAEEQTGLTPCYHYAASDRGVPPLALHFRGNATVALPRRNYFMGF 432
Query: 396 --EPDRGRF-------CVAIQ-----------DDPKYSILGAWQQQNMLIIYDLNVPALR 435
E + G C+ + DD LG +QQQ ++YD++ +
Sbjct: 433 KSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVG 492
Query: 436 FGSENC 441
F C
Sbjct: 493 FARRRC 498
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 154/375 (41%), Gaps = 40/375 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G P K + DT S ++W C PC C + F+P +S+T S I
Sbjct: 88 LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147
Query: 151 PCDDPLCRSPFK-----CQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVRNG---- 196
PC D C + + CQ+ C YT Y G T G +T F G
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQT 207
Query: 197 FTFVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLVRE 252
+ FGCSN SG + GI GF LS+ SQL + FS+CL
Sbjct: 208 ANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGS 267
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ L TP++ S +PH+ L+L I++ + P +
Sbjct: 268 DNGGGILVLG---EIVEPGLVFTPLVPS--QPHYNLNLESIAVSGQ--KLPIDSSLFATS 320
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
T G I+D+GT + ++ +G Y + + + + Q F DSSF
Sbjct: 321 NTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPS-VRSVVSKGIQCFVTTSSVDSSF-- 377
Query: 373 YPSMTFHLQEA-DYIVQPENMYFIE----PDRGRFCVAIQDDPKYSILGAWQQQNMLIIY 427
P+ T + + V+PEN Y ++ + +C+ Q +ILG ++ + +Y
Sbjct: 378 -PTATLYFKGGVSMTVKPEN-YLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVY 435
Query: 428 DLNVPALRFGSENCA 442
DL + + +C+
Sbjct: 436 DLANMRMGWADYDCS 450
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 119/498 (23%), Positives = 200/498 (40%), Gaps = 89/498 (17%)
Query: 10 AAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARAN 69
++F F +FLTH+ S S ++ L+P+ +LS+S+ F +
Sbjct: 3 SSFLFLFMTIFLTHYVFSCS---AIVLLPLTH--------SLSKSQ-----FNSTPHLLK 46
Query: 70 YMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGT-PMKPQHLLFDTASSLVWTQCQP- 127
+ ++ S I LP++ Y++ N+G+ P +P L DT S LVW C P
Sbjct: 47 FTSARSATRFHHRHRQISLPLSPGS-DYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPF 105
Query: 128 -CIRCFDQ----TTPIFDPRASTTYSEIPCDDPLC----------------RSPFK---- 162
CI C + T P T+ + + C P C R P +
Sbjct: 106 ECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIET 165
Query: 163 --CQNGKCV-YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGK 219
C + C + Y G + L R++ + P + + FGC++ A G
Sbjct: 166 SDCSSFSCPPFYYAYGDGSLVARLY-RDSLSMPASSPLV-LHNFTFGCAHT----ALGEP 219
Query: 220 ISGILGFNASPLSLSSQLRN---RIQGLFSYCLVRE------MEATSVIKFGR---DADV 267
+ G+ GF LSL +QL + + FSYCLV + S + GR D +
Sbjct: 220 V-GVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEK 278
Query: 268 RRR------DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIID 320
++R + T +L + P+FY LE I++G + P + R G GG ++D
Sbjct: 279 KKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVD 338
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+GT T + G Y++L+ ++ + + ++ CY D S P++ H
Sbjct: 339 SGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPCYYSDDSAAKVPAVALHF 398
Query: 381 QEADYIVQPENMYFIEPDRGR---------FCVAIQDDPK-------YSILGAWQQQNML 424
++ P N Y+ E GR C+ + + + LG +QQQ
Sbjct: 399 VGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFE 458
Query: 425 IIYDLNVPALRFGSENCA 442
++YDL + F CA
Sbjct: 459 VVYDLEKHRVGFARRKCA 476
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 114/428 (26%), Positives = 168/428 (39%), Gaps = 82/428 (19%)
Query: 86 IHLPMA-KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTPIFDPR 142
+ LP++ D S+ V + P L DT S LVW C P C+ C + TP
Sbjct: 80 LSLPLSPGSDYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGP 139
Query: 143 ASTTYSE--IPCDDPLC----------------RSPFK-CQNGKCVYTRR-----YHVGD 178
IPC PLC R P + + G C + Y GD
Sbjct: 140 LPPPPDSRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGD 199
Query: 179 -----------VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
V G +R + A V N FTF C++ A G + G+ GF
Sbjct: 200 GSLVAHLRRGRVALGAGARASVAVAVDN-FTFA------CAHT----ALGEPV-GVAGFG 247
Query: 228 ASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK-----FGRDADVRRRDLET-----TP 276
PLSL QL ++ G FSYCLV A +I+ GR D ET TP
Sbjct: 248 RGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTP 307
Query: 277 ILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
+L + P+FY LE +S+G ++ P + R G GG ++D+GT T + N Y
Sbjct: 308 LLHNPKHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYAR 367
Query: 336 LMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYF 394
+ + + + + + G R Q CYRY +S + P + H + + P YF
Sbjct: 368 VAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYF 427
Query: 395 I---EPDRGR-------FCVAI---------QDDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ D G C+ + + D LG +QQQ ++YD++ +
Sbjct: 428 MGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVG 487
Query: 436 FGSENCAN 443
F C +
Sbjct: 488 FARRRCTD 495
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 155/368 (42%), Gaps = 29/368 (7%)
Query: 88 LPMAKQDLFYSVEVNIGTPM--KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRAST 145
L + D + V V+IGT + + L DTA+S W C+PC Q +F P S
Sbjct: 57 LALTPSDYVHGVFVSIGTGQGGRRKILALDTAASTSWVMCEPCRPPLHQLGRLFSPAESP 116
Query: 146 TYSEIPCDDPLCRSPF---KCQNGKCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFV 200
T+ + DDP+C P+ NG C + +G + +R+TF R+ +
Sbjct: 117 TFRGVRRDDPVCVPPYHRLHSTNG-CSFAFPSAIGYL-----ARDTFHLRHSERSVVKSI 170
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VREMEATS 257
+AFGC++ +GF + G+L + SPLS +Q +R G FSYCL +
Sbjct: 171 SGVAFGCAHTTTGFYNEDILGGVLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSG 230
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD--GTG 315
I+FG + R TT + +S ++L L+ IS+G DI R +
Sbjct: 231 FIQFGIEVPSLPRHAHTTTLTVSA--SGYHLSLIGISLGNK-------RLDIDRHILTSH 281
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
G I+ +T I Y + + + LG +++ S + P+
Sbjct: 282 GCSINPAETITKIAEPAYIIVARELMAQMNELGSKQVKGPPSSPLVFNKISRRVRARLPN 341
Query: 376 MTFHLQE-ADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPAL 434
M FH + D ++ + RF V + +++GA QQ N I+++ L
Sbjct: 342 MVFHFADGGDMWFTAGKLFQVIGTTARFLVEGHGSHR-TVIGAAQQVNARFIFNVAAGRL 400
Query: 435 RFGSENCA 442
F E C+
Sbjct: 401 TFAEELCS 408
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 155/366 (42%), Gaps = 46/366 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ +G P K ++ DT S ++W C C +C ++ ++DP +S + + +
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRV 85
Query: 151 PCDDPLCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
CDD C S + C+ C Y Y G T G + F G L
Sbjct: 86 SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTG-----NLQ 140
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRD 264
G SN F G + SG LG + L + I G F++CL + + G
Sbjct: 141 TGLSNGTVTFGCGAQQSGGLGTSGEAL-------DGILGAFAHCL-DNVNGGGIFAIG-- 190
Query: 265 ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM-RDGTGGFIIDTGT 323
++ + TTP++ + + H+ +++ EI +G ++ P FD R GT IID+GT
Sbjct: 191 -ELVSPKVNTTPMVPN--QAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGT---IIDSGT 244
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQE 382
+ ++ Y ++M +RS + ++F C++Y + +P + FH ++
Sbjct: 245 TLAYLPEVVYDSMMNE----IRSQQPGLSLHTVEEQF-ICFKYSGNVDDGFPDIKFHFKD 299
Query: 383 ADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALR 435
+ + + Y + +C Q D ++LG N L++YD+ A+
Sbjct: 300 SLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIG 359
Query: 436 FGSENC 441
+ NC
Sbjct: 360 WTEYNC 365
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 156/393 (39%), Gaps = 62/393 (15%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD--------PRASTTYS 148
YSV + GTP + +FDT SSLVW C RC + P D P+ S++
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191
Query: 149 EIPCDDPLCRSPF----------------KCQNGKCVYTRRYHVGDVTRGLASRETFAFP 192
+ C +P C F KC + Y +Y G T G+ ET
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSG-ATAGILLSETLDLE 250
Query: 193 VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE 252
+ VP GCS + + +GI GF P SL SQ+R + FS+CLV
Sbjct: 251 NKR----VPDFLVGCSVMSV-----HQPAGIAGFGRGPESLPSQMRLK---RFSHCLVSR 298
Query: 253 ------MEATSVIKFGRDADVRRRD-------LETTPILLSDLRPHFYLHLLEISIGRHI 299
+ + V+ G ++D + E + + R ++YL L I IG
Sbjct: 299 GFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKP 358
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
V+FP G GG IID+G+ TF+ ++ + ++ L R + A
Sbjct: 359 VKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK-DVEAQSG 417
Query: 360 FDYCYRY--DSSFKAYPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQDDPKYS--- 413
C+ + +P + + + + EN + D G C+ + D
Sbjct: 418 LRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGG 477
Query: 414 -----ILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILGA+QQQN+L+ YDL + F + C
Sbjct: 478 GGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/429 (23%), Positives = 181/429 (42%), Gaps = 55/429 (12%)
Query: 58 HKMFEISKARANYMASMSKPNAFQE------LEDIHLPMA---KQDLF--YSVEVNIGTP 106
+ +F + A S+S A + L + LP+ + D+ Y ++ IGTP
Sbjct: 28 NGVFSVKYKYAGLQRSLSDLKAHDDQRQLRILAGVDLPLGGIGRPDILGLYYAKIGIGTP 87
Query: 107 MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLC---- 157
K ++ DT S ++W C C C ++ +++ S T +PCD C
Sbjct: 88 TKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQEFCYEIN 147
Query: 158 --RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFVPRLAFGCSNDN 211
+ P N C Y Y G T G ++ + +G + FGC
Sbjct: 148 GGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQ 207
Query: 212 SGFAFGGK---ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDAD 266
SG + GILGF S S+ SQL +++ +F++CL + G
Sbjct: 208 SGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL-DGTNGGGIFVIGH--- 263
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
V + + TP++ + +PH+ +++ + +G + P F+ G IID+GT +
Sbjct: 264 VVQPKVNMTPLIPN--QPHYNVNMTAVQVGHEFLSLPTDVFE--AGDRKGAIIDSGTTLA 319
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-DSSFKAYPSMTFHLQEADY 385
++ Y+ L+ + I+ ++ + E+ C++Y DS +P++TFH + +
Sbjct: 320 YLPEMVYKPLVSK---IISQQPDLKV-HTVRDEYT-CFQYSDSLDDGFPNVTFHFENSVI 374
Query: 386 I-VQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
+ V P F P G +C+ Q D ++LG N L++YDL A+ +
Sbjct: 375 LKVYPHEYLF--PFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWT 432
Query: 438 SENCANGRQ 446
NC++ Q
Sbjct: 433 EYNCSSSIQ 441
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 166/379 (43%), Gaps = 44/379 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y +V +G P+K + DT S ++W C+PC C ++ ++DPR S+T S +
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 151 PCDDPLCR-----SPFKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPV--RNGF-TFV 200
C DPLC + +C C Y Y G + G R+ + V NG
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 120
Query: 201 PRLAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEAT 256
++ FGCS +G + + GI+GF LS+ +QL + I +FS+CL E
Sbjct: 121 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 180
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
++ G A+ + TP++ + H+ + L IS+ + R P A D G
Sbjct: 181 GILVIGGIAE---PGMTYTPLVPDSV--HYNVVLRGISVNSN--RLPIDAEDFSSTNDTG 233
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
I+D+GT + + +G Y +Q + S R+ +Q F R F P++
Sbjct: 234 VIMDSGTTLAYFPSGAYNVFVQAIREA-TSATPVRVQGMDTQCFLVSGRLSDLF---PNV 289
Query: 377 TFHLQEADYIVQPEN--MYFIEPDRGR---FCVAIQ---------DDPKYSILGAWQQQN 422
T + + +QP+N M+ G +C+ Q D + +ILG ++
Sbjct: 290 TLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 349
Query: 423 MLIIYDLNVPALRFGSENC 441
L++YDL+ + + S NC
Sbjct: 350 KLVVYDLDNSRIGWMSYNC 368
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 40/359 (11%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK 162
IGTP + L+ DT S++ + C C +C + P F P S TY + C +P C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKC-NPDCTC--D 58
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSGFAFGGKIS 221
+N +C Y R+Y + G+ + +F N P R FGC N +G F
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSF--GNMSELKPQRAVFGCENAETGDLFSQHAD 116
Query: 222 GILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILL 279
GI+G LS+ QL + I FS C ++ G A V + + ++
Sbjct: 117 GIMGLGRGDLSIVDQLVEKGVINDSFSLCY-------GGMEVGGGAMVLGQISPPSDMVF 169
Query: 280 S----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
S D P++ + L + + + P F DG G I+D+GT ++ +
Sbjct: 170 SHSDPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGTTYAYLPEAAFLP 225
Query: 336 LMQRYDQILRSLGRQRIP---YNASQEFDYCYRYDSS-----FKAYPSMTFHLQEAD-YI 386
+Q L L + R P YN D C+ S +K +PS+ + Y
Sbjct: 226 FIQAITSELHGLKQIRGPDPNYN-----DVCFSGAGSEIPELYKTFPSVDMVFDNGEKYS 280
Query: 387 VQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ PEN F G +C+ + + K ++LG +N L+ YD + F NC+
Sbjct: 281 LSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 161/377 (42%), Gaps = 55/377 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+ C + P++ P IPC+D
Sbjct: 56 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLYQPSNDL----IPCND 111
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC++ +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 112 PLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCG 171
Query: 209 NDNSGFAFGGK-ISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDA 265
D A G + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 172 YDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCL--SSLGGGILFFGNDL 229
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
R + TP+ + + + E+ G G +++ + D+G+
Sbjct: 230 YDSSR-VSWTPMARENSKHYSPAMGGELLFGGRTT----GLKNLLT------VFDSGSSY 278
Query: 326 TFIRNGPYQTLM--------------QRYDQILRSLGRQRIPYNASQEFDYCYR-YDSSF 370
T+ + YQ + R D L + R P+ + +E ++ SF
Sbjct: 279 TYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSF 338
Query: 371 K-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNML 424
K + S T + + PE Y I +G C+ I + + +++G Q+ +
Sbjct: 339 KTGWRSKTL------FEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQM 391
Query: 425 IIYDLNVPALRFGSENC 441
IIYD ++ + +C
Sbjct: 392 IIYDNEKQSIGWIPADC 408
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 75/144 (52%), Gaps = 9/144 (6%)
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
+A+ Y + +GTP K +++ DT S +VW QC PC +C+ QT P+FDP+ S ++S
Sbjct: 167 LAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSS 226
Query: 150 IPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
I C PLC SP C+Y Y G T G S ET F T VP++A G
Sbjct: 227 ISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRG----TRVPKVALG 282
Query: 207 CSNDNSGFAFGGKISGILGFNASP 230
C +DN G G +G+LG P
Sbjct: 283 CGHDNEGLFVG--AAGLLGLGRQP 304
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 155/376 (41%), Gaps = 45/376 (11%)
Query: 97 YSVEVNIGTPMKPQHLLF--DTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD- 153
Y V ++GTP PQ LL DT++ W C C C T P F+P +S T+ +PC
Sbjct: 94 YLVRASLGTP--PQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGA 150
Query: 154 -------DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
+P C S K +N C ++ Y + L S++ A G + FG
Sbjct: 151 PPCSQAPNPSCTSLAKSKN-SCGFSLSYGDSSLDATL-SQDNLAVTANGGV--IKGYTFG 206
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS----VIKFG 262
C ++G A + LG +Q + +G FSYCL + + + G
Sbjct: 207 CLTKSNGSAAPAQGLLGLGRGPL--GFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLG 264
Query: 263 RDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
R ++TTP+L S RP +Y+ + + IG+ V PP A G ++D+
Sbjct: 265 RKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDS 324
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLG---------RQRIPYNASQEFDYCYRYDSSFKA 372
GT + Y + D++ R + + ++ FD C Y+ S A
Sbjct: 325 GTMFARLAQPAYAAVR---DEVRRRVAGSLRRRGGGGASVSVSSLGGFDTC--YNVSTVA 379
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDP------KYSILGAWQQQNMLI 425
+P++T + PE I G C+A+ P +++G+ QQQN +
Sbjct: 380 WPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRV 439
Query: 426 IYDLNVPALRFGSENC 441
++D+ + F E C
Sbjct: 440 LFDVPNARVGFARERC 455
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 166/377 (44%), Gaps = 44/377 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD-----PRASTTYSEI 150
Y ++ +GTP++ ++ DT S ++W C C C ++ + P +S+T + +
Sbjct: 73 LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRV 132
Query: 151 PCDDPLCRSPFK-----CQ-NGKCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP-- 201
C+ C S + C C Y Y G T G R+ V F
Sbjct: 133 TCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTN 192
Query: 202 -RLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEAT 256
+ FGC SG A + GILGF + S+ SQL +++ +F++CL +
Sbjct: 193 GSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCL-DNINGG 251
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI-MRDGTG 315
+ G +V + + TTP++ + H+ + + I + ++ P FD +R GT
Sbjct: 252 GIFAIG---EVVQPKVRTTPLVPQ--QAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGT- 305
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQEFDYCYRYDSSF-KAY 373
IID+GT + + + Y+ L+ + RQ + + +E C+ YD + +
Sbjct: 306 --IIDSGTTLAYFPDVIYEPLISKI------FARQSTLKLHTVEEQFTCFEYDGNVDDGF 357
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-------ILGAWQQQNMLII 426
P++TFH +++ + + Y + D ++CV Q+ S +LG QN L++
Sbjct: 358 PTVTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVM 417
Query: 427 YDLNVPALRFGSENCAN 443
YDL + + NC++
Sbjct: 418 YDLENQTIGWTEYNCSS 434
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 155/376 (41%), Gaps = 40/376 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G P K + DT S ++W C PC C + F+P +S+T S I
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149
Query: 151 PCDDPLCRSPFK-----CQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVRNG---- 196
C D C + F+ CQ C YT Y G T G +T F G
Sbjct: 150 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 209
Query: 197 FTFVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQG--LFSYCLVRE 252
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 210 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGS 269
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ L TP++ S +PH+ L+L I++ + P +
Sbjct: 270 DNGGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIAVNGQ--KLPIDSSLFTTS 322
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
T G I+D+GT + ++ +G Y + + R + SQ F DSSF
Sbjct: 323 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVS-KGSQCFITSSSVDSSF-- 379
Query: 373 YPSMTFHLQEADYI-VQPENMYFIEPDRGR---FCVAIQDD--PKYSILGAWQQQNMLII 426
P++T + + V+PEN + +C+ Q + + +ILG ++ + +
Sbjct: 380 -PTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 438
Query: 427 YDLNVPALRFGSENCA 442
YDL + + +C+
Sbjct: 439 YDLANMRMGWADYDCS 454
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 155/376 (41%), Gaps = 40/376 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G P K + DT S ++W C PC C + F+P +S+T S I
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147
Query: 151 PCDDPLCRSPFK-----CQNGK-----CVYTRRYHVGDVTRGLASRETFAFPVRNG---- 196
C D C + F+ CQ C YT Y G T G +T F G
Sbjct: 148 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 207
Query: 197 FTFVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQG--LFSYCLVRE 252
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGS 267
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ L TP++ S +PH+ L+L I++ + P +
Sbjct: 268 DNGGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIAVNGQ--KLPIDSSLFTTS 320
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
T G I+D+GT + ++ +G Y + + R + SQ F DSSF
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVS-KGSQCFITSSSVDSSF-- 377
Query: 373 YPSMTFHLQEADYI-VQPENMYFIEPDRGR---FCVAIQDD--PKYSILGAWQQQNMLII 426
P++T + + V+PEN + +C+ Q + + +ILG ++ + +
Sbjct: 378 -PTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 436
Query: 427 YDLNVPALRFGSENCA 442
YDL + + +C+
Sbjct: 437 YDLANMRMGWADYDCS 452
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 163/383 (42%), Gaps = 67/383 (17%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+RC + P++ P + IPC+D
Sbjct: 59 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDL----IPCND 114
Query: 155 PLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC+ S +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 115 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCG 174
Query: 209 NDN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDA 265
D G + + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 175 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL--SSLGGGILFFGDDL 232
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGF-----II 319
R + TP + + H+ P +++ G T G +
Sbjct: 233 YDSSR-VSWTP-MSREYSKHYS---------------PAMGGELLFGGRTTGLKNLLTVF 275
Query: 320 DTGTPVTFIRNGPYQTLM--------------QRYDQILRSLGRQRIPYNASQEFDYCYR 365
D+G+ T+ + YQ + R D L + R P+ + +E ++
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335
Query: 366 -YDSSFK-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAW 418
SFK + S T + + PE Y I +G C+ I + + +++G
Sbjct: 336 PLALSFKTGWRSKTL------FEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388
Query: 419 QQQNMLIIYDLNVPALRFGSENC 441
Q+ +IIYD ++ + +C
Sbjct: 389 SMQDQMIIYDNEKQSIGWMPADC 411
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 40/359 (11%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK 162
IGTP + L+ DT S++ + C C +C + P F P S TY + C +P C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKC-NPDCTC--D 58
Query: 163 CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSGFAFGGKIS 221
+N +C Y R+Y + G+ + +F N P R FGC N +G F
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSF--GNMSELKPQRAVFGCENAETGDLFSQHAD 116
Query: 222 GILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILL 279
GI+G LS+ QL + I FS C ++ G A V + + ++
Sbjct: 117 GIMGLGRGDLSIVDQLVEKGVINDSFSLCY-------GGMEVGGGAMVLGQISPPSDMVF 169
Query: 280 S----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQT 335
S D P++ + L + + + P F DG G I+D+GT ++ +
Sbjct: 170 SHSDPDRSPYYNIELRGLHVAGKKLDINPQVF----DGKHGTILDSGTTYAYLPEAAFLP 225
Query: 336 LMQRYDQILRSLGRQRIP---YNASQEFDYCYRYDSS-----FKAYPSMTFHLQEAD-YI 386
+Q L L + R P YN D C+ S +K +PS+ + Y
Sbjct: 226 FIQAITSELHGLKQIRGPDPNYN-----DVCFSGAGSEIPELYKTFPSVDMVFDNGEKYS 280
Query: 387 VQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ PEN F G +C+ + + K ++LG +N L+ YD + F NC+
Sbjct: 281 LSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 97/203 (47%), Gaps = 12/203 (5%)
Query: 50 NLSQSERIHKMFEISKARANYMASMSKPNAFQELEDI--HLPMAKQDLFYSVEVNIGTPM 107
NL++ E + + + S+ R + M++ A + + P+ Y V++ IGTP
Sbjct: 41 NLTEHELLRRAIQRSRYRLAGIG-MARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 108 KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC-- 163
DTAS L+WTQCQPC C+ Q P+F+PR S+TY+ +PC C +C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISG 222
+ C YT Y T G + + G +AFGCS ++G A + SG
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPPQASG 215
Query: 223 ILGFNASPLSLSSQLRNRIQGLF 245
++G PLSL SQL R G+
Sbjct: 216 VVGLGRGPLSLVSQLSVRRYGMI 238
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 136/305 (44%), Gaps = 30/305 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +GTP + ++ DT++ W C C C T F P ASTT + C +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQ 101
Query: 157 CRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C F C + C++ + Y GD + A+ A + N +P FGC N
Sbjct: 102 CSQVRGFSCPATGSSACLFNQSYG-GDSSLA-ATLVQDAITLAN--DVIPGFTFGCINAV 157
Query: 212 SGFAFGGKI--SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
S GG I G+LG P+SL SQ G+FSYCL + + +K G
Sbjct: 158 S----GGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG-- 211
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ + + TTP+L + RP +Y++L +S+GR V P + G IID+GT +T
Sbjct: 212 QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
Y + + + + P ++ FD C+ + +A P++T H + + +
Sbjct: 272 RFVQPVYFAIRDEFRKQVNG------PISSLGAFDTCFAATNEAEA-PAVTLHFEGLNLV 324
Query: 387 VQPEN 391
+ EN
Sbjct: 325 LPMEN 329
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP- 155
Y + +G+P + L+ DT S L W QC PC C I+D S +Y + C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQ 159
Query: 156 LCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGF--TFVPRLAFGC 207
LC + + C G +C + Y G + G S +T G V AFGC
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGC 219
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VREMEATSVIKFGRD 264
+ + G SGILG NA ++L QL R FS+C + +T V+ FG +
Sbjct: 220 AQGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFG-N 277
Query: 265 ADVRRRDLETTPILL--SDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
A++ ++ T + L S+L+ FY + L +SI H + F P ++ D F
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVVILDSGSSF---- 333
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-----DSSFKAYPSM 376
+F+R P+ + ++ R + + ++ + C++ D + PS+
Sbjct: 334 ---SSFVR--PFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388
Query: 377 TFHLQEADYIVQPENMYFIEPDR----GRFCVAIQDDP--KYSILGAWQQQNMLIIYDLN 430
+ ++ I P + R + C A +D +++G +QQQN+ + YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQ 448
Query: 431 VPALRFGSENC 441
+ F +C
Sbjct: 449 RSRVGFARASC 459
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 156/385 (40%), Gaps = 43/385 (11%)
Query: 94 DLFYSVEVNIGTPMKPQ--HLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTY--S 148
D Y + +G P Q HL DT S L W QC PC C ++ PR S
Sbjct: 200 DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 259
Query: 149 EIPCDDPLCRSPF--KCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
E C + + R+ C+N +C Y Y + G+ +++ F + NG + F
Sbjct: 260 EAFCVE-VQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 318
Query: 206 GCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKF 261
GC D G K GILG + + +SL SQL +R I + +CL ++ I
Sbjct: 319 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFM 378
Query: 262 GRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
G D V + P+L + + + ++S G+ ++ + D G + DT
Sbjct: 379 GSDL-VPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML-----SLDGENGRVGKVLFDT 432
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF---------KA 372
G+ T+ N Y L+ ++ G + ++ + C+R ++F K
Sbjct: 433 GSSYTYFPNQAYSQLVTSLQEV---SGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKF 489
Query: 373 YPSMTFHLQEA------DYIVQPENMYFIEPDRGRFCVAIQD-----DPKYSILGAWQQQ 421
+ +T + ++QPE+ Y I ++G C+ I D D ILG +
Sbjct: 490 FRPITLQIGSKWLIISRKLLIQPED-YLIISNKGNVCLGILDGSSVHDGSTIILGDISMR 548
Query: 422 NMLIIYDLNVPALRFGSENCANGRQ 446
LI+YD + + +C R+
Sbjct: 549 GHLIVYDNVKRRIGWMKSDCVRPRE 573
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 169/386 (43%), Gaps = 51/386 (13%)
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRAST 145
A+ L+Y+ + IG+P H+ DT S ++W C C C ++ +++P++S+
Sbjct: 68 AETGLYYA-RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSS 126
Query: 146 TYSEIPCDDPLCRSPFK-----CQ-NGKCVYTRRYHVGDVTRGLASRETFAF--PVRNGF 197
T + I CD P C + + C+ + C Y Y G T G + V N
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHK 186
Query: 198 TFVPR--LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVR 251
T + FGC SG + + GILGF + S+ SQL +++ +F++CL
Sbjct: 187 TSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL-D 245
Query: 252 EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR 311
+ + G +V L TP++ + + H+ + L + +G + P G F+
Sbjct: 246 SISGGGIFAIG---EVVEPKLXNTPVVPN--QAHYNVVLNGVKVGDTALDLPLGLFETSY 300
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRY-----DQILRSLGRQRIPYNASQEFDYCYRY 366
G IID+GT + ++ Y LM++ D LR++ Q C+ +
Sbjct: 301 --KRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQFT----------CFVF 348
Query: 367 DSSF-KAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAW 418
D + +P++TF +E+ + + Y + +CV Q D + ++LG
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDL 408
Query: 419 QQQNMLIIYDLNVPALRFGSENCANG 444
QN L+ Y+L + + NC++G
Sbjct: 409 VLQNKLVYYNLENQTIGWTEYNCSSG 434
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 111/442 (25%), Positives = 170/442 (38%), Gaps = 57/442 (12%)
Query: 27 SESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASM------SKPNAF 80
S S+G ++ L P SP+ P + ++ + RANY+ +
Sbjct: 53 SSSSGATVPLNHRHGPCSPV-PSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGGL 111
Query: 81 QELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
Q+ E +P+A L Y + V+IG+P + DT S + W +C+ +
Sbjct: 112 QQSEAT-VPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------S 161
Query: 137 PIFDPRASTTYSEIPCDDPLC----RSPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAF 191
++DP S+TY+ C P C R C +G CVY+ +Y G T G +T
Sbjct: 162 RLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTL 221
Query: 192 -----PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
P+ +GF F GCS GF G++G S SQ FS
Sbjct: 222 AGTSEPLISGFQF------GCSAVEHGFEED-NTDGLMGLGGDAQSFVSQTAATYGSAFS 274
Query: 247 YCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPG 305
YCL ++ + G + TTP+L S FY LL IS+G + P
Sbjct: 275 YCLPPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSS 334
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCY 364
F + G I+D+GT +T + Y L + D + R Q P D C+
Sbjct: 335 VF------SAGSIVDSGTVITRLPPTAYGALSAAFRDGMAR---YQYQPAAPRGLLDTCF 385
Query: 365 RYDSSFKA----YPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQ 419
+ + PS+ L + + P I D A DD + I+G Q
Sbjct: 386 DFTGHGEGNNFTVPSVALVLDGGAVVDLHPNG---IVQDGCLAFAATDDDGRTGIIGNVQ 442
Query: 420 QQNMLIIYDLNVPALRFGSENC 441
Q+ ++YD+ F C
Sbjct: 443 QRTFEVLYDVGQSVFGFRPGAC 464
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+RC + P++ P + IPC+D
Sbjct: 37 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDL----IPCND 92
Query: 155 PLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC+ S +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 93 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 152
Query: 209 NDN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRD 264
D G + + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 153 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL--SSLGGGILFFGDD 209
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/449 (25%), Positives = 181/449 (40%), Gaps = 61/449 (13%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GFS++LI S +SP + L++ +R S+ARA + + D+ +
Sbjct: 26 GFSVELIHRDSIKSPFHDPKLTRHDRFLAAARRSRARAAALLAS----------DVSSDL 75
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-------------------IRC 131
D Y VN+GTP + DT S LVW +C
Sbjct: 76 FYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPP 135
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLC---RSPFKCQNGK--CVYTRRYHVGDVTRGLASR 186
+ F+P S++YS + CD P C + C C + Y G GL +
Sbjct: 136 PPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAA 195
Query: 187 ETFAFP--VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
+TF F + N T + FGC+ +G F + G++G A PLSL+SQL +
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREF--QADGMVGLGAGPLSLASQLGRK---- 249
Query: 245 FSYCLVRE--MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRF 302
FS+CL +A+S++ FG A V TTP++ S Y + + I + +
Sbjct: 250 FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAY-YAISIDSLKVAGQP 308
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFI-RNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD 361
PG + + I+DTGT +TF+ R L + +++ G R P + +
Sbjct: 309 VPGTTSVSK-----VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAP-PPDETLE 362
Query: 362 YCY---RYDSSFKAYPSMTFHLQEADY--IVQPENMYFIEPDRGRFCVA-IQDDPK---Y 412
CY R P +T L + F+ G C+A + P+
Sbjct: 363 LCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPELQPL 422
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENC 441
S+LG Q++ + DL+ F + NC
Sbjct: 423 SVLGNVALQDLHVGIDLDARTATFATANC 451
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 165/377 (43%), Gaps = 43/377 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y ++ +G+P + ++ DT S ++W C C RC ++DP+ S T +
Sbjct: 69 LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVV 128
Query: 151 PCDDPLCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPR- 202
CD C + F C++ C Y+ Y G T G ++ + NG P+
Sbjct: 129 SCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQN 188
Query: 203 --LAFGCSNDNSGFAFGGK----ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC SG G + GI+GF + S+ SQL +++ +FS+CL +
Sbjct: 189 SSIIFGCGAVQSG-TLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-DNVR 246
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G +V + TTP++ H+ + L I + I++ P FD +
Sbjct: 247 GGGIFAIG---EVVEPKVSTTPLVPR--MAHYNVVLKSIEVDTDILQLPSDIFDSVNG-- 299
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
G +ID+GT + ++ + Y L+Q+ + R G + Y Q+F C+ Y + + +
Sbjct: 300 KGTVIDSGTTLAYLPDIVYDELIQKV--LARQPGLKL--YLVEQQF-RCFLYTGNVDRGF 354
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLII 426
P + H +++ + + Y + G +C+ Q + ++LG N L+I
Sbjct: 355 PVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVI 414
Query: 427 YDLNVPALRFGSENCAN 443
YDL + + NC++
Sbjct: 415 YDLENMVIGWTDYNCSS 431
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 136/305 (44%), Gaps = 30/305 (9%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V V +GTP + ++ DT++ W C C C T F P ASTT + C +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQ 101
Query: 157 CRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C F C + C++ + Y GD + A+ A + N +P FGC N
Sbjct: 102 CSQVRGFSCPATGSSACLFNQSYG-GDSSLA-ATLVQDAITLAN--DVIPGFTFGCINAV 157
Query: 212 SGFAFGGKI--SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
S GG I G+LG P+SL SQ G+FSYCL + + +K G
Sbjct: 158 S----GGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVG-- 211
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ + + TTP+L + RP +Y++L +S+GR V P + G IID+GT +T
Sbjct: 212 QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYI 386
Y + + + + P ++ FD C+ + +A P++T H + + +
Sbjct: 272 RFVQPVYFAIRDEFRKQVNG------PISSLGAFDTCFAETNEAEA-PAVTLHFEGLNLV 324
Query: 387 VQPEN 391
+ EN
Sbjct: 325 LPMEN 329
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 172/425 (40%), Gaps = 46/425 (10%)
Query: 31 GFSLKLIPIFSPESPLYPGN-LSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLP 89
G +L++ +FSP SP P +S E + ++ +AR Y++++ + +
Sbjct: 41 GSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLVARRSIVPIASGR-- 98
Query: 90 MAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE 149
Q Y V GTP + L DT++ W C C+ C TTP F P STT+ +
Sbjct: 99 QITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGC-STTTP-FAPPKSTTFKK 156
Query: 150 IPCDDPLC---RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF----VPR 202
+ C C R+P C C + Y V L V++ T VP
Sbjct: 157 VGCGASQCKQVRNP-TCDGSACAFNFTYGTSSVAASL---------VQDTVTLATDPVPA 206
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
FGC +G + + LG SL +Q + Q FSYCL + + F
Sbjct: 207 YTFGCIQKATGSSLPPQGLLGLGRGPL--SLLAQTQKLYQSTFSYCL----PSFKTLNFS 260
Query: 263 RDADV----RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
D+ + RD + P + R +Y++L+ I +GR IV PP A G
Sbjct: 261 GHXDLXPVAQPRD-QVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGT 319
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT 377
+ D+GT T + Y + + + R +++ + FD CY P++T
Sbjct: 320 VFDSGTVFTRLVEPAYTAVRNEFRR--RVSVHKKLTVTSLGGFDTCYTVP---IVAPTIT 374
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVP 432
F + + P+N+ C+A+ P +++ QQQN +++D VP
Sbjct: 375 FMFSGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFD--VP 432
Query: 433 ALRFG 437
R G
Sbjct: 433 NSRLG 437
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 158/370 (42%), Gaps = 67/370 (18%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+RC + P++ P + IPC+D
Sbjct: 59 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDL----IPCND 114
Query: 155 PLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC+ S +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 115 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 174
Query: 209 NDN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDA 265
D G + + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 175 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL--SSLGGGILFFGDDL 232
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGF-----II 319
R + TP + + H+ P +++ G T G +
Sbjct: 233 YDSSR-VSWTP-MSREYSKHYS---------------PAMGGELLFGGRTTGLKNLLTVF 275
Query: 320 DTGTPVTFIRNGPYQTLM--------------QRYDQILRSLGRQRIPYNASQEFDYCYR 365
D+G+ T+ + YQ + R D L + R P+ + +E ++
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335
Query: 366 -YDSSFK-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAW 418
SFK + S T + + PE Y I +G C+ I + + +++G
Sbjct: 336 PLALSFKTGWRSKTL------FEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388
Query: 419 QQQNMLIIYD 428
Q+ +IIYD
Sbjct: 389 SMQDQMIIYD 398
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 162/377 (42%), Gaps = 55/377 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+RC + P++ P + IPC+D
Sbjct: 47 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDL----IPCND 102
Query: 155 PLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC+ S +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 103 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 162
Query: 209 NDN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDA 265
D G + + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 163 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL--SSLGGGILFFGDDL 220
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPV 325
R + TP+ + + E+ G G +++ + D+G+
Sbjct: 221 YDSSR-VSWTPMSREYSKHYSPAMGGELLFGGRTT----GLKNLLT------VFDSGSSY 269
Query: 326 TFIRNGPYQTLM--------------QRYDQILRSLGRQRIPYNASQEFDYCYR-YDSSF 370
T+ + YQ + R D L + R P+ + +E ++ SF
Sbjct: 270 TYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSF 329
Query: 371 K-AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNML 424
K + S T + + PE Y I +G C+ I + + +++G Q+ +
Sbjct: 330 KTGWRSKTL------FEIPPE-AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQM 382
Query: 425 IIYDLNVPALRFGSENC 441
IIYD ++ + +C
Sbjct: 383 IIYDNEKQSIGWMPVDC 399
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 166/379 (43%), Gaps = 44/379 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y +V +G P+K + DT S ++W C+PC C ++ ++DPR S+T S +
Sbjct: 28 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 87
Query: 151 PCDDPLCR-----SPFKCQ--NGKCVYTRRYHVGDVTRGLASRETFAFPV--RNGF-TFV 200
C DPLC + +C C Y Y G + G R+ + V NG
Sbjct: 88 SCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 147
Query: 201 PRLAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEAT 256
++ FGCS +G + + GI+GF LS+ +QL + I +FS+CL E
Sbjct: 148 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 207
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
++ G A+ + TP++ + H+ + L IS+ + R P A D G
Sbjct: 208 GILVIGGIAE---PGMTYTPLVPDSV--HYNVVLRGISVNSN--RLPIDAEDFSSTNDTG 260
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSM 376
I+D+GT + + +G Y +Q + S R+ +Q F R F P++
Sbjct: 261 VIMDSGTTLAYFPSGAYNVFVQAIREA-TSATPVRVQGMDTQCFLVSGRLSDLF---PNV 316
Query: 377 TFHLQEADYIVQPEN--MYFIEPDRGR---FCVAIQ---------DDPKYSILGAWQQQN 422
T + + +QP+N M+ G +C+ Q D + +ILG ++
Sbjct: 317 TLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 376
Query: 423 MLIIYDLNVPALRFGSENC 441
L++YDL+ + + S NC
Sbjct: 377 KLVVYDLDNSRIGWMSYNC 395
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/418 (25%), Positives = 171/418 (40%), Gaps = 46/418 (11%)
Query: 34 LKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQ 93
+ +IPI+ SP + S I M R Y++S+ + + P+A
Sbjct: 43 ITMIPIYGNCSPFKNYSTSWENIIIDMASKDPERVVYLSSLDA--SLRRKPISAAPIASG 100
Query: 94 DLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY-S 148
F Y V V +G+P + ++ DT++ W C C C +T + P+ASTTY
Sbjct: 101 QAFGIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGCSSSST-YYSPQASTTYGG 159
Query: 149 EIPCDDPLC---RSPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
+ C P C R C + C + + Y + L +R G +P
Sbjct: 160 AVACYAPRCAQARGALPCPYTGSKACTFNQSYAGSTFSATLVQDS-----LRLGIDTLPS 214
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEA--TSVIK 260
AFGC N SG+ + L PLSL SQ G+FSYCL + + +K
Sbjct: 215 YAFGCVNSASGWTLPAQGLLGL--GRGPLSLPSQSSKLYSGIFSYCLPSFQSSYFSGSLK 272
Query: 261 FGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
G RR + TTP+L + RP +Y++L +++GR V P + G I+
Sbjct: 273 LGPTGQPRR--IRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSGTIL 330
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT-- 377
D+GT +T Y + + ++ P+ + FD C+ K Y ++T
Sbjct: 331 DSGTVITRFVGPVYSAIRDEFRNQVKG------PFFSRGGFDTCF-----VKTYENLTPL 379
Query: 378 --FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYD 428
D + EN G C+A+ P +++ +QQQN+ +++D
Sbjct: 380 IKLRFTGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFD 437
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 164/379 (43%), Gaps = 48/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V IGTP K ++ DT S ++W C C C +++ + S + +
Sbjct: 85 LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 151 PCDDPLC----RSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFV 200
PCD+ C P N C Y Y G T G ++ + +G +
Sbjct: 145 PCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSN 204
Query: 201 PRLAFGCSNDNSGFAFG----GKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC SG G + GILGF S S+ SQL +++ +F++CL +
Sbjct: 205 GSVIFGCGARQSG-DLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL-DGIN 262
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G V + + TP++ + +PH+ +++ + +G + P F+
Sbjct: 263 GGGIFAIGH---VVQPKVNMTPLIPN--QPHYNVNMTAVQVGEDFLHLPTEEFE--AGDR 315
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSF-KA 372
G IID+GT + ++ Y+ L+ + + +Q + + ++ C++Y S
Sbjct: 316 KGAIIDSGTTLAYLPEIVYEPLVSKI------ISQQPDLKVHIVRDEYTCFQYSGSVDDG 369
Query: 373 YPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNML 424
+P++TFH + + ++ V P F P G +C+ Q D ++LG N L
Sbjct: 370 FPNVTFHFENSVFLKVHPHEYLF--PFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKL 427
Query: 425 IIYDLNVPALRFGSENCAN 443
++YDL A+ + NC++
Sbjct: 428 VLYDLENQAIGWTEYNCSS 446
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 162/388 (41%), Gaps = 50/388 (12%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + +G+P + ++ DT S L W C+ +FDP S++YS IPC
Sbjct: 52 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPC 107
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
P CR+ P C K C Y G + +TF G + +P
Sbjct: 108 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATI 163
Query: 205 FGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVREMEATSV 258
FGC +SGF+ K +G++G N LS +Q+ GL FSYC+ + +++ +
Sbjct: 164 FGCM--DSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM-----GLQKFSYCISGQ-DSSGI 215
Query: 259 IKFGRDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG + + L+ TP++ +S P+F + L I + +++ P +
Sbjct: 216 LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHT 275
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQ-EFDYCYRY---D 367
G G ++D+GT TF+ Y L + Q SL P Q D CYR
Sbjct: 276 GAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTR 335
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYS----ILGAW 418
+ P++T + A+ V E + + P RG +C + I+G
Sbjct: 336 RTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHH 395
Query: 419 QQQNMLIIYDLNVPALRFGSENCANGRQ 446
QQN+ + +DL + F C Q
Sbjct: 396 HQQNVWMEFDLAKSRVGFAEVRCXLAGQ 423
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 163/373 (43%), Gaps = 41/373 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP- 155
Y + +G+P + L+ DT S L W +C PC C I+D S +Y + C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159
Query: 156 LCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGF--TFVPRLAFGC 207
LC + + C G +C + Y G + G S +T G V AFGC
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGC 219
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL---VREMEATSVIKFGRD 264
+ + G SGILG NA ++L QL R FS+C + +T V+ FG +
Sbjct: 220 AQGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFG-N 277
Query: 265 ADVRRRDLETTPILL--SDLRPHFY-LHLLEISIGRH-IVRFPPGAFDIMRDGTGGFIID 320
A++ ++ T + L S+L+ FY + L +SI H +V P G+ I+D
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSV---------VILD 328
Query: 321 TGTPV-TFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY-----DSSFKAYP 374
+G+ +F+R P+ + ++ R + + ++ + C++ D + P
Sbjct: 329 SGSSFSSFVR--PFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLP 386
Query: 375 SMTFHLQEADYIVQPENMYFIEPDR----GRFCVAIQDDP--KYSILGAWQQQNMLIIYD 428
S++ ++ I P + R + C A +D +++G +QQQN+ + YD
Sbjct: 387 SLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYD 446
Query: 429 LNVPALRFGSENC 441
+ + F +C
Sbjct: 447 IQRSRVGFARASC 459
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 108/429 (25%), Positives = 182/429 (42%), Gaps = 51/429 (11%)
Query: 34 LKLIPIFSPESPLYPGNLSQS--ERIHKMFEISKARANYMASM----SKPNAFQELEDIH 87
L +IPI + SP ++S S + + M R Y++S+ SKP +
Sbjct: 40 LSIIPINAKCSPFAHTHVSASVIDTVLHMASSDSHRFTYLSSLVAGKSKPTSVPVASGNQ 99
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTY 147
L + Y V +GTP + ++ DT++ VW C C C + +T F+ +S+TY
Sbjct: 100 LHIGN----YVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTY 154
Query: 148 SEIPCDDPLCRSP--FKC-----QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF- 199
S + C C C Q C + + Y GD + A V++ T
Sbjct: 155 STVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYG-GDSSFS-------ANLVQDTLTLS 206
Query: 200 ---VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREME 254
+P +FGC N SG + + G++G P+SL SQ + G+FSYCL R
Sbjct: 207 PDVIPNFSFGCINSASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY 264
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ +K G + + + TP+L + RP +Y++L +S+G V P +
Sbjct: 265 FSGSLKLGLLG--QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNS 322
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
G IID+GT +T Y+ + + + + ++ FD C+ D+
Sbjct: 323 GAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNG------SFSTLGAFDTCFSADNE-NVT 375
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAI-----QDDPKYSILGAWQQQNMLIIYD 428
P +T H+ D + EN C+++ + +++ QQQN+ I++D
Sbjct: 376 PKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFD 435
Query: 429 LNVPALRFG 437
VP R G
Sbjct: 436 --VPNSRIG 442
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 156/384 (40%), Gaps = 43/384 (11%)
Query: 95 LFYSVEVNIGTPMKPQ--HLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTY--SE 149
+ Y + +G P Q HL DT S L W QC PC C ++ PR SE
Sbjct: 28 MLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSE 87
Query: 150 IPCDDPLCRSPF--KCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C + + R+ C+N +C Y Y + G+ +++ F + NG + FG
Sbjct: 88 AFCVE-VQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFG 146
Query: 207 CSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFG 262
C D G K GILG + + +SL SQL +R I + +CL ++ I G
Sbjct: 147 CGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMG 206
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
D V + P+L + + + ++S G+ ++ + D G + DTG
Sbjct: 207 SDL-VPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML-----SLDGENGRVGKVLFDTG 260
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF---------KAY 373
+ T+ N Y L+ ++ G + ++ + C+R ++F K +
Sbjct: 261 SSYTYFPNQAYSQLVTSLQEV---SGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFF 317
Query: 374 PSMTFHLQEA------DYIVQPENMYFIEPDRGRFCVAIQD-----DPKYSILGAWQQQN 422
+T + ++QPE+ Y I ++G C+ I D D ILG +
Sbjct: 318 RPITLQIGSKWLIISRKLLIQPED-YLIISNKGNVCLGILDGSSVHDGSTIILGDISMRG 376
Query: 423 MLIIYDLNVPALRFGSENCANGRQ 446
LI+YD + + +C R+
Sbjct: 377 HLIVYDNVKRRIGWMKSDCVRPRE 400
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 161/383 (42%), Gaps = 50/383 (13%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + +G+P + ++ DT S L W C+ +FDP S++YS IPC
Sbjct: 59 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPC 114
Query: 153 DDPLCRS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
P CR+ P C K C Y G + +TF G + +P
Sbjct: 115 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATI 170
Query: 205 FGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGL--FSYCLVREMEATSV 258
FGC +SGF+ K +G++G N LS +Q+ GL FSYC+ + +++ +
Sbjct: 171 FGCM--DSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM-----GLQKFSYCISGQ-DSSGI 222
Query: 259 IKFGRDADVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG + + L+ TP++ +S P+F + L I + +++ P +
Sbjct: 223 LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHT 282
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQ-EFDYCYRY---D 367
G G ++D+GT TF+ Y L + Q SL P Q D CYR
Sbjct: 283 GAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTR 342
Query: 368 SSFKAYPSMTFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYS----ILGAW 418
+ P++T + A+ V E + + P RG +C + I+G
Sbjct: 343 RTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHH 402
Query: 419 QQQNMLIIYDLNVPALRFGSENC 441
QQN+ + +DL + F C
Sbjct: 403 HQQNVWMEFDLAKSRVGFAEVRC 425
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 159/374 (42%), Gaps = 45/374 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI-----FDPRASTTYSEI 150
Y ++ IGTP ++ DT S W C +C ++ + +DPR+S + E+
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 151 PCDDPLCRSPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF----------PVRNGFTF 199
CDD +C S C +C Y Y G +T G+ + + P TF
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 201
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
L S +NS A I GI+GF S + SQL + + +FS+CL
Sbjct: 202 GCGLQQSGSLNNSAVA----IDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-DSTNGGG 256
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G +V ++TTPI+ ++ H ++L I++ ++ P F + T G
Sbjct: 257 IFAIG---EVVEPKVKTTPIVKNNEVYHL-VNLKSINVAGTTLQLPANIFGTTK--TKGT 310
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSF-KAYPS 375
ID+G+ + ++ Y L IL + I A F C+ + S +P
Sbjct: 311 FIDSGSTLVYLPEIIYSEL------ILAVFAKHPDITMGAMYNFQ-CFHFLGSVDDKFPK 363
Query: 376 MTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-----ILGAWQQQNMLIIYDL 429
+TFH + + V P + Y +E + ++C QD + ILG N +++YD+
Sbjct: 364 ITFHFENDLTLDVYPYD-YLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDM 422
Query: 430 NVPALRFGSENCAN 443
A+ + NC++
Sbjct: 423 EKQAIGWTEHNCSS 436
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 161/387 (41%), Gaps = 40/387 (10%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
S PNA L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 64 SKRHPNARMRLHDDLL----LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCG 119
Query: 133 DQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF 191
P F P +S+TY + C D C S +CVY R+Y + G+ + +F
Sbjct: 120 RHQDPKFQPESSSTYQPVKCTIDCNCDS----DRMQCVYERQYAEMSTSSGVLGEDLISF 175
Query: 192 PVRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYC 248
N P R FGC N +G + GI+G LS+ QL +N I FS C
Sbjct: 176 --GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC 233
Query: 249 L-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
++ +++ G + P+ P++ + L EI + + F
Sbjct: 234 YGGMDVGGGAMVLGGISPPSDMAFAYSDPV----RSPYYNIDLKEIHVAGKRLPLNANVF 289
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFDYCY 364
DG G ++D+GT ++ + + L+SL + P YN D C+
Sbjct: 290 ----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYN-----DICF 340
Query: 365 R---YDSS--FKAYPSMTFHLQEAD-YIVQPENMYFIEPD-RGRFCVAI--QDDPKYSIL 415
D S K++P + + Y + PEN F RG +C+ + + + ++L
Sbjct: 341 SGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLL 400
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCA 442
G +N L++YD + F NCA
Sbjct: 401 GGIIVRNTLVVYDREQTKIGFWKTNCA 427
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 147/378 (38%), Gaps = 44/378 (11%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT--QCQPCIRCFDQTTPIFDPRASTTYSE 149
++L+ IGTP +P D LVWT CF+Q P FDP S+TY
Sbjct: 19 SRELYNVASFTIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRP 78
Query: 150 IPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
PC LC S C C Y + + T G + A G +AF
Sbjct: 79 EPCGTALCEFFPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAI----GTATAASVAF 134
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS------VI 259
GC + G SG +G +PLSL +Q+ FS+CL +
Sbjct: 135 GCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTA---FSHCLAPHDGGGGKNSRLFLG 191
Query: 260 KFGRDADVRRRDLETTPILLS---DLRPHFYLHLLE-ISIG-RHIVRFPPGAFDIMRDGT 314
+ A + TTP + S D++ +YL LE I G I+ P ++
Sbjct: 192 AAAKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAIITVPQSGRTVL---- 247
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSFKAY 373
+ T +PV+F+ +G YQ L + + G P Q FD C++
Sbjct: 248 ----LQTFSPVSFLVDGVYQDLKKAVTAAVG--GPTATPPEQFQSIFDLCFKR-GGVSGA 300
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY--------SILGAWQQQNMLI 425
P + Q A + P Y ++ CVAI + SILG QQQN+
Sbjct: 301 PDVVLTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHF 360
Query: 426 IYDLNVPALRFGSENCAN 443
+YDL L F + +C++
Sbjct: 361 LYDLEKETLSFEAADCSS 378
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 145/359 (40%), Gaps = 45/359 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V +NIG P KP L DT S L W QC PC C P++ P T +PC D
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP---TKNKLVPCVD 121
Query: 155 PLCRS-------PFKCQN--GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
LC S KC + +C Y +Y + G+ ++FA + NG P LAF
Sbjct: 122 QLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLANGSVVRPSLAF 181
Query: 206 GCSNDNSGFAFGGKIS---GILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIK 260
GC D G++S G+LG +SL SQ + + + +CL + +
Sbjct: 182 GCGYDQQ--VSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCL--SLRGGGFLF 237
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
FG D V + + TP++ S LR ++ + G +R + D
Sbjct: 238 FGDDL-VPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVK----------LTEVVFD 286
Query: 321 TGTPVTFIRNGPYQTLMQR----YDQILRSLGRQRIP--YNASQEFDYCYRYDSSFKAYP 374
+G+ T+ PYQ L+ + L+ + +P + + F FK+
Sbjct: 287 SGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKGKKPFKSVLDVKKEFKSL- 345
Query: 375 SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLIIYD 428
+ F ++ P Y I G C+ I + + SILG Q+ ++IYD
Sbjct: 346 VLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYD 404
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 36/286 (12%)
Query: 84 EDIHLPMAKQD------LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT- 136
E + P++ D L+Y+ + +GTP + ++ DT S + W C PC C +
Sbjct: 30 EVVAFPISGDDDTFTTGLYYT-RIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNV 88
Query: 137 ----PIFDPRASTTYSEIPCDDPLC--RSPFKC--QNGKCVYTRRYHVGDVTRGLASRET 188
IFDP ST+ + I C D C S KC + C Y+ Y G T G +
Sbjct: 89 ALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDV 148
Query: 189 FAF---PVRN--GFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRI 241
+F P N + RL FGC ++ +G G++GF + +SL SQL +N
Sbjct: 149 LSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL---TDGLVGFGQAEVSLPSQLSKQNVS 205
Query: 242 QGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIV 300
+F++CL + + + + G +R L TPI+ + H+ + LL I + G ++
Sbjct: 206 VNIFAHCLQGDNKGSGTLVIGH---IREPGLVYTPIVPK--QSHYNVELLNIGVSGTNVT 260
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS 346
P AFD+ +GG I+D+GT +T++ Y + +RS
Sbjct: 261 T--PTAFDL--SNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRS 302
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/431 (24%), Positives = 164/431 (38%), Gaps = 45/431 (10%)
Query: 25 TSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMA-SMSKPNAFQEL 83
+SS+ T S+ L + P SP P + + ++ + RA+Y+ S N
Sbjct: 27 SSSDGTS-SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAG 85
Query: 84 ED-----IHLPM----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR---C 131
ED + +P + L Y + V +G+P Q ++ DT S + W QC+PC C
Sbjct: 86 EDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPC 145
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC--QNG-----KCVYTRRYHVGDVTRGLA 184
+FDP AS+TY+ C C NG +C Y +Y G T G
Sbjct: 146 HAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTY 205
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
S + +G V FGCS+ G K G++G S SQ R
Sbjct: 206 SSDVLTL---SGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262
Query: 245 FSYCLVREMEATSVIKFGRDADVRRRD---LETTPILLSDLRPHFYLHLLE-ISIGRHIV 300
F YCL ++ + G A TTP+L S P +Y LE I++G +
Sbjct: 263 FFYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 322
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF 360
P F G ++D+GT +T + Y L + + R P
Sbjct: 323 GLSPSVF------AAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAE-PLGI---L 372
Query: 361 DYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGA 417
D C+ + K + +V + + C+A +DD + +G
Sbjct: 373 DTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGN 428
Query: 418 WQQQNMLIIYD 428
QQ+ ++YD
Sbjct: 429 VQQRTFEVLYD 439
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 159/386 (41%), Gaps = 57/386 (14%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP---IFDPRASTTYSEIPCD 153
Y + + +GTP + DT S LVW +C+ + T P F P AS+TY + CD
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCD 169
Query: 154 DPLCR---SPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF---------- 199
CR S C +G C Y Y G G S ETF F +
Sbjct: 170 TKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNN 229
Query: 200 --------VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCL 249
+ +L FGCS +G F + G++G P+SL+SQL + FSYCL
Sbjct: 230 SSSHGQVEIAKLDFGCSTTTTG-TF--RADGLVGLGGGPVSLASQLGATTSLGRKFSYCL 286
Query: 250 V--REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
A+S + FG A V +TP++ ++ ++ + L I++ + P A
Sbjct: 287 APYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAG--TKRPTTAA 344
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYCY- 364
I+D+GT +T++ + L++ + R ++P S E D CY
Sbjct: 345 QAH------IIVDSGTTLTYLDSALLTPLVKDLTR------RIKLPRAESPEKILDLCYD 392
Query: 365 ----RYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFC---VAIQDDPKYSILGA 417
R + + P +T L + + F+ G C VA + SILG
Sbjct: 393 ISGVRGEDAL-GIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGN 451
Query: 418 WQQQNMLIIYDLNVPALRFGSENCAN 443
QQN+ + YDL + F + +CA
Sbjct: 452 IAQQNLHVGYDLEKGTVTFAAADCAK 477
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 159/388 (40%), Gaps = 48/388 (12%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+AS + A Q + + M Y ++V +G+P K L+ DT S L W QC PC
Sbjct: 144 VASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYD 203
Query: 131 CFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFA 190
CF Q N C Y Y T G + ETF
Sbjct: 204 CFQQN----------------------------DNQSCPYYYWYGDSSNTTGDFAVETFT 235
Query: 191 FPVR-NGFTF----VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
+ NG + V + FGC + N G +G+LG PLS SSQL++ F
Sbjct: 236 VNLTTNGGSSELYNVENMMFGCGHWNRGLFH--GAAGLLGLGRGPLSFSSQLQSLYGHSF 293
Query: 246 SYCLVREMEATSV---IKFGRDADVRRR-DLETTPILLSD---LRPHFYLHLLEISIGRH 298
SYCLV T+V + FG D D+ +L T + + +Y+ + I +
Sbjct: 294 SYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGE 353
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
++ P ++I DG GG IID+GT +++ Y+ + + + ++ G+ + Y
Sbjct: 354 VLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE--KAKGKYPV-YRDFP 410
Query: 359 EFDYCYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK--YSIL 415
D C+ P + + P FI + C+A+ PK +SI+
Sbjct: 411 ILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSII 470
Query: 416 GAWQQQNMLIIYDLNVPALRFGSENCAN 443
G +QQQN I+YD L + CA+
Sbjct: 471 GNYQQQNFHILYDTKRSRLGYAPTKCAD 498
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 152/395 (38%), Gaps = 74/395 (18%)
Query: 113 LFDTASSLVWTQCQPC----------IRCFDQTTPIFDPRASTTYSEIPCDD---PLCR- 158
+ DT S LVWTQC C CF Q P ++ S T +PCDD LC
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 159 --SPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C G CV Y G V G+ + F FP + T LAFGC +
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVT----LAFGCVSQ 191
Query: 211 N--SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--------------VREME 254
S A G SGI+G LSL SQL N + FSYCL V + E
Sbjct: 192 TRISPGALNGA-SGIIGLGRGALSLVSQL-NATE--FSYCLTPYFRDTVSPSHLFVGDGE 247
Query: 255 ATSVIKFGRDADVRRRDLETTPILL----SDLRPHFYLHLLEISIGRHIVRFPPGAFDIM 310
+ + T P S +YL L+ ++ G V P GAFD+
Sbjct: 248 LAGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLR 307
Query: 311 RDG----TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP-YNASQEFDYCYR 365
GG +ID+G+P T + + ++ L + + LR G P + C
Sbjct: 308 EAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVE 367
Query: 366 Y----DS-SFKAYPSMTFHLQEA----DYIVQPENMYFIEPDRGRFCVAIQDDP------ 410
DS + A P + + +V P Y+ + +C+A+
Sbjct: 368 AGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATL 427
Query: 411 ---KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ +I+G + QQ+M ++YDL L F NC+
Sbjct: 428 PTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/391 (24%), Positives = 152/391 (38%), Gaps = 50/391 (12%)
Query: 80 FQELEDIHLPMAKQDLF--------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC 131
Q + H P A+ LF Y+ + IGTP + L+ DT S++ + C C C
Sbjct: 68 LQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHC 127
Query: 132 FDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQ----NGKCVYTRRYHVGDVTRGLASRE 187
P F P AS TY + C ++C +C Y RRY + G+ +
Sbjct: 128 GSHQDPKFRPEASETYQPVKCT-------WQCNCDDDRKQCTYERRYAEMSTSSGVLGED 180
Query: 188 TFAFPVRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGL 244
+F N P R FGC ND +G + + GI+G LS+ QL + I
Sbjct: 181 VVSF--GNQSELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDA 238
Query: 245 FSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPP 304
FS C + G + + + S P++ + L EI + + P
Sbjct: 239 FSLCYGGMGVGGGAMVLGGISPPADMVFTHSDPVRS---PYYNIDLKEIHVAGKRLHLNP 295
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFD 361
F DG G ++D+GT ++ + + SL R P YN D
Sbjct: 296 KVF----DGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYN-----D 346
Query: 362 YCY-----RYDSSFKAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI---QDDPK 411
C+ K++P + + + PEN F RG +C+ + +DP
Sbjct: 347 ICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPT 406
Query: 412 YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++LG +N L++YD + F NC+
Sbjct: 407 -TLLGGIVVRNTLVMYDREHSKIGFWKTNCS 436
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/419 (26%), Positives = 169/419 (40%), Gaps = 68/419 (16%)
Query: 86 IHLPMA-KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTPIFDPR 142
+ LP+A D S+ V + P L DT S LVW C P C+ C + TP +
Sbjct: 73 LSLPLAPGSDYTLSLSVGPLSTANPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNN 132
Query: 143 AS------TTYSEIPCDDPLCRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRETF 189
+S T IPC P C + C +C + D+ G +
Sbjct: 133 SSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAAARCP------LDDIETGSCAASHA 186
Query: 190 AFPVRNGF---TFVPRL-------AFGCSNDNSGFAFG----GKISGILGFNASPLSLSS 235
P+ + + V RL A + +N FA G+ G+ GF PLSL +
Sbjct: 187 CPPLYYAYGDGSLVARLRRGRVGIAASVAVENFTFACAHTALGEPVGVAGFGRGPLSLPA 246
Query: 236 QLR-NRIQGLFSYCLV-------REMEATSVIKFGR---DADVRRRDLETTPILLSDLRP 284
QL + G FSYCLV R + + +I GR + + TP+L + P
Sbjct: 247 QLAPAALSGRFSYCLVAHSFRADRPIRPSPLI-LGRSPGEDPASETGIVYTPLLHNPKHP 305
Query: 285 HFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
+FY LE +S+G + P + R G GG ++D+GT T + N Y + + + +
Sbjct: 306 YFYSVALEAVSVGGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRA 365
Query: 344 LRSLGRQRIPYNASQE-FDYCYRYD--------SSFKAYPSMTFHLQEADYIVQPENMYF 394
+ + +R Q CY YD S +A P + H + +V P YF
Sbjct: 366 MAAARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYF 425
Query: 395 I----EPDRGRFCVAI----QDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
+ E R C+ + +DD LG +QQQ ++YD++ + F C +
Sbjct: 426 MGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 116/432 (26%), Positives = 171/432 (39%), Gaps = 89/432 (20%)
Query: 86 IHLPMA-KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTP----- 137
+ LP++ D S+ V + P L DT S LVW C P C+ C + TP
Sbjct: 80 LSLPLSPGSDYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGP 139
Query: 138 IFDPRASTTYSEIPCDDPLC----------------RSPFK-CQNGKCVYTRR-----YH 175
+ P S IPC PLC R P + + G C + Y
Sbjct: 140 LPPPPDS---RRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYA 196
Query: 176 VGD-----------VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGIL 224
GD V G +R + A V N FTF C++ A G + G+
Sbjct: 197 YGDGSLVAHLRRGRVALGAGARASVAVAVDN-FTFA------CAHT----ALGEPV-GVA 244
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK-----FGRD-ADVRRRDLET--- 274
GF PLSL QL ++ G FSYCLV A +I+ GR D ET
Sbjct: 245 GFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGF 304
Query: 275 --TPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
TP+L + P+FY LE +S+G ++ P + R G GG ++D+GT T + N
Sbjct: 305 VYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNE 364
Query: 332 PYQTLMQRYDQILRSLGRQRIPYNASQE-FDYCYRYDSSFKAYPSMTFHLQEADYIVQPE 390
Y + + + + + + G R Q CYRY +S + P + H + + P
Sbjct: 365 MYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPR 424
Query: 391 NMYFI---EPDRGR-------FCVAI---------QDDPKYSILGAWQQQNMLIIYDLNV 431
YF+ D G C+ + + D LG +QQQ ++YD++
Sbjct: 425 RNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDA 484
Query: 432 PALRFGSENCAN 443
+ F C +
Sbjct: 485 GRVGFARRRCTD 496
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 160/376 (42%), Gaps = 57/376 (15%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQC---QPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
+ + IGTP + Q ++ DT S L W QC QP T FDP S+T+S +PC P
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQCHKKQP-------PTASFDPSLSSTFSILPCTHP 129
Query: 156 LCRS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
LC+ P C QN C Y+ Y G G RE F F + P L GC
Sbjct: 130 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SRSVSTPPLILGC 186
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEA--TSVIKFGRD 264
+ +++ GILG N LS + Q ++I FSYC+ R+ T F
Sbjct: 187 ATEST------DPRGILGMNLGRLSFAKQ--SKITK-FSYCVPPRQTRPGFTPTGSFYLG 237
Query: 265 ADVRRRDLETTPILLSDLR--PHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ + + ++ S + P+F + ++ I I + P F G+G
Sbjct: 238 NNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQT 297
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDSSFKAYP-- 374
+ID+G+ T++ + Y + Q++R++G R + Y D C+ S KA
Sbjct: 298 MIDSGSEFTYLVSEAYDKVRA---QVVRAVGPRLKKGYVYGGVADMCF---DSVKAVEIG 351
Query: 375 ----SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY----SILGAWQQQNMLII 426
M F + +V P+ + G CV I K +I+G + QQN+ +
Sbjct: 352 RLIGEMVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVE 411
Query: 427 YDLNVPALRFGSENCA 442
+DL + FG +C+
Sbjct: 412 FDLVRRRVGFGKADCS 427
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P +P +L DT S L W QC PC+RC + P++ P + IPC+D
Sbjct: 56 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDL----IPCND 111
Query: 155 PLCR-----SPFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC+ S +C+ +C Y Y G + G+ R+ F+ G PRLA GC
Sbjct: 112 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 171
Query: 209 NDN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRD 264
D G + + G+LG +S+ SQL ++ ++ + +CL ++ FG D
Sbjct: 172 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL--SSLGGGILFFGDD 228
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 160/377 (42%), Gaps = 47/377 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT------PIFDPRASTTYSE 149
Y ++ +GTP ++ DT S + W C PC C +T +DP S+T
Sbjct: 36 LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGA 95
Query: 150 IPCDDPLCRSPF-----KCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ C D C + C + G C Y+ Y G T+G ++ F + T V
Sbjct: 96 LSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGT 155
Query: 204 A---FGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEAT 256
A FGC SG + G++GF + +S+ SQL + ++ F++CL + +
Sbjct: 156 ASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGG 215
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI-GRHIVRFPPGAFDIMRDGTG 315
I G V ++ TPI+ R H+ + + I++ GR++ P +FD G
Sbjct: 216 GTIVIGS---VSEPNISYTPIV---SRNHYAVGMQNIAVNGRNVTT--PASFDTTSTSAG 267
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YP 374
G I+D+GT + ++ + Y + S+ ++ + +C S +A +P
Sbjct: 268 GVIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFSS---HSQCLQLAWC-----SLQADFP 319
Query: 375 SMTFHLQEADYI-VQPENMYFIEP---DRGRFCVAIQDDP------KYSILGAWQQQNML 424
++ + + P N + +P + +C+ Q YSILG ++ L
Sbjct: 320 TVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHL 379
Query: 425 IIYDLNVPALRFGSENC 441
++YD + + + S +C
Sbjct: 380 VVYDNDNRVVGWKSFDC 396
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 165/399 (41%), Gaps = 75/399 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP---CIRC----FDQT-TPIFDPRASTTYS 148
YS ++ GTP + HL+FDT SSLVW C C C D T P F P+ S++
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 149 EIPCDDPLCRSPF----KCQNGKC------------VYTRRYHVGDVTRGLASRETFAFP 192
+ C +P C F K Q C Y +Y G T GL ET FP
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFP 199
Query: 193 VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-R 251
+ +P GCS F + SGI GF SL SQ+ + F+YCL R
Sbjct: 200 DKK----IPNFVVGCS-----FLSIHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASR 247
Query: 252 EMEAT--SVIKFGRDADVRRRDLETTP------ILLSDLRPHFYLHLLEISIGRHIVRFP 303
+ + + S V+ L TP + + + ++YL++ +I +G V+ P
Sbjct: 248 KFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVP 307
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ---------RIPY 354
DG GG IID+G+ TF+ + + + +++ L + R R +
Sbjct: 308 YKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCF 367
Query: 355 NASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYF-IEPDRGRFCVAI----QDD 409
+ S+E S K +P + F + P N YF + G C+ + +D
Sbjct: 368 DISKE--------KSVK-FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 418
Query: 410 PKYS------ILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILGA+QQQN + YDL L F + C+
Sbjct: 419 GGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 159/388 (40%), Gaps = 42/388 (10%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
S PNA L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 61 SKRHPNARMRLHDDLL----LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCG 116
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGK--CVYTRRYHVGDVTRGLASRETFA 190
P F P S+TY + C C N + CVY R+Y + G+ + +
Sbjct: 117 RHQDPKFQPDLSSTYQPVK-----CTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVS 171
Query: 191 FPVRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSY 247
F N P R FGC N +G + GI+G LS+ QL +N + FS
Sbjct: 172 F--GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSL 229
Query: 248 CL-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA 306
C ++ +++ G ++ P+ P++ + L EI + + P
Sbjct: 230 CYGGMDVGGGAMVLGGISPPSDMVFAQSDPV----RSPYYNIDLKEIHVAGKRLPLNPSV 285
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFDYC 363
F DG G ++D+GT ++ + + + L+S + P YN D C
Sbjct: 286 F----DGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYN-----DLC 336
Query: 364 YRYDSSFKAYPSMTFHLQEA------DYIVQPENMYFIEPD-RGRFCVAIQDDPK--YSI 414
+ + S TF + + Y + PEN F RG +C+ I + K ++
Sbjct: 337 FSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTL 396
Query: 415 LGAWQQQNMLIIYDLNVPALRFGSENCA 442
LG +N L++YD + F NCA
Sbjct: 397 LGGIVVRNTLVLYDREQTKIGFWKTNCA 424
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 165/399 (41%), Gaps = 75/399 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP---CIRC----FDQT-TPIFDPRASTTYS 148
YS ++ GTP + HL+FDT SSLVW C C C D T P F P+ S++
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 149 EIPCDDPLCRSPF----KCQNGKC------------VYTRRYHVGDVTRGLASRETFAFP 192
+ C +P C F K Q C Y +Y G T GL ET FP
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFP 199
Query: 193 VRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-R 251
+ +P GCS F + SGI GF SL SQ+ + F+YCL R
Sbjct: 200 DKX----IPNFVVGCS-----FLSIHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASR 247
Query: 252 EMEAT--SVIKFGRDADVRRRDLETTP------ILLSDLRPHFYLHLLEISIGRHIVRFP 303
+ + + S V+ L TP + + + ++YL++ +I +G V+ P
Sbjct: 248 KFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVP 307
Query: 304 PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ---------RIPY 354
DG GG IID+G+ TF+ + + + +++ L + R R +
Sbjct: 308 YKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCF 367
Query: 355 NASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYF-IEPDRGRFCVAI----QDD 409
+ S+E S K +P + F + P N YF + G C+ + +D
Sbjct: 368 DISKE--------KSVK-FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 418
Query: 410 PKYS------ILGAWQQQNMLIIYDLNVPALRFGSENCA 442
ILGA+QQQN + YDL L F + C+
Sbjct: 419 GGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 152/381 (39%), Gaps = 43/381 (11%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPC 152
D Y + + IG P K +L DT S L W QC PC C ++DP+ + + C
Sbjct: 28 DGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARV---VDC 84
Query: 153 DDPLCR-----SPFKCQNG--KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
P C F C +C Y Y G T G+ +T + NG F R
Sbjct: 85 RRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVI 144
Query: 206 GCSNDNSG-FAFGGKIS-GILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKF 261
GC D G A ++ G++G ++S +SL SQL + + +CL + F
Sbjct: 145 GCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLFF 204
Query: 262 GRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
G D V + TP++ L + L I G ++ D+ GG + D+
Sbjct: 205 G-DTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDV-----GGAMFDS 258
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-------YP 374
GT T++ Y ++ + + G +RI + + F C+R S F++ +
Sbjct: 259 GTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPF--CWRGPSPFESVADVSAYFK 316
Query: 375 SMTFHLQEADYI-------VQPENMYFIEPDRGRFCVAIQDDPKYS-----ILGAWQQQN 422
++T + + + PE Y I +G C+ + D S ILG +
Sbjct: 317 TVTLDFGGSTWWSSGKLLELSPEG-YLIVSTQGNVCLGVLDASVASLEVTNILGDISMRG 375
Query: 423 MLIIYDLNVPALRFGSENCAN 443
L++YD + + NC N
Sbjct: 376 YLVVYDNMREQIGWVRRNCYN 396
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 148/353 (41%), Gaps = 51/353 (14%)
Query: 124 QCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPFKCQ---NGKCVYTRRYHVGD 178
QCQPC+ C+ Q P+F+P+ S++Y+ +PC C +C +G C YT +Y
Sbjct: 2 QCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHG 61
Query: 179 VTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR 238
VT+G + + A G + FGCS+ + G + SG++G PLSL SQL
Sbjct: 62 VTKGTLAIDKLAI----GGDVFHAVVFGCSDSSVG-GPAAQASGLVGLGRGPLSLVSQLS 116
Query: 239 NRIQGLFSYCLVREMEATS---VIKFGRDADVRRRDLETTPILLSDLR--PHFYLHLLEI 293
F YCL M TS V+ G DA VR T + S R ++YL+L +
Sbjct: 117 VH---RFMYCLPPPMSRTSGKLVLGAGADA-VRNMSDRVTVTMSSSTRYPSYYYLNLDGL 172
Query: 294 SIGRHI------VRFPPGAFDIMRDGTG-------------GFIIDTGTPVTFIRNGPYQ 334
++G PP G G G I+D + ++F+ Y
Sbjct: 173 AVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYD 232
Query: 335 TLMQRYDQILRSLGRQRIPYNASQEFDYCY------RYDSSFKAYPSMTFHLQEADYIVQ 388
L ++ +R R + D C+ D + S++F + ++
Sbjct: 233 ELADDLEEEIR---LPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF---DGRWLEL 286
Query: 389 PENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ F+ R C+ I SILG +Q QNM ++++L + F +C
Sbjct: 287 DRDRLFVTDGR-MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 113/441 (25%), Positives = 182/441 (41%), Gaps = 68/441 (15%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM-AKQDLFYSVEVNIGTPMKPQ 110
S R HK+ + + + A S A + + P+ AK YSV ++ GTP +
Sbjct: 46 SSIARAHKLKHGTSIKPDEDALSSTTTASATV--VKSPLSAKSYGGYSVSLSFGTPSQTI 103
Query: 111 HLLFDTASSLVWTQCQP---CIRC----FDQT-TPIFDPRASTTYSEIPCDDPLCR---- 158
+FDT SSLVW C C C D T P F P+ S++ I C P C+
Sbjct: 104 PFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYG 163
Query: 159 ----------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
+ C G Y +Y +G T G+ E FP VP GCS
Sbjct: 164 PNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFPDLT----VPDFVVGCS 218
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK------- 260
++ + +GI GF P+SL SQ+ + FS+CLV R + T+V
Sbjct: 219 IIST-----RQPAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTNVTTDLDLDTG 270
Query: 261 FGRDADVRRRDLETTP------ILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G ++ + L TP + ++YL+L I +GR V+ P +G
Sbjct: 271 SGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGD 330
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-- 372
GG I+D+G+ TF+ ++ + + + + + R++ + +E ++ S K
Sbjct: 331 GGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREK---DLEKETGLGPCFNISGKGDV 387
Query: 373 -YPSMTFHLQEADYIVQPENMYFI-EPDRGRFCVAIQDDPKYS---------ILGAWQQQ 421
P + F + + P + YF + C+ + D + ILG++QQQ
Sbjct: 388 TVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQ 447
Query: 422 NMLIIYDLNVPALRFGSENCA 442
N L+ YDL F + C+
Sbjct: 448 NYLVEYDLENDRFGFAKKKCS 468
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 144/363 (39%), Gaps = 53/363 (14%)
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLCR--S 159
GT Q ++ D+ S + W QCQPC + C Q P+FDP STTY+ +PC C
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 160 PFK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
P++ N +C + Y G G S + + V FGC++ + G F
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTL---GPYDVVRGFLFGCAHADQGSTF 191
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG----RDADVRRRDL 272
++G L S Q ++ +FSYC+ + I FG R A V
Sbjct: 192 SYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPT--F 249
Query: 273 ETTPILLSD-LRPHFYLHLL-EISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+TP+L S + P FY LL I + + PP F + +ID+ T ++ I
Sbjct: 250 VSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVF------SASSVIDSATVISRIPP 303
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSM--------TFHLQ 381
YQ L + RS P D CY + PS+ T +L
Sbjct: 304 TAYQALRAAF----RSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLD 359
Query: 382 EADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
A ++Q C+A D +G QQ+ + ++YD+ A+RF S
Sbjct: 360 AAGILLQ-------------GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRS 406
Query: 439 ENC 441
C
Sbjct: 407 AAC 409
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 150/363 (41%), Gaps = 52/363 (14%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD-- 154
Y V +GTP++ L DT++ W+ C PC C + F P +S++Y+ +PC
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDW 136
Query: 155 -PLCRSPF----KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
PL R P + G R T R R G+ P A
Sbjct: 137 CPLFRRPAVPGEPGRVGAAADVRLLQAASRT----PRSGVLAATRCGWARTPSPAT---- 188
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
+ P+SL SQ +R G+FSYCL R + ++ G A
Sbjct: 189 -----------------RSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLG--AAG 229
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ R++ TP+L + RP +Y+++ +S+GR +V+ P G+F G +ID+GT +T
Sbjct: 230 QPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVIT 289
Query: 327 FIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSMTFHLQEA- 383
Y L + Q+ G Y + FD C+ D + P +T H+
Sbjct: 290 RWTAPVYAALRDEFRRQVAAPSG-----YTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGV 344
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFGS 438
D + EN C+A+ + P+ +++ QQQN+ ++ D+ + F
Sbjct: 345 DLTLPMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAR 404
Query: 439 ENC 441
E C
Sbjct: 405 EPC 407
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 142/364 (39%), Gaps = 41/364 (11%)
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPC 152
D Y + +G P +P +L DTAS L W QC PC C ++ PR +
Sbjct: 205 DGLYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTP--- 261
Query: 153 DDPLCRSPFKCQNG-------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
D LC + Q +C Y Y + G+ +R+ + NG + + F
Sbjct: 262 KDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNF 321
Query: 206 GCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKF 261
GC+ D G K GILG + + +SL SQL NR I + +CL ++ +
Sbjct: 322 GCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFL 381
Query: 262 GRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
G D V R + P+L S + +++++ G + + R + D+
Sbjct: 382 GDDF-VPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRR-----IVFDS 435
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-------YP 374
G+ T+ Y L+ Q+ G I + +C+R ++ +
Sbjct: 436 GSSYTYFTKEAYSELVASLKQV---SGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFK 492
Query: 375 SMTFHLQEADYIVQ-----PENMYFIEPDRGRFCVAIQD-----DPKYSILGAWQQQNML 424
++T +I+ P Y I ++G C+ I D D ILG + L
Sbjct: 493 TLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSDVHDGSSIILGDISLRGQL 552
Query: 425 IIYD 428
IIYD
Sbjct: 553 IIYD 556
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 141/353 (39%), Gaps = 34/353 (9%)
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCR--S 159
GT Q ++ D+ S + W QC+PC C Q P+FDP STTY+ +PC C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 160 PFK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
P++ N +C + Y G G S + + + FGC++ + G AF
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTL---GPYDVIRGFRFGCAHADRGSAF 278
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR--DLET 274
++G L SL Q R +FSYCL + + G + + +
Sbjct: 279 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 338
Query: 275 TPILLSDLRPHFYLHLLE--ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
TP+L S + P FY LL I GR + PP F + +ID+ T ++ +
Sbjct: 339 TPLLSSSMAPTFYRVLLRAIIVAGRPLA-VPPAVF------SASSVIDSSTIISRLPPTA 391
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPEN 391
YQ L + + ++ R P + D CY + PS+ +
Sbjct: 392 YQALRAAFRSAM-TMYRAAPPVSI---LDTCYDFTGVRSITLPSIALVFDGGATVNLDAA 447
Query: 392 MYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+ C+A D +G QQ+ + ++YD+ A+RF + C
Sbjct: 448 GILLG-----SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 159/396 (40%), Gaps = 52/396 (13%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+ +P+A L D L + +Y+ + IGTP + L+ D+ S++ + C C +
Sbjct: 63 LGDGGRPSARMRLHDDLL----TNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ 118
Query: 131 CFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF 189
C + P F P S+TYS + C D C S +C Y R+Y + G+ +
Sbjct: 119 CGNHQDPRFQPDLSSTYSPVKCSADCTCDS----DKSQCTYERQYAEMSSSSGVLGEDIV 174
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSY 247
+F + R FGC N +G F GI+G LS+ QL ++ I FS
Sbjct: 175 SFGTESELK-PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSM 233
Query: 248 C-----------LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIG 296
C ++ M A + F R VR P++ + L EI +
Sbjct: 234 CYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVR--------------SPYYNIELKEIHVA 279
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+R P FD G ++D+GT ++ + +R L + R P
Sbjct: 280 GKALRLDPRIFDSKH----GTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPN 335
Query: 357 SQEFDYCY----RYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDD 409
+ D C+ R S +A+P + + + + PEN F G +C+ + +
Sbjct: 336 YK--DICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQN 393
Query: 410 PK--YSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
K ++LG +N L+ YD + + F NC+
Sbjct: 394 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 429
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 54/375 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P KP L DT S L W QC PC++C + P + PR + +PC D
Sbjct: 33 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNL----VPCMD 88
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
P+C+S +C+N G+C Y Y G + G+ +TF + P LA GC
Sbjct: 89 PICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPLLALGCG 148
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATSVIKFGRDAD 266
D I G+LG S+ SQL + ++ + +CL
Sbjct: 149 YDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLF-------- 200
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGF-----IID 320
+++ + + + P +H + PG ++ DG T GF D
Sbjct: 201 FGDDLYDSSRVAWTPMSPD----------AKH---YSPGLAELTFDGKTTGFKNLLTTFD 247
Query: 321 TGTPVTFIRNGPYQTLMQRYDQIL------RSLGRQRIP--YNASQEFDYCYRYDSSFKA 372
+G T++ + YQ L+ + L +L Q +P + + F FK
Sbjct: 248 SGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKT 307
Query: 373 YP-SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLII 426
+ S T + + P Y I +G C+ I + + +++G Q+ ++I
Sbjct: 308 FALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVI 367
Query: 427 YDLNVPALRFGSENC 441
YD + + NC
Sbjct: 368 YDNEKERIGWAPGNC 382
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 97/223 (43%), Gaps = 15/223 (6%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI---RCFDQTTPIFDPRASTTYSEIP 151
L Y V ++GTP Q + DT S L W QC+PC C+ Q P+FDP S++Y+ +P
Sbjct: 46 LNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVP 105
Query: 152 CDDPLCRS-----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C P+C C +C Y Y G T G+ S +T + V FG
Sbjct: 106 CGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFG 162
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C + SG G + G+LG SL Q G+FSYCL + + G
Sbjct: 163 CGHAQSGLFNG--VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGP 220
Query: 267 VRRR-DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAF 307
TT +L S P +Y+ +L IS+G + P AF
Sbjct: 221 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF 263
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 103/419 (24%), Positives = 161/419 (38%), Gaps = 113/419 (26%)
Query: 31 GFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM 90
GFS+ LI SP SP Y +L+ SERI A++S N + E I +P
Sbjct: 28 GFSIDLIHRDSPLSPFYNPSLTPSERITD------------AALSS-NENKLPESILIPN 74
Query: 91 AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEI 150
+ Y + + IGTP + ++ DT S +W QC PC
Sbjct: 75 NGE---YLMRLYIGTPPVERLVIADTGSDFIWVQCSPC---------------------- 109
Query: 151 PCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV--PRLAFGC- 207
QN +CVY Y T + ET +F G V P FGC
Sbjct: 110 -------------QNCQCVYLNIYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCG 156
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADV 267
+N+N F K +G++G A LSL SQL +I FSY +KFG +A +
Sbjct: 157 ANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSY-----------LKFGSEAII 205
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+ +TP+++ P ++L+L ++IG+ +V
Sbjct: 206 TTNGVVSTPLIIKPSLPLYFLNLEVVTIGQKVV--------------------------- 238
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
P +TL Q +P+ F +C+ Y + P++ F A +
Sbjct: 239 ----PTETLGVE--------SVQDLPF----PFKFCFPYRDNMTV-PAIAFQFTGASVAL 281
Query: 388 QPENMYFIEPDRGRFCVAIQDDPK----YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+P+N+ DR +A+ SI G Q + ++YDL+ + +C
Sbjct: 282 RPKNLLIKLQDRNMLXLAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDCT 340
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 118/450 (26%), Positives = 183/450 (40%), Gaps = 98/450 (21%)
Query: 72 ASMSKPNAFQELEDIHLPMAKQDLF----------YSVEVNIGTPMKPQHLLFDTASSLV 121
S+ P + Q E I P++ D+ Y + +NIGTP + + DT S L
Sbjct: 49 VSLPTPKS-QTQERIKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLT 107
Query: 122 WTQCQ----PCIRCFD------QTTPIFDPRASTTYSEIPCDDPLC------RSPFK-CQ 164
W C CI C+D ++ +F P S+T C C +PF C
Sbjct: 108 WVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCA 167
Query: 165 NGKC---------------VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C + Y G + G+ +R+ R+ VPR +FGC
Sbjct: 168 VAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRD----VPRFSFGCVT 223
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYC-----LVREMEATSVIKFGRD 264
+ I GI GF LSL SQL +G FS+C V +S + G
Sbjct: 224 S----TYREPI-GIAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGAS 277
Query: 265 A-DVRRRD-LETTPILLSDLRPH-FYLHLLEISIGRHI--VRFPPGAFDIMRDGTGGFII 319
A + D L+ TP+L + + P+ +Y+ L I+IG +I + P G GG ++
Sbjct: 278 ALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLV 337
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE------FDYCYRY------- 366
D+GT T + P+ Y Q+L +L + I Y + E FD CY+
Sbjct: 338 DSGTTYTHLPE-PF------YSQLLTTL-QSTITYPRATETESRTGFDLCYKVPCPNNNL 389
Query: 367 ----DSSFKAYPSMTFHLQEADYIVQPENMYFI---EPDRGRF--CVAIQ--DDPKY--- 412
+ +PS+TFH ++ P+ F P G C+ Q +D Y
Sbjct: 390 TSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPA 449
Query: 413 SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ G++QQQN+ ++YDL + F + +C
Sbjct: 450 GVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 154/393 (39%), Gaps = 52/393 (13%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
S PNA L D L ++ +Y+ + IGTP + L+ DT S++ + C C C
Sbjct: 73 SEHHPNARMRLYDDLL----RNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCG 128
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGK--CVYTRRYHVGDVTRGLASRETFA 190
P F P S TY + C C N + C Y RRY + G + +
Sbjct: 129 SHQDPKFRPEDSETYQPVK-----CTWQCNCDNDRKQCTYERRYAEMSTSSGALGEDVVS 183
Query: 191 FPVRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSY 247
F N P R FGC ND +G + + GI+G LS+ QL + I FS
Sbjct: 184 F--GNQTELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSL 241
Query: 248 CLVREMEATSVIKFG-----RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRF 302
C + G D R D P+ P++ + L EI + +
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTRSD----PV----RSPYYNIDLKEIHVAGKRLHL 293
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQE 359
P F DG G ++D+GT ++ + + SL R P YN
Sbjct: 294 NPKVF----DGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYN---- 345
Query: 360 FDYCY---RYDSS--FKAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI---QDD 409
D C+ D S K++P + + + PEN F RG +C+ + +D
Sbjct: 346 -DICFSGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGND 404
Query: 410 PKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
P ++LG +N L++YD + F NC+
Sbjct: 405 PT-TLLGGIVVRNTLVMYDREHTKIGFWKTNCS 436
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 148/363 (40%), Gaps = 34/363 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-D 154
+Y+ + IGTP + L+ DT SS+ + C C +C P F P S+TY + C+ D
Sbjct: 12 YYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNID 71
Query: 155 PLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSG 213
C + +CVY R+Y + G+ + +F N P R FGC N +G
Sbjct: 72 CNCDD----EKQQCVYERQYAEMSTSSGVLGEDIISF--GNLSALAPQRAVFGCENMETG 125
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDADVRRRD 271
+ GI+G LS+ L ++ I FS C + G +
Sbjct: 126 DLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSNMV 185
Query: 272 LETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNG 331
+ + S P++ + L EI + + P F DG G I+D+GT ++
Sbjct: 186 FSQSDPVRS---PYYNIDLKEIHVAGKPLPLNPTVF----DGKHGTILDSGTTYAYLPEA 238
Query: 332 PYQTLMQRYDQILRSLGRQRIP---YNASQEFDYCYRYDSSFKAYPSMTFHLQEADY--- 385
+ + + L SL R P YN D C+ S + S +F E +
Sbjct: 239 AFVSFKDAIMKELHSLKPIRGPDPNYN-----DICFSGAGSDISQLSSSFPAVEMVFGNG 293
Query: 386 ---IVQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSE 439
++ PEN F G +C+ I + K ++LG +N L++YD + F
Sbjct: 294 QKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKT 353
Query: 440 NCA 442
NC+
Sbjct: 354 NCS 356
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 154/374 (41%), Gaps = 39/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G P K ++ DT S ++W C C C FDP +STT S +
Sbjct: 82 LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLV 141
Query: 151 PCDDPLC-----RSPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPV----RNGFTF 199
C D +C S C Q+ +C Y +Y G T G + V
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNS 201
Query: 200 VPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEA 255
+ FGCS +G + GI GF LS+ SQL +R +FS+CL +
Sbjct: 202 SASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSG 261
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
++ G ++ ++ TP++ S +PH+ L+L IS+ ++ P F +
Sbjct: 262 GGILVLG---EIVEPNVVYTPLVPS--QPHYNLNLQSISVNGQVLPISPAVF--ATSSSQ 314
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYP 374
G IID+GT + ++ Y + I+ S Q + ++ CY SS +P
Sbjct: 315 GTIIDSGTTLAYLAEEAYNAFVVAVTNIV-SQSTQSVVLKGNR----CYVTSSSVSDIFP 369
Query: 375 SMTFHLQEADYIVQPENMYFIEPDR----GRFCVAIQDDP--KYSILGAWQQQNMLIIYD 428
++ + +V Y I+ + +C+ Q P +ILG ++ + IYD
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYD 429
Query: 429 LNVPALRFGSENCA 442
L + + + +C+
Sbjct: 430 LANQRIGWTNYDCS 443
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 153/371 (41%), Gaps = 46/371 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y V ++IG P KP L DT S L W QC PC+RC P++ P + + C D
Sbjct: 66 YYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNL----VICKD 121
Query: 155 PLCRS----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
P+C S +KC++ +C Y Y G + G+ ++ F NG PRLA GC
Sbjct: 122 PMCASLHPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDADV 267
D + G+LG S+ SQL ++ I+ + +C+ + FG D
Sbjct: 182 DQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCV--SSRGGGFLFFGDDLYD 239
Query: 268 RRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R + T +L D H+ E+ +G F +++ D+G+ T+
Sbjct: 240 SSRVVWTP--MLRDQHTHYSSGYAELILGGKTTVFK----NLL------VTFDSGSSYTY 287
Query: 328 IRNGPYQTLMQRYDQIL------RSLGRQRIP--YNASQEFDYCYRYDSSFK----AYPS 375
+ + YQ L+ + L +L Q +P + + F FK ++P
Sbjct: 288 LNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPG 347
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLN 430
+ D P Y I +G C+ I + + ++++G Q+ +++YD
Sbjct: 348 GGRTKTQYDI---PLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNE 404
Query: 431 VPALRFGSENC 441
+ + NC
Sbjct: 405 KNQIGWAPTNC 415
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 153/365 (41%), Gaps = 39/365 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q Y + V +GTP K Q + DT SS W C+ C C R STT +++ C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSR-STTCAKVSC 135
Query: 153 DDPLC---RSPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+C S CQ+ + C + Y G + G+ ++T F + +P F
Sbjct: 136 GTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF---SDVQKIPSFTF 192
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM-------EATSV 258
GC+ D+ G G + G+LG A P+S+ Q R G FSYCL + + T
Sbjct: 193 GCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGY 251
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
G+ A R D+ T ++ F++ L IS+ + P F G
Sbjct: 252 FSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGV 304
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY-CYRYDSSFKA-YPS 375
+ D+G+ +++I + L QR ++L G A +E + CY S + P+
Sbjct: 305 VFDSGSELSYIPDRALSVLSQRIRELLLRRG------AAEEESERNCYDMRSVDEGDMPA 358
Query: 376 MTFHLQEADYIVQPENMYFIE---PDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
++ H + + F+E ++ +C+A SI+G+ Q + ++YDL
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQ 418
Query: 433 ALRFG 437
+ G
Sbjct: 419 LIGIG 423
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/258 (31%), Positives = 125/258 (48%), Gaps = 26/258 (10%)
Query: 187 ETFAFPVRNGFTFVPRLAFGCS-NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLF 245
ETF F + P +AFGC+ GF G SG++G LSL +QL F
Sbjct: 3 ETFTF--GDDAAAFPGIAFGCTLRSEGGFGTG---SGLVGLGRGKLSLVTQLNVEA---F 54
Query: 246 SYCLVREMEATSVIKFGRDADVRRRDLET---TPIL----LSDLRPHFYLHLLEISIGRH 298
Y L ++ A S I FG ADV + ++ TP+L + DL P +Y+ L IS+G
Sbjct: 55 GYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDL-PFYYVGLTGISVGGK 113
Query: 299 IVRFPPGAFDIMRD-GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+V+ P G F R G GG I D+GT +T + + P TL++ D++L +G Q+ P A+
Sbjct: 114 LVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPD-PAYTLVR--DELLSQMGFQKPPPAAN 170
Query: 358 QEFDYCYRYDSSFKAYPSMTFHLQ-EADYIVQPEN----MYFIEPDRGRFCVAIQDDPKY 412
+ C+ SS +PSM H AD + EN M + R ++
Sbjct: 171 DDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQAL 230
Query: 413 SILGAWQQQNMLIIYDLN 430
+I+G Q + +++DL+
Sbjct: 231 TIIGNIMQMDFHVVFDLS 248
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/433 (25%), Positives = 178/433 (41%), Gaps = 67/433 (15%)
Query: 30 TGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELED---- 85
+GFS++ I S +S + L+ R+ + S AR + A ++ A
Sbjct: 2 SGFSVEFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDS 61
Query: 86 ---IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPR 142
+ PM Q+ Y + +++ TP L DT SSLVW +C+ P
Sbjct: 62 DADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK---------LPAAHTP 112
Query: 143 ASTTYSEIPCDDPLCRS---PFKCQ-----NGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
AS++Y+ +PCD C++ C+ N CVY + G T G PV
Sbjct: 113 ASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAG---------PVT 163
Query: 195 -NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI--QGLFSYCLV- 250
+ FTF RL FGC+ G + G++G P+SL SQL + FSYCLV
Sbjct: 164 VDAFTFSTRLDFGCATRTEGLSVPDD--GLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221
Query: 251 --REMEATSVIKFGRDADVRRR-DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
+S + FG A V TTP++ + + + L I + V
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVP------ 275
Query: 308 DIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYD 367
++ T I+D+GT +T++ L+ + ++P S E Y YD
Sbjct: 276 --LQTTTTKLIVDSGTMLTYLPKAVLDPLVAALTAAI------KLPRVKSPETLYAVCYD 327
Query: 368 -------SSFKAYPSMTFHLQEADYIVQPENMYFIEPDRG-RFCVAIQDD--PKYSILGA 417
K+ P +T L + P F+ ++G C+A+ + P++ ILG
Sbjct: 328 VRRRAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEF-ILGN 386
Query: 418 WQQQNMLIIYDLN 430
QQN+ + +DL
Sbjct: 387 VAQQNLHVGFDLE 399
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 125/257 (48%), Gaps = 29/257 (11%)
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME-ATSVIKF 261
L FGC ++G G SG++G + +SL SQL FSYCL E TS + F
Sbjct: 94 LGFGCGALSAGSLVGA--SGLMGLSPGTMSLISQLSVP---RFSYCLTPFAERKTSPMLF 148
Query: 262 GRDADVRRRD----LETTPILLSDLRPHFYLH--LLEISIGRHIVRFPPGAFDIMRDGTG 315
G AD+R+ + ++TT IL + FY + L+ +S+G +R P + I DGTG
Sbjct: 149 GAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTG 208
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP-YNAS-QEFDYCYRYDSSFK-- 371
G I+D+G+ + + + + + +L ++ ++P +N + ++++ C+ S
Sbjct: 209 GTIVDSGSTMAHLAGKAFDAVKK---AVLEAV---KLPVFNGTVEDYELCFAVPSGVAMA 262
Query: 372 --AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNML 424
P + H + P + YF EP G C+A+ P+ SI+G QQQNM
Sbjct: 263 AVKTPPLVLHFDGGAAMALPRDNYFQEPRAGLMCLAVARSPEDLGAPISIIGNVQQQNMH 322
Query: 425 IIYDLNVPALRFGSENC 441
+++D++ F C
Sbjct: 323 VLFDVHNQKFSFAPTKC 339
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 158/369 (42%), Gaps = 42/369 (11%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
+D+ Y E+ IG + Q+LL DT SSLVWTQC C C P + S T+ E+ C
Sbjct: 78 EDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSC 137
Query: 153 DDP---------LCRSPFK-------CQNGKCVYTRRYHV---GDVTRGLASRETFAFPV 193
D P K C NG+C++ Y++ G+ +G S +TF F
Sbjct: 138 GDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTFHFID 197
Query: 194 RNGFTFVP--RLAFGCSN-DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
F + R+ FGC++ +N + +GILG S LR FSYC+
Sbjct: 198 DRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASF---LRQTGITKFSYCVP 254
Query: 251 REMEA-----TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
M S ++FG A + + + P+++ + + L + + + P
Sbjct: 255 PRMPGYSYRRHSWLRFGSHAQISGKKV---PLVMRWGKYYLPLTAITYTYNELMSPVPII 311
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
A+ D ++DTGT + + + L++ + I++S + I A++ +CY+
Sbjct: 312 AYKSQEDYL-HMMVDTGTSLLSLPTSLHDDLIKEMEAIIKS---ENIMEGATRWPKHCYK 367
Query: 366 YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGR---FCVAIQ--DDPKYSILGAWQQ 420
++T I + FI+ + + C+A+ DD +ILG + Q
Sbjct: 368 RTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMFAQ 427
Query: 421 QNMLIIYDL 429
N+ + YDL
Sbjct: 428 TNINVGYDL 436
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
FY+V +NIG P +P L DT S L W QC PC +C + P++ P + IPC D
Sbjct: 73 FYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHPLYKP----SNDFIPCKD 128
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S + C++ +C Y +Y T G+ + + NG R+A GC
Sbjct: 129 PLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLKVRMALGCG 188
Query: 209 NDN----SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFG 262
D S + + GILG SL SQL ++ ++ + +CL I FG
Sbjct: 189 YDQIFSPSTYH---PLDGILGLGRGKASLISQLNSQGLVRNVMGHCL--SSRGGGYIFFG 243
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
D R + TPI D H+ E+ G G+ +I I DTG
Sbjct: 244 NVYDSSR--MSWTPISSIDSGKHYSAGPAELVFGGRKTGV--GSLNI--------IFDTG 291
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS---QEFDYCYRYDSSFKA------- 372
+ T+ + YQ ++ ++ L R P A+ Q C+ F++
Sbjct: 292 SSYTYFNSQAYQAMISLLNKEL-----HRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKY 346
Query: 373 YPSMTFHLQEADYIVQ----PENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNM 423
+ +T + P Y I + G C+ I + P+ +++G +
Sbjct: 347 FKPLTLSFTNGGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDISMLDK 406
Query: 424 LIIYDLNVPALRFGSENC 441
++++D + +G +C
Sbjct: 407 VMVFDNEKQLIGWGPADC 424
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/439 (25%), Positives = 173/439 (39%), Gaps = 64/439 (14%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQH 111
S R HK+ + + + A S A + HL K YSV ++ GTP +
Sbjct: 46 SSIARAHKLKHGTSIKPDEEALSSTATASATVVKSHL-SPKSYGGYSVSLSFGTPSQTIP 104
Query: 112 LLFDTASSLVWTQCQPCIRCFD--------QTTPIFDPRASTTYSEIPCDDPLCRSPFKC 163
+FDT SSLVW C C D P F P+ S++ I C +P C+ F
Sbjct: 105 FVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFG- 163
Query: 164 QNGKC---------------VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
N +C Y +Y +G T G+ E FP VP GCS
Sbjct: 164 ANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFPDLT----VPDFVVGCS 218
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK-FGRDAD 266
++ +GI GF P SL SQ++ + FS+CLV R + T+V G D
Sbjct: 219 VISTRTP-----AGIAGFGRGPESLPSQMKLKS---FSHCLVSRRFDDTNVTTDLGLDTG 270
Query: 267 VRRRDLETTPILL------------SDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ TP L + ++YL+L I +G V+ P +G
Sbjct: 271 SGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGN 330
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AY 373
GG I+D+G+ TF+ ++ + + + + + R++ S C+
Sbjct: 331 GGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSG-IAPCFNISGKGDVTV 389
Query: 374 PSMTFHLQEADYIVQPENMYF-IEPDRGRFCVAIQDDPKYS---------ILGAWQQQNM 423
P + F + + P + YF + C+ + D + ILG++QQQN
Sbjct: 390 PELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGGTGPAIILGSFQQQNY 449
Query: 424 LIIYDLNVPALRFGSENCA 442
L+ YDL F + C+
Sbjct: 450 LVEYDLENDRFGFAKKKCS 468
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 122/249 (48%), Gaps = 15/249 (6%)
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME-ATSV 258
+PR+ FGC +N + +G+LG LSL SQL + FSYCL E TS
Sbjct: 139 IPRIGFGCGVNNRATGMD-QTAGLLGLGRGVLSLVSQLGTQ---KFSYCLTSIHENKTSS 194
Query: 259 IKFGRDA--DVRRRDLETTPILLSDLRP-HFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
+ FG A + + TP++ + P ++YL L I++G ++ P AF + +DG+G
Sbjct: 195 LLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKDGSG 254
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY---DSSFKA 372
G I+D+GT +T+++ + L + S ++ +++ D C+ +++
Sbjct: 255 GMILDSGTTITYLQEDAFDVLKNAF----ISQTELQVANSSTTGLDLCFHLPVKNAAEVK 310
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
P + FH + D + EN +P+ G C+AI SI G QQQNML+++DL
Sbjct: 311 VPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGSLSIFGNIQQQNMLVLHDLKKS 370
Query: 433 ALRFGSENC 441
L C
Sbjct: 371 TLSLVPTQC 379
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 156/360 (43%), Gaps = 47/360 (13%)
Query: 114 FDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSEIPCDDPLCRSPFKCQNG- 166
DT S ++W C C C Q++ + FD S+T + IPC D +C S +
Sbjct: 85 IDTGSDILWVNCNTCSNC-PQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAE 143
Query: 167 ------KCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFVPRLAFGCSNDNSG--F 214
+C YT +Y G T G + F + G + FGCS SG
Sbjct: 144 CSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLT 203
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGL----FSYCLVREMEATSVIKFGRDADVRRR 270
+ GI GF PLS+ SQL + QG+ FS+CL + ++ G ++
Sbjct: 204 KTDKAVDGIFGFGPGPLSVVSQLSS--QGITPKVFSHCLKGDGNGGGILVLG---EILEP 258
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
+ +P++ S +PH+ L+L I++ + P F I + GG I+D GT + ++
Sbjct: 259 SIVYSPLVPS--QPHYNLNLQSIAVNGQPLPINPAVFSI-SNNRGGTIVDCGTTLAYLIQ 315
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQ-EADYIVQ 388
Y L+ + + RQ + + + CY +S +P ++ + + A +++
Sbjct: 316 EAYDPLVTAINTAVSQSARQ-----TNSKGNQCYLVSTSIGDIFPLVSLNFEGGASMVLK 370
Query: 389 PE-----NMYFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
PE N Y + +CV Q SILG ++ +++YD+ + + + +C+
Sbjct: 371 PEQYLMHNGYLDGAE--MWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 86/165 (52%), Gaps = 27/165 (16%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN ++ ER+ + + K R +++ K +F+ + P+ + + +++ IGTP +
Sbjct: 46 GNYTKFERLQRAMKRGKLRLQRLSA--KTASFES--SVEAPVHAGNGEFLMKLAIGTPAE 101
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC 168
+ DT S L+WTQC+PC CFDQ TPIFDP+ S+++S++PC L S
Sbjct: 102 TYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLYYSS-------- 153
Query: 169 VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
T+G+ + ETFAF G V ++ FGC DN G
Sbjct: 154 -----------TQGVLATETFAF----GDASVSKIGFGCGEDNDG 183
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 108/245 (44%), Gaps = 32/245 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V+V G+P + ++ DT SSL W QC+PC + C Q P+FDP AS TY + C
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
++PLC + + CVYT Y + G S++ +P
Sbjct: 178 QCSSLVDATLNNPLCET----SSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ---TLPG 230
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG 262
+GC D+ G G+ +GILG + LS+ Q+ ++ FSYCL + G
Sbjct: 231 FVYGCGQDSDGLF--GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR-GGGGFLSIG 287
Query: 263 RDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
+ A + + TP+ P Y L L I++G + + + IID+
Sbjct: 288 K-ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------IIDS 340
Query: 322 GTPVT 326
GT +T
Sbjct: 341 GTVIT 345
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/430 (23%), Positives = 161/430 (37%), Gaps = 30/430 (6%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELED 85
+S G +L L+ P SP+ E ++ A + S + ++ +EL+
Sbjct: 53 TSSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPRNSSAKELQQ 112
Query: 86 IHLPMAKQDLF------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTP 137
+ + + Y + V++GTP Q + DT S + W QC PC C Q
Sbjct: 113 SGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDK 172
Query: 138 IFDPRASTTYSEIPCDDPLCR----SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPV 193
+FDP S TYS C C C N C Y +Y T G +T
Sbjct: 173 LFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTT 232
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VRE 252
+ V FGCS+ +GF G++ G++G SL SQ FSYCL
Sbjct: 233 SDA---VKNFQFGCSHRANGFV--GQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSS 287
Query: 253 MEATSVIKFGRDA-DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR 311
A + G A TP++ ++ + + L I++ + P F
Sbjct: 288 SSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF---- 343
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
+G ++D+GT +T + YQ L + + +++ P D C+ + S K
Sbjct: 344 --SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAY-PSAAPVGI---LDTCFDF-SGIK 396
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNV 431
L + V ++ I A D ILG QQ+ +++D+
Sbjct: 397 TVRVPVVTLTFSRGAVMDLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGG 456
Query: 432 PALRFGSENC 441
L F C
Sbjct: 457 STLGFRPGAC 466
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 146/392 (37%), Gaps = 93/392 (23%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC----------IRCFDQTTPIFDPRASTT 146
Y IG P +P + DT S LVWTQC C CF Q P ++ S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 147 YSEIPCDD---PLCR---SPFKCQNG------KCVYTRRYHVGDVTRGLASRETFAFPVR 194
+PCDD LC C G CV Y G V G+ + F FP
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSS 196
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREME 254
+ T LAFGC + SP +L+
Sbjct: 197 SSVT----LAFGCVSQT---------------RISPGALTG------------------- 218
Query: 255 ATSVIKFGRDA-DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
A+ +I GR A + +D S +YL L+ ++ G V P GAFD+
Sbjct: 219 ASGIIGLGRGALSLNPKD--------SPFSTFYYLPLVGLAAGNATVALPAGAFDLREAA 270
Query: 314 ----TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQEFDYCYRY-- 366
GG +ID+G+P T + + ++ L + + LR G P + C
Sbjct: 271 PKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGD 330
Query: 367 --DS-SFKAYPSMTFHLQE----ADYIVQPENMYFIEPDRGRFCVAIQDDP--------- 410
DS + A PS+ + +V P Y+ + +C+A+
Sbjct: 331 DGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATLPTN 390
Query: 411 KYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ +I+G + QQ+M ++YDL L F NC+
Sbjct: 391 ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 97/223 (43%), Gaps = 15/223 (6%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI---RCFDQTTPIFDPRASTTYSEIP 151
L Y V ++GTP Q + DT S L W QC+PC C+ Q P+FDP S++Y+ +P
Sbjct: 138 LNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVP 197
Query: 152 CDDPLCRS-----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C P+C C +C Y Y G T G+ S +T + V FG
Sbjct: 198 CGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFG 254
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C + SG G + G+LG SL Q G+FSYCL + + G
Sbjct: 255 CGHAQSGLFNG--VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGP 312
Query: 267 VRRR-DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAF 307
TT +L S P +Y+ +L IS+G + P AF
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF 355
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 169/394 (42%), Gaps = 45/394 (11%)
Query: 72 ASMSKPNA---FQELE-DI----HLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
A S NA F+E++ DI +P+ + Y V++ +GTP L DT S + WT
Sbjct: 14 ARFSNKNAGSHFKEMQADIPVQSGIPLGAGN--YLVKMALGTPKLSLSLALDTGSDITWT 71
Query: 124 QCQPCI-RCFDQTTPIFDPRASTTYSEIPCDDPLCR------SPFKCQNGKCVYTRRYHV 176
QC+PC+ C+ Q FDPR S++Y + C CR C + C+Y +Y
Sbjct: 72 QCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVSSTCIYKVQYGD 131
Query: 177 GDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQ 236
G + G + E + + FGC N+G G+I+G+LG LSL+ Q
Sbjct: 132 GSYSVGFFATEKLTISPSD---VISNFLFGCGQQNAGRF--GRIAGLLGLGRGKLSLALQ 186
Query: 237 LRNRIQGLFSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISI 295
+ LF+YCL +T + G + +P + P + + + +S+
Sbjct: 187 TSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKN--TPFYGIDIKGLSV 244
Query: 296 GRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYN 355
G H++ F G IID+GT +T ++ Y L ++ Q+++ + +
Sbjct: 245 GGHVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKT----D 295
Query: 356 ASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIVQPENMYF----IEPDRGRFCVAI---Q 407
D CY + + + P ++F + V+ + +F + + C+A
Sbjct: 296 GFSILDTCYDFSGNESISVPRISFFFKGG---VEVDIKFFGILTVINAWDKVCLAFAPND 352
Query: 408 DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
DD + + G QQQ +++DL + F C
Sbjct: 353 DDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/443 (22%), Positives = 167/443 (37%), Gaps = 62/443 (13%)
Query: 37 IPIFSPESPLYP-GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDL 95
+P+ P P P + + + +M + R Y+ + A ED+ P + L
Sbjct: 57 VPLHRPFGPCSPSAGRAPAPSLLEMLRWDQVRTEYV----RRKASGGAEDVLNPAKPRVL 112
Query: 96 FYSVEVNIGTP---------------------MKPQHLLFDTASSLVWTQCQPCI--RCF 132
+ + +P + Q + DT + W QC PC +C+
Sbjct: 113 MSQTDFAVRSPFGVGSGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCY 172
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRS--PF------KCQNGKCVYTRRYHVGDVTRGLA 184
Q P+FDP S+T + + C P CRS P+ + N +C Y Y T G
Sbjct: 173 PQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYSDDRATAGTY 232
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
+T +G T V FGCS+ G F +G + SL +Q +
Sbjct: 233 MTDTLTI---SGTTAVRNFRFGCSHAVRG-RFSDLTAGTMSLGGGAQSLLAQTARSLGNA 288
Query: 245 FSYCLVREMEATSVIKFGRDADVRRRDL-ETTPILLSDLRPHFYLHLLE-ISIGRHIVRF 302
FSYC V + A+ + G A + TTP++ S + P YL L+ I + +
Sbjct: 289 FSYC-VPQASASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGI 347
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
PP AF + G ++D+ +T + Y+ L + + +R+ R A+ D
Sbjct: 348 PPVAF------SAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRS----GATGTLDT 397
Query: 363 CYRYDSSFKA-YPSMTFHLQEADYIVQPENMYFIEPDRGRFCV---AIQDDPKYSILGAW 418
CY + P+++ +V I C+ A D +G
Sbjct: 398 CYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG-----GCLAFTATSSDLALGFIGNV 452
Query: 419 QQQNMLIIYDLNVPALRFGSENC 441
QQQ ++YD+ + F C
Sbjct: 453 QQQTHEVLYDVAAGGVGFRRGAC 475
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 165/380 (43%), Gaps = 52/380 (13%)
Query: 97 YSVEVNIGTP---MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPC 152
Y V++ IGTP + P+++LFDT S L WTQC+PC C T P DP S T+ + C
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 182
Query: 153 DDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVR---NGFTFVPRLA 204
DP+C +G C++ RRY G G + F F G+ +A
Sbjct: 183 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 242
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCL-----------VRE 252
FGC++ A G +GIL S +QL +R FSYC+ E
Sbjct: 243 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR----FSYCIPASEITDDDDDDDE 298
Query: 253 MEATSVIKFGRDADVR------RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA 306
+ S ++FG A + ++D + L + Y H GR + P
Sbjct: 299 ERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV---VYQHG-----GRLNQQQPVPV 350
Query: 307 FDIMRDGTGG--FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
+ + ++D+GT + ++ + L +R ++ + SL R+ Y+ + YCY
Sbjct: 351 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDI-SLTRR---YDLTHPSLYCY 406
Query: 365 RYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQQQ 421
+ + S+T AD + +++F + + C+A+ + +ILG + Q+
Sbjct: 407 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQR 465
Query: 422 NMLIIYDLNVPALRFGSENC 441
N+ + YDL+ + F + C
Sbjct: 466 NINVGYDLSTMEIAFDRDQC 485
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 181/454 (39%), Gaps = 46/454 (10%)
Query: 8 PLAAFFSYFSVLFLTHFTSSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKAR 67
PL A YF ++ T + T L+ S S L P LS + R
Sbjct: 29 PLQALLQYFPIILFLIATVAGDTAL-LRNRHHGSRPSMLLPLYLSAPNSSTSALD---PR 84
Query: 68 ANYMASMSK--PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
S SK PNA L D L + +Y+ + IGTP + L+ DT S++ + C
Sbjct: 85 RQLTGSESKRHPNARMRLHDDLL----LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPC 140
Query: 126 QPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLA 184
C +C P F P +S+TY + C D C +CVY R+Y + G+
Sbjct: 141 STCEQCGRHQDPKFQPESSSTYQPVKCTIDCNCDG----DRMQCVYERQYAEMSTSSGVL 196
Query: 185 SRETFAFPVRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--I 241
+ +F N P R FGC N +G + GI+G LS+ QL ++ I
Sbjct: 197 GEDVISF--GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVI 254
Query: 242 QGLFSYCL-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIV 300
FS C ++ +++ G + P D P++ + L E+ + +
Sbjct: 255 SDSFSLCYGGMDVGGGAMVLGGISPPSDMTFAYSDP----DRSPYYNIDLKEMHVAGKRL 310
Query: 301 RFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNAS 357
F DG G ++D+GT ++ + + L+SL + P YN
Sbjct: 311 PLNANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYN-- 364
Query: 358 QEFDYCYRYDSSFKAYPSMTFHLQEA------DYIVQPENMYFIEPD-RGRFCVAI--QD 408
D C+ + + S +F + + Y + PEN F RG +C+ I
Sbjct: 365 ---DICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNG 421
Query: 409 DPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ + ++LG +N L++YD + F NCA
Sbjct: 422 NDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCA 455
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 161/374 (43%), Gaps = 40/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P + ++ DT S ++W C C C FD +S+T +
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLV 124
Query: 151 PCDDPLCRSPFKC-------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR- 202
C DP+C S + Q +C YT +Y G T G +T F G + V
Sbjct: 125 HCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNS 184
Query: 203 ---LAFGCSNDNSG-FAFGGK-ISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEA 255
+ FGCS SG K + GI GF LS+ SQL +FS+CL E
Sbjct: 185 SALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIG 244
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG 315
++ ++ + +P++ S +PH+ L+L I++ ++ P F +
Sbjct: 245 GGILV---LGEILEPGMVYSPLVPS--QPHYNLNLQSIAVNGKLLPIDPSVF--ATSNSQ 297
Query: 316 GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYP 374
G I+D+GT + ++ Y + + I+ I + + CY +S + +P
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPII-----SKGNQCYLVSTSVSQMFP 352
Query: 375 SMTFHLQ-EADYIVQPENMYFI--EPDRG---RFCVAIQDDPKYSILGAWQQQNMLIIYD 428
+F+ A +++PE+ Y I P +G +C+ Q +ILG ++ + +YD
Sbjct: 353 LASFNFAGGASMVLKPED-YLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVLKDKIFVYD 411
Query: 429 LNVPALRFGSENCA 442
L + + + +C+
Sbjct: 412 LVRQRIGWANYDCS 425
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 114/417 (27%), Positives = 170/417 (40%), Gaps = 58/417 (13%)
Query: 67 RANYMASMSKPNAFQELEDIHLPMAKQDLF-YSVEVNIGTPMKPQHLLFDTASSLVWTQC 125
+ NY+ S S A P+ YS+ ++ GTP + + DT SS VW C
Sbjct: 46 KLNYLVSTSLARAHHLKNPQTTPVFSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPC 105
Query: 126 QP---CIRC-FDQTTPIFDPRASTTYSEIPCDDPLC----RSPFKCQNG-----KCV--- 169
C C F F P+ S++ I C +P C ++ +C + C
Sbjct: 106 TLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQIC 165
Query: 170 --YTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
Y Y G T G+A ET +G VP GCS +S + +GI GF
Sbjct: 166 PPYLILYGSG-TTGGVALSETLHL---HGL-IVPNFLVGCSVFSSR-----QPAGIAGFG 215
Query: 228 ASPLSLSSQLRNRIQGL--FSYCLVRE-----MEATS-VIKFGRDADVRRRDLETTPILL 279
P SL SQL GL FSYCL+ E++S V+ D+D + L TP++
Sbjct: 216 RGPSSLPSQL-----GLTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVK 270
Query: 280 S-------DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
+ ++Y+ L ISIG V+ P +DG GG IID+GT T++
Sbjct: 271 NPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEA 330
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQ-EADYIVQPE 390
++ L + +++ R + A C+ + + P + H + AD + E
Sbjct: 331 FEILSNEFISQVKNYERALM-VEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLE 389
Query: 391 NMYFIEPDRGRFCVAIQDDPKYS------ILGAWQQQNMLIIYDLNVPALRFGSENC 441
N + R C + D ILG +Q QN + YDL L F E+C
Sbjct: 390 NYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 123/275 (44%), Gaps = 24/275 (8%)
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ G+ + ETF F F+ L FGC +G G SGI+G + PLS+ QL
Sbjct: 3 STGVLATETFTFGAHQNFS--ANLTFGCGKLTNGTIAGA--SGIMGVSPGPLSVLKQLSI 58
Query: 240 RIQGLFSYCLVREME-ATSVIKFGRDADVRR----RDLETTPILLSDLRP-HFYLHLLEI 293
FSYCL + TS + FG AD+ + ++T P+L + + ++Y+ ++ I
Sbjct: 59 T---KFSYCLTPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGI 115
Query: 294 SIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP 353
SIG + P + DGTGG ++D+ T + ++ ++ L + + ++ R
Sbjct: 116 SIGSKRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANR-- 173
Query: 354 YNASQEFDYCYRYDSSFKA----YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD 409
+ ++ C+ P + H + P + YF EP G C+A+
Sbjct: 174 --SIDDYPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCLAVMQA 231
Query: 410 P---KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
P +++G QQQNM ++YDL + C
Sbjct: 232 PFEGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 98/223 (43%), Gaps = 15/223 (6%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI---RCFDQTTPIFDPRASTTYSEIP 151
L Y V ++GTP Q + DT S L W QC+PC C+ Q P+FDP S++Y+ +P
Sbjct: 138 LNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVP 197
Query: 152 CDDPLCRS-----PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG 206
C P+C C +C Y Y G T G+ S +T + V FG
Sbjct: 198 CGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA---VQGFFFG 254
Query: 207 CSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDAD 266
C + SG F G + G+LG SL Q G+FSYCL + + G
Sbjct: 255 CGHAQSGL-FNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGP 312
Query: 267 VRRR-DLETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAF 307
TT +L S P +Y+ +L IS+G + P AF
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF 355
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 149/370 (40%), Gaps = 66/370 (17%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V +NIG P KP L D+ S L W QC PC C + P++ P S +PC
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVH 121
Query: 155 PLCRSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
LC S GK C Y +Y + G+ ++FA + NG P +AF
Sbjct: 122 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAF 181
Query: 206 GCSNDNSGFAFGGKIS----GILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVI 259
GC D G +S G+LG +SL SQL+ R + + +CL + +
Sbjct: 182 GCGYDQQ--VRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL--SLRGGGFL 237
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGT 314
FG D V + TP+ S R ++ L+ + S+G + +
Sbjct: 238 FFGDDL-VPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK------------- 283
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA- 372
+ D+G+ T+ PYQ L+ D + R+L + C++ FK+
Sbjct: 284 --VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEE-----PDTSLPLCWKGQEPFKSV 336
Query: 373 ------YPSMTFHLQEADYI---VQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAW 418
+ S+ + + PEN Y I + G C+ I + + SI+G
Sbjct: 337 LDVRKEFKSLVLNFASGKKTLMEIPPEN-YLIVTENGNACLGILNGSEIGLKDLSIIGDI 395
Query: 419 QQQNMLIIYD 428
Q+ ++IYD
Sbjct: 396 TMQDHMVIYD 405
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 165/380 (43%), Gaps = 52/380 (13%)
Query: 97 YSVEVNIGTP---MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPC 152
Y V++ IGTP + P+++LFDT S L WTQC+PC C T P DP S T+ + C
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 161
Query: 153 DDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVR---NGFTFVPRLA 204
DP+C +G C++ RRY G G + F F G+ +A
Sbjct: 162 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 221
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCL-----------VRE 252
FGC++ A G +GIL S +QL +R FSYC+ E
Sbjct: 222 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR----FSYCIPASEITDDDDDDDE 277
Query: 253 MEATSVIKFGRDADVR------RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA 306
+ S ++FG A + ++D + L + Y H GR + P
Sbjct: 278 ERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV---VYQHG-----GRLNQQQPVPV 329
Query: 307 FDIMRDGTGG--FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
+ + ++D+GT + ++ + L +R ++ + SL R+ Y+ + YCY
Sbjct: 330 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDI-SLTRR---YDLTHPSLYCY 385
Query: 365 RYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQQQ 421
+ + S+T AD + +++F + + C+A+ + +ILG + Q+
Sbjct: 386 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYPQR 444
Query: 422 NMLIIYDLNVPALRFGSENC 441
N+ + YDL+ + F + C
Sbjct: 445 NINVGYDLSTMEIAFDRDQC 464
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 149/370 (40%), Gaps = 66/370 (17%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V +NIG P KP L D+ S L W QC PC C + P++ P S +PC
Sbjct: 56 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVH 112
Query: 155 PLCRSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
LC S GK C Y +Y + G+ ++FA + NG P +AF
Sbjct: 113 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAF 172
Query: 206 GCSNDNSGFAFGGKIS----GILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVI 259
GC D G +S G+LG +SL SQL+ R + + +CL + +
Sbjct: 173 GCGYDQQ--VRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL--SLRGGGFL 228
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGT 314
FG D V + TP+ S R ++ L+ + S+G + +
Sbjct: 229 FFGDDL-VPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK------------- 274
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA- 372
+ D+G+ T+ PYQ L+ D + R+L + C++ FK+
Sbjct: 275 --VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEE-----PDTSLPLCWKGQEPFKSV 327
Query: 373 ------YPSMTFHLQEADYI---VQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAW 418
+ S+ + + PEN Y I + G C+ I + + SI+G
Sbjct: 328 LDVRKEFKSLVLNFASGKKTLMEIPPEN-YLIVTENGNACLGILNGSEIGLKDLSIIGDI 386
Query: 419 QQQNMLIIYD 428
Q+ ++IYD
Sbjct: 387 TMQDHMVIYD 396
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 132/291 (45%), Gaps = 35/291 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y E+ IGTP K ++ DT S ++W C C RC ++DP+ S+T S++
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 91
Query: 151 PCDDPLCRSPF-----KCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV---- 200
CD C + + C C Y+ Y G T G + F +G
Sbjct: 92 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 151
Query: 201 PRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLR--NRIQGLFSYCLVREMEAT 256
+ FGC + G + + GI+GF S S+ SQL +++ +F++CL +
Sbjct: 152 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL-DTINGG 210
Query: 257 SVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG 316
+ G +V + ++TTP++ + PH+ ++L I +G ++ P FD G
Sbjct: 211 GIFAIG---NVVQPKVKTTPLVPN--MPHYNVNLKSIDVGGTALKLPSHMFDTGEK--KG 263
Query: 317 FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQR-IPYNASQEFDYCYRY 366
IID+GT +T++ Y+ +M L + + I ++ QEF C++Y
Sbjct: 264 TIIDSGTTLTYLPEIVYKEIM------LAVFAKHKDITFHNVQEF-LCFQY 307
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 153/369 (41%), Gaps = 44/369 (11%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQ--PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS- 159
IGTP + Q L+ DT S L W QC + T FDP S+++S++PC PLC+
Sbjct: 87 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 146
Query: 160 ------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
P C + + C Y+ Y G G +E F F N T P L GC+ +++
Sbjct: 147 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTF--SNSQT-TPPLILGCAKEST 203
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-----MEATSVIKFGRDADV 267
+ GILG N LS SQ + FSYC+ + +T G + +
Sbjct: 204 ------DVKGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSFYLGENPNS 254
Query: 268 RR---RDLETTP--ILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
R L T P + +L P Y + LL I IG+ + P F G+G ++D+
Sbjct: 255 RGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDS 314
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDSSF---KAYPSMT 377
G+ T + + Y + + +I+R +G R + Y D C+ + + +
Sbjct: 315 GSEFTHLVDVAYDKVKE---EIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLV 371
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLIIYDLNVPA 433
F I+ + + G CV I +I+G QQN+ + +D+
Sbjct: 372 FEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRR 431
Query: 434 LRFGSENCA 442
+ F C+
Sbjct: 432 VGFSKAECS 440
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 40/373 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +GTP + ++ DT S ++W C C C FDP S++ S +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 151 PCDDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR--- 202
C D C S F+ ++G C Y+ +Y G T G + +F T
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSA 202
Query: 203 -LAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQLRNRIQGL----FSYCLVREMEA 255
FGCSN SG + + GI G LS+ SQL +QGL FS+CL +
Sbjct: 203 PFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQL--AVQGLAPRVFSHCLKGDKSG 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR-DGT 314
++ G+ ++R D TP++ S +PH+ ++L I++ I+ P F I DGT
Sbjct: 261 GGIMVLGQ---IKRPDTVYTPLVPS--QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGT 315
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
IIDTGT + ++ + Y +Q + GR I Y + Q F+ +P
Sbjct: 316 ---IIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRP-ITYESYQCFEITA---GDVDVFP 368
Query: 375 SMTFHLQEADYIVQPENMY---FIEPDRGRFCVAIQ--DDPKYSILGAWQQQNMLIIYDL 429
++ +V Y F +C+ Q + +ILG ++ +++YDL
Sbjct: 369 QVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDL 428
Query: 430 NVPALRFGSENCA 442
+ + +C+
Sbjct: 429 VRQRIGWAEYDCS 441
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 154/378 (40%), Gaps = 50/378 (13%)
Query: 98 SVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC 157
+V + +G+P + ++ DT S L W C+ +F+P +S TYS++PC P C
Sbjct: 70 TVSLTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSSKTYSKVPCLSPTC 125
Query: 158 RS-------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
++ P C K C Y G + ETF R G P FGC
Sbjct: 126 KTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETF----RLGSLTKPATIFGCM- 180
Query: 210 DNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDA 265
+SGF+ K +G++G N LS +Q+ FSYC + ++ V+ G +
Sbjct: 181 -DSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMG---YPKFSYC-ISGFDSAGVLLLGNAS 235
Query: 266 DVRRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
+ L TP++ +S P+F + L I + ++ P F G G ++
Sbjct: 236 FPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMV 295
Query: 320 DTGTPVTFIRNGPYQTLMQRY----DQILRSLGRQRIPYNASQEFDYCYRYDSS---FKA 372
D+GT TF+ Y L + IL+ L + + D CY DSS +
Sbjct: 296 DSGTQFTFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGA--MDLCYLLDSSRPNLQN 353
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPD--RGR---FCVAIQDDPKYS----ILGAWQQQNM 423
P ++ Q A+ V E + + P RGR +C + ++G QQN+
Sbjct: 354 LPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNV 413
Query: 424 LIIYDLNVPALRFGSENC 441
+ +DL + C
Sbjct: 414 WMEFDLEKSRIGLADVRC 431
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 155/378 (41%), Gaps = 43/378 (11%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ ++ ++ + IG+P + ++ DT S L W C+ F+P S++Y+ P
Sbjct: 54 QHNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKK----LPNLNSTFNPLLSSSYTPTP 109
Query: 152 CDDPLCRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
C+ +C + P C N C Y G + ETF+ P
Sbjct: 110 CNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPG 165
Query: 203 LAFGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
FGC D++G+ K +G++G N LSL +Q+ + FSYC+ E +A V
Sbjct: 166 TLFGCM-DSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCISGE-DAFGV 220
Query: 259 IKFGRDADVRRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
+ G D L+ TP++ + R + + L I + +++ P F
Sbjct: 221 LLLG-DGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHT 279
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIP-YNASQEFDYCYRYDSSF 370
G G ++D+GT TF+ Y +L + +Q L R P + D CY +S
Sbjct: 280 GAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASL 339
Query: 371 KAYPSMTFHLQEADYIVQPENMYFIEPDRGR---FCVAIQDDPKYSI----LGAWQQQNM 423
A P++T A+ V E + + +GR +C + I +G QQN+
Sbjct: 340 AAVPAVTLVFSGAEMRVSGERLLY-RVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNV 398
Query: 424 LIIYDLNVPALRFGSENC 441
+ +DL + F C
Sbjct: 399 WMEFDLVKSRVGFTETTC 416
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 153/377 (40%), Gaps = 43/377 (11%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
++ +V + +G+P + ++ DT S L W C+ F+P S++Y+ PC
Sbjct: 56 HNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKK----LPNLNSTFNPLLSSSYTPTPC 111
Query: 153 DDPLCRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ +C + P C N C Y G + ETF+ P
Sbjct: 112 NSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGT 167
Query: 204 AFGCSNDNSGFAFG----GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVI 259
FGC D++G+ K +G++G N LSL +Q+ FSYC+ E +A V+
Sbjct: 168 LFGCM-DSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS---LPKFSYCISGE-DALGVL 222
Query: 260 KFGRDADVRRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
G D L+ TP++ + R + + L I + +++ P F G
Sbjct: 223 LLGDGTDAPS-PLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 281
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIP-YNASQEFDYCYRYDSSFK 371
G ++D+GT TF+ Y +L + +Q L R P + D CY +SF
Sbjct: 282 AGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFA 341
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGR---FCVAIQDDPKYSI----LGAWQQQNML 424
A P++T A+ V E + + +G +C + I +G QQN+
Sbjct: 342 AVPAVTLVFSGAEMRVSGERLLY-RVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVW 400
Query: 425 IIYDLNVPALRFGSENC 441
+ +DL + F C
Sbjct: 401 MEFDLLKSRVGFTQTTC 417
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 159/395 (40%), Gaps = 52/395 (13%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+A +P+A L D L + +Y+ ++IGTP + L+ D+ S++ + C C +
Sbjct: 66 LAEGGRPSARMRLHDDLL----TNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ 121
Query: 131 CFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF 189
C + P F P S+TYS + C+ D C S +C Y R+Y + G+ +
Sbjct: 122 CGNHQDPRFQPDLSSTYSPVKCNVDCTCDS----DKNQCTYERQYAEMSSSSGVLGEDIV 177
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSY 247
+F + R FGC N +G F GI+G LS+ QL ++ I FS
Sbjct: 178 SFGTESELK-PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSM 236
Query: 248 C-----------LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIG 296
C ++ M A + + VR P++ + L E+ +
Sbjct: 237 CYGGMDIGGGAMVLGAMPAPPGMIYTHSNAVR--------------SPYYNIELKEMHVA 282
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+R P F DG G ++D+GT ++ + + L + R P
Sbjct: 283 GKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGP--D 336
Query: 357 SQEFDYCY----RYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDD 409
S D C+ R S + +P + + + PEN F G +C+ + +
Sbjct: 337 SNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQN 396
Query: 410 PK--YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
K ++LG +N L+ YD + + F NC+
Sbjct: 397 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 162/373 (43%), Gaps = 40/373 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +GTP + ++ DT S ++W C C C FDP S++ S +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 151 PCDDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR--- 202
C D C S F+ ++G C Y+ +Y G T G + +F T
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSA 202
Query: 203 -LAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQLRNRIQGL----FSYCLVREMEA 255
FGCSN +G + + GI G LS+ SQL +QGL FS+CL +
Sbjct: 203 PFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQL--AVQGLAPRVFSHCLKGDKSG 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMR-DGT 314
++ G+ ++R D TP++ S +PH+ ++L I++ I+ P F I DGT
Sbjct: 261 GGIMVLGQ---IKRPDTVYTPLVPS--QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGT 315
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
IIDTGT + ++ + Y +Q + GR I Y + Q F+ +P
Sbjct: 316 ---IIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRP-ITYESYQCFEIT---AGDVDVFP 368
Query: 375 SMTFHLQ-EADYIVQPENMYFIEPDRGR--FCVAIQ--DDPKYSILGAWQQQNMLIIYDL 429
++ A +++P I G +C+ Q + +ILG ++ +++YDL
Sbjct: 369 EVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDL 428
Query: 430 NVPALRFGSENCA 442
+ + +C+
Sbjct: 429 VRQRIGWAEYDCS 441
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 156/371 (42%), Gaps = 45/371 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI-----FDPRASTTYSEI 150
Y ++ IGTP ++ DT S W C +C ++ + +DPR+S + E+
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 151 PCDDPLCRSPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF----------PVRNGFTF 199
CDD +C S C +C Y Y G +T G+ + + P TF
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 201
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
L S +NS A I GI+GF S + SQL + + +FS+CL
Sbjct: 202 GCGLQQSGSLNNSAVA----IDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-DSTNGGG 256
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G +V ++TTPI+ ++ H ++L I++ ++ P F + T G
Sbjct: 257 IFAIG---EVVEPKVKTTPIVKNNEVYHL-VNLKSINVAGTTLQLPANIFGTTK--TKGT 310
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSF-KAYPS 375
ID+G+ + ++ Y L IL + I A F C+ + S +P
Sbjct: 311 FIDSGSTLVYLPEIIYSEL------ILAVFAKHPDITMGAMYNFQ-CFHFLGSVDDKFPK 363
Query: 376 MTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-----ILGAWQQQNMLIIYDL 429
+TFH + + V P + Y +E + ++C QD + ILG N +++YD+
Sbjct: 364 ITFHFENDLTLDVYPYD-YLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDM 422
Query: 430 NVPALRFGSEN 440
A+ + N
Sbjct: 423 EKQAIGWTEHN 433
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 156/383 (40%), Gaps = 41/383 (10%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA L D L + +Y+ + IGTP + L+ DT S++ + C C C
Sbjct: 72 PNARMRLYDDLL----SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQD 127
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
P F P S+TY + C+ C N CVY RRY + G+ + +F N
Sbjct: 128 PRFQPDESSTYHPVKCNMD-CNCDHDGVN--CVYERRYAEMSSSSGVLGEDIISF--GNQ 182
Query: 197 FTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREM 253
VP R FGC N +G + + GI+G LS+ QL +N I FS C
Sbjct: 183 SEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMH 242
Query: 254 EATSVIKFGR-----DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD 308
+ G D R D + P++ + L EI + ++ P FD
Sbjct: 243 VGGGAMVLGGIPPPPDMVFSRSDPYRS--------PYYNIELKEIHVAGKPLKLSPSTFD 294
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCY--- 364
+ GT ++D+GT ++ P + + D I++ + + + D C+
Sbjct: 295 -RKHGT---VLDSGTTYAYL---PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGA 347
Query: 365 -RYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI-QDDPKYSILGAWQ 419
R S KA+P + + + PEN F G +C+ I ++ ++LG
Sbjct: 348 GRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGII 407
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + F NC+
Sbjct: 408 VRNTLVTYDRENEKIGFWKTNCS 430
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 147/358 (41%), Gaps = 45/358 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
FYSV + IG P KP L D+ S L W QC PC+ C P + P I C+D
Sbjct: 67 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNK----GPITCND 122
Query: 155 PLC-------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
P+C + P K + +C Y Y + G+ + F+ + NG PRLAFGC
Sbjct: 123 PMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGC 182
Query: 208 SNDNS--GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDA 265
D S G + G+LG S+ +QLR S L+R + + G
Sbjct: 183 GYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLR-------SLGLIRSIVGHCLSGRGGGF 235
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG--AFDIMRDGTGG--FIIDT 321
L TTP ++ + G P F+ G G + D+
Sbjct: 236 LFLGDGLSTTPGII--------WTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDS 287
Query: 322 GTPVTFIRNGPYQT---LMQRY-DQILRSLGRQRIP--YNASQEFDYCYRYDSSFKAYPS 375
G+ T+ Y+T L+++Y + L+ + +P + ++ F + + FK + +
Sbjct: 288 GSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPF-A 346
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLIIYD 428
++F ++ + P Y I G C+ I + + +++G Q+ ++IYD
Sbjct: 347 LSFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYD 404
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 162/378 (42%), Gaps = 46/378 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC------FDQTTPIFDPRASTTYSE 149
Y ++ IGTP K ++ DT S +VW C C C + TP +D STT
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTP-YDLEESTTGKL 144
Query: 150 IPCDDPLC----RSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTF 199
+ CD+ C P N C Y + Y G T G ++ + +G
Sbjct: 145 VSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAA 204
Query: 200 VPRLAFGCSNDNSG-FAFGGK--ISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREME 254
+ FGC SG G+ + GILGF S S+ SQL + +++ +F++CL
Sbjct: 205 NGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL-DGTN 263
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM-RDG 313
+ G V + + TP++ + +PH+ +++ + +G I+ F+ R G
Sbjct: 264 GGGIFAMGH---VVQPKVNMTPLVPN--QPHYNVNMTGVQVGHIILNISADVFEAGDRKG 318
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
T IID+GT + ++ Y+ L+ + +L Q I + + F Y R D F
Sbjct: 319 T---IIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTI-HGEYKCFQYSERVDDGF--- 371
Query: 374 PSMTFHLQEA--------DYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLI 425
P + FH + + +Y+ Q EN++ I + +D ++ G N L+
Sbjct: 372 PPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSG--MQSRDRKNVTLFGDLVLSNKLV 429
Query: 426 IYDLNVPALRFGSENCAN 443
+YDL + + NC++
Sbjct: 430 LYDLENQTIGWTEYNCSS 447
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 165/382 (43%), Gaps = 54/382 (14%)
Query: 97 YSVEVNIGTP---MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPC 152
Y V++ IGTP + P+++LFDT S L WTQC+PC C T P DP S T+ + C
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 181
Query: 153 DDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVR---NGFTFVPRLA 204
DP+C +G C++ RRY G G + F F G+ +A
Sbjct: 182 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 241
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCL-------------V 250
FGC++ A G +GIL S +QL +R FSYC+
Sbjct: 242 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR----FSYCIPASEITDDDDDDDD 297
Query: 251 REMEATSVIKFGRDADVR------RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPP 304
E + S ++FG A + ++D + L + Y H GR + P
Sbjct: 298 DEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV---VYQHG-----GRLNQQQPV 349
Query: 305 GAFDIMRDGTGG--FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
+ + ++D+GT + ++ + L +R ++ + SL R+ Y+ + Y
Sbjct: 350 PVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDI-SLTRR---YDLTHPSLY 405
Query: 363 CYRYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQ 419
CY + + S+T AD + +++F + + C+A+ + +ILG +
Sbjct: 406 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYP 464
Query: 420 QQNMLIIYDLNVPALRFGSENC 441
Q+N+ + YDL+ + F + C
Sbjct: 465 QRNINVGYDLSTMEIAFDRDQC 486
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 153/371 (41%), Gaps = 46/371 (12%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI-FDPRASTTYSEIPCDDPLC 157
V + IGTP + Q ++ DT S L W QC + +T P FDP S+++S +PC+ LC
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQC----KVPPKTPPTAFDPLLSSSFSVLPCNHSLC 135
Query: 158 RS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
+ P C QN C Y+ Y G G RE F F + P L GC+
Sbjct: 136 KPRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCAT 192
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFG------- 262
D+S GILG N LS SS + FSYC+ + G
Sbjct: 193 DSS------DTQGILGMNLGRLSFSSLAK---ISKFSYCVPPRRSQSGSSPTGSFYLGPN 243
Query: 263 -RDADVRRRDLET--TPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
A + +L T + +L P Y L +L I I + AF G G +
Sbjct: 244 PSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTL 303
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDSSF--KAYPS 375
ID+GT TF+ + Y + + +I++ G + + Y D C+ D+ + +
Sbjct: 304 IDSGTWFTFLVDEAYSKVKE---EIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGN 360
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDP----KYSILGAWQQQNMLIIYDLNV 431
M F + IV + G C+ I +I+G + QQ++ + +DL
Sbjct: 361 MAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEFDLVG 420
Query: 432 PALRFGSENCA 442
+ FG +C+
Sbjct: 421 RRVGFGRTDCS 431
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 165/382 (43%), Gaps = 54/382 (14%)
Query: 97 YSVEVNIGTP---MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPC 152
Y V++ IGTP + P+++LFDT S L WTQC+PC C T P DP S T+ + C
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 163
Query: 153 DDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVR---NGFTFVPRLA 204
DP+C +G C++ RRY G G + F F G+ +A
Sbjct: 164 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 223
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCL-------------V 250
FGC++ A G +GIL S +QL +R FSYC+
Sbjct: 224 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR----FSYCIPASEITDDDDDDDD 279
Query: 251 REMEATSVIKFGRDADVR------RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPP 304
E + S ++FG A + ++D + L + Y H GR + P
Sbjct: 280 DEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV---VYQHG-----GRLNQQQPV 331
Query: 305 GAFDIMRDGTGG--FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
+ + ++D+GT + ++ + L +R ++ + SL R+ Y+ + Y
Sbjct: 332 PVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDI-SLTRR---YDLTHPSLY 387
Query: 363 CYRYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQ 419
CY + + S+T AD + +++F + + C+A+ + +ILG +
Sbjct: 388 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYP 446
Query: 420 QQNMLIIYDLNVPALRFGSENC 441
Q+N+ + YDL+ + F + C
Sbjct: 447 QRNINVGYDLSTMEIAFDRDQC 468
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 156/371 (42%), Gaps = 45/371 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI-----FDPRASTTYSEI 150
Y ++ IGTP ++ DT S W C +C ++ + +DPR+S + E+
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 151 PCDDPLCRSPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF----------PVRNGFTF 199
CDD +C S C +C Y Y G +T G+ + + P TF
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 177
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
L S +NS A I GI+GF S + SQL + + +FS+CL
Sbjct: 178 GCGLQQSGSLNNSAVA----IDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-DSTNGGG 232
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G +V ++TTPI+ ++ H ++L I++ ++ P F + T G
Sbjct: 233 IFAIG---EVVEPKVKTTPIVKNNEVYHL-VNLKSINVAGTTLQLPANIFGTTK--TKGT 286
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSF-KAYPS 375
ID+G+ + ++ Y L IL + I A F C+ + S +P
Sbjct: 287 FIDSGSTLVYLPEIIYSEL------ILAVFAKHPDITMGAMYNFQ-CFHFLGSVDDKFPK 339
Query: 376 MTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-----ILGAWQQQNMLIIYDL 429
+TFH + + V P + Y +E + ++C QD + ILG N +++YD+
Sbjct: 340 ITFHFENDLTLDVYPYD-YLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDM 398
Query: 430 NVPALRFGSEN 440
A+ + N
Sbjct: 399 EKQAIGWTEHN 409
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 147/358 (41%), Gaps = 45/358 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
FYSV + IG P KP L D+ S L W QC PC+ C P + P I C+D
Sbjct: 34 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNK----GPITCND 89
Query: 155 PLC-------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
P+C + P K + +C Y Y + G+ + F+ + NG PRLAFGC
Sbjct: 90 PMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGC 149
Query: 208 SNDNS--GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDA 265
D S G + G+LG S+ +QLR S L+R + + G
Sbjct: 150 GYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLR-------SLGLIRSIVGHCLSGRGGGF 202
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG--AFDIMRDGTGG--FIIDT 321
L TTP ++ + G P F+ G G + D+
Sbjct: 203 LFLGDGLSTTPGII--------WTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDS 254
Query: 322 GTPVTFIRNGPYQT---LMQRY-DQILRSLGRQRIP--YNASQEFDYCYRYDSSFKAYPS 375
G+ T+ Y+T L+++Y + L+ + +P + ++ F + + FK + +
Sbjct: 255 GSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPF-A 313
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLIIYD 428
++F ++ + P Y I G C+ I + + +++G Q+ ++IYD
Sbjct: 314 LSFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYD 371
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 155/374 (41%), Gaps = 39/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P K ++ DT S ++W C C C FD S+T + +
Sbjct: 82 LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 151 PCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPR 202
C DP+C S Q +C YT +Y G T G +T F V G + V
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVAN 201
Query: 203 ----LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREME 254
+ FGCS SG + GI GF LS+ SQL +R +FS+CL
Sbjct: 202 SSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGEN 261
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
V+ G ++ + +P++ S PH+ L+L I++ ++ F +
Sbjct: 262 GGGVLVLG---EILEPSIVYSPLVPS--LPHYNLNLQSIAVNGQLLPIDSNVFATTNN-- 314
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
G I+D+GT + ++ Y + + + I + + CY +S +
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPII-----SKGNQCYLVSNSVGDIF 369
Query: 374 PSMTFH-LQEADYIVQPENM---YFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYD 428
P ++ + + A ++ PE+ Y +C+ Q + ++ILG ++ + +YD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYD 429
Query: 429 LNVPALRFGSENCA 442
L + + NC+
Sbjct: 430 LANQRIGWADYNCS 443
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 152/376 (40%), Gaps = 55/376 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V +NIG P KP L DT S L W QC PC++C + P + PR + +PC D
Sbjct: 19 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNL----VPCMD 74
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFG-C 207
P+C+S +C+N G+C Y Y G + G+ R+TF + P LA G C
Sbjct: 75 PICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFTSEKRHSPLLALGLC 134
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATSVIKFGRDA 265
D I G+LG S+ SQL + ++ + +CL
Sbjct: 135 GYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLF------- 187
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG-TGGF-----II 319
+++ + + + P +H + PG ++ DG T GF
Sbjct: 188 -FGDDLYDSSRVAWTPMSPD----------AKH---YSPGLAELTFDGKTTGFKNLLTTF 233
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQIL------RSLGRQRIP--YNASQEFDYCYRYDSSFK 371
D+G T++ + YQ L+ + L +L Q +P + + F FK
Sbjct: 234 DSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFK 293
Query: 372 AYP-SMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLI 425
+ S T + + P Y I +G C+ I + + +++G Q+ ++
Sbjct: 294 TFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVV 353
Query: 426 IYDLNVPALRFGSENC 441
IYD + + NC
Sbjct: 354 IYDNEKERIGWAPGNC 369
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 165/382 (43%), Gaps = 54/382 (14%)
Query: 97 YSVEVNIGTP---MKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPC 152
Y V++ IGTP + P+++LFDT S L WTQC+PC C T P DP S T+ + C
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 160
Query: 153 DDPLCRSPFKCQNG-----KCVYTRRYHVGDVTRGLASRETFAFPVR---NGFTFVPRLA 204
DP+C +G C++ RRY G G + F F G+ +A
Sbjct: 161 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 220
Query: 205 FGCSNDNSGFAFGGKISGILGFNASPLSLSSQLR-NRIQGLFSYCL-------------V 250
FGC++ A G +GIL S +QL +R FSYC+
Sbjct: 221 FGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDR----FSYCIPASEITDDDDDDDD 276
Query: 251 REMEATSVIKFGRDADVR------RRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPP 304
E + S ++FG A + ++D + L + Y H GR + P
Sbjct: 277 DEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV---VYQHG-----GRLNQQQPV 328
Query: 305 GAFDIMRDGTGG--FIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
+ + ++D+GT + ++ + L +R ++ + SL R+ Y+ + Y
Sbjct: 329 PVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDI-SLTRR---YDLTHPSLY 384
Query: 363 CYRYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPD--RGRFCVAIQDDPKYSILGAWQ 419
CY + + S+T AD + +++F + + C+A+ + +ILG +
Sbjct: 385 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR-AILGVYP 443
Query: 420 QQNMLIIYDLNVPALRFGSENC 441
Q+N+ + YDL+ + F + C
Sbjct: 444 QRNINVGYDLSTMEIAFDRDQC 465
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 157/374 (41%), Gaps = 39/374 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +G+P K ++ DT S ++W C C C FD S+T + +
Sbjct: 82 LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 151 PCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAF-PVRNGFTFVPR 202
C DP+C S Q +C YT +Y G T G +T F V G + V
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVAN 201
Query: 203 ----LAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREME 254
+ FGCS SG + GI GF LS+ SQL +R +FS+CL
Sbjct: 202 SSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGEN 261
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
V+ G ++ + +P++ S +PH+ L+L I++ ++ F +
Sbjct: 262 GGGVLVLG---EILEPSIVYSPLVPS--QPHYNLNLQSIAVNGQLLPIDSNVFATTNN-- 314
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
G I+D+GT + ++ Y ++ + + I + + CY +S +
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPII-----SKGNQCYLVSNSVGDIF 369
Query: 374 PSMTFH-LQEADYIVQPENM---YFIEPDRGRFCVAIQD-DPKYSILGAWQQQNMLIIYD 428
P ++ + + A ++ PE+ Y +C+ Q + ++ILG ++ + +YD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYD 429
Query: 429 LNVPALRFGSENCA 442
L + + +C+
Sbjct: 430 LANQRIGWADYDCS 443
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 82/160 (51%), Gaps = 12/160 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + + +GTP +++ DT S +VW QC PC C++QT IFDP+ S T++ +PC L
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRL 194
Query: 157 CR---SPFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
CR +C ++ C+Y Y G T G S ET F V + GC +D
Sbjct: 195 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR----VDHVPLGCGHD 250
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
N G +G+LG LS SQ +NR G FSYCLV
Sbjct: 251 NEGLFV--GAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLV 288
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 155/378 (41%), Gaps = 60/378 (15%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y V+ NIG P KP L DT S L W QC PCI+C P++ P T + C D
Sbjct: 66 YYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQP----TNDLVVCKD 121
Query: 155 PLCRS----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
P+C S ++C + +C Y Y G + G+ + F + +G PRL GC
Sbjct: 122 PICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGY 181
Query: 210 DN-SGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDAD 266
D G A+ + G+LG S+ +QL ++ ++ + +C R + FG D
Sbjct: 182 DQLPGIAY-HPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSR--RGGGYLFFGDDI- 237
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG------FIID 320
+++ ++ + + + H + PG +++ +G + D
Sbjct: 238 -----YDSSKVIWTPMSRDYLKH------------YTPGFAELILNGRSSGLKNLLVVFD 280
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHL 380
+G+ T+ YQTL+ + L G+ C+R FK+ +
Sbjct: 281 SGSSYTYFNTQTYQTLLSFIKKDLH--GKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYF 338
Query: 381 Q------------EADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNM 423
+ ++ + +Q E+ Y I +G C+ I + + Y+I+G Q
Sbjct: 339 KPLALSFGSGWKTKSQFEIQQES-YLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEK 397
Query: 424 LIIYDLNVPALRFGSENC 441
L+IYD + + NC
Sbjct: 398 LVIYDNEKQVIGWQPSNC 415
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 149/371 (40%), Gaps = 67/371 (18%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
Y V +NIG P KP L D+ S L W QC PC C + P++ P S +PC
Sbjct: 63 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVH 119
Query: 155 PLCRSPFKCQNG----------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLA 204
LC S G +C Y +Y + G+ ++FA + NG P +A
Sbjct: 120 RLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTNGSVARPSVA 179
Query: 205 FGCSNDNSGFAFGGKIS----GILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSV 258
FGC D G +S G+LG +SL SQL+ R + + +CL +
Sbjct: 180 FGCGYDQQ--VRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL--SLRGGGF 235
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIMRDG 313
+ FG D V + TP+ S R ++ L+ + S+G + +
Sbjct: 236 LFFGDDL-VPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK------------ 282
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA 372
+ D+G+ T+ PYQ L+ D + R+L + C++ FK+
Sbjct: 283 ---VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEE-----PDTSLPLCWKGQEPFKS 334
Query: 373 -------YPSMTFHLQEADYI---VQPENMYFIEPDRGRFCVAIQDDPK-----YSILGA 417
+ S+ + + PEN Y I + G C+ I + + SI+G
Sbjct: 335 VLDVRKEFKSLVLNFASGKKTLMEIPPEN-YLIVTENGNACLGILNGSEIGLKDLSIIGD 393
Query: 418 WQQQNMLIIYD 428
Q+ ++IYD
Sbjct: 394 ITMQDHMVIYD 404
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 160/378 (42%), Gaps = 46/378 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEI 150
Y ++ IGTP K +L DT S ++W C C C +++ ++D + S++ +
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141
Query: 151 PCDDPLCRSPFK------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFV 200
PCD C+ N C Y Y G T G ++ + +G +
Sbjct: 142 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSAN 201
Query: 201 PRLAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEA 255
+ FGC SG + GILGF + S+ SQL +++ +F++CL +
Sbjct: 202 GSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL-NGVNG 260
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT- 314
+ G V + + TP+L +PH+ +++ + +G + + D G
Sbjct: 261 GGIFAIGH---VVQPKVNMTPLLPD--QPHYSVNMTAVQVGHTFLSL---STDTSAQGDR 312
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAY 373
G IID+GT + ++ G Y+ L+ + L Q + E+ C++Y S +
Sbjct: 313 KGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTL----HDEYT-CFQYSESVDDGF 367
Query: 374 PSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQNMLI 425
P++TF + + V P + F P +C+ Q D ++LG N L+
Sbjct: 368 PAVTFFFENGLSLKVYPHDYLF--PSVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLV 425
Query: 426 IYDLNVPALRFGSENCAN 443
YDL A+ + NC++
Sbjct: 426 FYDLENQAIGWAEYNCSS 443
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 110/433 (25%), Positives = 186/433 (42%), Gaps = 49/433 (11%)
Query: 26 SSESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISK---ARANYMASMSKPNAFQE 82
+S++ L +IPI+S SP P Q ++ + +++ AR Y++S+ A Q
Sbjct: 26 ASQADDSDLSIIPIYSKCSPFIPPK--QEPLVNTVIDMASKDPARLKYLSSL----AAQM 79
Query: 83 LEDIHLPMAKQDLF---YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIF 139
+ + +Q L Y V V +GTP + ++ DT++ W C C C T
Sbjct: 80 TTAVPIAPGQQVLNIGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTF--- 136
Query: 140 DPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
S+TY + C C F C + CV+ + Y GD + E V
Sbjct: 137 STNTSSTYGSLDCSMAQCTQVRGFSCPATGSSSCVFNQSYG-GDSSFSATLVEDSLRLVN 195
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VRE 252
+ +P AFGC N SG + + LG SL +Q + GLFSYCL +
Sbjct: 196 D---VIPNFAFGCINSISGGSVPPQGLLGLGRGPL--SLIAQSGSLYSGLFSYCLPSFKS 250
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMR 311
+ +K G + + + TP+L + RP +Y++L +S+GR +V P
Sbjct: 251 YYFSGSLKLGPAG--QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNP 308
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI--PYNASQEFDYCYRYDSS 369
+ G IID+GT +T Y + + R+++ P+++ FD C+ +
Sbjct: 309 NTGAGTIIDSGTVITRFVQPIYTAIRDEF--------RKQVAGPFSSLGAFDTCFAATNE 360
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNML 424
A P++T H + ++ EN C+A+ P +++ QQQN+
Sbjct: 361 AVA-PAVTLHFTGLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLR 419
Query: 425 IIYDLNVPALRFG 437
+++D VP R G
Sbjct: 420 LLFD--VPNSRLG 430
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 155/382 (40%), Gaps = 34/382 (8%)
Query: 81 QELEDIHLPMAKQDLF--------YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
+ L++ LP A+ LF Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 53 RHLQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCG 112
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFP 192
P F P S+TY + C +P C + +C Y RRY + G+ + + +F
Sbjct: 113 KHQDPRFQPDLSSTYRPVKC-NPSCNC--DDEGKQCTYERRYAEMSSSSGVIAEDVVSF- 168
Query: 193 VRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCL 249
N P R FGC N +G + + GI+G LS+ QL ++ I FS C
Sbjct: 169 -GNESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCY 227
Query: 250 VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI 309
+ G+ + + S P++ + L E+ + ++ P FD
Sbjct: 228 GGMDVGGGAMVLGQISPPPNMVFSHSNPYRS---PYYNIELKELHVAGKPLKLKPKVFDE 284
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY----R 365
G ++D+GT + + L + +R L ++IP D C+ R
Sbjct: 285 KH----GTVLDSGTTYAYFPEAAFHALKDAIMKEIRHL--KQIPGPDPNYHDICFSGAGR 338
Query: 366 YDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQ 420
S K +P + + + PEN F G +C+ I + ++LG
Sbjct: 339 EVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVV 398
Query: 421 QNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + F NC+
Sbjct: 399 RNTLVTYDRENDKIGFWKTNCS 420
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 156/371 (42%), Gaps = 45/371 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI-----FDPRASTTYSEI 150
Y ++ IGTP ++ DT S W C +C ++ + +DPR+S + E+
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 151 PCDDPLCRSPFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF----------PVRNGFTF 199
CDD +C S C +C Y Y G +T G+ + + P TF
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 177
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATS 257
L S +NS A I GI+GF S + SQL + + +FS+CL
Sbjct: 178 GCGLQQSGSLNNSAVA----IDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-DSTNGGG 232
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
+ G +V ++TTPI+ ++ H ++L I++ ++ P F + T G
Sbjct: 233 IFAIG---EVVEPKVKTTPIVKNNEVYHL-VNLKSINVAGTTLQLPANIFGTTK--TKGT 286
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-RIPYNASQEFDYCYRYDSSF-KAYPS 375
ID+G+ + ++ Y L IL + I A F C+ + S +P
Sbjct: 287 FIDSGSTLVYLPEIIYSEL------ILAVFAKHPDITMGAMYNFQ-CFHFLGSVDDKFPK 339
Query: 376 MTFHLQ-EADYIVQPENMYFIEPDRGRFCVAIQDDPKYS-----ILGAWQQQNMLIIYDL 429
+TFH + + V P + Y +E + ++C QD + ILG N +++YD+
Sbjct: 340 ITFHFENDLTLDVYPYD-YLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDM 398
Query: 430 NVPALRFGSEN 440
A+ + N
Sbjct: 399 EKQAIGWTEHN 409
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 143/359 (39%), Gaps = 39/359 (10%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
V + IGTP L+FDT S L+WTQCQPC+ C Q ++DP + TY+ +
Sbjct: 90 VFLGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSS---- 145
Query: 159 SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
Y Y T G + ETFA G V + FGC N G+
Sbjct: 146 -----------YNYTYSKQSFTSGYFATETFAL----GNVTVANITFGCGTRNQGYY--D 188
Query: 219 KISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKF-GRDADV---RRRDLET 274
++G+ G S L FSYC S F G ++
Sbjct: 189 NVAGVFGVGRGGRGGVSLLNQLGIDRFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAA 248
Query: 275 TPILLSD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
+ +++D L+ +++ L+ +++G +V GA G +ID+ +PVT +
Sbjct: 249 STPMVADPVLKSGYFVKLVGVTVGATLVDV-AGASS-AEGGGRALVIDSTSPVTVLDEAT 306
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP-----SMTFHLQ--EADY 385
Y + + L L +A D C+ ++ A P +MT H AD
Sbjct: 307 YGPVRRALVAQLAPLKEANANASAGVGLDLCFEL-AAGGATPTPPNVTMTLHFDGGAADL 365
Query: 386 IVQPENMYFIEPDRGRFCVAIQDDPKYS--ILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++ P + + G C+ + +LG+W + L++YDL + F +CA
Sbjct: 366 VLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQPLDCA 424
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 151/391 (38%), Gaps = 69/391 (17%)
Query: 114 FDTASSLVWTQCQP--CIRCFDQTTP-IFDP-------------RASTTYSEIPCDDPLC 157
DT S +VW C P CI C + P P RA +T P LC
Sbjct: 109 MDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNSPSTSDLC 168
Query: 158 ---RSPFK------CQNGKC-VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+ P C N C + Y G + L N + FGC
Sbjct: 169 AIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTSNKPFSLKDFTFGC 228
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL---FSYCLVR------EMEATSV 258
++ A G I G+ GF LSL +QL N L FSYCLV ++ S
Sbjct: 229 AHS----ALGEPI-GVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKLHHPSP 283
Query: 259 IKFGRDADVRRRDLET------TPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMR 311
+ G+ V+ RD + TP+L + P+FY +E IS+G VR P I R
Sbjct: 284 LILGK---VKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALIRIDR 340
Query: 312 DGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK 371
DG GG ++D+GT T + G Y ++ D+ + + ++ + CY + +
Sbjct: 341 DGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGNGV 400
Query: 372 -----AYPSMTFHLQEADYIVQPENMYFIE------PDRGR--FCVAIQDDPKYS----- 413
P + FH +V P YF E +GR C+ + D S
Sbjct: 401 ERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGGPG 460
Query: 414 -ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
LG +QQQ ++YDL + F CA+
Sbjct: 461 ATLGNYQQQGFQVVYDLEERRVGFAPRKCAS 491
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 161/406 (39%), Gaps = 75/406 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFD------QTTPIFDPRASTTYSEI 150
Y+ ++GTP +P +L DT S L W C C + P+F P+ S++ +
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 151 PCDDPLCRS-------PFKCQNGKCV----------------YTRRYHVGDVTRGLASRE 187
C +P C+ KC+ C Y Y G T GL +
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIAD 217
Query: 188 TFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--F 245
T P R VP GCS + SG+ GF S+ +QL GL F
Sbjct: 218 TLRAPGRA----VPGFVLGCSL----VSVHQPPSGLAGFGRGAPSVPAQL-----GLPKF 264
Query: 246 SYCLV-REMEATSVIK--FGRDADVRRRDLETTPILLS---DLRPH---FYLHLLEISIG 296
SYCL+ R + + + ++ P++ S D P+ +YL L +++G
Sbjct: 265 SYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVG 324
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
VR P AF G+GG I+D+GT T++ +Q + + GR + +A
Sbjct: 325 GKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG--GRYKRSKDA 382
Query: 357 SQEFDY--CYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRG---RFCVAIQDD 409
C+ + A P ++FH + + P YF+ RG C+A+ D
Sbjct: 383 EDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTD 442
Query: 410 ------------PKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
ILG++QQQN L+ YDL L F ++C +
Sbjct: 443 FGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 488
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 148/373 (39%), Gaps = 49/373 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V +NIG P KP L DT S L W QC PC C P++ P T +PC +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRP---TKNKLVPCANS 113
Query: 156 LC------RSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+C SP K +C Y +Y + G+ ++F+ P+RN P L+FGC
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRPSLSFGC 173
Query: 208 SNDNSGFAFGGK---ISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFG 262
D G G+LG +SL SQL+ + + + +CL + FG
Sbjct: 174 GYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL--STSGGGFLFFG 231
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
D V + P++ S ++ + R + P + D+G
Sbjct: 232 DDM-VPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPME----------VVFDSG 280
Query: 323 TPVTFIRNGPYQ-TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQ 381
+ T+ PYQ T+ + +SL + P C++ +FK+ + +
Sbjct: 281 STYTYFSAQPYQATISAIKGSLSKSLKQVSDP-----SLPLCWKGQKAFKSVSDVKKDFK 335
Query: 382 EADYI--------VQPENMYFIEPDRGRFCVAIQDDP----KYSILGAWQQQNMLIIYDL 429
+I + PEN Y I G C+ I D +SI+G Q+ ++IYD
Sbjct: 336 SLQFIFGKNAVMEIPPEN-YLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDN 394
Query: 430 NVPALRFGSENCA 442
L + +C+
Sbjct: 395 EKAQLGWIRGSCS 407
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 103/415 (24%), Positives = 171/415 (41%), Gaps = 46/415 (11%)
Query: 34 LKLIPIFSPESPLYPGNL-SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK 92
L +IP++ SP P S R+ M AR +Y++++ + P+A
Sbjct: 35 LNVIPMYGKCSPFNPPKADSWDNRVINMASKDPARMSYLSTL-----VAQKTATSAPIAS 89
Query: 93 QDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYS 148
F Y V V IGTP + ++ DT++ + CI C T F P ST++
Sbjct: 90 GQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGCSATT---FYPNVSTSFV 146
Query: 149 EIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+ C P C C +G C + + Y + L +R +P
Sbjct: 147 PLDCSVPQCGQVRGLSCPATGSGACSFNQSYAGSTFSATLVQDS-----LRLATDVIPSY 201
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKF 261
+FG N SG + + LG SL SQ G+FSYCL + + +K
Sbjct: 202 SFGSINAISGSSVPAQGLLGLGRGPL--SLLSQSGAIYSGVFSYCLPSFKSYYFSGSLKL 259
Query: 262 GRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G + + + TTP+L + RP +Y++L IS+GR V P G IID
Sbjct: 260 GPVG--QPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGAGTIID 317
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRI--PYNASQEFDYCYRYDSSFKAYPSMTF 378
+GT +T Y + + R+++ P+++ FD C+ + A P++T
Sbjct: 318 SGTVITRFVEPIYNAVRDEF--------RKQVTGPFSSLGAFDTCFVKNYETLA-PAITL 368
Query: 379 HLQEADYIVQPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYD 428
H + D + EN C+A+ P +++ +QQQN+ +++D
Sbjct: 369 HFTDLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFD 423
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 135/343 (39%), Gaps = 31/343 (9%)
Query: 110 QHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFK--C 163
Q ++ D+AS + W QC PC C Q +DP S T + C P C + P+ C
Sbjct: 29 QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGC 88
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGI 223
N +C Y RY G T G + N V FGCS+ G +F + +GI
Sbjct: 89 ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA---VSGFKFGCSHAEQG-SFDARAAGI 144
Query: 224 LGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLR 283
+ P SL SQ +R FSYC+ + G R + T +
Sbjct: 145 MALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 204
Query: 284 PHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
+ + L I++G + P F G ++D+ T +T + YQ L +
Sbjct: 205 TFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRAAFRSS 258
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL-QEADYIVQPENMYFIEPDRGR 401
+ ++ R P D CY + P ++ + A + P + F +
Sbjct: 259 M-TMYRSAPPKG---YLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND----- 309
Query: 402 FCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A DD +LG+ QQQ + ++YD+ A+ F C
Sbjct: 310 -CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 111/415 (26%), Positives = 171/415 (41%), Gaps = 87/415 (20%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ----PCIRCFD------QTTPIFDPRASTT 146
Y + +NIGTP + + DT S L W C CI C D +++ IF P S++
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 147 YSEIPCDDPLCR------SPFK-CQNGKC---------------VYTRRYHVGDVTRGLA 184
C C +PF C C + Y G + G+
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130
Query: 185 SRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
+R+ R+ VPR +FGC + I GI GF LSL SQL +G
Sbjct: 131 TRDILKARTRD----VPRFSFGCVTS----TYHEPI-GIAGFGRGLLSLPSQLGFLEKG- 180
Query: 245 FSYC-----LVREMEATSVIKFGRDA-DVRRRD-LETTPILLSDLRPH-FYLHLLEISIG 296
FS+C V +S + G A + D L+ TP+L + + P+ +Y+ L I+IG
Sbjct: 181 FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIG 240
Query: 297 RHI--VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPY 354
+I + P G GG ++D+GT T + N P+ Y Q+L L + I Y
Sbjct: 241 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPN-PF------YSQLLTIL-QSTITY 292
Query: 355 NASQE------FDYCYRY-----------DSSFKAYPSMTFHLQEADYIVQPENMYFI-- 395
+ E FD CY+ + +PS+TF+ ++ P+ F
Sbjct: 293 PRATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAM 352
Query: 396 -EPDRGRF--CVAIQ--DDPKY---SILGAWQQQNMLIIYDLNVPALRFGSENCA 442
P G C+ Q +D Y + G++QQQN+ ++YDL + F + +C
Sbjct: 353 SAPSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 407
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 146/355 (41%), Gaps = 31/355 (8%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q Y V IGTP + L DT++ W C C C + +F P STT+ + C
Sbjct: 74 QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAPEKSTTFKNVSC 130
Query: 153 DDPLCRSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
P C+ C C + Y + L ++T VP FGC +
Sbjct: 131 AAPECKQVPNPGCGVSSCNFNLTYGSSSIAANLV-QDTITLATDP----VPSYTFGCVSK 185
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVR 268
+G + + LG SL SQ +N Q FSYCL + + + ++ G A +
Sbjct: 186 TTGTSAPPQGLLGLGRGPL--SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPK 243
Query: 269 RRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
R ++ TP+L + R +Y++L I +GR +V PP A G I D+GT T
Sbjct: 244 R--IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTR 301
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIV 387
+ Y + D+ R +G ++ + FD CY P++TF + +
Sbjct: 302 LVAPVYVAV---RDEFRRRVG-PKLTVTSLGGFDTCYNVP---IVVPTITFIFTGMNVTL 354
Query: 388 QPENMYFIEPDRGRFCVAIQDDPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
+N+ C+A+ P +++ QQQN ++YD VP R G
Sbjct: 355 PQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYD--VPNSRVG 407
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 164/385 (42%), Gaps = 41/385 (10%)
Query: 86 IHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CF-DQTTPIFDPRA 143
+H + FY+ +++GTP + ++ DT S++ + C C R C FDP +
Sbjct: 52 LHGAVKDYGYFYAT-LHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPAS 110
Query: 144 STTYSEIPCDDPLC---RSPFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
S++ + I CD C R P C + +C Y R Y + GL + +R+G
Sbjct: 111 SSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQL--QLRDGAVE 168
Query: 200 VPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATS 257
V FGC +G + + GILG S +SL +QL I +F+ C +E
Sbjct: 169 V---VFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCF-GSVEGDG 224
Query: 258 VIKFGRDADVRRRD--LETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G D D D L+ T +L S PH+Y + L + +G + P ++ +G
Sbjct: 225 ALMLG-DVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYE---EGY 280
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYP 374
G ++D+GT T++ + +Q + G + +E + +D F P
Sbjct: 281 -GTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAP 339
Query: 375 SM-------------TFHLQEADYI---VQPENMYFIEP-DRGRFCVAIQDD-PKYSILG 416
F LQ AD + P N F+ + G +C+ + D+ ++LG
Sbjct: 340 HAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGASGTLLG 399
Query: 417 AWQQQNMLIIYDLNVPALRFGSENC 441
+N+L+ YD + FG+ +C
Sbjct: 400 GISFRNILVQYDRRNRRVGFGAASC 424
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 89/382 (23%), Positives = 157/382 (41%), Gaps = 54/382 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y ++ IGTP K +L DT S ++W C C C ++D + S++ +
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143
Query: 151 PCDDPLCRSPFK------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFV 200
PCD C+ N C Y Y G T G ++ + +G +
Sbjct: 144 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSAN 203
Query: 201 PRLAFGCSNDNSGFAFGGK---ISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEA 255
+ FGC SG + GILGF + S+ SQL +++ +F++CL +
Sbjct: 204 GSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL-NGVNG 262
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT- 314
+ G V + + TP+L +PH+ +++ + +G + + D G
Sbjct: 263 GGIFAIGH---VVQPKVNMTPLLPD--QPHYSVNMTAVQVGHAFLSL---STDTSTQGDR 314
Query: 315 GGFIIDTGTPVTFIRNGPYQTLM-----QRYDQILRSLGRQRIPYNASQEFDYCYRYDSS 369
G IID+GT + ++ G Y+ L+ Q D +R+L + + S+ D
Sbjct: 315 KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQYSESVD-------- 366
Query: 370 FKAYPSMTFHLQEADYI-VQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQ 421
+P++TF+ + + V P + F P +C+ Q D ++LG
Sbjct: 367 -DGFPAVTFYFENGLSLKVYPHDYLF--PSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLS 423
Query: 422 NMLIIYDLNVPALRFGSENCAN 443
N L+ YDL + + NC++
Sbjct: 424 NKLVFYDLENQVIGWTEYNCSS 445
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 163/407 (40%), Gaps = 77/407 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFD------QTTPIFDPRASTTYSEI 150
Y+ ++GTP +P +L DT S L W C C + P+F P+ S++ +
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 151 PCDDPLCRS-------PFKCQNGKCV----------------YTRRYHVGDVTRGLASRE 187
C +P C+ KC+ C Y Y G T GL +
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIAD 185
Query: 188 TFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL--F 245
T P R VP GCS + SG+ GF S+ +QL GL F
Sbjct: 186 TLRAPGRA----VPGFVLGCSL----VSVHQPPSGLAGFGRGAPSVPAQL-----GLPKF 232
Query: 246 SYCLV-REMEATSVIK--FGRDADVRRRDLETTPILLS---DLRPH---FYLHLLEISIG 296
SYCL+ R + + + ++ P++ S D P+ +YL L +++G
Sbjct: 233 SYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVG 292
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
VR P AF G+GG I+D+GT T++ +Q + + GR + +A
Sbjct: 293 GKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG--GRYKRSKDA 350
Query: 357 SQEFDY--CYRYDSSFK--AYPSMTFHLQEADYIVQPENMYFIEPDRG---RFCVAIQDD 409
E C+ + A P ++FH + + P YF+ RG C+A+ D
Sbjct: 351 EDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTD 410
Query: 410 -------------PKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
P ILG++QQQN L+ YDL L F ++C +
Sbjct: 411 FSGGSGAGNEGSGPAI-ILGSFQQQNYLVEYDLEKERLGFRRQSCTS 456
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 145/341 (42%), Gaps = 38/341 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTT---YSEIPC 152
Y + ++GTP + + D S VW QC C C P A++ Y+ +
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADA-----PAATSAPPFYAFLSF 150
Query: 153 DDPLCRSPFKCQNGKCVYTRRYHVG--DVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
D R+P C Y+ Y G + T GL + + FAF + FGC+
Sbjct: 151 HD--TRAP---TTPPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI----FGCA-- 199
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE--MEATSVIKFGRDADVR 268
A G I G++G LS SQL+ G FSY L + ++ S I F DA R
Sbjct: 200 ---VATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDDAKPR 253
Query: 269 RRDLETTPILLSDL-RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTF 327
+TP++ S R +Y+ L I + + P G FD+ DG+GG ++ PVTF
Sbjct: 254 TSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPVTF 313
Query: 328 IRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQEADYI 386
+ G Y+ + Q + S R + D CY +S A PSM +
Sbjct: 314 LDAGAYKVVR----QAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 369
Query: 387 -VQPENMYFIEPDRGRFCVAIQDDPK--YSILGAWQQQNML 424
++ N ++++ G C+ I P S+LG+ Q ++L
Sbjct: 370 ELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVSLL 410
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 86/168 (51%), Gaps = 27/168 (16%)
Query: 49 GNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMK 108
GN ++ ER+ + + + R +++ K +F+ + P+ + + + + IGTP +
Sbjct: 53 GNYTKFERLQRAVKRGRLRLQRLSA--KTASFEP--SVEAPVHAGNGEFLMNLAIGTPAE 108
Query: 109 PQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKC 168
+ DT S L+WTQC+PC CFDQ TPIFDP S+++S++PC L
Sbjct: 109 TYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDL------------ 156
Query: 169 VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
YH T+G+ + ETF F G V ++ FGC DN G A+
Sbjct: 157 -----YH--SSTQGVLATETFTF----GDASVSKIGFGCGEDNRGRAY 193
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 161/383 (42%), Gaps = 40/383 (10%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA L D L + +Y+ + IGTP + L+ DT S++ + C C +C
Sbjct: 72 PNAHMRLYDDLL----SNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQD 127
Query: 137 PIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG 196
P F P +S+TY + C +P C + +C Y RRY + GL + + +F +
Sbjct: 128 PRFQPESSSTYKPMQC-NPSCNC--DDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESE 184
Query: 197 FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
T R FGC +G F + GI+G PLS+ QL + + FS C
Sbjct: 185 LT-PQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDV 243
Query: 255 ATSVIKFGRDADVRRRDLETTPILL---SD-LRPHFY-LHLLEISIGRHIVRFPPGAFDI 309
+ G ++ P ++ SD R +Y + L E+ + ++ P F
Sbjct: 244 VGGAMVLG--------NIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVF-- 293
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCY---- 364
DG G ++D+GT ++ P + + D I++ + + + + D C+
Sbjct: 294 --DGKHGTVLDSGTTYAYL---PEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAG 348
Query: 365 RYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQ 419
R S K +P + + + PEN F G +C+ I + K ++LG
Sbjct: 349 RDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIV 408
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + + F NC+
Sbjct: 409 VRNTLVTYDRDNDKIGFWKTNCS 431
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 94.4 bits (233), Expect = 1e-16, Method: Composition-based stats.
Identities = 46/134 (34%), Positives = 70/134 (52%), Gaps = 6/134 (4%)
Query: 82 ELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+++D+ P++ + + +++ IG P + DT S L WTQC PC C+ Q TPI+DP
Sbjct: 6 QVKDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDP 65
Query: 142 RASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
S+TY + C LC + C + C Y Y T+G+ S ETF ++
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 200 VPRLAFGCSNDNSG 213
+P +AFGC DN G
Sbjct: 122 IPHIAFGCGQDNEG 135
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 157/382 (41%), Gaps = 52/382 (13%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSE 149
L+Y+ IG ++ DT S +W C C C ++DP +S T
Sbjct: 76 LYYT---KIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKV 132
Query: 150 IPCDDPLCRSPFK-----CQNG-KCVYTRRYHVGDVTRGLASRETFAFP-VRNGFTFVP- 201
+PCDD C S + C+ C Y+ Y G T G ++ F V VP
Sbjct: 133 VPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 192
Query: 202 --RLAFGCSNDNSGF---AFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREME 254
+ FGC + SG + GI+GF + S+ SQL +++ +FS+CL +
Sbjct: 193 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCL-DTVN 251
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
+ G +V + ++TTP++ H+ + L +I + ++ P FD
Sbjct: 252 GGGIFAIG---EVVQPKVKTTPLVPR--MAHYNVVLKDIEVAGDPIQLPTDIFD--STSG 304
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFDYCYRYD---S 368
G IID+GT + ++ + YDQ+L QR Y +F C+ Y S
Sbjct: 305 RGTIIDSGTTLAYLP-------VSIYDQLLEKTLAQRSGMELYLVEDQF-TCFHYSDEKS 356
Query: 369 SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ-------DDPKYSILGAWQQQ 421
A+P++ F +E + + Y +C+ Q D +LG
Sbjct: 357 LDDAFPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLT 416
Query: 422 NMLIIYDLNVPALRFGSENCAN 443
N L IYDL+ ++ + NC++
Sbjct: 417 NKLFIYDLDNMSIGWTDYNCSS 438
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 153/359 (42%), Gaps = 38/359 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y V +GTP + ++ DT++ VW C C C + +T F+ +S+TYS + C
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTYSTVSCSTAQ 88
Query: 157 CRSP--FKC-----QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C C Q C + + Y ++T +P +FGC N
Sbjct: 89 CTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL----APDVIPNFSFGCIN 144
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
SG + + G++G P+SL SQ + G+FSYCL R + +K G
Sbjct: 145 SASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLG-- 200
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+ + + TP+L + RP +Y++L +S+G V P + G IID+GT +T
Sbjct: 201 QPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVIT 260
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRI---PYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
Y+ + + R+++ ++ FD C+ D+ A P +T H+
Sbjct: 261 RFAQPVYEAIRDEF--------RKQVNVSSFSTLGAFDTCFSADNENVA-PKITLHMTSL 311
Query: 384 DYIVQPENMYFIEPDRGRFCVAI-----QDDPKYSILGAWQQQNMLIIYDLNVPALRFG 437
D + EN C+++ + +++ QQQN+ I++D VP R G
Sbjct: 312 DLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFD--VPNSRIG 368
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 152/365 (41%), Gaps = 39/365 (10%)
Query: 93 QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPC 152
Q Y + V +GTP K Q + DT SS W C+ C C R STT +++ C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSR-STTCAKVSC 135
Query: 153 DDPLC---RSPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAF 205
+C S CQ+ + C + Y G + G+ ++T F + +P +F
Sbjct: 136 GTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF---SDVQKIPGFSF 192
Query: 206 GCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM-------EATSV 258
GC+ D+ G G + G+LG A P+S+ Q FSYCL + + T
Sbjct: 193 GCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGY 251
Query: 259 IKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
G+ A R D+ T ++ F++ L IS+ + P F G
Sbjct: 252 FSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----RKGV 304
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY-CYRYDSSFKA-YPS 375
+ D+G+ +++I + L QR ++L G A +E + CY S + P+
Sbjct: 305 VFDSGSELSYIPDRALSVLSQRIRELLLKRG------AAEEESERNCYDMRSVDEGDMPA 358
Query: 376 MTFHLQEADYIVQPENMYFIE---PDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVP 432
++ H + + F+E ++ +C+A SI+G+ Q + ++YDL
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQ 418
Query: 433 ALRFG 437
+ G
Sbjct: 419 LIGIG 423
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 160/386 (41%), Gaps = 36/386 (9%)
Query: 78 NAFQELEDIHLPMAKQDL--------FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI 129
+A + L D H P A+ L +Y+ + IGTP + L+ D+ S++ + C C
Sbjct: 64 SARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCE 123
Query: 130 RCFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRET 188
+C + P F P S+TYS + C+ D C + + +C Y R+Y + G+ +
Sbjct: 124 QCGNHQDPRFQPDLSSTYSPVKCNVDCTCDN----ERSQCTYERQYAEMSSSSGVLGEDI 179
Query: 189 FAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFS 246
+F + R FGC N +G F GI+G LS+ QL + I FS
Sbjct: 180 MSFGKESELK-PQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFS 238
Query: 247 YCL-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPG 305
C ++ +++ G A + P+ P++ + L EI + +R P
Sbjct: 239 LCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPV----RSPYYNIELKEIHVAGKALRLDPK 294
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY- 364
F+ G ++D+GT ++ + + SL + R P + D C+
Sbjct: 295 IFNSKH----GTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYK--DICFA 348
Query: 365 ---RYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDDPK--YSILG 416
R S + +P + + + PEN F G +C+ + + K ++LG
Sbjct: 349 GAGRNVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 408
Query: 417 AWQQQNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + + F NC+
Sbjct: 409 GIVVRNTLVTYDRHNEKIGFWKTNCS 434
>gi|222617032|gb|EEE53164.1| hypothetical protein OsJ_35998 [Oryza sativa Japonica Group]
Length = 384
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 136/348 (39%), Gaps = 90/348 (25%)
Query: 99 VEVNIGTPM-KPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLC 157
+ + +GTP+ + L D S VW QC P S TY +
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPY---------------SLTYGGSAAN---- 130
Query: 158 RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
T G + +TF F G T VP + FGCS+ + G F
Sbjct: 131 ----------------------TSGYLATDTFTF----GATAVPGVVFGCSDASYG-DFA 163
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR-----EMEATSVIKFGRDA--DVRRR 270
G SG++G LSL SQL+ G FSY L+ + A SVI+FG DA +R
Sbjct: 164 GA-SGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRG 219
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
L+ P G FD+ +GTGG I+ + TPVT++
Sbjct: 220 RLDA---------------------------IPAGTFDLRANGTGGVILSSTTPVTYLEQ 252
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ-EADYIVQ 388
Y + + +G + +A+ E D CY S K P +T AD +
Sbjct: 253 AAYDVVRA---AVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLS 309
Query: 389 PENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRF 436
N ++I+ D G C+ + S+LG Q +IYD++ L F
Sbjct: 310 AANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRLTF 357
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/456 (23%), Positives = 187/456 (41%), Gaps = 39/456 (8%)
Query: 6 ALP-LAAFFSYFSVLFLTHFTSSESTGFS--LKLIPIFSPESPL-YPGNLSQSERIHKMF 61
ALP +++ + FS+L S + G + L P P+ +P LSQ +
Sbjct: 2 ALPSISSIGATFSLLIYLSLPYSITAGENNLLHQSPTARSRRPMVFPLFLSQPNSSSRSI 61
Query: 62 EISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLV 121
I + + S S P++ L D L +Y+ + IGTP + L+ D+ S++
Sbjct: 62 SIPHRKLHKSDSKSLPHSRMRLYDDLLING----YYTTRLWIGTPPQMFALIVDSGSTVT 117
Query: 122 WTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVT 180
+ C C +C P F P S+TY + C+ D C +CVY R Y +
Sbjct: 118 YVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMDCNCDD----DREQCVYEREYAEHSSS 173
Query: 181 RGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR 240
+G+ + +F + T R FGC +G + + GI+G LSL QL ++
Sbjct: 174 KGVLGEDLISFGNESQLT-PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDK 232
Query: 241 --IQGLFSYCL-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGR 297
I F C ++ S+I G D ++ P D P++ + L I +
Sbjct: 233 GLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDP----DRSPYYNIDLTGIRVAG 288
Query: 298 HIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNAS 357
+ F DG G ++D+GT ++ P + ++R + +
Sbjct: 289 KQLSLHSRVF----DGEHGAVLDSGTTYAYL---PDAAFAAFEEAVMREVSTLKQIDGPD 341
Query: 358 QEF-DYCYRYDSS------FKAYPSMTFHLQEA-DYIVQPENMYFIEPD-RGRFCVAIQD 408
F D C++ +S K +PS+ + +++ PEN F G +C+ +
Sbjct: 342 PNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP 401
Query: 409 DPK--YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ K ++LG +N L++YD + F NC+
Sbjct: 402 NGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCS 437
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/385 (22%), Positives = 166/385 (43%), Gaps = 37/385 (9%)
Query: 83 LEDIHLPMAKQ--DL-FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT--- 136
L+ I P+ DL Y E+ +G P++ ++ DT S ++W +C PC C +
Sbjct: 66 LQGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIP 125
Query: 137 --PIFDPRASTTYSEIPCDDPLCRSPFKC-----QNGKCVYTRRYHVGDVTRGLASRETF 189
I++ AS+T S C DPLC N C Y Y + G ++
Sbjct: 126 PLSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDM 185
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSY 247
+ ++ G + FGC+ + +G GI+GF ++ +Q+ + + +FS+
Sbjct: 186 HYVLQGGNATTSHIFFGCAINITG---SWPADGIMGFGQISKTVPNQIATQRNMSRVFSH 242
Query: 248 CLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF 307
CL E +++FG + + ++ TP+L ++ H+ + LL IS+ ++ F
Sbjct: 243 CLGGEKHGGGILEFGEEPNT--TEMVFTPLL--NVTTHYNVDLLSISVNSKVLPIDSKEF 298
Query: 308 DIMRDGTG--GFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR 365
+ + T G IID+GT + + L +++L ++ E C+
Sbjct: 299 SYVSNSTNETGVIIDSGTSFALLATKANRILFSE----IKNLTTAKL--GPKLEGLQCFY 352
Query: 366 YDSSF---KAYPSMTFHLQEADYI-VQPEN---MYFIEPDRGRFCVAIQDDPKYSILGAW 418
S ++P++T + ++P+N M ++ R +C A +I G
Sbjct: 353 LKSGLTVETSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADGLTIFGEI 412
Query: 419 QQQNMLIIYDLNVPALRFGSENCAN 443
++ L+ YD+ + + +NC++
Sbjct: 413 VLKDKLVFYDVENRRIGWKGQNCSS 437
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/393 (23%), Positives = 158/393 (40%), Gaps = 48/393 (12%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+A +P+A L D L + +Y+ ++IGTP + L+ D+ S++ + C C +
Sbjct: 66 LAEGGRPSARMRLHDDLL----TNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ 121
Query: 131 CFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF 189
C + P F P S+TYS + C+ D C S +C Y R+Y + G+ +
Sbjct: 122 CGNHQDPRFQPDLSSTYSPVKCNVDCTCDS----DKNQCTYERQYAEMSSSSGVLGEDIV 177
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSY 247
+F + R FGC N +G F GI+G LS+ QL ++ I FS
Sbjct: 178 SFGTESELK-PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSM 236
Query: 248 C-----------LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIG 296
C ++ M A + + VR P++ + L E+ +
Sbjct: 237 CYGGMDIGGGAMVLGAMPAPPGMIYTHSNAVR--------------SPYYNIELKEMHVA 282
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
+R P F DG G ++D+GT ++ + + L + R P
Sbjct: 283 GKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPN 338
Query: 357 SQE--FDYCYRYDSSF-KAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDDPK 411
++ F R S + +P + + + PEN F G +C+ + + K
Sbjct: 339 YKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK 398
Query: 412 --YSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
++LG +N L+ YD + + F NC+
Sbjct: 399 DPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 111/259 (42%), Gaps = 28/259 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G+P K + DT S ++W C PC C + F+P S+T S+I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 151 PCDDPLCRSPFK-----CQ---NGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FT 198
PC D C + + CQ N C YT Y G T G +T F G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 199 FVPRLAFGCSNDNSG--FAFGGKISGILGFNASPLSLSSQLRNRIQG--LFSYCLVREME 254
+ FGCSN SG + GI GF LS+ SQL + +FS+CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
++ G ++ L TP++ S +PH+ L+L I + + P + T
Sbjct: 270 GGGILVLG---EIVEPGLVYTPLVPS--QPHYNLNLESIVVNGQ--KLPIDSSLFTTSNT 322
Query: 315 GGFIIDTGTPVTFIRNGPY 333
G I+D+GT + ++ +G Y
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 169/401 (42%), Gaps = 69/401 (17%)
Query: 83 LEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTP 137
L + LP A Y ++ IG+P K ++ DT S ++W C C C
Sbjct: 73 LGGVGLPTATG--LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELT 130
Query: 138 IFDPRASTTYSEIPCDDPLCRS------PFKC--QNGKCVYTRRYHVGDVTRGLASRETF 189
+DP S T + CD C + P C + C + Y G T G ++
Sbjct: 131 QYDPAGSGT--TVGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSV 188
Query: 190 AFPVRNGFTFV----PRLAFGCSNDNSGFAFGGKIS-------GILGFNASPLSLSSQL- 237
+ +G + FGC G GG + GILGF + S+ SQL
Sbjct: 189 QYNQVSGNGQTTPSNASITFGC-----GAQLGGDLGSSSQALDGILGFGQADSSMLSQLA 243
Query: 238 -RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIG 296
+++ +F++CL + + G +V + ++TTP++ + H+ ++L IS+G
Sbjct: 244 AARKVRKIFAHCL-DTVHGGGIFAIG---NVVQPKVKTTPLVQN--VTHYNVNLQGISVG 297
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR-YDQILRSLGRQRIPYN 355
++ P FD + G IID+GT + ++ Y+TL+ +D+ Q + +
Sbjct: 298 GATLQLPSSTFD--SGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKY------QDLALH 349
Query: 356 ASQEFDYCYRYDSSF-KAYPSMTFHLQEA--------DYIVQPEN----MYFIEPDRGRF 402
Q+F C+++ S +P +TF + DY+ Q EN M F++
Sbjct: 350 NYQDF-VCFQFSGSIDDGFPVVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGG---- 404
Query: 403 CVAIQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
V +D +LG N L++YDL + + NC++
Sbjct: 405 -VQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADYNCSS 444
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/423 (22%), Positives = 174/423 (41%), Gaps = 43/423 (10%)
Query: 40 FSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV 99
+ S + P +S + H+ R ++ ++ KP++ +H + +Y+
Sbjct: 33 YQQRSVILPLFISPTNSSHRRVLDRDHRLRHLQNLVKPHSSNARMRLHDDLLTNG-YYTT 91
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCR 158
+ IG+P + L+ DT S++ + C C++C + P F P S+TY + C+ D C
Sbjct: 92 RLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCD 151
Query: 159 SPFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSGFAF 216
+NG +C Y RRY + G+ + + +F + VP R FGC SG +
Sbjct: 152 -----ENGVQCTYERRYAEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLY 204
Query: 217 GGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
+ GI+G LS+ QL + + FS C + G + +
Sbjct: 205 TQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLG--------GISS 256
Query: 275 TPILL---SD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
P ++ SD P++ + L EI + ++ P F DG G I+D+GT +
Sbjct: 257 PPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYF- 311
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCY----RYDSSF-KAYPSMTFHLQEA 383
P + D I++ + + F D C+ R + K +P +
Sbjct: 312 --PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369
Query: 384 DYI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
I + PEN F G +C+ I + + ++LG +N L+ Y+ + F
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429
Query: 440 NCA 442
NC+
Sbjct: 430 NCS 432
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/423 (22%), Positives = 174/423 (41%), Gaps = 43/423 (10%)
Query: 40 FSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV 99
+ S + P +S + H+ R ++ ++ KP++ +H + +Y+
Sbjct: 33 YQQRSVILPLFISPTNSSHRRVLDRDHRLRHLQNLVKPHSSNARMRLHDDLLTNG-YYTT 91
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCR 158
+ IG+P + L+ DT S++ + C C++C + P F P S+TY + C+ D C
Sbjct: 92 RLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCD 151
Query: 159 SPFKCQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVP-RLAFGCSNDNSGFAF 216
+NG +C Y RRY + G+ + + +F + VP R FGC SG +
Sbjct: 152 -----ENGVQCTYERRYAEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLY 204
Query: 217 GGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLET 274
+ GI+G LS+ QL + + FS C + G + +
Sbjct: 205 TQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLG--------GISS 256
Query: 275 TPILL---SD--LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
P ++ SD P++ + L EI + ++ P F DG G I+D+GT +
Sbjct: 257 PPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYF- 311
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCY----RYDSSF-KAYPSMTFHLQEA 383
P + D I++ + + F D C+ R + K +P +
Sbjct: 312 --PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369
Query: 384 DYI-VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
I + PEN F G +C+ I + + ++LG +N L+ Y+ + F
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429
Query: 440 NCA 442
NC+
Sbjct: 430 NCS 432
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 93.6 bits (231), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/134 (34%), Positives = 70/134 (52%), Gaps = 6/134 (4%)
Query: 82 ELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDP 141
+++D+ P++ + + +++ IG P + DT S L WTQC PC C+ Q TPI+DP
Sbjct: 6 QVKDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDP 65
Query: 142 RASTTYSEIPCDDPLCRS--PFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF 199
S+TY + C LC + C + C Y Y T+G+ S ETF ++
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 200 VPRLAFGCSNDNSG 213
+P +AFGC DN G
Sbjct: 122 IPHIAFGCGQDNEG 135
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/402 (25%), Positives = 160/402 (39%), Gaps = 71/402 (17%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--------CFDQTTPI--------- 138
Y V V IGTP P +L+ DTA+ L W C+ R QT +
Sbjct: 123 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAK 182
Query: 139 -------FDPRASTTYSEIPCDDP--------LCRSPFKCQNGKCVYTRRYHVGDVTRGL 183
+ P S+++ I C C+SP K ++ C Y ++ G VT G+
Sbjct: 183 KEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAES--CSYFQKTQDGTVTIGI 240
Query: 184 ASRETFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ 242
+E V +G +P L GCS +G + G+L +S + R
Sbjct: 241 YGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAH-DGVLSLGNGDMSFAVHAAKRFG 299
Query: 243 GLFSYCLVR---EMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRH 298
FS+CL+ +A+S + FG + V T IL + D++P + + + +G
Sbjct: 300 QRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGE 359
Query: 299 IVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQ 358
+ P +D R GG I+DT T VT + Y + D+ L L R +
Sbjct: 360 RLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPR----VYELE 415
Query: 359 EFDYCYRY----DSSFKAY----PSMTFHLQ-------EADYIVQPENMYFIEPDRGRFC 403
F+YCY++ D A+ PS T + EA +V PE +EP G C
Sbjct: 416 GFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPE----VEP--GVAC 469
Query: 404 VA----IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
+A ++ P ILG Q + D +RF + C
Sbjct: 470 LAFRKLLRGGP--GILGNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/376 (22%), Positives = 152/376 (40%), Gaps = 64/376 (17%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPCI-RCFDQ---TTPIFDPRASTTYSEIPCDDPL 156
+++GTP + DT S++ W QCQ CI C+ Q P F+ +S+TY + C +
Sbjct: 27 ISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQV 86
Query: 157 CRSPFKCQN---------GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C QN C+Y+ RY G+ + G S++ N ++ + + FGC
Sbjct: 87 CHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLA--NSYS-IQKFIFGC 143
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ-GLFSYCLVREMEATSVIKFG---R 263
+DN + G +GI+GF S +Q+ FSYC E + G R
Sbjct: 144 GSDNR---YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQENEGFLSIGPYVR 200
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG---------- 313
D++ ++L+ L + G H+ + FD+M +G
Sbjct: 201 DSN---------KLILTQLFDY----------GAHLPVYALQQFDMMVNGMRLQVDPPVY 241
Query: 314 -TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---S 369
T ++D+GT TF+ + ++ L + + + + G R S + C+ +
Sbjct: 242 TTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVR----GSDSKEICFHSNGDSVD 297
Query: 370 FKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLI 425
+ P + + + EN+++ E G C Q D P ILG ++ +
Sbjct: 298 WSKLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRV 357
Query: 426 IYDLNVPALRFGSENC 441
++D+ F + C
Sbjct: 358 VFDIQQRNFGFEAGAC 373
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 155/393 (39%), Gaps = 43/393 (10%)
Query: 67 RANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ 126
RA M + + P +L H ++ +V + +GTP + ++ DT S L W C
Sbjct: 61 RARQMPARALPRQPSKLRFHH------NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCA 114
Query: 127 PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-----SPFKCQNG--KCVYTRRYHVGDV 179
P + F PRAS+T++ +PC CR SP C +C + Y G
Sbjct: 115 PAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSS 174
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKIS-GILGFNASPLSLSSQLR 238
+ G + + FA V +G R AFGC + + G S G+LG N LS SQ
Sbjct: 175 SDGALATDVFA--VGSGPPL--RAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQAS 230
Query: 239 NRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDL------RPHFYLHLLE 292
R FSYC + + + V+ G L TP+ L R + + LL
Sbjct: 231 TR---RFSYC-ISDRDDAGVLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLG 286
Query: 293 ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRS-LGRQR 351
I +G + P G G ++D+GT TF+ Y L + + R L
Sbjct: 287 IRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALD 346
Query: 352 IPYNASQE-FDYCYRYDSSFKA----YPSMTFHLQEADYIVQPENMYFIEPDR-----GR 401
P A QE FD C+R P +T A+ V + + + P G
Sbjct: 347 DPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGV 406
Query: 402 FCVAIQDDPKYSIL----GAWQQQNMLIIYDLN 430
+C+ + I+ G Q N+ + YDL
Sbjct: 407 WCLTFGNADMVPIMAYVIGHHHQMNVWVEYDLE 439
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 159/398 (39%), Gaps = 67/398 (16%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR--------------------CFDQT 135
Y V V IGTP P +L+ DTA+ L W C+ R + +
Sbjct: 124 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEAS 183
Query: 136 TPIFDPRASTTYSEIPCDDP--------LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRE 187
+ P S+++ I C C+SP K ++ C Y ++ G VT G+ +E
Sbjct: 184 KNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAES--CSYFQKTQDGTVTIGIYGKE 241
Query: 188 TFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
V +G +P L GCS +G + G+L +S + R FS
Sbjct: 242 KATVTVSDGRMAKLPGLILGCSVLEAGGSVDAH-DGVLSLGNGDMSFAVHAAKRFGQRFS 300
Query: 247 YCLVR---EMEATSVIKFGRDADVRRRDLETTPILLS-DLRPHFYLHLLEISIGRHIVRF 302
+CL+ +A+S + FG + V T IL + D++P + + + +G +
Sbjct: 301 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDI 360
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
P +D R GG I+DT T VT + Y + D+ L L R + F+Y
Sbjct: 361 PDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPR----VYELEGFEY 416
Query: 363 CYRY----DSSFKAY----PSMTFHLQ-------EADYIVQPENMYFIEPDRGRFCVA-- 405
CY++ D A+ PS T + EA +V PE +EP G C+A
Sbjct: 417 CYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPE----VEP--GVACLAFR 470
Query: 406 --IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
++ P ILG Q + D +RF + C
Sbjct: 471 KLLRGGP--GILGNVFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 144/372 (38%), Gaps = 45/372 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
+Y+V + IG P K L DT S L W QC PC C ++ P + + C D
Sbjct: 63 YYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNGNL----VKCGD 118
Query: 155 PLCRS-----PFKCQ--NGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
PLC++ C N +C Y Y + G+ R+ NG P LAFGC
Sbjct: 119 PLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPILAFGC 178
Query: 208 SNDNS--GFAFGGKISGILGFNASPLSLSSQLRN--RIQGLFSYCLVREMEATSVIKFGR 263
D G +G+LG S+ SQL + I+ + +CL E F
Sbjct: 179 GYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCL---SERGGGFLFFG 235
Query: 264 DADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
D V + + TP+L S H+ ++ R P + ++ I D+G+
Sbjct: 236 DQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRK-----PTSVKGLQ-----LIFDSGS 285
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMT------ 377
T+ + ++ L+ LR R ++S C+R FK+ +T
Sbjct: 286 SYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSS--LPICWRGPKPFKSLHDVTSNFKPL 343
Query: 378 ---FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQNMLIIYDL 429
F + + P Y I G C+ I D + +I+G Q+ L+IYD
Sbjct: 344 LLSFTKSKNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDN 403
Query: 430 NVPALRFGSENC 441
+ + S NC
Sbjct: 404 EKQQIGWASANC 415
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 148/367 (40%), Gaps = 38/367 (10%)
Query: 88 LPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ---PCIRCFDQTTPIFDPRAS 144
LP + F +V +GTP ++ DT S +VW + P +R Q + A
Sbjct: 115 LPQGTGEYF--AQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAP 172
Query: 145 TTYSEIPCDDPLCR--SPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV 200
T C P+CR C + C+Y Y G VT G + ET F R V
Sbjct: 173 TPRWN--CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF-ARG--ARV 227
Query: 201 PRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK 260
R+A GC +DN G SG+LG LS SQ+ FSYCLV +
Sbjct: 228 QRVAIGCGHDNEGLFI--AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARP 285
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD---GTGGF 317
R R + +Y+HLL S+G V+ + D+ + G GG
Sbjct: 286 SRRWGGTPR------------MATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGV 332
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS-SFKAYPSM 376
I+D+GT VT + Y+ + + L R+ FD CY P++
Sbjct: 333 ILDSGTSVTRLARPVYEAVRDAFRAAAVGL---RVSPGGFSLFDTCYNLSGRRVVKVPTV 389
Query: 377 TFHLQEADYIVQPENMYFIEPD-RGRFCVAIQD-DPKYSILGAWQQQNMLIIYDLNVPAL 434
+ HL + P Y I D G FC A+ D SI+G QQQ +++D + +
Sbjct: 390 SMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRV 449
Query: 435 RFGSENC 441
F ++C
Sbjct: 450 GFVPKSC 456
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 112/441 (25%), Positives = 181/441 (41%), Gaps = 68/441 (15%)
Query: 52 SQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPM-AKQDLFYSVEVNIGTPMKPQ 110
S R HK+ + + + A S A + + P+ AK YSV ++ GTP +
Sbjct: 46 SSIARAHKLKHGTSIKPDEDALSSTTTASATV--VKSPLSAKSYGGYSVSLSFGTPSQTI 103
Query: 111 HLLFDTASSLVWTQCQP---CIRC----FDQT-TPIFDPRASTTYSEIPCDDPLCR---- 158
+FDT SSLV C C C D T P F P+ S++ I C P C+
Sbjct: 104 PFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYG 163
Query: 159 ----------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
+ C G Y +Y +G T G+ E FP VP GCS
Sbjct: 164 PNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFPDLT----VPDFVVGCS 218
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIK------- 260
++ + +GI GF P+SL SQ+ + FS+CLV R + T+V
Sbjct: 219 IIST-----RQPAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTNVTTDLDLDTG 270
Query: 261 FGRDADVRRRDLETTP------ILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G ++ + L TP + ++YL+L I +GR V+ P +G
Sbjct: 271 SGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGD 330
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-- 372
GG I+D+G+ TF+ ++ + + + + + R++ + +E ++ S K
Sbjct: 331 GGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREK---DLEKETGLGPCFNISGKGDV 387
Query: 373 -YPSMTFHLQEADYIVQPENMYFI-EPDRGRFCVAIQDDPKYS---------ILGAWQQQ 421
P + F + + P + YF + C+ + D + ILG++QQQ
Sbjct: 388 TVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQ 447
Query: 422 NMLIIYDLNVPALRFGSENCA 442
N L+ YDL F + C+
Sbjct: 448 NYLVEYDLENDRFGFAKKKCS 468
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 137/349 (39%), Gaps = 30/349 (8%)
Query: 110 QHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG- 166
Q ++ DTAS + W QC PC C QT ++DP S++ + PC P CR+ NG
Sbjct: 156 QTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGC 215
Query: 167 -----KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND--NSGFAFGGK 219
+C Y +Y G + G + + + FGCS+ G +F K
Sbjct: 216 TPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPG-SFSNK 274
Query: 220 ISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILL 279
SGI+ SL +Q + +FSYCL + G R TP+L
Sbjct: 275 TSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASR-YAVTPMLR 333
Query: 280 SDLRPHFYL-HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQ 338
S P YL L+ I + + PP F G ++D+ T VT + Y L
Sbjct: 334 SKAAPMLYLVRLIAIEVAGKRLPVPPAVF------AAGAVMDSRTIVTRLPPTAYMALRA 387
Query: 339 RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQ-PENMYFIEP 397
+ +R+ R P + D CY + S L + + P ++P
Sbjct: 388 AFVAEMRAY-RAAAP---KEHLDTCYDF-SGAAPGGGGGVKLPKITLVFDGPNGAVELDP 442
Query: 398 DRGRF--CVAIQ---DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A DD I+G QQQ + ++Y+++ + F C
Sbjct: 443 SGVLLDGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 143/363 (39%), Gaps = 70/363 (19%)
Query: 112 LLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNG----- 166
++ DT S L W QC+PC C+ Q P+FDP S +Y+ +PC+ C + K G
Sbjct: 124 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 183
Query: 167 -------------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
+C Y+ Y G +RG+ + +T A G V FGC N G
Sbjct: 184 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLSNRG 239
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
G S ASP S +A + G D R
Sbjct: 240 LRRPG--SAASSPTASPPGTSG------------------DAAGSLSLGGDTSSYR---N 276
Query: 274 TTPI----LLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFI 328
TP+ +++D +P FY + ++ A + ++D+GT +T +
Sbjct: 277 ATPVSYTRMIADPAQPPFY--FMNVTGASVGGAAVAAAGLGAAN----VLLDSGTVITRL 330
Query: 329 RNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKA-YPSMTFHLQE-A 383
Y+ + + R G +R P A+ F D CY + P +T L+ A
Sbjct: 331 APSVYRAVRAEF---ARQFGAERYP--AAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGA 385
Query: 384 DYIVQPENMYFI-EPDRGRFCVAIQD---DPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
D V M F+ D + C+A+ + + I+G +QQ+N ++YD L F E
Sbjct: 386 DMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADE 445
Query: 440 NCA 442
+C+
Sbjct: 446 DCS 448
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/343 (24%), Positives = 135/343 (39%), Gaps = 31/343 (9%)
Query: 110 QHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFK--C 163
Q ++ D+AS + W QC PC C Q +DP S + + C P C + P+ C
Sbjct: 159 QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGC 218
Query: 164 QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGI 223
N +C Y RY G T G + N V FGCS+ G +F + +GI
Sbjct: 219 ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA---VSGFKFGCSHAEQG-SFDARAAGI 274
Query: 224 LGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLR 283
+ P SL SQ +R FSYC+ + G R + T +
Sbjct: 275 MALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 334
Query: 284 PHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQI 343
+ + L I++G + P F G ++D+ T +T + YQ L +
Sbjct: 335 TFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRSAFRSS 388
Query: 344 LRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL-QEADYIVQPENMYFIEPDRGR 401
+ ++ R P D CY + P ++ + A + P + F +
Sbjct: 389 M-TMYRSAPPKG---YLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND----- 439
Query: 402 FCVAI---QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A DD +LG+ QQQ + ++YD+ A+ F C
Sbjct: 440 -CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 158/365 (43%), Gaps = 37/365 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTP---IFDPRASTTYSEIPC 152
Y + +++GTP + DT S+L W QC+ C I+C+DQ IF+P S+TYS++ C
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84
Query: 153 DDPLCRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C + C ++ C+Y+ RY G+ + G ++ +
Sbjct: 85 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNF 141
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ-GLFSYCLVREMEATSVIKFG 262
FGC DN + G +GI+GF S +Q+ + FSYC R+ E + G
Sbjct: 142 IFGCGEDN---LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 198
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A R +L T ++ D +P + + L++ + + +R + + T I+D+G
Sbjct: 199 PYA--RDINLMWTKLIYYDHKPAYAIQQLDMMV--NGIRLEIDPYIYISKMT---IVDSG 251
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---SFKAYPSMTFH 379
T T+I + + L + + +++ G R E C+ +S ++ +P++
Sbjct: 252 TADTYILSPVFDALDKAMTKEMQAKGYTR----GWDERRICFISNSGSANWNDFPTVEMK 307
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRF 436
L + + EN ++ + + DD +LG ++ +++D+ F
Sbjct: 308 LIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGF 367
Query: 437 GSENC 441
+ C
Sbjct: 368 KARAC 372
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 153/372 (41%), Gaps = 60/372 (16%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L+ IGTP +P + D A P P AS+T+ PC
Sbjct: 65 LYNVANFTIGTPPQPASAIIDVAGP----------------APCSFPNASSTFRPEPCGT 108
Query: 155 PLCRS--PFKCQNGKCVY--TRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
C+S C + C Y T +G T G+ + +TFA T L FGC
Sbjct: 109 DACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAI-----GTATASLGFGCVV- 162
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV-REMEATSVIKFGRDADVRR 269
SG G SG++G +P SL SQ+ N + FSYCL + S + G A +
Sbjct: 163 ASGIDTMGGPSGLIGLGRAPSSLVSQM-NITK--FSYCLTPHDSGKNSRLLLGSSAKLAG 219
Query: 270 R-DLETTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTP 324
+ TTP + + D+ ++ + L I G + PP ++ + T P
Sbjct: 220 GGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVL--------VQTLAP 271
Query: 325 VTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY-RYDSSFKAYPSMTFHLQE- 382
++F+ + YQ L + ++ +++G Q FD C+ + S + P + F Q+
Sbjct: 272 MSFLVDSAYQALKK---EVTKAVGAAPT-ATPLQPFDLCFPKAGLSNASAPDLVFTFQQG 327
Query: 383 ADYIVQPENMYFIE--PDRGRFCVAIQD---------DPKYSILGAWQQQNMLIIYDLNV 431
A + P Y I+ ++G C+AI D +ILG+ QQ+N + DL
Sbjct: 328 AAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEK 387
Query: 432 PALRFGSENCAN 443
L F +CA+
Sbjct: 388 KTLSFEPADCAH 399
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 158/365 (43%), Gaps = 37/365 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTP---IFDPRASTTYSEIPC 152
Y + +++GTP + DT S+L W QC+ C I+C+DQ IF+P S+TYS++ C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 153 DDPLCRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C + C ++ C+Y+ RY G+ + G ++ +
Sbjct: 66 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNF 122
Query: 204 AFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ-GLFSYCLVREMEATSVIKFG 262
FGC DN + G +GI+GF S +Q+ + FSYC R+ E + G
Sbjct: 123 IFGCGEDN---LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 179
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTG 322
A R +L T ++ D +P + + L++ + + +R + + T I+D+G
Sbjct: 180 PYA--RDINLMWTKLIYYDHKPAYAIQQLDMMV--NGIRLEIDPYIYISKMT---IVDSG 232
Query: 323 TPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---SFKAYPSMTFH 379
T T+I + + L + + +++ G R E C+ +S ++ +P++
Sbjct: 233 TADTYILSPVFDALDKAMTKEMQAKGYTR----GWDERRICFISNSGSANWNDFPTVEMK 288
Query: 380 LQEADYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRF 436
L + + EN ++ + + DD +LG ++ +++D+ F
Sbjct: 289 LIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGF 348
Query: 437 GSENC 441
+ C
Sbjct: 349 KARAC 353
>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 316
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 116/265 (43%), Gaps = 34/265 (12%)
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVI 259
+ GC+ +G +F G+L S +S +S+ R G FSYCLV + ATS +
Sbjct: 58 VVLGCTTSYTGESFLAS-DGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYL 116
Query: 260 KFGRDADVRRRDLE--------------TTPILLSD-LRPHFYLHLLEISIGRHIVRFPP 304
FG + V TP+LL +RP + + + +S+ ++R P
Sbjct: 117 TFGPNPAVSSASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPR 176
Query: 305 GAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY 364
+D+ + GG I+D+GT +T + + Y+ ++ + L L P A FDYCY
Sbjct: 177 LVWDVQKG--GGAILDSGTSLTVLVSPAYRAVVAALGKKLVGL-----PRVAMDPFDYCY 229
Query: 365 RYDSSFK------AYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILG 416
+ S A P++ H + + P Y I+ G C+ +Q D P S++G
Sbjct: 230 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAAPGVKCIGLQEGDWPGVSVIG 289
Query: 417 AWQQQNMLIIYDLNVPALRFGSENC 441
QQ L +DL LRF C
Sbjct: 290 NILQQEHLWEFDLKNRRLRFKRSRC 314
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 150/363 (41%), Gaps = 33/363 (9%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDD 154
L Y+V V GTP + + DT + C+PC P FD STT++ +PCD
Sbjct: 147 LDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHVPCDS 206
Query: 155 PLCRSPFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
P C S C G C + + G ++ + + + V++ FTFV ++G
Sbjct: 207 PDCPSTANCSAGSVCPFNLFFVEGTFSQDVLTVAP-SVAVQD-FTFVCL--------DAG 256
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
+ G G L + SL S+L FSYC+ + ++ + G DA VR +
Sbjct: 257 ASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDSPGFLSLGDDATVRGDNCT 316
Query: 274 TTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
LLS DL +++ ++ +S+G + P G F I++ GT T +
Sbjct: 317 AHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTF----GNNASTIVEAGTTFTMLA 372
Query: 330 NGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-AYPSMTFHLQEADYIV- 387
Y L + Q + R + +FD CY + + P + F D ++
Sbjct: 373 PDAYTPLRDAFRQAMAQYNRSVPGF---YDFDTCYNFTGLQELTVPLVEFKFGNGDSLLI 429
Query: 388 -QPENMYFIEPDRGRF---CVAIQD-----DPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ +Y+ P G F C+A D +++GA+ ++YD+ + F
Sbjct: 430 DGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFIP 489
Query: 439 ENC 441
E+C
Sbjct: 490 ESC 492
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 170/397 (42%), Gaps = 72/397 (18%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ---PCIRC-FDQTTP---IFDPRASTTYSE 149
YS+ ++ GTP + L+ DT S LVW C C C F + P IF P++S++
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 150 IPCDDPLC------RSPFKCQNGK---------CVYTRRYHVGDVTRGLASRETFAFPVR 194
+ C +P C + +C++ + C ++ +T G+ ET P +
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-- 252
VP GCS ++ + +GI GF P SL SQL + FSYCL+
Sbjct: 210 G----VPNFIVGCSVLST-----SQPAGISGFGRGPPSLPSQLGLK---KFSYCLLSRRY 257
Query: 253 ---MEATSVIKFGR-DADVRRRDLETTPILLS-------DLRPHFYLHLLEISIGRHIVR 301
E++S++ G D+ + L TP + + ++YL L I++G V+
Sbjct: 258 DDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK 317
Query: 302 FP-----PGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNA 356
P PGA DG GG IID+GT T+++ ++ + +++ ++S ++
Sbjct: 318 IPYKYLIPGA-----DGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS--KRATEVEG 370
Query: 357 SQEFDYCYRYDS-SFKAYPSMTFHLQEADYIVQPENMY--FIEPDRGRFCVAIQDDPKYS 413
C+ + ++P +T + + P Y F+ D C+ I D
Sbjct: 371 ITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDD-VVCLTIVTDGAAG 429
Query: 414 ---------ILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILG +QQQN + YDL L F ++C
Sbjct: 430 KEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 159/384 (41%), Gaps = 34/384 (8%)
Query: 73 SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCF 132
S PNA L D L +Y+ + IGTP + L+ DT S++ + C C C
Sbjct: 69 SKRHPNARMRLYDDLLING----YYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCG 124
Query: 133 DQTTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFP 192
P F P S TY + C P C +C+Y R+Y + G+ + +F
Sbjct: 125 RHQDPKFQPDLSETYQPVKC-TPDCNC--DGDTNQCMYDRQYAEMSSSSGVLGEDVVSF- 180
Query: 193 VRNGFTFVP-RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCL 249
N P R FGC ND +G + + GI+G LS+ QL ++ I FS C
Sbjct: 181 -GNLSELAPQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY 239
Query: 250 -VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD 308
++ ++I G + P D P++ ++L E+ + ++ P F
Sbjct: 240 GGMDVGGGAMILGGISPPEDMVFTHSDP----DRSPYYNINLKEMHVAGKKLQLNPKVF- 294
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYR--- 365
DG G ++D+GT ++ + + + SL + P + D C+
Sbjct: 295 ---DGKHGTVLDSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYK--DICFTGAG 349
Query: 366 YDSS--FKAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAI---QDDPKYSILGAW 418
D S K++P + + + + PEN F RG +C+ + DP ++LG
Sbjct: 350 IDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPT-TLLGGI 408
Query: 419 QQQNMLIIYDLNVPALRFGSENCA 442
+N L++YD + F NC+
Sbjct: 409 FVRNTLVMYDRENSKIGFWKTNCS 432
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 114/440 (25%), Positives = 174/440 (39%), Gaps = 93/440 (21%)
Query: 64 SKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWT 123
S RA+++ + + P++ + L +H K YS+++ GTP + + DT SSLVW
Sbjct: 188 SITRAHHLKNHNNPSSLKTL--VH---PKTYGGYSIDLKFGTPPQTFPFVLDTGSSLVWL 242
Query: 124 QCQP---CIRC---FDQTTPIFDPRASTTYSEIPCDDPLCRSPFK-------CQ------ 164
C C +C + TP F P+ S + + C +P C F C+
Sbjct: 243 PCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVTSHCCKLAKAAF 302
Query: 165 --NGKC-----VYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFG 217
N C YT +Y +G T G E FP +N V GCS
Sbjct: 303 SNNNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPAKN----VSDFLVGCS-----VVSV 352
Query: 218 GKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE------------MEATSVIKFGRDA 265
+ GI GF SL +Q+ FSYCL+ MEAT+ + +
Sbjct: 353 YQPGGIAGFGRGEESLPAQMN---LTRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTN 409
Query: 266 DVRRRDLETTPILLSDLRPHF----YLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
V P S +P F Y+ L +I +G VR P + +G GGFI+D+
Sbjct: 410 GVSYTAFLKNP---STKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDS 466
Query: 322 GTPVTFIRNGPYQTLMQR--YDQILRSLGRQRIPYNASQEFDYCYRYDSSF--------K 371
G+ +TF M+R +D + +Q + Y ++E + + F
Sbjct: 467 GSTLTF---------MERPIFDLVAEEFVKQ-VNYTRARELEKQFGLSPCFVLAGGAETA 516
Query: 372 AYPSMTFHLQEADYIVQPENMYFIEPDRGRF-CVAIQDDPKYS---------ILGAWQQQ 421
++P M F + + P YF +G C+ I D ILG +QQQ
Sbjct: 517 SFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQ 576
Query: 422 NMLIIYDLNVPALRFGSENC 441
N + DL F S++C
Sbjct: 577 NFYVECDLENERFGFRSQSC 596
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 165/379 (43%), Gaps = 48/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y +V +GTP + ++ DT S ++W C C C QT+ + FDPR+S+T S
Sbjct: 76 LYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPRSSSTSSL 134
Query: 150 IPCDDPLCRSPFKC-------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
I C D CRS + QN +C YT +Y G T G + F T
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTN 194
Query: 203 ----LAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + + GI GF +S+ SQL +QG+ FS+CL +
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQL--SLQGIAPRVFSHCLKGD 252
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
V+ G ++ ++ +P++ S +PH+ L+L IS+ IV P F +
Sbjct: 253 NSGGGVLVLG---EIVEPNIVYSPLVQS--QPHYNLNLQSISVNGQIVPIAPAVFATSNN 307
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYD----QILRSL---GRQRIPYNASQEFDYCYR 365
G I+D+GT + ++ Y + Q +RS+ G Q S D +
Sbjct: 308 --RGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQ 365
Query: 366 YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS--ILGAWQQQNM 423
+F S+ Q DY++Q Y E +C+ Q P S ILG ++
Sbjct: 366 VSLNFAGGASLVLRPQ--DYLMQQN--YIGEGS--VWCIGFQRIPGQSITILGDLVLKDK 419
Query: 424 LIIYDLNVPALRFGSENCA 442
+ +YDL + + + +C+
Sbjct: 420 IFVYDLAGQRIGWANYDCS 438
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 150/379 (39%), Gaps = 61/379 (16%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y V +NIG P KP L DT S L W QC PC C P++ P T +PC +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRP---TKNKLVPCANS 113
Query: 156 LC------RSPFK--CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
+C SP K +C Y +Y + G+ ++F+ P+RN P L+FGC
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSNVRPSLSFGC 173
Query: 208 SNDNSGFAFGGK---ISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFG 262
D G G+LG +SL SQL+ + + + +CL + FG
Sbjct: 174 GYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL--STSGGGFLFFG 231
Query: 263 RDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTG------G 316
D R ++ ++ + G + + PG+ + D
Sbjct: 232 DDMVPTSR--------------VTWVSMVRSTSGNY---YSPGSATLYFDRRSLSTKPME 274
Query: 317 FIIDTGTPVTFIRNGPYQ-TLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPS 375
+ D+G+ T+ PYQ T+ + +SL + P C++ +FK+
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDP-----SLPLCWKGQKAFKSVSD 329
Query: 376 MTFHLQEADYI--------VQPENMYFIEPDRGRFCVAIQDDP----KYSILGAWQQQNM 423
+ + +I + PEN Y I G C+ I D +SI+G Q+
Sbjct: 330 VKKDFKSLQFIFGKNAVMDIPPEN-YLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQ 388
Query: 424 LIIYDLNVPALRFGSENCA 442
++IYD L + +C+
Sbjct: 389 MVIYDNEKAQLGWIRGSCS 407
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 156/361 (43%), Gaps = 37/361 (10%)
Query: 101 VNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTP---IFDPRASTTYSEIPCDDPL 156
+++GTP + DT S+L W QC+ C I+C+DQ IF+P S+TYS++ C
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEA 62
Query: 157 CRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C + C ++ C+Y+ RY G+ + G ++ + FGC
Sbjct: 63 CNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRS---IDNFIFGC 119
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQ-GLFSYCLVREMEATSVIKFGRDAD 266
DN + G +GI+GF S +Q+ + FSYC R+ E + G A
Sbjct: 120 GEDN---LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIGPYA- 175
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
R +L T ++ D +P + + L++ + + +R + + T I+D+GT T
Sbjct: 176 -RDINLMWTKLIYYDHKPAYAIQQLDMMV--NGIRLEIDPYIYISKMT---IVDSGTADT 229
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS---SFKAYPSMTFHLQEA 383
+I + + L + + +++ G R E C+ +S ++ +P++ L +
Sbjct: 230 YILSPVFDALDKAMTKEMQAKGYTR----GWDERRICFISNSGSANWNDFPTVEMKLIRS 285
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRFGSEN 440
+ EN ++ + + DD +LG ++ +++D+ F +
Sbjct: 286 TLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARA 345
Query: 441 C 441
C
Sbjct: 346 C 346
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 168/383 (43%), Gaps = 57/383 (14%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y +V +GTP ++ DT S ++W C C C QT+ + FDP +S+T S
Sbjct: 74 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGC-PQTSGLQIQLNFFDPGSSSTSSM 132
Query: 150 IPCDDPLCRSPFK-------CQNGKCVYTRRYHVGDVTRG------LASRETFAFPVRNG 196
I C D C + + QN +C YT +Y G T G + F V
Sbjct: 133 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 192
Query: 197 FTFVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLV 250
T + FGCSN +G + GI GF +S+ SQL + QG+ FS+CL
Sbjct: 193 ST--APVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSS--QGIAPRVFSHCLK 248
Query: 251 REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIM 310
+ ++ G ++ ++ T ++ + +PH+ L+L I++ ++ F
Sbjct: 249 GDSSGGGILVLG---EIVEPNIVYTSLVPA--QPHYNLNLQSIAVNGQTLQIDSSVF--A 301
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIP---YNASQEFDYCYRYD 367
+ G I+D+GT + ++ + YD + ++ IP + + CY
Sbjct: 302 TSNSRGTIVDSGTTLAYLAE-------EAYDPFVSAI-TASIPQSVHTVVSRGNQCYLIT 353
Query: 368 SSF-KAYPSMTFHLQ-EADYIVQPENMYFIEPDR----GRFCVAIQ--DDPKYSILGAWQ 419
SS + +P ++ + A I++P++ Y I+ + +C+ Q +ILG
Sbjct: 354 SSVTEVFPQVSLNFAGGASMILRPQD-YLIQQNSIGGAAVWCIGFQKIQGQGITILGDLV 412
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
++ +++YDL + + + +C+
Sbjct: 413 LKDKIVVYDLAGQRIGWANYDCS 435
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 163/407 (40%), Gaps = 59/407 (14%)
Query: 66 ARANYMA-SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
+RA MA S S A +L+ P Y V +NIG P KP L DT S L W Q
Sbjct: 25 SRAATMARSPSSSTAVFQLQGDVYPTGH----YYVTMNIGNPAKPYFLDVDTGSDLTWLQ 80
Query: 125 CQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-------SPFKCQNGK-CVYTRRYH 175
C PC C P++ P A+ +PC + LC S KC + K C Y +Y
Sbjct: 81 CDAPCRSCNKVPHPLYRPTANRL---VPCANALCTALHSGQGSNNKCPSPKQCDYQIKYT 137
Query: 176 VGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND---NSGFAFGGKISGILGFNASPLS 232
++G+ ++F+ P+R+ P L FGC D A I G+LG +S
Sbjct: 138 DSASSQGVLINDSFSLPMRSS-NIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVS 196
Query: 233 LSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHL 290
L SQL+ + + + +CL + FG D R ++ +
Sbjct: 197 LVSQLKQQGITKNVVGHCL--STNGGGFLFFGDDVVPSSRVT--------------WVPM 240
Query: 291 LEISIGRHIVRFPPGA----FDIMRDGTG--GFIIDTGTPVTFIRNGPYQTLMQRYDQIL 344
+ + G + + PG+ FD G + D+G+ T+ PYQ ++ L
Sbjct: 241 AQRTSGNY---YSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKGGL 297
Query: 345 -RSLGRQRIP-----YNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPD 398
+SL + P + + F + + FK+ + A + PEN Y I
Sbjct: 298 SKSLKQVSDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAAMEIPPEN-YLIVTK 356
Query: 399 RGRFCVAIQDDP----KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+ I D ++++G Q+ ++IYD L + C
Sbjct: 357 NGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 15/149 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR-CFDQTTPIFDPRASTTYSEIPCDDP 155
Y V V +GTP + +FDT S L WTQC+PC R C+ Q PIF+P ST+Y+ I C P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 156 LC--------RSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGC 207
C SP C CVY +Y + G +++ A + F FGC
Sbjct: 198 TCDELKSGTGNSP-SCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDVFN---NFLFGC 253
Query: 208 SNDNSGFAFGGKISGILGFNASPLSLSSQ 236
+N G G ++G++G + LSL S+
Sbjct: 254 GQNNRGLFVG--VAGLIGLGRNALSLMSK 280
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/421 (23%), Positives = 167/421 (39%), Gaps = 41/421 (9%)
Query: 40 FSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV 99
F P + + P LS + S+ S S A L D +P +Y+
Sbjct: 40 FGPSAMVLPLTLSAPNS-SRTLSHSRRHLQRSESHSTATARMPLYDDLIPYG----YYTT 94
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCR 158
+ IGTP + L+ DT S+L + C C +C P F P S+TY + C + C
Sbjct: 95 RIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMECTCD 154
Query: 159 SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
S CVY R+Y + G+ + +F ++ R FGC N +G +
Sbjct: 155 SEMM----HCVYDRQYAEMSSSSGVLGEDIVSFGKQSELK-PQRTVFGCENVETGDIYSQ 209
Query: 219 KISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTP 276
+ GI+G LS+ QL + I FS C + G A V
Sbjct: 210 RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-------GGMDVGGGAMVLGGISPPAG 262
Query: 277 ILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
++ + P ++ + L EI I + P F DG G I+D+GT ++
Sbjct: 263 MVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGTTYAYLPEPA 318
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDSS-----FKAYPSMTFHLQEADYI 386
++ D I++ L ++ + + D C+ S K +P++ + +
Sbjct: 319 FKAFK---DAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRL 375
Query: 387 -VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ PEN F G +C+ I ++ + ++LG +N L++YD + F NC+
Sbjct: 376 SLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
Query: 443 N 443
Sbjct: 436 E 436
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 109/452 (24%), Positives = 176/452 (38%), Gaps = 64/452 (14%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLF 96
IP+ P++ P Q ++++ + S ARA ++ + P +
Sbjct: 11 IPLQHPQTNQIPFQ-DQYQKLNHLVTTSLARARHLKN---PQTTPATTTTAPLFSHSYGG 66
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQP---CIRCFDQTTPI------FDPRASTTY 147
YSV ++ GTP + + DT S +VW C C C ++ F P+ S++
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126
Query: 148 SEIPCDDPLCR-------------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVR 194
+ C +P C S C N C ++ T G+A ET
Sbjct: 127 KLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHLHSL 186
Query: 195 NGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVR--- 251
+ P GCS +S + +GI GF SL SQL G FSYCL+
Sbjct: 187 SK----PNFLVGCSVFSSH-----QPAGIAGFGRGLSSLPSQLG---LGKFSYCLLSHRF 234
Query: 252 ----EMEATSVIKFGR-DADVRRRDLETTPILL-------SDLRPHFYLHLLEISIGRHI 299
+ ++ V+ + D+D + L TP + S ++YL L I++G H
Sbjct: 235 DDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHH 294
Query: 300 VRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE 359
V+ P DG GG IID+GT TF+ ++ L + + ++ R + +A
Sbjct: 295 VKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG- 353
Query: 360 FDYCYRY-DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYS----- 413
C+ D+ ++P + + + + P YF C+ + D
Sbjct: 354 LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVG 413
Query: 414 ----ILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILG +Q QN + YDL L F E C
Sbjct: 414 GPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 99/421 (23%), Positives = 167/421 (39%), Gaps = 41/421 (9%)
Query: 40 FSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSV 99
F P + + P LS + S+ S S A L D +P +Y+
Sbjct: 40 FGPSAMVLPLTLSAPNS-SRTLSHSRRHLQRSESHSTATARMPLYDDLIPYG----YYTT 94
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD-DPLCR 158
+ IGTP + L+ DT S+L + C C +C P F P S+TY + C + C
Sbjct: 95 RIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMECTCD 154
Query: 159 SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGG 218
S CVY R+Y + G+ + +F ++ R FGC N +G +
Sbjct: 155 SEMM----HCVYDRQYAEMSSSSGVLGEDIVSFGKQSELK-PQRTVFGCENVETGDIYSQ 209
Query: 219 KISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTP 276
+ GI+G LS+ QL + I FS C + G A V
Sbjct: 210 RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-------GGMDVGGGAMVLGGISPPAG 262
Query: 277 ILLSDLRP----HFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
++ + P ++ + L EI I + P F DG G I+D+GT ++
Sbjct: 263 MVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVF----DGKYGTILDSGTTYAYLPEPA 318
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDSS-----FKAYPSMTFHLQEADYI 386
++ D I++ L ++ + + D C+ S K +P++ + +
Sbjct: 319 FKAFK---DAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRL 375
Query: 387 -VQPENMYFIEPD-RGRFCVAI--QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
+ PEN F G +C+ I ++ + ++LG +N L++YD + F NC+
Sbjct: 376 SLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
Query: 443 N 443
Sbjct: 436 E 436
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 86/182 (47%), Gaps = 22/182 (12%)
Query: 82 ELEDIHLPMAK----QDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
E +P++ Q L Y V + +G+ K ++ DT S L W QC+PC+ C++Q P
Sbjct: 46 EASQTQIPLSSGINLQTLNYIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGP 103
Query: 138 IFDPRASTTYSEIPCDDPLCRS-PFKCQN---------GKCVYTRRYHVGDVTRGLASRE 187
IF P S++Y + C+ C+S F N C Y Y G T G E
Sbjct: 104 IFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVE 163
Query: 188 TFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSY 247
+F G V FGC +N G FGG +SG++G S LSL SQ G+FSY
Sbjct: 164 ALSF----GGVSVSDFVFGCGRNNKGL-FGG-VSGLMGLGRSYLSLVSQTNATFGGVFSY 217
Query: 248 CL 249
CL
Sbjct: 218 CL 219
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 161/376 (42%), Gaps = 44/376 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y +V +GTP + DT S ++W C C C FD +S++ S +
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLV 137
Query: 151 PCDDPLCRSPFK-------CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR- 202
C DP+C S F+ Q+ +C YT +Y G T G E+ F + G + +
Sbjct: 138 SCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANS 197
Query: 203 ---LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEA 255
+ FGCS SG I GI GF LS+ SQL R +FS+CL E
Sbjct: 198 SASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEGNG 257
Query: 256 TSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAF--DIMRDG 313
++ G +V + +P++ S +PH+ L+L IS+ + P F I R
Sbjct: 258 GGILVLG---EVLEPGIVYSPLVPS--QPHYNLYLQSISVNGQTLPIDPSVFATSINR-- 310
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KA 372
G IID+GT + ++ Y + I ++ + P + + CY +S +
Sbjct: 311 --GTIIDSGTTLAYLVEEAYTPFV---SAITAAVSQSVTP--TISKGNQCYLVSTSVGEI 363
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIE----PDRGRFCVAIQD-DPKYSILGAWQQQNMLII 426
+P ++ + A +++PE Y + +C+ Q +ILG ++ + +
Sbjct: 364 FPLVSLNFAGSASMVLKPEE-YLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFV 422
Query: 427 YDLNVPALRFGSENCA 442
YDL + + S +C+
Sbjct: 423 YDLARQRIGWASYDCS 438
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/370 (23%), Positives = 146/370 (39%), Gaps = 49/370 (13%)
Query: 94 DLFYSVEVNIGTPMKPQ--HLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEI 150
D Y + +G P Q HL DT S L W QC PC C ++ PR +
Sbjct: 195 DGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNL---V 251
Query: 151 PCDDPLCRSPFK------CQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
+P C + C++ +C Y Y + G+ +++ F + NG +
Sbjct: 252 RSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDI 311
Query: 204 AFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVI 259
FGC D G K GILG + + +SL SQL +R I + +CL ++ I
Sbjct: 312 VFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYI 371
Query: 260 KFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFII 319
G D V + P+L + + + ++S G ++ + D G +
Sbjct: 372 FMGSDL-VPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAML-----SLDGENGRVGKVLF 425
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILR-SLGRQRIPYNASQEFDYCYRYDSSF-------- 370
DTG+ T+ N Y L+ ++ L R ++ + C+R ++
Sbjct: 426 DTGSSYTYFPNQAYSQLVTSLQEVSDLELTRD----DSDEALPICWRAKTNSPISSLSDV 481
Query: 371 -KAYPSMTFHLQ------EADYIVQPENMYFIEPDRGRFCVAIQD-----DPKYSILGAW 418
K + +T + ++QPE+ Y I ++G C+ I D D I+G
Sbjct: 482 KKFFRPITLQIGSKWLIISKKLLIQPED-YLIISNKGNVCLGILDGSNVHDGSTIIIGDI 540
Query: 419 QQQNMLIIYD 428
+ LI+YD
Sbjct: 541 SMRGRLIVYD 550
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 163/407 (40%), Gaps = 59/407 (14%)
Query: 66 ARANYMA-SMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
+RA MA S S A +L+ P Y V +NIG P KP L DT S L W Q
Sbjct: 25 SRAATMARSPSSSTAVFQLQGDVYPTGH----YYVTMNIGNPAKPYFLDVDTGSDLTWLQ 80
Query: 125 CQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR-------SPFKCQNGK-CVYTRRYH 175
C PC C P++ P A+ +PC + LC S KC + K C Y +Y
Sbjct: 81 CDAPCRSCNKVPHPLYRPTANRL---VPCANALCTALHSGQGSNNKCPSPKQCDYQIKYT 137
Query: 176 VGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND---NSGFAFGGKISGILGFNASPLS 232
++G+ ++F+ P+R+ P L FGC D A I G+LG +S
Sbjct: 138 DSASSQGVLINDSFSLPMRSS-NIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVS 196
Query: 233 LSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHL 290
L SQL+ + + + +CL + FG D R ++ +
Sbjct: 197 LVSQLKQQGITKNVVGHCL--STNGGGFLFFGDDVVPSSRVT--------------WVPM 240
Query: 291 LEISIGRHIVRFPPGA----FDIMRDGTG--GFIIDTGTPVTFIRNGPYQTLMQRYDQIL 344
+ + G + + PG+ FD G + D+G+ T+ PYQ ++ L
Sbjct: 241 AQRTSGNY---YSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKGGL 297
Query: 345 -RSLGRQRIP-----YNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPD 398
+SL + P + + F + + FK+ + A + PEN Y I
Sbjct: 298 SKSLKQVSDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFSSAKNAAMEIPPEN-YLIVTK 356
Query: 399 RGRFCVAIQDDP----KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
G C+ I D ++++G Q+ ++IYD L + C
Sbjct: 357 NGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 138/347 (39%), Gaps = 36/347 (10%)
Query: 110 QHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFKC-QNG 166
Q ++ DT+S + W QC PC +C Q P++DP S+T++ IPC P C+ NG
Sbjct: 169 QTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNG 228
Query: 167 ------KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKI 220
+C Y Y G T G +T + V FGCS+ G +F +
Sbjct: 229 CSPTTDECKYIVNYGDGKATTGTYVTDTLTM---SPTIVVKDFRFGCSHAVRG-SFSNQN 284
Query: 221 SGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLS 280
+GIL SL Q + FSYC+ + A + G + + TP++ +
Sbjct: 285 AGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSA-GFLSLGGPVEASLK-FSYTPLIKN 342
Query: 281 DLRPHFYL-HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQR 339
P FY+ HL I + + PP AF G ++D+G VT + Y L
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAA 396
Query: 340 YDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDR 399
+ + + G P + D CY F +P + + + + +EP
Sbjct: 397 FRSAMAAYGPLAAPV---RNLDTCY----DFTRFPDV--KVPKVSLVFAGGATLDLEPAS 447
Query: 400 GRF--CVAIQDDP---KYSILGAWQQQNMLIIYDLNVPALRFGSENC 441
C+A P +G QQQ ++YD+ + F C
Sbjct: 448 IILDGCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 156/368 (42%), Gaps = 35/368 (9%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCR 158
+++++GTP +P + S W C T +F P ST+++++PC P C
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCS 60
Query: 159 S----PFKCQ-NGKCVYTRRY-----HVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
+ C + C Y Y GD+ +A+ ++ VRN L+ GC
Sbjct: 61 AFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDS----VRN-RKVAANLSLGCG 115
Query: 209 NDNSGFAFGGKISGILGFNASPLSLSSQLRN-RIQGLFSYCLVREMEATSVIKFG---RD 264
D+ G SG +GF+ +S QL + F YCL + ++ R+
Sbjct: 116 RDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGKLVIGNYKLRN 175
Query: 265 ADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
A + + TP++ + Y ++L ISI ++ + P F + +GTGG +IDT T
Sbjct: 176 ASISSS-MAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGF--LSNGTGGTVIDTTT 232
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY--DSSFKAYPSMTFHLQ 381
++++ + Y L+Q +L + + CY +S F ++T+H
Sbjct: 233 FLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFL 292
Query: 382 EADYIVQPENMYFIEPD----RGRFCVAI----QDDPKYSILGAWQQQNMLIIYDLNVPA 433
+ + +F+ D C+AI P +++G +QQ ++ + YDL
Sbjct: 293 GGAGV--EVSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLEQMR 350
Query: 434 LRFGSENC 441
FG++ C
Sbjct: 351 YGFGAQGC 358
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 161/381 (42%), Gaps = 48/381 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC-----FDQTTPIFDPRASTTYSEI 150
Y V +G+P K ++ DT S ++W C C C FDP +STT + +
Sbjct: 83 LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALV 142
Query: 151 PCDDPLCRSPFKCQN-------GKCVYTRRYHVGDVTRGLASRETF---AFPVRNGF--- 197
C D C + + + +C YT +Y G T G + + +G
Sbjct: 143 SCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQ 202
Query: 198 ---TFVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYC 248
T+ ++F CS +G + GI GF +S+ SQL + QG+ FS+C
Sbjct: 203 ICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLAS--QGITPRVFSHC 260
Query: 249 LVREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD 308
L + V+ G ++ ++ TP++ S +PH+ L+L IS+ + P F
Sbjct: 261 LKGDDSGGGVLVLG---EIVEPNIVYTPLVPS--QPHYNLYLQSISVAGQTLAIDPSVFG 315
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDS 368
+ G I+D+GT + ++ G Y + ++ R + + + CY S
Sbjct: 316 ASSN--QGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYL-----SKGNQCYLVTS 368
Query: 369 SFK-AYPSMTFHLQ-EADYIVQPENMYFIEPDRGR---FCVAIQDDP--KYSILGAWQQQ 421
S +P ++ + A I+ P++ + G +CV Q P + +ILG +
Sbjct: 369 SVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLK 428
Query: 422 NMLIIYDLNVPALRFGSENCA 442
+ + +YD+ + + + +C+
Sbjct: 429 DKIFVYDIANQRVGWTNYDCS 449
>gi|56202144|dbj|BAD73477.1| chloroplast nucleoid DNA binding protein-like [Oryza sativa
Japonica Group]
gi|125571574|gb|EAZ13089.1| hypothetical protein OsJ_03009 [Oryza sativa Japonica Group]
Length = 316
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 128/311 (41%), Gaps = 45/311 (14%)
Query: 168 CVYTRRYHVGDVTRGLASRE--TFAFPVRNGFTFVPR-LAFGCSNDNSGFAFGGKISGIL 224
C RRY G RG + T A R R + GC+ +G +F G+L
Sbjct: 12 CSAARRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLAS-DGVL 70
Query: 225 GFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKFGRD-ADVRRRDLE------- 273
S +S +S+ +R G FSYCLV + ATS + FG + A RR E
Sbjct: 71 SLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGTASCKP 130
Query: 274 ---------------TTPILLSD-LRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
TP++L RP + + + +S+ +++ P +D+ + GG
Sbjct: 131 APAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQG--GGA 188
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA----- 372
I+D+GT +T + Y+ ++ + L L P FDYCY + S +
Sbjct: 189 ILDSGTSLTMLAKPAYRAVVAALSKRLAGL-----PRVTMDPFDYCYNWTSPSGSDVAAP 243
Query: 373 YPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD--PKYSILGAWQQQNMLIIYDLN 430
P + H + + P Y I+ G C+ +Q+ P S++G QQ L YDL
Sbjct: 244 LPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLK 303
Query: 431 VPALRFGSENC 441
LRF C
Sbjct: 304 NRRLRFKRSRC 314
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 151/368 (41%), Gaps = 44/368 (11%)
Query: 103 IGTPMKPQHLLFDTASSLVWTQCQ--PCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS- 159
IGTP + Q L+ DT S L W QC + T FDP S+++S++PC PLC+
Sbjct: 86 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 145
Query: 160 ------PFKCQNGK-CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNS 212
P C + + C Y+ Y G G +E F F N T P L GC+ +++
Sbjct: 146 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTF--SNSQT-TPPLILGCAKEST 202
Query: 213 GFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVRE-----MEATSVIKFGRDADV 267
GILG N LS SQ + FSYC+ + +T G + +
Sbjct: 203 ------DEKGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSFYLGDNPNS 253
Query: 268 RR---RDLETTP--ILLSDLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
R L T P + +L P Y + L I IG+ + P F G+G ++D+
Sbjct: 254 RGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDS 313
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYDSSF---KAYPSMT 377
G+ T + + Y + + +I+R +G R + Y D C+ + S + +
Sbjct: 314 GSEFTHLVDVAYDKVKE---EIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLV 370
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLIIYDLNVPA 433
F I+ + + G CV I +I+G QQN+ + +D+
Sbjct: 371 FEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRR 430
Query: 434 LRFGSENC 441
+ F C
Sbjct: 431 VGFSKAEC 438
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 143/362 (39%), Gaps = 56/362 (15%)
Query: 106 PMKPQHLLFDTASSLVWTQCQPC--IRCFDQTTPIFDPRASTTYSEIPCDDPLCR--SPF 161
P Q +L DTAS + W QC PC +C+ QT ++DP S + C P CR P+
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPY 237
Query: 162 K--CQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
C + G+C Y RY G T G + + + + VP+ FGCS+ G
Sbjct: 238 ANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSL---SPTSQVPKFEFGCSHAARGS 294
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRD--- 271
K +GI+ SL SQ + +FSYC G V RR
Sbjct: 295 FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLG----VPRRSSSR 350
Query: 272 LETTPILLSDLRPHFYLHLLE-ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
TP+L + P Y LE I++ + PP F G +D+ T +T +
Sbjct: 351 YAVTPMLKT---PMLYQVRLEAIAVAGQRLDVPPTVF------AAGAALDSRTVITRLPP 401
Query: 331 GPYQTLMQRYDQILRSLGRQRI----PYNASQEFDYCYRY---DSSFKAYPSMTFHLQEA 383
YQ LRS R ++ P A+ + D CY + S S+ F A
Sbjct: 402 TAYQA--------LRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGA 453
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQ----DDPKYSILGAWQQQNMLIIYDLNVPALRFGSE 439
+ P + F C+A DD I+G Q Q + ++Y++ ++ F
Sbjct: 454 GVQLDPSGVLFGS------CLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRG 507
Query: 440 NC 441
C
Sbjct: 508 AC 509
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 165/379 (43%), Gaps = 48/379 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y +V +GTP + ++ DT S ++W C C C QT+ + FDP +S+T S
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPGSSSTSSL 134
Query: 150 IPCDDPLCRSPFKC-------QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
I C D CRS + +N +C YT +Y G T G + F T
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTN 194
Query: 203 ----LAFGCSNDNSGFAFGGK--ISGILGFNASPLSLSSQLRNRIQGL----FSYCLVRE 252
+ FGCS +G + + GI GF +S+ SQL + QG+ FS+CL +
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSS--QGIAPRVFSHCLKGD 252
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
V+ G ++ ++ +P++ S +PH+ L+L IS+ IVR P F +
Sbjct: 253 NSGGGVLVLG---EIVEPNIVYSPLVPS--QPHYNLNLQSISVNGQIVRIAPSVFATSNN 307
Query: 313 GTGGFIIDTGTPVTFIRNGPYQ----TLMQRYDQILRSL---GRQRIPYNASQEFDYCYR 365
G I+D+GT + ++ Y + Q +RS+ G Q S D +
Sbjct: 308 --RGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQ 365
Query: 366 YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQNM 423
+F S+ Q DY++Q FI + +C+ Q +ILG ++
Sbjct: 366 VSLNFAGGASLVLRPQ--DYLMQQN---FIG-EGSVWCIGFQKISGQSITILGDLVLKDK 419
Query: 424 LIIYDLNVPALRFGSENCA 442
+ +YDL + + + +C+
Sbjct: 420 IFVYDLAGQRIGWANYDCS 438
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 138/369 (37%), Gaps = 76/369 (20%)
Query: 78 NAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP 137
N DI + Y + +++GTP + DT S L+W QC PC C+ Q P
Sbjct: 10 NQLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEP 69
Query: 138 IFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
+FDP+ S TY T G S ETF G
Sbjct: 70 LFDPKKSKTYK-------------------------------TLGYLSSETFTIGSTEGD 98
Query: 198 -TFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV---REM 253
P LAFGC + N G F K SG++G PLSL QL +++ G FSYCLV +
Sbjct: 99 PASFPGLAFGCGHSNGG-TFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDS 157
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
A+S I FG+ A V + P A +
Sbjct: 158 TASSKINFGKSAVVSGSGTSS-----------------------------PAAAE----- 183
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAY 373
IID+GT +T + P + + +G Q + F CY +
Sbjct: 184 ESNIIIDSGTTLTLL---PRDFYTDMESALTKVIGGQTT-TDPRGTFSLCYSGVKKLE-I 238
Query: 374 PSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPA 433
P++T H AD + P N F++ C ++ +I G Q N L+ YDL
Sbjct: 239 PTITAHFIGADVQLPPLNT-FVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNK 297
Query: 434 LRFGSENCA 442
+ F +C
Sbjct: 298 VSFKPTDCT 306
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 145/334 (43%), Gaps = 31/334 (9%)
Query: 114 FDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRSPFK--CQNGKCVYT 171
DT+S + W C C+ C ++ +F+ ASTTY + C C+ K C G C +
Sbjct: 1 MDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57
Query: 172 RRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPL 231
Y + L S++T VP +FGC +G + + LG
Sbjct: 58 LTYGGSSLAANL-SQDTITLATD----AVPGYSFGCIQKATGGSLPAQGLLGLGRGPL-- 110
Query: 232 SLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-L 288
SL SQ +N Q FSYCL + + + ++ G +R ++ TP+L + RP Y +
Sbjct: 111 SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR--IKYTPLLKNPRRPSLYFV 168
Query: 289 HLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG 348
+L+ + +GR +V PPG+F G I D+GT T + Y + D +G
Sbjct: 169 NLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAY---IAVRDAFRNRVG 225
Query: 349 RQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQD 408
R + + FD CY A P++TF + + P+N+ C+A+
Sbjct: 226 RN-LTVTSLGGFDTCYTVP---IAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAA 281
Query: 409 DPK-----YSILGAWQQQNMLIIYDLNVPALRFG 437
P +++ QQQN ++YD VP R G
Sbjct: 282 APDNVNSVLNVIANLQQQNHRLLYD--VPNSRLG 313
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 160/376 (42%), Gaps = 43/376 (11%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT---TPI--FDPRASTTYSEI 150
Y + +GTP + ++ DT S ++W C C C + P+ FDP +S T S I
Sbjct: 51 LYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110
Query: 151 PCDDPLCR-------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR- 202
C D C S QN C Y +Y G T G + F G + +
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNS 170
Query: 203 ---LAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVREM 253
+ FGCS +G + GI GF +S+ SQL + QG+ FS+CL +
Sbjct: 171 SAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLAS--QGISPRAFSHCLKGDD 228
Query: 254 EATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDG 313
++ G ++ ++ TP++ S +PH+ L++ IS+ + P F
Sbjct: 229 SGGGILVLG---EIVEPNIVYTPLVPS--QPHYNLNMQSISVNGQTLAIDPSVFG--TSS 281
Query: 314 TGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK-A 372
+ G IID+GT + ++ Y + I+ R PY + + ++CY SS
Sbjct: 282 SQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVR---PYLS--KGNHCYLISSSINDI 336
Query: 373 YPSMTFHLQ-EADYIVQPENMYFIEPDRGR---FCVAIQ--DDPKYSILGAWQQQNMLII 426
+P ++ + A I+ P++ + G +C+ Q +ILG ++ + +
Sbjct: 337 FPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFV 396
Query: 427 YDLNVPALRFGSENCA 442
YD+ + + + +C+
Sbjct: 397 YDIANQRIGWANYDCS 412
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 156/384 (40%), Gaps = 55/384 (14%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSE-- 149
K + V + IGTP + Q ++ DT S L W I+C ++ TP +T+ +
Sbjct: 77 KYSMALVVTLPIGTPPQLQQMVLDTGSQLSW------IQCHNKKTPQKKQPPTTSSFDPS 130
Query: 150 -------IPCDDPLCRS-------PFKCQ-NGKCVYTRRYHVGDVTRGLASRETFAF-PV 193
+PC+ PLC+ P C N C Y+ Y G G RE AF P
Sbjct: 131 LSSSFFVLPCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPS 190
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM 253
+ P + GC+ + GILG N L SQ + FSYC+ +
Sbjct: 191 QT----TPPIILGCATQSD------DARGILGMNLGRLGFPSQAKIT---KFSYCVPTKQ 237
Query: 254 EATSVIKFGRDADVRRRDLETTPIL-------LSDLRPHFY-LHLLEISIGRHIVRFPPG 305
+ F + +L + +L P Y L L ISIG + PP
Sbjct: 238 AQPASGSFYLGNNPASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPS 297
Query: 306 AFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCY 364
F G+G +ID+G+ T++ + Y + + ++++ +G + + Y D C+
Sbjct: 298 VFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIRE---ELVKKVGPKIKKGYMYGGVADICF 354
Query: 365 RYDS--SFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY----SILGAW 418
D+ + M F ++ IV P+ D G C+ + + +I+G +
Sbjct: 355 DGDAIEIGRLVGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGMGRSERLGAGGNIIGNF 414
Query: 419 QQQNMLIIYDLNVPALRFGSENCA 442
QQN+ + +DL + FG +C+
Sbjct: 415 HQQNLWVEFDLANRRVGFGEADCS 438
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 115/274 (41%), Gaps = 25/274 (9%)
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCR--S 159
GT Q ++ D+ S + W QC+PC C Q P+FDP STTY+ +PC C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 160 PFK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
P++ N +C + Y G G S + + + FGC++ + G AF
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTL---GPYDVIRGFRFGCAHADRGSAF 278
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR--DLET 274
++G L SL Q R +FSYCL + + G + + +
Sbjct: 279 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 338
Query: 275 TPILLSDLRPHFYLHLLE--ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
TP+L S + P FY LL I GR + PP F + +ID+ T ++ +
Sbjct: 339 TPLLSSSMAPTFYRVLLRAIIVAGRPLA-VPPAVF------SASSVIDSSTIISRLPPTA 391
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
YQ L + + ++ R P + D CY +
Sbjct: 392 YQALRAAFRSAM-TMYRAAPPVSI---LDTCYDF 421
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 115/414 (27%), Positives = 169/414 (40%), Gaps = 84/414 (20%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ----PCIRCFD----QTTPIFDPRASTTYS 148
Y + +NIGTP + +L DT S L W C C+ C D + F P S++
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141
Query: 149 EIPCDDPLC------RSPF-KCQNGKCV---------------YTRRYHVGDVTRGLASR 186
C P C +P C C + Y G V G+ +R
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201
Query: 187 ETFAFPVRNGFT-----FVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRI 241
+T NG + +P+ FGC G A+ I GI GF LS+ SQL +
Sbjct: 202 DTLRV---NGSSPGVAKEIPKFCFGCV----GSAYREPI-GIAGFGRGTLSMVSQL-GFL 252
Query: 242 QGLFSYCLV-----REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLE-ISI 295
Q FS+C + +S + G A + D++ TP+L S + P+FY LE I++
Sbjct: 253 QKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITV 312
Query: 296 GRHIVRFPPGA---FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ-R 351
G P + FD + G GG ID+GT T + P+ Y Q+L L
Sbjct: 313 GNVSATEVPSSLREFDSL--GNGGMKIDSGTTYTHLPE-PF------YSQVLSILQSTIN 363
Query: 352 IPYNASQE----FDYCYRYD-------SSFKAYPSMTFHLQEADYIVQPENMYFI---EP 397
P + E FD CY+ +S PS+TFH +V P+ +F P
Sbjct: 364 YPRDTGMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAP 423
Query: 398 DRGRF--CVAIQ-----DDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCANG 444
C+ Q DD + G++QQQN+ ++YDL + F +CA+
Sbjct: 424 GNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASA 477
>gi|255647724|gb|ACU24323.1| unknown [Glycine max]
Length = 334
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 141/330 (42%), Gaps = 36/330 (10%)
Query: 16 FSVLFLTHFT-----SSESTGFSLKLIPIFSPESPLYP--GNLSQSERIHKMFEISKARA 68
FSV++L +S++ L +IPI+S SP P + S RI M R
Sbjct: 13 FSVIWLMRVNGIDPCASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMASKDPLRF 72
Query: 69 NYMASMSKPNAFQELEDIHLPMAKQDLF----YSVEVNIGTPMKPQHLLFDTASSLVWTQ 124
Y++++ P+A F Y V V +GTP + ++ DT++ +
Sbjct: 73 KYLSTLVGQKTVSTA-----PIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVP 127
Query: 125 CQPCIRCFDQTTPIFDPRASTTYSEIPCDDPLCRS--PFKC---QNGKCVYTRRYHVGDV 179
C C C D T F P+AST+Y + C P C C G C + + Y
Sbjct: 128 CSGCTGCSDAT---FSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184
Query: 180 TRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRN 239
+ L +R +P +FGC N +G + + LG SL SQ +
Sbjct: 185 SATLVQDS-----LRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPL--SLLSQSGS 237
Query: 240 RIQGLFSYCL--VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPH-FYLHLLEISIG 296
G+FSYCL + + +K R + + + TTP+L S RP +Y++ IS+G
Sbjct: 238 NYSGIFSYCLPSFKSYYFSGSLKL-RPVG-QPKSIRTTPLLRSPHRPSLYYVNFTGISVG 295
Query: 297 RHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
R +V FP + G IID+GT +T
Sbjct: 296 RVLVPFPSEYLGFNPNTGSGTIIDSGTVIT 325
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/409 (25%), Positives = 168/409 (41%), Gaps = 66/409 (16%)
Query: 89 PMAKQDLFYSVEVNIGT-PMKPQHLLFDTASSLVWTQCQP--CIRC---FDQTTPIF--- 139
P++ ++ Y++ N+G+ P + L DT S LVW C P CI C F+ T P+
Sbjct: 11 PISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITR 70
Query: 140 -------DPRASTTYSEIPCDD--PLCRSPFK------CQNGKCV-YTRRYHVGDVTRGL 183
P ST +S + D + R P C + C + Y G L
Sbjct: 71 SHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAHL 130
Query: 184 ASRETFAFPVRNGFTFVPRLAFGCSN----DNSGFAFGGKISGILGFNASPLSLSSQLRN 239
R+T + F+ FGC++ + +G A G+ G+L A +LS L N
Sbjct: 131 -HRDTLSMSQ----LFLKNFTFGCAHTALAEPTGVAGFGR--GLLSLPAQLATLSPNLGN 183
Query: 240 RIQGLFSYCLV------REMEATSVIKFGR--DADVRRRDLETTPILLSDLRPHFY-LHL 290
R FSYCLV + S + G D R + T +L + +FY + L
Sbjct: 184 R----FSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGL 239
Query: 291 LEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQ 350
IS+G+ + P + R G GG ++D+GT T + Y +++ +D+ + + ++
Sbjct: 240 TGISVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKR 299
Query: 351 RIPYNASQEFDYCYRYDSSFKAYPSMTFH-LQEADYIVQPENMYFIE----PDRGRFCVA 405
CY + P++T+H L ++ P YF E D R V
Sbjct: 300 ASEVEEKTGLGPCY-FLEGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVG 358
Query: 406 I------QDDPKYS-----ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
DD + S ILG +QQQ ++YDL + F CA+
Sbjct: 359 CLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCAS 407
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 152/379 (40%), Gaps = 45/379 (11%)
Query: 85 DIHLPM--AKQDLF--YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFD 140
++ +PM + D Y EV +G+P + L+ DT S W C +
Sbjct: 97 EVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNCSKSFEAVTCASRKCK 156
Query: 141 PRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTF- 199
S +S C P + C+Y Y G +G ++ + NG
Sbjct: 157 VDLSELFSLSVCPKP---------SDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGK 207
Query: 200 VPRLAFGCSNDN-SGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSV 258
+ L GC+ +G F + GILG + S + N+ FSYCLV + SV
Sbjct: 208 LNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSV 267
Query: 259 ---IKFGRD------ADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDI 309
+ G ++RR +L P P + ++++ ISIG +++ PP +D
Sbjct: 268 SSNLTIGGHHNAKLLGEIRRTELILFP-------PFYGVNVVGISIGGQMLKIPPQVWDF 320
Query: 310 MRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR-QRIPYNASQEFDYCYR--- 365
+ GG +ID+GT +T + Y+ + ++ + +SL + +R+ ++C+
Sbjct: 321 --NAEGGTLIDSGTTLTSLLLPAYEAV---FEALTKSLTKVKRVTGEDFDALEFCFDAEG 375
Query: 366 YDSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQN 422
+D S P + FH P Y I+ C+ I S++G QQN
Sbjct: 376 FDDS--VVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQN 433
Query: 423 MLIIYDLNVPALRFGSENC 441
L +DL+ + F C
Sbjct: 434 HLWEFDLSTNTVGFAPSTC 452
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 161/378 (42%), Gaps = 47/378 (12%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPI------FDPRASTTYSE 149
Y +V +GTP ++ DT S ++W C C C QT+ + FDP +S+T S
Sbjct: 77 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGC-PQTSGLQIQLNFFDPGSSSTSSM 135
Query: 150 IPCDDPLCR-----SPFKC--QNGKCVYTRRYHVGDVTRGLASRETF----AFPVRNGFT 198
I C D C S C QN +C YT +Y G T G + F
Sbjct: 136 IACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN 195
Query: 199 FVPRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQG----LFSYCLVRE 252
+ FGCSN +G + GI GF +S+ SQL + QG +FS+CL +
Sbjct: 196 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSS--QGIAPRIFSHCLKGD 253
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
++ G ++ ++ T ++ + +PH+ L+L IS+ ++ F
Sbjct: 254 SSGGGILVLG---EIVEPNIVYTSLVPA--QPHYNLNLQSISVNGQTLQIDSSVF--ATS 306
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK- 371
+ G I+D+GT + ++ Y + + R + + CY SS
Sbjct: 307 NSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRG-----NQCYLITSSVTD 361
Query: 372 AYPSMTFHLQ-EADYIVQPENMYFIEPDR----GRFCVAIQ--DDPKYSILGAWQQQNML 424
+P ++ + A I++P++ Y I+ + +C+ Q +ILG ++ +
Sbjct: 362 VFPQVSLNFAGGASMILRPQD-YLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKI 420
Query: 425 IIYDLNVPALRFGSENCA 442
++YDL + + + +C+
Sbjct: 421 VVYDLAGQRIGWANYDCS 438
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 155/373 (41%), Gaps = 62/373 (16%)
Query: 84 EDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC---IRCFDQTTPIFD 140
+D+ + + Y + VN+G+P + + DT S LVW +C+ T FD
Sbjct: 88 DDVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFD 147
Query: 141 PRASTTYSEIPCDDPLCRSPFK--CQNG-KCVYTRRYHVGDVTRGLASRETFAFPVRNGF 197
P S+TY + C C + + C +G C Y Y G T G+ S ETF F G
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFD-DGGA 206
Query: 198 TFVPR------LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCL 249
PR + FGCS +G +F G++G +SL +QL + FSYCL
Sbjct: 207 GRSPRQVRIGGVKFGCSTATAG-SF--PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL 263
Query: 250 V-REMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFD 308
V + A+S + FG ADV +TP++ G
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPLV--------------------------GNKT 297
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQE--FDYCY-- 364
+ + I+D+GT +TF+ L D++ R R +P S + CY
Sbjct: 298 VASAASSRIIVDSGTTLTFLDP---SLLGPIVDELSR---RITLPPVQSPDGLLQLCYNV 351
Query: 365 --RYDSSFKAYPSMTFHL-QEADYIVQPENMYFIEPDRGRFCVAI---QDDPKYSILGAW 418
R + ++ P +T A ++PEN F+ G C+AI + SILG
Sbjct: 352 AGREVEAGESIPDLTLEFGGGAAVALKPENA-FVAVQEGTLCLAIVATTEQQPVSILGNL 410
Query: 419 QQQNMLIIYDLNV 431
QQN+ + YDL+
Sbjct: 411 AQQNIHVGYDLDA 423
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 115/274 (41%), Gaps = 25/274 (9%)
Query: 104 GTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQTTPIFDPRASTTYSEIPCDDPLCR--S 159
GT Q ++ D+ S + W QC+PC C Q P+FDP STTY+ +PC C
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130
Query: 160 PFK---CQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAF 216
P++ N +C + Y G G S + + + FGC++ + G AF
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTL---GPYDVIRGFRFGCAHADRGSAF 187
Query: 217 GGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRR--DLET 274
++G L SL Q R +FSYCL + + G + + +
Sbjct: 188 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 247
Query: 275 TPILLSDLRPHFYLHLLE--ISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGP 332
TP+L S + P FY LL I GR + PP F + +ID+ T ++ +
Sbjct: 248 TPLLSSSMAPTFYRVLLRAIIVAGRPLA-VPPAVF------SASSVIDSSTIISRLPPTA 300
Query: 333 YQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY 366
YQ L + + ++ R P + D CY +
Sbjct: 301 YQALRAAFRSAM-TMYRAAPPVSI---LDTCYDF 330
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 161/404 (39%), Gaps = 54/404 (13%)
Query: 60 MFEISKARANYMASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASS 119
+FE+ RA + + + P +L H ++ +V + +GTP + ++ DT S
Sbjct: 38 LFEL---RARQVPAGALPRPASKLRFHH------NVSLTVSLAVGTPPQNVTMVLDTGSE 88
Query: 120 LVWTQCQPCIRCFD--QTTPIFDPRASTTYSEIPCDDPLCR-----SPFKCQNG--KCVY 170
L W C P ++ F PRAS T++ +PCD CR SP C +C
Sbjct: 89 LSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCRSRDLPSPPACDGASKQCRV 148
Query: 171 TRRYHVGDVTRGLASRETFAF----PVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF 226
+ Y G + G + E F P+R F + AF S D G +G+LG
Sbjct: 149 SLSYADGSSSDGALATEVFTVGQGPPLRAAFGCM-ATAFDTSPD------GVATAGLLGM 201
Query: 227 NASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLETTPILLSDL---- 282
N LS SQ R FSYC + + + V+ G +D+ L TP+ +
Sbjct: 202 NRGALSFVSQASTR---RFSYC-ISDRDDAGVLLLGH-SDLPFLPLNYTPLYQPAMPLPY 256
Query: 283 --RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
R + + LL I +G + P G G ++D+GT TF+ Y L +
Sbjct: 257 FDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEF 316
Query: 341 DQILRS-LGRQRIPYNASQE-FDYCYRYDSSFKA---YPSMTFHLQEADYIVQPENMYFI 395
+ + L P A QE FD C+R P++T A V + + +
Sbjct: 317 SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYK 376
Query: 396 EPDR-----GRFCVAIQDDPKYSI----LGAWQQQNMLIIYDLN 430
P G +C+ + I +G Q N+ + YDL
Sbjct: 377 VPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLE 420
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/373 (23%), Positives = 150/373 (40%), Gaps = 49/373 (13%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ ++ +V + +G+P + ++ DT S L W C+ +F+P +S+TYS +P
Sbjct: 56 RHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKS----PNLGSVFNPVSSSTYSPVP 111
Query: 152 CDDPLCRS-------PFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
C P+CR+ P C + C Y G + +TF G P
Sbjct: 112 CSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVI----GSVTRPG 167
Query: 203 LAFGC--SNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIK 260
FGC S +S K +G++G N LS +QL FSYC + +++ ++
Sbjct: 168 TLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYC-ISGSDSSGILL 223
Query: 261 FGRDADVRRRDLETTPILLSDL------RPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
G + ++ TP++L R + + L I +G I+ P F G
Sbjct: 224 LGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGA 283
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRY----DQILRSLGRQRIPYNASQEFDYCYRYDSS- 369
G ++D+GT TF+ Y L + +LR + + + D CYR SS
Sbjct: 284 GQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGT--MDLCYRVGSST 341
Query: 370 ---FKAYPSMTFHLQEADYIVQPENMYF------IEPDRGRFCVAIQDDPKYSI----LG 416
F P ++ + A+ V + + + E +C + I +G
Sbjct: 342 RPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIG 401
Query: 417 AWQQQNMLIIYDL 429
QQN+ + +DL
Sbjct: 402 HHHQQNVWMEFDL 414
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 159/383 (41%), Gaps = 40/383 (10%)
Query: 77 PNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT 136
PNA L D L + +Y+ + IGTP + L+ D+ S++ + C C +C +
Sbjct: 72 PNARMRLHDDLL----TNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQD 127
Query: 137 PIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRN 195
P F P S++YS + C+ D C S K +C Y R+Y + G+ + +F +
Sbjct: 128 PRFQPDLSSSYSPVKCNVDCTCDSDKK----QCTYERQYAEMSSSSGVLGEDIVSFGRES 183
Query: 196 GFTFVPRLA-FGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVRE 252
P+ A FGC N +G F GI+G LS+ QL + I FS C
Sbjct: 184 --ELKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY--- 238
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSD---LR-PHFYLHLLEISIGRHIVRFPPGAFD 308
+ G A V L ++ S+ LR P++ + L EI + +R F+
Sbjct: 239 ----GGMDIGGGAMVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFN 294
Query: 309 IMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY---- 364
G ++D+GT ++ + + + SL + R P + + D C+
Sbjct: 295 SKH----GTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYK--DICFAGAG 348
Query: 365 -RYDSSFKAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQ 419
+ +P + + + PEN F G +C+ + + K ++LG
Sbjct: 349 RNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGII 408
Query: 420 QQNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + + F NC+
Sbjct: 409 VRNTLVTYDRHNEKIGFWKTNCS 431
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/428 (23%), Positives = 181/428 (42%), Gaps = 75/428 (17%)
Query: 64 SKARANYMASMSKPNAFQE---LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFD 115
+ A ++A++ + +A + L + L + L Y + IG+P K ++ D
Sbjct: 43 GRGVAEHLAALRRHDANRHGRLLGAVDLALGGVGLPTDTGLYYTRIEIGSPPKGYYVQVD 102
Query: 116 TASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRS------PFKC- 163
T S ++W C C C ++ +DP S T + C+ C + P C
Sbjct: 103 TGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCEQEFCVANSAGGVPPTCP 160
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFVPRLAFGCSNDNSGFAFGG 218
+ C + Y G T G + + +G T + FGC G GG
Sbjct: 161 STSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGC-----GAQLGG 215
Query: 219 KIS-------GILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
+ GILGF S S+ SQL R++ +F++CL + + G +V +
Sbjct: 216 DLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL-DTVRGGGIFAIG---NVVQ 271
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
++TTP++ + H+ ++L IS+G ++ P FD + G IID+GT + ++
Sbjct: 272 PKVKTTPLVPN--VTHYNVNLQGISVGGATLQLPTSTFD--SGDSKGTIIDSGTTLAYLP 327
Query: 330 NGPYQTLMQR-YDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQE----- 382
Y+TL+ +D+ Q +P + Q+F C+++ S +P +TF +
Sbjct: 328 REVYRTLLAAVFDKY------QDLPLHNYQDF-VCFQFSGSIDDGFPVITFSFEGDLTLN 380
Query: 383 ---ADYIVQPEN----MYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
DY+ Q N M F++ V +D +LG N L++YDL +
Sbjct: 381 VYPDDYLFQNRNDLYCMGFLDGG-----VQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435
Query: 436 FGSENCAN 443
+ NC++
Sbjct: 436 WTDYNCSS 443
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 146/363 (40%), Gaps = 38/363 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y +GTP + + D ++ W C P FDP S+TY + C P
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGAPQ 164
Query: 157 C-RSPF-KCQNG---KCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDN 211
C ++P C G C + Y + L ++ A + V FGC +
Sbjct: 165 CSQAPAPSCPGGLGSSCAFNLSY-AASTFQALLGQDALAL--HDDVDAVAAYTFGCLH-- 219
Query: 212 SGFAFGGKI--SGILGFNASPLSLSSQLRNRIQGLFSYCL--VREMEATSVIKFGRDADV 267
GG + G++GF PLS SQ ++ +FSYCL + + ++ G
Sbjct: 220 --VVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQP 277
Query: 268 RRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVT 326
+R ++TTP+L + RP +Y++++ I +G V P A G I+D GT T
Sbjct: 278 KR--IKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFT 335
Query: 327 FIRNGPYQTLMQRYDQILRSLGRQRIPYNAS-QEFDYCYRYDSSFKAYPSMTFHLQEADY 385
+ Y + + RS R R P FD CY S P++TF
Sbjct: 336 RLSAPVYAAVR----DVFRS--RVRAPVAGPLGGFDTCYNVTISV---PTVTFSFDGRVS 386
Query: 386 IVQPENMYFIEPDRGRF-CVAIQDDP------KYSILGAWQQQNMLIIYDLNVPALRFGS 438
+ PE I G C+A+ P ++L + QQQN +++D+ + F
Sbjct: 387 VTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSR 446
Query: 439 ENC 441
E C
Sbjct: 447 ELC 449
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/428 (23%), Positives = 181/428 (42%), Gaps = 75/428 (17%)
Query: 64 SKARANYMASMSKPNAFQE---LEDIHLPMAKQDL-----FYSVEVNIGTPMKPQHLLFD 115
+ A ++A++ + +A + L + L + L Y + IG+P K ++ D
Sbjct: 43 GRGVAEHLAALRRHDANRHGRLLGAVDLALGGVGLPTDTGLYYTRIEIGSPPKGYYVQVD 102
Query: 116 TASSLVWTQCQPCIRCFDQTT-----PIFDPRASTTYSEIPCDDPLCRS------PFKC- 163
T S ++W C C C ++ +DP S T + C+ C + P C
Sbjct: 103 TGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCEQEFCVANSAGGVPPTCP 160
Query: 164 -QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG----FTFVPRLAFGCSNDNSGFAFGG 218
+ C + Y G T G + + +G T + FGC G GG
Sbjct: 161 STSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGC-----GAQLGG 215
Query: 219 KIS-------GILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVRR 269
+ GILGF S S+ SQL R++ +F++CL + + G +V +
Sbjct: 216 DLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL-DTVRGGGIFAIG---NVVQ 271
Query: 270 RDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
++TTP++ + H+ ++L IS+G ++ P FD + G IID+GT + ++
Sbjct: 272 PKVKTTPLVPN--VTHYNVNLQGISVGGATLQLPTSTFD--SGDSKGTIIDSGTTLAYLP 327
Query: 330 NGPYQTLMQR-YDQILRSLGRQRIPYNASQEFDYCYRYDSSF-KAYPSMTFHLQE----- 382
Y+TL+ +D+ Q +P + Q+F C+++ S +P +TF +
Sbjct: 328 REVYRTLLAAVFDKY------QDLPLHNYQDF-VCFQFSGSIDDGFPVITFSFKGDLTLN 380
Query: 383 ---ADYIVQPEN----MYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPALR 435
DY+ Q N M F++ V +D +LG N L++YDL +
Sbjct: 381 VYPDDYLFQNRNDLYCMGFLDGG-----VQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435
Query: 436 FGSENCAN 443
+ NC++
Sbjct: 436 WTDYNCSS 443
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/363 (23%), Positives = 150/363 (41%), Gaps = 33/363 (9%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
+Y+ + IGTP + L+ D+ S++ + C C +C P F P S+TY + C+
Sbjct: 93 YYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN-- 150
Query: 156 LCRSPFKCQNGK--CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
C + K CVY R Y ++G+ + +F + T R FGC +G
Sbjct: 151 ---MDCNCDDDKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLT-PQRAVFGCETVETG 206
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCL-VREMEATSVIKFGRDADVRRR 270
+ + GI+G LSL QL ++ I F C ++ S+I G D
Sbjct: 207 DLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMI 266
Query: 271 DLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRN 330
++ P D P++ + L I + + F DG G ++D+GT ++
Sbjct: 267 FTDSDP----DRSPYYNIDLTGIRVAGKKLSLNSRVF----DGEHGAVLDSGTTYAYL-- 316
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEF-DYCYRYDSS------FKAYPSMTFHLQEA 383
P + ++R + + F D C+ +S K +PS+ +
Sbjct: 317 -PDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSG 375
Query: 384 -DYIVQPENMYFIEPD-RGRFCVAIQDDPK--YSILGAWQQQNMLIIYDLNVPALRFGSE 439
+++ PEN F G +C+ + + K ++LG +N L++YD + F
Sbjct: 376 QSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRT 435
Query: 440 NCA 442
NC+
Sbjct: 436 NCS 438
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/391 (24%), Positives = 157/391 (40%), Gaps = 53/391 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC---FDQTTPI-------------- 138
Y V V GTP P +L+ DTA+ L W C+ R + +T +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 139 ---FDPRASTTYSEIPCDDP--------LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRE 187
+ P S+++ I C C+SP K ++ C Y ++ G +T G+ +E
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAES--CSYYQQMQDGTLTMGIYGKE 243
Query: 188 TFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
V +G +P L GCS +G + G+L +S + R FS
Sbjct: 244 KATVTVSDGRMAKLPGLILGCSVLEAGGSVDAH-DGVLSLGNGEMSFAVHAAKRFGQRFS 302
Query: 247 YCLVR---EMEATSVIKFGRDADVRR-RDLETTPILLSDLRPHFYLHLLEISIGRHIVRF 302
+CL+ +A+S + FG + V +ET + D++P + + I +G +
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
P +D + GG I+DT T VT + Y + D+ L L R F+Y
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPR----VYELDGFEY 418
Query: 363 CYRY----DSSFKAY----PSMTFHLQEADYIVQPENMYFIEPDR--GRFCVAIQDDPKY 412
CYR+ D A+ P +T + + +PE + P+ G C+A + P+
Sbjct: 419 CYRWTFAGDGVDLAHNVTVPRLTVEMAGGARL-EPEAKSVVMPEVVPGVACLAFRKLPRG 477
Query: 413 --SILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILG Q + D +RF + C
Sbjct: 478 GPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 129/290 (44%), Gaps = 37/290 (12%)
Query: 168 CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFN 227
C Y Y G TRG E F G V FGC +N G FGG +SG++G
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF----GTILVKDFIFGCGRNNKGL-FGG-VSGLMGLG 186
Query: 228 ASPLSLSSQLRNRIQGLFSYCL-VREMEATSVIKFGRDADVRRRDLETTPILLSDL--RP 284
S LSL SQ G+FSYCL E + + + G ++ V R ++PI + + P
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYR---NSSPISYAKMIENP 243
Query: 285 HFY----LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRY 340
Y ++L ISIG ++ P G ++D+GT +T + Y+ L +
Sbjct: 244 QLYNFYFINLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEF 296
Query: 341 DQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHLQ-EADYIVQPENM-YFIEP 397
+ P A D C+ + + P++ H + A+ V + YF++
Sbjct: 297 LKQFTGFP----PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKS 352
Query: 398 DRGRFCVAI-----QDDPKYSILGAWQQQNMLIIYDLNVPALRFGSENCA 442
D + C+A+ QD+ +ILG +QQ+N+ +IYD + F E C+
Sbjct: 353 DASQVCLALASLEYQDE--VAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 148/372 (39%), Gaps = 51/372 (13%)
Query: 100 EVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTP---------IFDPRASTTYSEI 150
V +GTP + DT S L W C C + I+ P AS+T S++
Sbjct: 107 NVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKV 166
Query: 151 PCDDPLCRSPFKCQN--GKCVYTRRY----------HVGDVTRGLASRETFAFPVRNGFT 198
PC+ LC +C + C Y RY V DV L S E + P+R
Sbjct: 167 PCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLH-LVSMEKNSKPIR---- 221
Query: 199 FVPRLAFGCSNDNSG-FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATS 257
R+ GC +G F G +G+ G +S+ S L S+ + +
Sbjct: 222 --ARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAG 279
Query: 258 VIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGF 317
I FG V +R+ TP+ + P + + + +IS+G + FD
Sbjct: 280 RISFGDKGSVDQRE---TPLNIRQPHPTYNVTVTQISVGGNTGDL---EFDA-------- 325
Query: 318 IIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA--YPS 375
+ DTGT T++ + PY + + ++ + +R ++ F+YCY + K+ YP
Sbjct: 326 VFDTGTSFTYLTDAPYTLISESFNSLALD---KRYQTDSELPFEYCYAVSPNKKSFEYPD 382
Query: 376 MTFHLQEADY--IVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQQNMLIIYDLNVPA 433
+ ++ + P + IE D +C+AI SI+G +++D
Sbjct: 383 VNLTMKGGSSYPVYHPLIVVPIE-DTVVYCLAIMKSEDISIIGQNFMTGYRVVFDREKLI 441
Query: 434 LRFGSENCANGR 445
L + +C+ G
Sbjct: 442 LGWKESDCSTGE 453
>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
Length = 534
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 116/488 (23%), Positives = 180/488 (36%), Gaps = 88/488 (18%)
Query: 28 ESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIH 87
+ G S +P+++P P + R H ++K M + P + +
Sbjct: 41 QEGGSSSFTLPVWAPHVP----ESGEERREHFRALMAKDMRRMMRQV--PELMSKTDMFE 94
Query: 88 LPM-----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-----------PCIRC 131
LPM Q Y V V IGTP P L +TA+ + W C+ P +
Sbjct: 95 LPMRSALNIAQVGMYVVVVRIGTPALPYSLALETANEVTWINCRLRRRKGKHPGRPHVPP 154
Query: 132 FDQTTPI------------------------FDPRASTTYSEIPCDDPLCRS-PFKC--- 163
T I + P S+++ C C P+
Sbjct: 155 AATTMSIQVDDDGGGGGSGGKSKVTKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCES 214
Query: 164 --QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKI 220
QN C Y + +T G+ +E V +G +P L GCS F GG +
Sbjct: 215 PDQNTSCTYYQVMKDSTITSGIYGQEKATVAVSDGTMKKLPGLVIGCST----FEHGGAV 270
Query: 221 S---GILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKFGRDADVRRRDLET 274
+ GIL SP S R G S+CL+ A+S + FG + V+
Sbjct: 271 NSHDGILSLGNSPSSFGIAAARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQAPGTME 330
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT----GGFIIDTGTPVTFIRN 330
TP+L D+ + H+ I +G + PP +D G G I+DTGT +T++ +
Sbjct: 331 TPLLYRDVA--YGAHVTGILVGGQPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVS 388
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY----DSSFKAY----PSMTFHLQ- 381
Y + D L L + I + F+YCY + D A+ PS + +
Sbjct: 389 AVYDPVTAALDSHLAHLPKAEI-----KGFEYCYNWTFAGDGVDPAHNVTIPSFSIEMAG 443
Query: 382 EADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+A +++ E G C+ I P SI+G Q + D LRF
Sbjct: 444 DARLAADAKSIVVPEVVPGVVCLGFNRISQGP--SIIGNVLMQEHIWEIDHMSTVLRFRK 501
Query: 439 ENCANGRQ 446
+ C N +Q
Sbjct: 502 DKCINHQQ 509
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 156/374 (41%), Gaps = 40/374 (10%)
Query: 81 QELEDIHLPMA----KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCI--RCFDQ 134
QE +D P + +D + V V GTP + +L+ DT S W QC C C ++
Sbjct: 109 QESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNK 168
Query: 135 TTPIFDPRASTTYSEIPCDDPLCRSPFKCQNGKCVYTRRYHVGDVTRGL-ASRETFAFPV 193
T F+P S++YS C P N YT +Y ++G+ E P
Sbjct: 169 KT--FNPSLSSSYSNRSC------IPSTDTN----YTMKYEDNSYSKGVFVCDEVTLKP- 215
Query: 194 RNGFTFVPRLAFGCSNDNSGFAFGGKISGILGF-NASPLSLSSQLRNRIQGLFSYCLVRE 252
P+ FGC D+ G F G SG+LG SL SQ ++ + FSYC +
Sbjct: 216 ----DVFPKFQFGC-GDSGGGEF-GTASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPK 269
Query: 253 MEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRD 312
+ FG A L+ T +L +++ L+ IS+ + + F
Sbjct: 270 EHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLF----- 324
Query: 313 GTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFK- 371
+ G IID+GT +T + Y+ L + Q + P + D CY
Sbjct: 325 ASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQ-EKLLDTCYNLKGCGGR 383
Query: 372 --AYPSMTFH-LQEADYIVQPENMYFIEPDRGRFCVAI--QDDPKY-SILGAWQQQNMLI 425
P + H + E D + P + + D + C+A + +P + +I+G QQ ++ +
Sbjct: 384 NIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKV 443
Query: 426 IYDLNVPALRFGSE 439
+YD+ L FG++
Sbjct: 444 VYDIEGGRLGFGND 457
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 77/156 (49%), Gaps = 11/156 (7%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + IG P +++ DT S + W QC PC C+ Q PIF+P AS +Y+ + C+
Sbjct: 132 YFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYAPLSCEAAQ 191
Query: 157 CR--SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSGF 214
CR +C+NG C+Y Y G T G ET G V +A GC ++N G
Sbjct: 192 CRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTI----GVNKVKNVALGCGHNNEGL 247
Query: 215 AFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLV 250
+G++G PLS +QL + FSYCLV
Sbjct: 248 FV--GAAGLIGLGGGPLSFPAQLNSTS---FSYCLV 278
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 80/167 (47%), Gaps = 23/167 (13%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPC-IRCFDQTTPIFDPRASTTYSEIPC--- 152
Y V+V G+P + ++ DT SSL W QC+PC + C Q P+FDP AS TY + C
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 153 ----------DDPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR 202
++PLC + + CVYT Y + G S++ +P
Sbjct: 178 QCSSLVDATLNNPLCET----SSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ---TLPG 230
Query: 203 LAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL 249
+GC D+ G G+ +GILG + LS+ Q+ ++ FSYCL
Sbjct: 231 FVYGCGQDSDGLF--GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL 275
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 155/391 (39%), Gaps = 53/391 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRC---FDQTTPI-------------- 138
Y V V GTP P +L+ DTA+ L W C+ R + +T +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 139 ---FDPRASTTYSEIPCDDP--------LCRSPFKCQNGKCVYTRRYHVGDVTRGLASRE 187
+ P S+++ I C C+SP K ++ C Y ++ G +T G+ +E
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAES--CSYYQQMQDGTLTMGIYGKE 243
Query: 188 TFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFS 246
V +G +P L GCS +G + G+L +S + R FS
Sbjct: 244 KATVTVSDGRMAKLPGLILGCSVLEAGGSVDAH-DGVLSLGNGEMSFAVHAAKRFGQRFS 302
Query: 247 YCLVR---EMEATSVIKFGRDADVRR-RDLETTPILLSDLRPHFYLHLLEISIGRHIVRF 302
+CL+ +A+S + FG + V +ET + D++P + + I +G +
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
P +D + GG I+DT T VT + Y + D+ L L R F+Y
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPR----VYELDGFEY 418
Query: 363 CYRYD--------SSFKAYPSMTFHLQEADYIVQPENMYFIEPDR--GRFCVAIQDDPKY 412
CYR+ + P +T + + +PE + P+ G C+A + P+
Sbjct: 419 CYRWTFAGDGVDLTHNVTVPRLTVEMAGGARL-EPEAKSVVMPEVVPGVACLAFRKLPRG 477
Query: 413 --SILGAWQQQNMLIIYDLNVPALRFGSENC 441
ILG Q + D +RF + C
Sbjct: 478 GPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 147/370 (39%), Gaps = 40/370 (10%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT---PIFDPRASTTYSEIPC 152
+Y+ V IGTP + L+ DT S++ + C C C P F P S++Y + C
Sbjct: 98 YYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSC 157
Query: 153 DDPLCRSPF-KCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPR-LAFGCSND 210
+ P C + + +C Y R Y ++G+ ++ F NG P L FGC
Sbjct: 158 NSPDCITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGF--GNGSRLQPHPLLFGCETA 215
Query: 211 NSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSYCLVREMEATSVIKFGRDADVR 268
+G + GI+G PLS+ QL ++ FS C E + G
Sbjct: 216 ETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGA----- 270
Query: 269 RRDLETTPILL---SD-LRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGT 323
+ P ++ SD R ++Y L L EI + + P F +G G ++D+GT
Sbjct: 271 ---IPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF----NGRLGTVLDSGT 323
Query: 324 PVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEA 383
++ + + Q L SL Q +P D C+ S ++ H
Sbjct: 324 TYAYLPDKAFDAFKDAITQQLGSL--QAVPGPDPSYPDVCFAGAGSDSK--ALGKHFPPV 379
Query: 384 DYIVQPENMYFIEPDR---------GRFCVA-IQDDPKYSILGAWQQQNMLIIYDLNVPA 433
D++ F+ P+ G +C+ ++ ++LG +N L+ YD
Sbjct: 380 DFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQ 439
Query: 434 LRFGSENCAN 443
+ F NC N
Sbjct: 440 IGFFKTNCTN 449
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 153/371 (41%), Gaps = 42/371 (11%)
Query: 99 VEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDPLC 157
V + IGTP + Q ++ DT S L W QC + T FDP S+++S +PC+ PLC
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 158 RS-------PFKC-QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
+ P C QN C Y+ Y G G RE F P L GC+
Sbjct: 142 KPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQS---TPPLILGCAE 198
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCL-VREMEA----TSVIKFGRD 264
++ GILG N S +SQ + FSYC+ R+ A T G +
Sbjct: 199 AST------DEKGILGMNLGRRSFASQAK---ISKFSYCVPTRQARAGLSSTGSFYLGNN 249
Query: 265 ADVRR----RDLETTPILLS-DLRPHFY-LHLLEISIGRHIVRFPPGAFDIMRDGTGGFI 318
+ R L TP S +L P Y + + I +G + F G G I
Sbjct: 250 PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTI 309
Query: 319 IDTGTPVTFIRNGPYQTLMQRYDQILRSLG-RQRIPYNASQEFDYCYRYD--SSFKAYPS 375
ID+G+ T++ + Y + + +++R +G + + Y D C+ + + +
Sbjct: 310 IDSGSEFTYLVDEAYNKVRE---EVVRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGN 366
Query: 376 MTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDD----PKYSILGAWQQQNMLIIYDLNV 431
M F ++ IV + + G C+ I +I+G + QQN+ + YDL
Sbjct: 367 MVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLAN 426
Query: 432 PALRFGSENCA 442
+ G +C+
Sbjct: 427 RRIGLGKADCS 437
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 152/379 (40%), Gaps = 60/379 (15%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
FY+V + +G P KP L DT S L W QC PC +C + P++ P +PC D
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL----VPCKD 111
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S +C+N +C Y Y G + G+ R+ F + NG PRLA GC
Sbjct: 112 PLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCG 171
Query: 209 ND-NSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDA 265
D + G + + GILG +S+ SQL N+ ++ + +C + + FG D
Sbjct: 172 YDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCF--NSKGGGYLFFG-DG 228
Query: 266 DVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG------FII 319
L TP + D H + PG +++ +G +
Sbjct: 229 IYDPYRLVWTP-MSRDYPKH----------------YSPGFGELIFNGRSTGLRNLFVVF 271
Query: 320 DTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
D+G+ T+ YQ L ++ L G+ C+R K+ + +
Sbjct: 272 DSGSSYTYFNAQAYQVLTSLLNRELA--GKPLREAMDDDTLPLCWRGRKPIKSLRDVRKY 329
Query: 380 LQ------------EADYIVQPENMYFIEPDRGRFCVAIQDDPKY-----SILGAWQQQN 422
+ +A + + P Y I G C+ I + +I+G Q+
Sbjct: 330 FKPLALSFSSGGRSKAVFEI-PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQD 388
Query: 423 MLIIYDLNVPALRFGSENC 441
+++Y+ A+ + + NC
Sbjct: 389 KMVVYNNEKQAIGWATANC 407
>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 535
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 116/488 (23%), Positives = 180/488 (36%), Gaps = 88/488 (18%)
Query: 28 ESTGFSLKLIPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIH 87
+ G S +P+++P P + R H ++K M + P + +
Sbjct: 42 QEGGSSSFTLPVWAPHVP----ESGEERREHFRALMAKDMRRMMRQV--PELMSKTDMFE 95
Query: 88 LPM-----AKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-----------PCIRC 131
LPM Q Y V V IGTP P L +TA+ + W C+ P +
Sbjct: 96 LPMRSALNIAQVGMYVVVVRIGTPALPYSLALETANEVTWINCRLRRRKGKHPGRPHVPP 155
Query: 132 FDQTTPI------------------------FDPRASTTYSEIPCDDPLCRS-PFKC--- 163
T I + P S+++ C C P+
Sbjct: 156 AATTMSIQVDDDGGGGGSGGKSKVTKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCES 215
Query: 164 --QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNG-FTFVPRLAFGCSNDNSGFAFGGKI 220
QN C Y + +T G+ +E V +G +P L GCS F GG +
Sbjct: 216 PDQNTSCTYYQVMKDSTITSGIYGQEKATVAVSDGTMKKLPGLVIGCST----FEHGGAV 271
Query: 221 S---GILGFNASPLSLSSQLRNRIQGLFSYCLVREM---EATSVIKFGRDADVRRRDLET 274
+ GIL SP S R G S+CL+ A+S + FG + V+
Sbjct: 272 NSHDGILSLGNSPSSFGIAAARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQAPGTME 331
Query: 275 TPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT----GGFIIDTGTPVTFIRN 330
TP+L D+ + H+ I +G + PP +D G G I+DTGT +T++ +
Sbjct: 332 TPLLYRDVA--YGAHVTGILVGGQPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVS 389
Query: 331 GPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRY----DSSFKAY----PSMTFHLQ- 381
Y + D L L + I + F+YCY + D A+ PS + +
Sbjct: 390 AVYDPVTAALDSHLAHLPKAEI-----KGFEYCYNWTFAGDGVDPAHNVTIPSFSIEMAG 444
Query: 382 EADYIVQPENMYFIEPDRGRFCVA---IQDDPKYSILGAWQQQNMLIIYDLNVPALRFGS 438
+A +++ E G C+ I P SI+G Q + D LRF
Sbjct: 445 DARLAADAKSIVVPEVVPGVVCLGFNRISQGP--SIIGNVLMQEHIWEIDHMSTVLRFRK 502
Query: 439 ENCANGRQ 446
+ C N +Q
Sbjct: 503 DKCINHQQ 510
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 81/163 (49%), Gaps = 14/163 (8%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDD 154
FY+V + +G P KP L DT S L W QC PC +C + P++ P +PC D
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL----VPCKD 111
Query: 155 PLCRS-----PFKCQN-GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCS 208
PLC S +C+N +C Y Y G + G+ R+ F + NG PRLA GC
Sbjct: 112 PLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCG 171
Query: 209 ND-NSGFAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYC 248
D + G + + GILG +S+ SQL N+ ++ + +C
Sbjct: 172 YDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHC 214
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 164/412 (39%), Gaps = 79/412 (19%)
Query: 97 YSVEVNIG--TPMKPQHLLFDTASSLVWTQCQP--CIRCFDQTTPIFDPRASTTYS-EIP 151
Y++ N+G +P L DT S LVW C P CI C + P P +TT S +
Sbjct: 48 YTLSFNLGPRAQAQPITLYMDTGSDLVWFPCAPFKCILC--EGKPNASPPVNTTRSVAVS 105
Query: 152 CDDPLC----------------RSPFK------CQNGKCV-YTRRYHVGDVTRGLASRET 188
C P C R P + C N KC + Y G + L R+T
Sbjct: 106 CKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLY-RDT 164
Query: 189 FAFPVRNGFTFVPRLAFGCS----NDNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGL 244
+ F+ FGC+ + +G A G+ G+L A +LS QL NR
Sbjct: 165 LSLSS----LFLRNFTFGCAYTTLAEPTGVAGFGR--GLLSLPAQLATLSPQLGNR---- 214
Query: 245 FSYCLV------REMEATSVIKFGR--------DADVRRRDLETTPILLSDLRPHFY-LH 289
FSYCLV + S + GR + TP+L + P+FY +
Sbjct: 215 FSYCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVG 274
Query: 290 LLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGR 349
L+ IS+G+ IV P + G GG ++D+GT T + G Y +++ +D+ + +
Sbjct: 275 LIGISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNE 334
Query: 350 QRIPYNASQEFDYCYRYDSSFKAYPSMTFHLQEAD-YIVQPENMYFIEPDRGR------- 401
+ CY Y +S P +T + +V P YF E GR
Sbjct: 335 RARKIEEKTGLAPCY-YLNSVAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKR 393
Query: 402 --FCVAIQ---DDPKYS-----ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
C+ + D+ + S LG +QQQ + YDL + F CA+
Sbjct: 394 RVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCAS 445
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 161/379 (42%), Gaps = 50/379 (13%)
Query: 96 FYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQT---TPI--FDPRASTTYSEI 150
Y V +G+P K ++ DT S ++W C C C + P+ FDP +S+T S I
Sbjct: 82 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141
Query: 151 PCDDPLC-----RSPFKC--QNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFV--- 200
C D C S C Q +C+YT +Y G T G + F G +
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 201
Query: 201 PRLAFGCSNDNSGFAFGG--KISGILGFNASPLSLSSQLRNRIQGL----FSYCLVREME 254
+ FGCS +G + GI GF +S+ SQ+ + QG+ FS+CL +
Sbjct: 202 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSS--QGITPKVFSHCLKGDGG 259
Query: 255 ATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGT 314
++ ++ D+ +P++ S +PH+ L+L IS+ + P F +
Sbjct: 260 GGGILVL---GEIVEEDIVYSPLVPS--QPHYNLNLQSISVNGKSLAIDPEVFATSTN-- 312
Query: 315 GGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKA-Y 373
G I+D+GT + ++ Y + + + R + CY SS K +
Sbjct: 313 RGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-----CYLITSSVKGIF 367
Query: 374 PSMTF--------HLQEADYIVQPENMYFIEPDRGRFCVAIQ--DDPKYSILGAWQQQNM 423
P+++ +L+ DY++Q ++ D +C+ Q +ILG ++
Sbjct: 368 PTVSLNFAGGVSMNLKPEDYLLQQNSI----GDAAVWCIGFQKIQGQGITILGDLVLKDK 423
Query: 424 LIIYDLNVPALRFGSENCA 442
+ +YDL + + + +C+
Sbjct: 424 IFVYDLAGQRIGWANYDCS 442
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/385 (23%), Positives = 156/385 (40%), Gaps = 32/385 (8%)
Query: 71 MASMSKPNAFQELEDIHLPMAKQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIR 130
+ + PNA L D L + +Y+ + IGTP + L+ D+ S++ + C C +
Sbjct: 67 LGDGAHPNARMRLHDDLL----TNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ 122
Query: 131 CFDQTTPIFDPRASTTYSEIPCD-DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETF 189
C + P F P S++YS + C+ D C S K +C Y R+Y + G+ +
Sbjct: 123 CGNHQDPRFQPDLSSSYSPVKCNVDCTCDSDKK----QCTYERQYAEMSSSSGVLGEDIV 178
Query: 190 AFPVRNGFTFVPRLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQL--RNRIQGLFSY 247
+F R R FGC N +G F GI+G LS+ QL + I FS
Sbjct: 179 SF-GRESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSL 237
Query: 248 CL-VREMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGA 306
C ++ +++ G A + P+ P++ + L EI + +R
Sbjct: 238 CYGGMDIGGGAMVLGGVPAPSDMVFSHSDPL----RSPYYNIELKEIHVAGKALRVDSRV 293
Query: 307 FDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDYCY-- 364
F+ G ++D+GT ++ + + SL + R P + D C+
Sbjct: 294 FNSKH----GTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYK--DICFAG 347
Query: 365 ---RYDSSFKAYPSMTFHLQEADYI-VQPENMYFIEPD-RGRFCVAIQDDPK--YSILGA 417
+ +P + + + PEN F G +C+ + + K ++LG
Sbjct: 348 AGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGG 407
Query: 418 WQQQNMLIIYDLNVPALRFGSENCA 442
+N L+ YD + + F NC+
Sbjct: 408 IIVRNTLVTYDRHNEKIGFWKTNCS 432
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 95/201 (47%), Gaps = 9/201 (4%)
Query: 245 FSYCLVR-EMEATSVIKFGRDADVRRRDLETTPILLSDLRPHFY-LHLLEISIGRHIVRF 302
FSYCL + SV+ G A + D +TP+L + +P FY L L I +G +
Sbjct: 6 FSYCLTSMDDSKASVLLLGSLAKATK-DAISTPLLTNPSQPSFYYLSLEGIPVGGTQLSI 64
Query: 303 PPGAFDIMRDGTGGFIIDTGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFDY 362
FD+ DG+GG IID+GT +T++ + TL + + S ++ ++S D
Sbjct: 65 EQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEF----ISQSNLQLDKSSSTGLDV 120
Query: 363 CYRY--DSSFKAYPSMTFHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKYSILGAWQQ 420
C+ +++ P + FH + D + E+ + G C+A+ SI G QQ
Sbjct: 121 CFSLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAMGASNGMSIFGNVQQ 180
Query: 421 QNMLIIYDLNVPALRFGSENC 441
QN+L+ +DL + F C
Sbjct: 181 QNILVNHDLEKETISFVPTQC 201
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 141/340 (41%), Gaps = 37/340 (10%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCDDPL 156
Y + V +GTP K Q L DT SS W C+ C C R STT +++ C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSR-STTCAKVSCGTSM 58
Query: 157 C---RSPFKCQNGK----CVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSN 209
C S CQ+ + C + Y G + G+ ++T F + +P +FGC+
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF---SDVQKIPGFSFGCNM 115
Query: 210 DNSGFAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREM-------EATSVIKFG 262
D+ G G + G+LG A P+S+ Q G FSYCL +M + T G
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 263 RDADVRRRDLETTPILLSDLRPH-FYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDT 321
R D+ T ++ F++ L IS+ + P F G + D+
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFDS 229
Query: 322 GTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEFD-YCYRYDSSFKA-YPSMTFH 379
G+ +++I + L QR ++L G A +E + CY S + P+++ H
Sbjct: 230 GSELSYIPDRALSVLSQRIRELLLRRG------AAEEESERNCYDMRSVDEGDMPAISLH 283
Query: 380 LQEADYIVQPENMYFIE---PDRGRFCVAIQDDPKYSILG 416
+ + F+E ++ +C+A SI+G
Sbjct: 284 FDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 151/373 (40%), Gaps = 43/373 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +G P +P L DT S L W QC PC C P++ P +P D
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKDL 259
Query: 156 LCRSPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
LC+ QN +C Y Y + G+ +R+ NG FGC+ D
Sbjct: 260 LCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVFGCAYD 319
Query: 211 NSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDAD 266
G A K GILG +++ +SL SQL N+ I +F +C+ R+ + G D
Sbjct: 320 QQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFLGDDY- 378
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG---FIIDTGT 323
V R + +TPI S F+ ++ G + MR +G I D+G+
Sbjct: 379 VPRWGMTSTPI-RSAPDNLFHTEAQKVYYGDQQLS--------MRGASGNSVQVIFDSGS 429
Query: 324 PVTFIRNGPYQTLMQ----RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
T++ + Y+ L+ Y ++ + +P + +F Y D + + + H
Sbjct: 430 SYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVK-QLFKPLNLH 488
Query: 380 LQEADYI------VQPENMYFIEPDRGRFCVAIQ-----DDPKYSILGAWQQQNMLIIYD 428
+ ++ + P+N Y I D+G C+ D I+G + L++YD
Sbjct: 489 FGKRWFVMPRTFTILPDN-YLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVVYD 547
Query: 429 LNVPALRFGSENC 441
+ + + +C
Sbjct: 548 NQQRQIGWTNSDC 560
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 146/368 (39%), Gaps = 52/368 (14%)
Query: 95 LFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTT-PIFDPRASTTYSEIPCD 153
L +++ +N+GTP + S W C PC+ C T P+F +ST+Y+ IPC
Sbjct: 86 LNFAMNLNLGTPPVQHNFTMALNSEFFWAACSPCVDCNVSTNDPLFSSASSTSYTRIPCT 145
Query: 154 DPLCR-SPFKCQNG---------KCVYTRRYHVGDVTRGLASRETFAF--PVRNGFTFVP 201
P C SP N C+Y Y + G + + A P +
Sbjct: 146 SPFCSTSPGFSTNACGSSAVGSTTCLYNFSYSTDYSSAGEMASDVVAMKTPRKTRGNKSL 205
Query: 202 RLAFGCSNDNSGFAFGGKISGILGFNASPLSLSSQLRNR-IQGLFSYCLVREMEATSVIK 260
R++ GC +++ SG++GF + S QL F YC+ + + ++
Sbjct: 206 RMSLGCGRESTTLLGILNTSGLVGFAKTDKSFIGQLAEMDYTSKFIYCVPSDTFSGKIV- 264
Query: 261 FGRDADVRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIID 320
G L TP++++ +Y+ L ISI + FP I+ DGTGG IID
Sbjct: 265 LGNYKISSHSSLSYTPMIVNS-TALYYIGLRSISI-TDTLTFPVQG--ILADGTGGTIID 320
Query: 321 TGTPVTFIRNGPYQTLMQRYDQILRSLGRQRIPYNASQEF---DYCYRYDSSFKAYPSMT 377
+ ++ Y L+Q + +L ++ N + D CY
Sbjct: 321 STFAFSYFTPDSYTPLVQAIQNLNSNL--TKVSSNETAALLGNDICYN------------ 366
Query: 378 FHLQEADYIVQPENMYFIEPDRGRFCVAIQDDPKY----SILGAWQQQNMLIIYDLNVPA 433
+ + D + C+A+ D K +++G +QQ ++ + +DL
Sbjct: 367 VSVNDDD------------AENATVCLAVGDSEKVGFSLNVIGTYQQLDVAVEFDLEKQE 414
Query: 434 LRFGSENC 441
+ FG+ C
Sbjct: 415 IGFGTAGC 422
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 151/373 (40%), Gaps = 43/373 (11%)
Query: 97 YSVEVNIGTPMKPQHLLFDTASSLVWTQCQ-PCIRCFDQTTPIFDPRASTTYSEIPCDDP 155
Y + +G P +P L DT S L W QC PC C P++ P +P D
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKDL 260
Query: 156 LCRSPFKCQN-----GKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSND 210
LC+ QN +C Y Y + G+ +R+ NG FGC+ D
Sbjct: 261 LCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVFGCAYD 320
Query: 211 NSG--FAFGGKISGILGFNASPLSLSSQLRNR--IQGLFSYCLVREMEATSVIKFGRDAD 266
G A K GILG +++ +SL SQL N+ I +F +C+ R+ + G D
Sbjct: 321 QQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFLGDDY- 379
Query: 267 VRRRDLETTPILLSDLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGG---FIIDTGT 323
V R + +TPI S F+ ++ G + MR +G I D+G+
Sbjct: 380 VPRWGMTSTPI-RSAPDNLFHTEAQKVYYGDQQLS--------MRGASGNSVQVIFDSGS 430
Query: 324 PVTFIRNGPYQTLMQ----RYDQILRSLGRQRIPYNASQEFDYCYRYDSSFKAYPSMTFH 379
T++ + Y+ L+ Y ++ + +P + +F Y D + + + H
Sbjct: 431 SYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVK-QLFKPLNLH 489
Query: 380 LQEADYI------VQPENMYFIEPDRGRFCVAIQ-----DDPKYSILGAWQQQNMLIIYD 428
+ ++ + P+N Y I D+G C+ D I+G + L++YD
Sbjct: 490 FGKRWFVMPRTFTILPDN-YLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVVYD 548
Query: 429 LNVPALRFGSENC 441
+ + + +C
Sbjct: 549 NQQRQIGWTNSDC 561
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 159/397 (40%), Gaps = 59/397 (14%)
Query: 92 KQDLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIP 151
+ ++ +V V +GTP + ++ DT S L W C + FD AS++Y+ +P
Sbjct: 58 RHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN-----GSRHDAPFDASASSSYAPVP 112
Query: 152 CDDPLCR--------SPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRL 203
C P C PF C + C + Y GL + +TF G + +P L
Sbjct: 113 CSSPACTWLGRDLPVRPF-CDSSACRVSLSYADASSADGLLAADTFLL----GSSPMPAL 167
Query: 204 AFGCSNDNSGFAFGGKI--SGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKF 261
FGC S + +G+LG N LS +Q R F+YC+ ++
Sbjct: 168 -FGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATR---RFAYCIAAGQGPGILLLG 223
Query: 262 GRDADV-----RRRDLETTPIL-LSDLRPHF-----YLHLLEISIGRHIVRFPPGAFDIM 310
G D + ++ L TP++ +S P+F + L I +G ++ P
Sbjct: 224 GNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPD 283
Query: 311 RDGTGGFIIDTGTPVTFIRNGPYQTLMQRY-DQILRSLGRQRIPYNA-----SQEFDYCY 364
G G ++D+GT TF+ Y L + +Q+ RSL P FD C+
Sbjct: 284 HTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACF 343
Query: 365 R-------YDSSFKAYPSMTFHLQEADYIVQPEN--MYFIEPDR-----GRFCVAIQDDP 410
R ++ P + L+ A+ +V +Y + +R G +C+
Sbjct: 344 RGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSD 403
Query: 411 KYS----ILGAWQQQNMLIIYDLNVPALRFGSENCAN 443
++G QQ++ + YDL L F + CA+
Sbjct: 404 MAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/425 (23%), Positives = 167/425 (39%), Gaps = 71/425 (16%)
Query: 37 IPIFSPESPLYPGNLSQSERIHKMFEISKARANYMASMSKPNAFQELEDIHLPMAK---Q 93
+PI P SQ ++F ++R +++ S A + L+D H P K +
Sbjct: 66 LPITQKYGPCSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKD-HTPNNKLFDE 124
Query: 94 DLFYSVEVNIGTPMKPQHLLFDTASSLVWTQCQPCIRCFDQTTPIFDPRASTTYSEIPCD 153
D + V+V GTP + L+ DT SS+ WTQC+ C Y+ D
Sbjct: 125 DGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKACT-------------VENNYNMTYGD 171
Query: 154 DPLCRSPFKCQNGKCVYTRRYHVGDVTRGLASRETFAFPVRNGFTFVPRLAFGCSNDNSG 213
D + C T L + F + FG +N G
Sbjct: 172 DSTSVGNYGCD---------------TMTLEPSDVFQ-----------KFQFGRGRNNKG 205
Query: 214 FAFGGKISGILGFNASPLSLSSQLRNRIQGLFSYCLVREMEATSVIKFGRDADVRRRDLE 273
FG + G+LG LS SQ ++ +FSYCL E S++ FG A + L+
Sbjct: 206 -DFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLL-FGEKATSQSSSLK 263
Query: 274 TTPILLS----DLRPHFYLHLLEISIGRHIVRFPPGAFDIMRDGTGGFIIDTGTPVTFIR 329
T ++ +++++L +IS+G + P F + G IID+ T +T +
Sbjct: 264 FTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLP 318
Query: 330 NGPYQTLMQRYDQILR----SLGRQRIPYNASQEFDYCYRYDSSFKA-YPSMTFHL-QEA 383
Y L + + + S GR++ D CY P + H A
Sbjct: 319 QRAYSALKAAFKKAMAKYPLSNGRRK----KGDILDTCYNLSGRKDVLLPEIVLHFGGGA 374
Query: 384 DYIVQPENMYFIEPDRGRFCVAIQD------DPKYSILGAWQQQNMLIIYDLNVPALRFG 437
D + N+ + D R C+A +P+ +I+G QQ ++ ++YD+ + F
Sbjct: 375 DVRLNGTNIVW-GSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFR 433
Query: 438 SENCA 442
S C+
Sbjct: 434 SNGCS 438
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.140 0.433
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,259,584,728
Number of Sequences: 23463169
Number of extensions: 312396738
Number of successful extensions: 758559
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 885
Number of HSP's successfully gapped in prelim test: 1418
Number of HSP's that attempted gapping in prelim test: 752408
Number of HSP's gapped (non-prelim): 2614
length of query: 446
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 300
effective length of database: 8,933,572,693
effective search space: 2680071807900
effective search space used: 2680071807900
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 78 (34.7 bits)